Having established a general framework for evaluation, inspired by ISO 9126, and intended to be embodied by a PTB, the question arises how to set up an evaluation according to this framework or to construct a PTB to be applied to a specific class of NLP systems.
A number of ingredients is essential to fulfill this task and we will describe these briefly here.