Browsing by Author "Keen, Michael"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Open Access Aslib Cranfield research project - Factors determining the performance of indexing systems; Volume 1, Design; Part 1, Text(1966) Cleverdon, Cyril W.; Mills, Jack; Keen, MichaelItem Open Access Aslib Cranfield research project - Factors determining the performance of indexing systems; Volume 1, Design; Part 2, Appendices(1966) Cleverdon, Cyril W.; Mills, Jack; Keen, MichaelThe appendices which follow provide complete informationconcerning the collection of documents and the set of questions used in the investigation, and are included with the intention that it should be possible for anyone - if they should wish to do so - to repeat the test.Item Open Access Aslib Cranfield research project - Factors determining the performance of indexing systems; Volume 2, Test results(1966) Cleverdon, Cyril W.; Keen, MichaelThe test results are presented for a number of different index languages using various devices which affect recall or precision. Within the environment of this test, it is shown that the best performance was obtained with the group of eight index languages which used single terms. The group of fifteen index languages which were based on concepts gave the worst performance, while a group of six index languages based on the Thesaurus of Engineering Terms of the Engineers Joint Council were intermediary. Of the single term index languages, the only method of improving performance was to group synonyms and word forms, and any broader groupings of terms depressed performance. The use of precision devices such as links gave no advantage as compared to the basic device of simple coordination. All results have to be considered within the context of the experimental environment, but they can be said to substantiate or clarify many of the findings of Cranfield I. It is conclusively shown that an inverse relationship exists between recall and precision, whatever the variable may be that is being changed. The two factors which appear most likely to affect performance are the level of exhaustivity of indexing and the level of specificity of the terms in the index language. For any given operational situation, the optimum levels cannot be categorically stated in advance, but can only be determined by an evaluation of the system, the main consideration probably being the subject field. It would be unusual if the characteristics of the subject field used for this test were such as to make it unique, so the high performance obtained with the single terms in natural language can be considered to be of some importance in regard to the use of natural language text as input to mechanised systems.