In vitro evaluation of a program for machine-aided indexing

被引:5
作者
Jacquemin, C
Daille, B
Royauté, J
Polanco, X
机构
[1] CNRS, LIMSI, F-91403 Orsay, France
[2] Univ Paris 11, F-91403 Orsay, France
[3] Univ Nantes, IRIN, F-44322 Nantes 3, France
[4] CNRS, INIST, Unite Rech & Innovat, F-54514 Vandoeuvre Les Nancy, France
关键词
human evaluation; machine-aided indexing; controlled indexing; free indexing; multi-word terms; key-phrase indexing;
D O I
10.1016/S0306-4573(01)00050-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents the human evaluation of ILIAD, a program for machine-aided indexing (MAI). It consists of two language engineering modules and is designed to assist expert librarians in computer-aided indexing and document analysis. Our aim is the expert evaluation of automatic multi-word term indexing. Evaluation is performed by documentary engineers. Cataloging and indexing are their principal tasks. They also have a good scientific knowledge of the domain to which the indexed documents belong. We first present the ILIAD program and the two systems submitted to this evaluation, the methodology (protocol) adopted, the differences between the protocol and the implementation, and the results of these evaluations. Human evaluation is divided into three parts: firstly the evaluation of controlled indexing, then free indexing and finally term variant extraction performed during controlled indexing. Finally, we analyze the relevance of this evaluation by calculating the agreement frequency and the Kappa coefficient and propose some future developments. (C) 2002 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:765 / 792
页数:28
相关论文
共 27 条
[1]  
[Anonymous], 1971, SMART RRETRIEVAL SSY
[2]  
Daille Beatrice, 1996, BALANCING ACT COMBIN, P49
[3]  
Dunning T., 1993, Computational Linguistics, V19, P61
[4]  
FROEHLICH TJ, 1994, J AM SOC INFORM SCI, V45, P124, DOI 10.1002/(SICI)1097-4571(199404)45:3<124::AID-ASI2>3.0.CO
[5]  
2-8
[6]  
Grefenstette G., 1994, Proceedings of the 3rd International Conference on Computational Lexicography, P79
[7]   OVERVIEW OF THE 2ND TEXT RETRIEVAL CONFERENCE (TREC-2) [J].
HARMAN, D .
INFORMATION PROCESSING & MANAGEMENT, 1995, 31 (03) :271-289
[8]  
Hodge G. M., 1994, Indexer, V19, P23
[9]  
L'Homme M.-C., 1996, Terminology, V3, P291, DOI 10.1075/term.3.2.04hom
[10]   MEASUREMENT OF OBSERVER AGREEMENT FOR CATEGORICAL DATA [J].
LANDIS, JR ;
KOCH, GG .
BIOMETRICS, 1977, 33 (01) :159-174