Recovering traceability links between code and documentation

被引:518
作者
Antoniol, G
Canfora, G
Casazza, G
De Lucia, A
Merlo, E
机构
[1] Univ Sannio, Res Ctr Software Technol, Dept Engn, I-82100 Benevento, Italy
[2] Univ Naples Federico II, Dept Informat & Sistemist, I-80125 Naples, Italy
[3] Ecole Politech, Dept Elect & Comp Engn, Montreal, PQ, Canada
关键词
redocumentation; traceability; program comprehension; object orientation; information retrieval;
D O I
10.1109/TSE.2002.1041053
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software system documentation is almost always expressed informally in natural language and free text. Examples include requirement specifications, design documents, manual pages, system development journals, error logs, and related maintenance reports. We propose a method based on information retrieval to recover traceability links between source code and free text documents. A premise of our work is that programmers use meaningful names for program items, such as functions, variables, types, classes, and methods. We believe that the application-domain knowledge that programmers process when writing the code is often captured by the mnemonics for identifiers; therefore, the analysis of these mnemonics can help to associate high-level concepts with program concepts and vice-versa. We apply both a probabilistic and a vector space information retrieval model in two case studies to trace C++ source code onto manual pages and Java code to functional requirements. We compare the results of applying the two models, discuss the benefits and limitations, and describe directions for improvements.
引用
收藏
页码:970 / 983
页数:14
相关论文
共 54 条
[11]   RE(2) - REVERSE-ENGINEERING AND REUSE REENGINEERING [J].
CANFORA, G ;
CIMITILE, A ;
MUNRO, M .
JOURNAL OF SOFTWARE MAINTENANCE-RESEARCH AND PRACTICE, 1994, 6 (02) :53-72
[12]   REVERSE ENGINEERING AND DESIGN RECOVERY - A TAXONOMY [J].
CHIKOFSKY, EJ ;
CROSS, JH .
IEEE SOFTWARE, 1990, 7 (01) :13-17
[13]  
Cormen T. H., 1990, INTRO ALGORITHMS
[14]  
COVER TM, 1992, ELEMENTS INFORMATION
[15]  
De Mori R., 1998, SPOKEN DIALOGUES COM
[16]   Identifying design-code inconsistencies in object-oriented software: A case study [J].
Fiutem, R ;
Antoniol, G .
INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE, PROCEEDINGS, 1998, :94-102
[17]  
FRAKES W, 1992, INFORMATION RETRIEVA
[18]  
Frakes W. B., 1987, Proceedings of the Twentieth Hawaii International Conference on System Sciences 1987, P530
[19]  
Fyson MJ, 1998, J SOFTW MAINT-RES PR, V10, P93, DOI 10.1002/(SICI)1096-908X(199803/04)10:2<93::AID-SMR169>3.0.CO
[20]  
2-6