Recovering traceability links in software artifact management systems using information retrieval methods

被引:173
作者
De Lucia, Andrea [1 ]
Fasano, Fausto [1 ]
Oliveto, Rocco [1 ]
Tortora, Genoveffa [1 ]
机构
[1] Univ Salerno, Dipartimento Matemat & Informat, I-84084 Fisciano, SA, Italy
关键词
documentation; management; software artifact management; traceability management; impact analysis; latent semantic indexing; SOURCE-CODE; IMPACT ANALYSIS; DESIGN; TOOL; MODELS;
D O I
10.1145/1276933.1276934
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The main drawback of existing software artifact management systems is the lack of automatic or semi-automatic traceability link generation and maintenance. We have improved an artifact management system with a traceability recovery tool based on Latent Semantic Indexing (LSI), an information retrieval technique. We have assessed LSI to identify strengths and limitations of using information retrieval techniques for traceability recovery and devised the need for an incremental approach. The method and the tool have been evaluated during the development of seventeen software projects involving about 150 students. We observed that although tools based on information retrieval provide a useful support for the identification of traceability links during software development, they are still far to support a complete semi-automatic recovery of all links. The results of our experience have also shown that such tools can help to identify quality problems in the textual description of traced artifacts.
引用
收藏
页数:50
相关论文
共 79 条
[1]  
Alexander Ian., 2002, Proc of the 1st Int Workshop on Traceability, P26
[2]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[3]   Design-code traceability for object-oriented systems [J].
Antoniol, G ;
Caprile, B ;
Potrich, A ;
Tonella, P .
ANNALS OF SOFTWARE ENGINEERING, 2000, 9 (1-4) :35-58
[4]   Traceability recovery by modeling programmer behavior [J].
Antoniol, G ;
Casazza, G ;
Cimitile, A .
SEVENTH WORKING CONFERENCE ON REVERSE ENGINEERING - PROCEEDINGS, 2000, :240-247
[5]   Working session: Identification of lower-level artifacts [J].
Antoniol, G ;
Casazza, G ;
Merlo, AD .
8TH INTERNATIONAL WORKSHOP ON PROGRAM COMPREHENSION (IWPC 2000), PROCEEDINGS, 2000, :253-253
[6]  
ARNOLD S.P., 1988, Software Reuse: Emerging Technology, P138
[7]  
Aversano L, 2003, P 15 INT C SOFTW ENG, P497
[8]  
Baeza-Yates R., 1999, Modern Information Retrieval, V463
[9]   DESIGN RECOVERY FOR MAINTENANCE AND REUSE [J].
BIGGERSTAFF, TJ .
COMPUTER, 1989, 22 (07) :36-49
[10]   Active artefact management for distributed software engineering [J].
Boldyreff, C ;
Nutter, D ;
Rank, S .
26TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, PROCEEDINGS, 2002, :1081-1086