A new perspective on data homogeneity in software cost estimation: a study in the embedded systems domain

被引:24
作者
Bakir, Ayse [1 ]
Turhan, Burak [2 ]
Bener, Ayse B. [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
[2] Natl Res Council Canada, Inst Informat Technol, Software Engn Grp, Ottawa, ON K1A 0R6, Canada
关键词
Application domain; Cost estimation; Data homogeneity; Embedded software; Machine learning; DESIGN; MODEL;
D O I
10.1007/s11219-009-9081-z
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Cost estimation and effort allocation are the key challenges for successful project planning and management in software development. Therefore, both industry and the research community have been working on various models and techniques to accurately predict the cost of projects. Recently, researchers have started debating whether the prediction performance depends on the structure of data rather than the models used. In this article, we focus on a new aspect of data homogeneity, "cross-versus within-application domain'', and investigate what kind of training data should be used for software cost estimation in the embedded systems domain. In addition, we try to find out the effect of training dataset size on the prediction performance. Based on our empirical results, we conclude that it is better to use cross-domain data for embedded software cost estimation and the optimum training data size depends on the method used.
引用
收藏
页码:57 / 80
页数:24
相关论文
共 38 条
[1]  
Albrecht A.J., 1979, Em Proceedings of the Joint SHARE, GUIDE, and IBM Application Development Symposium, P83
[2]  
Alpaydin E., 1998, Proceedings of Engineering of Intelligent Systems, V2, P6
[3]  
ANGELIS L, 2000, J EMPIRICAL SOFTWARE, V5, P35, DOI DOI 10.1023/A:1009897800559
[4]  
[Anonymous], 2004, Introduction to Machine Learning
[5]  
BASKELES B, 2007, SOFTWARE EFFORT ESTI, P1
[6]  
BOEHM BW, 1999, COCOMO 2 COQUALMO DA
[7]  
BOEHM BW, 1981, SOFTWARE ENG EC ADV
[8]  
BOEHM BW, 2009, COCOMO 2 MODEL DEFIN
[9]  
Boetticher G., 2007, The PROMISE Repository of Empirical Software Engineering Data
[10]  
BOETTICHER GD, 2001, 1 INT WORKSH MOD BAS, P17