Review and empirical analysis of software effort estimation

Rahman, Mizanur and Sarwar, Hasan and Kader, Md Abdul and Gonagalves, Teresa and Tin, Ting Tin (2024) Review and empirical analysis of software effort estimation. IEEE Access. p. 1. ISSN 2169-3536. (In Press / Online First) (In Press / Online First)

[img]
Preview
Pdf
Review and Empirical Analysis of Software Effort Estimation.pdf
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (3MB) | Preview

Abstract

The average software company spends a huge amount of its revenue on R&D for how to deliver software on time. Accurate software effort estimation is critical for successful project planning, resource allocation, and on-time delivery within budget for sustainable software development. However, both overestimation and underestimation pose significant challenges in software development, necessitating continuous improvement in estimation techniques. This study reviews recent machine learning approaches exploited to enhance software effort estimation (SEE) accuracy, focusing on research published between 2020 and 2023. The literature review employed an approach to identify pertinent research on machine learning techniques for software estimation efforts. Additionally, comparative experiments were conducted employing five commonly used ML methods: K-Nearest Neighbor, Support Vector Machine, Random Forest, Logistic Regression, and LASSO Regression. These techniques were assessed using five widely employed accuracy metrics such as Mean Squared Error (MSE), Mean Magnitude of Relative Error (MMRE), R-squared, Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE) on seven benchmark datasets (Albrecht, Desharnais, China, Kemerer, Mayazaki94, Maxwell, COCOMO). By carefully reviewing study quality, analyzing results across the literature, and rigorously evaluating experimental outcomes, clear conclusions were drawn about the most promising techniques for achieving state-of-the-art accuracy in estimating software effort. This study makes three key contributions to the field: firstly, it furnishes a thorough overview of recent machine learning research in software effort estimation (SEE); secondly, it provides data-driven guidance for researchers and practitioners to select optimal methods for accurate effort estimation; and thirdly, it demonstrates the performance of publicly available datasets through experimental analysis. Enhanced estimation supports the development of better predictive models for software project time, cost, and staffing needs. The findings aim to focus future research directions and tool development toward the most accurate machine learning approaches for modeling software development effort, costs, and delivery schedules.

Item Type: Article
Additional Information: Indexed by Scopus
Uncontrolled Keywords: Estimation; KNN; LASSO; Linear Regression; Machine learning algorithms; R&D investment; Random Forest; Research and development; Reviews; Software; Software algorithms; Software Development Efforts Estimation; Software Effort Estimation; Software reliability; Support Vector Machine
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
T Technology > T Technology (General)
T Technology > TA Engineering (General). Civil engineering (General)
Faculty/Division: Faculty of Computing
Depositing User: Mr Muhamad Firdaus Janih@Jaini
Date Deposited: 31 Jul 2024 01:53
Last Modified: 31 Jul 2024 01:53
URI: http://umpir.ump.edu.my/id/eprint/41601
Download Statistic: View Download Statistics

Actions (login required)

View Item View Item