A Rigorous Empirical Benchmark of Machine Learning Models for Software Effort Estimation

doi:10.23940/ijpe.26.04.p2.188199

Abstract

Abstract: Accurate software effort estimation is critical for effective project planning, resource allocation, and cost control. Yet, reliable prediction remains challenging due to the heterogeneous, noisy, and nonlinear characteristics of software project data, which often lead to schedule delays and cost overruns. This study presents a systematic empirical comparison of machine learning and ensemble-based models for software effort estimation, focusing on performance consistency, robustness across datasets, and the practical value of ensemble complexity under both tuned and untuned settings. An extensive experimental evaluation is conducted on five widely used benchmark datasets - DESHARNAIS, CHINA, ISBSG, COCOMO81, and MAXWELL - covering traditional single learners, strong tree-based ensembles, and stacking approaches. Models are evaluated using multiple accuracy and robustness metrics, including Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), Median MRE, PRED (0.25), and Standardized Accuracy, with nonparametric statistical tests and Friedman's rank test applied to ensure a rigorous comparative analysis. The findings indicate that ensemble-based models consistently outperform traditional single learners across all datasets; however, model rankings remain largely stable between tuned and untuned configurations, suggesting that performance gains are not primarily driven by hyperparameter optimization. Among all evaluated methods, Extra Trees demonstrates the most robust and consistent performance with the best overall Friedman rank and minimal sensitivity to tuning, while stacking ensembles fail to provide statistically significant or consistent improvements despite higher computational cost. Overall, the results provide strong empirical evidence that well-designed tree-based ensemble models offer the best balance of accuracy, robustness, and efficiency, challenging the presumed advantages of increased ensemble complexity in practical software effort estimation.

Key words: software effort estimation, machine learning, ensemble learning, tree-based models, extra trees, empirical evaluation, model robustness

Jaskirat Kaur and Navdeep Kaur. A Rigorous Empirical Benchmark of Machine Learning Models for Software Effort Estimation [J]. Int J Performability Eng, 2026, 22(4): 188-199.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

References

[1] Rijwani P., and Jain S., 2016. Enhanced software effort estimation using multi layered feed forward artificial neural network technique.Procedia Computer Science, 89, pp. 307-312.
[2] Sree P.R., and SNSVSC R., 2016. Improving efficiency of fuzzy models for effort estimation by cascading & clustering techniques.Procedia Computer Science, 85, pp. 278-285.
[3] Mustapha H., and Abdelwahed N., 2019. Investigating the use of random forest in software effort estimation.Procedia Computer Science, 148, pp. 343-352.
[4] Alhazmi O.H., and Khan M.Z., 2020. Software effort prediction using ensemble learning methods. Journal of Software Engineering and Applications,13(07), pp. 143-160.
[5] AG P.V., K A.K., and Varadarajan V., 2021. Estimating software development efforts using a random forest-based stacked ensemble approach.Electronics, 10(10), 1195.
[6] De Carvalho H.D.P., Fagundes R., and Santos W., 2021. Extreme learning machine applied to software development effort estimation.IEEE Access, 9, pp. 92676-92687.
[7] Sakhrawi Z., Sellami A., and Bouassida N., 2022. Software enhancement effort estimation using correlation-based feature selection and stacking ensemble method. Cluster Computing,25(4), pp. 2779-2792.
[8] Hoc H.T., Silhavy R., Prokopova Z., and Silhavy P., 2023. Comparing stacking ensemble and deep learning for software project effort estimation.IEEE Access, 11, pp. 60590-60604.
[9] Sousa A.O., Veloso D.T., Gonçalves H.M., Faria J.P., Mendes-Moreira J., Graça R., Gomes D., Castro R.N., and Henriques P.C., 2023. Applying machine learning to estimate the effort and duration of individual tasks in software projects.IEEE Access, 11, pp. 89933-89946.
[10] Sharma A., and Chaudhary N., 2023. Prediction of software effort by using non-linear power regression for heterogeneous projects based on use case points and lines of code.Procedia Computer Science, 218, pp. 1601-1611.
[11] Rahman M., Roy P.P., Ali M., Gonc T., and Sarwar H., 2023. Software effort estimation using machine learning technique. International Journal of Advanced Computer Science and Applications,14(4).
[12] Nhung H.L.T.K., Van Hai V., Silhavy P., Prokopova Z., and Silhavy R., 2024. Incorporating statistical and machine learning techniques into the optimization of correction factors for software development effort estimation.Journal of Software: Evolution and Process, 36(5), e2611.
[13] Ali S.S., Ren J., and Wu J., 2024. Framework to improve software effort estimation accuracy using novel ensemble rule.Journal of King Saud University-Computer and Information Sciences, 36(9), 102189.
[14] Qassem N.T., and Saleh I.A., 2025. Prediction of software cost estimation using stacking ensemble learning method.Procedia Computer Science, 258, pp. 727-736.
[15] Ali A., and Gravino C., 2021. Improving software effort estimation using bio-inspired algorithms to select relevant features: an empirical study.Science of Computer Programming, 205, 102621.
[16] Dashti M., Gandomani T.J., Adeh D.H., Zulzalil H., and Sultan A.B.M., 2022. LEMABE: a novel framework to improve analogy-based software cost estimation using learnable evolution model.PeerJ Computer Science, 8, e800.
[17] Kumar A.,2024. Recommendation of machine learning techniques for software effort estimation using multi-criteria decision making. Journal of Universal Computer Science (JUCS),30(2).
[18] Manchala P., and Bisi M., 2024. TSoptEE: two-stage optimization technique for software development effort estimation. Cluster Computing,27(7), pp. 8889-8908.
[19] Gandomani T.J., Dashti M., Ansaripour S., and Zulzalil H., 2025. Enhancing analogy-based software cost estimation using grey wolf optimization algorithm.PeerJ Computer Science, 11, e2794.
[20] Singal P., Kumari A.C., and Sharma P., 2020. Estimation of software development effort: A differential evolution approach.Procedia Computer Science, 167, pp. 2643-2652.
[21] Shweta K.R., Duraisamy S., and Maheswari T.L., 2021. Software cost and effort estimation using ensemble duck traveler optimization algorithm (eDTO) in earlier stage. Turkish Journal of Computer and Mathematics Education,12(13), pp. 3300-3311.
[22] Xia T., Shu R., Shen X., and Menzies T., 2020. Sequential model optimization for software effort estimation. IEEE Transactions on Software Engineering,48(6), pp. 1994-2009.
[23] Dowlatshahi M.B., Zare-Chahooki M.A., Beiranvand S., and Hashemi A., 2022. GKRR: A gravitational-based kernel ridge regression for software development effort estimation. Journal of Mahani Mathematical Research,11(3), pp. 147-174.
[24] Marco R., Ahmad S.S.S., and Ahmad S., 2023. An improving long short term memory-grid search based deep learning neural network for software effort estimation. International Journal of Intelligent Engineering & Systems,16(4).
[25] Iordan A.E.,2024. An optimized LSTM neural network for accurate estimation of software development effort.Mathematics, 12(2), 200.
[26] Li J., Sun S., Xie L., Zhu C., and He D., 2024. Multi-kernel support vector regression with improved moth-flame optimization algorithm for software effort estimation.Scientific Reports, 14(1), 16892.
[27] Draz M.M., Emam O., and Azzam S.M., 2024. Software cost estimation predication using a convolutional neural network and particle swarm optimization algorithm.Scientific Reports, 14(1), 13129.
[28] Vanathi D., Anusha K., Ahilan A., and Suniram A.S.E., 2024. Software cost and effort estimation using dragonfly whale optimized multilayer perceptron neural network.Alexandria Engineering Journal, 103, pp. 30-37.
[29] Bajusova D., Silhavy P., and Silhavy R., 2024. Enhancing software effort estimation with self-organizing migration algorithm: A comparative analysis of COCOMO models.IEEE Access, 12, pp. 67170-67188.