Int J Performability Eng ›› 2025, Vol. 21 ›› Issue (3): 149-156.doi: 10.23940/ijpe.25.03.p4.149156

Previous Articles     Next Articles

Data Driven Software Quality Assessment: Correlation Analysis of Code Metrics and Fault-Proneness

Seema Kalonia* and Amrita Upadhyay   

  1. Department of Computer Science, Banasthali Vidyapith, Rajasthan, India
  • Submitted on ; Revised on ; Accepted on
  • Contact: * E-mail address: pgphb21069_seema@banasthali.in

Abstract: Predicting software faults is essential for raising program quality and cutting maintenance expenses. Debugging efforts can be minimized, software failures can be avoided, and overall software reliability can be increased via early detection of problematic modules. Code metrics from NASA's Metrics Data Program (MDP) datasets are analyzed in this research in order to find trends and connections between software complexity and defectiveness. We examine how different code complexity indicators and software flaws are related using statistical methods and exploratory data analysis. We discover that defect-prone modules are highly correlated with cyclomatic complexity, decision density, and unique operands. By determining threshold values for these important indicators, we offer information on the quality of the software and possible places where code maintainability could be improved. This analysis emphasizes the value of empirical investigation, statistical validation, and organized feature selection in defect prediction. We lay the groundwork for future defect avoidance efforts by providing useful suggestions to lower software complexity and increase reliability through comparative analysis across several NASA datasets. By offering data-driven insights that can assist developers in optimizing code architectures and reducing defect risks, the study advances software engineering. Furthermore, our analysis highlights how important it is to comprehend software complexity early on in the development process so that teams may proactively enhance maintainability and code quality. Software engineers, quality assurance teams, and companies looking to create more reliable and fault-resistant software systems can use the research's findings as a guide. Software teams can improve software lifecycle management, reduce post-release problems, and increase productivity by methodically identifying defect-prone modules based on predetermined thresholds. Future developments in real-time monitoring and automated flaw detection systems can bolster these initiatives even more, increasing the effectiveness and dependability of software development.

Key words: software fault prediction, NASA MDP dataset, code metrics, software quality