Abstract
How can I deliver defect-free software? Can I achieve more with less resources? How can I reduce time, effort, and cost involved in developing software? Software defect prediction is an important area of research which can significantly help the software development teams grappling with these questions in an effective way. A small increase in prediction accuracy will go a long way in helping software development teams improve their efficiency. In this paper, we have proposed a framework which uses PCA for dimensionality reduction and Naïve Bayes classification algorithm for building the prediction model. We have used seven projects from NASA Metrics Data Program for conducting experiments. We have seen an average increase of 10.3% in prediction accuracy when the learning algorithm is applied with the key features extracted from the datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dhamayanthi N, Lavanya B (2019) Improvement in software defect prediction outcome using principal component analysis and ensemble machine learning algorithms. In: Hemanth J, Fernando X, Lafata P, Baig Z (eds) International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018. ICICI 2018. Lecture notes on data engineering and communications technologies, vol 26. Springer, Cham. https://link.springer.com/chapter/10.1007/978-3-030-03146-6_44
Murillo-Morera J, Castro-Herrera C, Arroyo J, Fucntcs-Fernandez R (2016) An automated defect prediction framework using genetic algorithms: a validation of empirical studies. Inteligencia Artif 19(57):114–137
Song Q, Jia Z, Shepperd M, Ying S, Liu J (2011) A general software defect-proneness prediction framework. IEEE Trans Softw Eng 37(3):356–370
Shirabad JS, Menzies TJ (2005) The PROMISE repository of software engineering databases. School of Information Technology and Engineering, University of Ottawa, Canada. Available: http://promise.site.uottawa.ca/SERepository
Wang S, Ping HE, Zelin L (2016) An enhanced software defect prediction model with multiple metrics and learners. Int J Ind Syst Eng 22(3):358–371
Shatnawi R, Li W (2016) An empirical investigation of predicting fault count, fix cost and effort using software metrics. (IJACSA) Int J Adv Comput Sci Appl 7(2)
Menzies T, Greenwald J, Frank A (2007) Data mining static code attributes to learn defect predictors. IEEE Trans Softw Eng 33(1)
Jiang Y, Lin J, Cukic B, Menzies T (2009) Variance analysis in software fault prediction models. In: 20th international symposium of software reliability engineering
Koru AG, Liu H (2005) An investigation of the effect of module size on defect prediction using static measures. In: Promise ’05
Singh P, Verma S (2014) An efficient software fault prediction model using cluster based classification. Int J Appl Inf Syst (IJAIS) 7(3)
Zhang H, Nelson A, Menzies T (2010) On the value of learning from defect dense components for software defect prediction. In: Promise 2010, 12–13 Sept
Jin C, Dong E-M, Qin L-N (2010) Software fault prediction model based on adaptive dynamical and median particle swarm optimization. In: Second international conference on multimedia and information technology
Witten IH, Frank E (2005) Data mining, practical machine learning tools and techniques. Morgan Kaufmann, San Francisco
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dhamayanthi, N., Lavanya, B. (2019). Software Defect Prediction Using Principal Component Analysis and Naïve Bayes Algorithm. In: Chaki, N., Devarakonda, N., Sarkar, A., Debnath, N. (eds) Proceedings of International Conference on Computational Intelligence and Data Engineering. Lecture Notes on Data Engineering and Communications Technologies, vol 28. Springer, Singapore. https://doi.org/10.1007/978-981-13-6459-4_24
Download citation
DOI: https://doi.org/10.1007/978-981-13-6459-4_24
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6458-7
Online ISBN: 978-981-13-6459-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)