Skip to main content

Software Defect Prediction Using Principal Component Analysis and Naïve Bayes Algorithm

  • Conference paper
  • First Online:
Proceedings of International Conference on Computational Intelligence and Data Engineering

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 28))

Abstract

How can I deliver defect-free software? Can I achieve more with less resources? How can I reduce time, effort, and cost involved in developing software? Software defect prediction is an important area of research which can significantly help the software development teams grappling with these questions in an effective way. A small increase in prediction accuracy will go a long way in helping software development teams improve their efficiency. In this paper, we have proposed a framework which uses PCA for dimensionality reduction and Naïve Bayes classification algorithm for building the prediction model. We have used seven projects from NASA Metrics Data Program for conducting experiments. We have seen an average increase of 10.3% in prediction accuracy when the learning algorithm is applied with the key features extracted from the datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dhamayanthi N, Lavanya B (2019) Improvement in software defect prediction outcome using principal component analysis and ensemble machine learning algorithms. In: Hemanth J, Fernando X, Lafata P, Baig Z (eds) International Conference on Intelligent Data Communication Technologies and Internet of Things (ICICI) 2018. ICICI 2018. Lecture notes on data engineering and communications technologies, vol 26. Springer, Cham. https://link.springer.com/chapter/10.1007/978-3-030-03146-6_44

  2. Murillo-Morera J, Castro-Herrera C, Arroyo J, Fucntcs-Fernandez R (2016) An automated defect prediction framework using genetic algorithms: a validation of empirical studies. Inteligencia Artif 19(57):114–137

    Article  Google Scholar 

  3. Song Q, Jia Z, Shepperd M, Ying S, Liu J (2011) A general software defect-proneness prediction framework. IEEE Trans Softw Eng 37(3):356–370

    Article  Google Scholar 

  4. Shirabad JS, Menzies TJ (2005) The PROMISE repository of software engineering databases. School of Information Technology and Engineering, University of Ottawa, Canada. Available: http://promise.site.uottawa.ca/SERepository

  5. Wang S, Ping HE, Zelin L (2016) An enhanced software defect prediction model with multiple metrics and learners. Int J Ind Syst Eng 22(3):358–371

    Google Scholar 

  6. Shatnawi R, Li W (2016) An empirical investigation of predicting fault count, fix cost and effort using software metrics. (IJACSA) Int J Adv Comput Sci Appl 7(2)

    Google Scholar 

  7. Menzies T, Greenwald J, Frank A (2007) Data mining static code attributes to learn defect predictors. IEEE Trans Softw Eng 33(1)

    Article  Google Scholar 

  8. Jiang Y, Lin J, Cukic B, Menzies T (2009) Variance analysis in software fault prediction models. In: 20th international symposium of software reliability engineering

    Google Scholar 

  9. Koru AG, Liu H (2005) An investigation of the effect of module size on defect prediction using static measures. In: Promise ’05

    Google Scholar 

  10. Singh P, Verma S (2014) An efficient software fault prediction model using cluster based classification. Int J Appl Inf Syst (IJAIS) 7(3)

    Google Scholar 

  11. Zhang H, Nelson A, Menzies T (2010) On the value of learning from defect dense components for software defect prediction. In: Promise 2010, 12–13 Sept

    Google Scholar 

  12. Jin C, Dong E-M, Qin L-N (2010) Software fault prediction model based on adaptive dynamical and median particle swarm optimization. In: Second international conference on multimedia and information technology

    Google Scholar 

  13. Witten IH, Frank E (2005) Data mining, practical machine learning tools and techniques. Morgan Kaufmann, San Francisco

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to N. Dhamayanthi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dhamayanthi, N., Lavanya, B. (2019). Software Defect Prediction Using Principal Component Analysis and Naïve Bayes Algorithm. In: Chaki, N., Devarakonda, N., Sarkar, A., Debnath, N. (eds) Proceedings of International Conference on Computational Intelligence and Data Engineering. Lecture Notes on Data Engineering and Communications Technologies, vol 28. Springer, Singapore. https://doi.org/10.1007/978-981-13-6459-4_24

Download citation

Publish with us

Policies and ethics