Hyper-Quadtree-Based K-Means Algorithm for Software Fault Prediction

Sasidharan, Rakhi; Sriram, Padmamala

doi:10.1007/978-81-322-1680-3_12

Rakhi Sasidharan⁸ &
Padmamala Sriram⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 246))

1567 Accesses
1 Citations

Abstract

Software faults are recoverable errors in a program that occur due to the programming errors. Software fault prediction is subject to problems like non-availability of fault data which makes the application of supervised technique difficult. In such cases, unsupervised techniques are helpful. In this paper, a hyper-quadtree-based K-means algorithm has been applied for predicting the faults in the program module. This paper contains two parts. First, the hyper-quadtree is applied on the software fault prediction dataset for the initialization of the K-means clustering algorithm. An input parameter Δ governs the initial number of clusters and cluster centers. Second, the cluster centers and the number of cluster centers obtained from the initialization algorithm are used as the input for the K-means clustering algorithm for predicting the faults in the software modules. The overall error rate of this prediction approach is compared with the other existing algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

P.S. Bishnu and V. Bhattacherjee, Member, IEEE, “Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm” IEEE Transactions on Knowledge and Data Engineering, Vol. 24, No. 6, June 2012.
Google Scholar
N. Seliya, T. M. Khoshgoftaar, “Software quality estimation with limited fault labels: a supervised learning perspective”, Software quality Journal Vol. 15, no. 3, 2007, pp. 377–344.
Google Scholar
C. Catal, B. Diri, “Investigating the effect of dataset size, metrics set, and feature selection techniques on software fault prediction problem,” Information Sciences, Vol. 179, no. 8, pp. 1040–1058, 2009.
Google Scholar
N. Seliya, “Software quality analysis with limited prior knowledge of faults,” Graduate Seminar, Wayne State University Department of Computer Science, 2006.
Google Scholar
J. Han and M. Kamber, “Data Mining Concepts and Techniques, second ed,” pp. 401–404. Morgan Kaufmann Publisher, 2007.
Google Scholar
S. Zhong, T.M. Khoshgoftaar, and N. Seliya, “Unsupervised Learning for Expert- Based Software Quality Estimation,” Proc. IEEE Eighth Int’l Symp. High Assurance Systems Eng., pp. 149–155, 2004.
Google Scholar
P.S. Bishnu and V. Bhattacherjee, “Application of K-Medoids with kd –Tree for software fault prediction,” ACM Software Eng. Note. Vol. 36, pp. 1–6, Mar. 2011.
Google Scholar
Yannis Manolopoulos, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Yannis Theodoridis, “R-Trees: Theory and Application” pp. 1–6, Dec. 2010.
Google Scholar
N. Seliya and T.M. Khoshgoftaar, “Software Quality Classification Modelling Using the SPRINT Decision Algorithm,” Proc. IEEE 14th Int’l Conf. Tools with Artificial Intelligence, pp. 365–374, 2002.
Google Scholar
C. Catal, U. Sevim, and B. Diri, “Clustering and Metrics Threshold Based Software Fault Prediction of Unlabeled Program Modules,” Proc. Sixth Int’l Conf. Information Technology: New Generations, pp. 199–204, 2009.
Google Scholar
H.Samet, The design and Analysis of Spatial Data structures. Reading, Mass Addison- Wesley, 2000.
Google Scholar
http://promisedata.org/, 2012.

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Amrita University, Kollam, Kerala, India
Rakhi Sasidharan & Padmamala Sriram

Authors

Rakhi Sasidharan
View author publications
You can also search for this author in PubMed Google Scholar
Padmamala Sriram
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rakhi Sasidharan .

Editor information

Editors and Affiliations

Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
G. Sai Sundara Krishnan
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. Anitha
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
R. S. Lekshmi
Applied Mathematics and Computational Sciences, PSG College of Technology, Coimbatore, Tamil Nadu, India
M. Senthil Kumar
Department of Mathematics, Ryerson University, Toronto, Ontario, Canada
Anthony Bonato
University of Basque Country, Paseo Manuel De Lardizalbal 1, San Sebastian, Spain
Manuel Graña

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sasidharan, R., Sriram, P. (2014). Hyper-Quadtree-Based K-Means Algorithm for Software Fault Prediction. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_12

Download citation

DOI: https://doi.org/10.1007/978-81-322-1680-3_12
Published: 27 November 2013
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1679-7
Online ISBN: 978-81-322-1680-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics