Abstract
Software faults are recoverable errors in a program that occur due to the programming errors. Software fault prediction is subject to problems like non-availability of fault data which makes the application of supervised technique difficult. In such cases, unsupervised techniques are helpful. In this paper, a hyper-quadtree-based K-means algorithm has been applied for predicting the faults in the program module. This paper contains two parts. First, the hyper-quadtree is applied on the software fault prediction dataset for the initialization of the K-means clustering algorithm. An input parameter Δ governs the initial number of clusters and cluster centers. Second, the cluster centers and the number of cluster centers obtained from the initialization algorithm are used as the input for the K-means clustering algorithm for predicting the faults in the software modules. The overall error rate of this prediction approach is compared with the other existing algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
P.S. Bishnu and V. Bhattacherjee, Member, IEEE, “Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm” IEEE Transactions on Knowledge and Data Engineering, Vol. 24, No. 6, June 2012.
N. Seliya, T. M. Khoshgoftaar, “Software quality estimation with limited fault labels: a supervised learning perspective”, Software quality Journal Vol. 15, no. 3, 2007, pp. 377–344.
C. Catal, B. Diri, “Investigating the effect of dataset size, metrics set, and feature selection techniques on software fault prediction problem,” Information Sciences, Vol. 179, no. 8, pp. 1040–1058, 2009.
N. Seliya, “Software quality analysis with limited prior knowledge of faults,” Graduate Seminar, Wayne State University Department of Computer Science, 2006.
J. Han and M. Kamber, “Data Mining Concepts and Techniques, second ed,” pp. 401–404. Morgan Kaufmann Publisher, 2007.
S. Zhong, T.M. Khoshgoftaar, and N. Seliya, “Unsupervised Learning for Expert- Based Software Quality Estimation,” Proc. IEEE Eighth Int’l Symp. High Assurance Systems Eng., pp. 149–155, 2004.
P.S. Bishnu and V. Bhattacherjee, “Application of K-Medoids with kd –Tree for software fault prediction,” ACM Software Eng. Note. Vol. 36, pp. 1–6, Mar. 2011.
Yannis Manolopoulos, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Yannis Theodoridis, “R-Trees: Theory and Application” pp. 1–6, Dec. 2010.
N. Seliya and T.M. Khoshgoftaar, “Software Quality Classification Modelling Using the SPRINT Decision Algorithm,” Proc. IEEE 14th Int’l Conf. Tools with Artificial Intelligence, pp. 365–374, 2002.
C. Catal, U. Sevim, and B. Diri, “Clustering and Metrics Threshold Based Software Fault Prediction of Unlabeled Program Modules,” Proc. Sixth Int’l Conf. Information Technology: New Generations, pp. 199–204, 2009.
H.Samet, The design and Analysis of Spatial Data structures. Reading, Mass Addison- Wesley, 2000.
http://promisedata.org/, 2012.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer India
About this paper
Cite this paper
Sasidharan, R., Sriram, P. (2014). Hyper-Quadtree-Based K-Means Algorithm for Software Fault Prediction. In: Krishnan, G., Anitha, R., Lekshmi, R., Kumar, M., Bonato, A., Graña, M. (eds) Computational Intelligence, Cyber Security and Computational Models. Advances in Intelligent Systems and Computing, vol 246. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1680-3_12
Download citation
DOI: https://doi.org/10.1007/978-81-322-1680-3_12
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1679-7
Online ISBN: 978-81-322-1680-3
eBook Packages: EngineeringEngineering (R0)