Abstract
A modified variable string length genetic algorithm, called MVGA, is proposed for text clustering in this paper. Our algorithm has been exploited for automatically evolving the optimal number of clusters as well as providing proper data set clustering. The chromosome is encoded by special indices to indicate the location of each gene. More effective version of evolutional steps can automatically adjust the influence between the diversity of the population and selective pressure during generations. The superiority of the MVGA over conventional variable string length genetic algorithm (VGA) is demonstrated by providing proper text clustering.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Maulik, Bandyopadhyay: Genetic Algorithm Based Clustering Technique. Pattern Recognition 33(9), 1455–1465 (2000)
Bandyopadhyay, S., Mauilk, U.: Nonparametric Genetic Clustering: Comparison of Validity Indices. IEEE Transactions on System, Man, and Cybernetics-Part C Applications and Reviews 31(1) (2001)
Mauilk, U., Bandyopadhyay, S.: Performance Evaluation of Some Clustering Algorithms and Validity Indices. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(12) (2002)
Yao, X., liu, Y., Lin, G.: Evolutionary Programming Made Faster. IEEE Trans, Evolutionary Computation 3(2) (1999)
Davies, D.L., Bouldin, D.W.: A Cluster Separation Measure. IEEE Trans. Patt. Anal. Mach. Intell. 1, 224–227, pp. ABSTRACT–INSPEC (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Song, W., Park, S.C. (2006). Genetic Algorithm-Based Text Clustering Technique. In: Jiao, L., Wang, L., Gao, Xb., Liu, J., Wu, F. (eds) Advances in Natural Computation. ICNC 2006. Lecture Notes in Computer Science, vol 4221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11881070_103
Download citation
DOI: https://doi.org/10.1007/11881070_103
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45901-9
Online ISBN: 978-3-540-45902-6
eBook Packages: Computer ScienceComputer Science (R0)