A Genetic Approach to Data Dimensionality Reduction Using a Special Initial Population

Tümer, M. Borahan; Demir, Mert C.

doi:10.1007/11499305_32

M. Borahan Tümer¹⁸ &
Mert C. Demir¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3562))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

2103 Accesses

Abstract

Accurate classification of data sets is an important phenomenon for many applications. While multi-dimensionality to a certain point contributes to the classification performance, after a point, incorporating more attributes degrades the quality of the classification. In a pattern classification problem, by determining and excluding the least effective attribute(s) the performance of the classification is likely to improve. The task of the elimination of the least effective attributes in pattern classification is called ”data dimensionality reduction (DDR)”. DDR using Genetic Algorithms (DDR-GA) aims at discarding the less useful dimensions and re-organizing the data set by means of genetic operators. We show that a wise selection of the initial population improves the performance of the DDR-GA considerably and introduce a method to implement this approach. Our approach focuses on using information obtained a priori for the selection of initial chromosomes. Our work then compares the performance of the GA initiated by a randomly selected initial population to the performance of the ones initiated by a wisely selected one. Furthermore, the results indicate that our approach provides more accurate results compared to the purely random one in a reasonable amount of time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alpaydin, E.: Introduction to Machine Learning. MIT Press, Cambridge (2004)
Google Scholar
Bishop, C.M.: Neural networks for pattern recognition. Oxford University Press, Oxford (1995)
Google Scholar
Bollacker, K.D., Ghosh, J.: Mutual information feature extractors for neural classifiers, IEEE Int. Conf. Neural Networks 3, 1528–1533 (1996)
Google Scholar
Eads, D.R., Williams, S.J., Theiler, J., Porter, R., Harvey, N.R., Perkins, S.J., Brumby, S.P., David, N.A.: A multimodal approach to feature extraction for image and signal learning problems. In: Proceedings of SPIE (2004)
Google Scholar
Fu, X., Wang, L.: Data Dimensionmality Reduction with Application to Simplify RBF Network Sutructure and Improving Classificaiton Performance. IEEE Transactions on Systems, Man and Cybernetics: Part B 33(3), 399–409 (2003)
Article Google Scholar
Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)
MATH Google Scholar
Hallinan, J., Jackway, P.: Simultaneous evolution of feature subset and neural classifier on high-dimensional data. In: Conference on Digital Image Computing and Applications (1999)
Google Scholar
Kambhatla, N., Leen, T.K.: Dimension reduction by local principal component analysis. Neural Computation 9, 1493–1516 (1997)
Article Google Scholar
Kononenko, I.: Estimating Attributes: Analysis and Extensions of RELIEF. In: Bergadano, F., De Raedt, L. (eds.) ECML 1994. LNCS, vol. 784, pp. 171–182. Springer, Heidelberg (1994)
Google Scholar
Lopez-Rubio, E., de Lazcano-Lobato, J.M.O., Munoz-Perez, J., Antonio Gomez-Ruiz, J.: Principal components analysis competitive learning. Neural Computation 16(11), 2459–2481 (2004)
Article MATH Google Scholar
Murphy, P.M., Aha, D.: UCI repository of machine learning databases (1994), http://www.ics.uci.edu/~mlearn/MLRepository.html
Ramos, V., Muge, F.: Less is more: Genetic optimisation of nearest neighbour classifiers. In: F. Muge, C. Pinto, and M. Piedade, editors, 10th Portuguese Conference on Pattern Recognition, pp. 293–301. Technical University of Lisbon (March 1988)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: an Introduction. MIT Press, Cambridge (1999)
Google Scholar

Download references

Author information

Authors and Affiliations

Marmara University Faculty of Engineering, Göztepe Campus, 34722, Kadiköy, İstanbul, Turkey
M. Borahan Tümer & Mert C. Demir

Authors

M. Borahan Tümer
View author publications
You can also search for this author in PubMed Google Scholar
Mert C. Demir
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

E.T.S.I. Informática, Universidad Nacional de Educación a Distancia, 28040, Madrid, Spain
José Mira
E.T.S. de Ingeniería Informática, Departamento de Intelifencia Artificial, Universidad Nacional de Educación a Distancia, Juan del Rosal, 16, 28040, Madrid, Spain
José R. Álvarez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tümer, M.B., Demir, M.C. (2005). A Genetic Approach to Data Dimensionality Reduction Using a Special Initial Population. In: Mira, J., Álvarez, J.R. (eds) Artificial Intelligence and Knowledge Engineering Applications: A Bioinspired Approach. IWINAC 2005. Lecture Notes in Computer Science, vol 3562. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499305_32

Download citation

DOI: https://doi.org/10.1007/11499305_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26319-7
Online ISBN: 978-3-540-31673-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics