Sample Classification Based on Gene Subset Selection

Das, Sunanda; Das, Asit Kumar

doi:10.1007/978-81-322-2734-2_24

Sunanda Das⁴ &
Asit Kumar Das⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 410))

1005 Accesses

Abstract

Microarray datasets contain genetic information of patients analysis of which can reveal new findings about the cause and subsequent treatment of any disease. With an objective to extract biologically relevant information from the datasets, many techniques are used in gene analysis. In the paper, the concepts like functional dependency and closure of an attribute of database technology are applied to find the most important gene subset and based on which the samples of the gene datasets are classified as normal and disease samples. The gene dependency is defined as the number of genes dependent on a particular gene using gene similarity measurement on collected samples. The closure of a gene is computed using gene dependency set which helps to know how many genes are logically implied by it. Finally, the minimum number of genes whose closure logically implies all the genes in the dataset is selected for sample classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alon, U., Barkai, N., Notterman, D.A.: Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. PNAS 96, 6745–6750 (1999)
Article Google Scholar
Ding, C., Dubchak, I.: Multi-class protein fold recognition using support vector machines and neural networks. Bioinformatics 17, 349–358 (2001)
Article Google Scholar
Tsky-Shapiro, G., Smyth, P., Uthurusamy, R.: From Data Mining to Knowledge Discovery: An Overview in Advances in Knowledge Discovery and Data Mining, pp. 1–36. (1996)
Google Scholar
Lavrajc, N., Keravnou, E., Zupan, B.: Intelligent Data Analysis in Medicine and Pharmacology. Kluwer Academic Publishers (1997)
Google Scholar
Wolf, S., Oliver, H., Herbert, S., Michael, M.: Intelligent data mining for medical quality management. In: Proceedings of the Fifth International Workshop on Intelligent Data Analysis in Medicine and Pharmacology, Berlin, Germany (2000)
Google Scholar
Ye, C.Z., Yang, J., Geng, D.Y., Zhou, Y., Chen, N.Y.: Fuzzy rules to predict degree of malignancy in brain glioma. Med. Biol. Comput. Eng. 40(2), 145–152 (2002)
Google Scholar
Das, S., Das, A.K.: An approach towards most cancerous gene Selection from microarray data. ICCIDM 3, 641–648 (2014)
Google Scholar
Das, A.K., Pati, S.K.: Gene subset selection for cancer classification using statistical and rough set approach, pp. 294–302. Evol. Memetic Comput., Swarm (2012)
Google Scholar
Das, A.K., Pati, S.K., Chakrabarty, S.: Reduct generation of microarray dataset using rough set and graph theory for unsupervised learning. In: Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology, pp. 555–561. (2012)
Google Scholar
Kerber, R., ChiMerge.: Discretization of numeric attributes. In: Proceedings of AAAI-92. Ninth International Conference on Artificial Intelligence, pp. 123–128. AAAI-Press, (1992)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Neotia Institute of Technology, Management and Science, Diamond Harbour, South 24-Pargana, 743368, India
Sunanda Das
Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, 711103, India
Asit Kumar Das

Authors

Sunanda Das
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sunanda Das .

Editor information

Editors and Affiliations

Veer Surendra Sai University of Tech., Dept of Computer Sci Eng & IT, Sambalpur, Odisha, India
Himansu Sekhar Behera
National Institute of Technology, Dept. of Computer Science & Engineering, Rourkela, Odisha, India
Durga Prasad Mohapatra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Das, S., Das, A.K. (2016). Sample Classification Based on Gene Subset Selection. In: Behera, H., Mohapatra, D. (eds) Computational Intelligence in Data Mining—Volume 1. Advances in Intelligent Systems and Computing, vol 410. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2734-2_24

Download citation

DOI: https://doi.org/10.1007/978-81-322-2734-2_24
Published: 09 December 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2732-8
Online ISBN: 978-81-322-2734-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics