Abstract
Worldwide, breast cancer is the second leading cause of cancer deaths in women. To gain insight into the processes related to the course of the disease, human genetic data can be used to identify associations between gene expression and prognosis. Moreover, the expression data of groups of genes may be aggregated to metagenes that may be used for investigating complex diseases like breast cancer. Here we introduce a cluster analytic approach for identification of potentially relevant metagenes. In a first step of our approach we used gene expression patterns over time of erbB2 breast cancer MCF7 cell lines to obtain promising sets of genes for a metagene calculation. For this purpose, two cluster analytic approaches for short time-series of gene expression data – DIB-C and STEM – were applied to identify gene clusters with similar expression patterns. Among these we next focussed on groups of genes with transcription factor (TF) binding site enrichment or associated with a GO group. These gene clusters were then used to calculate metagenes of the gene expression data of 766 breast cancer patients from three breast cancer studies. In the last step of our approach Cox models were applied to determine the effect of the metagenes on the prognosis. Using this strategy we identified new metagenes that were associated with metastasis-free survival patients.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dortet-Bernadet JL, Wicker N (2008) Model-based clustering on the unit sphere with an illustration using gene expression profiles. Biostatistics 9(1):66–80
Elkon R, Linhart C, Sharan R, Shamir R, Shiloh Y (2003) Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res 13:773–780
Ernst J, Nau GJ, Bar-Joseph Z (2005) Clustering short time series gene expression data. Bioinformatics 21(1):159–168
Freis E, Selinski S, Weibert B, Krahn U, Schmidt M, Gehrmann M, Hermes M, Maccoux L, West J, Schwender H, Rahnenfhrer J, Hengstler J, Ickstadt K (2009) Effects of metagene calculation on survival: An integrative approach using cluster and promoter analysis. In: Sixth International Workshop on Computational Systems Biology, Tampere, Finland, TICSP series 48, pp 47–50
Glahn F, Schmidt-Heck W, Zellmer S, Guthke R, Wiese J, Golka K, Hergenroder R, Degen GH, Lehmann T, Hermes M, Schormann W, Brulport M, Bauer A, Bedawy E, Gebhardt R, Hengstler JG, Foth H (2008) Cadmium, cobalt and lead cause stress response, cell cycle deregulation and increased steroid as well as xenobiotic metabolism in primary normal human bronchial epithelial cells which is coordinated by at least nine transcription factors. Arch Toxicol 82:513–24
Hermes M (2007) Konditionale Expression von Her2/NeuT: Einfluss auf die Zell- und Tumorenentwicklung. PhD thesis, University of Leipzig
Kim J, Kim JH (2007) Difference-based clustering of short time-course microarray data with replicates. Bioinformatics 8:253
Krahn U (2008) Identifikation von Clustern in Gene-Expressions-Zeitreihen zur Analyse der Zellentwicklung. Diploma thesis, TU Dortmund
Petry IB, Fieber E, Schmidt M, Gehrmann M, Gebhard S, Hermes M, Schormann W, Selinski S, Freis E, Schwender H, Brulport M, Ickstadt K, Rahnenfuhrer J, Maccoux L, West J, Kolbl H, Schuler M, Hengstler JG (2010) ERBB2 induces an antiapoptotic expression pattern of Bcl-2 family members in node-negative breast cancer. Clin Cancer Res 16(2):451–460
R Development Core Team (2010) R: A language and environment for statistical computing. Vienna, Austria, URL http://www.R-project.org
Rody A, Holtrich U, Pusztai L, Liedtke C, Gaetje R, Ruckhaeberle E, Solbach C, Hanker L, Ahr A, Metzler D, Engels K, Karn T, Kaufmann M (2009) T-cell metagene predicts a favourable prognosis in estrogen receptor negative and HER2 positive breast cancers. Breast Cancer Res 11:R15
Schmidt M, Bohm D, von Torne C, Steiner E, Puhl A, Pilch H, Lehr HA, Hengstler JG, Kolbl H, Gehrmann M (2008) The humoral immune system has a key prognostic impact in node-negative breast cancer. Cancer Res 68:5405–5413
Shamir R, Maron-Katz A, Tanay A, Linhart C, Steinfeld I, Sharan R, Shiloh Y, Elkon R (2005) EXPANDER - an integrative program suite for microarray data analysis. BMC Bioinformatics 6:232
Slamon DJ, Clark GM, Wong SG, Levin WJ, Ullrich A, McGuire WL (1987) Human breast cancer: Correlation of relapse and survival with amplification of the HER-2/neu oncogene. Science 235:177–182
Tanay A (2005) Computational analysis of transcriptional programs: Function and evolution. PhD thesis, Tel Aviv University
Trost TM, Lausch EU, Fees SA, Schmitt S, Enklaar T, Reutzel D, Brixel LR, Schmidtke P, Maringer M, Schiffer IB, Heimerdinger CK, Hengstler JG, Fritz G, Bockamp EO, Prawitt D, Zabel BU, Spangenberg C (2005) Premature senescence is a primary fail-safe mechanism of ERBB2-driven tumorigenesis in breast carcinoma cells. Cancer Res 65:840–849
Wang X, Wu M, Li Z, Chan C (2008) Short time-series microarray analysis: Methods and challenges. BMC Syst Biol 2:58
Winter SC, Buffa FM, Silva P, Miller C, Valentine HR, Turley H, Shah KA, Cox GJ, Corbridge RJ, Homer JJ, Musgrove B, Slevin N, Sloan P, Price P, West CM, Harris AL (2007) Relation of a hypoxia metagene derived from head and neck cancer to prognosis of multiple cancers. Cancer Res 67(7):3441–3449
Acknowledgements
We would like to thank Ulrike Krahn, Marcus Schmidt, Mathias Gehrmann, Matthias Hermes, Lindsey Maccoux, Jonathan West, and Holger Schwender for collaboration and numerous helpful discussions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Freis, E., Selinski, S., Hengstler, J.G., Ickstadt, K. (2012). Cluster Analytic Strategy for Identification of Metagenes Relevant for Prognosis of Node Negative Breast Cancer. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24466-7_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-24466-7_48
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24465-0
Online ISBN: 978-3-642-24466-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)