Abstract
Chromosomal aneuploidy and segmental copy number changes are common genomic aberrations in cancer. Copy number alterations (CNAs) arise from deletions, insertions, or duplications resulting in chromosomal aberrations and aneuploidy. Genomes of normal cells also exhibit variable copy number called germline copy number variants (CNVs). CNVs in the general population tend to confound interpretation of predictions when attempting to extract relevant driver somatic events in cancer. In large studies of CNAs in cancer patients, it becomes necessary to accurately identify and separate CNAs and CNVs so as to prioritize candidate tumor suppressors and oncogenes. We have developed a probabilistic approach, HMM-Dosage, for segmenting and distinguishing CNAs and CNVs as separate, discrete events in cancer SNP genotyping array data. We outline the steps and computer code for the analysis of whole-genome cancer DNA hybridized to SNP genotyping arrays, focusing on distinguishing somatic CNA and germline CNVs, and describe the combined approach of HMM-Dosage for probabilistic inference and classification of somatic and germline copy number changes.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Stratton MR, Campbell PJ, Futreal PA (2009) The cancer genome. Nature 458:719–724
Negrini S, Gorgoulis VG, Halazonetis TD (2010) Genomic Instability—an evolving hallmark of cancer. Nat Rev Mol Cell Biol 11:220–228
Cancer Genome Atlas Research Network (2008) Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455:1061–1068
Cancer Genome Atlas Research Network (2011) Integrated genomic analyses of ovarian carcinoma. Nature 474:609–615
Bignell GR, Greenman CD, Davies H et al (2010) Signatures of mutation and selection in the cancer genome. Nature 463:893–898
Beroukhim R, Mermel CH, Porter D et al (2010) The landscape of somatic copy-number alteration across human cancers. Nature 463:899–905
Chin SF, Teschendorff AE, Marioni JC et al (2007) High-resolution aCGH and expression profiling identifies a novel genomic subtype of ER negative breast cancer. Genome Biol 8:R215
Curtis C, Shah SP, Chin S et al (2012) The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature 486:346–352
Sebat J, Lakshmi B, Troge J et al (2004) Large-scale copy number polymorphism in the human genome. Science 305:525–528
Tuzun E, Sharp AJ, Bailey JA et al (2005) Fine-scale structural variation of the human genome. Nat Genet 37:727–732
Redon R, Ishikawa S, Fitch KR et al (2006) Global variation in copy number in the human genome. Nature 444:444–454
Kidd JM, Cooper GM, Donahue WF et al (2008) Mapping and sequencing of structural variation from eight human genomes. Nature 453:56–64
Conrad DF, Pinto D, Redon R et al (2010) Origins and functional impact of copy number variation in the human genome. Nature 464:704–712
Sharp AJ, Locke DP, McGrath SD et al (2005) Segmental duplications and copy-number variation in the human genome. Am J Hum Genet 77:78–88
Genomes Project Consortium, Durbin RM, Abecasis GCR et al (2010) A map of human genome variation from population-scale sequencing. Nature 467:1061–1073
Mills RE, Walter K, Stewart C et al (2011) Mapping copy number variation by population-scale genome sequencing. Nature 470:59–65
Iafrate AJ, Feuk L, Rivera MN et al (2004) Detection of large-scale variation in the human genome. Nat Genet 36:949–951
Friedman JM, Baross A, Delaney AD et al (2006) Oligonucleotide microarray analysis of genomic imbalance in children with mental retardation. Am J Hum Genet 79:500–513
Sebat J, Lakshmi B, Malhotra D et al (2007) Strong association of de novo copy number mutations with autism. Science 316:445–449
Lee C, Iafrate AJ, Brothman AR (2007) Copy number variations and clinical cytogenetic diagnosis of constitutional disorders. Nat Genet 39:S48–S54
Sharp AJ, Mefford HC, Li K et al (2008) A recurrent 15q13.3 microdeletion syndrome associated with mental retardation and seizures. Nat Genet 40:322–328
The International HapMap 3 Consortium and Principal investigators and Altshuler, David M and Gibbs, Richard A and Peltonen, Leena and Project coordination leaders and Altshuler, David M and Gibbs, Richard A and Peltonen, Leena and Dermitzakis, Emmanouil and Manuscript writing group and Schaffner, Stephen F and Yu, Fuli and Peltonen, Leena and Dermitzakis, Emmanouil and Bonnen, Penelope E and Altshuler, David M and Gibbs, Richard A and Genotyping, QC, de Bakker, Co-Leader, Paul IW et al (2010) Integrating common and rare genetic variation in diverse human populations. Nature 467:52–58
Bengtsson H, Irizarry R, Carvalho B et al (2008) Estimation and assessment of raw copy numbers at the single locus level. Bioinformatics 24:759–767
Bengtsson H, Wirapati P, Speed TP (2009) A single-array preprocessing method for estimating full-resolution raw copy numbers from all affymetrix genotyping arrays including GenomeWideSNP 5 & 6. Bioinformatics 25:2149–2156
Ortiz-Estevez M, Bengtsson H, Rubio A (2010) ACNE: a summarization method to estimate allele-specific copy numbers for Affymetrix SNP arrays. Bioinformatics 26:1827–1833
Scharpf RB, Ruczinski I, Carvalho B et al (2011) A multilevel model to address batch effects in copy number estimation using SNP arrays. Biostatistics 12:33–50
Ritchie ME, Carvalho BS, Hetrick KN et al (2009) R/Bioconductor software for Illumina’s Infinium whole-genome genotyping BeadChips. Bioinformatics 25:2621–2623
Staaf J, Vallon-Christersson J, Lindgren D et al (2008) Normalization of Illumina Infinium whole-genome SNP data improves copy number estimates and allelic intensity ratios. BMC Bioinformatics 9:409–409
Peiffer DA, Le JM, Steemers FJ et al (2006) High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. Genome Res 16:1136–1148
Venkatraman ES, Olshen AB (2007) A faster circular binary segmentation algorithm for the analysis of array CGH data. Bioinformatics 23:657–663
Olshen AB, Venkatraman ES, Lucito R et al (2004) Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5:557–572
Shah SP, Xuan X, DeLeeuw RJ et al (2006) Integrating copy number polymorphisms into array CGH analysis using a robust HMM. Bioinformatics 22:e431–e439
Greenman CD, Bignell G, Butler A et al (2010) PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data. Biostatistics 11:164–175
Yau C, Mouradov D, Jorissen RN et al (2010) A statistical approach for detecting genomic aberrations in heterogeneous tumor samples from single nucleotide polymorphism genotyping data. Genome Biol 11:R92
Li A, Liu Z, Lezon-Geyda K et al (2011) GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays. Nucleic Acids Res 39:4928–4941
Ostrovnaya I, Nanjangud G, Olshen AB (2010) A classification model for distinguishing copy number variants from cancer-related alterations. BMC Bioinformatics 11:297–297
International HapMap Consortium, Frazer KA, Ballinger DG et al (2007) A second generation human haplotype map of over 3.1 million SNPs. Nature 449:851–861
Shah SP, Roth A, Goya R et al (2012) The clonal and mutational evolution spectrum of primary triple-negative breast cancers. Nature 486:395–399
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Ha, G., Shah, S. (2013). Distinguishing Somatic and Germline Copy Number Events in Cancer Patient DNA Hybridized to Whole-Genome SNP Genotyping Arrays. In: Banerjee, D., Shah, S. (eds) Array Comparative Genomic Hybridization. Methods in Molecular Biology, vol 973. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-62703-281-0_22
Download citation
DOI: https://doi.org/10.1007/978-1-62703-281-0_22
Published:
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-62703-280-3
Online ISBN: 978-1-62703-281-0
eBook Packages: Springer Protocols