Abstract
Consistency checking is a fundamental computational problem in genetics. Given a pedigree and information on the genotypes (of some) of the individuals in it, the aim of consistency checking is to determine whether these data are consistent with the classic Mendelian laws of inheritance. This problem arose originally from the geneticists' need to filter their input data from erroneous information, and is well motivated from both a biological and a sociological viewpoint. This paper shows that consistency checking is NP-complete, even with focus on a single gene and in the presence of three allele. Several other results on the computational complexity of problems from genetics that are related to consistency checking are also offered. In particular, it is shown that checking the consistency of pedigrees over two alleles, and of pedigrees without loops, can be done in polynomial time.
Similar content being viewed by others
References
Daniel Fanner Gudbjartsson, K Jonasson, C A Kong. Fast multipoint linkage calculation with Allegro.Nature Genetics, 2000, 20: 12–13.
Leonid Kruglyak, Mark J Daly, Mary Pat Reeve-Daly, Eric S Lander. Parametric and nonparametric linkage analysis: A unified multipoint approach.American Journal of Human Genetics, 1996, 58: 1347–1363.
Abecasis G R, Cherny S Set al. Merlin: Rapid analysis of dense genetic maps using sparse gene flow trees.Nature Genetics, 2002, 30: 97–101.
Jeffrey R O'Connell, Daniel E Weeks. Pedcheck: A program for identification of genotype incompabibilities in linkage analysis.American Journal of Human Genetics, 1998, 63: 259–266.
deCode News Center, November 2001. http://www.decode.com/news/releases/
William S Klug, Michael R Cummings. Concepts of Genetics. Prentice Hall, 5th Edition, 1997.
Eric Sobel, Jeanette C Pap, Kenneth Lange Detection and integration of genotyping errors in statistical genetics.American Journal of Human Genetics, 2002, 70: 496–508.
Jan Dohrmann. Mœnd. vil vide. hvem de er far til.Nordjyske, 2002. (In Danish)
Mick Hamer. Back to your roots.New Scientist, 2002, 2334: 33–36.
Human Genome Diversity Project, March 2002, http://www.stanford.edu/group/morrinsthgdp.html.
Jeffrey R O'Connell, Daniel E Weeks. An optimal algorithm for automatic genotype elimination.American Journal of Human Genetics, 1999, 65: 1733–1740.
Christos H Papadinitriou.Computational Complexity. Addison Wesley, 1995.
Antonio Piccolboni, Dan Gusfied. On the complexity of fundamental computational problems in pedigree analysis. Technical Report CSE-99-8, Computer Science Department, University of California, Davis, September 1999. Revised version to appear in theJournal of Computational Biology.
Kenneth Lange, Tushar Madhu Goaradia. An algorithm for automatic genotype elimination.American Journal of Human Genetics, 1987, 40: 250–256.
Jing Li, Tao Jiang. Efficient rule-based haplotyping algorithms for pedigree data [extended abstract]. InProc. RECOMB'08, Berlin, Germany, ACM, April 10–13, 2003, pp.197–206.
Bonnie Berger, Tom Leighton. Protein folding in the hydrophobic-hydrophilic (hp) model is NP-completeJournal of Computational Biology, 1998, 5(1): 27–40.
Pierluigi Crescenzi, Deborah Goldman, Christos Papadimitriou, Antonio Piccolboni, Mihalis Yannakakis. On the complexity of protein folding (extended abstract). InProc. the Thirtieth Annual ACM Symp. Theory of Computing, ACM Press, 1998, pp.597–603.
Jens A Hansen, Jacob Johnsen, John Knudsen. Computational complexity of consistency checking [thesis]. Department of Computer Science, Aalborg University, June 2002. Available at http://www.cs.auc.dk/~luca/PAPERS/hjk02.ps.gz.
Daniel Fanner Gudbjartsson.Multipoint Linkage Analysis Based on Allele Sharing Models [dissertation]. Institute of Statistics and Decision Sciences, Duke University, 2000.
Jurg Ott. Analysis of Human Genetic Linkage. The Johns Hopkins University Press, 3rd Edition, 1999.
Lynn B Jorde, John C Carey, Michael J Bamshad, Raymond L White.Medical Genetics, Mosby, 1999.
Claus Bøttcher Jørgensen, Tuesday, April 9, 2002. Personal communication.
Kenneth Lange. Monday. October 28, 2002. Personal communication.
Tom Strachan, Andrew P Read.,Human Molecular Genetics 2 Wiley-Liss, 1999.
Bryant R E. Graph-based algorithms for Boolean function manipulation.IEEE Trans. Computers, August 1986, C-35(8): 677–691.
Jun Gu, Paul W, Purdom, John Franco, Benjamin W Wah. Algorithms for the satisfiability (SAT) problem: A survey. InSatisfiability Problem: Theory and Applications (Piscataway, NJ, 1996), DIMACS Ser. Discrete Math. Theoret. Comput. Sci., Amer. Math. Soc., Providence, RI, 1997, 35: 19–151.
Valiant L G. The complexity of computing the permanent.Theoret Comput. Sci., 1979, 8(2): 189–201.
Author information
Authors and Affiliations
Corresponding author
Additional information
Luca Aceto is an associate professor at the Dept. Computer Science, Aalborg University, and is a board member of BRICS. He has previously held various academic positions at the University of Sussex (UK), INRIA-Sophia Antipolis (France), Hewlett-Packard Laboratories (Pisa Science Center, Italy), and BRICS. His main research interests include: semantics of concurrency logic in computer science, applications of equational logic in computer science, structural operational semantics, and computational complexity of verification problems and of problems in bioinformatics.
Anna Ingólfsdóttir is an associate professor at the Department of Computer Science, Aalborg University. She also works as a researcher at deCODE Genetics, and is a board member of BRICS. She has previously held various visiting academic positions at the Univ. Sussex (UK) and the Univ. Florence (Italy). Her research interests include: semantics of concurrency, linkage analysis, bioinformatics, logic in computer science, applications of equational logic in computer science, structural operational semantics, and computational complexity of verification problems and of problems in bioinformatics.
Jens A. Hansen, Jacob Johnsen and John Knudsen hold a Master degree in computer science from Aalborg University.
Rights and permissions
About this article
Cite this article
Aceto, L., Hansen, J.A., Ingólfsdóttir, A. et al. The complexity of checking consistency of pedigree information and related problems. J. Comput. Sci. & Technol. 19, 42–59 (2004). https://doi.org/10.1007/BF02944784
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02944784