Skip to main content

Ancestral Maximum Likelihood of Evolutionary Trees Is Hard

  • Conference paper
Algorithms in Bioinformatics (WABI 2003)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 2812))

Included in the following conference series:

Abstract

Maximum likelihood (ML) (Felsenstein, 1981) is an increasingly popular optimality criterion for selecting evolutionary trees. Finding optimal ML trees appears to be a very hard computational task – in particular, algorithms and heuristics for ML take longer to run than algorithms and heuristics for maximum parsimony (MP). However, while MP has been known to be NP-complete for over 20 years, no such hardness result has been obtained so far for ML.

In this work we make a first step in this direction by proving that ancestral maximum likelihood (AML) is NP-complete. The input to this problem is a set of aligned sequences of equal length and the goal is to find a tree and an assignment of ancestral sequences for all of that tree’s internal vertices such that the likelihood of generating both the ancestral and contemporary sequences is maximized. Our NP-hardness proof follows that for MP given in (Day, Johnson and Sankoff, 1986) in that we use the same reduction from Vertex Cover; however, the proof of correctness for this reduction relative to AML is different and substantially more involved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Day, W.: Computational Complexity of Inferring Phylogenies from Dissimilarity Matrices. Bulletin of Mathematical Biology 49(4), 461–467 (1987)

    MATH  MathSciNet  Google Scholar 

  2. Day, W., Johnson, D., Sankoff, D.: The Computational Complexity of Inferring Rooted Phylogenies by Parsimony. Mathematical Biosciences 81, 33–42 (1986)

    Article  MATH  MathSciNet  Google Scholar 

  3. Day, W., Sankoff, D.: Computational Complexity of Inferring Phylogenies by Compatability. Systematic Zoology 35(2), 224–299 (1986)

    Article  Google Scholar 

  4. Felsenstein, J.: Evolutionary Trees from DNA Sequences: A Maximum Likelihood Approach. Journal of Molecular Evolution 17, 368–376 (1981)

    Article  Google Scholar 

  5. Fitch, W.: Towards defining the course of evolution: minimum change for a specific tree topology. Systematic Zoology 20, 406–416 (1971)

    Article  Google Scholar 

  6. Foulds, L., Graham, R.: The Steiner problem in phylogeny is NP-complete. Advances in Applied Mathematics 3, 43–49 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  7. Goldman, N.: Maximum likelihood inference of phylogenetic trees, with special reference to a Poisson process model of DNA substitution and to parsimony analyses. Systematic Zoology 39(4), 345–361 (1990)

    Article  Google Scholar 

  8. Graham, R., Foulds, L.: Unlikelihood that minimal phylogenies for a realistic biological study can be constructed in reasonable computational time. Mathematical Biosciences 60, 133–142 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  9. Koshi, M., Goldstein, R.: Probabilistic reconstruction of ancestral protein sequences. Journal of Molecular Evolution 42, 313–320 (1996)

    Article  Google Scholar 

  10. Neyman, J.: Molecular studies of evolution: A source of novel statistical problems. In: Gupta, S., Jackel, Y. (eds.) Statistical Decision Theory and Related Topics, pp. 1–27. Academic Press, New York (1971)

    Google Scholar 

  11. Pupko, T., Pe’er, I., Shamir, R., Graur, D.: A Fast Algorithm for Joint Reconstruction of Ancestral Amino Acid Sequences. Molecular Biology and Evolution 17(6), 890–896 (2000)

    Google Scholar 

  12. Pupko, T., Pe’er, I., Hasegawa, M., Graur, D., Friedman, N.: A branch-and-bound algorithm for the inference of ancestral amino-acid sequences when the replacement rate varies among sites: Application to the evolution of five gene families. Bioinformatics 18(8), 1116–1123 (2002)

    Article  Google Scholar 

  13. Sankoff, D., Cedergren, R.: Simultaneous comparison of three or more sequences related by a tree. In: Sankoff, D., Kruskal, J. (eds.) Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, pp. 253–263. Addison-Wesley Publishing Company, Reading (1983)

    Google Scholar 

  14. Steel, M.: The complexity of reconstructing trees from qualitative characters and subtrees. Journal of Classification 9, 71–90 (1992)

    Article  MATH  MathSciNet  Google Scholar 

  15. Swofford, D., Maddison, W.: Parsimony, Character-State Reconstructions, and Evolutionary Inferences. In: Mayden, R. (ed.) Systematics, Historical Ecology, and North American Freshwater Fishes, pp. 186–223. Stanford University Press, Stanford (1992)

    Google Scholar 

  16. Swofford, D., Olsen, G., Waddell, P., Hillis, D.: Phylogenetic Inference. In: Hillis, D., Moritz, C., Mable, B. (eds.) Molecular Systematics, 2nd edn., pp. 407–514. Sinauer Associates, Sunderland (1996)

    Google Scholar 

  17. Wareham, T.: On the Computational Complexity of Inferring Evolutionary Trees. Technical Report 93-01, Department of Computer Science, Memorial University of Newfoundland (1993)

    Google Scholar 

  18. Yang, Z., Kumar, S., Nei, M.: A new method of inference of ancestral nucleotide and amino acid sequences. Genetics 141, 1641–1650 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Addario-Berry, L., Chor, B., Hallett, M., Lagergren, J., Panconesi, A., Wareham, T. (2003). Ancestral Maximum Likelihood of Evolutionary Trees Is Hard. In: Benson, G., Page, R.D.M. (eds) Algorithms in Bioinformatics. WABI 2003. Lecture Notes in Computer Science(), vol 2812. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39763-2_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39763-2_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20076-5

  • Online ISBN: 978-3-540-39763-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics