Skip to main content

Cluster Analysis of Untargeted Metabolomic Experiments

  • Protocol
  • First Online:
Microbial Metabolomics

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1859))

Abstract

Untargeted metabolite profiling based upon LC-MS methodology can be used to identify unique metabolic phenotypes associated with stress, disease or environmental exposure of cells using mathematical clustering. Here, we show how unsupervised data analysis is a powerful tool for both quality control and answering simple biological questions. We will demonstrate how to format untargeted mass spectrometry data for import into R, a programming language and software environment for statistical computing (R Development Core Team. R: A language and environment for statistical computing, reference index version 2.15. R Foundation for Statistical Computing, Vienna, 2012). Using R, we transform untargeted metabolite data using hierarchical clustering and principal component analysis (PCA) to create visual representations of change between biological samples and explore how these can be used predictively, in determining environmental stress, health and metabolic insight.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Development Core Team R (2012) R: A language and environment for statistical computing, reference index version 2.15.1. R Foundation for Statistical Computing, Vienna

    Google Scholar 

  2. Patti GJ, Tautenhahn R, Siuzdak G et al (2012) Meta-analysis of untargeted metabolomic data from multiple profiling experiments. Nat Protoc 7(3):508–516

    Article  CAS  Google Scholar 

  3. Patti GJ, Yanes O, Shriver LP, Courade J, Tautenhahn R, Manchester M, Siuzdak G et al (2012) Metabolomics implicates altered sphingolipids in chronic pain of neuropathic origin. Nat Chem Biol 8(3):232–234

    Article  CAS  Google Scholar 

  4. Everitt B (1974) Cluster analysis. Heinemann Educational Books, London

    Google Scholar 

  5. Hartigan JA (1975) Clustering algorithms. Wiley, New York

    Google Scholar 

  6. Anderberg MR (1973) Cluster analysis for applications. Academic Press, New York

    Google Scholar 

  7. Murtagh F (1985) Multidimensional Clustering Algorithms. In: COMPSTAT Lectures 4. Physica-Verlag, Wuerzburg

    Google Scholar 

  8. Becker RA, Chambers JM, Wilks AR (1988) The new S language. Wadsworth & Brooks/Cole Advanced Books & Software, Monterey

    Google Scholar 

  9. Mardia KV, Kent JT, Bibby JM (1979) Multivariate analysis. Academic Press, London

    Google Scholar 

  10. Venables WN, Ripley BD (2002) Modern applied statistics with S. Springer-Verlag, Berlin

    Book  Google Scholar 

  11. Heinemann J, Hamerly T, Maaty WS, Movahed N, Steffens JD, Reeves BD, Hilmer JK, Therien J, Grieco PA, Peters JW, Bothner B et al (2014) Expanding the paradigm of thiol redox in the thermophilic root of life. Biochim Biophys Acta 1840:80–85

    Article  CAS  Google Scholar 

  12. Maaty WS, Wiedenheft B, Tarlykov P, Schaff N, Heinemann J, Robison-Cox J, Valenzuela J, Bothner B et al (2009) Something old, something new, something borrowed; how the thermoacidophilic archaeon Sulfolobus solfataricus responds to oxidative stress. PLoS One 4(9):e6964

    Article  Google Scholar 

  13. Gordon AD (1999) Classification. Chapman and Hall / CRC, London

    Google Scholar 

  14. McQuitty LL (1966) Similarity analysis by reciprocal pairs for discrete and continuous data. Educ Psychol Meas 26:825–831

    Article  Google Scholar 

  15. Kessner D, Chambers M, Burke R, Agus D, Mallick P et al (2008) ProteoWizard: open source software for rapid proteomics tools development. Bioinformatics 24(21):2534–2536

    Article  CAS  Google Scholar 

  16. Tautenhahn R, Böttcher C, Neumann S et al (2008) Highly sensitive feature detection for high resolution LC/MS. BMC bioinformatics 9:504

    Article  Google Scholar 

  17. Yanes O, Tautenhahn R, Patti GJ, Siuzdak G et al (2011) Expanding coverage of the metabolome for global metabolite profiling. Anal Chem 83(6):2152–2161

    Article  CAS  Google Scholar 

  18. https://stat.ethz.ch/R-manual/R-devel/library/stats/html/prcomp.html

  19. https://www.rdocumentation.org/packages/rgl/versions/0.97.0/topics/plot3d

  20. http://stat.ethz.ch/R-manual/R-devel/library/stats/html/hclust.html

Download references

Acknowledgments

The authors would also like to acknowledge that this work was part of the DOE Joint BioEnergy Institute (http://www.jbei.org) supported by the US Department of Energy, Office of Science, Office of Biological and Environmental Research, through contract DE-AC02-05CH11231 between Lawrence Berkeley National Laboratory and the US Department of Energy.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joshua Heinemann .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Heinemann, J. (2019). Cluster Analysis of Untargeted Metabolomic Experiments. In: Baidoo, E. (eds) Microbial Metabolomics. Methods in Molecular Biology, vol 1859. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-8757-3_16

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-8757-3_16

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-8756-6

  • Online ISBN: 978-1-4939-8757-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics