Abstract
Larger numbers of germline cell divisions can increase the number of mutations inherited by offspring. Therefore, in systems where the number of offspring is dependent on the number of germline cell divisions, a higher overall rate of molecular evolution may be expected. Here, I examine whether colony size in social insects, which varies from tens to millions, influences molecular evolutionary rates via this mechanism. Comparative analyses of whole genomes from three clades of social insects, including eight species in the ant genus Pseudomyrmex, seven fungus-growing ants, and 11 bees, reveal that rates of molecular evolution are positively correlated with colony size. The additional germline cell divisions necessary to maintain large colony sizes may lead to mutation accumulation in the germlines of queens of these species, a process similar to that which occurs in aging human males. Among species with large colonies, I also find a weak signal of intensified constraint on DNA repair genes. This pattern suggests the intriguing possibility that the additional mutations that occur in these taxa may increase selective pressure for replication fidelity. Finally, I find that colony size is negatively associated with GC-content in five highly conserved genes across 115 ant genera, a pattern consistent with a positive relationship between substitution rate and colony size. Colony size, a fundamental facet of eusociality, appears to play a previously unappreciated role in genome evolution.
Similar content being viewed by others
Data availability statement
Genome sequences have been deposited in the NCBI genomes database under BioProject number PRJNA268384.
References
Bartosch-Härlid A, Berlin S, Smith NGC, Møller AP, Ellegren H (2003) Life history and the male mutation bias. Evolution 57:2398–2406. https://doi.org/10.1554/03-036
Blanchard BD, Moreau CS (2017) Defensive traits exhibit an evolutionary trade-off and drive diversification in ants. Evolution 71:315–328. https://doi.org/10.1111/evo.13117
Boyle EI, Weng S, Gollub J, Jin H, Botstein D, Cherry JM, Sherlock G (2004) GO::TermFinder–open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics 20:3710–3715. https://doi.org/10.1093/bioinformatics/bth456
Bradford PT, Goldstein AM, Tamura D, Khan SG, Ueda T, Boyle J, Oh K-S, Imoto K, Inui H, Moriwaki S-I et al (2011) Cancer and neurologic degeneration in xeroderma pigmentosum: long term follow-up characterises the role of DNA repair. J Med Genet 48:168–176. https://doi.org/10.1136/jmg.2010.083022
Bradley RK, Roberts A, Smoot M, Juvekar S, Do J, Dewey C, Holmes I, Pachter L (2009) Fast statistical alignment. PLoS Comput Biol 5:e1000392. https://doi.org/10.1371/journal.pcbi.1000392
Brady SG, Schultz TR, Fisher BL, Ward PS (2006) Evaluating alternative hypotheses for the early evolution and diversification of ants. Proc Natl Acad Sci 103:18172–18177. https://doi.org/10.1073/pnas.0605858103
Brand P, Saleh N, Pan H, Li C, Kapheim KM, Ramírez SR (2017) The nuclear and mitochondrial genomes of the facultatively eusocial orchid bee Euglossa dilemma. G3-Genes Genomes Genet 7:2891–2898. https://doi.org/10.1534/g3.117.043687
Branstetter MG, Danforth BN, Pitts JP, Faircloth BC, Ward PS, Buffington ML, Gates MW, Kula RR, Brady SG (2017) Phylogenomic insights into the evolution of stinging wasps and the origins of ants and bees. Curr Biol 27:1019–1025. https://doi.org/10.1016/j.cub.2017.03.027
Bromham L (2009) Why do species vary in their rate of molecular evolution? Biol Lett 5:401–404. https://doi.org/10.1098/rsbl.2009.0136
Bromham L (2011) The genome as a life-history character: why rate of molecular evolution varies between mammal species. Philos Trans R Soc B Biol Sci 366:2503–2513. https://doi.org/10.1098/rstb.2011.0014
Bromham L, Cowman PF, Lanfear R (2013) Parasitic plants have increased rates of molecular evolution across all three genomes. BMC Evol Biol 13:126. https://doi.org/10.1186/1471-2148-13-126
Bromham L, Leys R (2005) Sociality and the rate of molecular evolution. Mol Biol Evol 22:1393–1402. https://doi.org/10.1093/molbev/msi133
Büning J (1994) The Insect Ovary: Ultrastructure, previtellogenic growth and evolution, 1st edn. Springer, Dordrecht
Cantarel BL, Korf I, Robb SMC, Parra G, Ross E, Moore B, Holt C, Sanchez Alvarado A, Yandell M (2007) MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 18:188–196. https://doi.org/10.1101/gr.6743907
Castresana J (2000) Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 17:540–552. https://doi.org/10.1093/oxfordjournals.molbev.a026334
Chikina M, Robinson JD, Clark NL (2016) Hundreds of genes experienced convergent shifts in selective pressure in marine mammals. Mol Biol Evol 33:2182–2192. https://doi.org/10.1093/molbev/msw112
Chomicki G, Ward PS, Renner SS (2015) Macroevolutionary assembly of ant/plant symbioses: Pseudomyrmex ants and their ant-housing plants in the Neotropics. Proc R Soc B Biol Sci 282:20152200. https://doi.org/10.1098/rspb.2015.2200
Creighton WS (1955) Observations on Pseudomyrmex elongata Mayr (Hymenoptera: Formicidae). J N Y Entomol Soc 6:17–20
Crow JF (1997) The high spontaneous mutation rate: is it a health risk? Proc Natl Acad Sci 94:8380–8386. https://doi.org/10.1073/pnas.94.16.8380
Crozier R (1979) Genetics of sociality. Soc Insects. Academic Press, New York, pp 223–286
Drost JB, Lee WR (2006) Biological basis of germline mutation: comparisons of spontaneous germline mutation rates among drosophila, mouse, and human. Environ Mol Mutagen 25:48–64. https://doi.org/10.1002/em.2850250609
Elsik CG, Worley KC, Bennett AK, Beye M, Camara F, Childers CP, de Graaf DC, Debyser G, Deng J, Devreese B et al (2014) Finding the missing honey bee genes: lessons learned from a genome upgrade. BMC Genomics 15:86. https://doi.org/10.1186/1471-2164-15-86
Fonseca CR (1993) Nesting space limits colony size of the plant-ant Pseudomyrmex concolor. Oikos 67:473. https://doi.org/10.2307/3545359
Gao Z, Moorjani P, Sasani TA, Pedersen BS, Quinlan AR, Jorde LB, Amster G, Przeworski M (2019) Overlooked roles of DNA damage and maternal age in generating human germline mutations. Proc Natl Acad Sci 116:9491–9500. https://doi.org/10.1073/pnas.1901259116
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q et al (2011) Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol 29:644–652. https://doi.org/10.1038/nbt.1883
Graur D (1985) Gene diversity in Hymenoptera. Evolution 39:190–199. https://doi.org/10.1111/j.1558-5646.1985.tb04091.x
Haas BJ (2003) Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res 31:5654–5666. https://doi.org/10.1093/nar/gkg770
Haas BJ, Papanicolaou A, Yassour M, Grabherr M, Blood PD, Bowden J, Couger MB, Eccles D, Li B, Lieber M et al (2013) De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8:1494–1512. https://doi.org/10.1038/nprot.2013.084
Hoff KJ, Lange S, Lomsadze A, Borodovsky M, Stanke M (2016) BRAKER1: Unsupervised RNA-Seq-based genome annotation with GeneMark-ET and AUGUSTUS. Bioinformatics 32:767–769. https://doi.org/10.1093/bioinformatics/btv661
Hollstein MC, Metcalf RA, Welsh JA, Montesano R, Harris CC (1990) Frequent mutation of the p53 gene in human esophageal cancer. Proc Natl Acad Sci 87:9958–9961. https://doi.org/10.1073/pnas.87.24.9958
Holt C, Yandell M (2011) MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinform 12:491. https://doi.org/10.1186/1471-2105-12-491
Hua X, Bromham L (2017) Darwinism for the genomic age: connecting mutation to diversification. Front Genet 8:12. https://doi.org/10.3389/fgene.2017.00012
Janzen DH (1973) Evolution of polygynous obligate acacia-ants in western Mexico. J Anim Ecol 42:727. https://doi.org/10.2307/3134
Jones P, Binns D, Chang H-Y, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G et al (2014) InterProScan 5: genome-scale protein function classification. Bioinformatics 30:1236–1240. https://doi.org/10.1093/bioinformatics/btu031
Jónsson H, Sulem P, Kehr B, Kristmundsdottir S, Zink F, Hjartarson E, Hardarson MT, Hjorleifsson KE, Eggertsson HP, Gudjonsson SA et al (2017) Parental influence on human germline de novo mutations in 1,548 trios from Iceland. Nature 549:519–522. https://doi.org/10.1038/nature24018
Kapheim KM, Pan H, Li C, Salzberg SL, Puiu D, Magoc T, Robertson HM, Hudson ME, Venkat A, Fischman BJ et al (2015) Genomic signatures of evolutionary transitions from solitary to group living. Science 348:1139–1143. https://doi.org/10.1126/science.aaa4788
Keller L (1998) Queen lifespan and colony characteristics in ants and termites. Insectes Soc 45:235–246. https://doi.org/10.1007/s000400050084
Keller L, Genoud M (1997) Extraordinary lifespans in ants: a test of evolutionary theories of ageing. Nature 389:958–960. https://doi.org/10.1038/40130
Kim D, Langmead B, Salzberg SL (2015) HISAT: a fast spliced aligner with low memory requirements. Nat Methods 12:357–360. https://doi.org/10.1038/nmeth.3317
Kong A, Frigge ML, Masson G, Besenbacher S, Sulem P, Magnusson G, Gudjonsson SA, Sigurdsson A, Aslaug J, Adalbjorg J et al (2012) Rate of de novo mutations and the importance of father’s age to disease risk. Nature 488:471–475. https://doi.org/10.1038/nature11396
Kowalczyk A, Meyer WK, Partha R, Mao W, Clark NL, Chikina M (2019) RERconverge: an R package for associating evolutionary rates with convergent traits. Bioinformatics 35:4815–4817. https://doi.org/10.1093/bioinformatics/btz468
Kramer BH, Schaible R (2013) Colony size explains the lifespan differences between queens and workers in eusocial Hymenoptera. Biol J Linn Soc 109:710–724. https://doi.org/10.1111/bij.12072
Lanfear R, Kokko H, Eyre-Walker A (2014) Population size and the rate of evolution. Trends Ecol Evol 29:33–41. https://doi.org/10.1016/j.tree.2013.09.009
Lanfear R, Welch JJ, Bromham L (2010) Watching the clock: studying variation in rates of molecular evolution between species. Trends Ecol Evol 25:495–503. https://doi.org/10.1016/j.tree.2010.06.007
Lechner M, Findeis S, Steiner L, Marz M, Stadler PF, Prohaska SJ (2011) Proteinortho: detection of (co-) orthologs in large-scale analysis. BMC Bioinform 12:124. https://doi.org/10.1186/1471-2105-12-124
Li W-H, Ellsworth DL, Krushkal J, Chang BH-J, Hewett-Emmett D (1996) Rates of nucleotide substitution in primates and rodents and the generation–time effect hypothesis. Mol Phylogenet Evol 5:182–187. https://doi.org/10.1006/mpev.1996.0012
Lynch M (2007) The origins of genome architecture, 1st edn. Sinauer Associates, Sunderland
Mohrenweiser HW, Wilson DM, Jones IM (2003) Challenges and complexities in estimating both the functional impact and the disease risk associated with the extensive genetic variation in human DNA repair genes. Mutat Res Mol Mech Mutagen 526:93–125. https://doi.org/10.1016/S0027-5107(03)00049-6
Moreau CS, Bell CD (2013) Testing the museum versus cradle biological diversity hypothesis: phylogeny, diversification, and ancestral biogeographic range evolution of the ants. Evolution 67:2240–2257. https://doi.org/10.1111/evo.12105
Moreau CS, Bell CD, Vila R, Archibald SB, Pierce NE (2006) Phylogeny of the ants: diversification in the age of angiosperms. Science 312:101–104. https://doi.org/10.1126/science.1124891
Nygaard S, Hu H, Li C, Schiøtt M, Chen Z, Yang Z, Xie Q, Ma C, Deng Y, Dikow RB et al (2016) Reciprocal genomic evolution in the ant–fungus agricultural symbiosis. Nat Commun 7:12233. https://doi.org/10.1038/ncomms12233
Ohta T (1987) Very slightly deleterious mutations and the molecular clock. J Mol Evol 26:1–6. https://doi.org/10.1007/BF02111276
Pamilo P (1991) Life span of queens in the ant Formica exsecta. Insectes Soc 38:111–119. https://doi.org/10.1007/BF01240961
Pamilo P, Crozier RH (1997) Population biology of social insect conservation. Mem Mus Vic 56:411–419. https://doi.org/10.24199/j.mmv.1997.56.32
Paradis E, Schliep K (2019) ape 5.0: an environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics 35:526–528. https://doi.org/10.1093/bioinformatics/bty633
Partha R, Chauhan BK, Ferreira Z, Robinson JD, Lathrop K, Nischal KK, Chikina M, Clark NL (2017) Subterranean mammals show convergent regression in ocular genes and enhancers, along with adaptation to tunneling. Elife 6:e25884. https://doi.org/10.7554/eLife.25884
Peng Y, Leung HCM, Yiu SM, Chin FYL (2012) IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics 28:1420–1428. https://doi.org/10.1093/bioinformatics/bts174
Pennell MW, Eastman JM, Slater GJ, Brown JW, Uyeda JC, FitzJohn RG, Alfaro ME, Harmon LJ (2014) geiger v2.0: an expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics 30:2216–2218. https://doi.org/10.1093/bioinformatics/btu181
R Core Team (2018) R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/
Ramírez SR, Roubik DW, Skov C, Pierce NE (2010) Phylogeny, diversification patterns and historical biogeography of euglossine orchid bees (Hymenoptera: Apidae). Biol J Linn Soc 100:552–572. https://doi.org/10.1111/j.1095-8312.2010.01440.x
Rehan SM, Glastad KM, Lawson SP, Hunt BG (2016) The genome and methylome of a subsocial small carpenter bee, Ceratina calcarata. Genome Biol Evol 8:1401–1410. https://doi.org/10.1093/gbe/evw079
Romiguier J, Gayral P, Ballenghien M, Bernard A, Cahais V, Chenuil A, Chiari Y, Dernat R, Duret L, Faivre N et al (2014) Comparative population genomics in animals uncovers the determinants of genetic diversity. Nature 515:261–263. https://doi.org/10.1038/nature13685
Rubin BER, Jones BM, Hunt BG, Kocher SD (2019) Rate variation in the evolution of non-coding DNA associated with social evolution in bees. Philos Trans R Soc B Biol Sci 374:20180247. https://doi.org/10.1098/rstb.2018.0247
Rubin BER, Moreau CS (2016) Comparative genomics reveals convergent rates of evolution in ant–plant mutualisms. Nat Commun 7:12679. https://doi.org/10.1038/ncomms12679
Sadd BM, Barribeau SM, Bloch G, de Graaf DC, Dearden P, Elsik CG, Gadau J, Grimmelikhuijzen CJ, Hasselmann M, Lozier JD et al (2015) The genomes of two key bumblebee species with primitive eusocial organization. Genome Biol 16:76. https://doi.org/10.1186/s13059-015-0623-3
Schmid VS (2012) Reproductive conflict among workers of the ant species Pseudomyrmex gracilis (Hymenoptera: Formicidae). Universität Regensburg, Regensburg
Seppey M, Manni M, Zdobnov EM (2019) BUSCO: assessing genome assembly and annotation completeness. Methods Mol Biol 1962:227–245. https://doi.org/10.1007/978-1-4939-9173-0_14
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM (2015) BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. https://doi.org/10.1093/bioinformatics/btv351
Smith SA, Donoghue MJ (2008) Rates of molecular evolution are linked to life history in flowering plants. Science 322:86–89. https://doi.org/10.1126/science.1163197
Stanke M, Tzvetkova A, Morgenstern B (2006) AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol 7:S11. https://doi.org/10.1186/gb-2006-7-s1-s11
Tanaka ED, Hartfelder K (2004) The initial stages of oogenesis and their relation to differential fertility in the honey bee (Apis mellifera) castes. Arthropod Struct Dev 33:431–442. https://doi.org/10.1016/j.asd.2004.06.006
Thomas JA, Welch JJ, Lanfear R, Bromham L (2010) A generation time effect on the rate of molecular evolution in invertebrates. Mol Biol Evol 27:1173–1180. https://doi.org/10.1093/molbev/msq009
Ward PS (2017) A review of the Pseudomyrmex ferrugineus and Pseudomyrmex goeldii species groups: acacia-ants and relatives (Hymenoptera: Formicidae). Zootaxa 4227:524. https://doi.org/10.11646/zootaxa.4227.4.3
Ward PS, Downie DA (2005) The ant subfamily Pseudomyrmecinae (Hymenoptera: Formicidae): phylogeny and evolution of big-eyed arboreal ants. Syst Entomol 30:310–335. https://doi.org/10.1111/j.1365-3113.2004.00281.x
Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM (2018) BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol 35:543–548. https://doi.org/10.1093/molbev/msx319
Weinstock GM, Robinson GE, Gibbs RA, Weinstock GM, Weinstock GM, Robinson GE, Worley KC, Evans JD, Maleszka R, Robertson HM et al (2006) Insights into social insects from the genome of the honeybee Apis mellifera. Nature 443:931–949. https://doi.org/10.1038/nature05260
Weller C, Wu M (2015) A generation-time effect on the rate of molecular evolution in bacteria. Evolution 69:643–652. https://doi.org/10.1111/evo.12597
Wertheim JO, Murrell B, Smith MD, Kosakovsky Pond SL, Scheffler K (2015) RELAX: detecting relaxed selection in a phylogenetic framework. Mol Biol Evol 32:820–832. https://doi.org/10.1093/molbev/msu400
Yang Z (1997) PAML: a program package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci CABIOS 13:555–556
Yang Z (2007) PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24:1586–1591. https://doi.org/10.1093/molbev/msm088
Zdobnov EM, Tegenfeldt F, Kuznetsov D, Waterhouse RM, Simão FA, Ioannidis P, Seppey M, Loetscher A, Kriventseva EV (2017) OrthoDB v9.1: cataloging evolutionary and functional annotations for animal, fungal, plant, archaeal, bacterial and viral orthologs. Nucleic Acids Res 45:D744–D749. https://doi.org/10.1093/nar/gkw1119
Acknowledgements
I thank Lindell Bromham, Paul Durst, Deren Eaton, Wynn Meyer, Corrie Moreau, Luisa Pallares, Tom Stewart, and Benjamin Winger for providing analytical insights and valuable feedback on earlier versions of this manuscript. I thank members of the Kocher Lab at Princeton University for their input. This work was supported by postdoctoral fellowship Grant No. 2018-67012-28085 from the USDA National Institute of Food and Agriculture.
Author information
Authors and Affiliations
Corresponding author
Additional information
Significance statement: The causes of genomic change are manifold and difficult to diagnose. Here, I find that colony size, a fundamental facet of insect sociality, is positively correlated with rates of molecular evolution, genome-wide. In social insect taxa with the largest colonies, a queen may produce hundreds of millions of offspring, requiring vastly more divisions in germline stem cells than in taxa with small colonies, leading to mutation accumulation. In addition, genes involved in DNA repair experience increased purifying selection in taxa with large colony sizes, suggesting that selective pressures work to compensate for the inherent change in substitution rate. These findings reveal a fundamental force in genome evolution which may serve to cap the size and complexity of social insect colonies.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Rubin, B.E.R. Social insect colony size is correlated with rates of molecular evolution. Insect. Soc. 69, 147–157 (2022). https://doi.org/10.1007/s00040-022-00859-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00040-022-00859-3