Joint inference of evolutionary transitions to self-fertilization and demographic history using whole-genome sequences

  1. Stefan Strütt
  2. Thibaut Sellinger
  3. Sylvain Glémin
  4. Aurélien Tellier  Is a corresponding author
  5. Stefan Laurent  Is a corresponding author
  1. Max Planck Institute for Plant Breeding Research, Germany
  2. Technical University of Munich, Germany
  3. Université Rennes 1, CNRS, France

Abstract

The evolution from outcrossing to selfing is a transition that occurred recurrently throughout the eukaryote tree of life, in plants, animals, fungi and algae. Despite some short-term advantages, selfing is supposed to be an evolutionary dead-end reproductive strategy on the long-term and its tippy distribution on phylogenies suggests that most selfing species are of recent origin. However, dating such transitions is challenging while it is central for this hypothesis. We build on previous theories to explicit the differential effect of past changes in selfing rate or in population size on the probability of recombination events along the genome. This allows us to develop two methods making use of full genome polymorphism data to 1) test if a transition from outcrossing to selfing occurred, and 2) infer its age. The sequentially Markov coalescent based (teSMC) and the Approximate Bayesian Computation (tsABC) methods use a common framework based on a transition matrix summarizing the distribution of times to the most recent common ancestor along the genome, allowing to estimate changes in the ratio of population recombination and mutation rates in time. We first demonstrate that our methods can disentangle between past change in selfing rate from past changes in demographic history. Second, we assess the accuracy of our methods and show that transitions to selfing as old as approximatively 2.5Ne generations can be identified from polymorphism data. Third, our estimates are robust to the presence of linked negative selection on coding sequences. Finally, as a proof of principle, we apply both methods to three populations from Arabidopsis thaliana, recovering a transition to selfing which occurred approximately 600,000 years ago. Our methods pave the way to study recent transitions to predominant self-fertilization in selfing organisms and to better account for variation in mating systems in demographic inferences.

Data availability

The current manuscript is a computational study, so no data have been generated for this manuscript. Modelling code is available at the following repositories.tsABC: https://github.com/sstruett/tsABCteSMC: https://github.com/TPPSellinger/eSMC2scripts used for the analyses in Strütt and Sellinger et al: https://github.com/laurentlab-mpipz/struett_and_sellinger_et_al

The following previously published data sets were used

Article and author information

Author details

  1. Stefan Strütt

    Max Planck Institute for Plant Breeding Research, Cologne, Germany
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-2785-2815
  2. Thibaut Sellinger

    Department of Life Science Systems, Technical University of Munich, Munich, Germany
    Competing interests
    The authors declare that no competing interests exist.
  3. Sylvain Glémin

    ECOBIO [Ecosystèmes, Biodiversité, Evolution), Université Rennes 1, CNRS, Rennes, France
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0001-7260-4573
  4. Aurélien Tellier

    Department of Life Science Systems, Technical University of Munich, Munich, Germany
    For correspondence
    aurelien.tellier@tum.de
    Competing interests
    The authors declare that no competing interests exist.
  5. Stefan Laurent

    Max Planck Institute for Plant Breeding Research, Cologne, Germany
    For correspondence
    laurent@mpipz.mpg.de
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4016-5427

Funding

Max Planck Institute for Plant Breeding Research (open access funding)

  • Stefan Strütt
  • Stefan Laurent

No external funding was received for this work.

Reviewing Editor

  1. Vincent Castric, Université de Lille, France

Version history

  1. Preprint posted: August 1, 2022 (view preprint)
  2. Received: August 2, 2022
  3. Accepted: May 8, 2023
  4. Accepted Manuscript published: May 11, 2023 (version 1)
  5. Version of Record published: June 28, 2023 (version 2)

Copyright

© 2023, Strütt et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 970
    views
  • 128
    downloads
  • 7
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Stefan Strütt
  2. Thibaut Sellinger
  3. Sylvain Glémin
  4. Aurélien Tellier
  5. Stefan Laurent
(2023)
Joint inference of evolutionary transitions to self-fertilization and demographic history using whole-genome sequences
eLife 12:e82384.
https://doi.org/10.7554/eLife.82384

Share this article

https://doi.org/10.7554/eLife.82384

Further reading

    1. Computational and Systems Biology
    2. Evolutionary Biology
    Ryan T Bell, Harutyun Sahakyan ... Eugene V Koonin
    Research Article

    A comprehensive census of McrBC systems, among the most common forms of prokaryotic Type IV restriction systems, followed by phylogenetic analysis, reveals their enormous abundance in diverse prokaryotes and a plethora of genomic associations. We focus on a previously uncharacterized branch, which we denote coiled-coil nuclease tandems (CoCoNuTs) for their salient features: the presence of extensive coiled-coil structures and tandem nucleases. The CoCoNuTs alone show extraordinary variety, with three distinct types and multiple subtypes. All CoCoNuTs contain domains predicted to interact with translation system components, such as OB-folds resembling the SmpB protein that binds bacterial transfer-messenger RNA (tmRNA), YTH-like domains that might recognize methylated tmRNA, tRNA, or rRNA, and RNA-binding Hsp70 chaperone homologs, along with RNases, such as HEPN domains, all suggesting that the CoCoNuTs target RNA. Many CoCoNuTs might additionally target DNA, via McrC nuclease homologs. Additional restriction systems, such as Type I RM, BREX, and Druantia Type III, are frequently encoded in the same predicted superoperons. In many of these superoperons, CoCoNuTs are likely regulated by cyclic nucleotides, possibly, RNA fragments with cyclic termini, that bind associated CARF (CRISPR-Associated Rossmann Fold) domains. We hypothesize that the CoCoNuTs, together with the ancillary restriction factors, employ an echeloned defense strategy analogous to that of Type III CRISPR-Cas systems, in which an immune response eliminating virus DNA and/or RNA is launched first, but then, if it fails, an abortive infection response leading to PCD/dormancy via host RNA cleavage takes over.

    1. Evolutionary Biology
    2. Neuroscience
    Daniel Thiel, Luis Alfonso Yañez Guerra ... Gáspár Jékely
    Research Article

    Neuropeptides are ancient signaling molecules in animals but only few peptide receptors are known outside bilaterians. Cnidarians possess a large number of G protein-coupled receptors (GPCRs) – the most common receptors of bilaterian neuropeptides – but most of these remain orphan with no known ligands. We searched for neuropeptides in the sea anemone Nematostella vectensis and created a library of 64 peptides derived from 33 precursors. In a large-scale pharmacological screen with these peptides and 161 N. vectensis GPCRs, we identified 31 receptors specifically activated by 1 to 3 of 14 peptides. Mapping GPCR and neuropeptide expression to single-cell sequencing data revealed how cnidarian tissues are extensively connected by multilayer peptidergic networks. Phylogenetic analysis identified no direct orthology to bilaterian peptidergic systems and supports the independent expansion of neuropeptide signaling in cnidarians from a few ancestral peptide-receptor pairs.