Emergence of catalytic function in prebiotic information-coding polymers

Version of Record

The authors declare this version of their article to be the Version of Record.

Download
Cite
Share
CommentOpen annotations (there are currently 0 annotations on this page).

Version of Record published: March 26, 2024 (This version)
Reviewed preprint version 2: March 7, 2024 (Go to version)
Reviewed preprint version 1: December 1, 2023 (Go to version)
Sent for peer review: August 18, 2023
Preprint posted: July 26, 2023 (Go to version)

1. Of interest
Stochastic parabolic growth promotes coexistence and a relaxed error threshold in RNA-like replicator populations

Mátyás Paczkó, Eörs Szathmáry, András Szilágyi

Research Article Apr 26, 2024
Further reading

Abstract
eLife assessment
Introduction
Model and results
Discussion
Appendix 1
Appendix 2
Data availability
References
Article and author information
Metrics

Abstract

Life as we know it relies on the interplay between catalytic activity and information processing carried out by biological polymers. Here we present a plausible pathway by which a pool of prebiotic information-coding oligomers could acquire an early catalytic function, namely sequence-specific cleavage activity. Starting with a system capable of non-enzymatic templated replication, we demonstrate that even non-catalyzed spontaneous cleavage would promote proliferation by generating short fragments that act as primers. Furthermore, we show that catalytic cleavage function can naturally emerge and proliferate in this system. Specifically, a cooperative catalytic network with four subpopulations of oligomers is selected by the evolution in competition with chains lacking catalytic activity. The cooperative system emerges through the functional differentiation of oligomers into catalysts and their substrates. The model is inspired by the structure of the hammerhead RNA enzyme as well as other DNA- and RNA-based enzymes with cleavage activity that readily emerge through natural or artificial selection. We identify the conditions necessary for the emergence of the cooperative catalytic network. In particular, we show that it requires the catalytic rate enhancement over the spontaneous cleavage rate to be at least 10²–10³, a factor consistent with the existing experiments. The evolutionary pressure leads to a further increase in catalytic efficiency. The presented mechanism provides an escape route from a relatively simple pairwise replication of oligomers toward a more complex behavior involving catalytic function. This provides a bridge between the information-first origin of life scenarios and the paradigm of autocatalytic sets and hypercycles, albeit based on cleavage rather than synthesis of reactants.

eLife assessment

This valuable study uses a model to determine when catalytic self-replication of polymers can emerge from a random pool of replicating polymers. The model accounts for the folding and function of polymers in addition to abstract evolutionary dynamics, providing solid evidence for the claims of the authors. The work will be of relevance to those interested in the origin of life, artificial cells, and evolutionary dynamics.

https://doi.org/10.7554/eLife.91397.3.sa0

About eLife assessments

Introduction

One of the most intriguing mysteries in science is the origin of life. Despite extensive research in this area, we are still far from understanding how life has emerged on Earth. One promising hypothesis is the RNA world theory (Gilbert, 1986; Doudna and Szostak, 1989; Orgel, 2004; Lincoln and Joyce, 2009; Robertson and Joyce, 2012; Higgs and Lehman, 2015), inspired by the discovery of ribozymes (Kruger et al., 1982), that is, RNA molecules capable of enzymatic activity. According to this hypothesis, all processes in early life were carried out by the RNA, which was used both to store information and catalyze biochemical reactions. In particular, specific ribozymes could have catalyzed the self-replication of arbitrary RNA sequences, a function currently performed by specialized protein-based enzymes. However, based on the results of the existing experiments (Bartel and Szostak, 1993; Horning and Joyce, 2016), such a catalytic function requires rather long and carefully designed RNA sequences, which are highly unlikely to arise spontaneously. In contrast, one of the simplest catalytic activities of ribozymes is their ability to cleave an RNA sequence at a specific site. Indeed, such ribozymes independently evolved in multiple branches of life (de la Peña and García-Robles, 2010) and have been shown to emerge rapidly and repeatedly from artificial selection (Williams et al., 1995; Salehi-Ashtiani and Szostak, 2001). DNA molecules have also been shown to be capable of site-specific cleavage targeting either RNA (Breaker and Joyce, 1994) or DNA sequences (Silverman, 2005; Chandra et al., 2009).

In this article, we consider a population of information-carrying polymers capable of templated non-enzymatic replication (Szostak, 2012; Kim et al., 2021). This may have been the state of the proto-RNA world before the emergence of ribozymes. This could involve heteropolymers chemically distinct from present-day RNA (Kim et al., 2021) and/or inorganic catalysts such as mineral surfaces (Ferris, 2005; Jerome et al., 2022). We demonstrate that (i) even spontaneous cleavage promotes replication by generating short fragments used as primers for templated growth and (ii) catalytic cleavage activity naturally emerges in these populations and gets selected by the evolution (Lukin, 2010).

In a series of previous studies, we have shown that non-enzymatic templated replication can lead to the formation of longer chains (Tkachenko and Maslov, 2015) as well as to a reduction in sequence entropy (Tkachenko and Maslov, 2018). Such a reduction in entropy has subsequently been observed experimentally for templated ligation of DNA oligomers (Kudella et al., 2021). This selection in sequence space can be seen as a first step toward Darwinian evolution. However, this does not necessarily imply the emergence of a catalytic function. In this article, we build on these findings and further investigate the potential for the evolution of catalytic activity in the proto-RNA world.

Model and results

In our model, we consider the population dynamics of a pool of heteropolymers analogous to the familiar nucleic acids (RNA or DNA) but capable of enzyme-free templated polymerization. The basic processes in this scenario are similar to those in polymerase chain reaction (PCR), where the system is driven out of equilibrium by cyclic changes in the environment (e.g., temperature), which we refer to as ‘night’ and ‘day’ phases. During the night phase, heteropolymers hybridize with each other following Watson–Crick-like complementarity rules. If the terminus of one chain hybridizes with the middle section of another chain, the former can be gradually elongated by the virtue of non-enzymatic templated polymerization (Sulston et al., 1968; Weimann et al., 1968; Lohrmann et al., 1980; Duzdevich et al., 2020). We will refer to this type of hybridization as ‘end-to-middle’, the former chain as the ‘primer’ and the latter chain as the ‘template’. During the day phase, the hybridized structures melt and all the heteropolymers separate from each other. During the next night, they hybridize with new partners, providing them with the opportunity to elongate further. Unlike the classical PCR process, we assume that the polymerization in our system occurs without any assistance from enzymes and may proceed in either direction along the chain. Equivalently, instead of polymerization, the elongation could rely on ligation with very short chain segments. It is important to note that in the context of RNA, such bidirectional elongation requires chemical activation of the phosphate group at the 5′ end of the primer to provide free energy for the newly formed covalent bond. Like the polymerization process itself, achieving this without enzymes is biochemically challenging. One might speculate that prebiotic evolution relied on inorganic catalysis, such as on mineral surfaces, or involved polymers other than today’s RNA.

The elongation of primers naturally leads to the copying of information from the template’s sequence. The obvious limiting factor for this process is the availability of primers and the likelihood of end-to-middle hybridization resulting in elongation. The key observation behind our model is that the breakup (cleavage) of a chain creates a new pair of potential primers. Each of them could be elongated during subsequent nights. Thus, somewhat counterintuitively, breakup of chains results in their proliferation.

Our previous theoretical (Tkachenko and Maslov, 2015; Tkachenko and Maslov, 2018) and experimental (Kudella et al., 2021) results demonstrated that templated-assisted replication of heteropolymers has a generic tendency to substantially reduce their sequence entropy. Such reduction has important consequences in the context of the current work: it significantly increases the likelihood of end-to-middle hybridization of chains during the night phase. That in turn creates an evolutionary pressure to further decrease sequence entropy. A detailed study of this fascinating mechanism falls beyond the scope of the current study. However, below we will assume the logical end of this dynamics where the pool of sequences is composed of fragments of one or several nearly non-overlapping master sequences and their complementaries. Note that for any two overlapping chains, such that the sequence of the first one is a fragment of the master sequence while the other is a fragment of the complementary master sequence, the end-to-middle binding is essentially guaranteed. The exception to this rule is when both chains terminate at the same points so that they are exact complements of each other.

Random cleavage model

Based on the argument presented above, we focus on the case of a system populated with chains that are fragments of a single master sequence or its complement. We denote the total concentration of fragments of the master sequence as $c (t)$ , while the concentration of all fragments of the complementary sequence as $\bar{c} (t)$ . Our system operates in a chemostat, that is, a reservoir constantly supplied with fresh monomers at the concentration $m_{0}$ and diluted at the rate $δ$ . Let $M (t)$ (respectively $\bar{M} (t)$ ) be the concentration of all monomers incorporated into chains of the subpopulation $c (t)$ (respectively $\bar{c} (t)$ ). The concentration of free monomers not incorporated into any chains is given by:

m = m_{0} - M - \bar{M}

It is convenient to introduce a minimal length $l_{0}$ of a chain that would hybridize with its complementary partner during the night phase, and use it as the unit of chain length, instead of a single monomer. In effect, this leads to the renormalization of all monomer concentrations as $m = [m] / l_{0}$ , where $[m]$ is the conventional molarity. $M$ , $\bar{M}$ , and $m_{0}$ are similarly renormalized, while the polymer concentration remains unmodified: $c = [c]$ . In what follows, we renormalize all lengths and concentrations so that $l_{0} = 1$ .

We assume that nights are sufficiently long and that the hybridization rate is sufficiently fast so that early on during each night phase most chains find partners with a complementary overlap. That assumes that the total concentration of all master sequence fragments, $c$ , is lower than the total concentration of all fragments in the complementary subpopulation, $\bar{c}$ . If this is not the case, that is, if $c > \bar{c}$ , only a fraction $\bar{c} / c$ of all fragments of the master sequence find a partner, while the rest of the $c$ -subpopulation remains unpaired and thus does not elongate. For simplicity, our model neglects the possibility of the formation of hybridized complexes involving more than two chains.

It is well known that self-replication based on template-based polymerization or ligation is vulnerable to product inhibition, that is, re-hybridization of the products intended to act as templates (Szostak, 2012; Tupper and Higgs, 2021). Indeed, since full-length templates and their complements would bind more strongly to each other than any shorter fragment, the primers would typically be displaced from the templates by longer chains, leading to effective ‘template’ poisoning. In our model, however, the master sequence fragments act as both primers and templates. Of two oligomers in a hybridized pair, the one that extends further in a particular direction acts as a template for the growth of the other. At the same time, the partner may also act as a template for the growth of the first oligomer in the other direction. In either case, there will typically be two growing ends per duplex. Full inhibition occurs only when two hybridized chains terminate at exactly the same point in both directions. Since this is a rather rare event, we neglect its effect.

As discussed earlier, two hybridized chains whose sequences are fragments of the master sequence and its complementary would typically have two ends that undergo templated growth at a certain rate proportional to monomer concentration $m$ (see Figure 1). The exception to this rule is when these two hybridized chains terminate at exactly the same point. In our model, the average primer elongation rate is $r \cdot m (t)$ . The rate parameter $r$ accounts for the finite probability of primer binding to a template during the night phase as well as for a finite night-to-day ratio. Note that the value of $r$ does not change after our renormalization $l_{0} \to 1$ . The $M (t)$ dynamics is thus given by

\frac{d M}{d t} = r m \cdot m i n (c, \bar{c}) - δ \cdot M

Figure 1

Download asset Open asset

Illustration of two model variants.

(A) Random cleavage model. A random break in a chain of type $c$ generates two primer fragments, which are elongated to give rise to two chains of type $c$ . Elongation requires a complementary template of type $\bar{c}$ . (B) An example of catalyzed cleavage given by hammerhead ribozyme (Pley et al., 1994). Note that the right cleavage fragment is perfectly complementary to the blue sequence, while the left one contains an extra non-complementary base C. (C) Catalyzed cleavage model. A cleavage of the red chain $a$ catalyzed by the blue chain $\bar{b}$ gives rise to two primers $a_{L}$ (red) and $a_{R}$ (purple). Because of an extra non-complementary base (see B), the $a_{L}$ primer can only elongate to $a$ , while the $a_{R}$ primer – to either $a$ or $b$ depending on its first hybridization partner. Similar processes involving complementary chains $\bar{a}$ and $\bar{b}$ (not shown) result in the replication of templates.

The equation for the complementary subpopulation is obtained by replacing $M$ with $\bar{M}$ . Note that $m$ includes only monomers, and therefore is not subdivided into two complementary subpopulations.

We assume that the breakup of chains is completely random and happens at a constant rate $β_{0}$ at any internal bond along the chain. The concentration of these breakable bonds is given by $M$ . Because of our choice of the unit length $l_{0}$ , $β_{0} = l_{0} β_{0}^{'}$ , where $β_{0}^{'}$ is the cleavage rate of a single bond. Since each fragmentation of a chain creates one new primer, the equation governing the overall concentration of chain fragments in the subpopulation $c$ is given by

\frac{d c}{d t} = β_{0} M - δ c

Once again, the equation for the complementary subpopulation is obtained by replacing $c$ with $\bar{c}$ and $M$ with $\bar{M}$ .

Combining Equations 2 and 3, we observe that the steady state is a symmetric mixture $c = \bar{c}$ . The average length (in units of $l_{0}$ ) of all chain fragments in the subpopulation $c$ is given by

⟨ L ⟩ = \frac{M}{c} = \frac{δ}{β_{0}} = \frac{r m^{*}}{δ} .

This in turn determines the steady-state concentrations:

m^{*} = \frac{δ^{2}}{β_{0} r}

c = \bar{c} = \frac{1}{2} (\frac{β_{0}}{δ} m_{0} - \frac{δ}{r})

To obtain Equation 6, we combined Equations 1 and 4. Note that the mutually templating chains survive only when the concentration of free monomers supplied to the system $m_{0}$ exceeds $m^{*}$ .

Model with catalyzed cleavage

In the model considered above, the random breakage of chains led to their proliferation. It is therefore reasonable to expect that the ability of a heteropolymer to catalyze cleavage would be selected by the evolution. Incidentally, some of the simplest known RNA-based enzymes (ribozymes) have exactly this function (Kruger et al., 1982; Prody et al., 1986; Hutchins et al., 1986; Scott et al., 1995; Williams et al., 1995; Salehi-Ashtiani and Szostak, 2001; de la Peña and García-Robles, 2010; Scott et al., 2013).

Here we consider a simple model in which a heteropolymer capable of catalyzing cleavage spontaneously emerges from a pool of mutually templating chains. Our model is inspired by the real-world examples of naturally occurring hammerhead ribozyme (Prody et al., 1986; Hutchins et al., 1986), as well as artificially selected DNA-cleaving DNA enzymes (Chandra et al., 2009). The minimal structure of the hammerhead ribozyme (Pley et al., 1994; Scott et al., 1995; Scott et al., 2013) consists of a core region of 15 (mostly) invariant nucleotides flanked by three helical stems formed by mutually complementary RNA sequences. The cleavage happens at a specific site of this structure, located immediately adjacent to one of these stems. While a classical hammerhead ribozyme consists of a single RNA chain capable of self-cleavage, the same structure could be assembled from two chains, one (labeled $b$ in Figure 1) containing a hairpin and capable of hybridizing with and subsequently cleaving the other chain (labeled $a$ in Figure 1). Furthermore, such two-chain structuresare realized in certain DNA-cleaving DNA enzymes (Chandra et al., 2009).

We consider a scenario in which a master sequence spontaneously emerges from a random pool and subsequently diverges into two closely related subpopulations $a$ and $b$ and their respective complementarities $\bar{a}$ and $\bar{b}$ . The sequences of $a$ and $b$ are mostly identical except for a short insert in the chains $b$ and $\bar{b}$ , rendering them catalytically active. That is to say, when $\bar{b}$ is bound to a chain from the subpopulation $a$ , it induces a cleavage at a specific site of that chain. Inspired by the hammerhead ribozyme, we assume that the cleavage site in $a$ is immediately adjacent to the start of the catalytic insert in $b$ (see Figure 1).

We further assume that $b$ -chains are capable of cleaving $\bar{a}$ -chains at the same site. This symmetry most likely does not apply to highly optimized hammerhead ribozymes, but it is reasonable in enzymes with only a modest level of catalytic efficiency. Let the cleavage within $\bar{b}$ - $a$ or $b$ - $\bar{a}$ duplexes occur at rates $β$ and $\bar{β}$ , respectively.

The cleavage of a chain of type $a$ by $\bar{b}$ produces two pieces, right and left. The concentrations of these fragments are referred to as $a_{R}$ and $a_{L}$ , respectively. Inspired by the example of the hammerhead ribozyme (illustrated in Figure 1B), we assume that one of them ( $a_{L}$ ) can serve as a primer only for $a$ . Conversely, the other fragment ( $a_{R}$ ) can serve as a primer for either $a$ or $b$ depending on the first templating chain with which it will hybridize (respectively $\bar{a}$ or $\bar{b}$ ). Assuming a random first encounter, the probability for $a_{R}$ to serve as a primer to $b$ is thus given by $ϕ = \bar{b} / (\bar{a} + \bar{b}) < 1$ , while the probability of it to grow into $a$ is $1 - ϕ = \bar{a} / (\bar{a} + \bar{b})$ . Similarly, conversion probabilities in the complementary subpopulation ${\bar{a}}_{R}$ are determined by $\bar{ϕ} = b / (a + b)$ .

At this point, the issue of template poisoning due to product rehybridization should be revisited. In the context of random cleavage, we have argued that it is avoided due to the low probability of binding of two fragments terminating at exactly the same sites. However, catalyzed cleavage produces primers $a_{R} / {\bar{a}}_{R}$ and $a_{L} / {\bar{a}}_{L}$ that terminate in the same region, and therefore they would typically be displaced from the respective templates in favor of the formation of $a - \bar{a}$ and $b - \bar{b}$ duplexes. In Appendix 1, we analyze the binding kinetics during the night phase and come to an important conclusion. If the concentration of primers (such as $a_{R}$ ) is small compared to that of templates (such as $\bar{a}$ ), despite the strand displacement, a finite and substantial fraction of primers will remain hybridized to their respective templates for a significant time. More specifically, they would remain hybridized until the concentration of free templates drops to the level of the concentration of primers. This indicates that template poisoning has only a moderate effect if the night phase is not too long.

As in the random cleavage model, the concentration of hybridized duplexes is given by the smaller of two concentrations $a + b$ and $\bar{a} + \bar{b}$ . This is captured by the factor $χ = m i n (a + b, \bar{a} + \bar{b}) / (a + b) \leq 1$ in the elongation rate of a primer: $r m χ$ . Here, as before, this rate is measured in $l_{0}$ bases per unit time, and $m$ is the free monomer concentration.

We observe that in order for a cleavage fragment to work as a primer, it needs to exceed the minimal primer length $l_{0}$ . Therefore, a newly formed cleavage product $a_{L}$ needs to grow by at least $l_{0}$ bases before it can be considered a part of the $a$ subpopulation. Indeed, if it has not grown by that length, another catalyzed cleavage at the same site would not increase the number of primers in the system. Therefore, the rate at which chains in the subpopulation $a_{L}$ are converted to $a$ is given by $r m χ$ . The rate of conversion of $a_{R}$ to $a$ is similar, up to the factor $1 - ϕ$ discussed above: $r m χ (1 - ϕ)$ . If a segment $a_{R}$ was first hybridized to $\bar{b}$ , it will eventually grow to be a part of the subpopulation $b$ . However, in order to become functional, this chain has to grow at least by the length of the catalytic insert, which is distinct from $l_{0}$ . Furthermore, the rate of elongation of $b$ is slowed down by the presence of a hairpin in the catalytic domain of the $\bar{b}$ structure. Both effects can be captured by a factor λ in the conversion rate from $a_{R}$ to $b$ given by $r m χ ϕ / λ$ , relative to that of $a_{R}$ to $a$ given by $r m χ (1 - ϕ)$ .

Altogether, the dynamics of our model is described by the following equations:

\dot{c} = β_{0} M + β ϕ a - δ c

\dot{b} = β_{0} M \bar{ϕ} + \frac{r m χ ϕ}{λ} a_{R} - δ b

{\dot{a}}_{L} = β ϕ a - r m χ a_{L} - δ a_{L}

\begin{aligned} {\dot{a}}_{R} & = β ϕ a - r m χ (1 - ϕ) a_{R} - \frac{r m χ ϕ}{λ} a_{R} - δ a_{R} \end{aligned}

Here, $c \equiv a + b + a_{l} + a_{R}$ . Note that Equation 7 is obtained by first writing the kinetic equation for $a$ and then adding it up to the sum of Equations 8–10. The first terms in the r.h.s. of Equations 7 and 8 represent random non-catalyzed cleavage that occurs at rate $β_{0}$ at any location of any chain (compare to Equation 3). Similarly to Equation 2, the dynamics of the number density $M$ of monomers incorporated into chains $a$ , $a_{L}$ , $a_{R}$ , and $b$ is described by

\dot{M} = r m χ c - δ M .

This in turn determines the concentration of free monomers remaining in the solution as given by Equation 1.

Additional equations are obtained by replacing variables $a$ , $a_{L}$ , $a_{R}$ , $b$ , and $M$ with their complementary counterparts: $\bar{a}$ , $\bar{a_{L}}$ , $\bar{a_{R}}$ , $\bar{b}$ , and $\bar{M}$ , respectively. In addition, β, φ, and $χ$ should be replaced with $\bar{β}$ , $\bar{ϕ}$ , and $\bar{χ} = m i n (a + b, \bar{a} + \bar{b}) / (\bar{a} + \bar{b})$ , respectively.

System dynamics

A crucial feature of this multicomponent system is that catalytic cleavage depends on the cooperativity between all four subpopulations $a$ , $\bar{a}$ , $b$ , and $\bar{b}$ . To understand when such a cooperative steady state exists, we have numerically solved Equations 7–11 for $β_{0} = 0.015$ , $λ = 2$ , $δ = 1$ and different values of the catalytic cleavage rate $β$ . Figure 2A–C show the dynamic trajectories of our system in a–b space for a wide range of initial conditions. For $β$ too small (e.g., $β = 6$ in Figure 2A) or too large (e.g., $β = 18$ in Figure 2C), the only steady-state solutions correspond to the survival of either the $a / \bar{a}$ or $b / \bar{b}$ subpopulation. In these non-cooperative fixed points, marked with red and blue stars in Figure 2, one set of chains drives the other to extinction. Thus, they propagate by random rather than catalytic cleavage. The concentrations of monomers and a complementary pair of surviving chains at a non-cooperative fixed point are given by Equations 5 and 6, respectively.

Figure 2

Download asset Open asset

Dynamical phase portraits for different catalytic cleavage rates β.

(A) The phase portrait for a small catalytic cleavage rate $β = 6$ has two non-cooperative steady-state solutions marked with red and blue stars corresponding to pure $a / \bar{a}$ and pure $b / \bar{b}$ subpopulations, respectively. These solutions are maintained by random rather than catalytic cleavage. (B) The phase portrait for intermediate catalytic cleavage rate $β = 10$ in addition to two non-cooperative steady states marked with red and blue stars has a cooperative steady state marked with the green star in which all four subpopulations coexist. One can reach this state, for example, starting from the non-cooperative $b / \bar{b}$ steady state (the blue star) and adding a relatively small subpopulation of $a / \bar{a} > 2 e - 5$ crossing the saddle point separating blue and green trajectories. (C) The phase portrait for a large catalytic cleavage rate $β = 18$ again has only two non-cooperative cleavage steady states marked with red and blue stars. All three panels were obtained by numerically solving dynamical Equations 7–11 with random cleavage rate $β_{0} = 0.015$ , elongation asymmetry factor $λ = 2$ , and dilution factor $δ = 1$ .

For an intermediate value of β (e.g., for $β = 10$ shown in Figure 2B), we observe the emergence of a new cooperative fixed point (marked by the green star), where all four subpopulations survive at concentrations $a = \bar{a} > 0$ and $b = \bar{b} > 0$ . This fixed point is mainly maintained by catalytic cleavage.

The phase portrait of our system shown in Figure 2B suggests a plausible scenario for the emergence and subsequent evolution of this cooperative system. This evolutionary scenario starts with $b / \bar{b}$ subpopulations existing alone. Eventually, copying errors might result in the emergence of a small subpopulation of $a$ or $\bar{a}$ at a concentration $a ≪ b$ . If this concentration is greater than a certain threshold separating blue and green trajectories in Figure 2B, the dynamics of the system would drive it to the cooperative fixed point (green star in Figure 2B). For specific parameters used to generate Figure 2B, the minimal ratio between concentrations $a$ and $b$ is roughly 0.01.

Properties of the cooperative steady state

To better understand the conditions for the existence of the cooperative regime, we analytically derived the steady-state solutions of Equations 7–11. The key result is the relationship between the steady-state monomer concentration, $m_{c o o p}^{*}$ , and the catalytic cleavage rate β for given values of $β_{0}$ , λ, δ, and $r$ derived in Appendix 1. Figure 3A shows this dependence for $λ = 2$ , and different values of $β_{0} / δ$ alongside with data points (open circles) obtained by direct numerical solution of dynamical (Equations 7–11).

Figure 3

Download asset Open asset

Properties of the cooperative state.

(A) The relationship between parameters of the cooperative state. $m_{c o o p}^{*} / m^{*}$ plotted vs. $β$ for $λ = 2$ , $δ = 1$ and increasing values of $β_{0}$ : 0.015 (green), 0.003 (purple), and 0.0045 (red). Lines are given by the parametric equation describing the state and derived in the SI Appendix (Equation S14), while open circles are obtained by direct numerical solution of dynamical equations (7–11). Monotonically increasing branches (solid lines) correspond to the stable cooperative fixed point, while the decreasing branches (dashed lines) – to the dynamically unstable saddle points separating different steady-state solutions in Figure 2B. (B) Phase diagram of the cooperative state. The shaded region marks the values of $β / δ$ and $β_{0} / δ$ for which the cooperative solution exists. Green, purple, and red lines show the ranges of $β$ for which the cooperative solution exists for the corresponding value of $β_{0}$ shown in (A). Increasing the parameter $β_{0}$ makes the range of $β$ for which the cooperative solution exists progressively smaller until it altogether disappears above. $β_{0} / δ \approx 0.057$ .

The stable fixed point corresponds to the monotonically increasing branch of the graph $m_{c o o p}^{*}$ vs. $β$ (solid lines in Figure 3A), while two decreasing branches (dashed lines in Figure 3A) correspond to two dynamically unstable saddle points separating different steady-state solutions (see Figure 2B).

Note that the stability of our cooperative fixed point is a non-trivial result. For example, in a related model by Kamimura et al., 2019, the fixed point corresponding to a viable composite replicase is dynamically unstable and requires additional stabilization, for example, by cell-like compartments.

Increasing the parameter $β_{0} / δ$ , for example, by making the dilution rate $δ$ smaller, makes the range of $β$ for which the cooperative solution exists progressively smaller until it altogether disappears above $β_{0} / δ \approx 0.057$ (for $λ = 2$ ).

Figure 3B shows the ranges of $β / δ$ and $β_{0} / δ$ for which the cooperative solution exists (as before, for $λ = 2$ ). Solid lines of different colors correspond to three values of $β_{0}$ used in Figure 3A.

While the full set of our analytical results described in Appendix 1 is rather convoluted, here we present a simplified expression for the range of values of $β / δ$ , where the cooperative solution exists, and the corresponding range of $m_{c o o p}^{*}$ :

4 λ < \frac{β}{δ} < \frac{(λ - 1) δ}{4 β_{0}}

\frac{2 λ δ}{(λ - 1) β_{0}} < \frac{m_{c o o p}^{*}}{m^{*}} < \frac{1}{2}

These conditions were derived in the limit $β_{0} / δ ≪ 1 / λ$ (see Appendix 1 for details).

Evolutionary dynamics

The competition for monomers is the main mechanism of natural selection operating in our system. The steady state with the lowest level of monomer concentration is favored by evolution since the competing states would not be able to proliferate at that level of monomers. Equation 13 implies that the monomer concentration $m_{c o o p}^{*}$ in the cooperative solution is always less than a half of its value $m^{*}$ in the absence of the catalytic cleavage. Thus, once the cooperative state emerges, it drives out all chain sequences that still rely on non-catalytic cleavage for replication. The continuously increasing ‘fitness’ of the system can be quantified, for example, by the ratio $m^{*} / m_{c o o p}^{*}$ . Note that the non-cooperative solution has a fitness of 1, while the cooperative solution has a fitness higher than 2. Note that this fitness is defined at the level of the ecosystem comprising all the sequences in the chemostat and may not necessarily be attributable to individual members of that population. Over time, similar to microbial ecosystems, this population changes according to the laws of competitive exclusion (Gause, 1934; Tilman, 1982).

The fitness landscape of our system shown in Figure 4 depends on two parameters of the catalytic cleavage: β and λ. Its preeminent feature is a relatively narrow fitness ridge (orange and red color in Figure 4). For a fixed value of λ, this ridge corresponds to a sharp fitness maximum located at the lowest possible catalytic cleavage rate β for which the cooperative state still exists (see the red region in Figure 4B). In other words, for a given λ, the selective pressure would drive β down to the lower boundary separating cooperative and non-cooperative regions.

Figure 4

Download asset Open asset

The fitness landscape of our system.

(A) the three-dimensional plot and (B) the heatmap of the fitness $m^{*} / m_{c o o p}^{*}$ of the cooperative state in our system as a function of the catalytic cleavage rate enhancement $β / β_{0}$ and elongation asymmetry λ. The co-evolution of λ and β would increase together. A typical evolutionary trajectory in which small changes in these parameters are independent of each other is shown as a dashed line.

However, λ and β are expected to co-evolve together. Indeed, $λ > 1$ quantifies the ratio by which the structural properties of the type- $b$ chain (its excess length, the hairpin unzipping free energy, etc.) slow down its replication compared to that of the type- $a$ chain. Thus, it is reasonable to assume that λ could be easily modified in the course of the evolution. As a consequence of the selective pressure, both λ and β are expected to increase in the course of the evolution driving the system up the ridge in Figure 4. This intuition is confirmed by a direct numerical simulation in which we model the evolution as a Monte Carlo process with fitness playing the role of negative energy Parameters λ and β were allowed to vary randomly and independently of each other. A sample evolutionary trajectory is shown as a dashed line in Figure 4.

One can imagine several possible pathways leading to this self-sustaining cooperative system. (i) A pair of mutually complementary non-catalytic chains ( $a$ and $\bar{a}$ in our notation) gains function due to a copying error, giving rise to a small subpopulation of ‘sister’ chains ( $b$ and $\bar{b}$ in our notation) with nascent cleavage activity directed toward $\bar{a}$ and $a$ , respectively. (ii) A pair of catalytically active chains $b$ and $\bar{b}$ emerges first, subsequently losing the catalytic inserts due to a copying error, thus giving rise to a small subpopulation of substrate chains $a$ and $\bar{a}$ . According to the dynamic phase portrait of our system illustrated in Figure 2B, the second pathway is more plausible than the first one. Indeed, at least for the parameters used in Figure 2B, the conversion of only a few percent of $b$ chains to $a$ brings the systems into the basin of attraction of the cooperative fixed point marked green in the figure. The other scenario is less likely since it requires a much higher ratio of emergent over ancestral population sizes.

Our results indicate that the cooperative steady state emerges when the catalytic rate enhancement over the spontaneous cleavage rate is at least 10²–10³. This is a relatively modest gain compared to the 10⁹ enhancement reported for a highly optimized hammerhead ribozyme Scott et al., 2013. However, it is comparable to the rate enhancement observed after only five rounds of in vitro selection from an unbiased sample of random RNA sequences (Salehi-Ashtiani and Szostak, 2001). In the course of subsequent evolution, two main parameters of our model β and λ are expected to increase in tandem. The first parameter, β, is the catalytic cleavage rate, which, as we know, can increase over multiple orders of magnitude (Salehi-Ashtiani and Szostak, 2001). The second parameter, λ, quantifies the relative delay in the elongation of $b$ -types chains compared to $a$ -type chains. It may be caused at least in part by the difficulty of unzipping the hairpin, thus λ can be dramatically increased by making the hairpin stem longer. One should note that our model requires the catalytic activity of both $b$ chains and their complementary partner, $\bar{b}$ chains. Thus, sequences are expected to evolve to simultaneously optimize these two catalytic rates. The need for this compromise would likely prevent the catalysts from reaching the maximum efficiency observed, for example, in fully optimized ribozymes (Scott et al., 2013). The drive to further optimize cleavage activity might trigger a transition to more complex catalytic networks, for example, to an increase in the number of chains involved.

Discussion

The proposed scenario is certainly not the only plausible pathway that could lead to the emergence of functional heteropolymers in the prebiotic world. To illustrate some of the unique features of the presented model, it is useful to place it in the context of other recent proposals. One of the most intriguing possibilities is the virtual circular genome (VCG) model recently proposed in Zhou et al., 2021. It is based on the observation that a relatively long ancestral genome can be stored as a collection of short overlapping RNA fragments of a circular master sequence or its complement. The model assumes unidirectional non-enzymatic replication of these fragments. On the one hand, it explains how a collection of relatively short RNA fragments (10–12 nucleotides each) could store a large amount of genetic information. On the other hand, computer simulations indicate that the VCG is susceptible to so-called sequence scrambling, where the appearance of repeats in the master sequence results in the loss of integrity of an entire circular genome (Chamanian and Higgs, 2022). The proliferation of the VCG is quite sensitive to sequence scrambling since the model assumes unidirectional polymerization, so that in order to return to copying a particular segment, one must copy the entire virtual circle.

Our model, while sharing some elements with the VCG model, differs in two important aspects: (i) it assumes that the non-enzymatic templated polymerization is bidirectional, and (ii) the functional activity of the heteropolymer is localized in a relatively short sequence region that is catalytically cleaved and thus replicated first. This implies that possible scrambling outside of the narrow functional region does not affect the viability of our autocatalytic system. Indeed, assuming the minimal hybridization length $l_{0} = 6$ and random statistics of the master sequence, one gets the scrambling-free length $\sqrt{2 \cdot 4_{0}^{l}} + l_{0} ≃ 100$ . This is an order of magnitude larger than both $l_{0}$ and the length of the core region of the hammerhead ribozyme.

It may be possible to incorporate the selection mechanism proposed in this article into the VCG model. Such a hybrid approach would avoid the need for the biochemically problematic bidirectional growth, while explaining the emergence of early catalytic activity unaffected by sequence scrambling. This in turn could pave the way for the emergence of the rolling cycle, a model proposed as an alternative to VCG. (Tupper and Higgs, 2021; Rivera-Madrinan et al., 2022). The rolling cycle relies on strand displacement rather than cyclic melting of hybridized duplexes and requires a more sophisticated setup, including a pre-selected sequence pool and the availability of cleavage enzymes. Our model provides a plausible pathway for how this catalytic function could have evolved.

Our model aims to describe the early stages of the evolution of life on Earth based on non-enzymatic polymerization. While it may seem challenging to test it for conditions relevant to the origin of life, our main conclusions can still be verified experimentally. RNA or DNA can be used as model polymers in such experiments, as both have demonstrated catalytic abilities in cleavage reactions (de la Peña and García-Robles, 2010; Williams et al., 1995; Salehi-Ashtiani and Szostak, 2001; Breaker and Joyce, 1994; Silverman, 2005; Chandra et al., 2009). To simulate primordial polymerization driven by day/night cycling, the experiment would have to rely on enzymatic polymerization or ligation as used, for example, in PCR (Mullis and Faloona, 1987; Saiki et al., 1988) or ligase chain reaction (LCR) (Barany, 1991). However, it is important to note that our model assumes polymerization in both the 5′-to-3′ and 3′-to-5′ directions, unlike traditional PCR, which only adds new nucleotides in the 5′-to-3′ direction. This problem of bidirectional polymerization was solved by evolution using Okazaki fragments (Okazaki et al., 1968). Inspired by this discontinuous synthesis of the lagging strand of DNA, we propose a possible experimental implementation of our system based on ligation rather than polymerization enzymes. In this scenario, the system would be supplied with ultrashort random DNA segments. These segments, which are much shorter than the minimal primer length ( $l_{0}$ ), would play the role of ‘monomers’ and bidirectional primer extension would occur through a sequence of ligation steps connecting adjacent ultrashort segments to each other. Another important consideration for experimental implementation is the need to activate the nucleotides to provide free energy for polymerization. Thus, both the short fragments supplied to the system and the new primers formed by cleavage must be chemically activated.

Conclusions

Our results suggest a plausible pathway by which a pool of information-carrying polymers could acquire the catalytic function, thereby bringing this system closer to the onset of the RNA world. We start with a pool of polymers capable of non-enzymatic templated polymerization and subjected to cyclic change of conditions (day/night cycles). The replication in our system is carried out during the night phase of the cycle via the elongation of primers hybridized with their complementary templates. We first observe that any cleavage of chains generates new primers and thus promotes their replication. The mutual replication of complementary chains is sustainable as long as cleavage and elongation rates are large compared to the dilution rate: $β_{0} m_{0} r > δ^{2}$ . This suggests that a faster, catalyzed cleavage would be selected by the evolution. Furthermore, DNA or RNA sequences capable of catalyzing site-specific cleavage are known to be relatively simple and readily arise via either natural or artificial selection (de la Peña and García-Robles, 2010; Williams et al., 1995; Salehi-Ashtiani and Szostak, 2001; Breaker and Joyce, 1994; Silverman, 2005; Chandra et al., 2009). The oligomer replication based on catalyzed cleavage is not trivial as it requires cooperativity between multiple chain types. Our study shows that a stable cooperative solution can be achieved with as few as four subpopulations of chains. Furthermore, we demonstrate that there is a wide range of conditions under which this catalytic network proliferates and significantly outcompetes non-catalytic ancestors of the constituent chains.

Appendix 1

Effect of template rehybridization

Consider a simple model in which two mutually complementary sequences act as templates for each other’s polymerization. Let $C$ and $\bar{C}$ be the concentrations of these unhybridized template chains. Let $c$ and $\bar{c}$ be the concentrations of unhybridized fragments of these two sequences, respectively. They may act as primers once bound to their complementary template. At the beginning of the ‘night’ phase, we assume all the chains to be in an unhybridized state, that is, $C = C_{0}$ , $\bar{C} = {\bar{C}}_{0}$ , $c = c_{0}$ , and $\bar{c} = c_{0}$ . We will describe the system’s kinetics during the ‘night’ phase by assuming that hybridization is essentially irreversible, except for the possibility of a primer strain displacement by a full chain. We only focus on the regime when the total concentration of primers, $c_{0}$ and ${\bar{c}}_{0}$ , is much smaller than that of free templates, $C$ and $\bar{C}$ . Note that the concentrations of bound primers are $c_{0} - c$ and ${\bar{c}}_{0} - \bar{c}$ , respectively.

The kinetics of this simple model is described by the following rate equations:

\dot{C} = - κ_{0} C \bar{C}

\dot{c} = - κ_{1} \bar{C} c + κ_{2} C (c_{0} - c)

The first equation accounts for the hybridization of mutually complementary template chains (assuming $c ≪ C$ and $\bar{c} ≪ \bar{C}$ , $κ_{0}$ is the corresponding association rate). The second equation accounts for the hybridization of primers with their complementary substrates and for the primer strand displacement due to the hybridization of the template with its full-length complementary. The respective association rates for these two processes, $κ_{1}$ and $κ_{2}$ , are dependent on the length and sequences of the chains involved, but generally comparable. The other two equations, for $\bar{C}$ and $\bar{c}$ , are obtained by replacing all the concentrations with their complementaries.

We first consider the asymmetric case, when $C_{0} > {\bar{C}}_{0}$ (without loss of generality). This implies that for a long enough duration of the night, the number of hybridized chains is limited by ${\bar{C}}_{0}$ . Thus, only one type of the template chain will remain in the solution, $C = C_{0} - {\bar{C}}_{0}$ , while the other will be completely hybridized, that is, $\bar{C} = 0$ . In turn, this implies that the situation will be even more drastic for the primers: $\bar{c} = 0$ (completely hybridized), while $c = c 0$ (completely free). As a result, the minority fraction, $\bar{C}$ would replicate faster and the balance would be restored, so that $C_{0} = {\bar{C}}_{0}$ .

Now, we will analyze the above set of differential equations for a symmetric case, $C_{0} = {\bar{C}}_{0}$ . The fraction of free template chains vanishes with time as a power law (rather than exponentially):

C (t) = \bar{C} (t) = \frac{C_{0}}{1 + C_{0} κ_{0} t}

As to the concentration of free fragments, it reaches a steady-state value between 0 and $c_{0}$ :

c = \frac{c_{0}}{1 + κ_{1} / κ_{2}}

Since the association rates $κ_{1}$ and $κ_{2}$ are comparable, we come to a surprising conclusion that a finite fraction of fragments, of the order of 1, will stay hybridized, despite the effect of strand displacement by longer chains:

\frac{c_{0} - c}{c_{0}} = \frac{1}{1 + κ_{2} / κ_{1}} \sim 1

The major assumption behind this calculation is that primer concentration remains small compared to that of templates. Based on Equation S3, this leads to the following estimate of the optimal duration of the night phase in terms of primer concentration:

t_{n i g h t} ≲ \frac{1}{κ c_{0}}

By assuming $κ \sim 10^{- 7} M^{- 1} s^{- 1}$ , typical for RNA, and $t_{n i g h t} \sim 10^{4} s$ (several hours), we conclude that the described mechanism is relevant for concentrations of specific primers, such as $a_{R} / a_{L}$ of the order of 10 pM. Note that (i) the concentration of templates (e.g., $a / \bar{a})$ would typically be significantly larger, (ii) we only count chains with specific sequences, and (iii) this refers to the concentration of oligomers, not overall monomer concentration. This implies that the typical monomer concentration should be of nM scales or greater.

Appendix 2

Cooperative steady-state solution

The steady state of Equations S7–S11 from the main text must satisfy the following set of equations:

β ϕ a = δ c - β_{0} M

δ b = β_{0} M ϕ + \frac{r m ϕ}{λ} a_{R}

β ϕ a = (r m + δ) a_{L}

β ϕ a = (r m (1 - ϕ + \frac{ϕ}{λ}) + δ) a_{R}

r m c = δ M

Let us define

μ = \frac{δ}{r m} .

By using Equation S11 to exclude variable $M$ , and replacing $b$ in Equation S8 with $ϕ (a + b)$ , the fixed point conditions can be rewritten as

β ϕ a = (δ - β_{0} / μ) c

a + b = \frac{1}{μ} (\frac{a_{R}}{λ} + \frac{β_{0}}{δ} c)

a_{L} = \frac{(μ - β_{0} / δ) c}{1 + μ}

a_{R} = \frac{(μ - β_{0} / δ) c}{1 + μ - (1 - λ^{- 1}) ϕ}

As $a + b$ , $a_{L}$ , and $a_{R}$ are now expressed in terms of $c$ , we use its definition, $c = a + b + a_{L} + a_{R}$ , to get

\begin{array}{ll} (1 - \frac{β_{0}}{μ δ}) c = (1 + \frac{1}{λ μ}) a_{R} + a_{L} = (1 - \frac{β_{0}}{μ δ}) \cdot \\ \cdot (\frac{1 + λ μ}{1 + μ - (λ - 1) ϕ} + \frac{1}{1 + μ}) \end{array}

This yields a compact expression for φ in terms of μ and λ only, thus invariant with respect to both cleavage rates, $β_{0}$ and β:

ϕ = (1 - \frac{λ μ}{λ - 1}) (1 + μ)

Another relationship is obtained by using Equation S14 and S16 to express $a = (1 - ϕ) (a + b)$ in terms of $c$ , and substituting it into Equation S13:

\frac{δ}{β} = ϕ (1 - ϕ) (\frac{1}{λ (1 - ϕ + μ) + ϕ} + \frac{1}{μ δ / β_{0} - 1})

By substituting Equation S18 into Equation S19, one gets the analytic relationship that allows to compute β for arbitrary values of μ, $β_{0}$ , δ, and λ:

\frac{δ}{β} = \frac{μ}{λ - 1} (1 - \frac{λ μ}{λ - 1}) (1 + (1 + μ) (1 + λ μ) \frac{β_{0}}{μ δ - β_{0}})

In practice, it is the monomer concentration that gets adjusted to its steady-state value $m^{*}$ , thus μ is the variable that has to satisfy the above equation, for given values of other parameters.

The stable fixed points only appear on the decreasing segment of the function $β (μ)$ . Thus, the limits of stability correspond to zero derivatives of the r.h.s. of Equation S20. This leads to the following condition:

(1 - \frac{2 λ μ}{λ - 1}) z^{2} + μ A^{'} (μ) z - A (μ) = 0

Here we defined $z = μ δ / β_{0} - 1$ ,

\begin{array}{ll} A (μ) = (1 - \frac{λ μ}{λ - 1}) (1 + μ) (1 + λ μ) \\ = 1 + \frac{(λ^{2} - λ - 1) μ - 2 λ μ^{2} - λ^{2} μ^{3}}{λ - 1} \end{array}

A^{'} (μ) = \frac{(λ^{2} - λ - 1) - 4 λ μ - 3 λ^{2} μ^{2}}{λ - 1}

The positive solution to the quadratic Equation S21 is given by

\begin{array}{ll} z (μ) = \frac{(λ - 1) μ A^{'} (μ)}{2 (λ - 1 - 2 λ μ)} \\ \cdot (\sqrt{1 + 4 (1 - \frac{2 λ μ}{λ - 1}) \frac{A (μ)}{μ^{2} A^{'} (μ)^{2}}} - 1) \end{array}

Now, by using Equation S20 and our definition of $x$ , both β and $β_{0}$ can be parameterized by μ:

β_{0} (μ) = \frac{μ δ}{z (μ) + 1}

β (μ) = \frac{δ (λ - 1)}{μ (1 - \frac{λ μ}{λ - 1} + \frac{A (μ)}{z (μ)})}

This parametric curve in $(β, β_{0})$ space defines the boundary of the region in which the non-trivial fixed point solution exists.

A simplified asymptotic relationship between β, $β_{0}$ , and μ can be obtained in the limit of $μ ≪ 1$ . In this case, Equation S24 implies $z (μ) \approx 1$ . By substituting this result into Equations S25 and S26, one obtains

\begin{aligned} μ & \approx 2 β_{0} / δ \end{aligned}

\begin{aligned} β & \approx \frac{(λ - 1) δ^{2}}{4 β_{0}} \end{aligned}

This result gives the lower bounds for μ and the upper bound for β consistent with the cooperative solution, for a given $β_{0}$ . The approximation is valid in the limit of $μ ≪ 1$ , that is, $β_{0} / δ ≪ 1$ .

Another asymptotic result can be obtained for vanishingly small non-catalytic cleavage rate $β_{0}$ . In that limit, Equation S20 turns into quadratic equation since $(1 + μ) (1 + λ μ) / μ ≪ δ / β_{0}$ . Its two solutions correspond to unstable and stable fixed points. Specifically, the stable branch is given by

μ \approx \frac{λ - 1}{2 λ} (1 - \sqrt{1 - \frac{4 λ δ}{β}})

The solution only exists for $β / δ \geq 4 λ$ , which sets the lower bound for β. This critical point corresponds to $μ = \frac{λ - 1}{2 λ}$ , which is the upper bound of μ. Note that this approximation is valid as long as

\frac{β_{0}}{δ} ≪ \frac{μ}{(1 + μ) (1 + λ μ)} \approx \frac{2 (λ - 1)}{(3 λ - 1) (λ + 1)}

To summarize, the range of β for which the cooperative solution exists, and the respective range of the steady-state monomer concentration $m_{c o o p}^{*}$ , can be approximated as

4 λ < \frac{β}{δ} < \frac{(λ - 1) δ}{4 β_{0}}

\frac{2 λ δ}{(λ - 1) β_{0}} < \frac{m_{c o o p}^{*}}{m^{*}} < 1 / 2

Here, $m^{*} = δ^{2} / (r β_{0})$ is the steady-state monomer concentration in the non-cooperative regime given by Equation 5.

The fitness parameter for the cooperative regime

\frac{m^{*}}{m_{c o o p}^{*}} = \frac{μ δ}{β_{0}}

Data availability

The current manuscript is a theoretical study, so no data have been generated for this manuscript.

References

1. Barany F
(1991) Genetic disease detection and DNA amplification using cloned thermostable ligase
PNAS 88:189–193.
https://doi.org/10.1073/pnas.88.1.189
- PubMed
- Google Scholar
1. Bartel DP
2. Szostak JW
(1993) Isolation of new ribozymes from a large pool of random sequences [see comment]
Science 261:1411–1418.
https://doi.org/10.1126/science.7690155
- PubMed
- Google Scholar
1. Breaker RR
2. Joyce GF
(1994) A DNA enzyme that cleaves RNA
Chemistry & Biology 1:223–229.
https://doi.org/10.1016/1074-5521(94)90014-0
- PubMed
- Google Scholar
1. Chamanian P
2. Higgs PG
(2022) Computer simulations of Template-Directed RNA Synthesis driven by temperature cycling in diverse sequence mixtures
PLOS Computational Biology 18:e1010458.
https://doi.org/10.1371/journal.pcbi.1010458
- PubMed
- Google Scholar
(2009) DNA-catalyzed sequence-specific hydrolysis of DNA
Nature Chemical Biology 5:718–720.
https://doi.org/10.1038/nchembio.201
- PubMed
- Google Scholar
1. de la Peña M
2. García-Robles I
(2010) Ubiquitous presence of the hammerhead ribozyme motif along the tree of life
RNA 16:1943–1950.
https://doi.org/10.1261/rna.2130310
- PubMed
- Google Scholar
1. Doudna JA
2. Szostak JW
(1989) RNA-catalysed synthesis of complementary-strand RNA
Nature 339:519–522.
https://doi.org/10.1038/339519a0
- PubMed
- Google Scholar
(2020) Deep sequencing of non-enzymatic RNA primer extension
Nucleic Acids Research 48:e70.
https://doi.org/10.1093/nar/gkaa400
- PubMed
- Google Scholar
1. Ferris JP
(2005) Mineral catalysis and prebiotic synthesis: Montmorillonite-catalyzed formation of RNA
Elements 1:145–149.
https://doi.org/10.2113/gselements.1.3.145
- Google Scholar
1. Gause GF
(1934) Experimental analysis of vito volterra’s mathematical theory of the struggle for existence
Science 79:16–17.
https://doi.org/10.1126/science.79.2036.16-a
- PubMed
- Google Scholar
1. Gilbert W
(1986) Evolution of antibodies: The road not taken
Nature 320:485–486.
https://doi.org/10.1038/320485a0
- PubMed
- Google Scholar
1. Higgs PG
2. Lehman N
(2015) The RNA World: molecular cooperation at the origins of life
Nature Reviews. Genetics 16:7–17.
https://doi.org/10.1038/nrg3841
- PubMed
- Google Scholar
1. Horning DP
2. Joyce GF
(2016) Amplification of RNA by an RNA polymerase ribozyme
PNAS 113:9786–9791.
https://doi.org/10.1073/pnas.1610103113
- PubMed
- Google Scholar
(1986) Self-cleavage of plus and minus RNA transcripts of avocado sunblotch viroid
Nucleic Acids Research 14:3627–3640.
https://doi.org/10.1093/nar/14.9.3627
- PubMed
- Google Scholar
1. Jerome CA
2. Kim HJ
3. Mojzsis SJ
4. Benner SA
5. Biondi E
(2022) Catalytic synthesis of polyribonucleic acid on prebiotic rock glasses
Astrobiology 22:629–636.
https://doi.org/10.1089/ast.2022.0027
- PubMed
- Google Scholar
(2019) Horizontal transfer between loose compartments stabilizes replication of fragmented ribozymes
PLOS Computational Biology 15:e1007094.
https://doi.org/10.1371/journal.pcbi.1007094
- PubMed
- Google Scholar
(2021) The emergence of RNA from the heterogeneous products of prebiotic nucleotide synthesis
Journal of the American Chemical Society 143:3267–3279.
https://doi.org/10.1021/jacs.0c12955
- PubMed
- Google Scholar
1. Kruger K
2. Grabowski PJ
3. Zaug AJ
4. Sands J
5. Gottschling DE
6. Cech TR
(1982) Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena
Cell 31:147–157.
https://doi.org/10.1016/0092-8674(82)90414-7
- PubMed
- Google Scholar
(2021) Structured sequences emerge from random pool when replicated by templated ligation
PNAS 118:e2018830118.
https://doi.org/10.1073/pnas.2018830118
- PubMed
- Google Scholar
1. Lincoln TA
2. Joyce GF
(2009) Self-sustained replication of an RNA enzyme
Science 323:1229–1232.
https://doi.org/10.1126/science.1167856
- PubMed
- Google Scholar
(1980) Efficient metal-ion catalyzed template-directed oligonucleotide synthesis
Science 208:1464–1465.
https://doi.org/10.1126/science.6247762
- PubMed
- Google Scholar
Book
1. Lukin M
(2010)
The Idea of Possible Selection for Cleavage Ribozymes in Prebiotic Evolution Was First Suggested to Us by Dr. Mark Lukin of Stony Brook University

Stony Brook University.
- Google Scholar
1. Mullis KB
2. Faloona FA
(1987) Specific synthesis of DNA in vitro via a polymerase-catalyzed chain reaction
Methods in Enzymology 155:335–350.
https://doi.org/10.1016/0076-6879(87)55023-6
- Google Scholar
1. Okazaki R
2. Okazaki T
3. Sakabe K
4. Sugimoto K
5. Kainuma R
6. Sugino A
7. Iwatsuki N
(1968) In vivo mechanism of DNA chain growth
Cold Spring Harbor Symposia on Quantitative Biology 33:129–143.
https://doi.org/10.1101/SQB.1968.033.01.017
- Google Scholar
1. Orgel LE
(2004) Prebiotic chemistry and the origin of the RNA world
Critical Reviews in Biochemistry and Molecular Biology 39:99–123.
https://doi.org/10.1080/10409230490460765
- PubMed
- Google Scholar
(1994) Three-dimensional structure of a hammerhead ribozyme
Nature 372:68–74.
https://doi.org/10.1038/372068a0
- PubMed
- Google Scholar
(1986) Autolytic processing of dimeric plant virus satellite RNA
Science 231:1577–1580.
https://doi.org/10.1126/science.231.4745.1577
- PubMed
- Google Scholar
(2022) Rolling Circles as a Means of Encoding Genes in the RNA World
Life 12:1373.
https://doi.org/10.3390/life12091373
- PubMed
- Google Scholar
1. Robertson MP
2. Joyce GF
(2012) The origins of the RNA world
Cold Spring Harbor Perspectives in Biology 4:a003608.
https://doi.org/10.1101/cshperspect.a003608
- PubMed
- Google Scholar
1. Saiki RK
2. Gelfand DH
3. Stoffel S
4. Scharf SJ
5. Higuchi R
6. Horn GT
7. Mullis KB
8. Erlich HA
(1988) Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase
Science 239:487–491.
https://doi.org/10.1126/science.2448875
- PubMed
- Google Scholar
1. Salehi-Ashtiani K
2. Szostak JW
(2001) In vitro evolution suggests multiple origins for the hammerhead ribozyme
Nature 414:82–84.
https://doi.org/10.1038/35102081
- PubMed
- Google Scholar
1. Scott WG
2. Finch JT
3. Klug A
(1995) The crystal structure of an all-RNA hammerhead ribozyme: A proposed mechanism for RNA catalytic cleavage
Cell 81:991–1002.
https://doi.org/10.1016/s0092-8674(05)80004-2
- PubMed
- Google Scholar
(2013) The hammerhead ribozyme: structure, catalysis, and gene regulation
Progress in Molecular Biology and Translational Science 120:1–23.
https://doi.org/10.1016/B978-0-12-381286-5.00001-9
- PubMed
- Google Scholar
1. Silverman SK
(2005) In vitro selection, characterization, and application of deoxyribozymes that cleave RNA
Nucleic Acids Research 33:6151–6163.
https://doi.org/10.1093/nar/gki930
- PubMed
- Google Scholar
(1968) Nonenzymatic synthesis of oligoadenylates on a polyuridylic acid template
PNAS 59:726–733.
https://doi.org/10.1073/pnas.59.3.726
- PubMed
- Google Scholar
1. Szostak JW
(2012) The eightfold path to non-enzymatic RNA replication
Journal of Systems Chemistry 3:2.
https://doi.org/10.1186/1759-2208-3-2
- Google Scholar
Book
1. Tilman D
(1982) Resource Competition and Community Structure
Princeton university press.
https://doi.org/10.1515/9780691209654
- Google Scholar
1. Tkachenko AV
2. Maslov S
(2015) Spontaneous emergence of autocatalytic information-coding polymers
The Journal of Chemical Physics 143:045102.
https://doi.org/10.1063/1.4922545
- PubMed
- Google Scholar
1. Tkachenko AV
2. Maslov S
(2018) Onset of natural selection in populations of autocatalytic heteropolymers
The Journal of Chemical Physics 149:134901.
https://doi.org/10.1063/1.5048488
- PubMed
- Google Scholar
1. Tupper AS
2. Higgs PG
(2021) Rolling-circle and strand-displacement mechanisms for non-enzymatic RNA replication at the time of the origin of life
Journal of Theoretical Biology 527:110822.
https://doi.org/10.1016/j.jtbi.2021.110822
- PubMed
- Google Scholar
(1968) Template-directed synthesis with adenosine-5’-phosphorimidazolide
Science 161:387.
https://doi.org/10.1126/science.161.3839.387
- PubMed
- Google Scholar
(1995) Selection of novel Mg(2+)-dependent self-cleaving ribozymes
The EMBO Journal 14:4551–4557.
https://doi.org/10.1002/j.1460-2075.1995.tb00134.x
- PubMed
- Google Scholar
1. Zhou L
2. Ding D
3. Szostak JW
(2021) The virtual circular genome model for primordial RNA replication
RNA 27:1–11.
https://doi.org/10.1261/rna.077693.120
- Google Scholar

Article and author information

Author details

Alexei V Tkachenko

Center for Functional Nanomaterials, Brookhaven National Laboratory, Upton, United States

Contribution
Conceptualization, Software, Formal analysis, Investigation, Visualization, Writing – original draft, Writing – review and editing

For correspondence
oleksiyt@bnl.gov

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0003-1291-243X
Sergei Maslov
1. Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, United States
2. Department of Physics, University of Illinois Urbana-Champaign, Urbana, United States
3. Carl R Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, United States
Contribution
Conceptualization, Software, Formal analysis, Investigation, Visualization, Writing – original draft, Writing – review and editing

For correspondence
maslov@illinois.edu

Competing interests
No competing interests declared

"This ORCID iD identifies the author of this article:" 0000-0002-3701-492X

Funding

Department of Energy Office of Science (DE-SC0012704)

Alexei V Tkachenko

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

This research used resources of the Center for Functional Nanomaterials, which is a US DOE Office of Science User Facility, at Brookhaven National Laboratory under contract no. DE-SC0012704.

Version history

Preprint posted: July 26, 2023 (view preprint)
Sent for peer review: August 18, 2023
Preprint posted: December 1, 2023 (view preprint)
Preprint posted: March 7, 2024 (view preprint)
Version of Record published: March 26, 2024 (version 1)

Cite all versions

You can cite all versions using the DOI https://doi.org/10.7554/eLife.91397. This DOI represents all versions, and will always resolve to the latest one.

Copyright

This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.