<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms12053085</article-id>
<article-id pub-id-type="publisher-id">ijms-12-03085</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Required Levels of Catalysis for Emergence of Autocatalytic Sets in Models of Chemical Reaction Systems</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Hordijk</surname><given-names>Wim</given-names></name><xref ref-type="aff" rid="af1-ijms-12-03085"><sup>1</sup></xref><xref ref-type="corresp" rid="c1-ijms-12-03085"><sup>★</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Kauffman</surname><given-names>Stuart A.</given-names></name><xref ref-type="aff" rid="af2-ijms-12-03085"><sup>2</sup></xref><xref ref-type="aff" rid="af3-ijms-12-03085"><sup>3</sup></xref><xref ref-type="aff" rid="af4-ijms-12-03085"><sup>4</sup></xref></contrib>
<contrib contrib-type="author">
<name><surname>Steel</surname><given-names>Mike</given-names></name><xref ref-type="aff" rid="af5-ijms-12-03085"><sup>5</sup></xref></contrib></contrib-group>
<aff id="af1-ijms-12-03085">
<label>1</label> Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland</aff>
<aff id="af2-ijms-12-03085">
<label>2</label> Tampere University of Technology, Finland; E-Mail: <email>stukauffman@gmail.com</email></aff>
<aff id="af3-ijms-12-03085">
<label>3</label> University of Vermont, 85 South Prospect Street Burlington, VT 05405, USA</aff>
<aff id="af4-ijms-12-03085">
<label>4</label> Santa Fe Institute, 1399 Hyde Park Road Santa Fe, NM 87501, USA</aff>
<aff id="af5-ijms-12-03085">
<label>5</label> Department of Mathematics and Statistics, University of Canterbury, Private Bag 4800, Christchurch, New Zealand; E-Mail: <email>mike.steel@canterbury.ac.nz</email></aff>
<author-notes>
<corresp id="c1-ijms-12-03085">
<label>★</label> Author to whom correspondence should be addressed; E-Mail: <email>wim@santafe.edu</email>; Tel.: +41-21-692-4268.</corresp></author-notes>
<pub-date pub-type="epub">
<day>12</day>
<month>5</month>
<year>2011</year></pub-date>
<pub-date pub-type="collection">
<year>2011</year></pub-date>
<volume>12</volume>
<issue>5</issue>
<fpage>3085</fpage>
<lpage>3101</lpage>
<history>
<date date-type="received">
<day>18</day>
<month>2</month>
<year>2011</year></date>
<date date-type="rev-recd">
<day>18</day>
<month>4</month>
<year>2011</year></date>
<date date-type="accepted">
<day>5</day>
<month>5</month>
<year>2011</year></date></history>
<permissions>
<copyright-statement>© 2011 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
<copyright-year>2011</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>The formation of a self-sustaining autocatalytic chemical network is a necessary but not sufficient condition for the origin of life. The question of whether such a network could form “by chance” within a sufficiently complex suite of molecules and reactions is one that we have investigated for a simple chemical reaction model based on polymer ligation and cleavage. In this paper, we extend this work in several further directions. In particular, we investigate in more detail the levels of catalysis required for a self-sustaining autocatalytic network to form. We study the size of chemical networks within which we might expect to find such an autocatalytic subset, and we extend the theoretical and computational analyses to models in which catalysis requires template matching.</p></abstract>
<kwd-group>
<kwd>autocatalytic sets</kwd>
<kwd>chemical reaction networks</kwd>
<kwd>template-based catalysis</kwd>
<kwd>origin of life</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>In previous work we introduced and investigated a mathematical model of catalytic reaction systems and autocatalytic sets [<xref ref-type="bibr" rid="b1-ijms-12-03085">1</xref>,<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>]. It was shown, both theoretically and computationally, that a linear growth rate in the level of catalysis (with increasing length <italic>n</italic> of the largest molecules in the system) is sufficient for autocatalytic sets to arise spontaneously [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>,<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>] in a well-known binary polymer model of catalytic reaction systems [<xref ref-type="bibr" rid="b4-ijms-12-03085">4</xref>,<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>].</p>
<p>In this paper we take a closer and more detailed look at our model and its results. First, we introduce a small modification to our mathematical definition of autocatalytic sets and the corresponding algorithm for finding them in general catalytic reaction systems (Section 3). This modification makes both the definition and the algorithm slightly simpler, and includes some specific (although probably rare) cases of autocatalytic sets which were previously left out. However, we show (formally, and in simulations) that this modified algorithm does not invalidate any previous results or conclusions.</p>
<p>Second, we show that there is a discrepancy between the theoretical and simulation results (Section 4). Both results show that a linear growth rate in level of catalysis is sufficient for the emergence of autocatalytic sets. However, there is a difference in the parameter values of these linear relations. Here, we recalculate and compare the required levels of catalysis in more detail and under different scenarios.</p>
<p>Third, we show how our model and algorithm can be used to answer other interesting questions relating to the emergence of autocatalytic sets (Section 5). In particular: What is the minimum required size of the molecule set for autocatalytic sets to emerge given a fixed (known) probability of a molecule catalyzing an arbitrary reaction?</p>
<p>Fourth, we show how more chemical realism can be included in our model, for example by considering template-based catalysis (Section 6). Even though this makes the model harder to analyze, it still generates interesting and useful results.</p>
<p>The next section briefly reviews our previously introduced model and definitions. The four sections following it will present the model modifications, extensions, and additional results mentioned above. The final section summarizes the main conclusions and discusses future directions. Mathematical proofs are provided in an <xref ref-type="app" rid="app1">Appendix</xref>.</p>
<p>Our study fits within a large and growing body of work that aims to formally model how self-sustaining biochemical systems necessary for life might have emerged. This is an area that has been investigated from many angles over the last three decades. Some approaches that are similar in scope but different in their specific details from the one we study here include models based on Petri-nets [<xref ref-type="bibr" rid="b6-ijms-12-03085">6</xref>], algebraic approaches based on metabolic closure (such as Rosen’s “(M,R) systems”, [<xref ref-type="bibr" rid="b7-ijms-12-03085">7</xref>–<xref ref-type="bibr" rid="b9-ijms-12-03085">9</xref>]), computer simulations of autocatalytic networks involving artificial chemistry [<xref ref-type="bibr" rid="b10-ijms-12-03085">10</xref>] or metabolic networks [<xref ref-type="bibr" rid="b11-ijms-12-03085">11</xref>], differential equation modeling [<xref ref-type="bibr" rid="b12-ijms-12-03085">12</xref>–<xref ref-type="bibr" rid="b15-ijms-12-03085">15</xref>], and Erdös-Renyi style random graph theory [<xref ref-type="bibr" rid="b16-ijms-12-03085">16</xref>]. The idea of autocatalytic sets as a precursor to life has certainly not been without criticism [<xref ref-type="bibr" rid="b17-ijms-12-03085">17</xref>–<xref ref-type="bibr" rid="b19-ijms-12-03085">19</xref>], but recent (exciting) experimental evidence shows that they are a real possibility [<xref ref-type="bibr" rid="b20-ijms-12-03085">20</xref>–<xref ref-type="bibr" rid="b24-ijms-12-03085">24</xref>]. Thus, we believe that our theoretical and computational studies and results are of direct relevance in the larger context of the origin of life [<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>,<xref ref-type="bibr" rid="b25-ijms-12-03085">25</xref>,<xref ref-type="bibr" rid="b26-ijms-12-03085">26</xref>].</p></sec>
<sec>
<label>2.</label>
<title>RAF Sets</title>
<p>Autocatalytic sets may have played an important role in the origin of life [<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>,<xref ref-type="bibr" rid="b15-ijms-12-03085">15</xref>,<xref ref-type="bibr" rid="b26-ijms-12-03085">26</xref>,<xref ref-type="bibr" rid="b27-ijms-12-03085">27</xref>], and are a necessary, although not sufficient, condition for life. Here, <italic>autocatalytic sets</italic> are defined more formally as <italic>RAF sets</italic> as follows (see [<xref ref-type="bibr" rid="b1-ijms-12-03085">1</xref>,<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>] for the full mathematical definition and notations). Given a <italic>catalytic reaction system</italic> (CRS), <italic>i.e.</italic>, a network of molecule types and catalyzed chemical reactions, a (sub)set <italic>ℛ</italic> of such reactions (plus the molecules involved in the reactions in <italic>ℛ</italic>) is called:
<list list-type="order">
<list-item>
<p><italic>Reflexively autocatalytic</italic> (RA) if every reaction in <italic>R</italic> is catalyzed by at least one molecule involved in any of the reactions in <italic>ℛ</italic>;</p></list-item>
<list-item>
<p><italic>F-generated</italic> (F) if every reactant in <italic>ℛ</italic> can be constructed from a small “food set” <italic>F</italic> by successive applications of reactions from <italic>ℛ</italic>;</p></list-item>
<list-item>
<p><italic>Reflexively autocatalytic and F-generated</italic> (RAF) if it is both RA and <italic>F</italic>.</p></list-item></list>The food set <italic>F</italic> contains molecules that are assumed to be freely available in the environment. Thus, an RAF set formally captures the notion of “catalytic closure”, <italic>i.e.</italic>, a self-sustaining set supported by a steady supply of (simple) molecules from some food set. <xref ref-type="fig" rid="f1-ijms-12-03085">Figure 1</xref> shows a simple example. In [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], we also introduced a polynomial-time algorithm to find RAF sets in general catalytic reaction systems.</p>
<p>Note (as already stated earlier [<xref ref-type="bibr" rid="b26-ijms-12-03085">26</xref>]) that this notion of an autocatalytic <italic>set</italic> is somewhat different from the (chemical) term autocatalytic <italic>reaction</italic> in which a molecule directly catalyzes its own production. With an autocatalytic set we do <italic>not</italic> mean a set of autocatalytic reactions, but rather a set of molecules and reactions which is collectively autocatalytic in the sense that all molecules help in producing each other (through mutual catalysis, and supported by a food set). Because of this confusion in terminology, we prefer to use the term “RAF set”.</p>
<sec>
<title/>
<sec>
<title>A Model of Catalytic Reaction Systems</title>
<p>In [<xref ref-type="bibr" rid="b4-ijms-12-03085">4</xref>,<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>], a random CRS model was introduced using binary polymers (bit strings) of length at most <italic>n</italic> as molecule types, ligation and cleavage reactions, and a probability <italic>p</italic> of any molecule catalyzing any reaction (<italic>n</italic> and <italic>p</italic> are parameters of the model). It was argued that in a “sufficiently complex” system (<italic>i.e.</italic>, large enough diversity of molecule types), autocatalytic sets would appear “spontaneously”. This was later criticized for requiring an exponential increase (with <italic>n</italic>) in the (average) number of reactions catalyzed by any one molecule [<xref ref-type="bibr" rid="b17-ijms-12-03085">17</xref>], which is chemically unrealistic. However, in [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>] it was shown computationally (by applying the RAF algorithm to instances of the random CRS model), and then confirmed theoretically in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>], that only a <italic>linear</italic> growth rate (with <italic>n</italic>) in the level of catalysis is sufficient for RAF sets to appear with high probability. Furthermore, with these results it is possible to quantify “sufficiently complex” in terms of the size of the molecule set (or maximum molecule length <italic>n</italic>) and the level of catalysis (average number of reactions catalyzed by any molecule). Here, we continue our investigations of this random CRS model, which we will refer to throughout as the <italic>binary polymer model</italic>.</p></sec></sec></sec>
<sec>
<label>3.</label>
<title>Modification of the Original Definition and Algorithm</title>
<p>In the original (mathematical) definition of RAF sets [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], the “RA” part is stated in terms of the <italic>support</italic> supp(<italic>ℛ</italic>) of a set of reactions <italic>ℛ</italic>, which is defined as all the molecules (reactants and products) that are involved in at least one of the reactions in <italic>ℛ</italic>. For a set of reactions <italic>ℛ</italic> to be reflexively autocatalytic (RA), it is required that every reaction in <italic>ℛ</italic> is catalyzed by at least one molecule in supp(<italic>ℛ</italic>).</p>
<p>Similarly, the “F” part of the definition is stated in terms of the <italic>closure</italic> cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>) of the food set <italic>F</italic> relative to the set of reactions <italic>ℛ</italic>, which is defined as <italic>F</italic> together with all molecules that can be constructed from <italic>F</italic> by repeated applications of reactions in <italic>ℛ</italic>. For a set of reactions <italic>ℛ</italic> to be <italic>F</italic>-generated (<italic>F</italic>), it is required that the reactants of every reaction in <italic>ℛ</italic> are in cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>).</p>
<p>Now imagine a situation where all reactions in <italic>ℛ</italic> conform to the above two requirements, except for one reaction <italic>r</italic> ∈ <italic>ℛ</italic> which is catalyzed by a molecule <italic>x</italic> ∈ <italic>F</italic> which is not involved in any other way in any reaction in <italic>ℛ</italic>. So, <italic>x</italic> ∈ cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>), but <italic>x</italic> ∉ supp(<italic>ℛ</italic>). In words, the molecule <italic>x</italic> is in the closure of the food set (by default, as it is part of the food set), but it is not in the support of the reaction set. So, according to the original definition, this set <italic>ℛ</italic> is not RAF (it does not conform to the RA part of the definition), even though logically one would consider this case to be a proper autocatalytic set.</p>
<p>To remedy this, and make sure these (probably rare but relevant) cases are also included, we propose a slight modification to our original definition of RAF sets as follows:</p>
<sec>
<label>3.1.</label>
<title>RAF Definition</title>
<p>Given a catalytic reaction system <italic>𝒬</italic> = (<italic>X, ℛ, C</italic>), with molecule set <italic>X</italic>, reaction set <italic>ℛ</italic>, catalyzation <italic>C</italic> (a set of molecule-reaction pairs indicating which molecules catalyze which reactions), a nonempty subset <italic>ℛ′</italic> of <italic>ℛ</italic> is said to be:
<list list-type="bullet">
<list-item>
<p><italic>Reflexively autocatalytic</italic> (RA) if for all reactions <italic>r</italic> ∈ <italic>ℛ′</italic> there exists a molecule <italic>x</italic> ∈ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>) such that (<italic>x, r</italic>) ∈ <italic>C</italic>;</p></list-item>
<list-item>
<p><italic>F-generated</italic> if ρ(<italic>ℛ′</italic>) ⊆ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>), where ρ(<italic>ℛ′</italic>) is the set of all reactants in <italic>ℛ′;</italic></p></list-item>
<list-item>
<p><italic>Reflexively autocatalytic and F -generated (RAF)</italic> if <italic>ℛ′</italic> is both RA and <italic>F</italic>.</p></list-item></list></p>
<p>In other words, a set of reactions <italic>ℛ′</italic> is RAF if each reaction in <italic>ℛ′</italic> is catalyzed by at least one molecule that can be produced by the set itself (starting from the food molecules), and all reactants (of all reactions in <italic>ℛ′</italic>) can also be produced by the set itself. So, instead of using supp(<italic>ℛ</italic>) in the RA part of the definition and cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>) in the F part, as in the original definition, the modified definition uses cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>) in both the RA and the F part, thus simplifying it slightly. Note that an RAF in the original setting of [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>] is still an RAF under this new definition by virtue of the following result, a proof of which is provided in the <xref ref-type="app" rid="app1">Appendix</xref>.</p>
<p><bold>Lemma 3.1</bold> <italic>Any set of reactions that forms an RAF under the earlier definition of [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>] is an RAF in the modified definition above.</italic></p>
<p>This may seem like a minor point, but it could be an important one. Consider, for example, the (reverse) citric acid cycle, which has been argued to have (possibly) been a major step in the origin of life by synthesizing the basic building blocks of organic molecules [<xref ref-type="bibr" rid="b28-ijms-12-03085">28</xref>,<xref ref-type="bibr" rid="b29-ijms-12-03085">29</xref>]. Even though at present the catalysts that drive the reactions in this cycle are enzymes (proteins), at the early stages these reactions could very well have been catalyzed by much simpler, naturally occurring elements, or ones that are easily synthesized (by basic chemistry) from freely available inorganic molecules [<xref ref-type="bibr" rid="b30-ijms-12-03085">30</xref>]. Thus, these original catalysts can be considered food molecules, but they are not involved (as reactants or products) in any of the reactions in the (reverse) citric acid cycle (<italic>i.e.</italic>, they are not in the support). As a consequence, this important metabolic network of reactions would not be classified as an RAF with our original definition, but it is indeed a true RAF according to our modified definition.</p>
<p>Lemma 3.1 has several other desirable properties. First, it ensures that the recent result that Rosen’s (M,R) systems can be viewed and studied within the RAF framework [<xref ref-type="bibr" rid="b9-ijms-12-03085">9</xref>], still holds. Second, it tells us that if a system has no RAF under the new definition, then it clearly has none under the original one, nor, indeed, under more embellished definitions of an RAF that impose further conditions so as to avoid “trivialities”. One such condition (from [<xref ref-type="bibr" rid="b1-ijms-12-03085">1</xref>]) would be to require that not all reactions in an RAF are catalyzed by molecules in <italic>F</italic> (or in some larger subset <italic>S</italic> of molecules that can be generated from <italic>F</italic> by catalysis from <italic>F</italic> and resulting molecules). However, we can accommodate this additional condition within the new RAF framework as follows. The algorithm we will describes below constructs a unique maximal RAF (under the new definition), and from this one can easily check whether an RAF exists that satisfies the additional non-triviality condition (simply check whether this maximal RAF contains a reaction that does not have all its catalysts in <italic>F</italic>, or the larger set <italic>S</italic>). Third, spontaneous reactions (those that can proceed without any catalyst) can be allowed (and form part of an RAF) if we formally extend <italic>F</italic> by an extra “token” element with directed (catalysis) arrows from that token to all such spontaneous reactions. And there are various other extensions, restrictions, or variations that can be imposed on our more general framework.</p>
<p>Of course there are many more properties relevant to the origin of self-sustaining biochemistry beyond such ad-hoc conditions to exclude trivialities—for example, the dynamics of the reactions (the quantity of reagents and products (stoichiometry) along with thermodynamic considerations) and the effect of inhibition and degrading side reactions [<xref ref-type="bibr" rid="b18-ijms-12-03085">18</xref>,<xref ref-type="bibr" rid="b19-ijms-12-03085">19</xref>]. However, our view here is that RAF sets should be regarded very much as minimal <italic>necessary</italic> conditions for such systems, rather than in any sense <italic>sufficient</italic>. Identifying RAF sets, and establishing conditions for their existence is thus a natural question on the road to finding viable candidates for the origin of early biochemistry.</p>
<p>An additional benefit of our modified RAF definition is that it significantly simplifies the corresponding algorithm for finding RAF sets in general catalytic reaction systems, and its correctness proof. The original algorithm is based on repeatedly (and alternately) applying two reduction steps (starting from the full reaction set) [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>]:
<list list-type="order">
<list-item>
<p>Remove all reactions that do not conform to the RA requirement;</p></list-item>
<list-item>
<p>Remove all reactions that do not conform to the <italic>F</italic> requirement.</p></list-item></list>However, since in our modified definition both the RA part and the <italic>F</italic> part are stated in terms of the closure, these two reduction steps can now be merged into one. As a consequence, the algorithm can be simplified to the following:</p></sec>
<sec>
<label>3.2.</label>
<title>RAF Algorithm</title>
<list list-type="order">
<list-item>
<p>Start with the complete set of reactions <italic>ℛ</italic> and the food set <italic>F</italic>;</p></list-item>
<list-item>
<p>Compute the closure of the food set cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>) relative to the current set of reactions <italic>ℛ</italic>;</p></list-item>
<list-item>
<p>For each reaction <italic>r</italic> ∈ <italic>ℛ</italic> for which (1) all catalysts, or (2) one or more reactants, are not in cl<italic><sub>ℛ</sub></italic>(<italic>F</italic>), remove <italic>r</italic> from <italic>ℛ</italic>;</p></list-item>
<list-item>
<p>Repeat steps 2 and 3 until no more reactions can be removed.</p></list-item></list>
<p>The resulting (reduced) reaction set <italic>ℛ</italic> is either the (maximal) RAF set contained in the given catalytic reaction system, or it is empty, in which case there is no RAF. This was already proved for the original algorithm in [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], and a (simpler) proof for the modified version is provided in the <xref ref-type="app" rid="app1">appendix</xref>. Note, however, that the overall running time of this modified algorithm is still the same as in the original case (<italic>𝒪</italic>(|<italic>ℛ</italic>|<sup>2</sup> log |<italic>ℛ</italic>|) worst-case, but shown to be sub-quadratic on average in practice [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>]), as it is dominated by step 2 (computing the closure of the food set). All results presented in this paper are generated with this new version of the RAF algorithm.</p></sec></sec>
<sec>
<label>4.</label>
<title>Required Levels of Catalysis</title>
<p>In [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], we showed through computer simulations that a linear growth rate in the level of catalysis (with <italic>n</italic>, the length of the largest molecules in the system) appears to be sufficient for RAF sets to occur with high probability in the binary polymer model with ligation and cleavage reactions and random catalysis. This was subsequently confirmed theoretically in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>]. However, even though both the computational and theoretical results give a linear relation for the required level of catalysis, there appears to be a significant difference in the values of the parameters of these linear functions.</p>
<p>This discrepancy can partly be explained by the fact that the theoretical analysis in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>] actually assumes RAF sets that involve <italic>all</italic> molecule types in the system, <italic>i.e.</italic>, an RAF that contains the entire molecule set <italic>X</italic> (but not necessarily all reactions). This, of course, is a much stronger assumption than used in the simulation studies in [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], where the RAF algorithm was used to find <italic>any</italic> RAF set, regardless of how many molecules or reactions it contains. But the question remains whether the discrepancy can be explained entirely by this difference.</p>
<p>To answer this, we repeated the original simulations, using the RAF algorithm to find <italic>any</italic> RAF set, but this time with the modified RAF algorithm. Then we also applied the RAF algorithm to look for RAF sets that involve <italic>all</italic> molecule types in <italic>X</italic>. In both cases, we collected statistics for the average number <italic>f</italic> (<italic>n</italic>) of reactions catalyzed by any molecule (<italic>i.e.</italic>, the level of catalysis) for which there is a probability <italic>P<sub>n</sub></italic> = 0.50 (or close to 0.50) of finding an RAF set in a number of instances of the random catalytic reaction model [<xref ref-type="bibr" rid="b31-ijms-12-03085">31</xref>]. From these statistics, we then estimated a linear function <italic>f</italic> (<italic>n</italic>) = <italic>a</italic> + <italic>bn</italic> using an ordinary least squares regression. We compare these results with the theoretical linear relation which can be calculated from Theorem 4.1 (ii) in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>] (using <italic>P<sub>n</sub></italic> = 0.50 and <italic>k</italic> = <italic>t</italic> = 2).</p>
<p>So, in short, we compare the required levels of catalysis for RAF sets to occur with high probability for three cases:
<list list-type="alpha-upper">
<list-item>
<p>Computational case for <italic>any</italic> RAF;</p></list-item>
<list-item>
<p>Computational case for <italic>all-molecule</italic> RAFs;</p></list-item>
<list-item>
<p>Theoretical case for <italic>all-molecule</italic> RAFs.</p></list-item></list></p>
<p>The computational values were calculated over 100 to 1000 (depending on the value of <italic>n</italic>) instances of the random catalytic reaction model, for <italic>n</italic> = 7, . . ., 20 (because of an exponential increase in the number of molecules |<italic>X</italic>| and reactions |<italic>ℛ</italic>| with <italic>n</italic>, we are computationally limited to about <italic>n</italic> = 20 in these simulations).</p>
<p><xref ref-type="table" rid="t1-ijms-12-03085">Table 1</xref> presents the linear relations estimated from the simulation data or calculated from the theoretical analysis. There is a difference of almost 2 orders of magnitude between the observed (simulation) slope of the required growth rate (case <bold>A</bold>) and the theoretical one (case <bold>C</bold>). However, as case <bold>B</bold> shows, this difference cannot be fully explained by the fact that the theoretical analysis assumes RAFs involving <italic>all</italic> molecules. Even with this stronger assumption, the theoretical slope (case <bold>C</bold>) is still more than twice as large as the one from the simulations (case <bold>B</bold>).</p>
<p><xref ref-type="fig" rid="f2-ijms-12-03085">Figure 2</xref> shows the complete data (dots for simulation values, lines for estimated or calculated linear relations). Clearly, even though the theoretically predicted value for <italic>f</italic> (<italic>n</italic>) grows quite fast with <italic>n</italic> (slope = 1.6339), the actual value grows only very slowly (slope = 0.0189). So, for example, for <italic>n</italic> = 20 the theoretically expected value for the average number of reactions that need to be catalyzed by any molecule to have RAF sets occurring with probability at least <italic>P<sub>n</sub></italic> = 0.50 is <italic>f<sub>C</sub></italic> (20) = 32.678, which seems unrealistically high. However, the actual level of catalysis required is only <italic>f<sub>A</sub></italic>(20) = 1.475, which is chemically much more plausible.</p></sec>
<sec>
<label>5.</label>
<title>Required Size of the Molecule Set</title>
<p>In addition to the level of catalysis required for RAF sets to emerge, we can use the RAF algorithm to answer other interesting, and related, questions. For example, one could ask what the minimum required size of the molecule set is (or, in the binary polymer model, the minimum size <italic>n</italic> of the largest molecules in the system) to get RAF sets with high probability, given a fixed (known) probability <italic>p</italic> of a molecule catalyzing an (arbitrary) reaction.</p>
<p>Suppose we fix the probability of catalysis at <italic>p</italic> = 0.00001, or perhaps even <italic>p</italic> = 0.000001 (one in a million), which is, for example, roughly the probability in phage display of a random peptide binding an arbitrary ligand [<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>,<xref ref-type="bibr" rid="b32-ijms-12-03085">32</xref>], and “the ease of polypeptide evolution with a small number of arbitrary sequences indicates that a significant fraction of all possible sequences may have functions, at least binding activity in correlation with catalytic activity” [<xref ref-type="bibr" rid="b33-ijms-12-03085">33</xref>]. In the binary polymer model, with these probabilities of catalysis, what value of <italic>n</italic> is required to get a probability of, say, <italic>P<sub>n</sub></italic> ≥ 0.50 of RAF sets occurring?</p>
<p><xref ref-type="fig" rid="f3-ijms-12-03085">Figure 3</xref> shows the results of applying the (modified) RAF algorithm to instances of the random binary polymer model (Section 2) for the two given values of <italic>p</italic> and for <italic>n</italic> = 5, . . ., 20. With <italic>p</italic> = 0.00001 (left curve in <xref ref-type="fig" rid="f3-ijms-12-03085">Figure 3</xref>), <italic>P<sub>n</sub></italic> = 0 for <italic>n ≤</italic> 12, but <italic>P</italic><sub>13</sub> = 0.982, and <italic>P<sub>n</sub></italic> = 1 for <italic>n</italic> ≥ 14. So, in this case a value of at least <italic>n</italic> = 13 is required to get RAF sets with high probability. For only a slightly higher probability of catalysis (<italic>p</italic> = 0.00002), a value of <italic>n</italic> = 12 would be sufficient (results not shown).</p>
<p>Similarly, with <italic>p</italic> = 0.000001 (right curve in <xref ref-type="fig" rid="f3-ijms-12-03085">Figure 3</xref>), <italic>P<sub>n</sub></italic> = 0 for <italic>n ≤</italic> 15, <italic>P</italic><sub>16</sub> = 0.939, and <italic>P<sub>n</sub></italic> = 1 for <italic>n ≥</italic> 17. So, in this case a value of at least <italic>n</italic> = 16 is required to get RAF sets with high probability. Again, for only a slightly higher probability of catalysis (<italic>p</italic> = 0.000002), a value of <italic>n</italic> = 15 would be sufficient (results not shown). The size of the molecule set in this case would be |<italic>X</italic>| = 65534.</p></sec>
<sec>
<label>6.</label>
<title>Template-Based Catalysis</title>
<p>One could argue that the binary polymer model used in our studies so far is perhaps somewhat oversimplified to be biologically or chemically realistic. However, the model serves as a useful starting point with which precise mathematical statements can be formulated and proved, or at least tested computationally. Furthermore, our RAF definition and algorithm are independent of the particular model that is used, and can in principle also be applied to real catalytic reaction systems (for example metabolic networks, of which the already mentioned citric acid cycle is a core element). And, equally importantly, it is actually not very difficult to add more chemical realism into our mathematical models.</p>
<p>As one particular example, we have considered <italic>template-based catalysis</italic> [<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>]. The idea here is that, in order to act as a catalyst, a molecule must match at least a certain area around the reaction site according to some template-based matching rule. Similar to, for example, base-pair complementarity in RNA and DNA, we could require a catalyst to match the complement of at least four positions around the reaction site of a ligation or cleavage (two on either side). Consider the ligation reaction
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:mn>00101</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>+</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>0011</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mo>&gt;</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>001010011</mml:mn></mml:mrow></mml:math></disp-formula>The reaction site template in this case is 0100 (the last two bits of the first binary polymer, plus the first two bits of the second polymer). A given molecule can only act as a catalyst for this reaction if, somewhere along its binary string representation, it contains the complement of this template, <italic>i.e.</italic>, 1011.</p>
<p>In line with the original random CRS model, and some initial simulations with such template-based catalysis [<xref ref-type="bibr" rid="b4-ijms-12-03085">4</xref>,<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>], we have included this idea as follows. For each combination of a molecule <italic>x</italic> ∈ <italic>X</italic> and a reaction <italic>r</italic> ∈ <italic>ℛ</italic>, if <italic>x</italic> matches (anywhere along its length) the complement of the reaction site template of length four of <italic>r</italic>, then with probability <italic>p</italic> the pair (<italic>x, r</italic>) is included in the set of catalyzation <italic>C</italic>. We present analytical results for this model in Section 6.1 below. However, in our simulations (Section 6.2 below), we used a slightly less constrained version as follows: If a molecule happens to be shorter than length four, then it only has to match part of the complement of the reaction site template, but we require catalysts to be of at least length two. So, the molecule 01 could also catalyze the reaction in the above example. This is mainly done to give some of the food molecules (which are all bit strings up to length two in our simulations) also a chance to act as catalysts.</p>
<p>Note that this template matching requirement is almost the same as in the original simulations [<xref ref-type="bibr" rid="b5-ijms-12-03085">5</xref>,<xref ref-type="bibr" rid="b34-ijms-12-03085">34</xref>–<xref ref-type="bibr" rid="b36-ijms-12-03085">36</xref>], except that we do not allow partial matches here. So, we are considering a slightly “stronger”, or more constrained case. Some initial results on similar simulations were reported in [<xref ref-type="bibr" rid="b37-ijms-12-03085">37</xref>].</p>
<sec sec-type="results">
<label>6.1.</label>
<title>Theoretical Results</title>
<p>Theorem 4.1 (ii) of Mossel and Steel (2005) shows that for polymers of length up to <italic>n</italic> over an alphabet of size <italic>κ</italic> ≥ 2, and with a food set of polymers of length up to <italic>t</italic>, the probability <italic>P</italic> (<italic>n</italic>) that there exists an all-molecule RAF is at least:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≥</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi></mml:mrow></mml:msup></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mi>t</mml:mi></mml:msup></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>when each molecule catalyzes on average (at least) <italic>λn</italic> reactions (provided also that <italic>λ</italic> &gt; log<italic><sub>e</sub></italic>(<italic>κ</italic>)). Notice that <italic>P</italic> (<italic>n</italic>) can be chosen as close to 1 as we wish by selecting <italic>λ</italic> large enough (and independently of <italic>n</italic>!). Thus, this result justifies the statement that the average number of reactions each molecule catalyzes needs to grow only linearly with <italic>n</italic> in order for there to be a given (high) probability of generating an RAF.</p>
<p>We now describe how this result modifies if catalysis is required to be template-based, as described above. Suppose that a polymer <italic>x</italic> can catalyze a given reaction only if <italic>x</italic> contains a substring of length <italic>s</italic> = <italic>s</italic><sub>1</sub> + <italic>s</italic><sub>2</sub> that is complementary to the end-segment (of length <italic>s</italic><sub>1</sub>) and the initial segment (of length <italic>s</italic><sub>2</sub>) of the two molecules involved in the cleavage or ligation. Thus, for the above set-up we have: <italic>s</italic><sub>1</sub> = <italic>s</italic><sub>2</sub> = 2 and so <italic>s</italic> = 4. We also assume that the probability that a molecule <italic>x</italic> catalyzes a reaction <italic>r</italic> with a complementary template just depends on <italic>x</italic> and not on <italic>r</italic> (this is the analogue of the template-free model assumption (<italic>R</italic><sub>2</sub>) in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>]).</p>
<p>The following result shows that RAFs will still arise with high probability under linear growth in the average number of reactions each molecule catalyzes, provided the constant involved is increased by a factor of <italic>κ<sup>s</sup></italic> (this factor would be 16 for the binary model with template size four). More precisely we have the following result, whose proof is provided in the <xref ref-type="app" rid="app1">Appendix</xref>.</p>
<p><bold>Theorem 6.1</bold> <italic>Let P<sub>s</sub>(n) be the probability that there exists an all-molecule RAF under this template matching model. Suppose that each molecule catalyzes (on average) at least λ<sub>s</sub>n reactions, where λ<sub>s</sub></italic> <italic>= κ<sup>s</sup>λ and λ &gt;</italic> log<italic><sub>e</sub>(κ). Then P<sub>s</sub>(n) satisfies the same inequality as P (n), namely</italic>
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≥</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi></mml:mrow></mml:msup></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mi>t</mml:mi></mml:msup></mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>κ</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula></p></sec>
<sec sec-type="results">
<label>6.2.</label>
<title>Computational Results</title>
<p>As with the original (random) model (see Section 4), we expect there to be a difference between the theoretically predicted required level of catalysis and the observed level (from simulations) in case of template-based catalysis. Unfortunately, we are computationally even more restricted with this template-based catalysis case as with the original model. The running time of our RAF algorithm is polynomial in the size of the reaction set |<italic>ℛ</italic>|, as was shown in [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>]. However, since |<italic>ℛ</italic>| is <italic>exponential</italic> in the size of the largest molecules <italic>n</italic>, <italic>i.e.</italic>, |<italic>ℛ</italic>| ∝ 2<italic><sup>n</sup></italic>, we can only go up to about <italic>n</italic> = 20 to get computational results in a reasonable amount of time (even on a parallel cluster) for the original model (Section 4). But for the template-based catalysis as described above, it is even worse. We now have to check for each pair of molecule <italic>x</italic> ∈ <italic>X</italic> and reaction <italic>r</italic> ∈ <italic>ℛ</italic> whether there is a template match between any part of the molecule and the complement of the (4-site) reaction template. Since both <italic>|X|</italic> and <italic>|ℛ|</italic> are ∝ 2<italic><sup>n</sup></italic>, this means that |<italic>X</italic> × <italic>ℛ</italic>| ∝ 2<sup>2</sup><italic><sup>n</sup></italic>, and in practice this means that we can only go up to about <italic>n</italic> = 16 with our template-based catalysis simulations.</p>
<p><xref ref-type="fig" rid="f4-ijms-12-03085">Figure 4</xref> shows the results of these simulations, using our modified algorithm to find RAF sets (of any size) in the template-based catalysis case, compared to the original model (case <bold>A</bold> in Section 4). As can be expected, for smaller values of <italic>n</italic>, a higher level of catalysis is needed to find RAF sets with high probability (again, <italic>P<sub>n</sub></italic> = 0.50 is taken as the transition point) in the case of template-based catalysis compared to the purely random model. Since each molecule type <italic>x</italic> ∈ <italic>X</italic> is now restricted, to some extent, in terms of which reactions it can catalyze, the system as a whole is more constrained, and it will be harder to get RAF sets.</p>
<p>However, for longer and longer molecule types, this restriction becomes less of a problem, as a longer molecule has an increasingly higher probability of matching a given 4-site template somewhere along its length. Indeed, the required level of catalysis for the template-based case tapers off as <italic>n</italic> increases, and converges to that of the original (purely random) model, reaching the same level at <italic>n</italic> = 16. Given that the purely random model is the “limiting” case for the template-based model, we expect that the level of catalysis for this template-based model will follow the same pattern as the base model for <italic>n &gt;</italic> 16, and follow a linear relation.</p></sec></sec>
<sec sec-type="conclusions">
<label>7.</label>
<title>Conclusions</title>
<p>Building on our previous work, we have investigated in more detail the levels of catalysis required for the emergence of autocatalytic sets in models of chemical reaction systems. First, we have shown that there is a discrepancy between the theoretically predicted levels and the computationally observed ones. Although both results yield a linear relation between the required level of catalysis and the size <italic>n</italic> of the largest molecules in the system, in practice this required level is almost two orders of magnitude smaller than the predicted one. However, this discrepancy cannot be fully explained by the fact that the theoretical result is based on a much stronger assumption (requiring the RAF sets to contain <italic>all</italic> molecules in the system). Even for large systems (<italic>n</italic> = 20, containing several millions of molecule types), each molecule only needs to catalyze (on average) between one and two reactions to have RAF sets appear with high probability, which is chemically highly plausible.</p>
<p>Next, we looked at the minimum size of the molecule set (<italic>i.e.</italic>, number of different types of molecules) necessary to get RAF sets given a fixed probability of catalysis. With a (realistic) probability on the order of one in a million of any molecule catalyzing any given reaction, we only need molecules up to length <italic>n</italic> = 15 or <italic>n</italic> = 16, or about 65,000 different molecule types. Again, this number is well within a plausible (experimental) range.</p>
<p>Finally, we studied an extension of the original model, including template-based catalysis. We established formally that in this case a linear growth rate in the level of catalysis also suffices for RAF sets to appear with arbitrary high probability. However, the simulations show that for smaller values of <italic>n</italic> (the length of the largest molecules in the system), this linear relation is not exact, and that a higher level of catalysis is necessary compared to the original (purely random) model to get RAF sets with a similar probability. But, as <italic>n</italic> increases, the template matching constraint becomes less of an issue, and the required levels of catalysis converge to those of the original model. This example shows how more chemical realism can be included in our model.</p>
<p>We intend to continue studying the emergence of autocatalytic sets in chemical reaction systems under various scenarios, models, and extensions. However, so far we have mainly studied the (static) underlying graph structures of such systems. One particularly important issue we hope to address in the future is the actual molecular dynamics in a given (catalytic) reaction system. Also, next to studying models of chemical reaction systems, we would like to apply our RAF framework to real (bio)chemical systems such as metabolic networks, or the collection of all known (organic) substrates and reactions. It is our hope that this line of work will help provide more insight into the (possible) origin of life in general.</p></sec></body>
<back>
<ack>
<p>The computations were performed at the Vital-IT (<ext-link xlink:href="http://www.vital-it.ch" ext-link-type="uri">http://www.vital-it.ch</ext-link>) center for high-performance computing of the Swiss Institute of Bioinformatics. We thank Jotun Hein for stimulating discussions. MS also thanks the Royal Society of New Zealand (James Cook fellowship).</p></ack>
<app-group>
<app id="app1">
<title>Appendix</title>
<sec>
<label>A1.</label>
<title>Proof of Lemma 3.1</title>
<p>Let RA denote the original definition of “Reflexively Autocatalytic” from [<xref ref-type="bibr" rid="b2-ijms-12-03085">2</xref>], and [RA<sub>1</sub>] the new one as given here in Section 3. We need to show that [RA] + [F-gen] implies [RA<sub>1</sub>], so suppose that <italic>r</italic> ∈ <italic>R′</italic>. By [RA] there exists <italic>x</italic> ∈ <italic>supp</italic>(<italic>ℛ′</italic>) : (<italic>x, r</italic>) ∈ <italic>C.</italic> Either <italic>x</italic> ∈ <italic>ρ</italic>(<italic>R′</italic>) or <italic>x</italic> ∈ <italic>π</italic>(<italic>R′</italic>) (the molecules that are products of at least one reaction in <italic>ℛ′</italic>). In the first case, by [F-gen], we have <italic>x</italic> ⋲ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>) so [RA<sub>1</sub>] holds. In the second case, where <italic>x</italic> ∈ <italic>π</italic>(<italic>R′</italic>) there exists (<italic>A, B</italic>) ∈ <italic>R′</italic> for which <italic>x</italic> ∈ <italic>B</italic>. By [F-gen], <italic>A</italic> ⊆ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>) and so <italic>B</italic> ⊆ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>) and thus <italic>x</italic> ∈ cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>). So once again [RA<sub>1</sub>] holds.</p></sec>
<sec>
<label>A2.</label>
<title>Proof of RAF Algorithm Correctness</title>
<p>Let <italic>ℛ′</italic> be the output of the RAF algorithm. We need to establish two claims, namely that (i) if <italic>ℛ′</italic> is empty then <italic>ℛ</italic> contains no RAF, and (ii) if <italic>ℛ′</italic> is nonempty then <italic>ℛ′</italic> is an RAF, and indeed the unique maximal RAF present in <italic>ℛ</italic>.</p>
<p>To establish claim (i) we first justify a third claim (iii): If a reaction <italic>r</italic> lies in any RAF (say <italic>ℛ</italic><sub>1</sub>) that is contained within an arbitrary subset (say <italic>ℛ</italic><sub>2</sub>) of <italic>ℛ</italic>, then <italic>r</italic> is not eliminated if step 3 of the RAF algorithm is applied with <italic>ℛ</italic><sub>2</sub> being taken as the “current set of reactions”. To see this, the fact that <italic>ℛ</italic><sub>1</sub> is an RAF implies that there exists a molecule <italic>x</italic> ∈ cl<sub><italic>ℛ</italic><sub>1</sub></sub> (<italic>F</italic>) that catalyzes <italic>r</italic>, and that the reactants of <italic>r</italic> are contained within cl<sub><italic>ℛ</italic><sub>1</sub></sub> (<italic>F</italic>). Now, <italic>ℛ</italic><sub>1</sub> ⊆ <italic>R</italic><sub>2</sub> and so cl<sub><italic>ℛ</italic><sub>1</sub></sub> (<italic>F</italic>) ⊆ cl<sub><italic>ℛ</italic><sub>2</sub></sub> (<italic>F</italic>); thus step 3 will not eliminate <italic>r</italic> from <italic>ℛ</italic><sub>2</sub>. This establishes claim (iii) which implies, by induction, the further claim (iv): If a reaction <italic>r</italic> lies within any RAF that is contained within <italic>ℛ</italic>, then <italic>r</italic> is never eliminated at step 3 of the RAF algorithm at any stage starting with <italic>ℛ</italic>. Claim (i) now follows immediately.</p>
<p>To establish claim (ii) we first note that, since the algorithm has terminated at some non-empty set of reactions <italic>ℛ′</italic>, no reaction in <italic>ℛ′</italic> can be further eliminated by step 3. By definition, this means that each reaction <italic>r</italic> in <italic>ℛ′</italic> has the properties that (1) at least one catalyst of <italic>r</italic> is in cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>) and (2) all reactants of <italic>r</italic> are in cl<italic><sub>ℛ′</sub></italic> (<italic>F</italic>). Condition (1) implies that <italic>ℛ′</italic> satisfies the (RA) condition, and property (2) shows that <italic>ℛ′</italic> satisfies the F-generated condition. Thus <italic>ℛ′</italic> is an RAF; moreover it is the unique maximal RAF by virtue of claim (iv) established in the previous paragraph.</p></sec>
<sec>
<label>A3.</label>
<title>Proof of Theorem 6.1</title>
<p>We first recall two quantities (<italic>x<sub>m</sub>, r<sub>m</sub></italic>) from [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>]. Let
<disp-formula id="FD1">
<label>(1)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>m</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>m</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula>denote the number of polymers of length at most <italic>m</italic> over an alphabet of size <italic>κ</italic> and let <italic>r<sub>m</sub></italic> denote the number of forward (ligation) reactions that produce polymers of length at most <italic>m</italic>. We have:
<disp-formula id="FD2">
<label>(2)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mi>m</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>m</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mi>m</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula></p>
<p>For any polymer <italic>x</italic> of length at most <italic>n</italic> let <italic>T<sub>s</sub></italic>(<italic>x</italic>) denote the set of polymers of length <italic>s</italic> that occur within <italic>x</italic>, and for any polymer <italic>t</italic> of length <italic>s</italic> let <italic>X<sub>n</sub></italic>(<italic>t</italic>) denote the set of polymers of length at most <italic>n</italic> that contain at least one copy of <italic>t</italic>. Thus,
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>∈</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>⇔</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>x</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>∈</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>n</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>Clearly,
<disp-formula id="FD3">
<label>(3)</label>
<mml:math display="block">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msup></mml:mrow></mml:math></disp-formula>Also, for <italic>n ≥ s</italic>, and <italic>t</italic> any polymer of length <italic>s</italic> we have:
<disp-formula id="FD4">
<label>4</label>
<mml:math display="block">
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>X</mml:mi></mml:mrow>
<mml:mi>n</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≥</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>+</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>κ</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>+</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mi> </mml:mi>
<mml:mo>+</mml:mo>
<mml:mi> </mml:mi>
<mml:mo>⋯</mml:mo>
<mml:mi> </mml:mi>
<mml:mo>+</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msup>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula></p>
<p>Given a (forward) reaction <italic>r</italic>, let <italic>t<sub>r</sub></italic> denote the complement of the template polymer of length <italic>s</italic> = <italic>s</italic><sub>1</sub> + <italic>s</italic><sub>2</sub> (which is also polymer of length <italic>s</italic>). If we let <italic>R<sub>n</sub></italic>(<italic>t</italic>) denote the set of forward reactions <italic>r</italic> with <italic>t<sub>r</sub></italic> = <italic>t</italic>, then:
<disp-formula id="FD5">
<label>(5)</label>
<mml:math display="block">
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mi>n</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula></p>
<p>In the template polymerization model, <italic>x</italic> can catalyze <italic>r</italic> only if <italic>x</italic> ∈ <italic>X</italic>(<italic>t<sub>r</sub></italic>) (or, equivalently, if <italic>r</italic> ∈ <italic>R<sub>n</sub></italic>(<italic>t</italic>) for some <italic>t</italic> ∈ <italic>T<sub>s</sub></italic>(<italic>x</italic>)). Let <italic>p</italic>(<italic>x, r</italic>) be the probability that molecule <italic>x</italic> catalyzes reaction <italic>r</italic>. We have assumed that each molecule that could catalyze reaction <italic>r</italic> (by virtue of template matching) has the same probability of doing so, that is:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>{</mml:mo>
<mml:mrow>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mtext>if</mml:mtext>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>∈</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mi>n</mml:mi></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mtext>otherwise</mml:mtext></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>p<sub>x</sub></italic> just depends on <italic>x</italic> and not <italic>r</italic> (the analogue of requirement (R2) in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>]).</p>
<p>Thus, if we let <italic>μ<sub>x</sub></italic> be the expected number of (forward) reactions (<italic>r</italic> ∈ <italic>R</italic><sub>+</sub>(<italic>n</italic>)) that a molecule <italic>x</italic> catalyzes, then:
<disp-formula id="FD6">
<label>(6)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:munder>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>∈</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mo>+</mml:mo></mml:msub>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:munder>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>·</mml:mo>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula>where <italic>S</italic>(<italic>x</italic>) = {<italic>r</italic> ∈ <italic>R</italic><sub>+</sub>(<italic>n</italic>) : <italic>x</italic> ∈ <italic>X</italic> (<italic>t<sub>r</sub></italic>)}. Note that <italic>|S</italic>(<italic>x</italic>)<italic>|</italic> is bounded above by the number of pairs (<italic>t, r</italic>) where <italic>t</italic> ∈ <italic>T<sub>s</sub></italic>(<italic>x</italic>) and <italic>r</italic> ∈ <italic>R</italic><sub>+</sub>(<italic>n</italic>) : <italic>r<sub>t</sub></italic> = <italic>t</italic>. By virtue of Inequality (3) and <xref ref-type="disp-formula" rid="FD5">Equation (5)</xref> this set of pairs has size at most <italic>κ<sup>s</sup>r<sub>n–s</sub></italic> and so
<disp-formula id="FD7">
<label>(7)</label>
<mml:math display="block">
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msup>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula></p>
<p>Now, Inequality (2) applied to <italic>m</italic> = <italic>n – s</italic>, and (7) gives:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:mrow>
<mml:mo>|</mml:mo>
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>x</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>|</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msup>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>)</mml:mo></mml:mrow></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>s</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></disp-formula>Thus, by the assumption that <italic>μ<sub>x</sub></italic> ≥ <italic>λ<sub>s</sub>n</italic> and applying (6), we obtain:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>·</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>n</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>μ</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mi>s</mml:mi></mml:msup>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>s</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>·</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>n</mml:mi></mml:mrow></mml:math></disp-formula>Since <italic>λ<sub>s</sub></italic> = <italic>κ<sup>s</sup>λ</italic> this last inequality can be written as:
<disp-formula id="FD8">
<label>(8)</label>
<mml:math display="block">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub>
<mml:mi> </mml:mi>
<mml:mo>≥</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mi>λ</mml:mi>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>s</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:math></disp-formula></p>
<p>Now, following [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>], for a forward reaction <italic>r</italic> ∈ <italic>R</italic><sub>+</sub>(<italic>n</italic>) let 
<inline-formula>
<mml:math>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>q</mml:mi></mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>*</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> be the probability that <italic>r</italic> is not catalyzed by any molecule. Then
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>q</mml:mi></mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>*</mml:mo></mml:msubsup>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:munder>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>∈</mml:mo>
<mml:mi>X</mml:mi>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:munder>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>r</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>=</mml:mo>
<mml:mi> </mml:mi>
<mml:munder>
<mml:mo>∏</mml:mo>
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mo>∈</mml:mo>
<mml:mi>X</mml:mi>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi></mml:mrow>
<mml:mi>r</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:munder>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mi>x</mml:mi></mml:msub></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow></mml:mrow></mml:math></disp-formula></p>
<p>By Inequalities (4) and (8) and the inequality (1 – <italic>x</italic>)<italic><sup>y</sup></italic> ≤ exp(–<italic>xy</italic>) for <italic>x, y &gt;</italic> 0 we obtain:
<disp-formula>
<mml:math display="block">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>q</mml:mi></mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>*</mml:mo></mml:msubsup>
<mml:mi> </mml:mi>
<mml:mo>≤</mml:mo>
<mml:mi> </mml:mi>
<mml:mtext>exp</mml:mtext>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>·</mml:mo>
<mml:mi> </mml:mi>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>κ</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>+</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:msup>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mi>s</mml:mi>
<mml:mo>/</mml:mo>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi></mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>−</mml:mo>
<mml:mi>s</mml:mi></mml:mrow></mml:msub>
<mml:mi> </mml:mi>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi>κ</mml:mi>
<mml:mi> </mml:mi>
<mml:mo>−</mml:mo>
<mml:mi> </mml:mi>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mfrac></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mi> </mml:mi>
<mml:mo>&lt;</mml:mo>
<mml:mi> </mml:mi>
<mml:mtext>exp</mml:mtext>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mo>−</mml:mo>
<mml:mi>λ</mml:mi></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></disp-formula>where the second inequality follows by replacing <italic>x<sub>n–s</sub></italic> by the expression given by <xref ref-type="disp-formula" rid="FD1">Equation (1)</xref> for <italic>m</italic> = <italic>n</italic> – <italic>s</italic> and checking that the ratio that post-multiplies <italic>λ</italic> is greater than 1. The remainder of the proof now follows the argument in [<xref ref-type="bibr" rid="b3-ijms-12-03085">3</xref>] for the proof of Theorem 4.1 (ii).</p></sec></app></app-group>
<ref-list>
<title>References and Notes</title>
<ref id="b1-ijms-12-03085"><label>1.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steel</surname><given-names>M</given-names></name></person-group><article-title>The emergence of a self-catalysing structure in abstract origin-of-life models</article-title><source>Appl. Math. Lett</source><year>2000</year><volume>3</volume><fpage>91</fpage><lpage>95</lpage></citation></ref>
<ref id="b2-ijms-12-03085"><label>2.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hordijk</surname><given-names>W</given-names></name><name><surname>Steel</surname><given-names>M</given-names></name></person-group><article-title>Detecting autocatalytic, self-sustaining sets in chemical reaction systems</article-title><source>J. Theor. Biol</source><year>2004</year><volume>227</volume><fpage>451</fpage><lpage>461</lpage><pub-id pub-id-type="doi">10.1016/j.jtbi.2003.11.020</pub-id><pub-id pub-id-type="pmid">15038982</pub-id></citation></ref>
<ref id="b3-ijms-12-03085"><label>3.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mossel</surname><given-names>E</given-names></name><name><surname>Steel</surname><given-names>M</given-names></name></person-group><article-title>Random biochemical networks: The probability of self-sustaining autocatalysis</article-title><source>J. Theor. Biol</source><year>2005</year><volume>233</volume><fpage>327</fpage><lpage>336</lpage><pub-id pub-id-type="doi">10.1016/j.jtbi.2004.10.011</pub-id><pub-id pub-id-type="pmid">15652142</pub-id></citation></ref>
<ref id="b4-ijms-12-03085"><label>4.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kauffman</surname><given-names>SA</given-names></name></person-group><article-title>Autocatalytic sets of proteins</article-title><source>J. Theor. Biol</source><year>1986</year><volume>119</volume><fpage>1</fpage><lpage>24</lpage><pub-id pub-id-type="doi">10.1016/S0022-5193(86)80047-9</pub-id><pub-id pub-id-type="pmid">3713221</pub-id></citation></ref>
<ref id="b5-ijms-12-03085"><label>5.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kauffman</surname><given-names>SA</given-names></name></person-group><source>The Origins of Order</source><publisher-name>Oxford University Press</publisher-name><publisher-loc>New York, NY, USA</publisher-loc><year>1993</year></citation></ref>
<ref id="b6-ijms-12-03085"><label>6.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sharov</surname><given-names>A</given-names></name></person-group><article-title>Self-reproducing systems: Structure, niche relations and evolution</article-title><source>BioSystems</source><year>1991</year><volume>25</volume><fpage>237</fpage><lpage>249</lpage><pub-id pub-id-type="doi">10.1016/0303-2647(91)90022-D</pub-id><pub-id pub-id-type="pmid">1742472</pub-id></citation></ref>
<ref id="b7-ijms-12-03085"><label>7.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Letelier</surname><given-names>JC</given-names></name><name><surname>Soto-Andrade</surname><given-names>J</given-names></name><name><surname>Abarzúa</surname><given-names>FG</given-names></name><name><surname>Cornish-Bowden</surname><given-names>A</given-names></name><name><surname>Cárdenas</surname><given-names>ML</given-names></name></person-group><article-title>Organizational invariance and metabolic closure: Analysis in terms of (M,R) systems</article-title><source>J. Theor. Biol</source><year>2006</year><volume>238</volume><fpage>949</fpage><lpage>961</lpage><pub-id pub-id-type="doi">10.1016/j.jtbi.2005.07.007</pub-id><pub-id pub-id-type="pmid">16122760</pub-id></citation></ref>
<ref id="b8-ijms-12-03085"><label>8.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cornish-Bowden</surname><given-names>A</given-names></name><name><surname>Cárdenas</surname><given-names>ML</given-names></name><name><surname>Letelier</surname><given-names>JC</given-names></name><name><surname>Soto-Andrade</surname><given-names>J</given-names></name></person-group><article-title>Beyond reductionism: Metabolic circularity as a guiding vision for a real biology of systems</article-title><source>Proteomics</source><year>2007</year><volume>7</volume><fpage>839</fpage><lpage>845</lpage><pub-id pub-id-type="doi">10.1002/pmic.200600431</pub-id><pub-id pub-id-type="pmid">17370262</pub-id></citation></ref>
<ref id="b9-ijms-12-03085"><label>9.</label><citation citation-type="other"><person-group person-group-type="author"><name><surname>Jaramillo</surname><given-names>S</given-names></name><name><surname>Honorato-Zimmer</surname><given-names>R</given-names></name><name><surname>Pereira</surname><given-names>U</given-names></name><name><surname>Contreras</surname><given-names>D</given-names></name><name><surname>Reynaert</surname><given-names>B</given-names></name><name><surname>Hernández</surname><given-names>V</given-names></name><name><surname>Soto-Andrade</surname><given-names>J</given-names></name><name><surname>Cárdenas</surname><given-names>M</given-names></name><name><surname>Cornish-Bowden</surname><given-names>A</given-names></name><name><surname>Letelier</surname><given-names>J</given-names></name></person-group><article-title>(M,R) Systems and RAF Sets: Common Ideas, Tools and Projections</article-title><conf-name>Proceedings of the Alife XII Conference</conf-name><conf-loc>Odense, Denmark</conf-loc><conf-date>19–23 August, 2010</conf-date><fpage>94</fpage><lpage>100</lpage></citation></ref>
<ref id="b10-ijms-12-03085"><label>10.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flamm</surname><given-names>C</given-names></name><name><surname>Ullrich</surname><given-names>A</given-names></name><name><surname>Ekker</surname><given-names>H</given-names></name><name><surname>Mann</surname><given-names>M</given-names></name><name><surname>Hoegerl</surname><given-names>D</given-names></name><name><surname>Rohrschneider</surname><given-names>M</given-names></name><name><surname>Sauer</surname><given-names>S</given-names></name><name><surname>Scheuermann</surname><given-names>G</given-names></name><name><surname>Klemm</surname><given-names>K</given-names></name><name><surname>Hofacker</surname><given-names>IL</given-names></name><etal/></person-group><article-title>Evolution of metabolic networks: A computational framework</article-title><source>J Syst Chem</source><year>2010</year><volume>1</volume><pub-id pub-id-type="doi">10.1186/1759-2208-1-4.</pub-id></citation></ref>
<ref id="b11-ijms-12-03085"><label>11.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kun</surname><given-names>A</given-names></name><name><surname>Papp</surname><given-names>B</given-names></name><name><surname>Szathmáry</surname><given-names>E</given-names></name></person-group><article-title>Computational identification of obligatorily autocatalytic replicators embedded in metabolic networks</article-title><source>Genome Biol</source><year>2008</year><volume>9</volume><pub-id pub-id-type="doi">10.1186/gb-2008-9-3-r51.</pub-id></citation></ref>
<ref id="b12-ijms-12-03085"><label>12.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Awazu</surname><given-names>A</given-names></name><name><surname>Kaneko</surname><given-names>K</given-names></name></person-group><article-title>Discretness-induced transition in catalytic reaction networks</article-title><source>Phys Rev E</source><year>2007</year><volume>76</volume><fpage>041915:1</fpage><lpage>041915:8</lpage></citation></ref>
<ref id="b13-ijms-12-03085"><label>13.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brogioli</surname><given-names>D</given-names></name></person-group><article-title>Marginally stable chemical systems as precursors of life</article-title><source>Phys Rev Lett</source><year>2010</year><volume>105</volume><fpage>058102:1</fpage><lpage>058102:4</lpage></citation></ref>
<ref id="b14-ijms-12-03085"><label>14.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartsev</surname><given-names>SI</given-names></name><name><surname>Mezhevikin</surname><given-names>VV</given-names></name></person-group><article-title>On initial steps of chemical prebiotic evolution: Triggering autocatalytic reaction of oligomerization</article-title><source>Adv. Space Res</source><year>2008</year><volume>42</volume><fpage>2008</fpage><lpage>2013</lpage><pub-id pub-id-type="doi">10.1016/j.asr.2008.07.002</pub-id></citation></ref>
<ref id="b15-ijms-12-03085"><label>15.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dyson</surname><given-names>FJ</given-names></name></person-group><article-title>A model for the origin of life</article-title><source>J. Mol. Evol</source><year>1982</year><volume>18</volume><fpage>344</fpage><lpage>350</lpage><pub-id pub-id-type="doi">10.1007/BF01733901</pub-id><pub-id pub-id-type="pmid">7120429</pub-id></citation></ref>
<ref id="b16-ijms-12-03085"><label>16.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bollobas</surname><given-names>B</given-names></name><name><surname>Rasmussen</surname><given-names>S</given-names></name></person-group><article-title>First cycles in random directed graph processes</article-title><source>Discret. Math</source><year>1989</year><volume>75</volume><fpage>55</fpage><lpage>68</lpage><pub-id pub-id-type="doi">10.1016/0012-365X(89)90078-2</pub-id></citation></ref>
<ref id="b17-ijms-12-03085"><label>17.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lifson</surname><given-names>S</given-names></name></person-group><article-title>On the crucial stages in the origin of animate matter</article-title><source>J. Mol. Evol</source><year>1997</year><volume>44</volume><fpage>1</fpage><lpage>8</lpage><pub-id pub-id-type="doi">10.1007/PL00006115</pub-id><pub-id pub-id-type="pmid">9073329</pub-id></citation></ref>
<ref id="b18-ijms-12-03085"><label>18.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Szathmary</surname><given-names>E</given-names></name></person-group><article-title>The evolution of replicators</article-title><source>Philos. Trans. R. Soc. Lond. B</source><year>2000</year><volume>355</volume><fpage>1669</fpage><lpage>1676</lpage><pub-id pub-id-type="doi">10.1098/rstb.2000.0730</pub-id></citation></ref>
<ref id="b19-ijms-12-03085"><label>19.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Orgel</surname><given-names>LE</given-names></name></person-group><article-title>The implausibility of metabolic cycles on the prebiotic earth</article-title><source>PLoS Biol</source><year>2008</year><volume>6</volume><fpage>5</fpage><lpage>13</lpage></citation></ref>
<ref id="b20-ijms-12-03085"><label>20.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sievers</surname><given-names>D</given-names></name><name><surname>von Kiedrowski</surname><given-names>G</given-names></name></person-group><article-title>Self-replication of complementary nucleotide-based oligomers</article-title><source>Nature</source><year>1994</year><volume>369</volume><fpage>221</fpage><lpage>224</lpage><pub-id pub-id-type="doi">10.1038/369221a0</pub-id><pub-id pub-id-type="pmid">8183342</pub-id></citation></ref>
<ref id="b21-ijms-12-03085"><label>21.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname><given-names>DH</given-names></name><name><surname>Severin</surname><given-names>K</given-names></name><name><surname>Ghadiri</surname><given-names>MR</given-names></name></person-group><article-title>Autocatalytic networks: The transition from molecular self-replication to molecular ecosystems</article-title><source>Curr. Opin. Chem. Biol</source><year>1997</year><volume>1</volume><fpage>491</fpage><lpage>496</lpage><pub-id pub-id-type="doi">10.1016/S1367-5931(97)80043-9</pub-id><pub-id pub-id-type="pmid">9667892</pub-id></citation></ref>
<ref id="b22-ijms-12-03085"><label>22.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashkenasy</surname><given-names>G</given-names></name><name><surname>Jegasia</surname><given-names>R</given-names></name><name><surname>Yadav</surname><given-names>M</given-names></name><name><surname>Ghadiri</surname><given-names>MR</given-names></name></person-group><article-title>Design of a directed molecular network</article-title><source>Proc. Nat. Acad. Sci. USA</source><year>2004</year><volume>101</volume><fpage>10872</fpage><lpage>10877</lpage><pub-id pub-id-type="doi">10.1073/pnas.0402674101</pub-id><pub-id pub-id-type="pmid">15256596</pub-id></citation></ref>
<ref id="b23-ijms-12-03085"><label>23.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hayden</surname><given-names>EJ</given-names></name><name><surname>von Kieddrowski</surname><given-names>G</given-names></name><name><surname>Lehman</surname><given-names>N</given-names></name></person-group><article-title>Systems chemistry on ribozyme self-construction: Evidence for anabolic autocatalysis in a recombination network</article-title><source>Angew. Chem. Int. Ed</source><year>2008</year><volume>120</volume><fpage>8552</fpage><lpage>8556</lpage><pub-id pub-id-type="doi">10.1002/ange.200802177</pub-id></citation></ref>
<ref id="b24-ijms-12-03085"><label>24.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lincoln</surname><given-names>TA</given-names></name><name><surname>Joyce</surname><given-names>GF</given-names></name></person-group><article-title>Self-Sustained Replication of an RNA Enzyme</article-title><source>Science</source><year>2009</year><volume>323</volume><fpage>1229</fpage><lpage>1232</lpage><pub-id pub-id-type="doi">10.1126/science.1167856</pub-id><pub-id pub-id-type="pmid">19131595</pub-id></citation></ref>
<ref id="b25-ijms-12-03085"><label>25.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Penny</surname><given-names>D</given-names></name></person-group><article-title>An interpretive review of the origin of life research</article-title><source>Biol. Philos</source><year>2005</year><volume>20</volume><fpage>633</fpage><lpage>671</lpage><pub-id pub-id-type="doi">10.1007/s10539-004-7342-6</pub-id></citation></ref>
<ref id="b26-ijms-12-03085"><label>26.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hordijk</surname><given-names>W</given-names></name><name><surname>Hein</surname><given-names>J</given-names></name><name><surname>Steel</surname><given-names>M</given-names></name></person-group><article-title>Autocatalytic sets and the origin of life</article-title><source>Entropy</source><year>2010</year><volume>12</volume><fpage>1733</fpage><lpage>1742</lpage><pub-id pub-id-type="doi">10.3390/e12071733</pub-id></citation></ref>
<ref id="b27-ijms-12-03085"><label>27.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Dyson</surname><given-names>FJ</given-names></name></person-group><source>Origins of Life</source><publisher-name>Cambridge University Press</publisher-name><publisher-loc>Cambridge, UK</publisher-loc><year>1985</year></citation></ref>
<ref id="b28-ijms-12-03085"><label>28.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morowitz</surname><given-names>HJ</given-names></name><name><surname>Kostelnik</surname><given-names>JD</given-names></name><name><surname>Yang</surname><given-names>J</given-names></name><name><surname>Cody</surname><given-names>GD</given-names></name></person-group><article-title>The origin of intermediary metabolism</article-title><source>Proc. Nat. Acad. Sci. USA</source><year>2000</year><volume>97</volume><fpage>7704</fpage><lpage>7708</lpage><pub-id pub-id-type="doi">10.1073/pnas.110153997</pub-id><pub-id pub-id-type="pmid">10859347</pub-id></citation></ref>
<ref id="b29-ijms-12-03085"><label>29.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Smith</surname><given-names>E</given-names></name><name><surname>Morowitz</surname><given-names>HJ</given-names></name></person-group><article-title>Universality in intermediary metabolism</article-title><source>Proc. Nat. Acad. Sci. USA</source><year>2004</year><volume>101</volume><fpage>13168</fpage><lpage>13173</lpage><pub-id pub-id-type="doi">10.1073/pnas.0404922101</pub-id><pub-id pub-id-type="pmid">15340153</pub-id></citation></ref>
<ref id="b30-ijms-12-03085"><label>30.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morowitz</surname><given-names>HJ</given-names></name><name><surname>Srinivasan</surname><given-names>V</given-names></name><name><surname>Smith</surname><given-names>E</given-names></name></person-group><article-title>Ligand field theory and the origin of life as an emergent feature of the periodic table of elements</article-title><source>Biol. Bull</source><year>2010</year><volume>219</volume><fpage>1</fpage><lpage>6</lpage><pub-id pub-id-type="pmid">20813983</pub-id></citation></ref>
<ref id="b31-ijms-12-03085"><label>31.</label><citation citation-type="other"><comment>It was shown in [2] that <italic>P<sub>n</sub></italic> = 0.50 can be taken as the “transition point” between RAFs not existing at all and RAFs occurring with high probability.</comment></citation></ref>
<ref id="b32-ijms-12-03085"><label>32.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scott</surname><given-names>JK</given-names></name><name><surname>Smith</surname><given-names>GP</given-names></name></person-group><article-title>Searching for peptide ligands with an epitope library</article-title><source>Science</source><year>1990</year><volume>249</volume><fpage>286</fpage><lpage>390</lpage></citation></ref>
<ref id="b33-ijms-12-03085"><label>33.</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yamauchi</surname><given-names>A</given-names></name><name><surname>Nakashima</surname><given-names>T</given-names></name><name><surname>Tokuriki</surname><given-names>N</given-names></name><name><surname>Hosokawa</surname><given-names>M</given-names></name><name><surname>Nogami</surname><given-names>H</given-names></name><name><surname>Arioka</surname><given-names>S</given-names></name><name><surname>Urabe</surname><given-names>I</given-names></name><name><surname>Yomo</surname><given-names>T</given-names></name></person-group><article-title>Evolvability of random polypeptides through functional selection within a small library</article-title><source>Protein Eng</source><year>2002</year><volume>15</volume><fpage>619</fpage><lpage>626</lpage><pub-id pub-id-type="doi">10.1093/protein/15.7.619</pub-id><pub-id pub-id-type="pmid">12200545</pub-id></citation></ref>
<ref id="b34-ijms-12-03085"><label>34.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bagley</surname><given-names>RJ</given-names></name></person-group><source>A Model of Functional Self Organization</source><comment>PhD Thesis</comment><publisher-name>University of California, San Diego</publisher-name><publisher-loc>CA, USA</publisher-loc><year>1991</year></citation></ref>
<ref id="b35-ijms-12-03085"><label>35.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bagley</surname><given-names>RJ</given-names></name><name><surname>Farmer</surname><given-names>JD</given-names></name></person-group><article-title>Spontaneous Emergence of a Metabolism</article-title><source>Artificial Life II</source><person-group person-group-type="editor"><name><surname>Langton</surname><given-names>CG</given-names></name><name><surname>Taylor</surname><given-names>C</given-names></name><name><surname>Farmer</surname><given-names>JD</given-names></name><name><surname>Rasmussen</surname><given-names>S</given-names></name></person-group><publisher-name>Addison-Wesley</publisher-name><publisher-loc>Upper Saddle River, NJ, USA</publisher-loc><year>1991</year><fpage>93</fpage><lpage>140</lpage></citation></ref>
<ref id="b36-ijms-12-03085"><label>36.</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bagley</surname><given-names>RJ</given-names></name><name><surname>Farmer</surname><given-names>JD</given-names></name><name><surname>Fontana</surname><given-names>W</given-names></name></person-group><article-title>Evolution of a metabolism</article-title><source>Artificial Life II</source><person-group person-group-type="editor"><name><surname>Langton</surname><given-names>CG</given-names></name><name><surname>Taylor</surname><given-names>C</given-names></name><name><surname>Farmer</surname><given-names>JD</given-names></name><name><surname>Rasmussen</surname><given-names>S</given-names></name></person-group><publisher-name>Addison-Wesley</publisher-name><publisher-name>Upper Saddle River, NJ, USA</publisher-name><year>1991</year><fpage>141</fpage><lpage>158</lpage></citation></ref>
<ref id="b37-ijms-12-03085"><label>37.</label><citation citation-type="web"><person-group person-group-type="author"><name><surname>Andersen</surname><given-names>IT</given-names></name><name><surname>Nan</surname><given-names>L</given-names></name><name><surname>Kjaersgaard</surname><given-names>MIS</given-names></name></person-group><article-title>Search for life in catalytic reaction systems</article-title><comment>Available online: <ext-link xlink:href="http://www.stats.ox.ac.uk/research/genome/projects/pastprojects" ext-link-type="uri">http://www.stats.ox.ac.uk/research/genome/projects/pastprojects</ext-link> (accessed on 5 May 2011).</comment></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Table</title>
<fig id="f1-ijms-12-03085" position="float">
<label>Figure 1.</label>
<caption>
<p>A simple example of a <italic>catalytic reaction system</italic> (CRS) with seven molecule types {<italic>a, b, c, d, e, f, g</italic>} (solid nodes) and four reactions {<italic>r</italic><sub>1</sub>, <italic>r</italic><sub>2</sub><italic>, r</italic><sub>3</sub><italic>, r</italic><sub>4</sub>} (open nodes). The food set is <italic>F</italic> = {<italic>a, b</italic>}. Solid arrows indicate reactants going into and products coming out of a reaction, dashed arrows indicate catalysis. The subset <italic>ℛ</italic> = {<italic>r</italic><sub>1</sub><italic>, r</italic><sub>2</sub>} (shown with bold arrows) is RAF.</p></caption>
<graphic xlink:href="ijms-12-03085f1.gif"/></fig>
<fig id="f2-ijms-12-03085" position="float">
<label>Figure 2.</label>
<caption>
<p>The linear relations for the three cases.</p></caption>
<graphic xlink:href="ijms-12-03085f2.gif"/></fig>
<fig id="f3-ijms-12-03085" position="float">
<label>Figure 3.</label>
<caption>
<p>The probability <italic>P<sub>n</sub></italic> of finding RAF sets for different (fixed) catalysis probabilities <italic>p</italic> and values of <italic>n</italic>.</p></caption>
<graphic xlink:href="ijms-12-03085f3.gif"/></fig>
<fig id="f4-ijms-12-03085" position="float">
<label>Figure 4.</label>
<caption>
<p>The level of catalysis <italic>f</italic> (<italic>n</italic>) required for the template-based catalysis case compared to the original (purely random) case, for different values of <italic>n</italic>.</p></caption>
<graphic xlink:href="ijms-12-03085f4.gif"/></fig>
<table-wrap id="t1-ijms-12-03085" position="float">
<label>Table 1.</label>
<caption>
<p>The empirical (cases <bold>A</bold> and <bold>B</bold>) and theoretical (case <bold>C</bold>) linear relations.</p></caption>
<table frame="hsides" rules="rows">
<tbody>
<tr>
<td align="center" valign="middle"><bold>A</bold></td>
<td align="center" valign="middle"><italic>f<sub>A</sub></italic>(<italic>n</italic>) =</td>
<td align="right" valign="middle">1.0970 + 0.0189<italic>n</italic></td></tr>
<tr>
<td align="center" valign="middle"><bold>B</bold></td>
<td align="center" valign="middle"><italic>f<sub>B</sub></italic> (<italic>n</italic>) =</td>
<td align="right" valign="middle">–0.4736 + 0.7012<italic>n</italic></td></tr>
<tr>
<td align="center" valign="middle"><bold>C</bold></td>
<td align="center" valign="middle"><italic>f<sub>C</sub></italic> (<italic>n</italic>) =</td>
<td align="right" valign="middle">1.6339<italic>n</italic></td></tr></tbody></table></table-wrap></sec></back></article>
