Abstract
This paper aims to explain the transition to multicellularity as a consequence of the evolutionary response to stress. The proposed model is composed of three parts. The first part details stochastic biochemical kinetics within a reactor (potentially compartmentalized), where kinetic rates are influenced by random stress parameters, such as temperature, toxins, oxidants, etc. The second part of the model is a feedback mechanism governed by a genetic regulation network (GRN). The third component involves stochastic dynamics that describe the evolution of this network. We assume that the organism remains viable as long as the concentrations of certain key reagents are maintained within a defined range (the homeostasis domain). For this model, we calculate the probability estimate that the system will stay within the homeostasis domain under stress impacts. Under certain assumptions, we show that a GRN expansion increases the viability probability in a very sharp manner. It is shown that multicellular organisms increase their viability due to compartment organization and stem cell activity. By the viability probability estimates, an explanation of the Peto paradox is proposed: why large organisms are stable with respect to cancer attacks.
MSC:
34D20; 92E20; 34H15
1. Introduction
In this paper, we focus on three enigmatic problems of biology: (I) why multicellular organisms emerge; (II) why complex multicellular organisms can be constructed with a relatively small number of genes, and (III) the Peto paradox on cancer and lifespan, i.e., why they are stable with respect to cancer attacks.
The origin of multicellular organisms presents a significant challenge for biological evolution theory. For instance, E. Koonin notes “The emergence and evolution of complexity at the levels of the genotype and the phenotype, and the relationship between the two, is a central (if not the central) problem in biology. Even leaving aside for now the problem of the actual origin of the very substantial complexity associated with the cellular level of organization, one cannot help wondering why the evolution of life did not stop at the stage of the simplest autotrophic prokaryotes, with 1000 to 1500 genes. Why instead did evolution continue, to produce complex prokaryotes possessing more than 10,000 genes and, far more strikingly, eukaryotes, with their huge, elaborately regulated genomes; multiple tissue types; and even ability to develop mathematical theories of evolution?” (see [1], Ch. 8, p. 250). Some approaches to the evolution of multicellularity were proposed in ([2,3]), describing this evolution as a result of an interaction between hosts and parasites. The emergence of multicellular organisms can be considered as a major evolutionary transition [4].
The second enigma is connected to several protein-coding genes. In fact, the human genome comprises approximately 22,000 protein-coding genes, a number that is comparable to the genomes of fruit flies and nematodes. Surprisingly, more complex multicellular organisms do not necessitate a higher number of genes, despite having a greater number of phenotypic traits and the need to adapt to numerous environmental constraints. These facts challenge the classical ideas of modern evolutionary synthesis. According to the celebrated Fisher geometric model, it can be demonstrated that the likelihood of improving fitness through random mutations diminishes as the organism’s complexity increases ([5]). Building upon Fisher’s approach, Orr estimated the adaptation rate as a function of the number of environmental constraints M, with F representing the average population fitness ([6], 2000, [7], 2005). Orr’s findings reveal that the adaptation rate becomes exponentially small when , a phenomenon known as the complexity cost or complexity barrier. Therefore, for large values of M, the complexity barrier becomes exponentially large.
Richard Peto was the first to notice that the cancer incidence does not correlate with the number of organism cells [8]. The cancer incidence in humans is much higher than the incidence of cancer in whales, despite whales having more cells than humans. For simplicity, suppose that the probabilities of cancer driver mutations are constant for all cells. Then whales should die from cancer more often than people and at an early age. In reality, they live for hundreds of years. No statistically significant relationship has been found between body size and cancer incidence, supporting Peto’s observation. This problem III is connected to I and II. In fact, the existence of multicellular organisms requires the suppression of cancer [9]. There exists a connection between the origins of multicellularity and cancer [10,11]. In order to build larger bodies, organisms must suppress cancer. According to ([12]), large organisms have more anticancer genes.
Let us outline the proposed model, which aims to address problems I–III. The first part describes stochastic biochemical kinetics within a reactor (potentially compartmentalized), where kinetic rates are influenced by random stress parameters such as temperature, toxins, oxidants, etc. The second part of the model is a feedback mechanism governed by a genetic regulation network (GRN). The third component is stochastic dynamics, which describe the evolution of this network. We assume that an organism remains viable as long as the concentrations of certain key reagents are maintained within a defined range (the homeostasis domain).
To describe GRNs that control responses to stress, radial basis function networks (RBFNs) are used. The inputs to these networks are the stress parameter values and the states of the system. By external stress, we mean any deviation in environmental parameters that could potentially harm the organism (whether multicellular or a single cell). The GRN produces an output dependent on its architecture and, ultimately, on the genetic code, s, which determines the architecture and interconnections within the GRN. Following [13], we assume that the GRN output and stress impact neutralize each other. Connections between units in these networks depend on coding sequences , of Boolean genes. Such a stress response model is not new; however, herein, it uses a new key element connected to stem cell activity in the stress response. Multicellular organisms consist of many types of differentiated (somatic) cells, which form tissues, and stem cells. Stem cells can produce differentiated cells. In this paper, we provide mathematical justification for the well-known idea that cell differentiation emergence in evolution is a response to stress, particularly oxidative stress [14]. In different eukaryotic microorganisms, the induction of antioxidant mechanisms is associated with development. Note that stem cells are particularly susceptible to stresses ([15]) and they are regulated by stress. Experimental data show ([15]) that stem cell division provides tissue turnover and response to damage. Different stress factors induce stem cell division and differentiation. Many pathways that regulate stem cell functioning are also stress–response pathways [15]. So, stress is involved in the long-term maintenance of cell populations [15,16]. In this paper, to describe models for gene regulation in stem cells, we take into account experimental observations and ideas [14,15,16,17]. We propose that the biochemical response of multicellular organisms to stress is facilitated by different gene networks activated in various tissues. We imagine the entire organism as divided into compartments, with stem cell activity aiding in regulating each compartment’s response. Contemporary morphogenesis models explain the emergence of such compartments [18,19]. The model proposed can be considered to be a far-reaching extension of L. Valiant’s model of the ideal answer [20]. In Valiant’s model, organisms are viewed as Boolean circuits modeling responses to environmental stress through a GRN. Our proposed model builds upon this approach while also incorporating real biochemistry, the existence of compartments, and stem cell dynamics. We assume that an organism remains viable as long as the concentrations of certain key reagents are maintained within a specific domain (the homeostasis domain).
The results can be sketched as follows. For such a model, we determine the estimate of the probability that the system does not leave the homeostasis domain under the impact of stress. These estimates are given by Theorem 1 for models without compartments and stem cells and Theorem 2 for models with compartments and stem cells. Furthermore, it is shown that the RBF network growth makes biosystems stable for long time periods. We show that a state with a GRN of the maximal possible size is the most probable, with overwhelming probability. This indicates that evolution favors transitions to increasingly larger GRNs. Multicellular organisms exhibit higher viability compared to a simple colony of cells due to compartment organization and stem cell activity. By these results, we propose an explanation for Peto’s paradox. Moreover, we estimate the number of genes needed to encode GRNs supporting anti-stress responses.
1.1. Innovations
This article continues the series of works (see, for example, [21], 2006, [13], 2021), where the replicative stability concept is investigated. This concept is proposed by M. Gromov and A. Carbone [22]. It was shown that some properties of evolution could be explained by the need for biosystems to maintain homeostasis under the influence of stress and fluctuations (both internal and external), and this homeostasis support must be ensured through subsequent replications. Indeed, many aspects of evolution can be interpreted this way, for example, the tendency to increase the number of genes (see a review in ([23], 2014)). The key question, however, is to explain the complexity of genetic networks. An attempt to resolve this problem was made in ([13], 2021), where a model of the biochemical system’s response to stress was introduced. In this study, the entropy of a stressful environment is defined, and it is demonstrated that an effective response to stress is achievable if the binary logarithm of the size of the regulatory genetic network is at least as large as this entropy. Therefore, according to ([13], 2021), the main reason for the growth of regulatory networks involves the complexity and diversity of the stress environment.
In this paper, the main innovation, compared to previous works, is the use of a compartment model to describe a response to stresses. This model accounts for the modular structure of an organism and stem cell activity. The organism’s response to stress is mediated through cell populations in these compartments, with their dynamics governed by the standard Lotka–Volterra model. Such an approach allows us to address one of the key biological problems, the Peto paradox, because it will be shown that the organism size directly affects stress response mechanisms. The second innovation, with respect to ([13], 2021), is that we no longer need the complexity of the environment to explain the complexity of an organism’s genetic network. If the organism’s environment fluctuates, it is enough to initiate an increase in the complexity of the genetic network. The third intriguing innovative element is that when an organism is divided into compartments, there is no longer a need to fine-tune genetic regulatory networks (as in connectionist models [19]). Instead, it is enough to adjust an appropriate interaction between the compartments.
1.2. Organization of the Paper
The paper is organized as follows. Section 2 describes the different models. Furthermore, we outline the methods. In Section 4, we state the concept of viability under stochastic perturbations. The main results on viability (Theorems 1 and 2) can be found in Section 5. Furthermore, the proof of these results follows based on the standard estimates of the Ventsel–Freidlin theory ([24], 1984). By these viability estimates, in Section 7, we show that evolution has sufficient time to construct complex structures. In Section 8, we outline some contemporary morphogenesis models and explain why multicellular organisms can be encoded by a few genes. An explanation of Peto’s paradox is stated in Section 9. Discussions and conclusions can be found in the last section. Technical estimates are relegated in Appendix A.
2. Model
In this section, we describe the model. It unfolds in a few steps.
2.1. Chemical Kinetics with Random Stress Parameters
In this subsection, following ([13]), we consider systems of differential equations where the right-hand sides depend on a random process, . These systems read
where , where D is a compact domain in with a smooth boundary , and represent reagent concentrations. We assume that are sufficiently smooth functions of v; for example, multivariate polynomials in v and smooth functions of . Functions are trajectories of random processes, which are piecewise constant in t and take values in a compact subset of , where the quantity is a dimension of the stress parameter . These Markov processes describe a random stress produced by an environment. For system (1), we set the following initial conditions:
Reaction terms, f, can be defined by the well-known models of chemical kinetics and population dynamics. The simplest choice from the law of mass action is
where is a multi-index with integers , are finite subsets of , and denote coefficients that determine kinetic rates. One also uses rational nonlinearities, for example,
where and are polynomials. We suppose that are separated from zero:
for all and . Systems (1) with defined by (3) or (4) often occur in biochemistry and population dynamics.
To ensure the existence and uniqueness of solutions to the Cauchy problem (1), (2) for all , we make the following assumption. Let
where is a unit normal vector directed outside D at the point . Then solutions to the Cauchy problem (1), (2) exist and are unique for all .
Polynomial and rational functions may not fulfill conditions (6). Since our aim is to investigate the local stability of local equilibria, we consider narrow neighborhoods of those equilibria. Let be the equilibrium under consideration. For , we introduce a -neighborhood of that equilibrium by
where we assume that , where is a viability (homeostasis) domain. We truncate nonlinearities in the setting , where is defined as follows: is a Heaviside-like function, which is equal to 0 outside of an open neighborhood of the equilibrium, and is equal to 1 inside this neighborhood. Here, we assume that the neighborhood radius is so small that this neighborhood lies inside the homeostasis domain.
Moreover, let us note that an explicit form of is not essential in the subsequent statement; however, certain assumptions on the existence of the equilibria of system (1) and their stability should be fulfilled.
2.2. Extended Model with Anti-Stress Response Encoded by Genes
Different complicated models of gene regulation were proposed, for example, refs. [19,25] for Drosophila morphogenesis. Suppose reaction terms involve feedback, u:
Here, , where
and is a positive integer, is a constant. Here, the input is the averaged value of on an interval:
where is a parameter. We are seeking regulators defined by a radial basis function network (RBFN):
where the function class is defined in Appendix A.2.
We suppose that all parameters in are encoded by a Boolean genetic code s, where is a gene expression vector (see Appendix A.3). We have N Boolean genes ; therefore, s are elements of the Boolean hypercube . The i-th gene may be switched on () or switched off (). The strings, s, can depend on time: at certain time moments, they change as a result of mutations, gene drift, and other evolution forces.
2.3. Assumptions
Let us formulate natural assumptions to stress , chemical kinetics, and the feedback.
Assumption 1.
Assumption to ξ.
We suppose that ξ is the sum of a trend and a small multiplicative noise:
where κ is a small positive parameter, are functions, and , where are independent standard Wiener processes (therefore, are white noises).
To describe the trend , we use the model of subsequent environmental shocks. We assume that are piecewise constant functions. Let us denote as an infinitely increasing sequence of time moments . Let . We suppose and , where and are the average times of GRN reaction to stress and biochemical dynamics, respectively. At the moments, we have environment shocks, where an interaction between the environment and the biosystem changes. The assumptions on , and are natural. It means that changes in the organism’s environment are not very frequent and, therefore, organisms have time to react to them. Otherwise, it is difficult to expect that evolution could be successful.
Let us assume that, for , the parameter , where at each step j is chosen randomly. More precisely, at each interval , the value is a constant random vector, which is distributed according to a continuous density probability function (pdf) with a support, which has a support in a compact, non-empty domain .
Let us denote by the open half-planes of the complex plane .
Assumption 2.
Equilibrium existence and stability.
We assume that for each and each , the system
has a hyperbolic rest point , smoothly depending on η and u. This means the following: One has
Consider the corresponding linearized operator :
The operator is uniformly hyperbolic and stable: there is a , such that
This means that the biochemical equilibrium is uniformly stable in .
To formulate the next assumption, let us introduce important auxiliary functions. Let us define by
where and , where is a positive integer, and are smooth functions.
Assumption 3.
Existence of an ideal feedback
We suppose for each and , there exists a solution of the system
which is a smooth function defined on the domain .
This assumption means that there is an ideal response to the stress impact . The GRNs should approximate this response to support the organism’s viability, as described in the last assumption.
Assumption 4.
We suppose that for sufficiently large sizes, N, the ideal feedback (which exists according to Assumption 3) can be approximated with a polynomial accuracy:
by a network , where constants are uniform in N.
This assumption can be justified by estimates stated in Appendix A.2. Moreover, Assumptions 3 and 4 show that our model can be considered an extension of L. Valiant’s circuit model of the ideal answer ([20], 2006, [26], 2009), where we take into account the biochemical kinetics, compartment existence, and the cell dynamics (which will be described below).
2.4. Stochastic Equations for Perturbations Induced by Stress Fluctuations
Taking into account our assumptions on and the existence of the equilibrium, we represent v as
where is a new unknown function, representing a small correction to the equilibrium. By Assumption 2, one has
where
Using the Taylor expansion of f and the definition (14) of the linear operator , we obtain
where
One has the following formal asymptotics for :
Using (11), one finds that
We remove the terms in and substitute the obtained shorted into (19), then one has
where is defined by (16).
Thus, for each time interval, where does not change, we have the system of the following stochastic Ito equations
where are independent standard Wiener processes. We suppose that in these equations, u is defined by (8). Note that our derivation of these equations is only formal; for example, in (21), we remove the terms proportional to , which involve unbounded quantities (the Wiener processes are not smooth). This difficulty can be circumvented ([23], 2014). Nonetheless, we will use Equation (23) because it has a classical form of the Ito equations, which are well-studied and fundamental in applications; see [27].
2.5. Model with Compartments
In biology and medicine, compartment models are popular for biochemical applications, for example, to describe pharmacokinetics (see, for example, [28]). We consider a model consisting of compartments. Each compartment consists of differentiated cells of the k-th type, where , and is the number of cell types. Reaction terms in k-th compartments are denoted by . Furthermore, each cell produces its own response to a stress disturbance , which depends on the cell type. We denote by the corresponding feedback (where we omit a dependence on the gene code s to simplify the notation). Let be a relative abundance of cells of k-type, which produces an answer to stress. One can assume that the dynamics of these quantities are governed by the multispecies Lotka–Volterra dynamics with parameters depending on the stress parameter :
where are growth-mortality coefficients, which take into account the apoptosis and production of differentiated cells by stem cells, and denotes interaction coefficients. We have a dependence on because the behavior of stem cells depends on the stress factors (see Introduction and [14,15,16,17]). However, real organisms consist of cells; therefore, these variables, , which are relative concentrations, take discrete values. For each positive integer, M, let us define the sets:
In a discrete model, we suppose that , where denotes the set and is the total number of cells in the organism. To describe the dynamics, where cell abundances take discrete values, we can replace (24) with the stochastic Markov dynamics, which simulate system (24) by the Gillespie algorithm [29]. We suppose that the total anti-stress response of the whole organism is proportional to and :
Within the framework of this weakly nonlinear approximation, all information about the interaction (coupling–decoupling) of reactants is contained in matrix . We can define the coupling graph , where the set V of vertices is the set of reactants, and an edge lies on the edge set E if the i-th reactant interacts with the j-th one. One can consider various cases of the interactions of reactants that are relevant from a biological point of view. For example, we can consider the case of a Michaelis–Menten reaction chain. In this case, the analysis of the spectrum of the operator specified by the matrix is relatively simple. A more complex option involves several parallel reactions or parallel chains of Michaelis–Menten reactions. For systems with a large number of reactants, the case of random pairing (of a random E) is also interesting. A more detailed analysis of the coupling problem has been postponed for future work.
In the coming subsection, we describe how the population dynamics for can produce an effective stress response.
2.6. Associated Optimization Problems
Let be a smooth function defined on . When we try to approximate an ideal answer by GRNs acting independently in each compartment, we obtain the risk function , defined by
where is a vector with real-valued components and denotes the standard Euclidean norm in . The approximation problem reduces to
Relaxed optimization problem RP To find X giving the minimum of under conditions
and
Integer optimization problem IP To find X giving the minimum of under conditions
This last problem involves finding the anti-stress response via the correct activation of discrete cell sets.
First, consider the problem RP. If this problem has a solution, then that solution can be found by the population dynamics defined by system (24) under an appropriate choice of coefficients and . To show it, let us define a square matrix M of size and the vector b:
where is the Euclidean scalar product in . Let us consider the Cauchy problem defined by the system of equations
with and positive initial conditions , where can be chosen at random. It is clear that this system can be rewritten in the form (24). System (32) enjoys the remarkable properties. First, the quantities remain positive for all t; therefore, condition (28) holds. Furthermore, for fixed , the function is a Lyapunov function for system (32), decreasing along trajectories . In fact, Equation (32) implies
We observe that is a convex function of X defined on the convex domain, . Therefore, the optimization problem RP has a unique solution. This solution can be found by system (32). Indeed, the Lyapunov function is convex. Therefore, it has a single local minimum . According to classical results on gradient-like dynamical systems, all the trajectories converge to [30]. We note that the vector gives the solution to the optimization problem RP. Finally, we conclude that population dynamics give the solution to that optimization problem RP. We suppose that the domain, , is convex, as we would like to deal with a convex optimization problem, where the global minimum is unique. To achieve this uniqueness property, not only must the objective function be convex, but the feasible set must be convex as well. In our biological context, the most natural choice for the domain is the simplex , , where is a constant.
A good approximation for solutions of the problem IP can be found for large , as follows: First we solve QP and obtain the solution (as described above). Then we make the standard relaxation procedure, replacing each with the corresponding closest element of the set . We denote an X obtained by . We note that
where C is a constant. If for certain feedback, , we have
Let us find sufficient conditions under which this estimate is satisfied. Note that the set
is the simplex with vertices . For each let us consider the set of all possible ideal answers
Proposition 1.
Let . Assume that for each , one has the inclusion
where . Then for each , there is a , such that and, thus, (34) holds.
Proof.
It follows at once from the definitions. □
This claim indicates that adapting to stresses does not necessitate fine-tuning genetic networks within the differentiated cells of multicellular organisms. To create an effective response to stress, it is enough to correctly encode interactions between cells in different compartments. Moreover, the Carathéodory Theorem leads to the following conclusion: to satisfy (34), it is sufficient to have more cell types than the regulator dimension: .
2.7. Stochastic Equation for Feedback Evolution
In this subsection, we describe a model for the evolution of the GRN evolution. Let be feedback maps, where all parameters are encoded by binary genes (see Appendix A.3). At the initial moment, we have a random binary string . Consider the time interval , where is big. We choose a sequence of random moments such that Ȧt these moments, mutations occur in s. To describe this process, we can use an analog of the master equation basic in physics. Let represent the probability that the system is in state s at time t. The probability of transitioning from s to is denoted by . We also introduce a special state corresponding to the destruction of the system, 0, and the probability of transitioning from s for 0 will be denoted by . We suppose that the likelihood of survival is completely determined by s (the theory of gene-trait maps, pioneered by R. Fisher ([5], 1930); see also [7,31,32]). Then we have
To make this general model mathematically tractable, we simplify these equations as follows. Instead of the evolution of the genetic code, we consider an evolution in a space of GRN sizes, neglecting details of gene coding. The evolutionary step is conceptualized as an increase in the GRN size as a result of mutations (possibly, a few mutations are necessary).
We simplify the master equation as follows. Instead of the states s, we consider states defined by positive integers that define the network sizes, , where and . We consider the transitions ahead only (the networks can extend and cannot shrink). For brevity, the corresponding transition probabilities are denoted by . We obtain the following equations:
where we suppose that , , , and is the probability of having a network of size m. These conditions mean that, in particular, the maximal network size is . This equation can be written down in a short matrix form, as , where , and is the transition matrix for (37).
3. Materials and Methods
The classical methods of mathematical analysis and theory of differential equations are applied together with the results on approximations of smooth functions by RBF networks [33]. Our approach is based on the following:
(i) The idea of M. Gromov–A. Carbone on replicative stability in systems supporting homeostasis [22];
(ii) The theory of large deviations and stochastic transitions in random dynamical systems (A. Ventsel, M. Freidlin, Yu. Kifer et al.) ([24], 1984);
(iii) Models of morphogenesis proposed by A. Turing ([34]) and others ([19,35,36,37]);
(iv) Universal approximations by networks [33,38,39].
4. Stochastic Stability
Let us formulate the stochastic stability problem.
Following [13], we introduce probability, where for a given feedback , the solution of the Cauchy problem defined by (1) with initial data (2) lies in within the time interval :
We suppose that the initial value is the equilibrium: . The probability can be considered a measure of stochastic stability. A more relevant measure of stochastic robustness is given by the minimum over all network regulators, u:
where the infimum is taken over all feedback functions defined by RBFN of size N.
In the multi-compartment case, we take infimum over all and :
where
5. Main Results
Let us formulate the main results. First, we choose in a special way. If it is necessary, we diminish to satisfy the following condition:
Theorem 1.
Let (41) hold. Consider the stochastic Equation (23) under Assumptions 1–4. Let the feedback satisfy estimate (18).
Then one has the following estimate of the stochastic stability via the network size N, as :
where
where is a positive constant, depending on f, and uniform in .
The second theorem concerns the multi-compartment case.
Theorem 2.
Consider the multi-compartment system (26) under Assumptions 1–4. Let conditions of Proposition 1 hold. Then one has the following estimate of the stochastic stability via the numbers as :
where
where is a positive constant that is uniform in and .
6. Proof of Theorems 1 and 2
Proof.
Let us estimate the probability that the trajectory exits within the time interval under the condition that . We use the following estimate (see Appendix A.4) for the probability :
where
Let us estimate , considering the size of the network N. According to Assumption 3, there exists a , such that
Due to (18)
thus
where are constants. This last estimate and (46) imply the assertion of Theorem. □
To prove Theorem 2, we repeat the same arguments in the proof of Theorem 1 and use definition of .
7. Transitions to Complex GRN
Using estimates of viability from Theorems 1 and 2 and the simplified Master Equation (37), we can estimate the probability of reaching the maximally complex regulation network state, .
Let us denote by N the size of the gene network of the regulation. In real genetic networks, the degrees of the most nodes are bounded. Therefore, we need a bounded number of mutations, when the network increases its size from N to as a result of the gene code modification . It is natural, therefore, to assume that the transition probabilities admit the estimate:
where is a probability of a single mutation and a constant is uniform in the network size m. It is shown by Theorem 1 that
where is uniform in m. Under these assumptions, the following claim holds:
Proposition 2.
Proof.
By estimates (47) and (48), system (37) can be investigated by the standard algebraic method. Let denote the eigenvectors of the linear operator and denote the same ones for the conjugate operator . These functions form a biorthogonal system:
where stands for the Kronecker symbol. The corresponding eigenvalues are . Suppose that all are different. Under conditions (47) and (48), the minimum of absolute values is , where
The solution of (37) with the initial data has the form
8. Morphogenesis Models and the Number of Coding Genes
In this section, we first present a short review of different mathematical models of morphogenesis. Different approaches were developed: reaction–diffusion models [34,36,37], mechanochemical models, and connectionist models [18,25,40].
8.1. Brief Overview of Models
Reaction–diffusion models have received great attention and become popular due to seminal work [34]. These models successfully describe pattern formation in biology, physics, chemistry [36,41,42,43,44,45], wave propagation, chaos ([46]), and other important phenomena. Turing’s instability mechanism proposed in [34] can explain the emergence of layered periodical patterns, such as zebra strips, etc. [41]. However, the famous biologist S. Brenner noted in [47] that biological support for Turing’s idea has been marginal. Patterns of Drosophila development do not fit the Turing theory. Moreover, this Turing approach involves no information on gene expression. Nonetheless, the reaction–diffusion models describe many observed patterns and effects, for example, somitogenesis [35,43,48,49,50,51]. The model from ([43]) describes waves, which generate periodical layered patterns. Moreover, it is shown that reaction–diffusion models are capable of generating any cell pattern, i.e., they potentially have a formidable pattern capacity. In fact, if we combine Turing’s ideas with the celebrated Wolpert concept of positional information (see [52]), then the reaction–diffusion models (even with two components ) are capable of generating all possible cell patterns (see [53]), not only periodical. The mechanism working in the model ([53]) is based on the presence of long-range interactions.
However, it is well-known that real morphogenesis is not only based on biochemical interactions but also on mechanical effects. It is an extremely complicated process; see [54] for a recent review. Biological media can be characterized as viscous–elastic, and can be fluid or solid states; there are possible transitions between solid and fluid states. Tissues form media that can change their mechanical properties during the morphogenesis process [54]. Such complicated models can be studied mainly via numerical simulations. However, in [55], a simple mathematically tractable model of an excitable mechanochemical medium is described, where waves resembling moving Turing machines arise and facilitate cell differentiation.
Many reaction–diffusion and mechanochemical models do not use explicitly genetic information, only general ideas about morphogens and cell differentiation. One can suppose that their parameters are encoded by a genetic code. Connectionist models, similar to neural networks, are capable of describing real patterns of gene expression in the Drosophila segmentation process ([18,19,25,40,56]) and their robustness [57]. In neural networks, connections define neuron interactions; similarly, in connectionist morphogenesis models, pair interactions between transcription factors (TFs) are determined by a matrix. Similar to classical neural nets, these models do not take into account triple interactions. In fact, one can show that a practically arbitrary reaction–diffusion model can be simulated by a connectionist system with a sufficiently large interaction matrix and pair interactions.
8.2. Number of Genes
Although we are still far from a comprehensive description of morphogenesis that incorporates genes, biochemistry, and mechanical effects, we can nevertheless draw interesting conclusions, which will be used in the next section to enhance our understanding of problem II. First, how many genes are needed for morphogenesis and stress responses, as described in Section 2? For example, models proposed in [37,55] can be encoded by a few genes , the number weakly depends on the organism size and its complexity. The models from [37,55] look like toy ones; nonetheless, one can suppose that this conclusion about the number of coding genes is applicable to more realistic systems. In fact, it is well-known that mammals and insects have many similar genes involved in morphogenesis.
To encode compartment structures and their cell dynamics, as considered in Section 2.5, only a small number of genes is needed. In fact, to define dynamics (24), it is necessary to encode coefficients and . If an organism consists of cell types, the corresponding number of genes has the order .
9. Multicellularity and Peto’s Paradox
9.1. How Multicellularity Supports Homeostasis
To understand why multicellularity is advantageous for adaptation and how it could have arisen in evolution, we use the results of the previous sections. Let us consider an organism consisting of cells, and let cells of that organism replicate times. The number is an integer, which corresponds to the number of divisions of cells during the organism’s life. Since somatic cells can die and have the Hayflick limit, one can take as a theoretical Hayflick limit. Let be the probability of homeostasis violations for an average somatic cell, which depends on two main parameters, M and (the size of the gene regulation networks). Then the probability that all cells survive together within replications is
The asymptotics of are given by Theorem 2. Since is small, we obtain
where is a constant. This computation is elementary but it leads to the following interesting conclusions. An increase in the gene regulation network produces an exponential gain in homeostasis stability, and the same effect gives an increase in the organism’s mass. One can suppose that mass growth is limited by ecological resources only. We also conclude that it is useful to diminish . Multicellular organisms could have emerged because the time required to build a network of size M depends polynomially on that size, and although this time is large, it is much less compared to the exponential gain in adaptability.
9.2. Peto’s Paradox: When a Large Mass Can Help
Let us consider now the Peto paradox. To this end, we again consider an organism consisting of cells making replications. Let be the probability of a mutation per generation, which can produce cancer (driver mutation). It is assumed that in order to modify a healthy cell into a cancerous one, driver mutations are necessary (see [58], where different methods of finding are considered). One thinks that, typically, cancer appears as a result of 1 to 6 driver mutations [58]. Note that most mutations are not driver ones and are not so dangerous.
The probability that within divisions a cell will acquire mutations and that cell will be modified in a cancer cell, is
where the Bernoulli law is replaced with the Poisson distribution because the probability of a driver mutation is very small. We suppose that each driver mutation can appear as a result of a replication error. This probability, as well as the number , depends on the cancer type, the organism, and the stress level. A cancer emerges in a tissue. If an organism consists of cells, one can suppose that the number of cells where the cancer can appear is proportional to : where a coefficient depends on the tissue. The logarithm of the probability that this cancer does not emerge can be roughly approximated by
(the logarithm is replaced by its asymptotic because is small). It is a very simplified estimate, which does not take into account apoptosis, immune system reactions, etc. The next step is an approximation of . It is natural to suppose that, in an ideal environment, has a minimal value . Fluctuations of external fields, chemical reagents, toxins, and other stress factors can increase . Some experimental data on the dependence of mutation rates on toxin concentrations were obtained in [59], where it was found that the response of the mutation rate in bacteria depends linearly on the toxin concentration. According to Theorem 2, the magnitudes of random fluctuations of reagent concentrations have the order , and the number of cells, in turn, is proportional to the body mass M. Thus, one can expect that decreases with the body mass. We consider a general dependence
where and are coefficients. So, we expect that cells of organisms of larger sizes have lower mutation rates. This conclusion, qualitatively, is consistent with experimental data, see [60]. According to [60], for somatic cells in humans, we have about mutations per base per mitosis, and for a mouse, one has ; that is, for humans, the mutation frequency is 20 times less. If a typical human weight is ≈70,000 g and a typical mouse is 20 g, then we can assume that lies within intervals .
Relations (53)–(55) show that the probability of acquiring a cancer of a fixed type can increase or decrease in mass; it depends on the parameters , where is the number of driver mutations, and are coefficients in (55). The numerical computations conducted by these relations are shown in Figure 1. This analysis and the simulations do not take into account the ecological factors connected to the resources and prey–predator interaction. On the one hand, a large mass increases the amount of food consumed, on the other hand, can improve the defense against predators.
Figure 1.
This image shows the probability that cancer does not emerge in an organism as a function of for the different numbers of driver mutations. The probability is computed by relations (53)–(55). The number of cells runs the interval , where (in a typical mouse, the number of cells is ). For each , parameter takes values , respectively. The number of cell divisions is . The parameters are , , and . We see that the effect dramatically depends on the parameter . The mass increase is profitable if is not too small.
The dependence of the survival probability on the stress parameter is complex. The parameter determines the type of stress (such as cooling or heat shock, exposure to toxins, radiation, etc.) and the stress level (for example, concentrations of toxins). The probability of survival depends on through the parameters (see relation (55)). It is difficult to determine this dependence within the framework of the simple model considered in this section. In the calculations, the parameters were chosen so that the survival probability took on more or less realistic values between 0 and 1. To investigate the dependence , we need more advanced spatially extended models, taking into account immune reactions and other phenomena.
10. Concluding Remarks
The three interconnected biological problems are considered: the origins of multicellular organisms, the Peto paradox on cancer and lifespan, and why complex multicellular organisms use a relatively small number of coding genes. The model, proposed in the paper, describes the response of a complex biochemical system to stress and can be viewed as a generalization of the Valiant model of the ideal answer [20]. The Valiant model considers organisms as circuits generating responses to environmental challenges. In the suggested model, some contemporary ideas on cell differentiation, stress response, and morphogenesis are applied, taking into account, to some extent, the real structures of multicellular organisms: biochemical kinetics, decomposition into compartments (tissues), stem cell activity, and gene regulation. It is found that the number of Boolean genes needed to encode an organism does not correlate with the organism’s size. The most intriguing results are obtained in Section 2.5 and Section 2.6. They show that evolutionary success in adaptation does not require fine-tuning genetic networks in differentiated cells of multicellular organisms. To create an effective response to stress, it is enough to correctly encode the interactions between cells. Evolution does not need to have too many genes for this coding. An analysis of the stress response mechanism leads to the conclusion that species with larger organism masses have larger viability: when the mass increases as a linear function, the viability grows as an exponential function. It is important to note that within the same species, we have an inverse dependence: small dogs live longer than large dogs, an effect that is not yet explained.
Complex traits can evolve adaptively or non-adaptively; the discussion regarding the nature of evolution has been ongoing for many years, starting with Kimura’s seminal work [61]; see ([1], 2011) for an overview. Large multicellular organisms form populations of relatively small effective sizes. This suggests that genetic drift in such populations is stronger than in populations formed by prokaryotes. This fact supports the idea that the evolution of these organisms might have been non-adaptive (constructive neutral evolution). The estimates from this paper do not allow us to draw a definite conclusion about the nature of evolution; however, they show that the emergence of multicellular organisms was not completely improbable.
In fact, they show that even a slight increase in network size can exponentially enhance the probability of maintaining homeostasis, provided the initial GRN was sufficiently large. The likelihood of a series of mutations leading to a more efficient network is small, but it is not exponentially small. Even if almost all mutations were non-adaptive, the process of successive replications leading to an organism with complex genetic regulation is possible.
In conclusion, I would like to express my gratitude to the referees for their useful comments and remarks.
Funding
The author received support from the Ministry of Science and Higher Education of the Russian Federation under agreement no. 075-15-2022-291, dated 15 April 2022. This agreement provided a grant in the form of subsidies from the federal budget for state support in establishing and developing the world-class scientific center, Pavlov Center “Integrative physiology for medicine, high-tech healthcare, and stress-resilience technologies”.
Data Availability Statement
No new data were created or analyzed in this study. Data sharing is not applicable to this article.
Conflicts of Interest
The author declares no conflict of interest.
Appendix A
Appendix A.1. Estimates for Approximations by RBF
Let us review some results for approximations by radial basis function networks (RBFNs) [33]. Let us introduce a useful notation.
Let be a unit ball in centered at 0, and let denote the Euclidean norm of the vector . Let . Let us denote by the Banach space of all measurable functions on the compact domain with the norm
For , we use the supremum norm. The Sobolev class is defined by
where is a multi-index with non-negative integer components, , and
We denote by the class of Hölder functions with exponent . For two function classes, U and W, the distance between those classes is defined by
The RBFN class consists of functions of the form
where are coefficients, are localization centers for radial basis function , which is a well-localized smooth function (for example, a Gaussian peak), and N is the network size.
Appendix A.2. Estimates for RBFN
The following estimate can be obtained (see, for example, [33]):
where a positive constant depends on p and d only.
Appendix A.3. Encoding GRN
Consider RBF networks (see (A1)). We are going to encode the parameters of this network by a binary string s. According to ([13]), for smooth , this binary coding problem can be resolved as follows. Each we approximate by M bits within a precision :
where
Similarly, we proceed for and . We obtain , where all parameters are replaced by their binary approximations. Therefore,
for bounded inputs v and some constants.
Remark. According to [13], for a network with size N, we should use binary genes to construct the network .
Appendix A.4. Estimate of Ventsel–Freidlin Distance
In this subsection, we derive the estimate (46) from the proof of Theorem 1
This estimate can be obtained in different ways, and really, it is almost obvious. In fact, if we remove the terms in (23), this equation becomes a linear stochastic equation, which defines a vector-valued Ornstein–Uhlenbeck process. This process is Gaussian with zero mean; hence, it is fully characterized by its covariance matrix. This matrix can be derived in a standard manner, linked to a fundamental physical principle, the fluctuation–dissipation theorem (see, for example, [62]). The second variant involves investigating the Fokker–Planck equation corresponding to (23). For small , we can obtain an asymptotic expansion of the solutions of that Fokker–Planck equation at equilibrium. This solution characterizes a probability density function, resembling a slightly perturbed multivariate Gaussian distribution, predominantly localized at the equilibrium.
We use the theory of small random perturbations of dynamical systems developed in ([24], 1984). Let us define the matrix with entries by
This matrix is symmetric and positively definite, and exists. We note that
where w is a vector and denotes the standard scalar product and the norm in Euclidean space . Indeed, let . Then
By this notation, (A4) can be rewritten as
We note that
By the Cauchy–Schwartz inequality,
which is equivalent to (A5).
Let us introduce the Ventsel–Freidlin distance between two points, w and v, by
where the infimum is taken over all differentiable trajectories , which connect and v:
According to classical results ([24], 1984), this distance allows one to find an asymptotic estimate of large deviations from the equilibrium for the system (23). We set and let . Then
To estimate , we define a trajectory defined by
Estimate (A4) gives
which is equal to
Using (20) and taking into account for all , one has
which gives
where
and
The relation for can be transformed into
where is an operator conjugate to . One has
The operator is self-adjoint and has negative real-valued eigenvalues; thus, we can use the spectral decomposition. Let be orthogonal eigenfunctions of , and let be the corresponding eigenvalues. Let us denote the Fourier coefficients by . Then
Similarly,
where .
We are going to find the minimum of under the condition . This condition can be rewritten as
This minimization problem can be resolved. The Euler–Lagrange equations have the following form:
They are independent for each k. Solutions of these equations have the following form:
where are unknown real numbers and
for small . We substitute these relations into (A13) and . As a result, we obtain the following minimization problem with respect to unknown coefficients, , to find the minimum of
under condition
The minimization problem can be resolved, and we obtain
This implies (A3).
References
- Koonin, E.V. The Logic of Chance: The Nature and Origin of Biological Evolution; FT Press: NewYork, NY, USA, 2011. [Google Scholar]
- Iranzo, J.; Lobkovsky, A.E.; Wolf, Y.I.; Koonin, E.V. Virus-host arms race at the joint origin of multicellularity and programmed cell death. Cell Cycle 2014, 13, 3083–3088. [Google Scholar] [CrossRef]
- Koonin, E.V. Viruses and mobile elements as drivers of evolutionary transitions. Phil. Trans. R. Soc. B 2016, 371, 20150442. [Google Scholar] [CrossRef] [PubMed]
- Maynard Smith, J.; Schatzmary, E. The Major Transitions in Evolution; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
- Fisher, R.A. The Genetical Theory of Natural Selection; Clarendon Press: Oxford, UK, 1930. [Google Scholar]
- Orr, H.A. Adaptation and cost of complexity. Evolution 2000, 54, 13–20. [Google Scholar] [CrossRef] [PubMed]
- Orr, H.A. The genetic theory of adaptation: A brief history. Nat. Rev. Genet. 2005, 6, 119–127. [Google Scholar] [CrossRef] [PubMed]
- Peto, R.; Roe, F.J.C.; Lee, P.N.; Levy, L.; Clack, J. Cancer and ageing in mice and men. Br. J. Cancer 1975, 32, 411–426. [Google Scholar] [CrossRef] [PubMed]
- Caulin, A.F.; Maley, C.C. Peto’s Paradox: Evolution’s prescription for cancer prevention. Trends Ecol. Evol. 2011, 26, 175–182. [Google Scholar] [CrossRef]
- Kobayashi, H.; Man, S. cquired multicellular-mediated resistance to alkylating agents in cancer. Proc. Natl. Acad. Sci. USA 1993, 90, 3294–3298. [Google Scholar] [CrossRef]
- Domazet-Loso, T.; Tautz, D. Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa. BMC Biol. 2010, 8, 66. [Google Scholar] [CrossRef]
- Dang, C. Links between metabolism and cancer. Genes Dev. 2012, 26, 877–890. [Google Scholar] [CrossRef]
- Vakulenko, S.; Grigoriev, D. Deep gene networks and response to stress. Mathematics 2021, 9, 3028. [Google Scholar] [CrossRef]
- Hansberg, W.; Aguirre, J.; Rís-Momberg, M.; Rangel, P.; Peraza, L.; Montes de Oca, Y.; Cano-Domínguez, N. Cell differentiation as a response to oxidative stress. Br. Mycol. Soc. Symp. Ser. 2008, 27, 235–257. [Google Scholar]
- Tower, J. Stress and stem cells. Wiley Interdiscip. Rev. Dev. Biol. 2012, 1, 789–802. [Google Scholar] [CrossRef] [PubMed]
- Bornstein, S.; Steenblock, C.; Chrousos, G.P.; Schally, A.V.; Beuschlein, F.; Kline, G.; Krone, N.P.; Licinio, J.; Wong, M.L.; Ullmann, E.; et al. Stress-inducible-stem cells: A new view on endocrine, metabolic and mental disease? Mol. Psychiatry 2019, 24, 2–9. [Google Scholar] [CrossRef] [PubMed]
- Greaves, R.B.; Dietmann, S.; Smith, A.; Stepney, S.; Halley, J.D. A conceptual and computational framework for modelling and understanding the non-equilibrium gene regulatory networks of mouse embryonic stem cells. PLoS Comput. Biol. 2017, 13, e1005713. [Google Scholar] [CrossRef]
- Jiang, P.; Ludwig, M.L.; Kreitman, M.; Reinitz, J. Natural variation of the expression pattern of the segmentation gene even-skipped in Drosophila melanogaster. Dev. Biol. 2015, 405, 173–181. [Google Scholar] [CrossRef]
- Mjolsness, E.; Sharp, D.H.; Reinitz, J. A connectionist model of development. J. Theor. Biol. 1991, 152, 429–453. [Google Scholar] [CrossRef]
- Valiant, L.G. Evolvability. J. ACM 2006, 120, 1–19. [Google Scholar]
- Vakulenko, S.; Grigoriev, D. Instability, complexity, and evolutionz. Zap. Nauchn. Sem. POMI 2008, 360, 31–69. [Google Scholar]
- Gromov, M.; Carbone, A. Mathematical slices of molecular biology. Gaz. MathéMaticiens 2001, 88A. [Google Scholar]
- Vakulenko, S. Complexity and Evolution of Dissipative Systems; de Gruyter: Berlin, Germany, 2014. [Google Scholar]
- Ventsel, D.A.; Freidlin, M.I. Random Perturbations of Dynamic Systems; Springer: New York, NY, USA, 1984. [Google Scholar]
- Reinitz, J.; Sharp, D.H. Mechanism of formation of eve stripes. Mech. Dev. 1995, 49, 133–158. [Google Scholar] [CrossRef]
- Valiant, L.G. Evolvability. J. ACM 2009, 56, 1–21. [Google Scholar] [CrossRef]
- Horsthemke, W.; Lefever, R. Noise-induced Transitions; Springer: Berlin/Heidelberg, Germany, 1984. [Google Scholar]
- Chen, B.; Abuassba, A.O. Compartmental Models with Application to Pharmacokinetics. Procedia Comput. Sci. 2021, 187, 60–70. [Google Scholar] [CrossRef]
- Gillespie, D.T. Exact Stochastic Simulation of Coupled Chemical Reactions. J. Phys. Chem. 1977, 81, 2340–2361. [Google Scholar] [CrossRef]
- Henry, D. Geometric Theory of Semilinear Parabolic Equations; Springer: Berlin/Heidelberg, Germany, 1981. [Google Scholar]
- Szendro, I.G.; Schenk, M.F.; Franke, J.; Krug, J.; de Visser, J.A.G. Quantitative analyses of empirical fitness landscapes. J. Stat. Mech. Theory Exp. 2013, 2013, P01005. [Google Scholar] [CrossRef]
- Jiang, P.; Reinitz, J.; Kreitman, M. The effect of mutational robustness on the evolvability of multicellular organisms and eukaryotic cells. J. Evol. Biol. 2023, 36, 906–924. [Google Scholar] [CrossRef] [PubMed]
- Lin, S.; Liu, X.; Rong, Y.; Xu, Z. Almost optimal estimates for approximation and learning by radial basis function networks. Mach. Learn. 2014, 95, 147–164. [Google Scholar] [CrossRef]
- Turing, A. The chemical basis of morphogenesis. Phil. Trans. Roy. Soc. B 1952, 237, 37–72. [Google Scholar]
- Cooke, J.; Zeeman, E. A Clock and Wavefront model for control of the number of repeated structures during animal morphogenesis. J. Theor. Biol. 1976, 58, 455–476. [Google Scholar] [CrossRef]
- Meinhardt, H. Models of Biological Pattern Formation; Academic Press: London, UK, 1982; pp. 1–215. [Google Scholar]
- Baker, R.; Schnell, S.; Maini, P. A clock and wavefront mechanism for somite formation. Dev. Biol. 2006, 293, 116–126. [Google Scholar] [CrossRef]
- Kishan, K.; Rui, L.; Cui, F.; Yu, Q.; Haake, A.R. GNE: A deep learning framework for gene network inference by aggregating biological information. BMC Syst. Biol. 2019, 13, 38. [Google Scholar]
- Shen, Z.; Yang, H.; Zhang, S. Nonlinear Approximation via Compositions. Neural Netw. 2019, 119, 74–84. [Google Scholar] [CrossRef] [PubMed]
- Surkova, S.; Spirov, A.V.; Gursky, V.V.; Janssens, H.; Kim, A.R.; Radulescu, O.; Vanario-Alonso, C.E.; Sharp, D.H.; Samsonova, M.; Reinitz, J.; et al. Canalization of gene expression in the Drosophila blastoderm by gap gene cross regulation. PLoS Biol. 2009, 7, e1000049. [Google Scholar]
- Murray, J.D. Mathematical Biology, 3rd ed.; Springer: Berlin/Heidelberg, Germany, 2002. [Google Scholar] [CrossRef]
- Crampin, E.J.; Gaffney, E.A.; Maini, P.K. Reaction and diffusion on growing domains: Scenarios for robust pattern formation. Bull. Math. Biol. 1999, 61, 1093–1120. [Google Scholar] [CrossRef] [PubMed]
- Krause, A.L.; Gaffney, E.A.; Maini, P.K.; Kika, V. Modern Perspectives on Near-Equilibrium Analysis of Turing Systems. Philos. Trans. A 2019, 379, 20200268. [Google Scholar] [CrossRef]
- Nicolis, G.; Prigogine, I. Self-Organization in Nonequilibrium Systems: From Dissipative Structures to Order through Fluctuations; Wiley and Sons: New York, NY, USA; London, UK; Sydney, Australia; Toronto, ON, Canada, 1977. [Google Scholar]
- Haken, H. Synergetics: An Introduction: Nonequilibrium Phase Transitions and Self-Organization in Physics, Chemistry, and Biology; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1978. [Google Scholar]
- Vakulenko, S.; Volpert, V. Generalized travelling waves for perturbed monotone reaction-diffusion systems. Nonlinear Anal. 2001, 46, 757–776. [Google Scholar] [CrossRef]
- Brenner, S. Life’s code script. Nature 2012, 482, 461. [Google Scholar] [CrossRef]
- Maroto, M.; Bone, R.A.; Dale, J. Somitogenesis. Development 2012, 139, 2453–2456. [Google Scholar] [CrossRef]
- Resende, T.; Ferreira, M.; Teillet, M.A.; Tavares, A.T.; Andrade, R.P.; Palmeirim, I. Sonic hedgehog in temporal control of somite formation. Proc. Natl. Acad. Sci. USA 2010, 107, 12907–12912. [Google Scholar] [CrossRef]
- Pourquié, O. The chick embryo: A leading model for model in somitogenesis studies. Mech. Dev 2004, 121, 1069–1079. [Google Scholar] [CrossRef]
- Heltberg, M.L.; Krishna, S.; Jensen, M. On chaotic dynamics in transcription factors and the associated effects in differential gene regulation. Nat. Commun. 2019, 10, 71. [Google Scholar] [CrossRef]
- Wolpert, L.; Tickle, C.; Jessell, T. Principles of development; Oxford University Press: Cambridge, UK, 2002. [Google Scholar]
- Reinitz, J.; Vakulenko, S.; Sudakow, I.; Grigoriev, D. Robust morphogenesis by chaotic dynamics. Sci. Rep. 2023, 13, 7482. [Google Scholar] [CrossRef] [PubMed]
- Maroudas-Sacks, Y.; Keren, K. Mechanical Patterning in Animal Morphogenesis. Annu. Rev. Cell Dev. Biol. 2021, 37, 469–493. [Google Scholar] [CrossRef] [PubMed]
- Sudakow, I.; Vakulenko, S.; Grigoriev, D. Excitable media store and transfer complicated information via topological defect motion. Commun. Nonlinear Sci. Numer. Simul. 2023, 116, 106844. [Google Scholar] [CrossRef]
- Vakulenko, S.; Grigoriev, D. Complexity of gene circuits, Pfaffian functions and the morphogenesis problem. Compte Rendu Math. 2003, 337, 721–724. [Google Scholar] [CrossRef]
- Vakulenko, S.; Radulescu, O.; Reinitz, J. Size Regulation in the Segmentation of Drosophila: Interacting Interfaces between Localized Domains of Gene Expression Ensure Robust Spatial Patterning. Phys. Rev. Lett. 2009, 103, 168102–168106. [Google Scholar] [CrossRef]
- Iranzo, J.; Martincorena, I.; Koonin, E. Cancer-mutation network and the number and specificity of driver mutations. Proc. Natl. Acad. Sci. USA 2018, 115, E6010–E6019. [Google Scholar] [CrossRef]
- Longa, H.; Miller, S.F.; Strauss, C.; Zhao, C.; Cheng, L.; Ye, Z.; Griffin, K.; Te, R.; Lee, H.; Chen, C.; et al. Antibiotic treatment enhances the genome-wide mutation rate of target cells. Proc. Natl. Acad. Sci. USA 2016, 113. [Google Scholar] [CrossRef]
- Milholland, B.; Dong, X.; Zhang, L.; Hao, X.; Suh, Y.; Vijg, J. Differences between germline and somatic mutation rates in humans and mice. Nat Commun. 2017, 8, 15183. [Google Scholar] [CrossRef]
- Kimura, M. The Neutral Theory of Molecular Evolution; Cambridge University Press: Cambridge, UK, 1983. [Google Scholar]
- Keizer, J. Statistical Thermodynamics of Nonequilibrium Processes; Springer: New York, NY, USA, 1987. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).