Next Article in Journal
Microbiota-Derived Extracellular Vesicles as Potential Mediators of Gut–Brain Communication in Traumatic Brain Injury: Mechanisms, Biomarkers, and Therapeutic Implications
Previous Article in Journal
EC359 Enhances Trametinib Efficacy in Ras/Raf-Driven Ovarian Cancer by Suppressing LIFR Signaling
Previous Article in Special Issue
Recycling of Uridylated mRNAs in Starfish Embryos
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization

1
Acquisition, Technology & Logistics Agency, Shinjuku-ku 162-8870, Japan
2
Department of Applied Life Sciences, Faculty of Agriculture, Shizuoka University, Shizuoka 422-8529, Japan
3
Research and Education Center for Natural Science, Keio University, Yokohama 223-8521, Japan
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Biomolecules 2025, 15(10), 1397; https://doi.org/10.3390/biom15101397
Submission received: 31 August 2025 / Revised: 25 September 2025 / Accepted: 27 September 2025 / Published: 30 September 2025
(This article belongs to the Special Issue Gametogenesis and Gamete Interaction, 2nd Edition)

Abstract

Allorecognition, or distinguishing between the self and nonself within the same species, is observed in both animals and plants, particularly in the context of immune reactions and self-incompatibility in sexual reproduction. Polymorphic recognition molecules are known to be responsible for such allorecognition during fertilization. Previous studies have reported that in ascidians and flowering plants, inbreeding avoidance relies on a pair of polymorphic recognition molecules with a receptor-ligand relationship that are encoded at a single locus, the S locus (Self-incompatibility locus), but the process by which such pairs of recognition molecules emerge and evolve to become polymorphic is not known. Here, a population genetics study was carried out as a novel approach for investigating allorecognition. To study the process by which self-recognition emerges, we simulated a situation in which an allorecognizing genotype is generated from a nonallorecognizing genotype through mutation and then analyzed whether the two genotypes could coexist. The conditions under which the numbers of allorecognition alleles could increase over evolutionary time were investigated, and the generational dynamics of nonallorecognizing genotypes were analyzed. Subsequent modeling was carried out to reproduce the allorecognition mechanism in Ciona, and consistency between the simulation results and experimental data was observed. Our approach provides new insight into the evolutionary process of allorecognition.

Graphical Abstract

1. Introduction

Fertilization, or gamete fusion, is a pivotal event in sexual reproduction in animals and plants. In general, a male gamete, or sperm, is released and attracted to a female gamete, or egg. Whereas most flowering plants are hermaphrodites, a majority of the animals are dioecious, with some exceptions, including ascidians. Among angiosperms, approximately half of all species, including rice and Arabidopsis, are self-compatible [1,2,3]. Similarly, both self-incompatible and self-compatible species of ascidians have been characterized: Ciona intestinalis and Halocynthia roretzi are well-known self-incompatible species, whereas Phallussia mammillata is self-compatible [4,5].
Self-compatibility appears to be advantageous for population growth, since it allows female gametes to be fertilized with male gametes from the same animal even in the absence of a mating partner: a good example of a self-compatible animal species is Caenorhabditis elegans [6]. On the other hand, individuals produced by self-fertilization have lower genetic diversity than those produced by nonself-fertilization, and as a result, self-fertilization causes inbreeding depression [7]. For example, in Ciona, self-fertilization results in the production of only a small number of normal embryos and larvae [8]. Self-fertilization in hermaphroditic colonial tunicates of the genus Botryllus also results in the production of an extremely small number of progeny due to embryonic death [9]. Therefore, although the prevention of self-fertilization has a certain disadvantage in terms of reducing opportunities for reproduction, it appears to have a significant advantage in eliminating harmful genes and promoting the spread of beneficial genes.
Although some species avoid self-fertilization by modulating reproductive behavior, such as by releasing male and female gametes at different times [10,11], many species prevent self-fertilization through allogenic recognition (allorecognition) mechanisms. In Ciona intestinalis type A (another name Ciona robusta), sperm can recognize eggs from the same individual upon attachment to the vitelline coat (VC), causing detachment from the VC or eventual cessation of motility, ultimately preventing self-gamete fusion. The sperm-VC recognition protein s-Themis, a PKD-like transmembrane protein, and the VC protein v-Themis, a fibrinogen-like protein, play key roles in this process [4]. Three pairs of s-Themis and v-Themis genes, s/v-Themis-A, s/v-Themis-B, and s/v-Themis-B2, are found at two loci: s/v-Themis-A on chromosome 2q (locus A) and s/v-Themis-B and s/v-Themis-B2 on chromosome 7q (locus B). In every case, v-Themis genes are located in the first intron of s-Themis genes but are transcribed in the opposite direction [4,5,12,13]. If three allelic pairs (A, B and B2) of s-Themis and v-Themis are matched, i.e., the same haplotype, fertilization is prevented, enabling self-incompatibility [12].
Similarly, in Papaveraceae, if self-pollen reaches the stigma, pollen germination and pollen tube elongation do not occur, and as a result, self-fertilization never takes place. The pollen recognition protein PrpS and the stigmatic recognition protein PrsS, genes of which are encoded in a single diverse genetic locus, the S-locus, in close proximity (within 0.5 kbp) to one another, are responsible for self-incompatibility in Papaveraceae [1,14].
In both cases, male and female recognition molecules, encoded by genes at one or a few loci, play pivotal roles in ligand-receptor interactions. When VC or stigmatic ligand molecules interact with receptors on self-sperm or self-pollen, respectively, intracellular Ca2+-mediated signal transduction occurs, resulting in the prevention of self-fertilization [15,16]. Both male and female allorecognition proteins are thought to be highly polymorphic and mediate specific interactions [17]. The polymorphic alleles responsible for such allorecognition are presumed to have originated from a single ancestral gene, encoding a pair of ancestral male and female recognition molecules believed to have diverged to produce polymorphisms during coevolution [18].
Here, we investigated the emergence and evolution of allorecognition molecules using a population genetics approach. For the emergence process, we evaluated whether a primitive self-recognition gene that emerged by mutation was maintained from generation to generation. The process, by which a pair of self-recognition proteins acquire variation by mutation was then simulated using a model in which a male or female mutation preceded the creation of a new allorecognition gene pair. Finally, we examined how the proportions of allorecognition alleles and nonallorecognition alleles changed when the number of allorecognition alleles or loci increased.

2. Materials and Methods

For this study we considered three preconditions for our modeling, and the model represents phenomena that occur in animals. Therefore, the sperm and the egg in the animal model correspond to the pollen and the pistil in plants, respectively.
The first precondition is that the pair of allorecognition molecules is specialized for fertilization and is not shared with other types of allorecognition, such as nonself recognition in the immune system. At least in the ascidian, Halocynthia roretzi, different molecular mechanisms have been suggested for fertilization and the immune response, since the characteristics of self-incompatibility and allogenic refusion of somatic cells (contact reaction) differ [19,20]. Additionally, the Fu/HC gene, which plays a key role in histocompatibility, was not expressed in germline cells in the ascidian Botryllus schlosseri [21].
The second precondition is that male recognition molecules are expressed from haploid cells, whereas female recognition molecules are derived from diploid cells, whether from eggs arrested at the first metaphase of meiosis in ascidians or from the somatic cells surrounding eggs, as reported in C. intestinalis [5,22]. In plants, this type of self-recognition in mating is called gametophytic, distinguished from sporophytic, in which male recognition molecules are synthesized in the anther tapetum (somatic cells) and attached to the surface of the pollen, although both gametophytic and sporophytic self-recognition have been reported [1,23].
The third precondition for modeling was that neither mutations nor gene migration occured during the simulation period and that generations did not overlap, as is typical in the modeling of population genetics [24]. This is to focus our examination on each step of emergence and the evolution of allorecognition, which consists of the appearance of a certain new allele by mutation and subsequent convergence to equilibrium through the spread of the selection of alleles in the population, rather than simulating the mutation process itself in real-time. In the simulation, genotypes compete through natural selection rather than genetic drift.

2.1. Modeling for the Emergence Process of Allorecognition Genes

The model for the process of self-recognition gene emergence is shown in Figure 1. X represents a nonallorecognition allele, Y represents an allorecognition allele that emerged as a result of the mutation, and a n and b n indicate the proportions of individuals with the XX and XY genotypes, respectively, in the n-th generation, where a n + b n = 1 , because the YY genotype does not exist (as described below).
Since the numbers of sperm and eggs should be proportional to the number of individuals, a n X-genotype sperm are produced from XX individuals, while b n / 2 X-genotype and b n / 2 Y-genotype sperm are produced from XY individuals (Figure 1A). Similarly, the ratio of XX-genotype eggs to XY-genotype eggs can be regarded as a n vs. b n . Although X-genotype sperm can fertilize any egg, sperm carrying the Y allele can fertilize only XX-genotype eggs (Figure 1B).
Assuming that all the eggs undergo fertilization with an excess amount of sperm, the number of offspring in the next generation produced by each combination of gametes can be expressed as the product of the proportions of eggs and sperm, as shown in Figure 1C. Here, YY-type offspring cannot occur, since Y-type sperm cannot fuse with XY-type eggs. Before the number of offspring for each genotype is summed, the survival probability α must be adjusted if the parental genotypes are the same because inbreeding depression must be considered for offspring of pairs of the same genotype, i.e., self-fertilized offspring. For cases of inbreeding depression, the value of α should be between 0 and 1, where α = 0 indicates that inbreeding is completely lethal, α = 1 indicates that inbred and outcrossed offspring exhibit equal survival, and α > 1 indicates that inbred offspring have greater survival than outcrossed offspring do, which is unlikely. Finally, the sums of each genotype are defined as a n + 1 and b n + 1 , respectively; then, a n + 1 and b n + 1 , which are normalized values obtained by dividing each value by their sum, defined as A l l n + 1 , are the proportions of XX and XY genotype individuals of the n+1st generation (Figure 1D).
For the simulation, a 0 and b 0 were set to arbitrary initial values, and the generation calculation in Figure 1D was simply repeated as many times as necessary for the analysis. The reproducibility and robustness of the convergence points of each population were confirmed by the simulation using randomized initial conditions; the same was applied to the following models.

2.2. Modeling of the Evolution of Allorecognition Genes

2.2.1. Divergence Through Mutation

To study the evolutionary process of allorecognition genes, namely, the generation of variation in allorecognition alleles, the process of allorecognition allele divergence must be considered first. Thus, situations in which a mutation occurred in a gene encoding the male or female recognition molecule of a primitive allorecognition gene in the previous model (Figure 1) were modeled as shown in Figure 2, Appendix A.1, and Supplementary Material S1.
In addition to the definition in Figure 1, we defined an allele YF, in which a mutation occurred in a gene encoding the female recognition molecule on allele Y and whose function was lost, and an allele YM, in which a mutation occurred in a gene encoding the male recognition molecule (Figure 2A). A mutated female recognition molecule derived from the YF allele does not lead to fertilization failure for sperm with a male recognition molecule encoded by either the Y allele or the YF allele, whereas the sperm of YF cannot fertilize eggs with the Y allele. Therefore, this mutation results in one-way sterility. With respect to the YM allele, sperm with a mutated male recognition molecule derived from the YM allele are permitted to fertilize eggs with the Y allele or YM allele, whereas eggs with the YM allele cannot be fertilized by sperm with the Y allele, similar to eggs with the Y allele. Models for alleles YF or YM were subsequently constructed (Figure 2B,D, respectively), and the mating characteristics of the genotypes in these models are shown in Figure 2C,E.
On the basis of these features, the relationships of the proportions of the n-th generation and the n+1-st generation were derived as Equations (A1) and (A2) through works similar to those in Figure 1C,D (see Figure S1A,B). Changes in the population occupancy rates of the YF allele or the YM allele were analyzed by repeating these generation calculations.
For further analysis of the divergence process, a situation in which a new allele encoding a male recognition molecule adapted to the mutated female molecule appears, must be considered. This new allele was defined as allele Z, and the populations consisting of X, Y, Z and YF were modeled as shown in Figure 2F. In this case, the XX, XY, XZ, YZ, XYF, YYF, ZYF, and YFYF genotypes are present.
Only the X-type sperm can be accepted by any egg, the Y-type sperm can be fertilized with only the XX- and XZ-type eggs, and the Z-type and YF-type sperm can fertilize XX, XY, XYF, YYF, and YFYF genotype eggs (Figure 2G). On the basis of these features, the relationship between the proportions of the n-th generation and the n+1-st generation was derived by Equation (A3) through a process similar to those in Figure 1C,D (see Figure S1C). Changes in the population occupancy rates of alleles X, Y, YF, and Z were simulated by repeating the generation calculations.

2.2.2. Model with Divergent Alleles

To simulate a situation in which the number of allorecognition alleles increases, it is necessary to consider three groups of genotypes, namely, a homozygous genotype of the nonallorecognition allele, a heterozygous genotype comprising the no allorecognition allele and one of the allorecognition alleles, and a heterozygous genotype comprising two different allorecognition alleles ( a , b , and c in Figure 3A, respectively).
As shown in Supplementary Material S2, a preliminary study in which two types of allorecognition alleles (Y1 and Y2 in Figure S2A) coexisted with the nonallorecognition allele X (defined as M = 2 ) revealed that heterogenotypic populations of a nonallorecognition allele and one of the allorecognition alleles ( b 1 n and b 2 n in Figure S2C,D), placed under the same conditions, converged to the same proportion as the number of generations increased, even if the simulations started from different initial proportions ( b 1 0 and b 2 0 ). Additionally, a preliminary study with three types of allorecognition alleles (defined as M = 3 ) revealed that the heterogenotypic populations consisted of two different allorecognition alleles ( c 1 n , c 2 n , and c 3 n in Figure S2G,H) converged to the same proportion, since these genotype populations were selected under the same conditions.
On the basis of the results of these preliminary studies, the simulation model was extended to M > 3 and generalized. The characteristics of the model with “M” types of allorecognition alleles are shown in Figure 3. Here, heterogenotypic populations under the same conditions were substituted as b 1 n = b 2 n = b 3 n = = b M n = b n and c 1 n = c 2 n = c 3 n = = c MC 2 n = c n (Figure 3A). Afterward, the relationship between the proportions of the n-th generation and the n+1-st generation was derived by Equation (A4) through procedures similar to those in Figure 1C,D (see Supplementary Material S3). Changes in the occupancy rates of alleles X, and Ys were simulated by repeating the generation calculations.

2.3. Modeling the 2-Locus Situation

To investigate the effects of the presence of multiple allorecognition loci, a 2-locus model was constructed according to the characteristics of C. intestinalis. To simulate the fertilization of C. intestinalis, sperm cannot fertilize eggs that share the same alleles at all the loci (Figure 4A).
The definitions of the genotypes and their characteristics are described in Figure 4B and Figure S4A. Loci A and B have “M” and “L” types of allorecognition alleles, respectively. As in the model with divergent alleles above, genotypes under the same conditions were combined; e.g., b n represents M types of populations, with homononallorecognition in locus A and heteroallorecognition in locus B, under the same conditions.
The relationships between the proportions of the n-th generation and the n+1-st generation were subsequently derived by Equation (A5) through a process similar to that in Figure 1C,D. Changes in the population occupancy rates were simulated by repeating the generation calculation.

3. Results

3.1. Emergence Process of Allorecognition Genes

Using a model for the emergence process described in the Materials and Methods (Figure 1), we observed the proportions of genotypes across generations.
The changes in the proportion of XY genotype individuals ( b n ) across generations when the initial value b 0 is fixed at 0.01 and the survival probability α varies between 0.01 and 1.1 are shown in Figure 5A. As shown in Figure 5A, b n converges to a certain value when 1 > α > 0 , whereas it converges to 0 for α > 1 and decreases monotonically for α = 1 , indicating that genotypes XX and XY coexist as long as 1 > α .
As shown in Figure 5B, when α is fixed at 0.2, simulations with any initial value b 0 yielded convergence of the XY proportion at the same value of approximately 0.45 after multiple generations. In other cases where α < 1 , the proportion of XY genotype individuals which converged over generations was dependent on only α regardless of the b 0 values, and reproducibility was confirmed by the simulation using randomized initial conditions, as shown in Supplementary Material S5.
In conclusion, the allorecognition allele can be fixed in the population at a certain proportion that depends on α . In such a situation, the allorecognition allele (Y) and nonallorecognition allele (X) are in a state of balanced selection. Although heterozygous dominance is known to lead to balanced selection as in the case of sickle-cell polymorphisms in malaria-endemic regions [25,26], the relationship between X and Y here involves a different mechanism.

3.2. Process of Allorecognition Gene Evolution

3.2.1. Process of Divergence Through Mutations in Male/Female Genes

When polymorphism of allorecognition alleles occurs, it is unlikely that a gene pair encoding male and female recognition molecules will simultaneously undergo mutation to generate a new gene pair that encodes a pair of functional recognition molecules. Therefore, it is natural to hypothesize that mutation occurs first within only one sex (gene encoding a male or female recognition molecule), followed by adaptive mutation of the gene encoding the corresponding molecule. Under this hypothesis, we simulated situations in which only the male or female allorecognition molecule underwent loss-of-function mutation via the models described in the Materials and Methods (Figure 2), and the resulting changes in the allele occupancy rates were determined.
First, when the YF allele emerges from the mutation in the population consisting of alleles X and Y, the YF allele is fixed in the population at a certain rate, and the population transitions to a state comprising X, Y, and YF alleles when α < 1 . At this time, the converged proportion values of each genotype depended not on the initial value but rather on α . Representative results of the simulations are shown in Figure 6A, and the summarized results of the simulation at convergence are shown in Figure 6B. These results indicate that the YF allele coexists with the X and Y alleles.
In contrast, when the case in which the YM allele appears in the population consisting of alleles X and Y is simulated, the Y allele tends to disappear from the population. As shown in a representative result (Figure 6C), genotypes with the Y allele, namely, XY and YYM, converged to 0 in the presence of the YM allele at α = 0.5 . The summarized results of the simulations (Figure 6D) show the converged proportion values of each genotype for various α values, indicating that the Y allele does not survive in all the cases. Therefore, the elimination of the original Y allele occurs through mutation of the gene encoding the male recognition molecule on Y allele.
Considering the evolution of polymorphisms in the allorecognition gene, we focus first on the case in which the mutation occurs in the gene encoding the female recognition molecule, since this case can be a transition state to the creation of a new allorecognition allele, whereas mutation in the male gene leads to the disappearance of the original allorecognition allele.
To create a new allorecognition allele, the male recognition molecule must adapt to the mutated female recognition molecule encoded by the YF allele. This requires a mutation in the gene encoding the male recognition molecule of allele YF, and an allele that accomplishes such a mutation is defined as allele Z, as shown in Figure 2A. Therefore, simulations for the state comprising alleles X, Y, Z and YF were conducted using the model described in the Materials and Methods (Figure 2).
An example of the results is shown in Figure 6E; the proportions of each genotype tended to converge to a fixed value, and none of them were eliminated when α < 1 . The convergence proportions depend on α regardless of the initial value, as in previous simulations. The relationships between α and the proportions of alleles after different numbers of generations are shown in Figure 6F, indicating that all the genotypes coexist in all the cases where α 1 . A comparison of the data in Figure 6B,F revealed that the proportion of YF allele was significantly reduced by the emergence of the Z allele even at the same α .
The simulations in this section suggest that the creation of the new allorecognition allele consists of two steps. First, one of the existing alleles is mutated, and a “transition allele” encoding a new female recognition molecule and the original male recognition molecule is generated and can be maintained in the population. In the second step, a “new allorecognition allele” encoding a male recognition molecule that can recognize the new female recognition molecule, encoded by the transition allele, is generated by the mutation and coexists in the population, accompanied by a significant reduction in the proportion of transition alleles, which ultimately can be eliminated by genetic drift.

3.2.2. Behavior of the Model with Divergent Alleles

In the above studies, allorecognition alleles and nonallorecognition allele coexisted across generations, which was related to balancing selection. Next, we focused on how this relationship changes when the number of allorecognition alleles increases.
Using the models described in the Section 2 (Figure 3), the proportion of nonallorecognition allele after different numbers of generations was simulated in the presence of multiple allorecognition alleles. When the number of types of allorecognition alleles is defined as “M”, there are M types of heterozygous genotypes of the nonallorecognition allele and one of the allorecognition alleles (“ b n ”) and MC2 types of heterozygous genotypes of the two different allorecognition alleles (“ c n ”), as described in Figure 3A.
The results of the simulation up to M = 50 are summarized in Figure 7. The proportions of the XX genotype at the point of convergence against M for several α values are shown in Figure 7A, indicating that even when M increases to 15, the XX genotype still accounts for a certain percentage of the whole population. Even if α = 0.01 , meaning that cross-fertilization between the different genotypes is 100 times more advantageous for survival than fertilization between the same genotypes, more than 10% of the populations exhibited the XX genotype at M = 15. Additionally, a large proportion of individuals possessed the nonallorecognition allele X alone (Figure 7B). Further simulation up to M = 50 (Figure 7C,D) revealed that the proportion of the XX genotype and the X allele occupancy rate decreased monotonically with increasing M, although the decrease slowed. The data in Figure 7C,D indicate that even when M increases, a considerable number of nonallorecognition allele remains in heterozygosity with allorecognition alleles (“b” in Figure 7C).
Therefore, as the number of allorecognition alleles increases, the occupancy rate of the nonallorecognition allele decreases, but the allele survives in the population, mainly in heterozygosity with allorecognition alleles, and direct selection does not occur. These findings suggest that there is a high likelihood that nonallorecognition allele persists in populations of animals and plants that avoid self-mating by allorecognition.

3.3. Simulation Using the 2-Locus Model

In C. intestinalis, three gene pairs encoding allorecognition molecules, s/v-Themis-A in locus A and s/v-Themis-B and s/v-Themis-B2 in locus B, are similar to each other and have been suggested to share a common ancestral locus. In this section, we analyzed the effects of the presence of multiple allorecognition loci, especially the occupancy rates of nonallorecognition alleles at different loci.
A 2-locus model simulating C. intestinalis was constructed as described in the Materials and Methods (Figure 4), and changes in the occupancy rates of nonallorecognizing alleles were observed. The results of the simulations are shown in Figure 8, in which “L” is the number of alleles at locus A and “M” is the number of alleles at locus B. When the numbers of alleles at both loci were the same ( L = M ) and the initial proportions of the nonallorecognition alleles at both loci were equal ( X 0 : Y 0 = x 0 : y 0 ), all the genotypes, as defined in Figure 4B, coexisted after multiple generations (Figure 8A,B). However, when either L = M or X 0 : Y 0 = x 0 : y 0 was unbalanced, the proportion of the nonallorecognition allele (X or x) at the locus with the greater number of allorecognition allele types (L and M) or the smaller initial proportion of the nonallorecognition allele was greatly reduced, trending toward disappearance. Figure 8C shows an example where L M (M > L), and Figure 8D shows a case where X 0 : Y 0 x 0 : y 0 ( b 0 > c 0 ). The proportion of nonallorecognition alleles, which decreased in this way, continued to decrease monotonically even after five thousand generations, suggesting a very high probability for the elimination of such nonallorecognition alleles through genetic drift. Such alleles are no longer in a state of balanced selection but rather in a state of direct selection. These reductions did not depend on the value of α (Figure 8E). In addition, in simulations of situations in which the nonallorecognition allele of one locus had disappeared, the nonallorecognition allele of the remaining other locus tended to persist (Supplementary Material S4 (Figure S4D–G)).
Therefore, when each of the two loci is polymorphic, the nonallorecognition allele tends to be eliminated at one locus, and survive at the other locus. In C. intestinalis, a nonallorecognition allele has been observed only at locus B (B2-3 allele in Ref. [12]), indicating that our model is consistent with observed phenomena.

4. Discussion

4.1. Validation of Simulations with Analytical Results

The reproducibility and robustness of the convergence points of each population were confirmed by the simulation using randomized initialization as shown in Table S1 and Supplementary Material S6. All the results showed consistent convergence to identical equilibrium states under various random initial conditions, indicating the reliability and robustness of the modeling.
The analytical solution of the model for the emergence process shown in Figure 1 was examined. When the proportion of the XX genotype converges, a n and a n + 1 should be equal. Therefore, the equilibrium points were calculated by substituting the equations in Figure 1C into the simultaneous equations a n = a n + 1 and a n + b n = 1 . Then, real numbers between 0 and 1 satisfying Equation (1) are the equilibrium points for the model.
a n 1 2 1 α a n 3 + 5 4 α a n 2 + 3 α 1 a n α = 0
When 1 > α > 0, Equation (1) has two equilibrium points. Although one ( a = 1) is an unstable equilibrium point, the other is a locally asymptotically stable equilibrium point such that the proportion of the genotype XX will converge to this stable equilibrium point regardless of the initial conditions. On the other hand, when α > 1 , a = 1 is the only equilibrium point of Equation (1), a state in which only the XX genotype exists. As shown in Table 1, the convergence points from the simulations match the stable equilibrium points obtained as the analytical solution.
Additionally, the analytical solution for the model with diversified alleles in Figure 3 was considered. As in the model for the emergence process, the following simultaneous Equation (2) were obtained by substituting the equations in Equation (A4) into simultaneous equations a n = a n + 1 , b n = b n + 1 , and a n + M b n + C 2 c n = 1 M   . Then, “ a ” and “ b ”, which are between 0 and 1 and satisfy the equations in Equation (2), are the equilibrium points for the model.
4 1 α a 4 + 2 1 α M a 3 b + 4 M 1 α + 2 α 1 a 3 + 2 1 + α M a 2 b + 3 2 α M 2 a b 2 + 4 M 1 α 1 a 2 + 4 M M 1 a b + M + α 1 M 2 b 2 + 4 M 1 a = 0 8 1 α M a 4 b + 8 1 α M 2 a 3 b 2 + 2 1 α M 3 a 2 b 3 8 a 4 + 4 3 M 4 1 α 3 M a 3 b + 2 3 M 4 1 α 3 M 2 a 2 b 2 + 3 4 α M 3 a b 3 + 2 1 α M 4 b 4 8 2 M 3 a 3 4 M 1 M 7 + α M 2 M a 2 b 2 4 M α M 2 a b 2 + M 6 α 2 M 5 M 3 b 3 8 M 1 M 3 a 2 4 M 2 M 2 4 M + 1 a b 2 M 2 + M 6 M 2 α M 2 b 2 + 8 M 1 2 a + 4 M M 1 b = 0
As shown in Table 2, the convergence points from the simulations coincided with the stable equilibrium points obtained as the analytical solution. These alignments indicate that the simulation model and the approach of this study are appropriate for understanding the emergence and evolutionary processes in the allorecognition system.
An increase in the number of types of alleles leads to an increase in the probability of elimination of nonallorecognition alleles, especially if there are two or more loci. With respect to the one-locus situation, theoretically, when the number of allorecognition allele types (M) increases to infinity, the following simultaneous equations (Equation (3)) are obtained by considering only the highest-order terms, namely, the cubic and quartic terms of Equation (2).
b 2 = 0   2 1 α b 4 + 1 2 α b 3 2 b 2 = 0
The solution of Equation (3) is b = 0. Since a = 1 or a = 0 satisfies Equation (2), ( a , b , c ) = (1, 0, 0) and ( a , b , c ) = (0, 0, 1/MC2) are equilibrium points. Whereas (1, 0, 0) is unstable, (0, 0, 1/MC2) is a stable equilibrium point. Therefore, as “M” continues to increase to infinity, the proportions of genotypes “ a ” and “ b ” converge to zero; that is, the nonallorecognition allele X will theoretically be eliminated from the population.

4.2. Comparison with Observed Phenomena

The simulations in this study suggest that the nonallorecognition allele may not be eliminated from the population; in nature, such an allele (B2-3 allele in C. intestinalis) exists and cannot perform the allorecognition function [12]. The sequence of this B2-3 allele is not particularly close to that of any of the other alleles of Themis-B2 (Supplementary Material S7), suggesting that the B2-3 allele has been independent for many generations.
The simulation indicates that the fitness of the nonallorecognition allele generally depends on α . There are no data comparing the fitness of offspring of the same allorecognition genotype with different genotypes, and the size of α may change depending on the size of the population, since the probability that mating between the same genotypes represents self-mating decreases with increasing population size. However, as mentioned above, in ascidians, self-fertilized embryos very often fail to develop [6,9], and abnormalities in seed formation caused by self-pollination have been reported in plants [27]. These observations indicate that self-fertilization carries considerable risk, and α must be very small.

4.3. Asymmetric Characteristics of Female and Male Recognition Molecules

Mutation of the gene encoding a female recognition molecule allows the coexistence of a transition allele (YF) and an original allele (Y) over a wide range of values of α . Therefore, the divergence of allorecognition alleles may originate from mutation of the gene encoding the female recognition molecule. In contrast, mutation of the gene encoding the male recognition molecule must be somewhat suppressed, since mutations that inactivate the male recognition molecule reduce the diversity of allorecognition loci, namely, the number of alleles, through selection for the original allele.
In sexual conflict, since the female typically exerts selection on the maleside, the male side undergoes changes to its elements or behaviors to expand the opportunities for mating or to exclude others, whereas the female side changes its elements or behaviors strictly in opposition to the changes on the male side [28,29]. Conversely, as revealed in this study, during the process of allorecognition molecule divergence, mutations in the gene encoding the female recognition molecule occur first, although they increase the risk of self-fertilization, and are not desirable for females. This phenomenon in self-sterility would be an interesting example from the standpoint of sexual conflict, since genes encoding female molecules are mutated first, followed by mutation in the male molecules.

4.4. Tolerance to Mutations and Molecular Evolution

In C. intestinalis, locus B consists of the tandem genes s/v-Themis-B and -B2 [12]. The ancestral locus is thought to have duplicated. With tandem loci, loss of the allorecognition function through a mutation in one copy can be complemented by the other, although s/v-Themis-B and -B2 are not always linked (manuscript in preparation). The tolerance of mutations in this way may promote the creation of variation in such loci, since mutation is necessary for generating polymorphisms, as shown in this study. In fact, the number of polymorphisms is greater in locus B than in locus A. This finding is consistent with previous statistical research suggesting that mutation and recombination play key roles in the creation and maintenance of polymorphisms in allorecognition loci [30].

5. Conclusions

In this research, we introduced a new modeling and simulation approach to investigate the emergence and evolution of allorecognition and observed its consistency with real-world phenomena, such as the coexistence of nonallorecognition allele.
Our study suggests that allorecognition genes, newly generated by mutation, and nonallorecognition allele established a balanced selection state, after which allorecognition diversified to construct the current allorecognition system. We also revealed that mutations in genes encoding male and female allorecognition molecules showed different characteristics and predicted the diversification process of the allorecognition molecules in which female molecules differentiate first.
Although the current study was based on the ascidian C. intestinalis, self-incompatibility has also been studied in other animals and plants such as B. campestris, and the detailed mechanisms differ among species; for example, both gametophytic and sporophytic mechanisms have been reported in plants. Therefore, we expect that our models could be tailored to different situations in different species to provide novel tools for furthering our knowledge and understanding of allorecognition.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biom15101397/s1; Supplementary Material S1: Details of the models for mutation in male or female genes. Supplementary Material S2: Preliminary studies for the model with divergent alleles. Supplementary Material S3: Derivation process of the model with divergent alleles. Supplementary Material S4: Details of the 2-locus model. Supplementary Material S5: Supplemental results for Figure 5. Supplementary Material S6: Verification of the reproducibility of the models. Supplementary Material S7: Phylogenetical analysis for Themis-B2 alleles.

Author Contributions

Conceptualization, M.N.; methodology, M.N.; simulation, M.N.; sequence analysis, T.S.; writing, M.N.; supervision, M.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JSPS KAKENHI, grant numbers JP22112520 (MM), JP23H04738, JP23K05779 and JP25H01817 (TS).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

We thank the National BioResource Project (AMED. Japan) for providing C. intestinalis. We express our sincere thanks to Hitoshi Sawada for providing his wide knowledge on observed self-incompatible phenomena.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
allorecognitionallogenic recognition
VCvitelline coat

Appendix A. Formulae for Generation Calculations

Appendix A.1. Models for Mutations in Male or Female Genes

For the model comprising alleles X, Y, and YF, the following equations were derived to calculate the n+1-st generation from the n-th generation on the basis of the data in Figure 2B,C, and Supplementary Material S1 (Figure S1A).
a n + 1 = α a n 2 + c n 2 4 + a n b n + 2 c n 2 + α b n 2 + b n 2 a n + c n 2 2 a n + b n + c n + b n c n 4 b n + 1 = a n 2 + c n 4 b n + d n + α b n 2 + 2 a n + c n b n + d n + b n d n 2 2 a n + b n + c n c n + 1 = α c n 2 2 + a n c n + 2 e n + c n e n + a n d n + b n e n + d n 2 + c n b n + d n 4 d n + 1 = c n 4 + e n 2 b n + d n e n + 1 = α c n 2 4 + e n 2 + e n c n + d n 2 + c n d n + 2 e n 4
In Equation (A1), a n e n indicate the population of each genotype in the n-th generation (defined in Figure 2B), and α represents the survival probability.
For the model comprising alleles of X, Y, and YM, the following equations were derived to calculate the n+1-st generation from the n-th generation on the basis of the data in Figure 2D,E, and Supplementary Material S1 (Figure S1B).
a n + 1 = α a n 2 + a n b n + c n 2 + α b n 2 + c n 2 + 2 a n b n + a n c n + b n c n 2 2 b n d n   b n + 1 = a n b n + d n 2 + α b n 2 + 2 a n + c n b n + d n + b n d n 2 2 b n d n   c n + 1 = a n e n + a n c n + d n 2 + 2 α c n 2 + 2 a n + b n c n + 2 a n + b n + c n d n + 2 e n 2 2 b n d n d n + 1 = α d n 2 + b n c n + d n + 2 e n + d n c n + 2 e n 2 2 b n d n   e n + 1 = α c n 2 + d n 2 + 4 e n 2 + 4 e n c n + d n + 2 c n d n 2 2 b n d n
In Equation (A1), a n e n indicate the population of each genotype in the n-th generation (defined in Figure 2B), and α represents the survival probability.
For the model comprising alleles of X, Y, Z, and YF, the following equations were derived to calculate the n+1-st generation from the n-th generation on the basis of the data in Figure 2F,G, and Supplementary Material S1 (Figure S1C).
a n + 1 = α a n 2 + a n b n + c n + e n 2 + α c n 2 + e n 2 + 2 a n + b n c n + e n + 2 c n e n 2 2 c n d n g n + α b 2 + b n 2 a n + c n + e n 2 1 + a n + c n f n h n b n + 1 = a n b n + d n + f n + d n + f n 2 + c n + e n b n + d n + f n 2 2 c n d n g n + α b 2 + b n 2 a n + c n + e n 2 1 + a n + c n f n h n   c n + 1 = a n c n + d n + g n + d n 2 + α c n 2 + 2 a n + b n + e n c n + g n + c n g n 2 2 c n d n g n + b n c n + d n + g n 2 1 + a n + c n f n h n d n + 1 = c n + g n b n + d n + f n 2 2 c n d n g n + b n c n + d n + g n 2 1 + a n + c n f n h n e n + 1 = a n e n + f n + g n + 2 h n + f n 2 + 2 α e n 2 + c n + e n f n + g n + 2 h n + e n 2 a n + b n + 2 c n + g n + 2 h n 2 a n + b n + c n + e n 2 2 c n d n g n f n + 1 = b n + d n + f n e n + g n + 2 h n 2 2 c n d n g n g n + 1 = α g n 2 + c n + g n e n + f n + 2 h n + c n g n 2 2 c n d n g n h n + 1 = α e n 2 + g n 2 + 4 h n 2 + e n + g n f n + 4 h n + 2 e n g n + 2 f n h n 2 2 c n d n g n
In Equation (A3), a n h n indicate the population of each genotype in the n-th generation (defined in Figure 2F), and α represents the survival probability.

Appendix A.2. Model with Divergent Alleles

For the model consisting of a nonallorecognition allele (X) and “M” types of allorecognition alleles (Yks), the characteristics of the X and Yk alleles are described in Figure 3. The following equations were then derived to calculate the n+1-st generation from the n-th generation on the basis of the data in Figure 3A,B, and Supplementary Material S3.
a n + 1 = α a n 2 + M a n b n 2 + M 2 · α b n 2 + 2 a n b n + M 1 b n 2 2 b n M 1 c n   b n + 1 = a n b n + M 1 a n c n 2 + α b n 2 + 2 a n b n + 2 M 1 b n 2 + M 1 2 b n c n 2 2 b n M 1 c n + M 1 2 a n c n + M b n c n 2 2 2 b n 2 M 1 c n   c n + 1 = M 1 b n c n + b n 2 2 b n M 1 c n + M 2 b n c n + M 1 c n 2 2 2 b n 2 M 1 c n
In Equation (A4), a n c n indicate the population of each genotype in the n-th generation (defined in Figure 3A). α and M represent the survival probability and the number of types of allorecognition alleles respectively.

Appendix A.3. Model with 2-Loci

For the model with 2 allorecognition loci, the following equations were derived to calculate the n+1-st generation from the n-th generation on the basis of Supplementary Material S4 (Figure S4A).
a n + 1 = α ( a n 2 + M b n 2 + L c n 2 4 + L M d n 2 4 4 P ) + a n M b n + L c n + L M d n 4 + L M b n c n 2 + M M 1 b n 2 + L L 1 c n 2 4 + M b n + L c n 8 L M d n + 2 a n + M b n + L c n 2 4 P L M d n + L M L M 1 d n 2 4 4 P b n + 1 = α ( b n 2 2 + L d n 2 2 4 P ) + a n b n + M 1 g n + 2 k n + M b n + L c n k n + L b n c n + a n l n + M 1 b n 2 + M b n + L c n g n 2 + L a n d n + L c n l n + M b n l n + d n b n + k n 4 + L M 1 2 a n + M b n + L c n h n + M d n g n 8 + L 2 c n d n 8 + 2 a n + M b n + L c n + L M d n 4 P L l n + 2 a n + 2 M b n + k n + L c n + M M 1 g n 2 4 P L d n + L M M 1 h n + 2 L M 1 d n 4 4 P L d n + L M 1 2 a n + M b n + L c n + 0.5 L M d n 4 2 P h n c n + 1 = α ( c n 2 2 + M d n 2 2 4 P ) + a n c n + L 1 e n + 2 i n + M b n + L c n i n + M b n c n + a n j n + L 1 c n 2 + M b n + L c n e n 2 + M a n d n + M b n j + L c n j n + d n c n + i n 4 + M L 1 2 a n + M b n + L c n f n + L d n e n 8 + M 2 b n d n 8 + 2 a n + M b n + L c n + L M d n 4 P M j n + 2 a n + M b n + 2 L c n + i n + L L 1 e n 2 4 P M d n + L M L 1 f n + 2 L M 1 d n 4 4 P M d n + M L 1 2 a n + M b n + L c n + 0.5 L M d n 4 2 P f n d n + 1 = 3 α 4 4 P d n 2 + 2 c n + 4 i n + M d n + 2 j n + L 1 2 e n + M f n b n + M 1 g n + 2 k n 8 + 2 b n + 4 k n + L d n + 2 l n + M 1 2 g n + L h n c n + L 1 e n + 2 i n 8 + d n + 2 j n + l n + L 1 f n + 2 o n + M 1 h n + 2 n n + L 1 M 1 m n 8 · 2 a n + M b n + L c n + d n + 2 j n 4 4 P + L 1 f n 8 2 P { 4 a n + 4 M b n + k n + 2 L c n + 2 L M d n + l n + M M 1 2 g n + L h n } + d n + 2 l n 4 4 P + M 1 h n 8 2 P { 4 a n + 2 M b n + 4 L c n + i n + 2 L M d n + j n + L L 1 2 e n + M f n } + d n 4 4 P [ 4 a n + d n 2 M b n + L c n + L M 1 L 1 f n + 2 o n + M 1 h n + 2 n n + 2 j n + l n + L 1 M 1 m n ] + L 1 M 1 m n 16 1 P + M 1 n n + L 1 o n 4 2 P 4 a n + 2 M b n + L c n + L M d n e n + 1 = α ( e n 2 2 + M f n 2 2 4 P ) + 2 c n + i n i n + M i n j n + L 1 c n + 2 i n e n + c n 2 + M c n j n + d n i n 2 + M L 1 e n j n + f n i n + L L 2 e n 2 2 + M c n d n + M L 1 c n f n + d n e n + L 1 e n f n 4 + M d n c n + 2 i n + L 1 d n e n + M f n j n + 2 c n + M d n + L 1 e n + 2 i n + M j n j n 4 P + M 2 d n + L 1 f n 2 4 P d n + M L 1 c n + 0.5 M d n + L 1 e n + 2 i n + M j n 2 2 P f n + M M L 1 2 1 4 2 P f n 2 f n + 1 = α f n 2 4 2 P + P c n + L 1 e n + 2 i n 4 + c n + 2 i n + M d n + 2 j n + l n d n + 2 j n 4 P + L 1 e n + M f n + o n + M M 1 0.5 h n + n n + 0.5 L 1 m n d n + 2 j n 4 P + 2 c n + 2 i n + L 1 2 e n + M f n l n 4 P + M L 1 1 d n + M 1 h n + 2 n n + 2 j n + l n + L 1 M 1 m n + 2 o n 4 2 P f n + L L 2 + L 1 2 M 1 L 1 4 2 P f n 2 + L 1 f n + 2 o n + M 1 h n + 2 n n 2 2 P + L 1 M 1 m n 4 1 P { c n + 2 i n + M 0.5 d n + j n + L 1 e n + 0.5 M f n } g n + 1 = α ( g n 2 2 + L h n 2 2 4 P ) + 2 b n + k n k n + L k n l n + M 1 b n + 2 k n g n + b n 2 + L b n l n + d n k n 2 + L M 1 g n l n + h n k n + M M 2 g n 2 2 + L b n d n + L M 1 b n h n + d n g n + M 1 g n h n 4 + L d n b n + 2 k n + M 1 d n g n + L h n l n + 2 c n + L d n + M 1 g n + 2 k n + L l n l n 4 P + L 2 d n + M 1 h n 2 4 P d n + L M 1 b n + 0.5 L d n + M 1 g n + 2 k n + L l n 2 2 P h n + L L M 1 2 1 4 2 P h n 2 h n + 1 = α h n 2 4 2 P + P b n + M 1 g n + 2 k n 4 + b n + 2 k n + L d n + 2 j n + l n d n + 2 l n 4 P + M 1 g n + L h n + n n + L L 1 0.5 f n + o n + 0.5 M 1 m n d n + 2 l n 4 P + 2 b n + 2 k n + M 1 2 g n + L h n j n 4 P + L M 1 1 d n + L 1 f n + 2 o n + 2 j n + l n + M 1 L 1 m n + 2 n n 4 2 P h n + M M 2 + L 1 M 1 2 M 1 4 2 P h n 2 + L 1 f n + 2 o n + M 1 h n + 2 n n 2 2 P + L 1 M 1 m n 4 1 P { b n + 2 k n + L 0.5 d n + l n + M 1 g n + 0.5 L h n } i n + 1 = α { c n 2 + L 1 e n 2 4 + i n 2 + M d n 2 + 4 j n 2 4 4 P + M L 1 f n 2 8 2 P } + c n i n + L 1 e n i n + L 1 c n e n 2 + L 1 L 2 4 e n 2 + M d n + L 1 f n + 2 j n c n + L 1 e n + 2 i n 8 + M d n i n + c n + M d n + L 1 e n + 2 i n + M 1 j n j n 4 P + M c n d n + L 1 d n e n + M f n j n 2 4 P + M M 1 d n + M L 1 f n d n 4 4 P + M L 1 2 c n + M 0.5 d n + j n + L 1 e n + 2 i n + L 1 M 1 + L 2 f n 8 2 P f n j n + 1 = α d n 2 + 4 j n 2 4 4 P + L 1 f n 2 8 2 P + P c n + L 1 e n + 2 i n 8 + l n 2 4 P + 2 L 1 o n + M 1 h n + 2 n n 8 2 P + L 1 M 1 m n 16 1 P { 2 c n + 4 i n + M d n + 2 j n + L 1 2 e n + M f n } + d n + 2 j n 4 4 P + L 1 f n 8 2 P [ 2 c n + 4 i n + M 1 d n + l n + L 1 2 e n + M f n + M 1 2 h n + 2 n n + L 1 M 1 f n + 2 o n + M 1 m n + 2 2 M 1 j n ] + d n 2 j n j n 2 4 P + L 1 d n f n f n 8 2 P k n + 1 = α b n 2 + M 1 g n 2 4 + k n 2 + L d n 2 + 4 l n 2 4 4 P + L M 1 h n 2 8 2 P + b n k n + M 1 g n k n + M 1 b n g n 2 + M 1 M 2 4 g n 2 + L d n + M 1 h n + 2 l n b n + M 1 g n + 2 k n 8 + L d n k n + b n + L d n + M 1 g n + 2 k n + L 1 l n l n 4 P + L b n d n + M 1 d n g n + L h n l n 2 4 P + L L 1 d n + L M 1 h n d n 4 4 P + L M 1 2 b n + L 0.5 d n + l n + M 1 g n + 2 k n + L 1 M 1 + M 2 h n 8 2 P h n l n + 1 = α d n 2 + 4 l n 2 4 4 P + M 1 h n 2 8 2 P + P b n + M 1 g n + 2 k n 8 + j n 2 4 P + 2 M 1 n n + L 1 f n + 2 o n 8 2 P + L 1 M 1 m n 16 1 P { 2 b n + 4 k n + L d n + 2 l n + M 1 2 g n + L h n } + d n + 2 l n 4 4 P + M 1 h n 8 2 P [ 2 b n + 4 k n + L 1 d n + j n + M 1 2 g n + L h n + L 1 2 f n + 2 o n + L 1 M 1 h n + 2 n n + L 1 m n + 2 2 L 1 l n ] + d n 2 l n l n 2 4 P + M 1 d n h n h n 8 2 P m n + 1 = P d n + 2 j n + l n 4 P + L 1 f n + 2 o n + M 1 h n + 2 n n 2 2 P + L M L M m n 4 1 P n n + 1 = P d n + 2 j n + l n 2 4 P + L 1 f n + 2 o n + M 2 h n + 2 n n 4 2 P + L 1 M 2 m n 8 1 P o n + 1 = P d n + 2 j n + l n 2 4 P + L 2 f n + 2 o n + M 1 h n + 2 n n 4 2 P + L 2 M 1 m n 8 1 P Here   P   is   defined   as ;   P = d n + L 1 f n + 2 o n + M 1 h n + 2 n n + 2 j n + l n + L 1 M 1 m n
In Equation (A5), a n o n indicate the population of each genotype in the n-th generation (defined in Figure 4B), and α represents the survival probability. L and M represent the numbers of types of allorecognition alleles in loci A and B respectively.

References

  1. Iwano, M.; Takayama, S. Self/non-self discrimination in angiosperm self-incompatibility. Curr. Opin. Plant Biol. 2012, 15, 78–83. [Google Scholar] [CrossRef]
  2. Fujii, S.; Kubo, K.; Takayama, S. Non-self- and self-recognition models in plant self-incompatibility. Nat. Plants 2016, 2, 1–9. [Google Scholar] [CrossRef]
  3. Zhang, D.; Li, Y.Y.; Zhao, X.; Zhang, C.; Liu, D.K.; Lan, S.; Yin, W.; Liu, Z.J. Molecular insights into self-incompatibility systems: From evolution to breeding. Plant Comm. 2024, 5, 100719. [Google Scholar] [CrossRef]
  4. Sawada, H.; Saito, T. Mechanisms of Sperm-Egg Interactions: What Ascidian Fertilization Research Has Taught Us. Cells 2022, 11, 2096. [Google Scholar] [CrossRef]
  5. Harada, Y.; Takagaki, Y.; Sunagawa, M.; Saito, T.; Yamada, L.; Taniguchi, H.; Shoguchi, E.; Sawada, H. Mechanism of Self-Sterility in a Hermaphroditic Chordate. Science 2008, 320, 548–550. [Google Scholar] [CrossRef] [PubMed]
  6. Sawada, H.; Saito, T.; Shimada, Y.; Nishimura, H. Fertilization mechanisms in hermaphroditic ascidians and nematodes: Common mechanisms with mammals and plants. Curr. Top. Dev. Biol. 2025, 162, 55–115. [Google Scholar]
  7. Schoen, D.J.; Baldwin, S.J. Self-incompatibility and the genetic architecture of inbreeding depression. New Phytol. 2023, 237, 1040–1049. [Google Scholar] [CrossRef]
  8. Morgan, T.H. The genetic and the physiological problems of self-sterility in Ciona. J. Exp. Zool. 1942, 90, 199–228. [Google Scholar] [CrossRef]
  9. Scofield, V.L.; Schlumpberger, J.M.; Weissman, I.L. Colony Specificity in the Colonial Tunicate Botryllus and the Origins of Vertebrate Immunity. Am. Zool. 1982, 22, 783–794. [Google Scholar] [CrossRef]
  10. Fischer, E.A.; Hardison, D. The timing of spawning and egg production as constraints on male mating success in a simultaneously hermaphroditic fish. Environ. Biol. Fishes 1987, 20, 301–310. [Google Scholar] [CrossRef]
  11. Shlesinger, T.; Shenkar, N.; Loya, Y. Gamete spawning of the ascidian Phallusia nigra in the Red Sea. Bull. Mar. Sci. 2017, 93, 959–960. [Google Scholar] [CrossRef]
  12. Sawada, H.; Yamamoto, K.; Yamaguchi, A.; Yamada, L.; Higuchi, A.; Nukaya, H.; Fukuoka, M.; Sakuma, T.; Yamamoto, T.; Sasakura, Y.; et al. Three multi-allelic gene pairs are responsible for self-sterility in the ascidian Ciona intestinalis. Sci. Rep. 2020, 10, 2514. [Google Scholar] [CrossRef] [PubMed]
  13. Saito, T.; Sawada, H. Fertilization of Ascidians: Gamete Interaction, Self/Nonself Recognition and Sperm Penetration of Egg Coat. Front. Cell Dev. Biol. 2022, 9, 827214. [Google Scholar] [CrossRef] [PubMed]
  14. Takayama, S.; Shimosato, H.; Shiba, H.; Funato, M.; Che, F.S.; Watanabe, M.; Iwano, M.; Isogai, A. Direct ligand-receptor complex interaction controls Brassica self-incompatibility. Nature 2001, 413, 534–538. [Google Scholar] [CrossRef]
  15. Bosch, M.; Franklin-Tong, V.E. Self-incompatibility in Papaver: Signaling to trigger PCD in incompatible pollen. J. Exp. Bot. 2008, 59, 481–490. [Google Scholar] [CrossRef]
  16. Saito, T.; Shiba, K.; Inaba, K.; Yamada, L.; Sawada, H. Self-incompatibility response induced by calcium increase in sperm of the ascidian Ciona intestinalis. Proc. Natl. Acad. Sci. USA 2012, 109, 4158–4162. [Google Scholar] [CrossRef]
  17. Lin, Z.; Eaves, D.J.; Sanchez-Moran, E.; Franklin, F.C.H.; Franklin-Tong, V.E. The Papaver rhoeas S determinants confer self-incompatibility to Arabidopsis thaliana in planta. Science 2015, 350, 684–687. [Google Scholar] [CrossRef]
  18. Paganini, J.; Pontarotti, P. Search for MHC/TCR-Like Systems in Living Organisms. Front. Immunol. 2021, 12, 635521. [Google Scholar] [CrossRef]
  19. Ema, M.; Okada, T.; Takahashi, M.; Uchiyama, M.; Kubo, H.; Moriyama, H.; Miyakawa, H.; Matsumoto, M. A self-marker-like protein governs hemocyte allorecognition in Halocynthia roretzi. Zool. Lett. 2019, 5, 34. [Google Scholar] [CrossRef] [PubMed]
  20. Sawada, H.; Tanaka, E.; Ban, S. Self/nonself recognition in ascidian fertilization: Vitelline coat protein HrVC70 is a candidate allorecognition molecule. Proc. Natl. Acad. Sci. USA 2004, 101, 15615–15620. [Google Scholar] [CrossRef]
  21. De Tomaso, A.W.; Nyholm, S.V.; Palmeri, K.J.; Ishizuka, K.J.; Ludington, W.B.; Mitchel, K.; Weissman, I.L. Isolation and Characterization of a Protochodate Histocompatibility Locus. Nature 2005, 438, 454–459. [Google Scholar] [CrossRef]
  22. Murabe, N.; Hoshi, M. Re-examination of Sibling Cross-Sterility in the Ascidian, Ciona intestinalis: Genetic Background of the Self-Sterility. Zool. Sci. 2002, 19, 527–538. [Google Scholar] [CrossRef]
  23. Newbigin, E.; Anderson, M.A.; Clarke, A.E. Gametophytic Self-Incompatibility Systems. Plant Cell. 1993, 5, 1315–1324. [Google Scholar] [CrossRef]
  24. Slatkin, M. Simulating genealogies of selected alleles in a population of variable size. Genet. Res. 2001, 78, 49–57. [Google Scholar] [CrossRef]
  25. Allison, A.C. Polymorphism and Natural Selection in Human Populations. Cold Spring Harb. Symp. Quant. Biol. 1964, 29, 137–149. [Google Scholar] [CrossRef]
  26. Piel, F.B.; Patil, A.P.; Howes, R.E.; Nyangiri, O.A.; Gething, P.W.; Willliams, T.N.; Weatherall, D.J.; Hay, S.I. Global distribution of the sickle cell gene and geographical confirmation of the malaria hypothesis. Nat. Commun. 2010, 1, 1–7. [Google Scholar] [CrossRef] [PubMed]
  27. Franklin, E.C. Inbreeding as a means of genetic improvements of loblolly pines. In Proceedings of the 10th Southern Forest Tree Improvement Conference, Houston, TX, USA, 17–19 June 1969. [Google Scholar]
  28. Arnqvist, G.; Rowe, L. Sexual Conflict; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
  29. Hosken, D.J.; Archer, C.R.; Mank, J.E. Sexual conflict. Curr. Biol. 2019, 29, R451–R455. [Google Scholar] [CrossRef] [PubMed]
  30. Nydam, M.L.; De Tomaso, A.W. Creation and maintenance of variation in allorecognition loci: Molecular analysis in various model systems. Front. Immunol. 2011, 2, 79. [Google Scholar]
Figure 1. Model for the process of self-recognition gene emergence. (A) Definitions of the genotypes and alleles in the model. (B) Fertilization characteristics of genotypes in the model. X-genotype sperm can fertilize any egg (✔ Fertile). Y-genotype sperm can not fertilize XY-genotype eggs (Biomolecules 15 01397 i001 Infertile). (C) Calculation of the next generation in the model. (D) Flow and equations for the calculation of the n+1-st generation from the n-th generation.
Figure 1. Model for the process of self-recognition gene emergence. (A) Definitions of the genotypes and alleles in the model. (B) Fertilization characteristics of genotypes in the model. X-genotype sperm can fertilize any egg (✔ Fertile). Y-genotype sperm can not fertilize XY-genotype eggs (Biomolecules 15 01397 i001 Infertile). (C) Calculation of the next generation in the model. (D) Flow and equations for the calculation of the n+1-st generation from the n-th generation.
Biomolecules 15 01397 g001
Figure 2. Models for the divergence of the allorecognition locus. (A) Two possible routes for the creation of the new allele and allele definitions. In the upper route, mutation in the gene encoding the female recognition molecule occurs first, after which the gene encoding the male recognition molecule adapts to this change through its own mutation. In contrast, in the lower route, the male gene is mutated first. “i” in the egg represents any other allele; inhibition of fertilization by such an allele is not described here. (B) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, and YF. (C) Characteristics of genotypes in the model, comprising alleles X, Y, and YF. (D) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, and YM. (E) Characteristics of genotypes in the model, comprising alleles X, Y, and YM. (F) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, Z, and YF. (G) Characteristics of genotypes in the model, comprising alleles X, Y, Z, and YF.
Figure 2. Models for the divergence of the allorecognition locus. (A) Two possible routes for the creation of the new allele and allele definitions. In the upper route, mutation in the gene encoding the female recognition molecule occurs first, after which the gene encoding the male recognition molecule adapts to this change through its own mutation. In contrast, in the lower route, the male gene is mutated first. “i” in the egg represents any other allele; inhibition of fertilization by such an allele is not described here. (B) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, and YF. (C) Characteristics of genotypes in the model, comprising alleles X, Y, and YF. (D) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, and YM. (E) Characteristics of genotypes in the model, comprising alleles X, Y, and YM. (F) Definitions of the genotypes and alleles in the model, comprising alleles X, Y, Z, and YF. (G) Characteristics of genotypes in the model, comprising alleles X, Y, Z, and YF.
Biomolecules 15 01397 g002
Figure 3. Model with “M” types of allorecognition alleles. (A) Definitions of the genotypes and alleles in the model. (B) Characteristics of the genotypes in the model.
Figure 3. Model with “M” types of allorecognition alleles. (A) Definitions of the genotypes and alleles in the model. (B) Characteristics of the genotypes in the model.
Biomolecules 15 01397 g003
Figure 4. Model with two loci. (A) Recognition between gametes in the 2-locus model. In this model, the egg rejects only the sperm that has the same alleles as the egg at both loci (red cross), although sperm that share only one allele at one locus with the egg can fertilize the egg. (B) Definitions of the genotypes and alleles in the model.
Figure 4. Model with two loci. (A) Recognition between gametes in the 2-locus model. In this model, the egg rejects only the sperm that has the same alleles as the egg at both loci (red cross), although sperm that share only one allele at one locus with the egg can fertilize the egg. (B) Definitions of the genotypes and alleles in the model.
Biomolecules 15 01397 g004
Figure 5. Representative results of simulations of the process of allorecognition gene emergence. The proportions of XY individuals (vertical axis) were plotted against the number of generations (horizontal axis). (A) Results of the cases where the initial proportion of XY individuals was fixed at b 0 = 0.01   and α was varied between 0.01 and 1.1. (B) Results of the cases where α was fixed at 0.2 and the initial proportion of XY individuals was varied between 0.01 and 1. In all the cases, the proportions of XY individuals converged to the same value, approximately 0.37, after 5–10 generations.
Figure 5. Representative results of simulations of the process of allorecognition gene emergence. The proportions of XY individuals (vertical axis) were plotted against the number of generations (horizontal axis). (A) Results of the cases where the initial proportion of XY individuals was fixed at b 0 = 0.01   and α was varied between 0.01 and 1.1. (B) Results of the cases where α was fixed at 0.2 and the initial proportion of XY individuals was varied between 0.01 and 1. In all the cases, the proportions of XY individuals converged to the same value, approximately 0.37, after 5–10 generations.
Biomolecules 15 01397 g005
Figure 6. Results of the simulation of the allorecognition locus divergence process. (A,C,E) Representative simulation results. The proportions of each genotype (vertical axis) were plotted against the number of generations (horizontal axis). The initial proportions of each genotype are listed in each figure. α was fixed at 0.5. (B,D,F) Summarized results of the simulations. The proportions of each allele in the populations (vertical axis) were plotted against the value of α (horizontal axis). (A,B) Results of the model with alleles X, Y, and YF. (C,D) Results of the model with alleles X, Y, and YM. After multiple generations, only the Y allele was eliminated from the population when 1 > α . (E,F) Results of the model with alleles X, Y, Z and YF.
Figure 6. Results of the simulation of the allorecognition locus divergence process. (A,C,E) Representative simulation results. The proportions of each genotype (vertical axis) were plotted against the number of generations (horizontal axis). The initial proportions of each genotype are listed in each figure. α was fixed at 0.5. (B,D,F) Summarized results of the simulations. The proportions of each allele in the populations (vertical axis) were plotted against the value of α (horizontal axis). (A,B) Results of the model with alleles X, Y, and YF. (C,D) Results of the model with alleles X, Y, and YM. After multiple generations, only the Y allele was eliminated from the population when 1 > α . (E,F) Results of the model with alleles X, Y, Z and YF.
Biomolecules 15 01397 g006
Figure 7. Summarized results of the simulation using the model with divergent alleles. (A) The proportions of XX individuals that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M), up to M = 15; α varied between 0.01 and 0.8. (B) The proportions of X alleles that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 15; α was varied between 0.01 and 0.8. (C) Proportions of each genotype that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 50; α was fixed at 0.5. (D) Proportions of nonallorecognition allele X and the sum of allorecognition alleles (Yks) converging across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 50; α was fixed at 0.5.
Figure 7. Summarized results of the simulation using the model with divergent alleles. (A) The proportions of XX individuals that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M), up to M = 15; α varied between 0.01 and 0.8. (B) The proportions of X alleles that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 15; α was varied between 0.01 and 0.8. (C) Proportions of each genotype that converged across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 50; α was fixed at 0.5. (D) Proportions of nonallorecognition allele X and the sum of allorecognition alleles (Yks) converging across generations (vertical axis) plotted against the number of allorecognition alleles (M) up to M = 50; α was fixed at 0.5.
Biomolecules 15 01397 g007
Figure 8. Results of the simulation using the 2-locus model. (AD) Representative results of the simulation. The proportions of each genotype (vertical axis) were plotted against the number of generations (horizontal axis). L, M and the initial proportions are shown in each figure. α was fixed at 0.2. (B) Magnified chart of (A) showing the separation lower on the vertical axis. (E) Summarized result of the simulation. The ratios of X vs. Y at locus A and x vs. y at locus B after 5000 generations (vertical axis) were plotted against the value of α (horizontal axis). Here, Y and y represent the total proportions of allorecognition alleles at loci A and B, respectively. As shown in the left column (“balanced”), when both L = M and X 0 : Y 0 = x 0 : y 0 , X and Y at locus A and x and y at locus B coexist after many generations at α < 1 . When L = M was unbalanced, the proportion of nonallorecognition alleles at one locus (x) greatly decreased and that at the other locus (X) increased, after many generations (center column “ L M ”). A similar tendency was observed when X 0 : Y 0 = x 0 : y 0 was unbalanced (right column “ X 0 : Y 0 x 0 : y 0 ”).
Figure 8. Results of the simulation using the 2-locus model. (AD) Representative results of the simulation. The proportions of each genotype (vertical axis) were plotted against the number of generations (horizontal axis). L, M and the initial proportions are shown in each figure. α was fixed at 0.2. (B) Magnified chart of (A) showing the separation lower on the vertical axis. (E) Summarized result of the simulation. The ratios of X vs. Y at locus A and x vs. y at locus B after 5000 generations (vertical axis) were plotted against the value of α (horizontal axis). Here, Y and y represent the total proportions of allorecognition alleles at loci A and B, respectively. As shown in the left column (“balanced”), when both L = M and X 0 : Y 0 = x 0 : y 0 , X and Y at locus A and x and y at locus B coexist after many generations at α < 1 . When L = M was unbalanced, the proportion of nonallorecognition alleles at one locus (x) greatly decreased and that at the other locus (X) increased, after many generations (center column “ L M ”). A similar tendency was observed when X 0 : Y 0 = x 0 : y 0 was unbalanced (right column “ X 0 : Y 0 x 0 : y 0 ”).
Biomolecules 15 01397 g008
Table 1. Comparison of the convergence points from the simulations and analytical solutions (the model for the emergence process).
Table 1. Comparison of the convergence points from the simulations and analytical solutions (the model for the emergence process).
α Proportion of the Genotype XX
(Simulation After 1000 Generations)
Proportion of the Genotype XX
(Analytical Solution)
1.1 11
1 0.998021
0.9 0.858170.85817
0.7 0.714840.71484
0.5 0.631220.63122
0.3 0.570860.57086
0.1 0.521990.52199
0.01 0.502150.50215
Table 2. Comparison of the convergence points from the simulations and analytical solutions for the model with diversified alleles.
Table 2. Comparison of the convergence points from the simulations and analytical solutions for the model with diversified alleles.
M α Simulation After 1000 GenerationsAnalytical Solution
abcabc
5 0.1 0.21930.11080.02270.21930.11080.0227
5 0.5 0.31280.10640.01550.31280.10640.0155
5 0.8 0.47340.08980.00780.47380.08980.0077
10 0.1 0.14680.05090.00760.14650.05090.0077
10 0.5 0.21650.05260.00570.21580.05260.0057
10 0.8 0.35290.04990.00330.34960.050.0033
15 0.5 0.17590.0340.0030.17220.03390.003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Naruse, M.; Saito, T.; Matsumoto, M. Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization. Biomolecules 2025, 15, 1397. https://doi.org/10.3390/biom15101397

AMA Style

Naruse M, Saito T, Matsumoto M. Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization. Biomolecules. 2025; 15(10):1397. https://doi.org/10.3390/biom15101397

Chicago/Turabian Style

Naruse, Masahiro, Takako Saito, and Midori Matsumoto. 2025. "Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization" Biomolecules 15, no. 10: 1397. https://doi.org/10.3390/biom15101397

APA Style

Naruse, M., Saito, T., & Matsumoto, M. (2025). Population Genetics of the Emergence and Evolution of Allogenic Recognition During Fertilization. Biomolecules, 15(10), 1397. https://doi.org/10.3390/biom15101397

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop