Next Article in Journal
Reproductive and Pollination Characteristics of Three Understory Impatiens Species
Previous Article in Journal
Research on High-Accuracy, Lightweight, Superfast Model for Nitrogen Diagnosis and Plant Growth in Lettuce (Lactuca sativa L.)
Previous Article in Special Issue
Research Progress on the Genetic and Molecular Mechanisms of Cucumber (Cucumis sativus L.) Powdery Mildew Resistance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Evolutionary Forces Shaping Trans-Species Polymorphisms in Genus Cucumis

1
Engineering Laboratory of Genetic Improvement of Horticultural Crops of Shandong Province, College of Horticulture, Qingdao Agricultural University, Qingdao 266109, China
2
College of Horticulture, Nanjing Agricultural University, Nanjing 210095, China
3
College of Horticulture, Xinjiang Agricultural University, Urumqi 830052, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Horticulturae 2025, 11(5), 452; https://doi.org/10.3390/horticulturae11050452
Submission received: 12 March 2025 / Revised: 15 April 2025 / Accepted: 22 April 2025 / Published: 23 April 2025
(This article belongs to the Special Issue Germplasm and Breeding Innovations in Cucurbitaceous Crops)

Abstract

:
Trans-species polymorphisms (TSPs) are fundamental to preserving ancient genetic diversity, yet the evolutionary forces driving their long-term maintenance remain largely unexplored. Here, we investigate genome-wide TSPs in two Cucumis species, cucumber and melon, using whole-genome sequencing data from over 1200 accessions. A total of 5149 TSPs were identified, which predominantly located in genic and promoter regions. Coalescent analysis indicated that both gene flow and balancing selection have contributed to the persistence of these ancestral alleles. Moreover, among the 99 genes with shared coding-region polymorphisms, two genes that cluster by alleles rather than by species provide evidence of long-term balancing selection. These genes are involved in the immune and stress response processes with pleiotropic effects. Our findings elucidate the complex evolutionary forces driving TSPs in Cucumis, providing mechanistic insights into the maintenance of intraspecific genetic diversity in plants across deep evolutionary timescales.

1. Introduction

Trans-species polymorphisms (TSPs) are ancestral allelic variants that have been maintained across diverging lineages since their split from a common ancestor [1,2]. The persistence of these ancient polymorphisms over deep phylogenetic timescales challenges the theoretical expectation that genetic drift and directional selection would eliminate such variation over evolutionary time. This paradox raises fundamental questions about the mechanisms that allow specific alleles to evade both selective sweeps and purifying selection [3]. Studying TSPs thus offers critical insights into the balance between neutral evolutionary processes and adaptive forces, while also providing a unique window into the long-term maintenance and functional relevance of genetic diversity.
TSPs play a crucial role in adaptive evolution. In plants, for example, TSPs are often associated with resistance (R) genes and self-incompatibility (S) genes, where they are maintained through balancing selection, thus contributing to the species’ resistance to pathogens [4]. Moreover, TSPs are also involved in environmental adaptability. In animals, TSPs in major histocompatibility complex (MHC) genes, for instance, are maintained through negative frequency-dependent selection, allowing organisms to adapt to a constantly changing pathogen environment [5].
The evolutionary origins and maintenance of TSPs involve complex mechanisms. TSPs can easily be confounded with recurrent mutations or introgression, as both processes can lead to shared polymorphisms in closely related species [6]. Recurrent mutations primarily contribute to polymorphism sharing in recently diverged lineages, as the probability of identical mutation events decreases over longer evolutionary periods. Interspecific introgression, on the other hand, typically shows incomplete lineage sorting patterns, which are inconsistent with the retention of ancestral polymorphisms. These mutations tend to be progressively eroded by selection over macro-evolutionary timescales. In contrast, the long-term preservation of TSPs is generally attributed to balancing selection, which sustains allelic diversity through various selective forces, such as negative frequency-dependent selection, heterozygote advantage, and spatiotemporally fluctuating selection [1,7,8]. Balancing selection leaves distinctive genomic signatures, including local increases in nucleotide diversity that exceed neutral expectations, allele frequency spectra skewed toward intermediate frequencies, and accelerated linkage disequilibrium decay around selected loci [9]. In particular, orthologous sequences from different species under strong balancing selection cluster by allele, as tight linkage limits recombination-mediated allele segregation. This contrasts with neutral shared polymorphisms (e.g., recurrent mutations), where phylogenetic signals predominantly follow species divergence patterns. Classical examples, including the major histocompatibility complex in vertebrates (MHC) [8,10] and disease resistance genes in plants [4,11,12,13,14], underscore the adaptive significance of balancing selection. However, current understanding of such evolutionary paradox remains constrained by methodological limitations in distinguishing true TSPs from confounding signals and insufficient systematic analyses across diverse taxonomic groups [7].
The genus Cucumis offers a good system for investigating TSPs, particularly through its two economically significant and phylogenetically divergent species: cucumber (C. sativus) and melon (C. melo) [15]. These two species diverged approximately 4–14 million years ago (MYA) [16,17,18,19,20,21], providing an ideal evolutionary timescale to study ancestral polymorphism retention. Cucumber and melon originated in India and Africa, respectively [22,23]. Both species underwent a domestication process from wild ancestors to cultivated forms, resulting in significant alterations to fruit traits. In cucumber, key selection targets during domestication included fruit length, size, and the loss of bitterness [23]. In melon, substantial changes occurred in fruit size, characteristics, and flavor [22]. The domestication history of melon is particularly complex, involving three independent domestication events in Africa and India, which led to the formation of different cultivated subspecies, such as C. melo ssp. melo and C. melo ssp. agrestis [22]. During domestication, key genes in both cucumber and melon were subjected to strong selective pressures. For instance, in melon, genes associated with fruit quality, size, and bitterness, such as CmBi and CmBt, were fixed during domestication [22]. Similarly, in cucumber, the Bt gene, which is linked to fruit bitterness, also exhibited significant selective signals [23]. These genetic changes not only shaped the phenotypic traits of cultivated varieties but also profoundly influenced their genetic structure. Recent advances in cucurbit genomics have established Cucumis as a model for evolutionary genetics, with chromosome-level reference genomes and extensive population resequencing data available [22,23,24,25,26]. These resources have resolved the phylogenetic divergence of both cucumber and melon clearly. Despite their agricultural importance and well-characterized domestication histories, systematic investigations of TSPs remain absent in these species.
Here, we systematically identify genome-wide TSPs within the genus Cucumis, focusing on two representative species, cucumber and melon. Our analyses reveal that, despite their divergent demographic histories, these species retain a substantial number of shared ancestral polymorphisms. The persistence of these TSPs appears to be shaped by multiple evolutionary forces, with no single mechanism fully explaining their long-term maintenance. Furthermore, genomic regions under balancing selection were significantly enriched for loci associated with stress responses and developmental processes. Notably, we identified two candidate genes under long-term balancing selection that may contribute to adaptive variations and environmental fitness in both species.

2. Materials and Methods

2.1. Species Sampling

Sequencing data for 1280 core accessions of C. sativus L. and C. melo L. were downloaded from NCBI under BioProject PRJNA565104 and the Short Read Archive (SRA) under accession SRA056480.

2.2. Variant Calling

Raw reads from 115 cucumber accessions and 1165 melon accessions were aligned to the C. melo reference genome (DHL92 v4.0) [27] using BWA (version 0.7.15) [28] with default parameters. Duplicate reads were identified and marked using the MarkDuplicates tool from Picard (https://broadinstitute.github.io/picard/, accessed on 12 December 2024). The alignments were subsequently converted to BAM format using SAMtools (version 1.10) [29]. Genome-wide SNPs were called and filtered following the Best Practices pipeline by GATK v4.2.0.0 [30].

2.3. Population Genetic Analysis

Phylogenetic analysis was performed using the cucumber and melon accessions described above, with six watermelon accessions included as the outgroup. The watermelon accessions were retrieved from NCBI under BioProject PRJNA527790, with accession numbers SRR8751612 to SRR8751617. Four-fold degenerate sites were extracted from the whole-genome SNP dataset using an in-house script. These 4D SNPs were then used to construct a phylogenetic tree with IQ-TREE [31], utilizing the ‘-m MFP -B 1000’ parameters. Principal component analysis (PCA) was performed using EIGENSOFT (v6.0.2) [32]. ADMIXTURE (v1.3.0) [33] was employed to estimate the optimal number of genetic clusters (K) among the cucumber and melon accessions.

2.4. Identification and Statistic of Trans-Species Polymorphisms

To ensure the reliability of TSPs, the minor allele frequency (MAF) was initially restricted to >0.05, and the site missing rate was limited to <10% for both cucumber and melon groups, using VCFTOOLS (v0.1.16) [34]. Variants that passed these filters were required to be biallelic SNPs. Regions with high (DP ≥ 35) or low (DP ≤ 8) mean site read depths were excluded, and chromosomal endpoints and large gap regions were removed. Repetitive elements identified in the melon genome were classified using RepeatMasker (v4.0.8) [35], and SNPs located within these regions were discarded. Finally, snpEff (v4.3) [36] was employed to annotate the variations. The distribution of TSPs was calculated using an in-house script. Gene Ontology (GO) enrichment analysis was performed using ClusterProfiler (v4.0) [37].

2.5. Demographic Analysis

To investigate the demographic history of cucumber and melon, demographic histories were then inferred using the continuous-time sequential Markovian coalescent approximation implemented in fastsimcoal2 (v2.7) [38]. Orthologous four-fold degenerate sites were used to generate the joint site frequency spectrum (SFS) for the two species with easySFS.py (https://github.com/isaacovercast/easySFS, accessed on 10 December 2024). fsc27 was applied to estimate the folded SFS under two evolutionary models, including scenarios of population divergence with or without ancient gene flow. For each model, a spontaneous mutation rate of 7 × 10−9 per base pair per generation was applied [39]. Demographic parameters for each model were optimized through 100 independent runs, each incorporating 100,000 coalescent simulations (-n 100,000) and 40 iterations (-L 40) for likelihood maximization.

2.6. Introgression Analysis

Gene introgression was assessed using Dsuite (v0.5) [40] to calculate Patterson’s D (ABBA-BABA) and f4-ratio statistics. Introgression regions were detected using the subcommand ‘Dsuite Dinvestigate’ with default parameters.

2.7. Identification of Trans-Species Polymorphisms Under Long-Term Balancing Selection

To investigate trans-species polymorphisms within cucumber and melon, only SNPs located in orthologous genic regions shared by both species were considered. To minimize the potential influence of recurrent mutations among trans-species polymorphisms, genes with more than one TSP and at least one TSP in coding regions were selected. Finally, for each retained TSP, 500 base pairs (bp) of the flanking sequence were extracted using bcftools (v1.8) [41]. Multi-sequence alignment was performed using MAFFT (v7.407) [42], and the aligned sequences were then used to construct phylogenetic trees with IQ-TREE. TSP allele trees discordant with the species tree were selected as candidates under long-term balancing selection.

3. Results

3.1. Trans-Species Polymorphisms Between Cucumber and Melon

Whole-genome sequencing data were collected from 1280 accessions, including 115 cucumber accessions [23] and 1165 melon accessions [22] from around the world, and downloaded from public databases to identify TSPs. High-quality reads were aligned to the melon reference genome DHL92 (v4.0) [27] using a standardized pipeline. Variant calling and stringent filtering resulted in a total of 3,342,859 high-confidence biallelic SNPs (see Section 2).
To assess the reliability of SNP datasets, phylogenetic analysis between cucumber and melon was constructed. A phylogenetic tree constructed from four-fold degenerate SNPs clearly separated cucumber and melon into two distinct clades (Figure 1a). The melon clade resolved into six subclades (wild African, cultivated African, wild agrestis, cultivated agrestis, wild melo, and cultivated melo), and the cucumber clade divided into four subclades (Indian, Xishuangbanna, Eurasian, and East Asian), consistent with prior studies [22,23]. Principal Component Analysis (PCA) further confirmed genetic differentiation between and within species (Figure 1b). Principal Component Analysis (PCA) further confirmed the genetic differentiation between and within species (Figure 1b). The first two principal components distinctly separated the cucumber and melon groups, highlighting species-level divergence and validating the robustness of the dataset.
Following comprehensive filtering, a total of 5149 TSPs (identical allele pairs at specific orthologous loci) were identified between cucumber and melon (Figure 1c). These TSPs were unevenly distributed across all chromosomes, with a significant enrichment in genic and promoter regions (the 3000 bp upstream of the start codon), which together accounted for approximately 60% of the total TSPs (Figure 1d). In contrast, intergenic regions displayed a markedly lower density of TSPs (40%). Furthermore, these TSPs spanned 2816 genes, which were significantly enriched in biological processes such as the cellular response to nitrogen compounds (GO:1901699), the metabolic process of organic hydroxy compounds (GO:1901615), and gene silencing by RNA (GO:0031047) (Figure 1e, Table S1). Taken together, these results highlight the functional significance of TSPs in shaping key adaptive traits, contributing to the evolution of cucumber and melon.

3.2. Demographic History of the Two Species

A comprehensive understanding of demographic history is essential for accurately detecting true TSP signals in cucumber and melon. Given the prolonged divergence time and distinct chromosome numbers (Seven vs. Twelve), these two species are highly unlikely to have experienced recent introgression. Therefore, coalescent simulations were conducted under two distinct models (Figure 2a). Model selection was based on Akaike's Information Criterion (AIC), which revealed that the model incorporating ancient gene flow (M2) provided a better fit, as evidenced by a lower AIC value (36,024.143 vs. 49,999.916). Under model M2, cucumber and melon were estimated to have diverged from a common ancestor approximately 7.3 million years ago, with a 95% confidence interval (CI) of 6.5 MYA to 8.9 MYA, consistent with previous studies. Additionally, an ancient migration rate of 3 × 10−4 per generation was inferred between the two species. This evidence of ancient gene flow suggests that introgression segments may still persist in their genomes despite the deep evolutionary divergence between cucumber and melon. Furthermore, we identified introgression regions using the ABBA-BABA method (Figure 2b, Table S2). Notably, 250 of the 5149 TSPs were located within these introgression regions, indicating the potential influence of ancient introgression on observed TSPs.

3.3. Trans-Species Polymorphisms Under Balancing Selection

To characterize TSPs under balancing selection in cucumber and melon, we implemented a stepwise filtering strategy (Figure 3a). The analysis focused on 1996 TSPs within 12,626 single-copy orthologous genes, ensuring that the analyzed variants were evolutionarily conserved between the two species. By restricting the dataset to genic regions, which are more likely to be functionally relevant, a total of 824 TSPs were retained for further investigation. To strengthen the evidence for long-term balancing selection, we required the presence of at least two TSPs per genic region, with at least one located within a coding sequence. This filtering process resulted in 99 genes containing 259 TSPs. Consistent with expectations for ancestral polymorphisms maintained by selection, these genes exhibited higher Tajima’s D values compared to genome-wide averages (Figure 3b), indicating a deviation from neutral mutations. Gene ontology analyses revealed that these genes were primarily associated with plant growth and development, as well as response to environmental stress, and cell structure and function (Figure S1, Table S3).
If ancient balanced polymorphisms originated prior to species divergence, allele phylogenies might deviate from the species tree. This discordance could arise when multiple linked polymorphisms cause alleles to cluster by sequence similarity rather than species origin. We reconstructed haplotype phylogenies for all 99 genes with TSPs. Two genes exhibited allele clusters that crossed species boundaries, with each subclade containing haplotypes from distinct lineages (Figure 3c–e). In MELO3C007588 (CsaV3_6G043590), we detected two completely linked polymorphisms (r2 = 0.47): a nonsynonymous SNP and a 3′ UTR SNP (Figure 3c,d). This gene encodes a secreted peptide precursor with dual functions in both immune modulation and salt tolerance [43]. It was also priorly identified as a candidate resistance gene against Fusarium oxysporum infection in melon based on transcriptomic analysis [44]. Another gene, MELO3C035245 (CsaV3_6G006620), similarly showed allele-specific clustering patterns across species, driven by two nonsynonymous TSPs (r2 = 0.45) (Figure 3e). This gene is a leucine-rich repeat (LRR) receptor-like protein (RLP) belonging to a family known to regulate both developmental processes and defense responses, as evidenced by studies in Arabidopsis thaliana [45]. These results suggest that the two genes have undergone long-term balancing selection in cucumber and melon. Their functional versatility indicates that they could act as evolutionary toggles, maintaining pleiotropic fitness by resolving environmental trade-offs. However, further functional validation is required to unravel the specific molecular mechanisms underlying their pleiotropic effects.

4. Discussion

In this study, a substantial number of TSPs were identified using large-scale whole-genome sequencing data from globally representative accessions of cucumber and melon, which diverged over an extensive evolutionary timescale, providing an exceptional opportunity to explore TSPs. The results indicate that various mechanisms, such as introgression and balancing selection, contribute to the persistence of TSPs across cucumber and melon. Additionally, long-term balancing selection is likely to play a crucial role in the formation of adaptive traits throughout the evolutionary trajectories of both species.
Given the current challenges associated with identifying TSPs [2,9], a rigorous multi-step filtering approach was employed to identify whole-genome TSPs. Findings indicate that TSPs are predominantly situated in functionally significant regions and are closely linked to the formation of adaptive traits. The discovery of TSPs within these regions emphasizes their potential role in regulatory mechanisms that influence key plant traits. TSPs located in promoter regions often affect gene expression by altering transcription factor binding or affecting the chromatin state, which can influence adaptive responses to environmental challenges. Similarly, TSPs within untranslated regions (UTRs) may regulate post-transcriptional processes such as mRNA stability, translation efficiency, and localization, which are crucial for fine-tuning gene expression during stress responses, TSPs in intronic regions can play a role in alternative splicing, potentially generating protein isoforms that may be critical for functional flexibility, particularly in response to pathogens or environmental stressors [46,47]. A recent study on Midas cichlid fishes identified a transposon insertion in an intron region, which was associated with a color polymorphism under long-term balancing selection, reflecting the broader functional implications of TSPs in maintaining adaptive traits across generations [46].
The evolution and maintenance of TSPs can involve various mechanisms [1,47,48]. By evaluating demographic parameters, evidence of ancient gene flow was discovered in both cucumber and melon. Subsequently, introgression regions from cucumber into melon were identified, indicating that introgression contributes to the persistence of TSPs. The significance of balancing selection in maintaining genetic variation has garnered considerable attention in evolutionary biology [7,13]. However, detecting balancing selection signals across genomes poses significant challenges [9]. Consequently, relatively few genes have been identified as being under balancing selection, particularly in plants. To address this gap, a multi-step filtering approach was employed to detect TSPs under long-term balancing selection. After filtering, 99 genes exhibited evidence of being under balancing selection, characterized by higher Tajima’s D values that indicate balancing selection signals. These genes were found to be enriched in pathways related to plant growth and environmental stress responses, which is consistent with previous studies [5,10,11,12,14,47]. Finally, two genes associated with immune modulation and stress tolerance were identified as being under long-term balancing selection, potentially contributing to the adaptability of cucumber and melon in fluctuating environments. These findings suggest that the identified genes may play a crucial role in enhancing the environmental resilience of both species. However, while our analysis indicates functional similarities between the two species, a more detailed comparison of the functional metabolic pathways in cucumber and melon is required to determine any species-specific differences. This comparison could reveal whether these genes have divergent roles in the metabolic processes of each species. Further experimental validation, particularly through metabolic profiling and functional genomics, will be necessary to confirm these differences and fully elucidate the contribution of these genes to their adaptive traits.
The TSPs in Cucumis species are notably enriched in genes involved in immune responses and stress tolerance. This is consistent with findings from other plant studies, where TSPs associated with disease resistance genes or self-incompatibility systems are maintained by balancing selection due to their pleiotropic effects on survival in fluctuating environments [12,13,14]. For instance, ton of TSPs in genes like those regulating plant immune responses, including resistance to pathogens, is often a result of negative frequency-dependent selection, where rare alleles confer a survival advantage against evolving pathogens [49]. Similarly, the adaptive of TSPs in Cucumis species likely contributes to their environmental adaptability, as these polymorphisms may enhance the plant’s ability to withstand various biotic and abiotic stressors.
Our study provides a deeper understanding of the mechanisms maintaining TSPs in Cucumis species, which have significant implications for agricultural production and crop improvement. The TSPs identified in our study, particularly those associated with immune responses and stress tolerance, could inform the development of resistant and resilient crop varieties. For instance, TSPs in genes related to disease resistance could serve as genetic resources for breeding plants with enhanced pathogen resistance. This aligns with findings from other studies on disease resistance in plants, where balancing selection at resistance loci has been critical for maintaining genetic diversity and sustainable resistance to pathogens [4]. The potential to harness these ancient polymorphisms in breeding programs offers a novel approach to addressing challenges in crop production, particularly in the face of climate change and emerging plant diseases. Furthermore, the identification of TSPs in genes involved in stress tolerance may also contribute to improving crop resilience under diverse environmental conditions, such as drought, salinity, or temperature stress. However, the practical application of these findings in crop improvement will require further functional validation of the identified TSPs, as well as genetic mapping to identify the most promising targets for breeding.
However, the focus was placed on orthologous genes with two or more shared SNPs under long-term balancing selection in both species. This approach may have overlooked some non-coding TSPs and sites where only a single TSP was initially targeted by balancing selection prior to species divergence. In addition, to avoid false positives, our filtering strategy may be too stringent.
Loci that are shared between species are critical for understanding adaptive mechanisms in organisms. However, due to limited time, weak selection strength, and other factors [50,51,52], detecting these TSPs and understanding their evolutionary forces remain challenging. Therefore, there is a growing need for more powerful methods to detect TSPs. Moving forward, integrating functional validation, comparative epigenomics, and expanded population sampling will be essential to elucidate further the adaptive significance of these shared polymorphisms and their roles in the evolutionary dynamics of species.

5. Conclusions

This study presents a comprehensive investigation of TSPs in two phylogenetically distinct Cucumis species, cucumber (C. sativus L.) and melon (C. melo L.), utilizing large-scale whole-genome sequencing data. The analysis reveals that a significant number of ancestral polymorphisms are conserved across these species, predominantly within genic and promoter regions, which are integral to the maintenance of adaptive traits. The persistence of these TSPs is driven by complex evolutionary forces, including balancing selection and gene flow, which collectively contribute to the preservation of genetic diversity over extended evolutionary timescales. These findings provide valuable insights into the role of TSPs in the evolutionary dynamics of plant species, particularly their contribution to disease resistance and stress tolerance. Notably, candidate genes under long-term balancing selection were identified, which may play a pivotal role in enhancing the environmental adaptability of both cucumber and melon. These results open promising avenues for breeding programs focused on advancing sustainable agriculture, particularly in improving crop resilience to environmental stresses and pathogen pressures. Although significant insights have been gained regarding the evolutionary significance of TSPs, further research is required to functionally validate the identified polymorphisms and explore their practical applications in crop improvement. Ultimately, the study underscores the critical role of ancient genetic diversity in shaping the evolutionary trajectories of plants and provides a valuable resource for future efforts in genetic conservation and crop breeding strategies.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/horticulturae11050452/s1, Figure S1: GO enrichment analysis of TSPs under balancing selection. Table S1: GO enrichment analysis of TSPs; Table S2: Introgression regions from cucumber into melon; Table S3: GO enrichment analysis of TSPs under balancing selection.

Author Contributions

Conceptualization, X.S. and K.X.; methodology, X.S., Y.L. (Yueting Li) and K.X.; software, X.C. and Q.Y.; validation, Y.L. (Yueting Li), M.H. and T.Y.; formal analysis, X.S. and Y.L. (Yueting Li); investigation, Y.L. (Yi Liu), H.W. and S.C.; resources, S.C. and H.W.; data curation, X.S.; writing—original draft preparation, X.S.; writing—review and editing, X.S. and K.X.; visualization, X.S.; supervision, K.X.; project administration, K.X.; funding acquisition, K.X. and H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Science Foundation of Shandong Province, grant number ZR2021QC075 (founder: K.X.); and the Major Science and Technology Special Project of Xinjiang Uygur Autonomous Region, grant number 2024A02007-1 (founder: H.W.).

Data Availability Statement

Data are contained within the article or Supplementary Materials.

Conflicts of Interest

Authors declare that they have no conflicting interests.

References

  1. Cornetti, L.; Fields, P.D.; Du Pasquier, L.; Ebert, D. Long-term balancing selection for pathogen resistance maintains trans-species polymorphisms in a planktonic crustacean. Nat. Commun. 2024, 15, 5333. [Google Scholar] [CrossRef] [PubMed]
  2. Murray, C.S.; Karram, M.; Bass, D.J.; Doceti, M.; Becker, D.; Nunez, J.C.; Ratan, A.; Bergland, A.O. Trans-Specific Polymorphisms Between Cryptic Daphnia Species Affect Fitness and Behavior. Mol. Ecol. 2025, 34, e17632. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, M.; Zhang, L.; Zhang, Z.; Li, M.; Wang, D.; Zhang, X.; Xi, Z.; Keefover-Ring, K.; Smart, L.B.; DiFazio, S.P. Phylogenomics of the genus Populus reveals extensive interspecific gene flow and balancing selection. New Phytol. 2020, 225, 1370–1382. [Google Scholar] [CrossRef] [PubMed]
  4. Roux, C.; Pauwels, M.; Ruggiero, M.-V.; Charlesworth, D.; Castric, V.; Vekemans, X. Recent and ancient signature of balancing selection around the S-locus in Arabidopsis halleri and A. lyrata. Mol. Biol. Evol. 2012, 30, 435–447. [Google Scholar] [CrossRef]
  5. Ségurel, L.; Thompson, E.E.; Flutre, T.; Lovstad, J.; Venkat, A.; Margulis, S.W.; Moyse, J.; Ross, S.; Gamble, K.; Sella, G. The ABO blood group is a trans-species polymorphism in primates. Proc. Natl. Acad. Sci. USA 2012, 109, 18493–18498. [Google Scholar] [CrossRef]
  6. Tellini, N.; De Chiara, M.; Mozzachiodi, S.; Tattini, L.; Vischioni, C.; Naumova, E.S.; Warringer, J.; Bergström, A.; Liti, G. Ancient and recent origins of shared polymorphisms in yeast. Nat. Ecol. Evol. 2024, 8, 761–776. [Google Scholar] [CrossRef]
  7. Bitarello, B.D.; Brandt, D.Y.; Meyer, D.; Andrés, A.M. Inferring balancing selection from genome-scale data. Genome Biol. Evol. 2023, 15, evad032. [Google Scholar] [CrossRef]
  8. Aqil, A.; Speidel, L.; Pavlidis, P.; Gokcumen, O. Balancing selection on genomic deletion polymorphisms in humans. Elife 2023, 12, e79111. [Google Scholar] [CrossRef]
  9. Fijarczyk, A.; Babik, W. Detecting balancing selection in genomes: Limits and prospects. Mol. Ecol. 2015, 24, 3529–3545. [Google Scholar] [CrossRef]
  10. Těšický, M.; Vinkler, M. Trans-species polymorphism in immune genes: General pattern or MHC-restricted phenomenon? J. Immunol. Res. 2015, 2015, 838035. [Google Scholar] [CrossRef] [PubMed]
  11. Bechsgaard, J.S.; Castric, V.; Charlesworth, D.; Vekemans, X.; Schierup, M.H. The transition to self-compatibility in Arabidopsis thaliana and evolution within S-haplotypes over 10 Myr. Mol. Biol. Evol. 2006, 23, 1741–1750. [Google Scholar] [CrossRef]
  12. Guo, Y.-L.; Zhao, X.; Lanz, C.; Weigel, D. Evolution of the S-locus region in Arabidopsis relatives. Plant Physiol. 2011, 157, 937–946. [Google Scholar] [CrossRef]
  13. Delph, L.F.; Kelly, J.K. On the importance of balancing selection in plants. New Phytol. 2014, 201, 45–56. [Google Scholar] [CrossRef]
  14. Le Veve, A.; Burghgraeve, N.; Genete, M.; Lepers-Blassiau, C.; Takou, M.; De Meaux, J.; Mable, B.K.; Durand, E.; Vekemans, X.; Castric, V. Long-term balancing selection and the genetic load linked to the self-incompatibility locus in Arabidopsis halleri and A. lyrata. Mol. Biol. Evol. 2023, 40, msad120. [Google Scholar] [CrossRef] [PubMed]
  15. Lija, M.; Beevy, S.S. A Review on the diversity of Melon. Plant Sci. Today 2021, 8, 995–1003. [Google Scholar] [CrossRef]
  16. Schaefer, H.; Heibl, C.; Renner, S.S. Gourds afloat: A dated phylogeny reveals an Asian origin of the gourd family (Cucurbitaceae) and numerous oversea dispersal events. Proc. R. Soc. B Biol. Sci. 2009, 276, 843–851. [Google Scholar] [CrossRef] [PubMed]
  17. Vanneste, K.; Baele, G.; Maere, S.; Van de Peer, Y. Analysis of 41 plant genomes supports a wave of successful genome duplications in association with the Cretaceous–Paleogene boundary. Genome Res. 2014, 24, 1334–1347. [Google Scholar] [CrossRef]
  18. Barrera-Redondo, J.; Ibarra-Laclette, E.; Vázquez-Lobo, A.; Gutierrez-Guerrero, Y.T.; de la Vega, G.S.; Piñero, D.; Montes-Hernández, S.; Lira-Saade, R.; Eguiarte, L.E. The genome of Cucurbita argyrosperma (silver-seed gourd) reveals faster rates of protein-coding gene and long noncoding RNA turnover and neofunctionalization within Cucurbita. Mol. Plant 2019, 12, 506–520. [Google Scholar] [CrossRef]
  19. Li, H.-T.; Yi, T.-S.; Gao, L.-M.; Ma, P.-F.; Zhang, T.; Yang, J.-B.; Gitzendanner, M.A.; Fritsch, P.W.; Cai, J.; Luo, Y. Origin of angiosperms and the puzzle of the Jurassic gap. Nat. Plants 2019, 5, 461–470. [Google Scholar] [CrossRef]
  20. Xie, D.; Xu, Y.; Wang, J.; Liu, W.; Zhou, Q.; Luo, S.; Huang, W.; He, X.; Li, Q.; Peng, Q. The wax gourd genomes offer insights into the genetic diversity and ancestral cucurbit karyotype. Nat. Commun. 2019, 10, 5158. [Google Scholar] [CrossRef]
  21. Guo, X.; Fang, D.; Sahu, S.K.; Yang, S.; Guang, X.; Folk, R.; Smith, S.A.; Chanderbali, A.S.; Chen, S.; Liu, M. Chloranthus genome provides insights into the early diversification of angiosperms. Nat. Commun. 2021, 12, 6930. [Google Scholar] [CrossRef] [PubMed]
  22. Zhao, G.; Lian, Q.; Zhang, Z.; Fu, Q.; He, Y.; Ma, S.; Ruggieri, V.; Monforte, A.J.; Wang, P.; Julca, I. A comprehensive genome variation map of melon identifies multiple domestication events and loci influencing agronomic traits. Nat. Genet. 2019, 51, 1607–1615. [Google Scholar] [CrossRef]
  23. Qi, J.; Liu, X.; Shen, D.; Miao, H.; Xie, B.; Li, X.; Zeng, P.; Wang, S.; Shang, Y.; Gu, X. A genomic variation map provides insights into the genetic basis of cucumber domestication and diversity. Nat. Genet. 2013, 45, 1510–1515. [Google Scholar] [CrossRef] [PubMed]
  24. Liu, S.; Gao, P.; Zhu, Q.; Zhu, Z.; Liu, H.; Wang, X.; Weng, Y.; Gao, M.; Luan, F. Resequencing of 297 melon accessions reveals the genomic history of improvement and loci related to fruit traits in melon. Plant Biotechnol. J. 2020, 18, 2545–2558. [Google Scholar] [CrossRef] [PubMed]
  25. Li, H.; Wang, S.; Chai, S.; Yang, Z.; Zhang, Q.; Xin, H.; Xu, Y.; Lin, S.; Chen, X.; Yao, Z. Graph-based pan-genome reveals structural and sequence variations related to agronomic traits and domestication in cucumber. Nat. Commun. 2022, 13, 682. [Google Scholar] [CrossRef]
  26. Oren, E.; Dafna, A.; Tzuri, G.; Halperin, I.; Isaacson, T.; Elkabetz, M.; Meir, A.; Saar, U.; Ohali, S.; La, T. Pan-genome and multi-parental framework for high-resolution trait dissection in melon (Cucumis melo). Plant J. 2022, 112, 1525–1542. [Google Scholar] [CrossRef]
  27. Castanera, R.; Ruggieri, V.; Pujol, M.; Garcia-Mas, J.; Casacuberta, J.M. An improved melon reference genome with single-molecule sequencing uncovers a recent burst of transposable elements with potential impact on genes. Front. Plant Sci. 2020, 10, 1815. [Google Scholar] [CrossRef]
  28. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  29. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R.; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  30. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef]
  31. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; Von Haeseler, A.; Lanfear, R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  32. Price, A.L.; Patterson, N.J.; Plenge, R.M.; Weinblatt, M.E.; Shadick, N.A.; Reich, D. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006, 38, 904–909. [Google Scholar] [CrossRef] [PubMed]
  33. Alexander, D.H.; Novembre, J.; Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009, 19, 1655–1664. [Google Scholar] [CrossRef]
  34. Danecek, P.; Auton, A.; Abecasis, G.; Albers, C.A.; Banks, E.; DePristo, M.A.; Handsaker, R.E.; Lunter, G.; Marth, G.T.; Sherry, S.T. The variant call format and VCFtools. Bioinformatics 2011, 27, 2156–2158. [Google Scholar] [CrossRef] [PubMed]
  35. Chen, N. Using Repeat Masker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinform. 2004, 5, 4–10. [Google Scholar] [CrossRef] [PubMed]
  36. Cingolani, P. Variant annotation and functional prediction: SnpEff. In Variant Calling: Methods and Protocols; Springer: New York, NY, USA, 2012; pp. 289–314. [Google Scholar]
  37. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar] [CrossRef]
  38. Excoffier, L.; Marchi, N.; Marques, D.A.; Matthey-Doret, R.; Gouy, A.; Sousa, V.C. fastsimcoal2: Demographic inference under complex evolutionary scenarios. Bioinformatics 2021, 37, 4882–4885. [Google Scholar] [CrossRef]
  39. Ossowski, S.; Schneeberger, K.; Lucas-Lledó, J.I.; Warthmann, N.; Clark, R.M.; Shaw, R.G.; Weigel, D.; Lynch, M. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 2010, 327, 92–94. [Google Scholar] [CrossRef]
  40. Malinsky, M.; Matschiner, M.; Svardal, H. Dsuite-Fast D-statistics and related admixture evidence from VCF files. Mol. Ecol. Resour. 2021, 21, 584–595. [Google Scholar] [CrossRef]
  41. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M. Twelve years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef]
  42. Katoh, K.; Standley, D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013, 30, 772–780. [Google Scholar] [CrossRef] [PubMed]
  43. Zhou, H.; Xiao, F.; Zheng, Y.; Liu, G.; Zhuang, Y.; Wang, Z.; Zhang, Y.; He, J.; Fu, C.; Lin, H. Pamp-induced secreted peptide 3 modulates salt tolerance through receptor-like kinase 7 in plants. Plant Cell 2022, 34, 927–944. [Google Scholar] [CrossRef] [PubMed]
  44. Yang, T.; Liu, J.; Li, X.; Amanullah, S.; Lu, X.; Zhang, M.; Zhang, Y.; Luan, F.; Liu, H.; Wang, X. Transcriptomic analysis of Fusarium oxysporum stress-induced pathosystem and screening of Fom-2 interaction factors in contrasted melon plants. Front. Plant Sci. 2022, 13, 961586. [Google Scholar] [CrossRef]
  45. Steidele, C.; Stam, R. Multi-omics approach highlights differences between RLP classes in Arabidopsis thaliana. BMC Genomics 2021, 22, 557. [Google Scholar] [CrossRef]
  46. Kratochwil, C.F.; Kautt, A.F.; Nater, A.; Härer, A.; Liang, Y.; Henning, F.; Meyer, A. An intronic transposon insertion associates with a trans-species color polymorphism in Midas cichlid fishes. Nat. Commun. 2022, 13, 296. [Google Scholar] [CrossRef] [PubMed]
  47. Wu, Q.; Han, T.-S.; Chen, X.; Chen, J.-F.; Zou, Y.-P.; Li, Z.-W.; Xu, Y.-C.; Guo, Y.-L. Long-term balancing selection contributes to adaptation in Arabidopsis and its relatives. Genome Biol. 2017, 18, 217. [Google Scholar] [CrossRef]
  48. Wang, B.; Mitchell-Olds, T. Balancing selection and trans-specific polymorphisms. Genome Biol. 2017, 18, 231. [Google Scholar] [CrossRef]
  49. Zhou, C.; Zhang, Q.; Chen, Y.; Huang, J.; Guo, Q.; Li, Y.; Wang, W.; Qiu, Y.; Guan, W.; Zhang, J. Balancing selection and wild gene pool contribute to resistance in global rice germplasm against planthopper. J. Integr. Plant Biol. 2021, 63, 1695–1711. [Google Scholar] [CrossRef]
  50. Fournier-Level, A.; Korte, A.; Cooper, M.D.; Nordborg, M.; Schmitt, J.; Wilczek, A.M. A map of local adaptation in Arabidopsis thaliana. Science 2011, 334, 86–89. [Google Scholar] [CrossRef]
  51. Hancock, A.M.; Brachi, B.; Faure, N.; Horton, M.W.; Jarymowycz, L.B.; Sperone, F.G.; Toomajian, C.; Roux, F.; Bergelson, J. Adaptation to climate across the Arabidopsis thaliana genome. Science 2011, 334, 83–86. [Google Scholar] [CrossRef]
  52. Li, Y.; Wang, N.; Feng, J.; Liu, Y.; Wang, H.; Deng, S.; Dong, W.; Liu, X.; Lv, B.; Sun, J. Enhancing genetic transformation efficiency in cucurbit crops through AtGRF5 overexpression: Mechanistic insights and applications. J. Integr. Plant Biol. 2025. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Genetic differentiation between cucumber and melon and identification of trans-species polymorphisms. (a) Phylogenetic tree of cucumber and melon based on 168,829 SNPs at fourfold degenerate sites. (b) Principal component analysis (PCA) plot illustrating the genetic differentiation between cucumber and melon groups based on the first and second principal components. (c) The chromosome density plot of TSPs. (d) Genomic distribution of trans-species polymorphisms. (e) GO enrichment analysis of trans-species polymorphisms.
Figure 1. Genetic differentiation between cucumber and melon and identification of trans-species polymorphisms. (a) Phylogenetic tree of cucumber and melon based on 168,829 SNPs at fourfold degenerate sites. (b) Principal component analysis (PCA) plot illustrating the genetic differentiation between cucumber and melon groups based on the first and second principal components. (c) The chromosome density plot of TSPs. (d) Genomic distribution of trans-species polymorphisms. (e) GO enrichment analysis of trans-species polymorphisms.
Horticulturae 11 00452 g001
Figure 2. Demographic history and introgression of cucumber and melon. (a) Demographic parameter estimates for two models of species divergence. Arrows indicate gene flow, and TDIV represents divergence time. (b) Genomic introgression regions from cucumber into melon.
Figure 2. Demographic history and introgression of cucumber and melon. (a) Demographic parameter estimates for two models of species divergence. Arrows indicate gene flow, and TDIV represents divergence time. (b) Genomic introgression regions from cucumber into melon.
Horticulturae 11 00452 g002
Figure 3. Identification of trans-species polymorphisms under long-term balancing selection. (a) SNP filtering pipeline to identify trans-species polymorphisms under long-term balancing selection. (b) Candidate TSPs exhibit higher Tajima’s D values compared to the genome-wide average. (c) Candidate genes and their TSP information. M indicate missense variant. *** indicates a statistically significant difference with a p value of less than 0.001. (d,e) Two candidate genes produce allelic trees rather than species trees, with each clade in the allelic trees displaying distinct genotypes. Light green bars represent the reference allele, while light orange bars represent the alternative allele. (d) Allelic tree of MELO3C007588. (e) Allelic tree of MELO3C035245.
Figure 3. Identification of trans-species polymorphisms under long-term balancing selection. (a) SNP filtering pipeline to identify trans-species polymorphisms under long-term balancing selection. (b) Candidate TSPs exhibit higher Tajima’s D values compared to the genome-wide average. (c) Candidate genes and their TSP information. M indicate missense variant. *** indicates a statistically significant difference with a p value of less than 0.001. (d,e) Two candidate genes produce allelic trees rather than species trees, with each clade in the allelic trees displaying distinct genotypes. Light green bars represent the reference allele, while light orange bars represent the alternative allele. (d) Allelic tree of MELO3C007588. (e) Allelic tree of MELO3C035245.
Horticulturae 11 00452 g003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Su, X.; Liu, Y.; Li, Y.; Hu, M.; Yu, T.; Yu, Q.; Wang, H.; Chen, X.; Chai, S.; Xu, K. Evolutionary Forces Shaping Trans-Species Polymorphisms in Genus Cucumis. Horticulturae 2025, 11, 452. https://doi.org/10.3390/horticulturae11050452

AMA Style

Su X, Liu Y, Li Y, Hu M, Yu T, Yu Q, Wang H, Chen X, Chai S, Xu K. Evolutionary Forces Shaping Trans-Species Polymorphisms in Genus Cucumis. Horticulturae. 2025; 11(5):452. https://doi.org/10.3390/horticulturae11050452

Chicago/Turabian Style

Su, Xiaofeng, Yi Liu, Yueting Li, Minghe Hu, Tao Yu, Qing Yu, Huilin Wang, Xinxiu Chen, Sen Chai, and Kuipeng Xu. 2025. "Evolutionary Forces Shaping Trans-Species Polymorphisms in Genus Cucumis" Horticulturae 11, no. 5: 452. https://doi.org/10.3390/horticulturae11050452

APA Style

Su, X., Liu, Y., Li, Y., Hu, M., Yu, T., Yu, Q., Wang, H., Chen, X., Chai, S., & Xu, K. (2025). Evolutionary Forces Shaping Trans-Species Polymorphisms in Genus Cucumis. Horticulturae, 11(5), 452. https://doi.org/10.3390/horticulturae11050452

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop