Next Article in Journal
Identification of Chalcone Synthase Genes and Their Responses to Salt and Cold Stress in Poncirus trifoliata
Previous Article in Journal
Valorization of Posidonia oceanica Biomass Extract as an Elicitor to Mitigate Aphid-Induced Stress in Sweet Pepper Plants
Previous Article in Special Issue
Research Advances in AP2/ERF Transcription Factors in Rice Growth and Development
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

DArTseq Analysis of Cypriot Common Bean Germplasm Unveils an Assortment of Unexplored Genetic Variability

by
Maria-Dimitra Tsolakidou
1,
Angelos C. Kyratzis
2 and
Nikolaos Nikoloudakis
1,*
1
Department of Agricultural Science, Biotechnology and Food Science, Cyprus University of Technology, Limassol 3036, Cyprus
2
Vegetable Crop Sector, Agricultural Research Institute, Ministry of Agriculture, Rural Development and Environment, Nicosia 1516, Cyprus
*
Author to whom correspondence should be addressed.
Plants 2025, 14(19), 3000; https://doi.org/10.3390/plants14193000
Submission received: 29 August 2025 / Revised: 22 September 2025 / Accepted: 26 September 2025 / Published: 28 September 2025
(This article belongs to the Special Issue Crop Germplasm Resources, Genomics, and Molecular Breeding)

Abstract

The common bean (Phaseolus vulgaris L.) is a globally significant crop with a well-documented domestication history and a critical role in food security. Here, we present the first whole-genome genetic characterization of Cypriot common bean landraces and heirloom varieties, originated from remote mountainous areas, using DArTseq-based SNP genotyping. A total of 13,215 high-quality SNPs were investigated from 50 genotypes, indicating a moderate linkage disequilibrium, high incidences of private and fixed alleles, and an overall low heterozygosity. The comparison of varieties indicated that dry and green bean varieties consisted of genetically distinct clusters, reinforced by phylogenetic and Bayesian structure analyses. A few of the varieties, such as “Gliastro” and “Stringless Blue Lake,” demonstrated an intense genetic diversity and/or inbreeding, whereas others showed evidence of admixture. The outcomes highlight the unique genetic make-up of the Cypriot bean germplasm and its worth as a tool for breeding, conservation, and upcoming genomic-assisted improvement programs.

1. Introduction

The common bean (Phaseolus vulgaris L.) is globally recognized as a cornerstone staple crop with a millennia-long contribution to the human diet and a remarkable diversity of types, cultivation schemes, and consumption patterns. In parallel, the adaptability of beans to diverse edaphoclimatic conditions, coupled with their high nutritional value (rich in protein and vitamins), has established them as a crucial crop for food security and food sovereignty, especially in developing countries [1]. A rise in bean demand during the last three decades has upped its production (60% increase) and harvested area (more than 30%), as recently reported (FAO, 2023). In terms of worldwide diffusion, the dry bean yield and harvested area (2023) were estimated to be around 28.7 million metric tons/38.4 million hectares (dry beans) and 1.4 million metric tons/134.2 thousand hectares (string beans), respectively. Nowadays, Asia (mainly India and China) is the continent that produces the most green and dried beans, accounting for 91.9% and 49.3% of global output, respectively [2]. India is leading the global production of these legumes (dry or string beans), which accounts for more than 43% of the total output.
Despite current trends, the common bean’s evolutionary drive began in the Americas, where independent domestication events shaped its genetic diversity. Specifically, P. vulgaris underwent domestication in two distinct regions: Mesoamerica and the mountainous Andes [3,4]. The beans’ cultivation was later introduced to the Old World (Europe) via conquistadors following Columbus’s voyaging explorations [5]. Germplasm dissemination has probably followed several rounds of introduction from both the Andean as well as the Mesoamerican germplasm, while local factors have formed heterologous populations (landraces) with distinct characteristics [3]. However, the genetic landscape of beans in Europe is not entirely defined by these initial introductions. The subsequent hybridization, adaptation, and gene flow among the Andean and Mesoamerican gene pools have profoundly shaped the genetic structure of European common bean populations, creating heirloom varieties [6].
This centuries-long presence in Europe has created a secondary tailormade domestication effect, resulting in a unique genetic, phenotyping, and agronomic diversity [7,8,9]. Across the several conclusions that were outlined from research focusing on the European germplasm, a key finding is the consistent observation of genotype-by-environment interactions, highlighting the adaptive plasticity of these landraces and their ability to thrive in diverse and/or specific environments. As a result, the conservation of bean landraces is of paramount importance in order to maintain biodiversity, and at the same time ensure future food security, since these gene pools represent a vast reservoir of genetic diversity, harboring valuable traits not found in modern cultivars [10,11,12]. In addition, such lines can further be used via Genomics-assisted breeding (GAB) and the utilization of landraces’ desirable genes [13,14]. In that sense, studies that genetically characterize the Phaseolus spp. germplasm align with that scope.
Several marker systems have been used until now to decipher the genomic complexity of beans. Among these, studies employing AFLPs [15,16], ISSRs [17], RAPD [18,19], RFLPs [20], and SSRs [21,22] stand out. Notwithstanding, many of these marker systems encompass several caveats, such as low density, genomic coverage, repeatability, or the inability to detect heterozygous genotypes. Moreover, the information-to-cost index is quite low in contrast to novel genotyping technologies such as Next Generation Sequencing (NGS), which nonetheless pose a significant cost for large-scale analyses [23].
Nowadays, many scientific efforts focus on DArTseq as an economic yet powerful marker analysis employing thousands of SNPs. DArTseq is characterized as an inexpensive, straightforward, and efficient genotyping-by-sequencing method, allowing for a genome-wide marker identification through a restriction enzyme-mediated genome complexity reduction and sequencing of the restriction fragments [24]. This procedure has generated significant insights for the common bean germplasm across countries like Brazil [25], Colombia [26], Kenya [27], Uganda [28], Ethiopia [29], Croacia [30], Turkey [31,32], and Zimbabwe [33].
Cyprus, the third largest Mediterranean island, has a relatively high floristic diversity, and it is considered as a biodiversity hotspot. This is mainly the result of its geographical position at the crossroads of the Eurasian and African continents, its size (9251 km2), varied geology and geomorphology, climatic conditions, habitat diversity, long history of human presence, and geographic isolation [34]. For these reasons, several studies focusing on the unique genetic distinctiveness of Cypriot landraces and local groups across crops have provided significant insights into their unique properties and exploitations [35,36,37,38,39].
For centuries, legumes, including common beans, have been appraised as a stable food in Cyprus. At the beginning of the previous century, common beans were mainly cultivated in the mountainous areas of Troodos (Marathasa and Pitsilia region) and to a lesser extent in plains [40]. Between 1930 and 1950, a rapid expansion of the cultivation area was recorded [41] reaching up to 1750 ha and ranking as the 2nd most important legume after broad beans (Vicia faba). The two major cultivation centers were Troodos massif (summer crop) and Morphou plain (winter and summer crop). Several local landraces have been cultivated by farmers, with a distinction of those adapted to high-altitude mountainous regions and those adapted to low-altitude plains [42]. Nowadays, these landraces are severely threatened with extinction due to their replacement with modern varieties in the plains, the aging of the rural population, and the abandonment of the agricultural land in the mountainous areas.
The current work aims to characterize the local Cypriot common bean landraces and heirloom variety collection originating from remote mountainous areas by employing a DArTseq analysis. This will enable us to recognize the origin and genetic diversity of the indigenous bean germplasm, while creating guidelines/datasets to mitigate genetic erosion. Finally, based on the SNP analysis among dry and green bean genotypes, possible cross-hybridizations among the two lineages may be identified, aiding future breeding/conservation efforts.

2. Materials and Methods

2.1. Germplasm Collection and Sample Preparation

Healthy young leaves from heirloom and commercial bean varieties were collected across Cypriot prefectures from farms in mountainous areas (Table 1; Figure 1). With the exception of variety “Moraleda”, all other genetic material is conserved by farmers (Farm Saved Seed), including the commercial variety “Stringless Blue Lake”. Except for “Moraleda” and “Stringless Blue Lake”, all other genetic material is exclusively cultivated in the Troodos massif, and it depicts high local adaptation. Samples were placed in zip bags and transferred to the laboratory using cool boxes. Tissues were frozen at −80 °C and subsequently freeze-dried for three days. Roughly 500 mg of dried material was pulverized (45″ at full speed) using a steel ball Mixer Mill MM 200 (Retsch, Haan, Germany). We established that 2 mL Eppendorf tubes (with rounded bases) and flash-freezing with liquid N2 before bead-beating had an improved efficiency in producing fine powder samples.

2.2. DNA Extraction

DNA was extracted using a modified CTAB (CTAB buffer: 2% Cetyl-Trimethyl-Ammonium-Bromide, 100 mM Tris-HCl, 1.4 M NaCl, 20 mM EDTA, 0.1% 2-mercaptoethanol, 1% polyvinyl pyrrolidone) procedure. For each freeze-dried, homogenized tissue (20 mg), 500 µL of CTAB extraction buffer (with 1 µL of 100 mM RNAse) was used. Samples were mixed/vortexed before transferring to a 60 °C water bath for 30 min and 10 min at room temperature. Following the incubation period, 500 µL of phenol–chloroform (1:1) was added, and samples were vortexed thoroughly before centrifugation for 5 min at 14,000× g. The aqueous upper phase was transferred to a new tube, and chloroform extraction (1 volume) followed. Samples were briefly vortexed before centrifugation (2 min at 14,000× g). The aqueous upper phase was transferred to a fresh tube, and DNA was precipitated by adding 1 volume of cold isopropanol. Following incubation (room temperature for 15 min), samples were centrifuged (14,000× g for 10 min). The supernatant was carefully decanted without disturbing the pellet and subsequently washed with 500 µL ice-cold 70% ethanol. Ethanol residuals were removed by centrifugation and drying for 2 min at 60 °C. The pellet was dissolved in 50 µL ddH2O, and DNA quality/quantity was assessed using 1% agarose gel electrophoresis and nanodrop spectrophotometry.

2.3. DArTseq Genotyping

DNA samples were sent to Diversity Arrays Technology Pty. Ltd. (Canberra, Australia) and genotyped using a ddRAD (double digest Restriction Associated DNA) genome complexity reduction method (DArTseq) as previously reported [43]. DArTseq libraries were constructed for the 50 genotypes using 50 ng of gDNA as a template. Briefly, DNA was digested using a combination of restriction enzymes (PstI-MseI), and generated fragments were ligated with enzyme-compatible adaptors before PCR amplification [44]. The DArTseq data were analyzed to produce dominant SilicoDArTs (presence or absence of the hybridized fragment) markers as well as SNPs. An e-value of 5 × 10−7 and a minimum sequence percent identity of 70% to a common bean genome reference (Common_bean_v12phy) were applied. Several statistical indexes were assessed and retained to establish the markers’ quality and information capacity: AvgCountRef: The sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Reference allele row; AvgCountSnp: The sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the SNP allele row; AvgPIC: The average of the polymorphism information content (PIC) of the Reference and SNP allele rows; FreqHets: The proportion of samples which score as heterozygous; FreqHomRef: The proportion of samples which score as homozygous for the Reference allele; FreqHomSnp: The proportion of samples which score as homozygous for the SNP allele; OneRatioRef: The proportion of samples for which the genotype score is “1” in the Reference allele row; OneRatioSnp: The proportion of samples for which the genotype score is “1” in the SNP allele row; PICRef: The polymorphism information content (PIC) for the Reference allele row; PICSnp: The polymorphism information content (PIC) for the SNP allele row (Figure 2).

2.4. DArTseq Filtering and Analyses

Several concurrent cut-off values were used in order to trim data of lower quality. A call rate threshold of 0.95 (for loci) and 0.8 (for genotypes) was applied, while monomorphic loci were removed. This generated a matrix consisting of 50 genotypes and 13,215 retained loci, and a genlight object map (genotypes vs. SNPs) was created (Figure 3). The DArTseq statistic indexes across genotypes and loci alongside SNP density (bin window of 200 Kbp) were calculated (library “lattice”) across the 11 bean chromosomes, and a Circos plot graph was produced using shinyCircos-V2.0 [45]. Furthermore, several genetic attributes (private/fixed alleles per pair of populations, minor allele frequency (MAF) for each locus, diversity SNPs indexes, departure from Hardy–Weinberg equilibrium (HWE), and AMOVA) were calculated to characterize the Cypriot bean germplasm. DArTseq data were furthermore converted to PHYLIP/VCF formats, and phylogenetic/linkage disequilibrium (LD) analyses were performed using IQ-TREE [46] and Tassel 5 [47], respectively. To avoid a biased influence of the low frequency SNPs on LD, linkage disequilibrium (LD) was calculated using only SNPs with a MAF > 0.05. Linkage disequilibrium was assessed as the correlation of allele frequencies (r2) for every pairwise SNP comparison across each chromosome, and, thereafter, the mean values specific to the chromosome and genome were calculated. Inter-chromosomal linkage disequilibrium (unlinked loci) was assessed across the entire genome. The LD decay was assessed by creating a plot of the r2 values versus the genetic distance of loci pairs (Mbp) for each chromosome, and a trend line representing the LD decay was computed using locally weighted polynomial regression. All the above indexes were generated by using the DArTseq dedicated “dartRverse”/“dartR” R libraries and Rstudio (RStudio 2023.12.1 Build 402) as reported by the developers (https://green-striped-gecko.github.io/dartR/ (assessed on 19 September 2025)). Finally, a Bayesian statistic-based method for assessing genetic kinship was used employing the Structure 2.3.4 suite [48]. The admixture model was chosen, and five independent repeats for each K value (ranging from 1 to 8) were performed. Each run included 100,000 iterations of the burning period and 500,000 post-burn simulations as previously reported [35].

3. Results

3.1. SNP Indexes Across the Cypriot Bean Panel

The DArTseq analysis resulted in 21,111 silico DArT markers, as well as 10,260 SNP-containing sequence tags. In the present work, only SNPs were taken into consideration for the genetic analyses due to the non-binary/codominant nature of these markers. SNP markers presented an average call rate of 0.89 (before data trimming), and a low percentage (less than 5%) of null alleles. Selecting a minimum call rate of 0.95 resulted in a matrix of 13,215 SNPs, since various deviations from the reference genome were identified in several sequence tags (Figure 3). The mean reproducibility score was determined at nearly 99.9%, with an absolute value of 100% for most of the SNP markers (96%), while the lowest percentage was estimated at 88%.
The SNP markers were aligned to the 11 chromosomes of the bean reference genome, and an even genome-wide distribution was confirmed. The average SNP marker number per chromosome was 863.27 (±207.41), ranging from 588 in chromosome 4 to 1197 SNPs detected in chromosome 2 (Figure 2), corresponding to one SNP in every 40,651 Kbp (genome assembly of P. vulgaris v2.0 indicates a size of 537.2 Mbp, https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000499845.2/ assessed on 15 February 2025). A high density of markers was mainly observed at the chromosomal ends, even though chromosome 9 had a uniform marker dispersion and chromosome six presented a lower SNP density at the initial end (chromosome 6 is acrocentric). The reduced marker density in the rest of the chromosomes corresponded to their broader centromeric region, as expected. DArTSeq statistical indexes indicated that most genotypes were classified as homozygous to the Phaseolus vulgaris reference genome, since the FreqHomRef average value was 0.674. In contrast, FreqHomSnp had a mean score of 0.303, meaning that several genotypes were found to be homozygous to the alternative SNP. Interestingly, there were several cases where heterozygous genotypes were detected at a low (but significant) percentage (0.025), indicating a possible intercrossing within the Cypriot beans’ germplasm. SNP markers were proven to be highly informative, having an average Polymorphism Information Content (PIC) value of 0.295 (Figure 2).

3.2. Linkage Disequilibrium Across Markers

Linkage disequilibrium estimates (as r2) and the degree of LD decay were determined among the 50 Cypriot accessions by using all possible pair combinations of genome-wide physically mapped SNP markers on the 11 bean chromosomes. A significant LD was observed for several SNP pairs within the germplasm (p < 0.001), and r2 values higher than 0.90 were observed at several sites (Figure 4A) within the chromosomes. Nonetheless, many loci had r2 values less than 0.5, proving that LD is not heavily widespread in the Cypriot bean landraces. Specifically, average r2 values across the 11 chromosomes varied from 0.225 (chromosome 5) to 0.423 (chromosome 9), while 25.56% (ranging from 21.46% in chromosome 5 to 28.93% in chromosome 4) of marker-pairs demonstrated significant a LD score (p < 0.001). Hence, moderate LD levels in the panel of Cypriot beans were confirmed. Significant discrepancies among linked (p < 0.001) and unlinked (p > 0.001) markers were also noted, since r2 values in unlinked SNP markers were below the 0.1 threshold across chromosomes (0.78 on average, ranging from 0.064 in chromosome 1 to 0.093 in chromosome 9). Moreover, the extent of LD decay was calculated using a non-linear regression of r2 values versus the physical chromosomal distance in the 50 Cypriot bean panel (Figure 4B). Linkage disequilibrium followed a gradual declining trend as physical distance increased, even though a sharp drop within the first 10 Mbp was not established. The germplasm studied had intermediate LD values, and most r2 records (correlations among locus pairs) were below the 0.5 threshold, while bean genotypes sustained a significant LD level up to a physical distance of 20 Mbp.

3.3. Alleles, HWE, and Genetic Diversity

A survey for private and fixed alleles across varietal combinations was performed (Table 2) and a Sankey Diagram depicting the complex relationships among genotypes was produced (Figure 5A). The minimum number of fixed alleles was estimated between the “Louva White” and “Louva Red” varieties (6 alleles), while the maximum was found among “Kreatofasoulo” and “Louva White” genotypes (5165 alleles). Since low genetic diversity is often linked to allele fixation, the moderate values detected for Cypriot beans indicate a possible decreased adaptability for some genotypes. “Gliastro” was also characterized by many private alleles (when compared with other cultivars) since, on average, approximately 5920 unique alleles were recorded. Interestingly, “Louva” types had a small number of private alleles when compared in pairs (Figure 5A), indicating that, regardless of the color (red or white) or pod length (normal or long), these heirloom varieties have significant genetic affiliations. Besides exploring the private/fixed alleles among varieties, a search for alleles having a minor frequency (less than 0.5%) was conducted, and the MAF (minor allele frequency) histogram (Figure 5B) revealed several fixed variants (SNPs) in the Cypriot common bean genome. It was established that, overall, the majority of the MAF was evenly distributed across the 0.01% and 0.5% threshold (mean 0.217%; median 0.2%; first quantile 0.010%; third quantile 0.380%) although in several varieties (“Gliastro”, “Koutsouli White”, “Kreatofasoulo”, and “Stringless Blue Lake”) most minor alleles had a frequency less than 0.2%.
Concurrently, a test for Hardy–Weinberg disequilibrium was conducted to detect loci where a nonrandom assortment of alleles occurred. Interestingly, across the loci and varieties studied, we identified 10 loci under Hardy–Weinberg disequilibrium (Figure 6A; Table 3). “Stringless Blue Lake” was the variety where most departures were noted (six), followed by “Gliastro” (two), while for “Koutsouli White” and “Kreatofasoulo” only one locus was found in HW disequilibrium. The loci identified were further screened against the NCBI nr database using blastn (https://blast.ncbi.nlm.nih.gov/Blast.cgi, assessed on 2 March 2025) in order to identify possible functions. Seven (out of the ten loci) corresponded to disease-resistant genes (XM_068642443.1, XM_068615419.1, XM_068628962.1) or crucial enzymes implicated in the primary or secondary metabolism (Table 3).
Across the 13,078 loci retained for the analyses, approximately 12% were found to be polymorphic. Moreover, low levels of observed (Ho), expected (He), and unbiased heterozygosity (uHe) were detected across populations (Figure 6B; Table 4). “Koutsouli white” and “Louva Red Long” were the varieties where the lowest values of heterozygosity were detected (Ho = 0.018/uHe = 0.021 and Ho = 0.025/uHe = 0.021, respectively), while the highest rates were determined for “Gliastro” and “Stringless Blue Lake” (Ho = 0.030/uHe = 0.065 and Ho = 0.025/uHe = 0.069, respectively). Moreover, inbreeding coefficient analysis (FIS) indicated that “Stringless Blue Lake” and “Gliastro” varieties had a high probability of inbreeding (FIS = 0.634 and FIS = 0.540, correspondingly), while negative scores were determined for “Louva Red Long” and “Moraleda” (FIS = −0.187 and FIS = −0.152, respectively), signifying an evasion of inbreeding. Most varieties, however, had near zero values, suggesting that the degree of inbreeding was equal to the expected value based on allelic rates of the germplasm.

3.4. Varieties Associations and Genetic Structure

Genetic affiliations among the germplasm were calculated by using an approximate likelihood-ratio test (aLRT) branch algorithm, and a dendrogram with bootstrap support was produced. Genotypes of the same variety were generally grouped within the same cluster, and these associations were further supported by high branch probability values (Figure 7A). Louva types (“Louva White”, “Louva Red”, and “Louva Long Red”) formed a separate cluster that had “Koutsouli White” as an outgroup. Interestingly, most of these genotypes have a dry bean use, signifying that genetic kinship corresponds to selected agronomical traits. “Gliastro”, “Stringless Blue Lake”, and “Kreatofasoulo” also exhibited high genetic affiliations that corresponded to their use (green beans), as these genotypes formed a second larger cluster. Interestingly, “Stringless Blue Lake” genotypes formed two discrete subgroups depending on their origin (farmers’ site), since these constitute genetic resources that are conserved “on-farm”. Despite the fact that “Moraleda” is classified as a green bean variety, it had an intermediate genetic markup, as its genotypes had an intermediate classification.
A Bayesian structure analysis was additionally performed to survey the distribution of genetic clusters and the admixture within the bean germplasm. The optimal score for the ad hoc test, based on the second-order rate of change in the probability function regarding the deltaK index, was determined for K = 2 (ΔK = 7395). In general, bean genotypes were rather genetically homogeneous, although several cases of admixture were noted (Figure 7B). Specifically, the majority of green bean genotypes were confidently clustered in the first genetic group (lilac color), with the exception of the “Moraleda” genotypes that were admixtured; on average, we observed a 0.42 similarity to cluster 1 and a 0.48 identity to cluster 2 (cyan color). On the contrary, dry bean genotypes were clustered to the second genetic group (Figure 7B). Minor admixtures (less than 0.02) were also noted for genotypes of “Koutsouli White” (one genotype), “Louva Red” (one genotype), and “Gliastro” (two genotypes), signifying that genetic flow may exist among different varieties during pollination. Finally, the analysis of molecular variance (AMOVA) determined that significant variation exists between varieties and between samples within varieties, but a significant within-sample variation was not revealed (Supplementary Figure S1), suggesting that population structure occurs.
Finally, a PCA plot analysis was conducted (Figure 8). The PCA plot of the Phaseolus genotypes also demonstrated that there are differences between genetic backgrounds within the groups. It appears that the first axis (59.9% of variation) was the main axis in differentiating the other genotypes. On the far right of the PCA plot, the clustering of “Louva” genotypes suggests a close genetic similarity, and “Koutsouli White” is positioned in a way that suggests it is somewhat related to their genetic background. On the left side of the PCA plot, “Gliastro” and “Kreatofasoulo” both fall into the upper quadrant and represent a different cluster than the “Louva” genotypes. There is significant overlap in the middle of the plot with “Moraleda”, as it is positioned between “Louva” and “Stringless Blue Lake” in the lower left quadrant, and yet the positioning suggests some evidence of a distinction between them. “Stringless Blue Lake” falls just below the upper left quadrant, and, despite being close to the other genotypes, it also appears to be somewhat isolated, representing a distant cluster.

4. Discussion

This work presents the first comprehensive DArTseq-based genetic analysis of the Cypriot common bean (Phaseolus vulgaris L.) gene pool that aims to study the unexploited genetic variation within heirloom dry and green bean genotypes originating from remote mountainous areas. Supported by a high-density SNP dataset that contained 13,215 markers, we were able to confirm the unique status of Cypriot beans and provide evidence of their possible value for conservation as well as breeding programs.
The common bean was brought to Europe shortly after the Columbian Exchange, when Spanish and Portuguese conquistadors planted Andean and Mesoamerican landraces in the Old World during the 16th century [3,5]. These original introductions from each gene pool presumably underwent a number of rounds of dissemination and local selection throughout Europe. As a result, European landraces, especially in the south, such as the Mediterranean, formed unique genotypic groups due to ecological adaptation, farmer preference, and seed exchange behavior [6,7], creating also the base for heirloom varieties. Landraces and traditional cultivars tend to have a higher genetic diversity compared to modern cultivars due to their broader allelic base and minimal exposure to intensive breeding bottlenecks. This, in fact, was noted in the current study (Cypriot panel), wherein heirloom varieties such as “Gliastro” and “Kreatofasoulo” exhibited high frequencies of private alleles and polymorphic loci. This is in accordance with genetic resources in Ethiopia [29] and Turkey [32], wherein landraces were reported to possess a higher intra-population diversity compared to elite lines. The low values of observed heterozygosity (0.018 to 0.030 for Ho) in the Cypriot bean collection are in line with the self-pollinating nature of P. vulgaris, but the prevalence of high minor allele frequencies and a Hardy–Weinberg disequilibrium in certain loci indicates that genetic diversity is still preserved in the traditional cultivars.
Moreover, the observed average PIC value (0.295) is in correlation with other reports utilizing the DArTseq in beans. For example, Gelaw et al. [29] noted a mean PIC value of 0.30 in the Ethiopian germplasm (the study involved 11,480 SNPs), while Wahome and colleagues [27] reported slightly lower values (0.27) in 222 Kenyan genotypes. These similarities attest that the Cypriot germplasm has moderate to high levels of polymorphism, despite being geographically isolated. The very high reproducibility (mean 99.9%) and extensive coverage of the genome confirm the validity of the marker system employed. Linkage disequilibrium (LD) analysis indicated that there were relatively moderate levels of LD across the Cypriot panel, from 0.225 (Chromosome 5) to 0.423 (Chromosome 9). These are at the range obtained for Ethiopian genotypes (mean r2~0.35) [29], but are in contrast to the steeper LD decay in more bottlenecked populations such as those in Turkey [32,50], where LD was observed in larger distances. The Cypriot panel showed a more gradual decline in LD without an abrupt drop in the first 10 Mbp, which might reflect a previous recombination and a low selection pressure, possibly due to conventional seed exchange and in situ conservation practices.
Structure analysis (STRUCTURE and phylogenetic clustering) identified two major genetic groups based mostly on usage type: “Koutsouli White”, “Stringless Blue Lake”, “Moraleda”, “Louva White”, “Louva Red”; and “Gliastro”, “Kreatofasoulo”, “Koutsouli White”, “Stringless Blue Lake”, and “Moraleda”. This division suggests simultaneous domestication routes or divergence post-introduction through utilization and selection. A similar divergence has been earlier documented by Wahome et al. [27], who reported clustering by agronomic use lines. Despite this dichotomy, admixture was detected—predominantly in “Moraleda” and also in some “Gliastro” genotypes—reflecting an ongoing gene flow between bean varieties in Cyprus, which is highly likely to be an artifact of traditional farming practices. Notably, the “Stringless Blue Lake” group, which clusters into two sub-clusters by farm origin, is an illustration of the study by Correa Abondano et al. [26], where they demonstrated how sampling strategy may reveal micro-structure within landrace groups. This supports the factoring in of farm-level management and local selection when interpreting bean genetic diversity.
As opposed to the greater differentiation between the Mesoamerican and Andean gene pools in other regions, the Cypriot genotypes are less stratified and more admixed. This may be due to centuries of Mediterranean germplasm flow and farmers’ seed exchange, which diluted the dichotomy of the gene pools. The relatively low observed heterozygosity agrees with expectations in self-pollinated crops and is in agreement with results from other studies in European landraces [30]. Even at low values of Ho, signs of private alleles (particularly in “Gliastro”) and Hardy–Weinberg disequilibrium loci (particularly in “Stringless Blue Lake”) reveal a hidden variability and potential selection of gene flow events. The detection of disease-resistance and stress-response loci among HWE-violating SNPs echoes reports by [27], who found high GWAS associations within corresponding genomic regions. These findings show the functional promise of the Cypriot bean genome for adaptive traits.
Finally, of particular interest, DArTseq use possessed an adequate resolution to distinguish highly similar landraces like “Louva Red” and “Louva White,” which were distinguished by a six-allele difference, while unrelated types exceeded 5000. Such resolution explains the usefulness of DArTseq in defining cohesion-by-descent relationships and guiding conservation strategies, as also emphasized by Correa Abondano and colleagues [26].

5. Conclusions

This study provided a valuable Mediterranean dataset to the global P. vulgaris DArTseq literature. It highlighted Cyprus as a secondary hotspot of diversity with a unique, admixed germplasm that is of high priority for in situ and ex situ conservation. The genetic differentiation that is apparent between dry and green beans, the richness of landraces’ diversity, and the traces of former admixture further highlight the importance of conserving these genetic materials. Future work should combine phenotypic and metabolomic information in order to better determine the adaptive worth and function of these landraces.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/plants14193000/s1. Figure S1: AMOVA plots depicting the distribution of the variance components (using permutation tests) at three hierarchical levels: (top) among samples, (middle) among samples, and (bottom) among populations. Black diamonds represent the estimated variance component for each level. Sizeable deviations of the observed values from the null distribution reflect hierarchical genetic structure at the given level.

Author Contributions

Conceptualization, N.N.; methodology, M.-D.T., N.N. and A.C.K.; validation, M.-D.T., N.N. and A.C.K.; investigation, M.-D.T., N.N. and A.C.K.; data curation, M.-D.T., A.C.K. and N.N.; writing—original draft, M.-D.T. and N.N.; writing—review and editing, M.-D.T., A.C.K. and N.N.; visualization, M.-D.T., N.N. and A.C.K.; supervision, N.N.; project administration, N.N.; All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the BIOVALUE project “Fork-to-farm agent-based simulation tool augmenting BIOdiversity in the agri-food VALUE chain”, that received funding from the European Union’s Horizon 2020 research and innovation program under the Grant Agreement No. 101000499.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Uebersax, M.A.; Cichy, K.A.; Gomez, F.E.; Porch, T.G.; Heitholt, J.; Osorno, J.M.; Kamfwa, K.; Snapp, S.S.; Bales, S. Dry beans (Phaseolus vulgaris L.) as a vital component of sustainable agriculture and food security—A review. Legume Sci. 2023, 5, e155. [Google Scholar] [CrossRef]
  2. Tassoni, A.; Tedeschi, T.; Zurlini, C.; Cigognini, I.M.; Petrusan, J.-I.; Rodríguez, Ó.; Neri, S.; Celli, A.; Sisti, L.; Cinelli, P.; et al. State-of-the-Art Production Chains for Peas, Beans and Chickpeas—Valorization of Agro-Industrial Residues and Applications of Derived Extracts. Molecules 2020, 25, 1383. [Google Scholar] [CrossRef] [PubMed]
  3. Schmutz, J.; McClean, P.E.; Mamidi, S.; Wu, G.A.; Cannon, S.B.; Grimwood, J.; Jenkins, J.; Shu, S.; Song, Q.; Chavarro, C.; et al. A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. 2014, 46, 707–713. [Google Scholar] [CrossRef] [PubMed]
  4. Mamidi, S.; Rossi, M.; Moghaddam, S.M.; Annam, D.; Lee, R.; Papa, R.; McClean, P.E. Demographic factors shaped diversity in the two gene pools of wild common bean Phaseolus vulgaris L. Heredity 2013, 110, 267–276. [Google Scholar] [CrossRef] [PubMed]
  5. Myers, J.R.; Formiga, A.K.; Janick, J. Iconography of Beans and Related Legumes Following the Columbian Exchange. Front. Plant Sci. 2022, 13, 851029. [Google Scholar] [CrossRef]
  6. Angioi, S.A.; Rau, D.; Attene, G.; Nanni, L.; Bellucci, E.; Logozzo, G.; Negri, V.; Spagnoletti Zeuli, P.L.; Papa, R. Beans in Europe: Origin and structure of the European landraces of Phaseolus vulgaris L. Theor. Appl. Genet. 2010, 121, 829–843. [Google Scholar] [CrossRef]
  7. Arteaga, S.; Yabor, L.; Torres, J.; Solbes, E.; Muñoz, E.; Díez, M.J.; Vicente, O.; Boscaiu, M. Morphological and Agronomic Characterization of Spanish Landraces of Phaseolus vulgaris L. Agriculture 2019, 9, 149. [Google Scholar] [CrossRef]
  8. Lazarević, B.; Karažija, T.; Petek, M.; Poljak, M.; Šatović, Z.; Liber, Z.; Carović-Stanko, K. Gas exchange capacity of croatian common bean landraces (Phaseolus vulgaris L.) is related to their origin and growth type. J. Elem. 2018, 23, 1043–1055. [Google Scholar] [CrossRef]
  9. Marotti, I.; Dinelli, G.; Bregola, V.; Bosi, S. Nutritional characterization of Italian common bean landraces (Phaseolus vulgaris L.): Fatty acid profiles for “genotype-niche diversity” fingerprints. AIMS Agric. Food 2020, 5, 543–562. [Google Scholar] [CrossRef]
  10. Smýkal, P.; Coyne, C.J.; Ambrose, M.J.; Maxted, N.; Schaefer, H.; Blair, M.W.; Berger, J.; Greene, S.L.; Nelson, M.N.; Besharat, N.; et al. Legume Crops Phylogeny and Genetic Diversity for Science and Breeding. CRC Crit. Rev. Plant Sci. 2015, 34, 43–104. [Google Scholar] [CrossRef]
  11. Catarcione, G.; Paolacci, A.R.; Alicandri, E.; Gramiccia, E.; Taviani, P.; Rea, R.; Costanza, M.-D.T.; De Lorenzis, G.; Puccio, G.; Mercati, F.; et al. Genetic Diversity and Population Structure of Common Bean (Phaseolus vulgaris L.) Landraces in the Lazio Region of Italy. Plants 2023, 12, 744. [Google Scholar] [CrossRef]
  12. Rendón-Anaya, M.; Montero-Vargas, J.M.; Saburido-Álvarez, S.; Vlasova, A.; Capella-Gutierrez, S.; Ordaz-Ortiz, J.J.; Aguilar, O.M.; Vianello-Brondani, R.P.; Santalla, M.; Delaye, L.; et al. Genomic history of the origin and domestication of common bean unveils its closest sister species. Genome Biol. 2017, 18, 60. [Google Scholar] [CrossRef]
  13. Assefa, T.; Assibi Mahama, A.; Brown, A.V.; Cannon, E.K.S.; Rubyogo, J.C.; Rao, I.M.; Blair, M.W.; Cannon, S.B. A review of breeding objectives, genomic resources, and marker-assisted methods in common bean (Phaseolus vulgaris L.). Mol. Breed. 2019, 39, 20. [Google Scholar] [CrossRef]
  14. Varshney, R.K.; Bohra, A.; Yu, J.; Graner, A.; Zhang, Q.; Sorrells, M.E. Designing Future Crops: Genomics-Assisted Breeding Comes of Age. Trends Plant Sci. 2021, 26, 631–649. [Google Scholar] [CrossRef] [PubMed]
  15. Boczkowska, M.; Bulińska-Radomska, Z.; Nowosielski, J. AFLP analysis of genetic diversity in five accessions of Polish runner bean (Phaseolus coccineus L.). Genet. Resour. Crop Evol. 2012, 59, 473–478. [Google Scholar] [CrossRef]
  16. Andrade, F.A.D.; Gonçalves, L.S.A.; Miglioranza, É.; Ruas, C.D.F.; Ruas, P.M.; Takahashi, L.S.A. AFLP analysis of genetic diversity in determinate and indeterminate snap bean accessions. Acta Sci. Agron. 2016, 38, 29–34. [Google Scholar] [CrossRef]
  17. Cabral, P.D.S.; de Souza, L.C.; da Costa, G.F.; Silva, F.H.L.; Soares, T.C.B. Investigation of the genetic diversity of common bean (Phaseolus vulgaris) cultivars using molecular markers. Genet. Mol. Res. 2018, 17, gmr18106. [Google Scholar] [CrossRef]
  18. Mavromatis, A.G.; Arvanitoyannis, I.S.; Korkovelos, A.E.; Giakountis, A.; Chatzitheodorou, V.A.; Goulas, C.K. Genetic diversity among common bean (Phaseolus vulgaris L.) Greek landraces and commercial cultivars: Nutritional components, RAPD and morphological markers. Span. J. Agric. Res. 2010, 8, 986–994. [Google Scholar] [CrossRef]
  19. Ligarreto, M.G.A.; Martínez, W.O. Identification of the variability of a common bean collection through morphological, Physiological, Biochemical, and molecular relationships. Agron. Colomb. 2014, 32, 159–169. [Google Scholar] [CrossRef]
  20. Velasquez, V.L.B.; Gepts, P. RFLP diversity of common bean (Phaseolus vulgaris) in its centres of origin. Genome 1994, 37, 256–263. [Google Scholar] [CrossRef]
  21. Scarano, D.; Rubio, F.; Ruiz, J.J.; Rao, R.; Corrado, G. Morphological and genetic diversity among and within common bean (Phaseolus vulgaris L.) landraces from the Campania region (Southern Italy). Sci. Hortic. 2014, 180, 72–78. [Google Scholar] [CrossRef]
  22. Valentini, G.; Gonçalves-Vidigal, M.C.; Elias, J.C.F.; Moiana, L.D.; Mindo, N.N.A. Population Structure and Genetic Diversity of Common Bean Accessions from Brazil. Plant Mol. Biol. Rep. 2018, 36, 897–906. [Google Scholar] [CrossRef]
  23. Bhat, J.A.; Ali, S.; Salgotra, R.K.; Mir, Z.A.; Dutta, S.; Jadon, V.; Tyagi, A.; Mushtaq, M.; Jain, N.; Singh, P.K.; et al. Genomic Selection in the Era of Next Generation Sequencing for Complex Traits in Plant Breeding. Front. Genet. 2016, 7, 221. [Google Scholar] [CrossRef]
  24. Edet, O.U.; Gorafi, Y.S.A.; Nasuda, S.; Tsujimoto, H. DArTseq-based analysis of genomic relationships among species of tribe Triticeae. Sci. Rep. 2018, 8, 16397. [Google Scholar] [CrossRef] [PubMed]
  25. Resende, R.T.; de Resende, M.D.V.; Azevedo, C.F.; e Silva, F.F.; Melo, L.C.; Pereira, H.S.; Souza, T.L.P.O.; Valdisser, P.A.M.R.; Brondani, C.; Vianello, R.P. Genome-wide association and Regional Heritability Mapping of plant architecture, lodging and productivity in Phaseolus vulgaris. G3 Genes Genomes Genet. 2018, 8, 2841–2854. [Google Scholar] [CrossRef] [PubMed]
  26. Correa Abondano, M.; Ospina, J.A.; Wenzl, P.; Carvajal-Yepes, M. Sampling strategies for genotyping common bean (Phaseolus vulgaris L.) Genebank accessions with DArTseq: A comparison of single plants, multiple plants, and DNA pools. Front. Plant Sci. 2024, 15, 1338332. [Google Scholar] [CrossRef] [PubMed]
  27. Wahome, S.W.; Githiri, M.S.; Kinyanjui, P.K.; Toili, M.E.M.; Angenon, G. Genome-wide association study of variation in cooking time among common bean (Phaseolus vulgaris L.) accessions using Diversity Arrays Technology markers. Legume Sci. 2023, 5, e184. [Google Scholar] [CrossRef]
  28. Amongi, W.; Nkalubo, S.T.; Ochwo-Ssemakula, M.; Badji, A.; Dramadri, I.O.; Odongo, T.L.; Nuwamanya, E.; Tukamuhabwe, P.; Izquierdo, P.; Cichy, K.; et al. Genetic clustering, and diversity of African panel of released common bean genotypes and breeding lines. Genet. Resour. Crop Evol. 2023, 70, 2063–2076. [Google Scholar] [CrossRef]
  29. Gelaw, Y.M.; Eleblu, J.S.Y.; Ofori, K.; Fenta, B.A.; Mukankusi, C.; Emam, E.A.; Offei, S. High-density DArTSeq SNP markers revealed wide genetic diversity and structured population in common bean (Phaseolus vulgaris L.) germplasm in Ethiopia. Mol. Biol. Rep. 2023, 50, 6739–6751. [Google Scholar] [CrossRef]
  30. Gunjača, J.; Carović-Stanko, K.; Lazarević, B.; Vidak, M.; Petek, M.; Liber, Z.; Šatović, Z. Genome-Wide Association Studies of Mineral Content in Common Bean. Front. Plant Sci. 2021, 12, 636484. [Google Scholar] [CrossRef]
  31. Altaf, M.-D.T.; Nadeem, M.A.; Ayten, S.; Aksoy, E.; Sönmez, F.; Çiftçi, V.; Baloch, F.S. Unveiling Genetic Basis Associated with Manganese Content in Turkish Common Bean (Phaseolus vulgaris L.) Germplasm Through a Genome-Wide Association Study. Plant Breed. 2024, 144, 285–309. [Google Scholar] [CrossRef]
  32. Nadeem, M.A.; Habyarimana, E.; Çiftçi, V.; Nawaz, M.A.; Karaköy, T.; Comertpay, G.; Shahid, M.Q.; Hatipoǧlu, R.; Yeken, M.Z.; Ali, F.; et al. Characterization of genetic diversity in Turkish common bean gene pool using phenotypic and whole-genome DArTseq-generated silicoDArT marker information. PLoS ONE 2018, 13, e0205363. [Google Scholar] [CrossRef] [PubMed]
  33. Mwale, S.E.; Shimelis, H.; Abincha, W.; Nkhata, W.; Sefasi, A.; Mashilo, J. Genetic differentiation of a southern Africa tepary bean (Phaseolus acutifolius A Gray) germplasm collection using high-density DArTseq SNP markers. PLoS ONE 2023, 18, e0295773. [Google Scholar] [CrossRef] [PubMed]
  34. Charalambous, I.; Ioannou, N.; Kyratzis, A.C.; Kourtellarides, D.; Hagidimitriou, M.; Nikoloudakis, N. Genome Size Variation across a Cypriot Fabeae Tribe Germplasm Collection. Plants 2023, 12, 1469. [Google Scholar] [CrossRef] [PubMed]
  35. Athinodorou, F.; Foukas, P.; Tsaniklidis, G.; Kotsiras, A.; Chrysargyris, A.; Delis, C.; Kyratzis, A.C.; Tzortzakis, N.; Nikoloudakis, N. Morphological diversity, genetic characterization, and phytochemical assessment of the cypriot tomato germplasm. Plants 2021, 10, 1698. [Google Scholar] [CrossRef]
  36. Kyriacou, M.C.; Ioannidou, S.; Nikoloudakis, N.; Seraphides, N.; Papayiannis, L.C.; Kyratzis, A.C. Physicochemical characterization and trait stability in a genetically diverse ex situ collection of pomegranate (Punica granatum L.) germplasm from Cyprus. Sci. Hortic. 2020, 263, 109116. [Google Scholar] [CrossRef]
  37. Grigoriou, A.; Tsaniklidis, G.; Hagidimitriou, M.; Nikoloudakis, N. The cypriot indigenous grapevine germplasm is a multi-clonal varietal mixture. Sci. Hortic. 2020, 9, 1034. [Google Scholar] [CrossRef]
  38. Kyratzis, A.C.C.; Nikoloudakis, N.; Katsiotis, A. Genetic variability in landraces populations and the risk to lose genetic variation. The example of landrace “Kyperounda” and its implications for ex situ conservation. PLoS ONE 2019, 14, e0224255. [Google Scholar] [CrossRef]
  39. Anestiadou, K.; Nikoloudakis, N.; Hagidimitriou, M.; Katsiotis, A. Monumental olive trees of Cyprus contributed to the establishment of the contemporary olive germplasm. PLoS ONE 2017, 12, e0187697. [Google Scholar] [CrossRef]
  40. Bevan, W. Notes on Agriculture in Cyprus and Its Products; Hazell, Watson & Viney, Ltd.: London, UK, 1919; Volume 17, pp. 1–105. [Google Scholar]
  41. Christodoulou, D. The World Land Use Survey. Monograph 2: Cyprus. The Evolution of the Rural Land Use Pattern in Cyprus, 1st ed.; Geographical Publications Limited: Cornwall, UK, 1959. [Google Scholar]
  42. Hadjichristodoulou, A. Evaluation of Common Bean (Phaseolus vulgaris L.) Varieties; Technical Bulletin; Agricultural Research Institute, Ministry of Agriculture, Natural Resources and Environment: Lefkosia, Cyprus, 1992; Volume 144. [Google Scholar]
  43. Jaccoud, D.; Peng, K.; Feinstein, D.; Kilian, A. Diversity arrays: A solid state technology for sequence information independent genotyping. Nucleic Acids Res. 2001, 29, e25. [Google Scholar] [CrossRef]
  44. Raman, H.; Raman, R.; Kilian, A.; Detering, F.; Carling, J.; Coombes, N.; Diffey, S.; Kadkol, G.; Edwards, D.; McCully, M.; et al. Genome-Wide Delineation of Natural Variation for Pod Shatter Resistance in Brassica napus. PLoS ONE 2014, 9, e101673. [Google Scholar] [CrossRef]
  45. Wang, Y.; Jia, L.; Tian, G.; Dong, Y.; Zhang, X.; Zhou, Z.; Luo, X.; Li, Y.; Yao, W. shinyCircos-V2.0: Leveraging the creation of Circos plot with enhanced usability and advanced features. iMeta 2023, 2, e109. [Google Scholar] [CrossRef] [PubMed]
  46. Trifinopoulos, J.; Nguyen, L.-T.L.-T.; von Haeseler, A.; Minh, B.Q. W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 2016, 44, W232–W235. [Google Scholar] [CrossRef]
  47. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef]
  48. Falush, D.; Stephens, M.; Pritchard, J.K. Inference of population structure using multilocus genotype data: Dominant markers and null alleles. Mol. Ecol. Notes 2007, 7, 574–578. [Google Scholar] [CrossRef]
  49. Wigginton, J.E.; Cutler, D.J.; Abecasis, G.R. A Note on Exact Tests of Hardy-Weinberg Equilibrium. Am. J. Hum. Genet. 2005, 76, 887–893. [Google Scholar] [CrossRef]
  50. Nemli, S. Genetic diversity of Turkish common bean landraces assessed by DArTseq. Turk. J. Agric. For. 2017, 41, 472–481. [Google Scholar]
Figure 1. Collection sites of the Cypriot bean germplasm in the Troodos region. Precice coordinates are indicated with red dots. Base map © OpenStreetMap contributors (www.openstreetmap.org), licensed under ODbL.
Figure 1. Collection sites of the Cypriot bean germplasm in the Troodos region. Precice coordinates are indicated with red dots. Base map © OpenStreetMap contributors (www.openstreetmap.org), licensed under ODbL.
Plants 14 03000 g001
Figure 2. Circos depicting statistical indexes across the Cypriot germplasm collection for DArTseq markers. A. SNP density using a 200 Kbp bin rolling window; B. OneRatioRef: proportion of samples for which the genotype score is 0 (homozygous reference); C. OneRatioSnp: proportion of samples for which the genotype score is 2 (homozygous alternate); D. FreqHomRef: proportion of homozygous samples for the Reference allele; E. FreqHomSnp: proportion of homozygous samples for the Alternate (SNP) allele; F. FreqHets: proportion of samples which score as heterozygous, that is, scored as 1; G. PICRef: polymorphism information content (PIC) for the Reference allele; H. PICSnp: polymorphism information content (PIC) for the SNP; I. AvgPIC: average of the polymorphism information content (PIC) of the Reference and SNP alleles; J. AvgCountRef: sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Reference allele row; K. AvgCountSnp: sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Alternate (SNP) allele row.
Figure 2. Circos depicting statistical indexes across the Cypriot germplasm collection for DArTseq markers. A. SNP density using a 200 Kbp bin rolling window; B. OneRatioRef: proportion of samples for which the genotype score is 0 (homozygous reference); C. OneRatioSnp: proportion of samples for which the genotype score is 2 (homozygous alternate); D. FreqHomRef: proportion of homozygous samples for the Reference allele; E. FreqHomSnp: proportion of homozygous samples for the Alternate (SNP) allele; F. FreqHets: proportion of samples which score as heterozygous, that is, scored as 1; G. PICRef: polymorphism information content (PIC) for the Reference allele; H. PICSnp: polymorphism information content (PIC) for the SNP; I. AvgPIC: average of the polymorphism information content (PIC) of the Reference and SNP alleles; J. AvgCountRef: sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Reference allele row; K. AvgCountSnp: sum of the tag read counts for all samples, divided by the number of samples with non-zero tag read counts, for the Alternate (SNP) allele row.
Plants 14 03000 g002
Figure 3. (A). Removing loci based on call rate (threshold = 0.95) resulted in the retention of 13,215 loci across 50 bean genotypes (eight populations). (B). Frequencies of SNPs detected across sequence tags depicting that a significant number of tags were polymorphic and that the majority of tags contained one SNP across the Cypriot bean germplasm. (C). Smear plot depicting color-coded loci by genotypes for DArTSeq scores of 0 (homozygous reference), 1 (heterozygous reference), 2 (homozygous alternate), and NA (null alleles).
Figure 3. (A). Removing loci based on call rate (threshold = 0.95) resulted in the retention of 13,215 loci across 50 bean genotypes (eight populations). (B). Frequencies of SNPs detected across sequence tags depicting that a significant number of tags were polymorphic and that the majority of tags contained one SNP across the Cypriot bean germplasm. (C). Smear plot depicting color-coded loci by genotypes for DArTSeq scores of 0 (homozygous reference), 1 (heterozygous reference), 2 (homozygous alternate), and NA (null alleles).
Plants 14 03000 g003
Figure 4. (A). Details of pairwise LD values of polymorphic sites are plotted on both the X and Y axes in the triangle plot created by TASSEL for pairwise LD between SNP sites across the 11 bean chromosomes (rolling window 500). The corresponding p-values from the rapid 1000 shuffle permutation test are shown below the diagonal, while r2 values are displayed above. Using color codes to indicate the existence of substantial LD, each cell compares two pairs of marker sites. Threshold values on both diagonals are indicated by colored bar codes. (B). Scatter plot showing the linkage disequilibrium (LD) decay across the genome of the Cypriot bean germplasm. The genetic distance in bp is plotted against the LD estimate (r2) for each pair of SNP markers. The red line represents the trend of non-linear regressions against physical distance. The horizontal and vertical lines indicate the crucial values for r2 (0.2) and LD decay levels, respectively.
Figure 4. (A). Details of pairwise LD values of polymorphic sites are plotted on both the X and Y axes in the triangle plot created by TASSEL for pairwise LD between SNP sites across the 11 bean chromosomes (rolling window 500). The corresponding p-values from the rapid 1000 shuffle permutation test are shown below the diagonal, while r2 values are displayed above. Using color codes to indicate the existence of substantial LD, each cell compares two pairs of marker sites. Threshold values on both diagonals are indicated by colored bar codes. (B). Scatter plot showing the linkage disequilibrium (LD) decay across the genome of the Cypriot bean germplasm. The genetic distance in bp is plotted against the LD estimate (r2) for each pair of SNP markers. The red line represents the trend of non-linear regressions against physical distance. The horizontal and vertical lines indicate the crucial values for r2 (0.2) and LD decay levels, respectively.
Plants 14 03000 g004
Figure 5. (A). A Sankey Diagram was utilized to display patterns of private alleles across populations and flows (private alleles) between nodes (varieties). Their connections are depicted by arcs with widths proportionate to the relevance of the flow (number of private alleles). (B). Minor allele frequency across the Cypriot bean germplasm.
Figure 5. (A). A Sankey Diagram was utilized to display patterns of private alleles across populations and flows (private alleles) between nodes (varieties). Their connections are depicted by arcs with widths proportionate to the relevance of the flow (number of private alleles). (B). Minor allele frequency across the Cypriot bean germplasm.
Plants 14 03000 g005
Figure 6. (A). Significant departures of HWE. HW equilibrium agreement was based on observed frequencies of reference homozygotes, heterozygotes, and alternate homozygotes as previously described [49]. (B). Indexes of heterozygosity and inbreeding across the Cypriot bean germplasm as calculated across 13,078 SNP loci. Observed heterozygosity (Ho) = number of homozygotes/n_Ind, where n_Ind is the number of genotypes without missing data. Expected heterozygosity (He) = 1 − (p2 + q2), where p is the frequency of the Reference allele and q is the frequency of the Alternative allele. Unbiased expected heterozygosity (uHe) = He × (2 × n_Ind/(2 × n_Ind − 1)). Inbreeding coefficient (FIS) = 1 − (mean(Ho)/mean(uHe)).
Figure 6. (A). Significant departures of HWE. HW equilibrium agreement was based on observed frequencies of reference homozygotes, heterozygotes, and alternate homozygotes as previously described [49]. (B). Indexes of heterozygosity and inbreeding across the Cypriot bean germplasm as calculated across 13,078 SNP loci. Observed heterozygosity (Ho) = number of homozygotes/n_Ind, where n_Ind is the number of genotypes without missing data. Expected heterozygosity (He) = 1 − (p2 + q2), where p is the frequency of the Reference allele and q is the frequency of the Alternative allele. Unbiased expected heterozygosity (uHe) = He × (2 × n_Ind/(2 × n_Ind − 1)). Inbreeding coefficient (FIS) = 1 − (mean(Ho)/mean(uHe)).
Plants 14 03000 g006
Figure 7. (A) Phylogenetic dendrogram based on DArTseq-derived SNP data, constructed using the approximate likelihood-ratio test (aLRT) method. Bootstrap support values are indicated as circles at each node, reflecting the confidence level of branching. Genotypes cluster primarily according to usage type (dry vs. green beans). (B) STRUCTURE bar plot representing genetic clustering (K = 2) across 50 Cypriot Phaseolus vulgaris genotypes. Each vertical bar represents one genotype, and colors indicate membership proportions to inferred genetic clusters. A clear separation between green and dry bean types is visible, with admixture observed in select varieties such as “Moraleda” and “Gliastro”.
Figure 7. (A) Phylogenetic dendrogram based on DArTseq-derived SNP data, constructed using the approximate likelihood-ratio test (aLRT) method. Bootstrap support values are indicated as circles at each node, reflecting the confidence level of branching. Genotypes cluster primarily according to usage type (dry vs. green beans). (B) STRUCTURE bar plot representing genetic clustering (K = 2) across 50 Cypriot Phaseolus vulgaris genotypes. Each vertical bar represents one genotype, and colors indicate membership proportions to inferred genetic clusters. A clear separation between green and dry bean types is visible, with admixture observed in select varieties such as “Moraleda” and “Gliastro”.
Plants 14 03000 g007
Figure 8. PCA analysis depicting the genetic associations across Cypriot bean genotypes.
Figure 8. PCA analysis depicting the genetic associations across Cypriot bean genotypes.
Plants 14 03000 g008
Table 1. Cypriot P. vulgaris accessions analyzed in this study, and their classification by origin and usage type.
Table 1. Cypriot P. vulgaris accessions analyzed in this study, and their classification by origin and usage type.
NoVariety NameOriginType
1KreatofasouloHeirloom variety collected from fieldGreen beans
2KreatofasouloHeirloom variety collected from fieldGreen beans
3KreatofasouloHeirloom variety collected from fieldGreen beans
4KreatofasouloHeirloom variety collected from fieldGreen beans
5KreatofasouloHeirloom variety collected from fieldGreen beans
6KreatofasouloHeirloom variety collected from fieldGreen beans
7KreatofasouloHeirloom variety collected from fieldGreen beans
8KreatofasouloHeirloom variety collected from fieldGreen beans
9KreatofasouloHeirloom variety collected from fieldGreen beans
10KreatofasouloHeirloom variety collected from fieldGreen beans
11Koutsouli WhiteHeirloom variety collected from fieldDry beans
12Koutsouli WhiteHeirloom variety collected from fieldDry beans
13Koutsouli WhiteHeirloom variety collected from fieldDry beans
14Koutsouli WhiteHeirloom variety collected from fieldDry beans
15Koutsouli WhiteHeirloom variety collected from fieldDry beans
16Koutsouli WhiteHeirloom variety collected from fieldDry beans
17Koutsouli WhiteHeirloom variety collected from fieldDry beans
18Koutsouli WhiteHeirloom variety collected from fieldDry beans
19Koutsouli WhiteHeirloom variety collected from fieldDry beans
20Koutsouli WhiteHeirloom variety collected from fieldDry beans
21Stringless Blue LakeVariety from farmer A collected from fieldGreen beans
22Stringless Blue LakeVariety from farmer A collected from fieldGreen beans
23Stringless Blue LakeVariety from farmer A collected from fieldGreen beans
24Stringless Blue LakeVariety from farmer A collected from fieldGreen beans
25Stringless Blue LakeVariety from farmer A collected from fieldGreen beans
26Stringless Blue LakeVariety from farmer B collected from fieldGreen beans
27Stringless Blue LakeVariety from farmer B collected from fieldGreen beans
28Stringless Blue LakeVariety from farmer B collected from fieldGreen beans
29Stringless Blue LakeVariety from farmer B collected from fieldGreen beans
30Stringless Blue LakeVariety from farmer B collected from fieldGreen beans
31MoraledaVariety from nursery collected from fieldGreen beans
32MoraledaVariety from nursery collected from fieldGreen beans
33MoraledaVariety from nursery collected from fieldGreen beans
34Louva WhiteHeirloom variety collected from fieldDry beans
35Louva Red LongHeirloom variety collected from fieldDry beans
36Louva Red LongHeirloom variety collected from fieldDry beans
37Louva Red LongHeirloom variety collected from fieldDry beans
38Louva RedHeirloom variety collected from fieldDry beans
39Louva RedHeirloom variety collected from fieldDry beans
40Louva RedHeirloom variety collected from fieldDry beans
41Louva RedHeirloom variety collected from fieldDry beans
42Louva RedHeirloom variety collected from fieldDry beans
43GliastroHeirloom variety collected from fieldDry beans
44GliastroHeirloom variety collected from fieldDry beans
45GliastroHeirloom variety collected from fieldDry beans
46GliastroHeirloom variety collected from fieldDry beans
47GliastroHeirloom variety collected from fieldDry beans
48GliastroHeirloom variety collected from fieldDry beans
49GliastroHeirloom variety collected from fieldDry beans
50GliastroHeirloom variety collected from fieldDry beans
Table 2. Pairwise comparisons of fixed/private alleles among the Cypriot bean collection. Allele frequency differences (AFD) and diversity estimates (Chao1/Chao2) illustrate genetic differentiation and distinctiveness among variety pairs.
Table 2. Pairwise comparisons of fixed/private alleles among the Cypriot bean collection. Allele frequency differences (AFD) and diversity estimates (Chao1/Chao2) illustrate genetic differentiation and distinctiveness among variety pairs.
NoVariety 1Variety 2FixedPriv1Priv2Chao1Chao2TotalprivAFD
1GliastroKoutsouli White44357191554571422012,7360.456
2GliastroKreatofasoulo4722858235771482952150.115
3GliastroLouva White4921782451597144312,9830.467
4GliastroLouva Red46387453580971447313,2620.466
5GliastroLouva Red Long47937677534871441213,0250.459
6GliastroMoraleda31515982373671430097180.323
7GliastroStringless Blue Lake4132453249471354149470.144
8Koutsouli WhiteKreatofasoulo46285788693322082912,7210.44
9Koutsouli WhiteLouva White77219709512204329210.083
10Koutsouli WhiteLouva Red6261728173022047334580.083
11Koutsouli WhiteLouva Red Long9502124144122041235650.099
12Koutsouli WhiteMoraleda24353634303422029966680.229
13Koutsouli WhiteStringless Blue Lake41705267695422054112,2210.443
14KreatofasouloLouva White5165758654228294313,0080.456
15KreatofasouloLouva Red48967225608282947313,3070.455
16KreatofasouloLouva Red Long50267418559082941213,0080.448
17KreatofasouloMoraleda33915722397782930096990.318
18KreatofasouloStringless Blue Lake3892011255382854145640.123
23Louva RedLouva Red Long390154786247341224090.053
24Louva RedMoraleda27703978337647329973540.247
25Louva RedStringless Blue Lake44975665735047354113,0150.463
26Louva Red LongMoraleda28613395347841230068730.243
27Louva Red LongStringless Blue Lake46155158752841254112,6860.456
19Louva WhiteLouva Red614511664247313110.019
20Louva WhiteLouva Red Long4576269624341215880.05
21Louva WhiteMoraleda2925315735764330067330.246
22Louva WhiteStringless Blue Lake4754499376994354112,6920.463
28MoraledaStringless Blue Lake28763435572229954191570.314
Table 3. Loci exhibiting significant deviations from Hardy–Weinberg equilibrium across the selected genotypes. Loci identified and their functional annotation from NCBI BLAST highlight the potential association to disease resistance and metabolic processes, mainly in cultivars.
Table 3. Loci exhibiting significant deviations from Hardy–Weinberg equilibrium across the selected genotypes. Loci identified and their functional annotation from NCBI BLAST highlight the potential association to disease resistance and metabolic processes, mainly in cultivars.
VarietyLocusHom_1HetHom_2NProbNCBI Accessione Value
Gliastro100062028-27-C/T20680.01538--
Koutsouli White100068280-6-T/C0100100.00691XM_068642443.1 19.00 × 10−21
Stringless Blue Lake13121316-28-G/A505100.00136XM_068640160.12 20.002
Stringless Blue Lake13121337-48-C/T505100.00136--
Stringless Blue Lake13121391-56-C/A41490.03645XM_068606144.1 37.00 × 10−29
Stringless Blue Lake8671140-23-T/C505100.00136XM_068641491.1 42.00 × 10−9
Stringless Blue Lake8671150-35-T/C190100.04549XM_068641720.1 56.00 × 10−22
Stringless Blue Lake8671153-41-A/C712100.04644XM_068615419.1 68.00 × 10−19
Kreatofasoulo8671195-24-A/C091100.04549--
Gliastro8671212-10-T/C30580.00699XM_068628962.1 70.005
1: Phaseolus vulgaris probable disease resistance protein At4g27220 (LOC137834386), mRNA; 2: Phaseolus vulgaris SNF1-related protein kinase regulatory subunit beta-2-like (LOC137832141), mRNA; 3: Phaseolus vulgaris small ribosomal subunit protein uS5w-like (LOC137806161), mRNA; 4: Phaseolus vulgaris probable N-acetyltransferase HLS1-like (LOC137833118), mRNA; 5: Phaseolus vulgaris non-specific lipid-transfer protein AP10-like (LOC137833370), mRNA; 6: Phaseolus vulgaris rust resistance kinase Lr10-like (LOC137813293), mRNA; 7: Phaseolus vulgaris putative disease resistance RPP13-like protein 1 (LOC137823721), transcript variant X1, mRNA.
Table 4. Genetic diversity indexes per bean cluster based on DArTseq-derived SNPs. Variables (Ho, He/uHe, and FIS) reflect the elevated intra-varietal genetic diversity.
Table 4. Genetic diversity indexes per bean cluster based on DArTseq-derived SNPs. Variables (Ho, He/uHe, and FIS) reflect the elevated intra-varietal genetic diversity.
VarietypolyLocmonoLocHoHeuHeFIS
Koutsouli White133111,7470.0180.0200.0210.151
Gliastro297710,1010.0300.0600.0650.540
Stringless Blue Lake301810,0600.0250.0660.0690.634
Louva Red133311,7450.0260.0270.0300.141
Kreatofasoulo247610,6020.0270.0310.0320.168
Louva Red Long64812,4300.0250.0170.021−0.187
Moraleda73112,3470.0270.0190.023−0.152
Louva White31212,7660.0240.0120.0240.000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tsolakidou, M.-D.; Kyratzis, A.C.; Nikoloudakis, N. DArTseq Analysis of Cypriot Common Bean Germplasm Unveils an Assortment of Unexplored Genetic Variability. Plants 2025, 14, 3000. https://doi.org/10.3390/plants14193000

AMA Style

Tsolakidou M-D, Kyratzis AC, Nikoloudakis N. DArTseq Analysis of Cypriot Common Bean Germplasm Unveils an Assortment of Unexplored Genetic Variability. Plants. 2025; 14(19):3000. https://doi.org/10.3390/plants14193000

Chicago/Turabian Style

Tsolakidou, Maria-Dimitra, Angelos C. Kyratzis, and Nikolaos Nikoloudakis. 2025. "DArTseq Analysis of Cypriot Common Bean Germplasm Unveils an Assortment of Unexplored Genetic Variability" Plants 14, no. 19: 3000. https://doi.org/10.3390/plants14193000

APA Style

Tsolakidou, M.-D., Kyratzis, A. C., & Nikoloudakis, N. (2025). DArTseq Analysis of Cypriot Common Bean Germplasm Unveils an Assortment of Unexplored Genetic Variability. Plants, 14(19), 3000. https://doi.org/10.3390/plants14193000

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop