Next Article in Journal
Correction: Yuan et al. Stage-Dependent Mineral Element Dynamics in ‘Junzao’ Jujube: Ionic Homeostasis and Selective Transport Under Graduated Saline-Alkali Stress. Horticulturae 2025, 11, 726
Previous Article in Journal
Integrated Physiological and Transcriptomic Analyses Reveal the Mechanism of Salt Acclimation-Induced Salinity Tolerance in Tomato Seedlings
Previous Article in Special Issue
Establishing a Leaf-Derived Tissue Culture and Rapid Propagation Method for Red Fruit Ginseng
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

ddRADseq Applications for Petunia × hybrida Clonal Line Breeding: Genotyping and Variant Identification for Target-Specific Assays

Department of Agronomy, Food, Natural Resources, Animals and Environment (DAFNAE), Campus of Agripolis, University of Padova, 35020 Legnaro, Italy
*
Author to whom correspondence should be addressed.
Horticulturae 2026, 12(2), 160; https://doi.org/10.3390/horticulturae12020160
Submission received: 15 December 2025 / Revised: 16 January 2026 / Accepted: 23 January 2026 / Published: 30 January 2026

Abstract

Molecular genotyping is a key factor for plant breeding programming and plant variety protection (PVP). However, its potential still remains to be elucidated when considering ornamental plants like Petunia × hybrida. In this study, a petunia breeding clone collection, including sister line groups, was genotyped through double digest Restriction-site Associated DNA sequencing (ddRADseq), and its genetic diversity and structure were studied. In addition to estimating the high genetic similarity observed among sister lines, this approach allowed the unique discrimination of each clone too. Molecular results agreed with genealogy data, supporting the assessment of genotyping effectiveness. In addition, the minimal number of variants able to uniquely discriminate and/or correctly cluster the experimental lines was investigated. The loci number could be reduced to eight to achieve line discrimination, and a method to identify the specific variant sets is presented. Conversely, to preserve the original clustering with minor adjustments, one hundred loci were required and were obtained through minor allele frequency (MAF) filtering. Moreover, analysis of the chromosomal distribution of variants revealed a predominant accumulation in distal regions. Genetic analyses were repeated considering only variants located in coding sequences and results were in agreement with what previously observed, disclosing the potential of the expressed regions for genotyping purposes. Eventually, the applied approach enabled the investigation of SNPs within genes putatively involved in traits of interest. Our findings encourage the adoption of high-throughput and cost-effective sequencing techniques for petunia genotyping aimed at achieving PVP, supporting new variety registration, and developing marker-assisted breeding (MAB) and marker-assisted selection (MAS) strategies.

1. Introduction

Petunia (Petunia × hybrida hort. ex E. Vilm.; synonym: Petunia × atkinsiana (Sweet) D. Don ex W. H. Baxter) is an annual herb belonging to the Solanaceae family, originating in South America [1], and it is well known worldwide for its ornamental use. A wholesale value of almost USD 160 million was reported in the US in 2020 for petunia ornamental varieties [2,3]. Commercial lines are derived from interspecific hybrids artificially obtained in the eighteenth century [4,5] between P. axillaris (Lamarck) Britton, Sterns and Poggemburg, and one or more species belonging the clade of P. integrifolia (Hooker) Schinz and Thellung [6,7]. Natural hybrids between the two species were never observed, in part due to different pollination syndromes. In particular, P. axillaris is pollinated by hawk moths, while P. integrifolia by bees [8,9]. Nevertheless, hybrids derived from artificial crosses can produce fertile progenies and commercial lines comprehend both seed-propagated F1 hybrids and cutting-propagated clonal lines [1]. Although varieties are more often diploid, tetraploids have been observed and selected in breeding programs [10,11]. Petunia (2n = 2x = 14) is a facultative allogamous species, in which gametophytic self-incompatibility [12,13] and cytoplasmic male sterility [14,15,16] were extensively studied. It is considered a model plant [17,18], especially for the study of flower development and biochemistry [4], as well as for tissue culture practices [1]. In addition, the application of genetic engineering techniques has led to one of the first and most well-known cases of genetically modified petunia, the orange-flowered A1-DFR lines [19]. It is also worth noting that the RNA interference mechanism was discovered for the first time in this species [20,21]. Finally, molecular resources developed for petunia comprehend high-density linkage maps with different marker types [22,23,24] and the draft genome assembly of two of its main species P. axillaris and P. integrifolia [25]. Recently, a chromosome-scale genome assembly and annotation of P. × hybrida was developed by combining short- and long-read sequencing, optical mapping and Hi-C technology [26].
In recent decades, breeding goals for petunia have mainly included aesthetic–sensorial aspects, particularly in relation to flower morphology and phenology [2,27,28]. More recently, breeding has also dealt with new emerging needs, such as ecosystem service provision, adaptability and sustainability [29,30]. The evaluation of these traits through the exclusive use of morphological and phenological measurements is often impossible. Nevertheless, the official protocols for variety registration aimed at assessing distinctness, uniformity and stability (DUS), such as those of the European Community Plant Variety Office (CPVO) and International Union for the Protection of New Varieties of Plants (UPOV) [31,32], are still exclusively based on this type of descriptor. In petunia, 34 and 32 descriptors, including plant growth habit, leaf shape and flower density, are provided respectively, by EU and US authorities for DUS tests [33,34]. Since official registration is necessary to achieve plant variety protection (PVP) and to guarantee plant breeders’ rights (PBRs) [35], incorporating molecular data into the registered variety profiles, in conjunction with specific phenotypic measurements, could provide significant benefits. For example, a clear enhancement in a sustainability-related trait (e.g., tolerance to low nutrient levels or reduced phytosanitary inputs) assessed through appropriate phenotypic evaluations (e.g., value for cultivation and use (VCU) tests), may not be reflected by visible morphological changes. Hence, traditional DUS tests may be insufficient for a successful evaluation. In this case, DUS tests could be carried out combining phenotypic measurements and genetic diversity analyses based on variety molecular profiles (value molecular DUS (vmDUS)) [36]. However, the possibility to exploit molecular genotyping for official variety registration is still limited to few specific functional markers, or it is used only to facilitate traditional DUS test procedures [37]. Molecular genotyping can also be useful for directly helping PVP, thus facilitating the identification of plant materials of unknown origin, essentially derived varieties (EDVs) and commercial frauds [35], as well as for Marker-Assisted Breeding (MAB) and Selection (MAS) [38,39,40].
Double digest Restriction-site Associated DNA sequencing (ddRADseq) is a Reduced Representation Sequencing (RRS) technology exploiting the gDNA digestion with two restriction enzymes, in order to reduce the fraction of the genome to be sequenced [41]. ddRADseq is reported as an approach able to ensure reliable genotyping results and relatively low cost and time consumption. Therefore, it can be particularly useful for agricultural species lacking economic convenience in performing deeper and hence more expensive analyses [42,43,44], like the majority of ornamental plants. The technique was successfully exploited in several plant species for different purposes, such as structure evaluation of wild populations [45,46,47], germplasm conservation [48,49,50], Quantitative Trait Locus (QTL) identification [51,52], Genome-Wide Association Study (GWAS) [53], phylogenesis [54], Marker-Assisted Selection (MAS) and Marker-Assisted Breeding (MAB) [38,39,40]. Regarding ornamentals, ddRADseq have been used for genotyping in several species, like Bougainvillea spp., lantana, orchid and primula [52,55,56,57,58,59,60]. The thousands of variants generated by ddRAD may be reduced down to an informative core (SNP core set), which, in turn, can be used for the development of target-specific assays, like Fluidigm, Kompetitive Allele-specific PCR (KASP) or AmpliSeq assays [61,62,63,64,65,66]. This can lead to a reduction in costs and procedure times for future analyses, as well as to an easier comparison of data from different populations and crop cycles. Furthermore, this may help the detection of specific regions of interest to be targeted for functional studies and MAS scheme development [66,67,68]. However, in ornamentals, very few works have selected SNP core sets and created target-specific assays [69,70,71]. Understanding how and to what extent molecular profiles should be filtered to produce an effective SNP core set can be challenging, especially when dealing with breeding populations characterized by highly related lines with reduced genetic and phenotypic variability.
In this study, a genotyping analysis using a ddRADseq approach was performed in a petunia breeding population of clonal lines consisting of sister line groups. Assuming a high level of intragroup genetic uniformity, the potential of this genotyping technique in uniquely discriminating the clones and in providing genetic information for the population was assessed. The aim was to evaluate the suitability of ddRADseq as a starting point to develop procedures for variety registration, protection and creation through MAB strategies. In addition, we determined to what extent and in which way the polymorphism number could be reduced while remaining informative, thereby identifying an SNP core set, without altering sample clustering. Eventually, the possibility of exploiting only expressed regions for genotyping analysis by selecting variants exclusively located in Coding DNA Sequences (CDSs) was also evaluated.

2. Materials and Methods

2.1. Plant Materials

Leaf samples from 36 petunia clonal lines were granted by Gruppo Padana S.S. (Paese, TV, Italy), a nursery company specializing in the production, selection and collection of young plants from seeds and cuttings. The analyzed clones included 32 breeding lines (BLs) and 4 benchmarks (BMs). BLs were divided into 4 groups of sister lines, i.e., clones sharing at least one of the two parental lines. BMs consisted of already developed varieties available on the market and exploited to evaluate the distinctness of BLs. Since the population consisted of clones, a single replicate per line was exploited. Genomic DNA (gDNA) was extracted from 100 mg of leaf tissue through DNeasy Plant mini kit (Qiagen, Valencia, CA, USA), following manufacturer instructions. Genomic DNA quality and concentration were assessed using NanoDrop 2000c UV-Vis spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and agarose gel electrophoresis with 1% agarose/1× TAE gel and 1× SybrSafe DNA stain (Life Technologies, Carlsbad, CA, USA).

2.2. ddRADseq Library Preparation and Sequencing

Double digestion Restriction-site Associated DNA sequencing (ddRADseq) libraries were produced following the protocol developed by Abed et al. [72], with some modifications. In detail, extracted gDNA was normalized to 20 ng/µL and a total of 200 ng per sample was used. PstI and MspI (Thermo Fisher Scientific) restriction enzymes were used, with 1 µL of each per reaction, for DNA digestion at 37 °C for 2 h, following the producer protocol. Adapter ligation was performed in 50 µL of reaction including 30 µL of digested DNA, 1 µL of T4 DNA ligase, 5 µL of 10× T4 ligation buffer (New England BioLabs, Ipswich, MA, USA), 5 µL of barcode adapters (0.1 µM) and common adapters (10 µM). Ligation was carried out with an incubation at room temperature for 2 h, followed by 20 min at 65 °C. Ligated DNA samples were grouped into two pools, each including 18 samples, and purified through QIAquick PCR and Gel Cleanup Kit (Qiagen) for the elimination of impurities, non-ligated adapters, small fragments and dimers. Size selection from 100 bp to 600 bp was performed with the CleanNGS magnetic beads (CleanNA, Waddinxveen, The Netherlands). Amplification of the obtained libraries was performed with Platinum SuperFi PCR Master MIX (Thermo Fisher Scientific), following the enrichment PCR protocol described by Abed et al. [72]. Amplified libraries were subjected to another step of purification with CleanNGS magnetic beads. Subsequently, they were quantified using the TapeStation System (Agilent Technologies, Santa Clara, CA, USA) with High Sensitivity D1000 DNA ScreenTape assays (Agilent Technologies) and diluted to 200 pM for the Ion Torrent sequencing. Libraries were loaded into Ion Chips 550 (Thermo Fisher Scientific) through the Ion Chef (Thermo Fisher Scientific) automated machine. Single-end sequencing was performed in two runs, one per pool, with the Ion GeneStudio S5 Plus System (Thermo Fisher Scientific) set to 200 bp of read length and 500 flows per run.

2.3. ddRADseq Data Processing

Raw reads were quality-checked by filtering out those with a Phred Quality score lower than 20, and then demultiplexed according to the adopted barcode recognition motif, through the Torrent Suite platform (Thermo Fisher Scientific) following the native implemented pipeline. After these steps, reads were mapped on the reference P. × hybrida v.01 genome assembly retrieved from the NCBI database (GCA_046563445.1) [26]. Mapping was performed exploiting the MEM algorithm of Burrows-Wheeler Aligner (BWA) v. 0.7.12 [73], followed by data conversion into BAM files through SAMtools and BCFtools v.1.13 software [74]. For the variant calling the gstacks and populations commands of Stacks v2.61 software [75] were used. gstacks was launched exploiting the ‘marukihigh’ model, with both a SNP discovery (--var-alpha) and a genotype call (--gt-alpha) with alpha thresholds of 0.10, while populations was run in order to keep loci present in at least 30% of individuals in a subpopulation (-r 0.30) and 65% of individuals across subpopulations (-R 0.65). Obtained SNPs were filtered through the poppr v2.9.6 package [76] for RStudio software v.4.2.3 (Posit, Boston, MA, USA), in order to retain only the loci present in all samples and with a minimal Minor Allele Frequency (MAF) of 0.1. The resulting SNPs composed the initial dataset for the genetic analyses.

2.4. Genetic Distance, Structure and Statistic Analyses

All analyses and graphic processing were carried out through RStudio, unless specified otherwise. Genetic Distance (GD) matrixes were produced through the Nei’s coefficient [77] considering all pairwise comparisons among sample SNP profiles and were made through poppr package’s nei.dist function. A Kinship-based INference for Genome-wide association studies (KING) matrix based on pairwise comparisons was calculated with the KING-robust method exploiting the PLINK2 [78] --make-king function [79,80]. Furthermore, 0.4 was chosen as a minimal threshold to hypothesize the same clone membership for a pair. The Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrograms for sample clustering were created based on Nei’s coefficient with the aboot function from the poppr package, considering 1000 bootstrap replicates. Clustering visualization was carried out through dendrograms created with the plot.phylo function from the ape package [81]. Principal Component Analysis (PCA) based on the GD analysis was performed using the gl.pcoa function from the dartR package v.2.9.9.5 [82] Structure v.2.2 [83] software was used for Bayesian clustering to infer the sample genetic structure. The following parameters were set: founding groups ranging from 1 to 10, 10 replicate simulations per each K, burn-in of 20,000, final run of 100,000 Markov Chain Monte Carlo (MCMC) steps. The K values characterized by the highest likelihood were selected using Structure Harvester software v. 0.6.94 [84]. Calculated genetic statistics included: observed (Ho), expected (Hs) and overall (Ht) Heterozygosity; inbreeding coefficient within subpopulations (Fis) and within the total population (Fit); subpopulation fixation index (Fst); gene flow (Nm); average number of observed (na) and expected (ne) alleles per locus; Private Alleles (PA) and Polymorphic Loci (PL) percentage, average locus Polymorphic Information Content (AvgPIC) and MAF. Statistics were measured either with the hierfstat package [85] basic.stat function, the dartR package loc.metrics function, or with a manually developed R script. Analysis of Molecular Variance (AMOVA) between and within subpopulations was performed using the ade4 method with the poppr.amova function and the significance of the results was validated through a Monte-Carlo test considering 999 permutations.

2.5. SNP Subset Selection for Core Set and Location in CDSs

Further filtering of the initial dataset for the SNP core set identification was performed by setting alternative combinations of MAF, AvgPIC and Hs. The filtering continued until the obtained dataset led to a BL clustering consistently different from the one obtained using the entire SNP set. The genotype accumulation curve was created through the genotype_curve function of the poppr R package v.2.9.8, setting a 10 k times loci resampling. The definition of the minimal set of SNPs able to discriminate all 36 genotypes was obtained with the subsequent method: a custom R function randomly orders all the SNPs and then, starting from the first, selects the minimum set of SNPs able to discriminate all 36 samples, storing it in memory. The function reiterates the same process, and each time, if the identified set is smaller than the stored one, it substitutes it. A second custom R function parallelizes the process and independently repeats it to obtain different minimum discriminating SNP sets. In this analysis 100 minimum discriminating SNP sets were generated, each resulting from 100,000 random iterations. The custom code written for this publication is publicly available [86]. In parallel, Coding DNA Sequences (CDSs) were retrieved from the reference genome annotation and were intersected to the initial SNP dataset using the bedtools intersect command of the Bedtools v.2.30.0 software [84], allowing to retain and use variants exclusively present in the coding sequences for clustering analyses. To visually represent them and allow the comparison of their distribution with the gene density, a genome map was created using Circos software v.0.69-10 (Michael Smith Genome Sciences Centre, Vancouver, BC, Canada) [87]. Protein sequences relative to the CDSs harboring SNPs were aligned by BLASTx v.2.12.0 (NCBI, Bethesda, MD, USA) against the P. axillaris (v1.6.2) [25] and Arabidopsis thaliana (TAIR10.1) [88] proteomes. Hits were retained using a minimum coverage threshold of 65% and an E-value cutoff of 1 × 10−5. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways relative to the resulting putative orthologs were then retrieved and a Gene Orthology (GO) enrichment through ShinyGO (v0.85.1) [89] was performed.

3. Results

3.1. Genetic Distance and Structure Analysis

A ddRADSeq analysis was performed on genotype 36 petunia BLs, comprehending four sister line groups, and four benchmarks (BMs). After quality filtering and mapping, more than 80 M reads with an average of 2.2 M reads per sample were obtained. This value corresponded to the 85.7% of the 12.3 Gb sequenced in total. The mean demultiplexed and trimmed read length resulted was 173 bp (Table 1; Supplementary Table S1).
Variant calling resulted in the identification of 42,236 SNPs located in a total of 15,655 loci. SNP filtering allowed the creation of a final dataset of 4349 variant sites in the 36 analyzed samples.
A GD analysis among samples through the Nei’s coefficient resulted in no pairwise comparison with complete genetic identity, and unique molecular profiles were produced for all the considered BLs. GD among samples ranged from 0.001 (PeHy-BL3-06 vs. PeHy-BL3-08) to 0.51 (PeHy-BL3-09 vs. PeHy-BM03), with an average of 0.23 ± 0.11 (Supplementary Table S2). Considering sister line groups as a whole, the lowest GD value, 0.07, was found comparing PeHy-BL1 and PeHy-BL2 subpopulations, while the highest, 0.29, was observed in the comparison between PeHy-BL2 and PeHy-BL3 (Table 2).
The kinship matrix created with the KING-robust method led to an average value of 0.46 with a SD of 0.52. Three pairwise comparisons exceeded the 0.4 value, used as a threshold for the identification of individuals belonging to the same clone [79,80]: PeHy-BL3-01 vs. PeHy-BL3-07; PeHy-BL3-06 vs. PeHy-BL3-08; PeHy-BL4-03 vs. PeHy-BL4-12 (Supplementary Table S3).
In the UPGMA dendrogram based on the GD analysis, three principal branches were observed: one grouping exclusively PeHy-BL3 lines, a second with PeHy-BL4 lines, and the last comprehending PeHy-BL1, PeHy-BL2 and the four BMs. All bootstrap values at the bases of the branches reached the maximum value (100). Within the last cluster, PeHy-BM3 was highly separated from the rest of the group, while a branch grouped together the other three BMs and two further branches hosted respectively PeHy-BL1 and PeHy-BL2 lines. All the relative nodes were characterized by very high bootstrap values (Figure 1A).
The genetic structure analysis showed the most likely K value, namely the most probable number of common ancestors, to be K = 3 (Delta K = 10,407.3), followed by K = 2 (Delta K = 3068.5) (Supplementary Figure S1). In both cases, each BL was predominantly associated with a specific ancestral population, with the exception of PeHy-BL3-04 and PeHy-BL3-02, which had an admixed result. PeHy-BM3 and the PeHy-BL4-07–PeHy-BL4-11 group were also found to be admixed, but only for one of the two K values (Figure 1B). When assuming K = 2, PeHy-BL3 was the only group assigned to one of the two inferred ancestral populations, while all other BLs and the four BMs exhibited a shared ancestry. For K = 3, in addition to the separate clustering of the PeHy-BL3 group, a further division was observed between the PeHy-BL4 group and the remaining lines (namely PeHy-BL1, PeHy-BL2 and the four benchmark populations) (Figure 1B). Results were in agreement with what was observed with the UPGMA clustering, and therefore with the genealogy data. The PCA based on GD confirmed the findings observed in the structure analysis and the UPGMA dendrogram. The first two components alone explained 50.3% of the total variability. PeHy-BL3 and PeHy-BL4 clustered apart, while the two other sister line groups and the benchmarks fell into the same group and were found to be poorly distinguishable (Figure 2).

3.2. Genetic Statistics and AMOVA

The sample pool was studied considering several genetic parameters. Average Ho per BL resulted in 26.0 ± 8.5%, with 1.26 na and 1.15 ne. The highest Ho values considering sister line groups were observed for PeHy-BL4 lines (33.6 ± 6.3%), while the lowest values were scored for PeHy-BL1 lines (19.1 ± 3.4%) and PeHy-BM4 (16.3%). Average Fis was negative and comprehended between −0.2 and −0.3, underlining an overall moderate heterozygosity excess for BLs in relation to what would be expected for the respective subpopulations (Table 3; Supplementary Table S2).
Fst ranged from 0.22 to 0.50, suggesting a high genetic fixation for sister line groups (Table 2). As a result of these two factors, a slightly positive inbreeding coefficient for BLs in comparison to the expected considering the entire plant collection was observed (Fit: 0.02), in addition to a limited gene flow (Nm: 0.05). The percentage of PA per sample, relative to the total number of PA in the population ranged from 0.4% for PeHy-BL4-12 to 20.1% for PeHyBM3. The other three BM lines also exhibited percentages of Private Alleles considerably higher than their BL counterparts, ranging from 7.33% to 19.49% (Supplementary Table S2). The PA percentages considering each BL as a whole ranged from 6.4% for PeHy-BL1 to 34.0% for PeHy-BL3. A minor deviation was noted considering PL percentage of the total population loci: from 63.8% (PeHy-BL1) to 81.9% (PeHy-BL4) (Table 3). The AMOVA carried out comparing molecular variance between and within subpopulations (sister line groups and BMs) demonstrated good reliability, showing a p-value of 0.001. The analysis revealed that a substantially larger proportion of the total observed variability was attributable to differences among subpopulations, rather than among samples within the same subpopulation. Respectively, the estimated variance quotes were 58.2% and 41.8% (Table 4).

3.3. GD Analysis Comparison Between Total and Core Set SNP Profiles

The 4349 SNPs used for the GD analysis (Figure 3A) were filtered multiple times by acting on MAF, PIC and Hs values, to find the lowest number of variants (SNP core set) able to correctly cluster the BLs.
The minimum SNP amount by which a unique molecular profile was conserved for all BLs with this approach was 1051, which was obtained retaining only those variants with a MAF greater than 0.3. The UPGMA clustering and the bootstrap values remained unchanged for the majority of the previously highlighted groups (Figure 3B). Only the hierarchical position of the PeHy-BL1 cluster in relation to the PeHy-BL2 group and to the one including PeHy-BM1, PeHy-BM2 and PeHy-BM4, changed from the initial results (i.e., those obtained considering the whole SNP set) (Figure 3A). By increasing the MAF threshold to 0.45, the number of SNPs drops sharply to 103 with an average PIC of 0.3. Based on these settings, PeHy-BL3-06 and PeHy-BL3-08 presented complete genetic identity and minor clustering changes were observed (Supplementary Table S4, Figure 3C). The main difference found in the UPGMA grouping (compared to that obtained using the complete dataset) was a shift in the hierarchical positions of the PeHy-BM3 and PeHy-BL4 clusters. The bootstrap value at the base of the branch separating PeHy-BM3 from the rest of the accessions remained high (82.5), while the other basal nodes showed values lower than 50 (Figure 3C). Beyond the MAF threshold of 0.45, significant grouping distortions occurred. The same occurred when stringent filters related to PIC and Hs were applied to the original dataset (Figure 3D). A second analysis was performed, aiming to quantify and identify the minimal set of SNPs able to discriminate all 36 lines. A genotype accumulation curve was created with the initial dataset to determine the minimal variant number able to discriminate all BLs. 8 polymorphic loci met the target criterion, whereas the curve reached a plateau at approximately 30 loci (Supplementary Figure S2). An extensive search for the minimal discriminating SNP sets led to the identification of 5 sets composed by 8 SNPs. The pairwise GD analysis performed with each set did not show in any case complete genetic identity (Supplementary Table S5).

3.4. GD Analysis Comparison Between Total and CDS-Located SNP Profiles

The initial SNP set was intersected with the CDSs annotated in the reference genome in order to retain only those variants that were located within coding sequences. Genotyping and clustering the clones using exclusively these variants allowed to assess possible differences with the previous results derived from the same analyses. 1975 SNPs were retained, corresponding to 45.4% of the total dataset. Their positions resulted uniformly distributed across the 7 chromosomes. Within each chromosome, distal areas presented higher variant site density in comparison to pericentromeric zones, reflecting gene distribution (Figure 4A).
Pairwise GD comparisons led to values very similar to those observed considering the initial dataset (Avg: 0.22 ± 0.10) (Supplementary Table S6). The Ho, Hs and Fis values also remained largely unchanged. The UPGMA dendrogram led to a completely identical clustering to that obtained with the original dataset. Bootstrap values were slightly lower in the main nodes, but continued to be significant (>80) (Supplementary Figure S3). The PCA explained 50.4% of the total genetic variation and subpopulation differentiation matched that obtained in the previous results. In particular, PeHy-BL3 and PeHy-BL4 groups were strongly separated, while PeHy-BL1, PeHy-BL2 and BMs samples resulted undistinguishable (Supplementary Figure S4). From the GO analysis of the genes whose CDSs were found to contain SNPs, 8 subcategories resulted enriched in the “Biological process” category, including the protein refolding (GO:0042026), the transmembrane transport (GO:0055085) and the response to hormone (GO:0009725) processes (Figure 4B), whereas 19 subcategories were enriched in the “Molecular function” category, including nucleic acid binding (GO:0003676), structural molecule activity (GO:0005198) and protein kinase activity (GO:0004672) (Figure 4C). Following the KEGG investigation, 390 genes harboring SNPs fell into at least one of the pathways. Of these, 50 ortholog genes resulted involved in secondary metabolism, 6 in flavonoid biosynthesis, 5 in terpenoid production, 4 in phenylpropanoid production, 18 in cofactor biosynthesis, 48 in plant hormone signal transduction (Supplementary Table S7).

4. Discussion

Molecular genotyping exploiting different types of molecular markers proved to be very useful for breeding plan setting and management in several ways. It can help retrieving genetic information about populations, such as genetic diversity and Heterozygosity, planning crosses, reducing inbreeding depression, exploiting heterosis, and favoring traits of interest’s selection [39,90,91,92,93]. In addition, the production of molecular profiles through genomic marker analysis can support PVP and variety registration, simplify DUS tests, help germplasm management, and aid the identification of GMOs, unknown plant materials and EDVs [35,37,50,68,94]. In petunia, several genotyping studies have been conducted using different types of molecular markers, including RAPDs [95], AFLPs [96,97], SSRs [8,98], SRAPs [99] and CAPSs [8,100]. GBS was used for genetic structure studies on petunia natural populations, including the parental species used for the production of the commercial interspecific hybrids, P. axillaris and P. integrifolia [101,102]. SNP genotyping was also successfully carried out on breeding collections for linkage map production and QTL analysis of flower-related traits, such as flowering timing, pollinator syndrome, and double flower development. Informative variants were retrieved from transcriptome data [7,103], targeted-sequencing techniques [2,22,104] and other GBS-based protocols [101,102,105,106]. By contrast, the use of molecular assays for the genotyping of experimental lines for breeding purposes, variety registration and variety protection, remains an underexplored area in petunia.
The present study focused on molecular genotyping through ddRADseq of 36 petunia BLs and BMs. The genotyping analysis proved to be effective and useful for PVP-related purposes, assigning a unique molecular profile to all lines of interest and discriminating each genotype from the others. In addition, it allowed us to estimate the GD values within the collection and other genetic parameters that are of interest for breeding plan management. BLs were subdivided into four sister line groups. As expected, sister lines with a relatively high homozygosity level were also characterized by a high genetic uniformity. Some lines (i.e., PeHy-BL3-01 vs. PeHy-BL3-07; PeHy-BL3-08 vs. PeHy-BL3-06; PeHy-BL4-03 vs. PeHy-BL4-12) turned out to be distinguished by a very limited polymorphism amount (respective GD: 1.7 × 10−2; 1.2 × 10−3; 0.29 × 10−3) suggesting the possibility that they might represent the same clone. In this case, it is crucial to determine standard thresholds of molecular similarity above which two accessions should be regarded as belonging to the same clonal lineage [107]. Some studies in different plant species, e.g., Populus tremuloides, Piper spp. and Callistemon kenmorrisonii, developed methods that exploited molecular diversity calculated with different markers for this purpose [108,109,110]. Thresholds were determined by estimating sequencing error through replicate sampling and therefore the results were population- and marker- specific. However, the possibility of using several thousands of SNPs for genotyping might have minimized the sequencing error impact, allowing them to normalize analyses and hence define standard values for clone identification. The exploitation of a kinship coefficient, based on the probability calculation that a pair of randomly sampled alleles is identical by descent (IBD) (i.e., due to genetic inheritance rather than by chance) [111,112] may help achieve this goal. Indeed, kinship coefficients return value near 0.5 in case of clones in plants—or of homozygous twins when considering animals—while values between 0.25 and 0.5 identify full siblings. Due to sequencing errors or somatic mutations, a kinship coefficient obtained between actual clones tends to be slightly lower than 0.5; therefore 0.4 was suggested as an effective threshold [113]. Up to now, kinship matrixes in plant breeding studies have been mainly exploited to identify the most genetically different genotypes in order to plan crosses [114] or to carry out the correction for relativeness in GWASs [115]. KING-robust [79,80] was described in a study comparing several kinship estimation methods in six non-model Australian species, as the best framework in case of high population structure and inbreeding, setting a maximum locus missingness below 10% and a minimal minor allele frequency threshold of 10% [116]. Exploiting this method, all three very genetically similar BL couples, and only these, presented kinship coefficients higher than the threshold value of 0.4. However, the approach has to be tested in other petunia populations exploiting preliminarily assessed clones. Nevertheless, the production of unique genotyping profiles for all lines demonstrated the approach’s capacity for discriminating between highly related genotypes. The SNP ability to uniquely genotyping very similar lines in petunia was previously shown by GBS studies for QTL analysis exploiting Recombinant Inbred Lines (RILs) and Near Isogenic Lines (NILs) [2,22,104]. However, to our knowledge this is the first study to exploit and review a ddRADseq method for petunia genotyping. Conversely, the methodology was used in several other species of ornamental interest, including Bougainvillea spp., Persian buttercup, Ceiba spp., flowering dogwood, lantana, lavender, orchid and primula, for MAB-oriented [56,58,117], ecology of wild populations [55,59] or trait association [52,57,60] purposes. These studies are consistent with our results, providing thousands of high-quality SNPs and confirming the efficiency of ddRADseq for genotyping.
The UPGMA clustering, the PCA and the genetic structure analysis led to consistent results and confirmed the genetic uniformity between sister lines, grouping them in the same branch, area of the plot, or ancestral group, respectively. The link with the genealogical data supported the reliability of the genotyping analysis. On the contrary, variability among sister line groups was significant. This was suggested by the AMOVA (variability between subpopulations > 50%), the fixation index calculation (Fst > 0.25 overall and in every pairwise comparison between sister line groups) and the genetic structure. However, despite the PCA explaining a percentage of total variability > 50%, the PeHy-BL1, PeHy-BL2 and BMs groups’ results were undistinguishable and were assigned to the same putative ancestor group. Therefore, although showing the highest presence of PAs, BMs were not entirely differentiable at the genetic level from all the other lines. This implies that, in cases where the breeding aim is to obtain new varieties that are highly genetically distinguishable from those already present in the market, PeHy-BL1 and PeHy-BL2 lines may not match this goal. The average GD of these groups with the three more-similar BMs were, resulted respectively, 0.13 and 0.14. To determine GD thresholds which allow us to assess whether experimental materials are genetically different from already developed varieties is a key factor for breeding and variety registration. This is particularly important in cases of experimental materials presenting phenotypic distinctiveness for only a single descriptor, in order to assess whether they may have been directly derived from other varieties (defined initial varieties, IVs). In this case, new varieties should be considered EDVs, assigning royalties to the IV breeder [35,118,119]. It is not possible to define standardized thresholds and formulas that discriminate between EDVs, as their determination also depends on the proportion of total genetic variability inherent to each species. Nevertheless, the definition of species-specific values, together with the development of general operational guidelines for their calculation—to be integrated with phenotypic assessments—is of paramount importance for variety registration and PBRs. To date, competent authorities such as UPOV and CPVO did not establish mandatory species-specific thresholds and standard procedures, but have suggested voluntary methods with case studies in maize and French bean for general variety registration [34]. In addition, they endorsed the EDV thresholds determined by the International Seed Federation (ISF) for several species, i.e., perennial ryegrass, oilseed rape, cotton, lettuce and maize [120,121]. In these cases, thresholds were alternatively set following alternatively three principles, defined, respectively, as follows: “calibration”, (i.e., based on similarities of pairs of genotypes known to have close genetic relationships), “tail” (i.e., exploiting a given percentile of the distribution of similarities among independent genotypes) and “pedigree” (i.e., defining the parental contribution expected) principle [119,122]. In particular, these approaches led to the introduction of a threshold of 0.96 in lettuce based on the Jaccard genetic similarity coefficient, determined with AFLPs [120,122]. In maize with a set of 3072 SNPs, the values 0.91 and 0.95 of Roger’s genetic diversity were set to delimit the three cases of definite, potential and no EDV [123,124]. A recent study on pear set the same thresholds at, respectively, 90% and 96% of the genetic similarity with the simple matching (SM) coefficient found by analyzing samples with 310 multiple nucleotide polymorphisms (MNPs) [125]. However, up to now no study has investigated EDV molecular identification in petunia.
The collection under study showed a relatively low average level of Ho, especially if we consider its origin (breeding population of clones with a consistent structure) and its mode of reproduction (facultative allogamous species). This finding would suggest that the occurrence of crosses between closely related lines within the population may be possible, and hence that the population would be at least partially self-compatible [126,127,128]. Nevertheless, the inbreeding coefficient within subpopulations was generally low, highlighting a breeding selection effort to increase heterozygosis. The fact that the BL inbreeding coefficient within the entire population (Fit) did not considerably deviate from the equilibrium is the result of the high subpopulation differentiation, counteracted by low inbreeding coefficient within subpopulations. Heterozygosis levels (10–15%) even lower than those found in our study were reported in a genotyping analysis performed through RNAseq on petunia cultivars, mostly reported as F1 hybrids. In the mentioned study, lines were representative of the total variation observable in the market considering the growth habit trait. Authors suggested a quite limited genetic diversity for the total commercial P. × hybrida germplasm pool [7]. Our findings supported the efficacy of ddRADseq in performing a genetic diversity analysis in petunia clonal populations. Although target-sequencing techniques methods are highly cost-effective and reproducible, as described below, genome-wide approaches such as ddRADseq for genotyping remain advantageous not only when prior marker information is lacking, but also when a more exploratory approach is needed. Indeed, the de novo SNP discovery facilitates unbiased analyses, like genetic structure and diversity, allows us to adapt the marker set to a specific population and is less sensitive to project design errors [41,129,130].
Given the relatively high genetic uniformity found for sister lines, assessing the minimal number of SNPs required to preserve the clustering patterns—representing the most valuable information for breeding purposes—was of particular interest. In order to also keep also a unique molecular profile for each BL, a final set of more than one thousand SNPs was revealed to be necessary and was obtained through MAF-based filtering. The clustering was overall respected: despite the presence of minor changes, no sister line separation or alteration of the BL group’s hierarchical order was observed. The need for a high minimum number of variants to simultaneously maintain line discrimination and clustering was expected, given the high within-group uniformity. When clustering preservation was not considered, a genotype accumulation curve showed that eight polymorphisms were sufficient to uniquely identify all the BLs. This may provide an indication that retaining clustering patterns weighs far more heavily than retaining discrimination when determining the number of variants to be used. However, preserving the unique identification of accessions, even if not their grouping, is useful for PVP purposes in order to detect fraud or unknown materials. With a manually developed method, five discriminating variant sets, each composed of eight SNPs, were identified, assessing the minimal locus number to characterize the collection. The custom method provided in this study is usable in other populations and species, in order to identify the minimal discriminating SNP sets and help genotyping analyses.
Keeping the same clustering effects described in the one-thousand dataset, and without observing major group disruptions, the variant reduction could be extended—using MAF filtering—down to approximately one hundred SNPs. In addition, the non-discriminated genotypes obtained with this set only included a couple of sister lines previously hypothesized as belonging to the same clone. Therefore, the dataset of one hundred variants obtained may be useful for both PVP- and MAB-related purposes. In particular, its most valuable application for breeding may lie in parental selection. In this case, indeed, crosses are designed between the most genetically differentiated genotypes in order to achieve greater phenotypic variability or to combine multiple traits of interest. Hence, clustering reliability is of pivotal importance. Studies aimed at selecting SNP core sets were performed in collections of several species of agronomic interest, including lettuce, corn, barley, cotton and tobacco [65,68,131,132,133], whereas in ornamental species such investigations have been reported for rose, Phalaenopsis spp. and orchids [69,70,71]. In particular, in polyploid rose, a core set of 18 PACE markers provided reliable fingerprinting across thousands of accessions from the German Genebank for Roses, while in Cymbidium ensifolium (L.) (Orchidaceae family) 11 KASP markers were selected and used for the genotyping of 83 commercial cultivars in China. In Phalaenopsis spp. a core set of SNPs—obtained from an initial dataset of thousands of variants with the following filtering parameters: MAF 0.1, max-missing 0.8, min-meanDP 3, and hwe 0.01—was effectively used to genotype 53 Chinese cultivars. The reliability of the results was assessed by observing the absence of differences in sample clustering with a Neighbor-Joining tree between the core SNP set and the initial dataset [69].
In general, core sets capable of discriminating between plant accessions are characterized by a few dozen markers. However, in the majority of the studies, only genetic diversity measurements were performed, without carrying out clustering analyses and hence not assessing the core set potential for MAB purposes. The SNP number is in fact consistent with that of our discriminating datasets. Furthermore, the accessions analyzed in the above-mentioned studies were part of germplasm collections, characterized by a genetic diversity far greater than that observed in our breeding population. This might explain why a high number of variants was required in our study for respecting the clustering of the main groups. In addition, in the other works PIC is considered a key parameter for defining a core set of markers to discriminate between different accessions (e.g., [68,71]). However, in our study we were able to identify a suitable SNP dataset by primarily adjusting the MAF thresholds. Our results would suggest that MAF might be more efficient for the discrimination of strongly related lines (i.e., characterized by low genetic diversity), whereas it may be less useful than PIC if the genetic differentiation degree of the entire population is higher. Nevertheless, further studies testing the approach in other petunia populations are required to extend these findings. SNP core sets can be used for the development of target-specific molecular genotyping assays, in order to reduce analysis costs, favor comparisons between populations and help identifying sequences potentially involved in trait of interest [61,62,63,64,66,134,135]. In particular, low-cost targeted panels include, among others, the following: PCR-based methods relying on allele-specific primers and fluorescence, e.g., KASP; multiplexed PCR with fluorescence microfluidic arrays, e.g., Fluidigm; and highly multiplexed PCR and Next Generation Sequencing (NGS) of the amplified products, e.g., AmpliSeq panels [66,136,137,138,139]. However, these platforms differ in scale, detection method, and multiplexing capacity, hence their suitability is subordinated to the specific breeding goal.
The selection of SNPs located in coding sequences led to retaining less than half of the initial dataset variants. This result is in agreement with what was reported for ddRADseq analyses exploiting the same restriction enzymes for library preparation [140]. Indeed, PstI and MspI are methylation-sensitive and therefore tend to favor the selection of coding sequences with greater probability than methylation-unsensitive enzymes. CDS-located SNPs were more present in the distal zones of chromosomes, as can be attested by the centromere positions reported for P. × hybrida chromosomes [26]. Variant density along chromosomes was in agreement with what was already reported for ddRADseq analyses with methylation-sensitive enzymes [49]. In addition, SNP distribution reflected annotated gene density, suggesting higher recombination rates and euchromatic features in distal regions, whereas pericentromeric zones are characterized by reduced presence of coding sequences, suppressed recombination and epigenetic silencing [141,142,143]. In petunia, recombination studies based on RIL populations were carried out before the release of a high-quality sequenced genome of P. × hybrida, and interpretations concerning recombination-suppressed regions were limited by the absence of a physical map [24,25,104]. Nevertheless, the inferences reported about the recombination and methylation of petunia chromosomes were corroborated in other Solanaceae species, i.e., tomato and pepper, in which also a higher abundance of transposable elements and repetitive sequences was also observed in the pericentromeric heterochromatic regions [144,145,146].
The finding that more than half of the retrieved SNPs in our collection were located in non-coding regions even using methylation-sensitive restriction enzymes is likely a consequence of the higher conservation degrees of CDSs due to functional reasons and their consequently lower levels of polymorphisms [147]. Despite this, the exclusive use of SNPs located in CDS regions for the genotyping of the petunia clonal lines led to results entirely comparable to those obtained with the initial dataset (considering the GD, UPGMA clustering, PCA and structure analysis) and to the genealogy data (i.e., similar genotyping outputs for sister lines). Hence, the obtained data assessed the suitability of the exclusive use of CDS-located SNPs for genotyping analyses in the petunia clonal population. In addition, the retrieval of CDS-SNPs enabled the investigation of variants within genes putatively associated with pathways of interest. Indeed, several putative CDSs related to the biosynthesis of terpenoids, flavonoids, phenylpropanoids, and other secondary metabolites, as well as plant hormones were found to harbor SNPs. The production and regulation of these compounds is of particular relevance, as they are involved in key biological processes, such as herbivore defense, flower opening and floral scent emission in petunia, both for the species itself and for its use as a model system to study these traits [148,149,150,151,152,153]. In particular, SNPs were identified in two PHYTOCHROME INTERACTING FACTORs (PIFs), PhPIF1a (Peaxi162Scf00052g01817; PHYG54490) and PhPIF3 (Peaxi162Scf00194g00213; PHYG60033), which have previously been characterized as playing a role in floral scent emission in petunia by transcriptionally activating key regulatory genes and biosynthetic enzymes involved in this process [154]. Overall, these results suggest that ddRAD sequencing represents a suitable approach for the identification of genetic variants putatively involved in trait manifestation, providing valuable candidates for subsequent functional analyses.

5. Conclusions

To summarize, ddRADseq was demonstrated to be effective in uniquely discriminating within an experimental petunia population including highly uniform sister line groups, in assessing their genetic relationship and in providing informative data for breeding purposes. A method to identify the minimal discriminating variant sets was developed. The selection of an SNP core set of one hundred variants able to correctly cluster the collection was achieved too, suggesting MAF as the best filtering parameter for a relatively uniform petunia population. In addition, the exclusive use of SNPs located in coding sequences for the population genotyping and genetic diversity analysis was demonstrated to be effective. CDS-SNPs, as annotated genes, were more present in the distal areas of chromosomes. Furthermore, the utilized approach seemed to be suitable to detect candidate variants for targeted in-depth analyses in order to assess the variant putative involvement in the traits of interest’s manifestation. In conclusion, results encourage the adoption of GBS techniques and target-specific assays for PVP to guarantee PBRs, for the registration of novel varieties whether coupled with DUS testing and other phenotype measurements, and for MAB approaches in petunia. The production of molecular profiles relative to experimental plant materials can help in achieving of innovative traits for cultivars. This aspect is particularly important nowadays for the ornamental sector, in order to select traits relative to sustainability for cultivation. The reduction in the agronomic input needs of plants leads to an enhancement of economic and environmental security. Further studies are needed in two goal directions: the enhancement of basic molecular knowledge relative to petunia, and the development of useful applications for breeders, such as assays able to predict traits of interest’s values. Eventually, GBS analyses need to be extended to understudied species, as is the case for the majority of the ornamental plants commercialized worldwide nowadays.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/horticulturae12020160/s1, Supplementary Figure S1: Definition of the most likely number of putative ancestral populations (K) based on a Structure analysis on molecular profiles of 36 Petunia Breeding clonal Lines (BLs) and BenchMarks (BMs), obtained with 4349 SNPs from a ddRADseq approach. The blue points indicate the mean LnP(D) ± SD values. Mean LnP(D) ± SD is a function of K, as L′(K) = ΔLnP(D) and mean Delta K is calculated as ∣L″(K)∣/(SD(L(K)); Supplementary Figure S2: Genotype accumulation curve relative to 4349 SNPs from a ddRADseq approach on 36 petunia breeding clonal lines (BLs), created with a 10 k times loci resampling; Supplementary Figure S3: Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrogram based on a Genetic Distance (GD) analysis performed with Nei’s coefficient on molecular profiles of 36 Petunia Breeding clonal Lines (BLs) and BenchMarks (BMs), obtained with 1975 SNPs located in CDSs from a ddRADseq approach. Numbers on branches represent bootstrap values and colors highlight different subpopulations; Supplementary Figure S4: Principal Component Analysis (PCA) centroid based on the Genetic Distance (GD) analysis performed with molecular profiles of 36 Petunia Breeding clonal Lines (BLs) and BenchMarks (BMs), obtained with 1975 SNPs located in Coding Sequences (CDSs) from a ddRADseq approach. Colors and ellipses represent sister line groups. Axes explain the two components with the highest genetic variability and the variability percentage of every component is shown in the line plot; Supplementary Table S1: Sequencing results, after raw read quality filtering and mapping, of ddRADseq libraries relative to 36 Petunia Breeding clonal Lines (BLs). Color scale ranges from deep green for the highest values to red for the lowest. Q20: Phred Quality score. MRL: Mean Read Length; Supplementary Table S2: Triangular Genetic Similarity (GS) matrix based on pairwise comparisons performed with Nei’s Genetic Distance (GD) coefficient relative to total SNP profiles of 36 Petunia Breeding clonal Lines (BLs) and BanchMarks (BMs), with sister line groups, GD values and color scale ranging from deep green for the highest values to red for the lowest. Observed Heterozigosity (Ho), percentage of Private Alleles (PA) of the total PAs in the population and average observed (na) and effetive (ne) alleles per locus per line are reported; Supplementary Table S3: Kinship-based INference for Genome-wide association studies (KING) matrix based on pairwise comparisons done with the KING-robust method, relative to total SNP profiles of 36 Petunia Breeding clonal Lines (BLs) and BanchMarks (BMs), with sister line groups, GD values and color scale ranging from deep green for the hlowest values to red for the highest; Supplementary Table S4: Genetic parameters of 103 loci relative to ddRADseq SNP profiles of 36 Petunia Breeding clonal Lines (BLs), filtered out sites presentng a Minor Allele Frequency (MAF) minor to 0.45, with: average Polymorphic Information Content per locus (AvgPIC); observed (Ho), expected in subpopulation (Hs) and expected in total population (Ht) Heterozygosity; inbreeding coefficnt (Fis); fixation index (Fst); Supplementary Table S5: Minimal variant subsets, each including 8 SNPs, discriminating a collection of 36 Petunia Breeding clonal Lines (BLs) and triangular Nei’s Genetic Diversity (GD) matrixes obtained with each set. GD values and color scale ranging from deep green for the highest values to red for the lowest; Supplementary Table S6: Triangular Genetic Similarity (GS) matrix based on pairwise comparisons with Nei’s Genetic Distance (GD) coefficient relative to molecular profiles of 36 Petunia Breeding clonal Lines (BLs) and BanchMarks (BMs), comprising 1975 SNP located in CDSs, with GD values and color scale ranging from deep green for the highest values to red for the lowest. Observed Heterozigosity (Ho) per BL (Sample) and sister line group (Pop), expected Heterozigosity (Hs) and inbreeding coefficient (Fis) are reported; Supplementary Table S7: KEGG-based functional annotation of putative P. × hybrida genes harboring CDS-SNPs, identified by ddRADseq in 36 Petunia breeding clonal lines (BLs), and aligned with orthologs from P. axillaris and A. thaliana.

Author Contributions

Conceptualization, methodology, formal analysis, A.B.; software, A.B., G.G., D.R. and F.S.; writing—original draft preparation, A.B.; writing—review and editing, A.B., F.P., S.F., D.R., G.G. and A.V.; supervision, F.P. and G.B.; project administration, G.B.; funding acquisition, G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This study was carried out within the research contract signed by the Gruppo Padana S.S. company (Paese, TV, Italy), and Department of Agronomy, Food, Natural resources, Animals and Environment (DAFNAE), University of Padua (Italy), within action IV.5_GREEN, PON 2014–2021.

Data Availability Statement

The datasets generated for this study can be found in GenBank under the BioProject accession number PRJNA1260944. The complete code of the custom R functions described in chapter 2.5 is publicly available on GitHub at https://github.com/Gabelberg/minimal_discriminating_marker_set (accessed on 15 January 2026). The exact version of the code used in this study corresponds to the GitHub release v1.0.0, which is permanently accessible at: https://github.com/Gabelberg/minimal_discriminating_marker_set/releases/tag/v1.0.0 (accessed on 15 January 2026).

Acknowledgments

The authors would like to thank Marco Gazzola (Gruppo Padana contact person) for funding and for providing plant material.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Aswath, C.; Bose, T.K.; Bhatia, R.; Saha, T.N.; Dutta, K. Commercial Flowers; Daya Publishing House, Ed.; Daya Publishing: Delhi, India, 2021; Volume 4, ISBN 9789354614163. [Google Scholar]
  2. Chen, Q.X.C.; Warner, R.M. Identification of QTL for Plant Architecture and Flowering Performance Traits in a Multi-Environment Evaluation of a Petunia axillaris × P. exserta Recombinant Inbred Line Population. Horticulturae 2022, 8, 1006. [Google Scholar] [CrossRef]
  3. USDA. Floriculture Crops 2020 Summary; USDA: Washington, DC, USA, 2021. [Google Scholar]
  4. Gebhardt, C. The Historical Role of Species from the Solanaceae Plant Family in Genetic Research. Theor. Appl. Genet. 2016, 129, 2281. [Google Scholar] [CrossRef]
  5. Sink, K.C. Petunia; Springer: Berlin/Heidelberg, Germany, 1984; Volume 9. [Google Scholar] [CrossRef]
  6. Geitmann, A. Petunia: Evolutionary, Developmental and Physiological Genetics. Ann. Bot. 2011, 107, vi–vii. [Google Scholar] [CrossRef]
  7. Guo, Y.; Warner, R.M. Dissecting Genetic Diversity and Genomic Background of Petunia cultivars with Contrasting Growth Habits. Hortic. Res. 2020, 7, 155. [Google Scholar] [CrossRef]
  8. Hermann, K.; Klahre, U.; Venail, J.; Brandenburg, A.; Kuhlemeier, C. The Genetics of Reproductive Organ Morphology in Two Petunia Species with Contrasting Pollination Syndromes. Planta 2015, 241, 1241–1254. [Google Scholar] [CrossRef]
  9. Rodrigues, D.M.; Caballero-Villalobos, L.; Turchetto, C.; Assis Jacques, R.; Kuhlemeier, C.; Freitas, L.B. Do We Truly Understand Pollination Syndromes in Petunia as Much as We Suppose? AoB Plants 2018, 10, ply057. [Google Scholar] [CrossRef] [PubMed]
  10. Regalado, J.J.; Carmona-Martín, E.; Querol, V.; Veléz, C.G.; Encina, C.L.; Pitta-Alvarez, S.I. Production of Compact Petunias through Polyploidization. Plant Cell Tissue Organ Cult. 2017, 129, 61–71. [Google Scholar] [CrossRef]
  11. Dermen, H. Polyploidy in Petunia. Am. J. Bot. 1931, 18, 250–261. [Google Scholar] [CrossRef]
  12. Robbins, T.P.; Harbord, R.M.; Sonneveld, T.; Clarke, K. The Molecular Genetics of Self-Incompatibility in Petunia hybrida. Ann. Bot. 2000, 85, 105–112. [Google Scholar] [CrossRef][Green Version]
  13. Williams, J.S.; Wu, L.; Li, S.; Sun, P.; Kao, T.-H.; Robbins, T.P.; Sims, T.L. Insight into S-RNase-Based Self-Incompatibility in Petunia: Recent Findings and Future Directions. Front. Plant Sci. 2015, 6, 41. [Google Scholar] [CrossRef]
  14. Yue, Y.; Du, J.; Li, Y.; Thomas, H.R.; Frank, M.H.; Wang, L.; Hu, H. Insight into the Petunia Dof Transcription Factor Family Reveals a New Regulator of Male-Sterility. Ind. Crops Prod. 2021, 161, 113196. [Google Scholar] [CrossRef]
  15. Nivison, H.T.; Hanson, M.R. Identification of a Mitochondrial Protein Associated with Cytoplasmic Male Sterility in Petunia. Plant Cell 1989, 1, 1121–1130. [Google Scholar] [CrossRef][Green Version]
  16. Farinati, S.; Draga, S.; Betto, A.; Palumbo, F.; Vannozzi, A.; Lucchin, M.; Barcaccia, G. Current Insights and Advances into Plant Male Sterility: New Precision Breeding Technology Based on Genome Editing Applications. Front. Plant Sci. 2023, 14, 1223861. [Google Scholar] [CrossRef]
  17. Gerats, T.; Vandenbussche, M. A Model System for Comparative Research: Petunia. Trends Plant Sci. 2005, 10, 251–256. [Google Scholar] [CrossRef] [PubMed]
  18. Melzer, R.; Janssen, B.J.; Dornelas, M.C.; Vandenbussche, M.; Chambrier, P.; Rodrigues Bento, S.; Morel, P. Petunia, Your Next Supermodel? Front. Plant Sci. 2016, 7, 72. [Google Scholar] [CrossRef]
  19. Meyer, P.; Heidmann, I.; Forkmann, G.; Saedler, H. A New Petunia Flower Colour Generated by Transformation of a Mutant with a Maize Gene. Nature 1987, 330, 677–678. [Google Scholar] [CrossRef] [PubMed]
  20. Mahmood-ur-Rahman, M.; Ali, I.; Husnain, T.; Riazuddin, S. RNA Interference: The Story of Gene Silencing in Plants and Humans. Elsevier Biotechnol. Adv. 2008, 26, 202–209. [Google Scholar] [CrossRef] [PubMed]
  21. Napoli, C.; Lemieux, C.; Jorgensen, R. Introduction of a Chimeric Chalcone Synthase Gene into Petunia Results in Reversible Co-Suppression of Homologous Genes in Trans. Plant Cell 1990, 2, 279–289. [Google Scholar] [CrossRef]
  22. Cao, Z.; Guo, Y.; Yang, Q.; He, Y.; Fetouh, M.I.; Warner, R.M.; Deng, Z. Genome-Wide Identification of Quantitative Trait Loci for Important Plant and Flower Traits in Petunia Using a High-Density Linkage Map and an Interspecific Recombinant Inbred Population Derived from Petunia integrifolia and P. axillaris. Hortic. Res. 2019, 6, 27. [Google Scholar] [CrossRef]
  23. Wang, G.; Zhang, W.; Ruan, Y.; Dai, B.; Yang, T.; Gou, T.; Liu, C.; Ning, G.; Liu, G.; Yu, Y.; et al. Construction of a High-Density Genetic Map and Mapping of Double Flower Genes in Petunia. Sci. Hortic. 2024, 329, 112988. [Google Scholar] [CrossRef]
  24. Bossolini, E.; Klahre, U.; Brandenburg, A.; Reinhardt, D.; Kuhlemeier, C.; Belzile, F. High Resolution Linkage Maps of the Model Organism Petunia Reveal Substantial Synteny Decay with the Related Genome of Tomato. Genome 2011, 54, 327–340. [Google Scholar] [CrossRef]
  25. Bombarely, A.; Moser, M.; Amrad, A.; Bao, M.; Bapaume, L.; Barry, C.S.; Bliek, M.; Boersma, M.R.; Borghi, L.; Bruggmann, R.; et al. Insight into the Evolution of the Solanaceae from the Parental Genomes of Petunia hybrida. Nat. Plants 2016, 2, 16074. [Google Scholar] [CrossRef]
  26. Saei, A.; Hunter, D.; Hilario, E.; David, C.; Ireland, H.; Esfandiari, A.; King, I.; Grierson, E.; Wang, L.; Boase, M.; et al. Chromosome-Level Genome Assembly and Annotation of Petunia hybrida. Sci. Data 2024, 12, 1262. [Google Scholar] [CrossRef] [PubMed]
  27. Kashikar, S.G.; Khalatkar, A.S. Breeding for Flower Colour in Petunia Hybrida Hort. In Proceedings of the Symposium on Vegetable and Flower seed Production 111; International Society for Horticultural Science (ISHS): Forli, Italy, 1981; pp. 35–40. [Google Scholar]
  28. Man-Zhu, D.; Se-Ping, B. Advances in Genetics and Breeding of Petunia hybrida Vilm. Chin. Bull. Bot. 2004, 21, 385. [Google Scholar]
  29. Masclaux-Daubresse, C.; Daniel-Vedele, F.; Dechorgnat, J.; Chardon, F.; Gaufichon, L.; Suzuki, A. Nitrogen Uptake, Assimilation and Remobilization in Plants: Challenges for Sustainable and Productive Agriculture. Ann. Bot. 2010, 105, 1141–1157. [Google Scholar] [CrossRef] [PubMed]
  30. Lütken, H.; Clarke, J.L.; Müller, R. Genetic Engineering and Sustainable Production of Ornamentals: Current Status and Future Directions. Plant Cell Rep. 2012, 31, 1141–1157. [Google Scholar] [CrossRef]
  31. CPVO Legislation in Force. Available online: https://cpvo.europa.eu/en/about-us/law-and-practice/legislation-in-force (accessed on 27 October 2024).
  32. UPOV Lex. Available online: https://upovlex.upov.int/en/convention (accessed on 27 October 2024).
  33. CPVO Protocol for Tests on Distinctness, Uniformity and Stability. Petunia Juss. × Petchoa J.M.H. Shaw. Available online: https://cpvo.europa.eu/sites/default/files/documents/petunia_2.pdf (accessed on 9 May 2025).
  34. UPOV Guidelines for the Conduct of Tests for Distinctness, Uniformity and Stability-Petunia (Petunia Juss.). Available online: https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://cpvo.europa.eu/sites/default/files/documents/petunia_2.pdf&ved=2ahUKEwj1uNWP7qaSAxVyhv0HHQz9OpIQFnoECBwQAQ&usg=AOvVaw2AfowugxhJGzXpqgTvDkKK (accessed on 15 January 2026).
  35. Yu, J.K.; Chung, Y.S. Plant Variety Protection: Current Practices and Insights. Genes 2021, 12, 1127. [Google Scholar] [CrossRef]
  36. Gilliland, T.J.; Annicchiarico, P.; Julier, B.; Ghesquière, M. A Proposal for Enhanced EU Herbage VCU and DUS Testing Procedures. Grass Forage Sci. 2020, 75, 227–241. [Google Scholar] [CrossRef]
  37. CPVO Guidance on the Use of Biochemical and Molecular Markers in the Examination of Distinctiveness, Uniformity and Stability (DUS). Available online: https://www.upov.int/edocs/tgpdocs/en/tgp_15.pdf (accessed on 4 May 2025).
  38. Patella, A.; Scariolo, F.; Palumbo, F.; Barcaccia, G. Genetic Structure of Cultivated Varieties of Radicchio (Cichorium intybus L.): A Comparison between F1 Hybrids and Synthetics. Plants 2019, 8, 213. [Google Scholar] [CrossRef]
  39. Seki, K.; Komatsu, K.; Hiraga, M.; Tanaka, K.; Uno, Y.; Matsumura, H. Development of PCR-Based Marker for Resistance to Fusarium Wilt Race 2 in Lettuce (Lactuca sativa L.). Euphytica 2021, 217, 126. [Google Scholar] [CrossRef]
  40. Zhigunov, A.V.; Ulianich, P.S.; Lebedeva, M.V.; Chang, P.L.; Nuzhdin, S.V.; Potokina, E.K. Development of F1 Hybrid Population and the High-Density Linkage Map for European Aspen (Populus tremula L.) Using RADseq Technology. BMC Plant Biol. 2017, 17, 180. [Google Scholar] [CrossRef]
  41. Peterson, B.K.; Weber, J.N.; Kay, E.H.; Fisher, H.S.; Hoekstra, H.E. Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species. PLoS ONE 2012, 7, e37135. [Google Scholar] [CrossRef]
  42. Chiurugwi, T.; Kemp, S.; Powell, W.; Hickey, L.T. Speed Breeding Orphan Crops. Theor. Appl. Genet. 2019, 132, 607–616. [Google Scholar] [CrossRef]
  43. Johnson, M.; Deshpande, S.; Vetriventhan, M.; Upadhyaya, H.D.; Wallace, J.G. Genome-Wide Population Structure Analyses of Three Minor Millets: Kodo Millet, Little Millet, and Proso Millet. Plant Genome 2019, 12, 190021. [Google Scholar] [CrossRef] [PubMed]
  44. Russell, J.; Hackett, C.; Hedley, P.; Liu, H.; Milne, L.; Bayer, M.; Marshall, D.; Jorgensen, L.; Gordon, S.; Brennan, R. The Use of Genotyping by Sequencing in Blackcurrant (Ribes nigrum): Developing High-Resolution Linkage Maps in Species without Reference Genome Sequences. Mol. Breed. 2014, 33, 835–849. [Google Scholar] [CrossRef]
  45. Praveen, P.; Gopal, R.; Ramakrishnan, U. The Population Structure of Invasive Lantana Camara Is Shaped by Its Mating System. eLife 2024, 14, RP104988. [Google Scholar] [CrossRef]
  46. Zhang, X.J.; Liu, X.F.; Liu, D.T.; Cao, Y.R.; Li, Z.H.; Ma, Y.P.; Ma, H. Genetic Diversity and Structure of Rhododendron meddianum, a Plant Species with Extremely Small Populations. Plant Divers. 2021, 43, 472–479. [Google Scholar] [CrossRef] [PubMed]
  47. Teixeira, T.M.; Nazareno, A.G. One Step Away From Extinction: A Population Genomic Analysis of A Narrow Endemic, Tropical Plant Species. Front. Plant Sci. 2021, 12, 730258. [Google Scholar] [CrossRef]
  48. Ramirez-Ramirez, A.R.; Bidot-Martínez, I.; Mirzaei, K.; Rasoamanalina Rivo, O.L.; Menéndez-Grenot, M.; Clapé-Borges, P.; Espinosa-Lopez, G.; Bertin, P. Comparing the Performances of SSR and SNP Markers for Population Analysis in Theobroma cacao L., as Alternative Approach to Validate a New DdRADseq Protocol for Cacao Genotyping. PLoS ONE 2024, 19, e0304753. [Google Scholar] [CrossRef]
  49. Molina, C.; Aguirre, N.C.; Vera, P.A.; Filippi, C.V.; Puebla, A.F.; Poltri, S.N.M.; Paniego, N.B.; Acevedo, A. DdRADseq-Mediated Detection of Genetic Variants in Sugarcane. Plant Mol. Biol. 2023, 111, 205–219. [Google Scholar] [CrossRef]
  50. McLean-Rodríguez, F.D.; Costich, D.E.; Camacho-Villa, T.C.; Pè, M.E.; Dell’Acqua, M. Genetic Diversity and Selection Signatures in Maize Landraces Compared across 50 Years of in Situ and Ex Situ Conservation. Heredity 2021, 126, 913–928. [Google Scholar] [CrossRef] [PubMed]
  51. Luo, H.; Guo, J.; Yu, B.; Chen, W.; Zhang, H.; Zhou, X.; Chen, Y.; Huang, L.; Liu, N.; Ren, X.; et al. Construction of DdRADseq-Based High-Density Genetic Map and Identification of Quantitative Trait Loci for Trans-Resveratrol Content in Peanut Seeds. Front. Plant Sci. 2021, 12, 644402. [Google Scholar] [CrossRef]
  52. Martina, M.; Acquadro, A.; Gulino, D.; Brusco, F.; Rabaglio, M.; Portis, E.; Lanteri, S. First Genetic Maps Development and QTL Mining in Ranunculus asiaticus L. through DdRADseq. Front. Plant Sci. 2022, 13, 1009206. [Google Scholar] [CrossRef]
  53. Ksouri, N.; Sánchez, G.; Forcada, C.F.I.; Contreras-Moreira, B.; Gogorcena, Y. DdRAD-Seq-Derived SNPs Reveal Novel Association Signatures for Fruit-Related Traits in Peach. bioRxiv 2023. [Google Scholar] [CrossRef]
  54. Daemi-Saeidabad, M.; Shojaeiyan, A.; Vivian-Smith, A.; Stenøien, H.K.; Falahati-Anbaran, M. The Taxonomic Significance of DdRADseq Based Microsatellite Markers in the Closely Related Species of Heracleum (Apiaceae). PLoS ONE 2020, 15, e0232471. [Google Scholar] [CrossRef]
  55. Reinula, I.; Träger, S.; Hernández-Agramonte, I.M.; Helm, A.; Aavik, T. Landscape Genetic Analysis Suggests Stronger Effects of Past than Current Landscape Structure on Genetic Patterns of Primula veris. Divers. Distrib. 2021, 27, 1648–1662. [Google Scholar] [CrossRef]
  56. Ma, X.; Wang, H.; Hu, Y.; Hu, Q.; Qian, R.; Zheng, J.; Zhang, X. DdRAD-Seq Data Reveals the Significant Population Structure and Potential Causes of Population Divergence in Bougainvillea. Sci. Rep. 2025, 15, 34200. [Google Scholar] [CrossRef] [PubMed]
  57. Han, B.; Huang, S.; Huang, G.; Wu, X.; Jin, H.; Liu, Y.; Xiao, Y.; Zhou, R. Genetic Relatedness and Association Mapping of Horticulturally Valuable Traits for the Ceiba Plants Using DdRAD Sequencing. Hortic. Plant J. 2023, 9, 826–836. [Google Scholar] [CrossRef]
  58. Betto, A.; Palumbo, F.; Riommi, D.; Vannozzi, A.; Barcaccia, G. Harnessing Genomics for Breeding Lantana camara L.: Genotyping and Ploidy Testing of Clonal Lines Through DdRADseq Applications. Int. J. Mol. Sci. 2025, 26, 4898. [Google Scholar] [CrossRef]
  59. Roy, S.C.; Moitra, K.; De Sarker, D. Assessment of Genetic Diversity among Four Orchids Based on DdRAD Sequencing Data for Conservation Purposes. Physiol. Mol. Biol. Plants 2017, 23, 169–183. [Google Scholar] [CrossRef]
  60. Moreau, E.P.; Honig, J.A.; Molnar, T.J. High-Density Linkage Mapping and Identification of Quantitative Trait Loci Associated with Powdery Mildew Resistance in Flowering Dogwood (Cornus florida). Horticulturae 2022, 8, 405. [Google Scholar] [CrossRef]
  61. Kim, H.; Yoon, J.B.; Lee, J. Development of Fluidigm SNP Type Genotyping Assays for Marker-Assisted Breeding of Chili Pepper (Capsicum annuum L.). Hortic. Sci. Technol. 2017, 35, 465–479. [Google Scholar] [CrossRef]
  62. Scaglione, D.; Pinosio, S.; Marroni, F.; Di Centa, E.; Fornasiero, A.; Magris, G.; Scalabrin, S.; Cattonaro, F.; Taylor, G.; Morgante, M. Single Primer Enrichment Technology as a Tool for Massive Genotyping: A Benchmark on Black Poplar and Maize. Ann. Bot. 2019, 124, 543–551. [Google Scholar] [CrossRef]
  63. Sun, X.; Liu, D.; Zhang, X.; Li, W.; Liu, H.; Hong, W.; Jiang, C.; Guan, N.; Ma, C.; Zeng, H.; et al. SLAF-Seq: An Efficient Method of Large-Scale De Novo SNP Discovery and Genotyping Using High-Throughput Sequencing. PLoS ONE 2013, 8, 58700. [Google Scholar] [CrossRef]
  64. Yang, Q.; Zhang, J.; Shi, X.; Chen, L.; Qin, J.; Zhang, M.; Yang, C.; Song, Q.; Yan, L.; Yang, Q.; et al. Development of SNP Marker Panels for Genotyping by Target Sequencing (GBTS) and Its Application in Soybean. Mol. Breed. 2023, 43, 26–27. [Google Scholar] [CrossRef]
  65. Dou, T.; Wang, C.; Ma, Y.; Chen, Z.; Zhang, J.; Guo, G. CoreSNP: An Efficient Pipeline for Core Marker Profile Selection from Genome-Wide SNP Datasets in Crops. BMC Plant Biol. 2023, 23, 580. [Google Scholar] [CrossRef]
  66. Sung, C.J.; Kulkarni, R.; Hillhouse, A.; Simpson, C.E.; Cason, J.; Burow, M.D. Reduced-Cost Genotyping by Resequencing in Peanut Breeding Programs Using Tecan Allegro Targeted Resequencing V2. Genes 2024, 15, 1364. [Google Scholar] [CrossRef]
  67. Broccanello, C.; Chiodi, C.; Funk, A.; Mcgrath, J.M.; Panella, L.; Stevanato, P. Comparison of Three PCR-Based Assays for SNP Genotyping in Plants. Plant Methods 2018, 14, 28. [Google Scholar] [CrossRef]
  68. Park, J.S.; Kang, M.Y.; Shim, E.J.; Oh, J.H.; Seo, K.I.; Kim, K.S.; Sim, S.C.; Chung, S.M.; Park, Y.; Lee, G.P.; et al. Genome-Wide Core Sets of SNP Markers and Fluidigm Assays for Rapid and Effective Genotypic Identification of Korean Cultivars of Lettuce (Lactuca sativa L.). Hortic. Res. 2022, 9, uhac119. [Google Scholar] [CrossRef]
  69. Chen, X.; Wang, Q.; Wang, F.; Wu, X.; Pan, Y.; Xue, L.; Duan, Y.; Wang, S.; Guan, Y.; Zhao, K.; et al. SNP Fingerprint Database and Makers Screening for Current Phalaenopsis Cultivars. Ornam. Plant Res. 2025, 5, e011. [Google Scholar] [CrossRef]
  70. Patzer, L.; Thomsen, T.; Wamhoff, D.; Schulz, D.F.; Linde, M.; Debener, T. Development of a Robust SNP Marker Set for Genotyping Diverse Gene Bank Collections of Polyploid Roses. BMC Plant Biol. 2024, 24, 1076. [Google Scholar] [CrossRef] [PubMed]
  71. Shen, B.; Shen, A.; Tan, Y.; Liu, L.; Li, S.; Tan, Z. Development of KASP Markers, SNP Fingerprinting and Population Genetic Analysis of Cymbidium ensifolium (L.) Sw. Germplasm Resources in China. Front. Plant Sci. 2024, 15, 1460603. [Google Scholar] [CrossRef]
  72. Abed, A.; Légaré, G.; Pomerleau, S.; St-Cyr, J.; Boyle, B.; Belzile, F.J. Genotyping-by-Sequencing on the Ion Torrent Platform in Barley. Methods Mol. Biol. 2019, 1900, 233–252. [Google Scholar] [CrossRef]
  73. Li, H. Aligning Sequence Reads, Clone Sequences and Assembly Contigs with BWA-MEM. arXiv 2013, arXiv:1303.3997. [Google Scholar] [CrossRef]
  74. Danecek, P.; Bonfield, J.K.; Liddle, J.; Marshall, J.; Ohan, V.; Pollard, M.O.; Whitwham, A.; Keane, T.; McCarthy, S.A.; Davies, R.M. Twelve Years of SAMtools and BCFtools. Gigascience 2021, 10, giab008. [Google Scholar] [CrossRef] [PubMed]
  75. Catchen, J.; Hohenlohe, P.A.; Bassham, S.; Amores, A.; Cresko, W.A. Stacks: An Analysis Tool Set for Population Genomics. Mol. Ecol. 2013, 22, 3124–3140. [Google Scholar] [CrossRef] [PubMed]
  76. Kamvar, Z.N.; Tabima, J.F.; Grünwald, N.J. Poppr: An R Package for Genetic Analysis of Populations with Clonal, Partially Clonal, and/or Sexual Reproduction. PeerJ 2014, 2014, e281. [Google Scholar] [CrossRef]
  77. Nei, M. Analysis of Gene Diversity in Subdivided Populations. Proc. Natl. Acad. Sci. USA 1973, 70, 3321–3323. [Google Scholar] [CrossRef]
  78. Chang, C.C.; Chow, C.C.; Tellier, L.C.A.M.; Vattikuti, S.; Purcell, S.M.; Lee, J.J. Second-Generation PLINK: Rising to the Challenge of Larger and Richer Datasets. Gigascience 2015, 4, s13742-015-0047-8. [Google Scholar] [CrossRef]
  79. Manichaikul, A.; Mychaleckyj, J.C.; Rich, S.S.; Daly, K.; Sale, M.; Chen, W.M. Robust Relationship Inference in Genome-Wide Association Studies. Bioinformatics 2010, 26, 2867–2873. [Google Scholar] [CrossRef]
  80. Sinnwell, J.P.; Therneau, T.M.; Schaid, D.J. The Kinship2 R Package for Pedigree Data. Hum. Hered. 2014, 78, 91–93. [Google Scholar] [CrossRef]
  81. Van Der Ploeg, A. Drawing Non-Layered Tidy Trees in Linear Time. Softw. Pract. Exp. 2014, 44, 1467–1484. [Google Scholar] [CrossRef]
  82. Gruber, B.; Unmack, P.J.; Berry, O.F.; Georges, A. Dartr: An r Package to Facilitate Analysis of SNP Data Generated from Reduced Representation Genome Sequencing. Mol. Ecol. Resour. 2018, 18, 691–699. [Google Scholar] [CrossRef] [PubMed]
  83. Pritchard, J.; Stephens, M.; Donnelly, P. Inference of Population Structure Using Multilocus Genotype Data. Genetics 2000, 155, 945–959. [Google Scholar] [CrossRef]
  84. Earl, D.A.; vonHoldt, B.M. STRUCTURE HARVESTER: A Website and Program for Visualizing STRUCTURE Output and Implementing the Evanno Method. Conserv. Genet. Resour. 2012, 4, 359–361. [Google Scholar] [CrossRef]
  85. Goudet, J. Hierfstat, a Package for R to Compute and Test Hierarchical F-Statistics. Mol. Ecol. Notes 2005, 5, 184–186. [Google Scholar] [CrossRef]
  86. Gabelli, G. Minimal Discriminating Marker Set Finder. Available online: https://github.com/Gabelberg/minimal_discriminating_marker_set/blob/main/README.md (accessed on 15 January 2026).
  87. Krzywinski, M.; Schein, J.; Birol, İ.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An Information Aesthetic for Comparative Genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef]
  88. Wang, B.; Yang, X.; Jia, Y.; Xu, Y.; Jia, P.; Dang, N.; Wang, S.; Xu, T.; Zhao, X.; Gao, S.; et al. High-Quality Arabidopsis Thaliana Genome Assembly with Nanopore and HiFi Long Reads. Genom. Proteom. Bioinform. 2022, 20, 4–13. [Google Scholar] [CrossRef]
  89. Ge, S.X.; Jung, D.; Yao, R. ShinyGO: A Graphical Gene-Set Enrichment Tool for Animals and Plants. Bioinformatics 2020, 36, 2628–2629. [Google Scholar] [CrossRef]
  90. Krishnan, G.S.; Singh, A.K.; Waters, D.L.E.; Henry, R.J. Molecular Markers for Harnessing Heterosis. Mol. Markers Plants 2012, 119–136. [Google Scholar] [CrossRef]
  91. Caballero, A.; Villanueva, B.; Druet, T. On the Estimation of Inbreeding Depression Using Different Measures of Inbreeding from Molecular Markers. Evol. Appl. 2020, 14, 416–428. [Google Scholar] [CrossRef]
  92. Nguyen, T.K.; Ha, S.T.T.; Lim, J.H. Analysis of Chrysanthemum Genetic Diversity by Genotyping-by-Sequencing. Hortic. Environ. Biotechnol. 2020, 61, 903–913. [Google Scholar] [CrossRef]
  93. Karikari, B.; Li, S.; Bhat, J.A.; Cao, Y.; Kong, J.; Yang, J.; Gai, J.; Zhao, T. Genome-Wide Detection of Major and Epistatic Effect QTLs for Seed Protein and Oil Content in Soybean Under Multiple Environments Using High-Density Bin Map. Int. J. Mol. Sci. 2019, 20, 979. [Google Scholar] [CrossRef]
  94. Yu, Q.; Ling, Y.; Xiong, Y.; Zhao, W.; Xiong, Y.; Dong, Z.; Yang, J.; Zhao, J.; Zhang, X.; Ma, X. RAD-Seq as an Effective Strategy for Heterogenous Variety Identification in Plants—A Case Study in Italian Ryegrass (Lolium multiflorum). BMC Plant Biol. 2022, 22, 231. [Google Scholar] [CrossRef]
  95. Peltier, D.; Farcy, E.; Dulieu, H.; Bervillé, A. Origin, Distribution and Mapping of RAPD Markers from Wild Petunia Species in Petunia hybrida Hort Lines. Theor. Appl. Genet. 1994, 88, 637–645. [Google Scholar] [CrossRef]
  96. Galliot, C.; Hoballah, M.E.; Kuhlemeier, C.; Stuurman, J. Genetics of Flower Size and Nectar Volume in Petunia Pollination Syndromes. Planta 2006, 225, 203–212. [Google Scholar] [CrossRef] [PubMed]
  97. Strommer, J.; Peters, J.; Zethof, J.; De Keukeleire, P.; Gerats, T. AFLP Maps of Petunia hybrida: Building Maps When Markers Cluster. Theor. Appl. Genet. 2002, 105, 1000–1009. [Google Scholar] [CrossRef] [PubMed]
  98. Vallejo, V.A.; Tychonievich, J.; Lin, W.K.; Wangchu, L.; Barry, C.S.; Warner, R.M. Identification of QTL for Crop Timing and Quality Traits in an Interspecific Petunia Population. Mol. Breed. 2015, 35, 2. [Google Scholar] [CrossRef]
  99. Liu, C.; He, Y.; Gou, T.; Li, X.; Ning, G.; Bao, M. Identification of Molecular Markers Associated with the Double Flower Trait in Petunia hybrida. Sci. Hortic. 2016, 206, 43–50. [Google Scholar] [CrossRef]
  100. Klahre, U.; Gurba, A.; Hermann, K.; Saxenhofer, M.; Bossolini, E.; Guerin, P.M.; Kuhlemeier, C. Pollinator Choice in Petunia Depends on Two Major Genetic Loci for Floral Scent Production. Curr. Biol. 2011, 21, 730–739. [Google Scholar] [CrossRef]
  101. Pezzi, P.H.; Guzmán-Rodriguez, S.; Giudicelli, G.C.; Turchetto, C.; Bombarely, A.; Freitas, L.B. A Convoluted Tale of Hybridization between Two Petunia Species from a Transitional Zone in South America. Perspect. Plant Ecol. Evol. Syst. 2022, 56, 125688. [Google Scholar] [CrossRef]
  102. Caballero-Villalobos, L.; Silva-Arias, G.A.; Turchetto, C.; Giudicelli, G.C.; Petzold, E.; Bombarely, A.; Freitas, L.B. Neutral and Adaptive Genomic Variation in Hybrid Zones of Two Ecologically Diverged Petunia Species (Solanaceae). Bot. J. Linn. Soc. 2021, 196, 100–122. [Google Scholar] [CrossRef]
  103. Guo, Y.; Wiegert-Rininger, K.E.; Vallejo, V.A.; Barry, C.S.; Warner, R.M. Transcriptome-Enabled Marker Discovery and Mapping of Plastochron-Related Genes in Petunia spp. BMC Genom. 2015, 16, 726. [Google Scholar] [CrossRef]
  104. Guo, Y.; Lin, W.K.; Chen, Q.; Vallejo, V.A.; Warner, R.M. Genetic Determinants of Crop Timing and Quality Traits in Two Interspecific Petunia Recombinant Inbred Line Populations. Sci. Rep. 2017, 7, 3200. [Google Scholar] [CrossRef]
  105. Elshire, R.J.; Glaubitz, J.C.; Sun, Q.; Poland, J.A.; Kawamoto, K.; Buckler, E.S.; Mitchell, S.E. A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species. PLoS ONE 2011, 6, e19379. [Google Scholar] [CrossRef] [PubMed]
  106. Berardi, A.E.; Esfeld, K.; Jäggi, L.; Mandel, T.; Cannarozzi, G.M.; Kuhlemeier, C. Complex Evolution of Novel Red Floral Color in Petunia. Plant Cell 2021, 33, 2273–2295. [Google Scholar] [CrossRef] [PubMed]
  107. Arnaud-Haond, S.; Duarte, C.M.; Alberto, F.; Serrão, E.A. Standardizing Methods to Address Clonality in Population Studies. Mol. Ecol. 2007, 16, 5115–5139. [Google Scholar] [CrossRef] [PubMed]
  108. Clifton, M.D.; D’Ombrain, T.H.; Watts-Parker, B.D.; Wills, A.R.; Hoebee, S.E. Genomic Evidence for a Hybrid Origin of the Critically Endangered Shrub Callistemon kenmorrisonii (Myrtaceae) and Persistence Involving Extreme Clonality. Conserv. Genet. 2025, 26, 1011–1028. [Google Scholar] [CrossRef]
  109. Lasso, E. The Importance of Setting the Right Genetic Distance Threshold for Identification of Clones Using Amplified Fragment Length Polymorphism: A Case Study with Five Species in the Tropical Plant Genus Piper. Mol. Ecol. Resour. 2008, 8, 74–82. [Google Scholar] [CrossRef]
  110. Flesher, K.N.; Jurgensen, M.F.; Gailing, O. Comparison of Phenotypic and Genetic Clone Delineation in Quaking Aspen, Populus tremuloides. Trees-Struct. Funct. 2016, 30, 1657–1667. [Google Scholar] [CrossRef]
  111. Coste, C.F.D.; Ronget, V.; Ramirez-Loza, J.-P.; Cubaynes, S.; Pavard, S.; Bienvenu, F. The Kinship Matrix: Inferring the Kinship Structure of a Population from Its Demography. Ecol. Lett. 2021, 24, 2750–2762. [Google Scholar] [CrossRef]
  112. Thompson, E.A. Identity by Descent: Variation in Meiosis, Across Genomes, and in Populations. Genetics 2013, 194, 301–326. [Google Scholar] [CrossRef]
  113. McMaster, E.S.; Yap, J.Y.S.; Chen, S.H.; Sherieff, A.; Bate, M.; Brown, I.; Jones, M.; Rossetto, M. On the Edge: Conservation Genomics of the Critically Endangered Dwarf Mountain Pine Pherosphaera fitzgeraldii. Basic Appl. Ecol. 2024, 80, 61–71. [Google Scholar] [CrossRef]
  114. Rueda, M.A.R.; Briceño-Pinzón, I.D.; Miguel, L.A.; Martínez, J.L.Q.; Padua, L.N.; de Lima Ribeiro, R.H.; Martins, V.S.; de Souza, L.C.; da Silva Junior, A.L.; de Souza Marçal, T. Multivariate Analysis of Genetic Variability in Advanced Potato Clones Under Tropical Conditions. Agric. Res. 2025, 1–10. [Google Scholar] [CrossRef]
  115. Woldeyohannes, A.B.; Iohannes, S.D.; Miculan, M.; Caproni, L.; Ahmed, J.S.; de Sousa, K.; Desta, E.A.; Fadda, C.; Pè, M.E.; Dell’acqua, M. Data-Driven, Participatory Characterization of Farmer Varieties Discloses Teff Breeding Potential under Current and Future Climates. Elife 2022, 11, e80009. [Google Scholar] [CrossRef] [PubMed]
  116. McMaster, E.S.; Lu-Irving, P.; van der Merwe, M.M.; Ho, S.Y.W.; Rossetto, M. Evaluating Kinship Estimation Methods for Reduced-Representation SNP Data in Non-Model Species. Mol. Ecol. Resour. 2025, 25, e70038. [Google Scholar] [CrossRef]
  117. Scariolo, F.; Palumbo, F.; Farinati, S.; Barcaccia, G. Pipeline to Design Inbred Lines and F1 Hybrids of Leaf Chicory (Radicchio) Using Male Sterility and Genotyping-by-Sequencing. Plants 2023, 12, 1242. [Google Scholar] [CrossRef]
  118. UPOV Explanatory Notes on Essentially Derived Varieties Under the 1991 Act of the UPOV Convention. Available online: https://www.upov.int/edocs/expndocs/en/upov_exn_edv.pdf (accessed on 4 May 2025).
  119. Noli, E.; Teriaca, M.S.; Conti, S. Criteria for the Definition of Similarity Thresholds for Identifying Essentially Derived Varieties. Plant Breed. 2013, 132, 525–531. [Google Scholar] [CrossRef]
  120. Jamali, S.H.; Cockram, J.; Hickey, L.T. Insights into Deployment of DNA Markers in Plant Variety Protection and Registration. Theor. Appl. Genet. 2019, 132, 1911–1929. [Google Scholar] [CrossRef] [PubMed]
  121. ISF View on Intellectual Property. Available online: https://worldseed.org/wp-content/uploads/2021/10/ISF-View-on-Intellectual-Property-2012-amended-2021.pdf (accessed on 16 January 2026).
  122. Van Eeuwijk, F.A.; Law, J.R. Statistical Aspects of Essential Derivation, with Illustrations Based on Lettuce and Barley. Euphytica 2004, 137, 129–137. [Google Scholar] [CrossRef]
  123. ISF Guidelines for Handling Disputes on Essential Derivation of Maize Lines. Available online: https://worldseed.org/wp-content/uploads/2015/10/ISF_Guidelines_Disputes_EDV_Maize_2014.pdf (accessed on 10 January 2026).
  124. Rousselle, Y.; Jones, E.; Charcosset, A.; Moreau, P.; Robbins, K.; Stich, B.; Knaak, C.; Flament, P.; Karaman, Z.; Martinant, J.P.; et al. Study on Essential Derivation in Maize: III. Selection and Evaluation of a Panel of Single Nucleotide Polymorphism Loci for Use in European and North American Germplasm. Crop Sci. 2015, 55, 1170–1180. [Google Scholar] [CrossRef]
  125. Yang, X.; Fang, Z.; Cao, Y.; Tian, L.; Peng, H.; Zhang, Y.; Xu, J.; Zhou, J.; Huo, H.; Qi, D.; et al. Development and Application of a Multiple Nucleotide Polymorphism (MNP)-Based Molecular Identification System for Pear Cultivars. BMC Plant Biol. 2026, 26, 15. [Google Scholar] [CrossRef]
  126. COGEM Update on Unauthorised Genetically Modified Garden Petunia Varieties. Available online: https://cogem.net/en/publication/update-on-unauthorised-genetically-modified-garden-petunia-varieties/ (accessed on 19 May 2025).
  127. Entani, T.; Takayama, S.; Iwano, M.; Shiba, H.; Che, F.S.; Isogai, A. Relationship between Polyploidy and Pollen Self-Incompatibility Phenotype in Petunia hybrida Vilm. Biosci. Biotechnol. Biochem. 1999, 63, 1882–1888. [Google Scholar] [CrossRef]
  128. Chen, Y.; Jiang, P.; Wilde, H.D. A Self-Pollinating Mutant of Petunia hybrida. Sci. Hortic. 2014, 177, 10–13. [Google Scholar] [CrossRef]
  129. Thurow, L.B.; Gasic, K.; Bassols Raseira, M.D.C.; Bonow, S.; Marques Castro, C. Genome-Wide SNP Discovery through Genotyping by Sequencing, Population Structure, and Linkage Disequilibrium in Brazilian Peach Breeding Germplasm. Tree Genet. Genomes 2020, 16, 10. [Google Scholar] [CrossRef]
  130. Ksouri, N.; Sánchez, G.; Font i Forcada, C.; Contreras-Moreira, B.; Gogorcena, Y. A Reproducible DdRAD-Seq Protocol Reveals Novel Genomic Association Signatures for Fruit-Related Traits in Peach. Plant Methods 2025, 21, 101. [Google Scholar] [CrossRef]
  131. Wang, Y.; Lv, H.; Xiang, X.; Yang, A.; Feng, Q.; Dai, P.; Li, Y.; Jiang, X.; Liu, G.; Zhang, X. Construction of a SNP Fingerprinting Database and Population Genetic Analysis of Cigar Tobacco Germplasm Resources in China. Front. Plant Sci. 2021, 12, 618133. [Google Scholar] [CrossRef] [PubMed]
  132. Varshney, R.K.; Thiel, T.; Sretenovic-Rajicic, T.; Baum, M.; Valkoun, J.; Guo, P.; Grando, S.; Ceccarelli, S.; Graner, A. Identification and Validation of a Core Set of Informative Genic SSR and SNP Markers for Assaying Functional Diversity in Barley. Mol. Breed. 2008, 22, 1–13. [Google Scholar] [CrossRef]
  133. Kuang, M.; Wei, S.J.; Wang, Y.Q.; Zhou, D.Y.; Ma, L.; Fang, D.; Yang, W.H.; Ma, Z.Y. Development of a Core Set of SNP Markers for the Identification of Upland Cotton Cultivars in China. J. Integr. Agric. 2016, 15, 954–962. [Google Scholar] [CrossRef]
  134. Yang, G.; Chen, S.; Chen, L.; Sun, K.; Huang, C.; Zhou, D.; Huang, Y.; Wang, J.; Liu, Y.; Wang, H.; et al. Development of a Core SNP Arrays Based on the KASP Method for Molecular Breeding of Rice. Rice 2019, 12, 21. [Google Scholar] [CrossRef]
  135. Meng, B.; Wang, S.; Li, W.X.; Guo, Z.; Tang, J. QTL Mapping of Fusarium Ear Rot Resistance Using Genotyping by Target Sequencing (GBTS) in Maize. J. Appl. Genet. 2025, 66, 787–796. [Google Scholar] [CrossRef]
  136. Yu, H.J.; Jeong, Y.M.; Lee, Y.J.; Yim, B.; Cho, A.; Mun, J.H. Marker Integration and Development of Fluidigm/KASP Assays for High-Throughput Genotyping of Radish. Hortic. Environ. Biotechnol. 2020, 61, 767–777. [Google Scholar] [CrossRef]
  137. Nagano, S.; Hirao, T.; Takashima, Y.; Matsushita, M.; Mishima, K.; Takahashi, M.; Iki, T.; Ishiguri, F.; Hiraoka, Y. SNP Genotyping with Target Amplicon Sequencing Using a Multiplexed Primer Panel and Its Application to Genomic Prediction in Japanese Cedar, Cryptomeria japonica (L.f.) D.Don. Forests 2020, 11, 898. [Google Scholar] [CrossRef]
  138. Guo, Z.; Wang, H.; Tao, J.; Ren, Y.; Xu, C.; Wu, K.; Zou, C.; Zhang, J.; Xu, Y. Development of Multiple SNP Marker Panels Affordable to Breeders through Genotyping by Target Sequencing (GBTS) in Maize. Mol. Breed. 2019, 39, 37. [Google Scholar] [CrossRef]
  139. Ayalew, H.; Tsang, P.W.; Chu, C.; Wang, J.; Liu, S.; Chen, C.; Ma, X.F. Comparison of TaqMan, KASP and RhAmp SNP Genotyping Platforms in Hexaploid Wheat. PLoS ONE 2019, 14, e0217222. [Google Scholar] [CrossRef]
  140. Pootakham, W.; Sonthirod, C.; Naktang, C.; Jomchai, N.; Sangsrakru, D.; Tangphatsornruang, S. Effects of Methylation-Sensitive Enzymes on the Enrichment of Genic SNPs and the Degree of Genome Complexity Reduction in a Two-Enzyme Genotyping-by-Sequencing (GBS) Approach: A Case Study in Oil Palm (Elaeis guineensis). Mol. Breed. 2016, 36, 154. [Google Scholar] [CrossRef]
  141. Brazier, T.; Glémin, S. Diversity and Determinants of Recombination Landscapes in Flowering Plants. PLoS Genet. 2022, 18, e1010141. [Google Scholar] [CrossRef]
  142. Zou, M.; Shabala, S.; Zhao, C.; Zhou, M. Molecular Mechanisms and Regulation of Recombination Frequency and Distribution in Plants. Theor. Appl. Genet. 2024, 137, 86. [Google Scholar] [CrossRef]
  143. Tiley, G.P.; Burleigh, G. The Relationship of Recombination Rate, Genome Structure, and Patterns of Molecular Evolution across Angiosperms. BMC Evol. Biol. 2015, 15, 194. [Google Scholar] [CrossRef]
  144. Park, M.; Jo, S.H.; Kwon, J.K.; Park, J.; Ahn, J.H.; Kim, S.; Lee, Y.H.; Yang, T.J.; Hur, C.G.; Kang, B.C.; et al. Comparative Analysis of Pepper and Tomato Reveals Euchromatin Expansion of Pepper Genome Caused by Differential Accumulation of Ty3/Gypsy-like Elements. BMC Genom. 2011, 12, 85. [Google Scholar] [CrossRef]
  145. Fuentes, R.R.; De Ridder, D.; Van Dijk, A.D.J.; Peters, S.A. Domestication Shapes Recombination Patterns in Tomato. Mol. Biol. Evol. 2022, 39, msab287. [Google Scholar] [CrossRef]
  146. Guyot, R.; Cheng, X.; Su, Y.; Cheng, Z.; Schlagenhauf, E.; Keller, B.; Ling, H.Q. Complex Organization and Evolution of the Tomato Pericentromeric Region at the FER Gene Locus. Plant Physiol. 2005, 138, 1205–1215. [Google Scholar] [CrossRef] [PubMed][Green Version]
  147. Mammadov, J.; Aggarwal, R.; Buyyarapu, R.; Kumpatla, S. SNP Markers and Their Impact on Plant Breeding. Int. J. Plant Genom. 2012, 2012, 728398. [Google Scholar] [CrossRef] [PubMed]
  148. Patrick, R.M.; Huang, X.Q.; Dudareva, N.; Li, Y. Dynamic Histone Acetylation in Floral Volatile Synthesis and Emission in Petunia Flowers. J. Exp. Bot. 2021, 72, 3704–3722. [Google Scholar] [CrossRef]
  149. Fu, J.; Huang, S.; Qian, J.; Qing, H.; Wan, Z.; Cheng, H.; Zhang, C. Genome-Wide Identification of Petunia HSF Genes and Potential Function of PhHSF19 in Benzenoid/Phenylpropanoid Biosynthesis. Int. J. Mol. Sci. 2022, 23, 2974. [Google Scholar] [CrossRef] [PubMed]
  150. Chen, J.; Liu, X.; Jiang, L.; Qing, H.; Qian, J.; Li, Z.; Zhang, C.; Fu, J. Genome-Wide Identification of Petunia C2H2 Zinc Finger Family Genes and Their Potential Roles in Floral Volatile Benzenoids/Phenylpropanoids Metabolism. Planta 2026, 263, 43. [Google Scholar] [CrossRef]
  151. Bednarczyk, D.; Skaliter, O.; Kerzner, S.; Masci, T.; Shklarman, E.; Shor, E.; Vainstein, A. The Homeotic Gene PhDEF Regulates Production of Volatiles in Petunia Flowers by Activating EOBI and EOBII. Plant Cell 2025, 37, koaf027. [Google Scholar] [CrossRef]
  152. Sasse, J.; Schlegel, M.; Borghi, L.; Ullrich, F.; Lee, M.; Liu, G.W.; Giner, J.L.; Kayser, O.; Bigler, L.; Martinoia, E.; et al. Petunia hybrida PDR2 Is Involved in Herbivore Defense by Controlling Steroidal Contents in Trichomes. Plant Cell Environ. 2016, 39, 2725–2739. [Google Scholar] [CrossRef]
  153. Amano, I.; Kitajima, S.; Suzuki, H.; Koeduka, T.; Shitan, N. Transcriptome Analysis of Petunia axillaris Flowers Reveals Genes Involved in Morphological Differentiation and Metabolite Transport. PLoS ONE 2018, 13, e0198936. [Google Scholar] [CrossRef]
  154. Shor, E.; Vainstein, A. Petunia PHYTOCHROME INTERACTING FACTOR 4/5 Transcriptionally Activates Key Regulators of Floral Scent. Plant Mol. Biol. 2024, 114, 66. [Google Scholar] [CrossRef]
Figure 1. Genetic Distance (GD) and Genetic Structure analysis results based on molecular profiles of 32 petunia clonal Lines (BLs) and 4 BenchMarks (BMs), obtained with 4349 SNPs from a ddRADseq approach. (A). Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrogram based on Nei’s GD analysis. Numbers on nodes represent bootstrap values and colors highlight different subpopulations. (B). Graphic representation of the structure analysis considering K = 2 and K = 3 results, where K is the most likely number of ancestral populations (Supplementary Figure S1). Colors indicate different putative ancestors and correspond to those of the most represented respective subpopulation.
Figure 1. Genetic Distance (GD) and Genetic Structure analysis results based on molecular profiles of 32 petunia clonal Lines (BLs) and 4 BenchMarks (BMs), obtained with 4349 SNPs from a ddRADseq approach. (A). Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrogram based on Nei’s GD analysis. Numbers on nodes represent bootstrap values and colors highlight different subpopulations. (B). Graphic representation of the structure analysis considering K = 2 and K = 3 results, where K is the most likely number of ancestral populations (Supplementary Figure S1). Colors indicate different putative ancestors and correspond to those of the most represented respective subpopulation.
Horticulturae 12 00160 g001
Figure 2. Principal Component Analysis (PCA) centroids based on the Genetic Distance (GD) analysis performed on molecular profiles of 32 petunia lines (grouped into 4 BLs) and 4 BenchMarks (BMs), obtained with 4349 SNPs using a ddRADseq approach. Colors and ellipses represent sister line groups. Axes explain the two components with the highest genetic variability. The line plot shows the variability percentage of every component (red line) and the threshold value of total variability (10%) used to determine component informativeness (blue line).
Figure 2. Principal Component Analysis (PCA) centroids based on the Genetic Distance (GD) analysis performed on molecular profiles of 32 petunia lines (grouped into 4 BLs) and 4 BenchMarks (BMs), obtained with 4349 SNPs using a ddRADseq approach. Colors and ellipses represent sister line groups. Axes explain the two components with the highest genetic variability. The line plot shows the variability percentage of every component (red line) and the threshold value of total variability (10%) used to determine component informativeness (blue line).
Horticulturae 12 00160 g002
Figure 3. Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrograms based on Genetic Distance (GD) analysis performed with Nei’s coefficient on ddRADseq SNP profiles of 32 petunia lines (grouped in 4 BL subpopulations) and 4 BenchMarks (BMs), considering different variant filtering. (A). MAF ≥ 0.1 (4349 SNPs). (B). MAF ≥ 0.3 (1051 SNPs). (C). MAF ≥ 0.45 and AvgPIC ≥ 0.3 (103 SNPs). (D). MAF ≥ 0.3 and AvgPIC ≥ 0.45 (123 SNPs). Numbers on branches represent bootstrap values and colors highlight different clusters, correlated to subpopulations. MAF: minimal Minor Allele Frequency; AvgPIC: average Polymorphic Information Content of the locus.
Figure 3. Unweighted Pair Group Method with Arithmetic Mean (UPGMA) dendrograms based on Genetic Distance (GD) analysis performed with Nei’s coefficient on ddRADseq SNP profiles of 32 petunia lines (grouped in 4 BL subpopulations) and 4 BenchMarks (BMs), considering different variant filtering. (A). MAF ≥ 0.1 (4349 SNPs). (B). MAF ≥ 0.3 (1051 SNPs). (C). MAF ≥ 0.45 and AvgPIC ≥ 0.3 (103 SNPs). (D). MAF ≥ 0.3 and AvgPIC ≥ 0.45 (123 SNPs). Numbers on branches represent bootstrap values and colors highlight different clusters, correlated to subpopulations. MAF: minimal Minor Allele Frequency; AvgPIC: average Polymorphic Information Content of the locus.
Horticulturae 12 00160 g003
Figure 4. (A). Physical map of petunia chromosomes with the distribution representation of 1975 SNPs located in CDSs from a ddRADseq approach applied to 36 petunia lines (red bands) and of annotated genes (blue bands), in non-overlapping 1 Mb windows. Darker shades represent higher densities; black bands indicate centromere position. (B,C). GO enrichment analysis of the annotated genes containing the CDS-SNPs for (B) Biological process and (C) Molecular function categories. False discovery rate (FDR) is represented by colors ranging from purple to red.
Figure 4. (A). Physical map of petunia chromosomes with the distribution representation of 1975 SNPs located in CDSs from a ddRADseq approach applied to 36 petunia lines (red bands) and of annotated genes (blue bands), in non-overlapping 1 Mb windows. Darker shades represent higher densities; black bands indicate centromere position. (B,C). GO enrichment analysis of the annotated genes containing the CDS-SNPs for (B) Biological process and (C) Molecular function categories. False discovery rate (FDR) is represented by colors ranging from purple to red.
Horticulturae 12 00160 g004
Table 1. Sequencing results, after raw read quality filtering and mapping, of ddRADseq libraries relative to the 32 petunia clonal lines–divided in four subpopulations (BL1-BL4)–and 4 distinct BenchMarks lines (BM1-BM4). Averages are weighted for the number of lines per subpopulation (N). Mn bases: average number of bases (in millions) sequenced per sample; ≥Q20 Mn Bases: average number of bases (in millions) sequenced per sample with a Phred quality score ≥ Q20; Mn Reads: average millions of reads per sample, MRL: Mean Read Length.
Table 1. Sequencing results, after raw read quality filtering and mapping, of ddRADseq libraries relative to the 32 petunia clonal lines–divided in four subpopulations (BL1-BL4)–and 4 distinct BenchMarks lines (BM1-BM4). Averages are weighted for the number of lines per subpopulation (N). Mn bases: average number of bases (in millions) sequenced per sample; ≥Q20 Mn Bases: average number of bases (in millions) sequenced per sample with a Phred quality score ≥ Q20; Mn Reads: average millions of reads per sample, MRL: Mean Read Length.
Pop-IDNMn Bases≥Q20 Mn BasesMn ReadsMRL (bp)
PeHy-BL14486.46416.443.34160
PeHy-BL27360.03304.842.39173
PeHy-BL39316.36271.881.99177
PeHy-BL412267.85230.081.68178
PeHy-BM11258.86226.061.25206
PeHy-BM21104.6091.090.50208
PeHy-BM31469.56399.953.31141
PeHy-BM41551.17468.703.91141
Tot3612,291.0110,528.1780.01
Avg 341.42292.452.22173
Table 2. Genetic Distance (GD) with Nei’s coefficient and fixation index (st) matrix based on a pairwise comparison among four sister line groups, comprehending a total of 32 clonal lines.
Table 2. Genetic Distance (GD) with Nei’s coefficient and fixation index (st) matrix based on a pairwise comparison among four sister line groups, comprehending a total of 32 clonal lines.
GDClusterFst
PeHy-BL2
0.07PeHy-BL10.23
0.110.10PeHy-BL40.230.22
0.190.270.29PeHy-BL30.500.450.34
PeHy-BL3PeHy-BL4PeHy-BL1PeHy-BL2 PeHy-BL2PeHy-BL1PeHy-BL4PeHy-BL3
Table 3. Genetic statistics calculated from ddRADseq molecular profiles relative to the 4 petunia sister line groups (PeHy-BL1-BL4), with number of individuals (N); average number of observed (na) and expected (ne) alleles per locus; observed (Ho%) and expected (Hs%) Heterozygosity; inbreeding coefficient (Fis); percentage of Polymorphic Loci (PL%) of the total loci in the population; percentage of Private Alleles (PA%) of the total PAs in the population.
Table 3. Genetic statistics calculated from ddRADseq molecular profiles relative to the 4 petunia sister line groups (PeHy-BL1-BL4), with number of individuals (N); average number of observed (na) and expected (ne) alleles per locus; observed (Ho%) and expected (Hs%) Heterozygosity; inbreeding coefficient (Fis); percentage of Polymorphic Loci (PL%) of the total loci in the population; percentage of Private Alleles (PA%) of the total PAs in the population.
Pop IDNnaneHo (%)Hs (%)FisPL (%)PA (%)
PeHy-BL141.361.2719.0515.02−0.2763.756.38
PeHy-BL271.541.3120.4816.14−0.2768.5414.89
PeHy-BL391.541.4626.2722.08−0.1973.1433.96
PeHy-BL4121.661.5533.5726.17−0.2881.8812.75
Avg81.531.5324.8419.85−0.2571.8317.00
StDev30.110.125.724.530.046.6910.28
Table 4. Analysis of Molecular Variance (AMOVA) results calculated from ddRADseq molecular profiles relative to 32 petunia clonal Lines (in 4 BLs) and 4 BMs. Degrees of freedom (Df); Sum of Squares (Sum Sq); Mean of Squares (Mean Sq) and Estimated Variance (Est. Var.) are reported.
Table 4. Analysis of Molecular Variance (AMOVA) results calculated from ddRADseq molecular profiles relative to 32 petunia clonal Lines (in 4 BLs) and 4 BMs. Degrees of freedom (Df); Sum of Squares (Sum Sq); Mean of Squares (Mean Sq) and Estimated Variance (Est. Var.) are reported.
DfSum SqMean SqEst. Var.Est. Var. (%)
Between Subpopulations716,100230049058.19
Within Subpopulations28985535235241.81
Total3525,955742842100
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Betto, A.; Scariolo, F.; Gabelli, G.; Riommi, D.; Farinati, S.; Vannozzi, A.; Palumbo, F.; Barcaccia, G. ddRADseq Applications for Petunia × hybrida Clonal Line Breeding: Genotyping and Variant Identification for Target-Specific Assays. Horticulturae 2026, 12, 160. https://doi.org/10.3390/horticulturae12020160

AMA Style

Betto A, Scariolo F, Gabelli G, Riommi D, Farinati S, Vannozzi A, Palumbo F, Barcaccia G. ddRADseq Applications for Petunia × hybrida Clonal Line Breeding: Genotyping and Variant Identification for Target-Specific Assays. Horticulturae. 2026; 12(2):160. https://doi.org/10.3390/horticulturae12020160

Chicago/Turabian Style

Betto, Angelo, Francesco Scariolo, Giovanni Gabelli, Damiano Riommi, Silvia Farinati, Alessandro Vannozzi, Fabio Palumbo, and Gianni Barcaccia. 2026. "ddRADseq Applications for Petunia × hybrida Clonal Line Breeding: Genotyping and Variant Identification for Target-Specific Assays" Horticulturae 12, no. 2: 160. https://doi.org/10.3390/horticulturae12020160

APA Style

Betto, A., Scariolo, F., Gabelli, G., Riommi, D., Farinati, S., Vannozzi, A., Palumbo, F., & Barcaccia, G. (2026). ddRADseq Applications for Petunia × hybrida Clonal Line Breeding: Genotyping and Variant Identification for Target-Specific Assays. Horticulturae, 12(2), 160. https://doi.org/10.3390/horticulturae12020160

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop