1. Introduction
Potato (
Solanum tuberosum L.), as the world’s fourth most important food crop, with a global annual production exceeding 383 million tons across approximately 16.8 million hectares in 2023, plays a critical role in ensuring food security and nutritional supply worldwide [
1]. However, the intensification of global climate change in recent years has posed severe threats to global food security [
2]. Additionally, most cultivated potatoes are autotetraploids (2n = 4x = 48) with a narrow genetic background. These factors contribute to challenges such as unstable yields and slow improvement of quality traits in potato production. Among these, tuber yield, tuber shape, and eye number are core agronomic traits determining commercial value. For instance, irregular tuber shapes increase processing losses, while excessive eyes directly affect tuber appearance and reduce market value. Therefore, deciphering the genetic basis of these traits and developing molecular marker-assisted breeding technologies are of great significance for advancing potato genetic improvement.
Compared with major cereal crops such as rice and maize, potato research lags significantly, primarily due to its tetraploid genetic characteristics. The tetraploid potato genome is highly heterozygous, with frequent recombination events between the four homologous chromosomes, leading to complex segregation patterns in progeny [
3]. Consequently, most genetic maps constructed to date are based on diploid materials, with only rare reports on tetraploid potato genetic maps [
4,
5,
6,
7,
8,
9,
10]. In nature, almost 75% of potato wild relatives are diploid [
11], originating from diverse habitats ranging from southwestern North America to central Chile and Argentina. These wild species possess high genetic diversity and adaptability to harsh environments [
12], providing a robust genetic resource base for potato breeding improvement. However, introgressing a new trait from diploid wild species into tetraploid potato cultivars can take 15–50 years [
13]. Diploid hybrid breeding represents a revolutionary breakthrough in potato breeding in recent years [
14]. Compared with traditional tetraploid breeding, diploid hybrid breeding offers distinct advantages: shorter fixation time for favorable alleles, feasibility of marker-assisted backcrossing, greater genetic variation for selection, and systematic utilization of heterosis [
15].
In summary, breeding and genetic analysis at the diploid level are pivotal directions for future research. Nevertheless, most diploid potatoes exhibit self-incompatibility and inbreeding depression, hindering the development of homozygous inbred lines through multiple generations of selfing, even for F
2 mapping populations derived from F
1 selfing [
16]. Recently, the Joint Academy of Potato Science of Yunnan Normal University successively cloned the
S-RNase gene [
17] and
Sli gene [
18], overcoming self-incompatibility and successfully creating a set of self-compatible diploid potato advanced inbred lines [
19], which provide material foundations for constructing secondary segregation populations. Furthermore, the completion of high-quality genome sequencing of diploid potatoes (e.g., DM8.1 and C88) [
20,
21] and advancements in resequencing technologies have further facilitated the construction of high-density genetic maps and fine QTL (Quantitative trait loci) mapping.
QTL mapping remains a traditional and effective approach for mining causal genes [
22], yet research on genetic map construction and QTL mapping in potatoes is severely delayed. Early genetic analyses predominantly relied on crosses of heterozygous diploid parents, resulting in complex genetic backgrounds that hindered the decomposition of polygenic traits into individual mendelian factors for chromosomal localization. Additionally, markers such as SSR (Simple sequence repeat), AFLP (Amplified fragment length polymorphism), and SCAR (Sequence-characterized amplified region) yielded low-density maps [
23,
24,
25,
26,
27], leading to overly broad QTL intervals that impeded gene cloning. With the development of high-throughput sequencing, SNP (Single nucleotide polymorphism) markers have been widely adopted for constructing high-density genetic maps in potatoes [
13,
28,
29,
30,
31,
32,
33]. However, only a few studies have utilized optimally selected parental materials. For example, Meijer et al. [
13] constructed an F
2 population of 272 lines using a homozygous diploid carrying the
Sli gene and a heterozygous diploid. Endelman and Jansky [
29] developed the first F
2 population from two homozygous diploids: DM (a doubled monoploid potato from anther culture) and M6 (an advanced S
7 inbred line with
Sli), which was later advanced to an 87-line RIL population [
34]. Most recently, Li et al. [
33] generated an F
2 population of 1064 individuals from two homozygous inbred lines (A6-26 and E4-63), performing whole-genome resequencing to construct a high-density genetic map for QTL mapping.
Yield is the most economically significant potato trait, determined by tuber number per plant and tuber size. Hurtado-Lopez et al. [
35] identified 4 QTLs for tuber number and 6 for yield using a backcross population. Manrique-Carpintero et al. [
30] detected 3 QTLs for total tuber yield, 2 for tuber number per plant, and 6 for average tuber weight in an F
1 population. Hara-Skrzypiec et al. [
26] reported QTLs for tuber weight on chromosomes 1, 4, 5, and 6 in an F
1 population. Using a diploid diversity panel of 246 breeding lines with genotyping-by-sequencing and GWAS (Genome-wide association study), Śliwka et al. [
36] identified 2 loci associated with tuber weight per plant and 26 with average tuber weight. However, no major yield-related genes have been cloned via QTL mapping to date.
Tuber shape and eye traits also critically influence quality appearance and processing suitability [
22]. Extensive QTL mapping has been conducted to identify shape-regulating genes [
25,
26,
28,
29,
37,
38], culminating in the cloning of
StOFP20 as a major causal gene in 2023 [
39]. Nevertheless, numerous shape-related genes remain uncloned, such as a novel locus on chromosome 6 (~58 Mb) reported by Huang et al. [
40]. Prashar et al. [
28] mapped a major QTL explaining 20% of phenotypic variation on chromosome 2 using a dense SNP map, while Lindqvist-Kreuze et al. [
25] identified secondary QTLs on chromosomes 5 and 12 alongside the major chromosome 10 locus. For eye depth, Fan et al. [
22] recently identified a major QTL
qEyd10.1 on chromosome 10 explaining 55.0% of phenotypic variation, refined to a 309.10 kb interval via recombination analysis. However, genetic studies on eye number remain limited, with no systematic reports of stable QTLs. Thus, mining stably expressed major QTLs through multi-environment trials combined with high-density genetic maps is crucial for deciphering these traits’ genetic mechanisms.
This study utilized an F
2 segregating population of 174 lines derived from a cross between two diploid potato materials, Y8 and IVP101. Y8 is a homozygous line carrying the
Sli gene, developed through hybridization of CIP151 (StGp Goniocalyx) and CIP65 (StGp Phureja) followed by 6 generations of selfing. IVP101 (StGp Phureja), a widely used inducer for ploidy reduction in potato wide crosses [
41], has a heterozygous genome. The two parents exhibit significant differences in multiple traits. We constructed a high-density genetic map using bin markers developed via whole-genome resequencing and phenotyped tuber yield, shape, and eye number across three environments to perform multi-environment joint QTL analysis for stable genetic loci. This study aims to: (1) explore resequencing-based high-density genetic map construction in an F
2 population derived from homozygous × heterozygous parents; (2) clarify the genetic basis of yield, tuber shape, and eye number in diploid potatoes; (3) enhance QTL mapping precision by integrating high-density genetic maps with multi-environment phenotypic data, laying foundations for dissecting complex trait genetic networks and accelerating potato breeding.
2. Materials and Methods
2.1. Plant Material
In this study, two diploid potato lines, IVP101 (male parent) and Y8 (female parent), were crossed to produce F
1 progeny, which were further selfed to generate an F
2 segregating population consisting of 174 lines. IVP101 (StGp Phureja) is a widely used inducer for ploidy reduction in potato wide crosses, characterized by a heterozygous genome, short growth period, concentrated flowering time, long tubers with purple skin and white flesh, numerous deep eyes with non-dormant tubers, purple self-incompatible flowers, seeds with embryo spots, and stems with abundant purple pigmentation. Y8 is an advanced generation inbred line developed by the Joint Academy of Potato Science of Yunnan Normal University through hybridization of CIP65 and CIP151 followed by 6 generations of selfing. It exhibits high genomic homozygosity, round tubers with yellow skin and yellow flesh, few shallow eyes, white self-compatible flowers, seeds without embryo spots, and green stems without purple pigmentation. F
1 seeds were obtained from the Y8 × IVP101 cross. Seeds were germinated with gibberellin and sown in a greenhouse, yielding 49 F
1 plants. We referenced the population construction method described by Meijer et al. [
13], two vigorous and easily selfed F
1 plants (designated IYY30 and IYP31) were selected. Selfed seeds from these two F
1 plants were randomly mixed and sown, ultimately generating an F
2 genetic segregating population with 174 lines.
2.2. Ploidy Analysis and Parentage Identification of F1 Plants
Healthy, disease-free leaves of moderate size (0.3–0.5 g) were collected. The leaves were placed into a 1.5 mL centrifuge tube, followed by the addition of 1 mL pre-chilled lysis buffer and three 3 mm steel beads. The mixture was ground at 50 Hz for 30 s using a tissue grinder and then incubated on ice for 30 min. The lysate was filtered through a 400-mesh filter into a new 1.5 mL centrifuge tube. The tube was centrifuged at 1500 rpm for 5 min at 4 °C in a pre-chilled centrifuge. After discarding the supernatant, the pellet (intact nuclear fraction) was collected. Next, 1 mL of pre-chilled lysis buffer containing 1% NP-40 was added, and the mixture was vigorously vortexed. Subsequently, 20 μL of pre-chilled propidium iodide (PI) dye was added, and the mixture was incubated at 4 °C in the dark for 15 min. After filtration through a 400-mesh nylon cell strainer, the filtrate was transferred to a light-protected centrifuge tube, and ploidy was determined using a flow cytometer (Agilent Technologies, NovoCyte 2000R, Santa Clara, CA, USA).
Sample DNA extraction and quality assessment were performed following the method described by Yang et al. [
42]. A total of 231 SSR molecular markers were designed based on reference genome sequences, and primers were synthesized by Sangon Biotech (Sangon Biotech, Shanghai, China). These 231 primer pairs were used to amplify DNA samples, with reactions conducted on a Veriti 384 PCR instrument (Applied Biosystems, Foster City, CA, USA). The PCR amplification program was as follows: initial denaturation at 95 °C for 5 min; followed by 10 cycles of denaturation at 95 °C for 30 s, touchdown annealing from 62 to 52 °C (30 s), and extension at 72 °C for 30 s; then 25 cycles of denaturation at 95 °C for 30 s, annealing at 52 °C for 30 s, and extension at 72 °C for 30 s; final extension at 72 °C for 20 min; and storage at 4 °C. After PCR, amplification products were analyzed by fluorescent capillary electrophoresis (ABI 3730xL DNA Analyzer, Applied Biosystems, Waltham, MA, USA). Results were processed using GeneMarker software (Vision 2.2.0, Developed by Soft Genetics LLC, State College, PA, USA) to determine the number of alleles, electropherograms, and genotypes for each sample.
2.3. Phenotyping for Yield, Tuber Shape, and Eye Number
The two parental lines (IVP101 and Y8), two F1 plants (IYY30 and IYP31), and 174 F2 lines were grown across three environments (E1–E3). E1 corresponded to March–July 2024, and E3 to September 2024–January 2025, both conducted in a greenhouse at the Songming Research Base of Yunnan Academy of Agricultural Sciences (N 25°21′, E 103°6′). E2 was conducted in a walk-in growth chamber at the Joint Academy of Potato Science of Yunnan Normal University from June to October 2024, with environmental conditions set as follows: photoperiod of 7:00–19:00 (light, 23 °C) and 19:00–7:00 (dark, 19 °C), and light intensity of 261 PPFD. For E1 and E2, uniform tissue culture seedlings were transplanted; for E3, uniformly sized and germinated seed tubers were sown. In E1 and E3, plants were grown in large pots (29 cm diameter × 23 cm height), with 5 plants per line; in E2, small pots (14 cm diameter × 13 cm height) were used, with 3 plants per line. Crop management and pest control followed local recommendations.
After plant maturation, eight phenotypic traits were investigated: (1) Tuber number per plant (TNP): Individual counts of all tubers harvested per pot. (2) Tuber fresh weight per plant (TFWP): Total weight of all tubers per pot, measured using an electronic balance. (3) Mean tuber weight (MTW): Five large, uniform tubers (labeled 1–5) were selected from each pot, weighed individually, and the mean was calculated. (4) Tuber length (TL): Measured on labeled tubers 1–5 using a vernier caliper. (5) Tuber width (TW): Measured on labeled tubers 1–5 using a vernier caliper. (6) Tuber length/width ratio (TLWR): Calculated as tuber length divided by tuber width. (7) Tuber eye number (TEN): Visually counted on labeled tubers 1–5. (8) Eye number-to-tuber volume ratio (ENTVR): Calculated as the number of eyes divided by tuber volume; volume of the five tubers was measured via water displacement using a graduated cylinder. All phenotypic data were collated using Microsoft Excel 16. Statistical analyses were performed using SAS (Statistical Analysis System, version 8.01, SAS Institute Inc., Cary, NC, USA). Best linear unbiased estimators (BLUEs) were calculated for root phenotypic traits in the F
2 population using QTL IciMapping v4.2 software [
43].
2.4. Sample Genomic Resequencing, Quality Control, and SNP Identification
After passing quality control, DNA samples of the two parental lines and 174 F2 lines were entrusted to Beijing Biomarker Technologies Co., Ltd. (Beijing, China) for library construction and whole-genome resequencing. Library construction was performed using the VAHTS™ Fg DNA Library Prep Kit for Illumina. Briefly, 10 ng of Qubit-quantified genomic DNA was used and supplemented with FEA Buffer (2 μL) and FEA Enzyme Mix (4 μL) in a total volume of 20 μL. This mixture was subjected to PCR thermal cycling for enzymatic fragmentation and end repair with the following cycling conditions: 4 °C for 15 s, followed by 37 °C for 8 min, 65 °C for 30 min, and a final hold at 4 °C (with a 75 °C heated lid). Fragmented products (20 μL) were then mixed with Rapid Ligation Buffer 3 (10 μL), DNA Adapter (2 μL), Rapid DNA Ligase (2 μL), and ddH2O (6 μL) in a 40 μL reaction system for ligation at 20 °C for 15 min (with the heated lid turned off). Ligation products were purified using 0.8× VAHTS™ DNA Clean Beads (Cat. No. N411-03), followed by size selection via adding 0.485× beads (supernatant collected) and 0.1× beads (pellet retained). The beads were washed twice with 80% ethanol, and DNA was eluted in 10 μL nuclease-free water; 8 μL of the eluate was used for PCR amplification. The PCR reaction mixture (20 μL) contained PCR Primer Mix 3 (2 μL), VAHTS HiFi Amplification Mix (10 μL), and size-selected product (8 μL). PCR products were purified using 0.6× beads to obtain the final library. Libraries were quality-controlled using a Qsep-400 (Qsep Inc., Taiwan), with a fragment peak at 430–530 bp, average fragment size of 420–580 bp, a normal distribution, and absence of unspecific peaks. Library concentration was quantified using a Qubit 3.0 Fluorometer (Life Technologies) (concentration ≥ 1 ng/μL). Adapter sequences were as follows: adapter3 = “AGATCGGAAGAGCACACGTCTGAACTCCAGTCAC”; adapter5 = “AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT”. Libraries were sequenced on a BGI DNBSEQ-T7 sequencer.
Raw Data were generated from the original sequencing output. To ensure the quality of bioinformatics analysis, Raw Reads were filtered to obtain Clean Reads for subsequent analyses. The main filtering steps were as follows: (1) removal of adapter sequences; (2) filtering out paired-end reads where the proportion of N (undetermined base calls) exceeded 10%; and (3) removal of low-quality reads (reads with >50% of bases having a quality score ≤ 10).
Detection of SNPs and small INDELs was conducted using the GATK [
45] toolkit: first, redundant reads were filtered with samtools to ensure accuracy. Then, GATK HaplotypeCaller (local haplotype assembly algorithm) was used to generate gVCF files for each sample, followed by population-level joint genotyping.
Variant results were rigorously filtered to ensure reliability: (1) SNPs within 5 bp of INDELs and adjacent INDELs within 10 bp were filtered using bcftools vcfutils.pl (varFilter -w 5 -W 10); (2) clusterSize 2 and clusterWindowSize 5 were set (no more than 2 variants within a 5 bp window); (3) GATK hard filtering parameters were applied: QUAL < 30 (Phred-scaled quality score for variant reliability), QD < 2.0 (ratio of variant quality to coverage depth), MQ < 40 (root mean square of mapping quality), and FS > 60.0 (strand bias estimated by Fisher’s exact test). Other parameters followed GATK official default values. The final set of variant sites was stored in VCF format.
2.5. Linkage Map Construction and QTL Analyses
To ensure marker quality, the final SNP set was filtered and screened as follows: (1) retaining markers that were homozygous and polymorphic in parents (aa × bb); (2) filtering out markers with a depth < 4×; (3) removing markers not mapped to chromosomes. After obtaining high-quality SNPs, a sliding window approach was applied across chromosomes using a 15-SNP window and 1-SNP step size. If ≥13 SNPs in a window were genotyped as “aa”, the window was assigned “aa”; if ≥13 SNPs were “bb”, it was assigned “bb”; otherwise, genotypes were imputed and corrected as “ab” [
46]. Following SNP imputation and correction, samples were aligned by chromosomal physical positions. A recombination breakpoint was defined as a genotype transition in any sample, and SNPs between breakpoints were clustered into Bins (non-recombinant intervals), generating unfiltered Bin markers. Further filtering excluded polymorphic Bin markers with length < 5 kb or severe segregation distortion (χ
2 test,
p < 0.001).
Filtered Bin markers were subjected to multi-round deduplication using HighMap software [
47]. Optimal recombination rates between markers were calculated, and genetic distances between adjacent Bins were estimated to construct a linkage map. QTL IciMapping v4.2 software [
43] was used for QTL analyses. Additive QTLs were calculated using the ICIM-ADD mapping method. The threshold for logarithm of odds (LOD) scores was set as 2.5.
3. Results
3.1. Feasibility Evaluation of Genetic Map Construction Using Resequencing Data from Two Parental Lines
SNPs that are homozygous and polymorphic (aa × bb) between parents can exhibit simple segregation patterns in F2 populations, making them the most ideal marker type for genetic map construction. Theoretically, two homozygous diploid parents can generate a large number of aa × bb-type SNP markers with high genome coverage. However, in potato, we constructed an F2 segregating population using one genomically homozygous (Y8) and one genomically heterozygous (IVP101) diploid for the first time, and whole-genome resequencing is planned to develop genome-wide SNP markers for genetic map construction.
To verify the feasibility and quality of map construction, genome resequencing was first performed on the two parents. Their clean data were aligned to the DM8.1 reference genome, yielding a total of 9,530,402 SNPs. After filtering out SNPs with parental missingness, non-polymorphism, or sequencing depth < 4×, 8,149,439 SNPs remained for genotyping (
Figure 1B). These markers were primarily classified into four types, with the nn × np type being the most abundant (5,601,059), while our target aa × bb type accounted for 1,860,910 markers (
Figure 1A). Markers were more densely distributed on chromosomes 1, 4, 9, and 10, and relatively fewer on chromosomes 2, 3, 5, 7, 8, 11, and 12 (
Figure 1C). To analyze their chromosomal distribution in detail, chromosome density plots were generated using a 1 Mb window size (
Figure 1D and
Table S1). SNP markers were present in all windows with uniform distribution, meeting the requirements for subsequent map construction.
3.2. Construction of F2 Genetic Segregating Population
F
1 seeds were obtained by crossing IVP101 (male parent) and Y8 (female parent). After randomly sowing a portion of the seeds, 49 F
1 plants were generated. To further construct an F
2 segregating population, two vigorously growing and highly self-fertile F
1 plants (designated IYY30 and IYP31) were selected for selfing, ultimately generating an F
2 genetic segregating population with 174 lines. Prior to this, given IVP101’s capacity to induce ploidy reduction, ploidy analysis of IYY30 and IYP31 was performed using a flow cytometer, with the tetraploid cultivar Cooperation-88 and the two parents (Y8 and IVP101) as controls. As shown in
Figure 2C, both IYY30 and IYP31 were diploid and had not undergone ploidy reduction influenced by the male parent IVP101.
Subsequently, preliminary screening for SSR marker polymorphism was conducted using DNA from IVP101 (male parent) and Y8 (female parent) as templates. From 231 primer pairs, two SSR markers with good polymorphism, Sot071 and Sot077, were identified (
Table S2). These markers were used for parentage identification of IYY30 and IYP31. Capillary electrophoresis results revealed that both IYY30 and IYP31 exhibited specific genetic loci from both parents (
Figure 2D), confirming they were true hybrids.
3.3. Resequencing and Genotyping of Samples
All samples were subjected to resequencing and data quality control filtering. The male parent IVP101 generated 45.71 Gb of clean data, the female parent Y8 yielded 46.59 Gb, and 174 progeny produced a total of 1693.04 Gb of clean data. All sequencing data achieved a Q30 score of over 85%. Alignment with the reference genome DM8.1 showed that the percentage of clean reads mapped to the reference genome exceeded 97% of all clean reads, indicating high alignment efficiency. The calculated genome sequencing depths for IVP101 and Y8 were 56× and 58×, respectively, with average genome coverages of 93.93% and 91.34% (at least 1× coverage). The average sequencing depth of F
2 progeny reached 12.58×, with coverage exceeding 87.97% (at least 1× coverage) (
Table 1). These results confirm high sequencing quality and uniform genome coverage distribution, which fully meets the quality requirements for subsequent studies such as genetic map construction.
SNP calling was performed on all samples using GATK, yielding a total of 11,073,260 SNP loci. Statistical analysis of homozygous and heterozygous loci revealed that the proportion of homozygous SNPs in Y8 was as high as 95.26%, whereas that in IVP101 was only 53.49% (
Table S3). These results were consistent with our expectations: Y8 is a highly homozygous inbred line, while IVP101 is a diploid with a highly heterozygous genome.
3.4. Bin Map Construction and Characteristics
Of the 11,073,260 SNPs, a total of 6,588,620 SNPs were detected between the two parents. After filtering based on thresholds, a total of 1,552,920 SNPs were retained. Using a 15-SNP window and 1-SNP step size, a sliding scan across chromosomes generated 63,595 unfiltered Bin markers. Further filtering removed Bin markers shorter than 5 kb and those with severe segregation distortion (χ
2 test,
p < 0.001), resulting in 5125 filtered Bin markers for subsequent linkage analysis (
Table 2).
After multiple rounds of redundancy removal for 5125 Bin markers using HighMap software, the optimal recombination rates between markers were calculated, and genetic distances between adjacent Bin markers were estimated, yielding the linear arrangement of Bin markers within linkage groups. A total of 4464 mapped Bin markers were ultimately obtained (
Table 3 and
Figure 3A). This map contained 12 linkage groups, corresponding exactly to the chromosome number of diploid potato. Chromosome 5 harbored the most markers (470), while chromosome 6 contained the fewest (231). The total map length of the 12 linkage groups was 1239.74 cM, with an average genetic distance of 0.28 cM between adjacent markers and an average physical distance of only 165.51 kb. Statistical analysis of genome-wide genetic distances between all markers showed that 99.57% of genetic gaps were smaller than 5 cM, with the largest gap (16.27 cM) located on chromosome 11.
The Spearman correlation coefficient was calculated for all the linkage groups. Its value near to 1 showed an improved collinearity between the physical and genetic maps. The order of most markers on every chromosome of the project is consistent with the genome, indicating that collinearity is good and the calculation accuracy of the genetic recombination rate is high (
Figure 3B).
3.5. Phenotypic Analysis of F2 Population and Parents
We investigated eight phenotypic traits (
Table S4) in parental lines IVP101 and Y8, two F
1 hybrids (IYY30 and IYP31), and 174 F
2 lines across three environments. These included three yield-related traits: Tuber Number per Plant (TNP), Tuber Fresh Weight per Plant (TFWP), and Mean Tuber Weight (MTW); three tuber-shape-related traits: Tuber Length (TL), Tuber Width (TW), and Tuber Length-to-Width Ratio (TLWR); and two eye-related traits: Tuber Eye Number (TEN) and Eye Number-to-Tuber Volume Ratio (ENTVR). Best linear unbiased estimates (BLUEs) were calculated for the eight traits based on phenotypic values across the three environments. The results revealed highly significant differences in all eight traits between the two parents and between the two F
1 hybrids (
Figure 2A), indicating substantial genetic differences between the two parents, which are beneficial for QTL identification. For the F
2 population, all eight traits exhibited wide variation (
Figure 2B), with high coefficients of variation observed for all traits except TLWR. Additionally, the phenotypic values of all eight investigated traits showed continuous unimodal distributions, consistent with quantitative traits controlled by multiple genes (
Table 4 and
Figure 4). We also analyzed broad-sense heritability across four environments, which ranged from 53.86% to 76.09% (
Table 4), indicating that these traits are primarily genetically controlled despite environmental influences.
Correlations between pairs of the eight traits were calculated using BLUE values, with results presented in
Table 5. TFWP, a direct indicator of yield, can be decomposed into two yield components: TNP and MTW. In this study, TFWP showed a correlation coefficient of 0.49 with TNP and 0.88 with MTW. Additionally, TL and TW effectively characterize tuber size; notably, pairwise correlation coefficients among TFWP, MTW, TL, and TW exceeded 0.85, indicating a greater contribution of MTW to yield. TLWR was uncorrelated or weakly correlated with other traits. TEN exhibited moderate positive correlations with TL, TW, and MTW, while TLWR showed strong negative correlations with TL, TW, and MTW.
3.6. QTL Mapping in the F2 Population
Using the inclusive composite interval mapping method, a total of 148 additive QTLs were detected with IciMapping v4.2 software based on the high-density genetic map (
Table S5). These included 39 in E1, 31 in E2, 36 in E3, and 42 based on BLUE values. QTLs were distributed across all chromosomes, with chromosome 9 harboring the fewest (1 QTL) and chromosome 12 the most (33 QTLs). The phenotypic variation explained (PVE) of all QTLs ranged from 0.16% to 30.45%. By trait category, 50 QTLs were identified for tuber yield traits (13 for TNP, 19 for TFWP, and 18 for MTW), 60 for tuber shape traits (23 for TL, 20 for TW, and 17 for TLWR), and 38 for tuber eye traits (17 for TEN and 21 for ENTVR). After merging QTLs by physical position, 89 unique genetic loci were obtained, comprising 32 for tuber yield, 33 for tuber shape, and 24 for tuber eye traits.
To further authenticate the correctness of these results, loci of the present study were compared with reported QTLs previously identified by the linkage or association mapping approach. We discovered seven loci that have been reported previously (
Table S5).
qTY-2 was detected by both TFWP and MTW. This locus was also significantly associated with mean tuber weight in another diploid GWAS study [
36].
qTY-5-4, identified by MTW in this study, was significantly associated with mean tuber weight in a diploid GWAS study [
36].
qTY-8-3 was detected by both TFWP and MTW in this study and was also significantly associated with mean tuber weight in another diploid GWAS study [
36].
StOFP20 [
39], the only cloned and validated tuber shape regulatory gene to date, was notably detected in our study as
qTS-10-2 via genetic mapping combined with TLWR phenotypic data. Furthermore,
qTS-5-2, detected by both TL and TLWR in this study, was identified by Meijer et al. [
13]. using a high-density genetic map of an F
2 segregating population, explaining 6.3% of phenotypic variation. In this study,
qTS-1-2 was detected by TW and co-localized with the tuber shape-associated significant SNP locus chr01_84614836 [
36]. For tuber eye traits, Endelman and Jansky [
29] identified a QTL at chr05: 47,991,614–49,804,489 using eye tubers (yes/no) as a trait and a high-density genetic map, which was also detected by TEN in our population (
qTE-5-4). Overall, our QTLs mapped using this high-density bin map are reliable.
3.7. Identification of Stable QTLs
QTL mapping was conducted for 8 traits across three environments, resulting in the identification of 89 loci (
Table S5). Some loci were detected by two or more traits; notably, 10 loci were mapped by traits in at least two environments. These QTLs were repeatedly detected across environments, exhibited stable expression, and thus had higher reliability, leading to their designation as stable QTLs. Three stable QTLs were associated with tuber yield:
qTY-1-3,
qTY-5-2, and
qTY-12-6. Four stable QTLs were associated with tuber shape:
qTS-2-1,
qTS-5-1,
qTS-8-3, and
qTS-12-3. Three stable QTLs were associated with tuber eye number:
qTE-10-2,
qTE-12-2, and
qTE-12-4 (
Table 6 and
Figure 5).
Notably, among the 10 stable QTLs, two exhibited pleiotropy. The locus on chromosome 5 (qTY-5-2 and qTS-5-1) co-regulates TL, TW, and MTW. The pleiotropic locus at the terminal region of chromosome 12 (qTY-12-6, qTS-12-3, and qTE-12-4) regulates all eight traits, with high phenotypic variation explained: a maximum of 25.26% for yield (TFWP-E3), 30.45% for tuber shape (TLWR-E2), and 23.90% for tuber eye traits (ENTVR-E2). These results confirm it as a major-effect locus with significant research and application value.
3.8. Validation of the Effects of the 10 Stable QTLs
To further clarify the effects of the ten stable QTLs, without considering interactions between loci, we summarized phenotypic differences among the three genotypes of each locus in the F
2 population (
Table 7). After segregation and recombination, each locus in the F
2 population exhibits homozygous paternal (II), maternal (YY), and heterozygous (IY) genotypes. Therefore, we first classified F
2 individuals into IVP101-type, Y8-type, and H-type based on marker genotypes of identified loci, then compared phenotypic differences in corresponding traits calculated using BLUE values.
Results showed that both parents contributed favorable alleles. For most QTLs, the average phenotypic values of all corresponding traits in F2 lines carrying favorable alleles were significantly (p < 0.05) higher than those carrying unfavorable alleles. For example, at the qTY-5-2, the IVP101-derived allele increased average tuber weight by ~2.30 g and total tuber weight per plant by 15.54 g compared to the Y8-derived allele. At qTS-5-1, the IVP101 allele increased tuber length and width by 5.28 mm and 2.89 mm, respectively, with a corresponding 0.09 increase in TLWR. At qTE-12-2, the Y8-derived allele reduced eye number by ~0.48 per cm3. Regarding the major-effect QTL regulating all eight traits (qTY-12-6, qTS-12-3, and qTE-12-4), the allele derived from parent Y8 effectively enhances yield, promotes rounder tuber shape, and reduces tuber eyes. Specifically, it increases mean tuber weight by ~1.90 g and tuber number per plant by ~6.40 tubers; simultaneously, it increases tuber length and width by 1.81 mm and 3.58 mm, respectively, while reducing tuber eyes by 1.27 per cm3.
Additionally, some loci exhibited dominant heterozygous advantage, where heterozygous genotypes showed significantly higher phenotypic values than both parental genotypes. For instance, qTS-8-3 heterozygotes displayed the longest tuber length and width compared to both parental genotypes.