Next Article in Journal
Melatonin Inhibits hIAPP Oligomerization by Preventing β-Sheet and Hydrogen Bond Formation of the Amyloidogenic Region Revealed by Replica-Exchange Molecular Dynamics Simulation
Next Article in Special Issue
Membrane Proteomic Profiling of Soybean Leaf and Root Tissues Uncovers Salt-Stress-Responsive Membrane Proteins
Previous Article in Journal
Editorial for the Special Issue “New Drugs for Breast Cancer Treatment”
Previous Article in Special Issue
Metatranscriptomic Analysis Reveals Rich Mycoviral Diversity in Three Major Fungal Pathogens of Rice
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Construction of a High-Density Recombination Bin-Based Genetic Map Facilitates High-Resolution Mapping of a Major QTL Underlying Anthocyanin Pigmentation in Eggplant

1
Key Laboratory of Genetics, Breeding and Multiple Utilization of Crops, Ministry of Education, College of Agriculture/College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou 350002, China
2
Fujian Provincial Key Laboratory of Crop Breeding by Design, Fujian Agriculture and Forestry University, Fuzhou 350002, China
3
Marine and Agricultural Biotechnology Laboratory, Fuzhou Institute of Oceanography, Minjiang University, Fuzhou 350108, China
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2022, 23(18), 10258; https://doi.org/10.3390/ijms231810258
Submission received: 1 August 2022 / Revised: 31 August 2022 / Accepted: 3 September 2022 / Published: 6 September 2022
(This article belongs to the Special Issue Power Up Plant Genetic Research with Genomic Data)

Abstract

:
High-density genetic maps can significantly improve the resolution of QTL mapping. We constructed a high-density recombination bin-based genetic map of eggplant based on 200 F2 plants from an interspecific cross (Solanum melongena × S. incanum) using the whole genome resequencing strategy. The map was 2022.8 cM long, covering near 99% of the eggplant genome. The map contained 3776 bins, with 3644 (96.5%) being effective (position non-redundant) ones, giving a nominal average distance of 0.54 cM and an effective average distance of 0.56 cM between adjacent bins, respectively. Using this map and 172 F2:3 lines, a major QTL with pleiotropic effects on two anthocyanin pigmentation-related traits, leaf vein color (LVC) and fruit pericarp color (FPC), was steadily detected in a bin interval of 2.28 cM (or 1.68 Mb) on chromosome E10 in two cropping seasons, explaining ~65% and 55% of the phenotypic variation in LVC and FPC, respectively. Genome-wide association analysis in this population validated the QTL and demonstrated the correctness of mapping two bins of chromosome E02 onto E10. Bioinformatics analysis suggested that a WDR protein gene inside the bin interval with reliable effective variation between the two parents could be a possible candidate gene of the QTL.

1. Introduction

During the development of quantitative genetics, one of the most important events was the advent of molecular marker technologies, which enables the construction of genetic maps for mapping quantitative trait loci (QTLs), dissecting the genetic basis of a quantitative trait into individual Mendelian factors. The resolution of QTL mapping is determined by many factors, such as the complexity and heritability of the trait, the type and size of the population, the marker density of the genetic map, and the power of the statistical method. High marker density is a precondition for high mapping resolution. The fast development of the next-generation sequencing (NGS) technologies has dramatically reduced the costs of high-throughput genotyping and made it practical to construct a high-density genetic map.
There can be two strategies of NGS-based genotyping for constructing a high-density genetic map. One is genotyping by partial genome sequencing (PGS), namely, genotyping individuals by sequencing a selected part of the genome. A few methods for PGS have been developed, such as restriction site-associated DNA sequencing (RAD-seq) [1,2], genotyping-by-sequencing (GBS) [3], and specific length amplified fragment sequencing (SLAF-seq) [4]. The merit of PGS is that it does not need a reference genome sequence and can reduce the cost, especially useful for the species with large genome size. So far, high-density genetic map construction has been mainly performed based on this strategy. It has been used in many plant species, such as soybean [5], cucumber [6], chickpea [7], durum wheat [8], rice [9,10], grape [11], Brassica rapa [12], wheat [13], and potato [14].
The other strategy is genotyping by whole genome resequencing (WGR). This strategy was first proposed by Huang et al. [15]. It roughly involves four steps: (1) resequencing the whole genome of each individual in the mapping population at a very low sequencing depth; (2) collectively examining genome-wide polymorphisms for genotype calling and recombination breakpoint determination using a sliding window approach; (3) dividing the genome into recombination bins; and (4) analyzing the genetic linkage among the bins. As WGR can identify all polymorphisms in a population in principle, it has the potential to uncover all recombination events in the population so as to maximize the density and the resolution of the genetic map. This is an advantage of WGR over PGS. Apart from in rice [15,16], this strategy has been used in some other species, such as soybean [17], maize [18], pepper [19], melon [20], peanut [21], and radish [22]. As reference genome sequence is available in more and more species and NGS becomes cheaper and cheaper, the WGR strategy will be used more widely in high-density genetic map construction.
Eggplant (Solanum melongena L.) is an important vegetable crop cultivated and consumed worldwide. To facilitate the genetic studies in eggplant, a number of genetic maps of eggplant have been constructed since the early of this century [23,24,25,26,27,28,29,30,31,32]. Most of the maps, however, were constructed based on traditional molecular markers and had a low marker density. Recently, two high-density genetic maps were constructed with the PGS strategy in eggplant. One was constructed based on an interspecific (S. melongena × S. linnaeanum) F2 population consisting of 121 individuals using the SLAF-seq approach [33]. The map comprises 2122 single nucleotide polymorphism (SNP) markers, with a total length of 1530.75 cM and an average distance of 0.72 cM between adjacent markers. The other map was constructed based on a population of 163 recombinant inbred lines (RILs) derived from a cross between two cultivated eggplant varieties using the GBS approach [34]. The map contains 7249 SNP markers, with a total length of 2169.23 cM and an average distance of 0.4 cM between adjacent markers. More recently, a bin-based genetic map (or termed bin map for short) was constructed based on a population of 100 F2 individuals developed from an interspecific cross (S. melongena × S. insanum) using the WGR strategy [35]. The map consists of 3918 bins, with a length of 1384.62 cM in total and a distance of 0.58 cM between adjacent bins on average. These maps have exhibited the merit of high marker density for increasing the resolution of gene/QTL mapping.
Eggplant is rich in anthocyanins, which are beneficial for human health [36]. So, increasing anthocyanin content is an important goal in eggplant breeding. For this purpose, it is necessary to understand the genetic basis and regulatory mechanism of anthocyanin accumulation in eggplant. In the past decade, especially in recent years, remarkable progress has been made in the molecular genetic research of anthocyanin accumulation in eggplant. A number of QTLs related to anthocyanin accumulation have been mapped on all eggplant chromosomes except chromosome E04 [23,34,37,38], and some anthocyanin metabolism-related genes have been identified using the approaches of reversed genetics, including transcriptomic or proteomic analysis [39,40,41] and homologous or candidate gene analysis [42,43,44,45]. However, as many genes/QTLs are involved in the control of anthocyanin accumulation, the genetic basis and regulatory mechanism underlying the anthocyanin metabolism in eggplant is still far from being clarified.
In this study, we constructed a high-density recombination bin-based genetic map based on an inter-specific F2 population and applied it to mapping QTLs for anthocyanin pigmentation in eggplant. We identified a major QTL displaying pleiotropic effects on the anthocyanin pigmentation in both leaf and fruit and demonstrated that the high-density map facilitated high-resolution mapping of the major QTL and prediction of its possible candidate gene.

2. Results

2.1. Genotyping Results

After sequencing, ~178.06 M and 138.30 M clean reads were obtained in the parental lines YZQ (S. melongena) and YQ (S. incanum), respectively, of which 98.69% and 98.83% could be mapped to the reference genome. The number of clean reads obtained in each of the 200 F2 plants was ~17.83M on average, varying 13.51 M–31.33 M, of which 99.12% (97.48–99.51%) could be mapped to the reference genome. The Q20 of each F2 plant varied 94.72–97.75%, with an average of 96.75%. These results indicated that the sequencing data were good in quality overall.
Based on the sequencing data, a set of 1,273,701 markers (SNPs and short InDels) meeting the three criteria (diallelic; homozygous in YZQ; minor allele frequency, MAF ≥ 0.3) were selected in the F2 population. The markers were not evenly distributed in the genome (Figure 1a). In most of the chromosomes, markers were mainly located close to the two ends. In chromosomes E01, E07 and E11, however, there was a long segment with higher marker density in the middle region, respectively. In addition, marker density was very low in the left terminal region of E02. The proportion of missing marker genotype data in each F2 plant was ~0.14 on average, varying 0.03–0.26. So, the number of informative markers in each F2 plant would not be smaller than 1.27 M × (1 − 0.26) = 0.94 M, which should be adequate for constructing a recombination bin-based genetic map.
To evaluate the quality of the F2 population, we analyzed the following parameters based on the marker data: (1) The MAFs of markers. The MAFs were largely distributed between 0.45 and 0.5 in the F2 population, peaking at ~0.4875 (Figure 1b), very close to the expected peak value 0.5. (2) The Proportion of Heterozygous marker Genotypes (PHG) in each F2 plant. The observed PHG ( p o ) was ~0.25 on average, varying 0.10–0.42. Theoretically, the expected PHG ( p e ) in an F2 individual is 0.5. However, because there is a probability ( p m ) that a heterozygous genotype is misjudged as a homozygous genotype depending on the sequencing depth, p o must be smaller than p e . It is easy to find that p m = 1 / 2 k 1 , where k is sequencing depth. In this study, the sequencing depth for each F2 plant was ~2×. Therefore, p m 0.5 on average. It can be predicted that the average p o = p e × 1 p m = 0.5 × 1 0.5 = 0.25 . This is consistent with the average PHG observed. (3) The genetic relationship between individuals [46]. The standardized relationship coefficient (or termed relatedness score) approximately followed a normal distribution in the F2 population (Figure 1c), suggesting that the genetic segregation in the F2 population was symmetric and unbiased overall. The above results together indicated that the F2 population was normal in structure, suitable for genetic mapping.

2.2. Recombination Bin-Based Genetic Map Constructed

The genetic map constructed based on the F2 population was 2022.8 cM in length, containing 14 linkage groups and 3776 bins (Table 1; Figure 2; Table S1). The bin length was 135.9 kb on average, varying from 10 to 16,490 kb, with ~82%, 87%, 90%, 94% and 96% smaller than or equal to 150, 200, 250, 300 and 400 kb, respectively. The total length of the bins was ~498.32 Mb, accounting for 46.44% of the eggplant genome sequence, indicating that about half of the genome sequence was not included as bins in the genetic map. There were 132 redundant bins in terms of the position in the genetic map (when there were multiple bins at a position in the map, they function only as one effective marker, so the extra bins were redundant). So, the number of effective bins (bin markers) in the map was 3776 − 132 = 3644. Each chromosome corresponded to a linkage group in the map except for E04, which was divided into three linkage groups (LG4a, 4b and 4c). Overall, the bin density was high throughout the map. The nominal average distance (between adjacent bins, including redundant bins) in each linkage group varied from 0.42 cM (LG4c) to 0.78 cM (LG4b), and was 0.54 cM in the whole map; whereas the effective average distance (redundant bins excluded) was close to the nominal one in each linkage group, varied from 0.42 cM (LG4c) to 0.98 cM (LG4b), and was 0.56 cM in the whole map (Table 1). The largest spacing between adjacent bins was 12.3 cM in LG7 (Figure 2). The physical span (from the beginning of the first bin to the end of the last bin) of each linkage group (or the sum of the three linkage groups of E04) was exactly or nearly equal to the full length of the corresponding chromosome except for LG2 (Table 1). In total, the genetic map physically spanned ~1.06 Gb, covering near 99% of the eggplant genome (Table 1). Therefore, the genetic map basically represented the whole eggplant genome.
The genetic map constructed based on the F2 population was 2022.8 cM in length, containing 14 linkage groups and 3776 bins (Table 1; Figure 2; Table S1). The bin length was 135.9 kb on average, varying from 10 to 16,490 kb, with ~82%, 87%, 90%, 94% and 96% smaller than or equal to 150, 200, 250, 300 and 400 kb, respectively. The total length of the bins was ~498.32 Mb, accounting for 46.44% of the eggplant genome sequence, indicating that about half of the genome sequence was not included as bins in the genetic map. There were 132 redundant bins in terms of the position in the genetic map (when there were multiple bins at a position in the map, they function only as one effective marker, so the extra bins were redundant). So, the number of effective bins (bin markers) in the map was 3776 − 132 = 3644. Each chromosome corresponded to a linkage group in the map except for E04, which was divided into three linkage groups (LG4a, 4b and 4c). Overall, the bin density was high throughout the map. The nominal average distance (between adjacent bins, including redundant bins) in each linkage group varied from 0.42 cM (LG4c) to 0.78 cM (LG4b), and was 0.54 cM in the whole map; whereas the effective average distance (redundant bins excluded) was close to the nominal one in each linkage group, varied from 0.42 cM (LG4c) to 0.98 cM (LG4b), and was 0.56 cM in the whole map (Table 1). The largest spacing between adjacent bins was 12.3 cM in LG7 (Figure 2). The physical span (from the beginning of the first bin to the end of the last bin) of each linkage group (or the sum of the three linkage groups of E04) was exactly or nearly equal to the full length of the corresponding chromosome except for LG2 (Table 1). In total, the genetic map physically spanned ~1.06 Gb, covering near 99% of the eggplant genome (Table 1). Therefore, the genetic map basically represented the whole eggplant genome.
According to the reference genome sequence, the order of the three linkage groups of E04 is LG4a-LG4b-LG4c (Table S1). However, the three linkage groups could not be linked accordingly because the estimated genetic distance between LG4a and LG4b and that between LG4b and LG4c were both very large (>90 cM). When there was no restriction on the order, LG4b still could not be linked to LG4a and LG4c. Hence, LG4b appeared to be genetically independent from LG4a and LG4c. Surprisingly, LG4a and LG4c could be linked in the absence of LG4b with a seemingly acceptable gap (25.8 cM) between them.
There were 45 bins that could not be mapped according to the physical order in the reference genome, but 13 of them were successfully mapped to the linkage groups of other chromosomes, including one from E12 to LG5; two from E02 to LG10; one from E04 and three from E05 to LG9; and one from E02, E05 and E09 each and three from E10 to LG11 (Table S2). The two bins of E02 (B02_129 and 130) mapped to LG10 were adjacent and therefore actually represented a continuous sequence in the reference genome. Similar situations also existed for the three bins of E05 (B05_207, 208 and 209) mapped to LG9 and the three bins of E10 (B10_194, 195 and 196) mapped to LG11 (Table S2). After mapped to LG10 and LG11, the two bins of E02 and the three bins of E10 were merged (as B02_129 and B10_194), respectively (Table S1).
By comparing the genetic map with the physical map (reference genome), it was found that each chromosome could be divided into three regions (I, II and III) with two obvious distinguishing features (Figure 3). One feature is bin density. In all the chromosomes except for E02, regions I and III had a very high bin density, while region II had much lower bin density (Figure 3). This was basically consistent with the marker density distribution in the genome (Figure 1a). In addition, this result also indicated that the sequences not contained as bins in the genetic map were mainly in the middle region (region II) of each chromosome. The other feature is Ratio of Genetic distance to Physical distance (RGP). The average RGPs (=total genetic length/total physical length) in regions I and III were high in general, while that in region II was very low (Table 2). This can be intuitively seen in Figure 3, where the overall slope of the genetic position-physical position curve appears to be very small in region II in each chromosome, suggesting that a very large physical distance is only equivalent to a small genetic distance in region II. Therefore, genetic recombination must occur mainly in regions I and III, but rare in region II. This explains why a high-density genetic map spanning the whole chromosome could be constructed although there were very few bins from region II.
Among the 12 chromosomes, three appeared to be special. The first was E02. Its first 10 Mb region was not covered by the genetic map (Figure 3). The reason was that the marker density in this region was very low (Figure 1a). It was unable to identify the region I in E02 (Figure 3). The marker density distribution also suggested that there was no region I in E02 (Figure 1a). Hence, E02 only contained two regions: region II with low average RGP and region III with high average RGP (Table 2). The second was E04. Its regions I and III were both broken into two parts. LG4b spanned the three regions, whereas LG4a and LG4c were only a part of region I and region III, respectively (Figure 3). The third was E07. The average RGP in the region II of E07 was much larger than those of other chromosomes (Table 2; Figure 3). In addition, although region II had a small average RGP in general, it contained some positions displaying obviously steep slopes (large RGPs) in some chromosomes, especially in E02, E04 and E07 (Figure 3). These positions must be the hotspots of genetic recombination.

2.3. QTLs Mapped and Candidate Gene Predicted

Based on the field experiments of 172 F2:3 lines in two different cropping seasons, a major QTL was mapped on E10 in each of the two traits, leaf vein color (LVC) and fruit pericarp color (FPC), in both seasons, named as qLVC10 and qFPC10b, respectively (Table 3; Figure S1). The positions and bin intervals of the major QTLs detected in different traits and different seasons were completely the same, all being 63 cM and B10_240-B10_249 (60.97–63.25 cM), respectively (Table 3; Table S1). The results suggested that these major QTLs were actually the same QTL with pleiotropic effects, and the mapping result was reliable and of high resolution (within a marker interval of 2.28 cM) for a primary mapping population of a median size. The major QTL had very high heritability, which could explain ~65% and 55% of the phenotypic variation in LVC and FPC, respectively (Table 3). It showed very large additive effects but negligible dominance effects in the two traits (Table 3). The allele from the cultivated parent YZQ increased the trait value, while the allele from the wild parent YQ acted negatively (Table 3). This was consistent with the phenotypes of the two parents, namely, the colors of leaf vein and fruit pericarp were purple in YZQ, but not purple (no anthocyanin pigmentation) in YQ, while those in F1 were medium between the two parents (Figure 4).
Apart from the major QTL, there were three and two minor QTLs detected in LVC and FPC, respectively (Table 3). All of the minor QTLs were detected only in the first season, and there were no common minor QTLs between the two traits. In addition, the LOD scores of all the minor QTLs except qFPC6 were only slightly over the threshold. Therefore, most of these minor QTLs detected might not be very reliable.
Based on the marker (SNP and short InDel) genotype data of the F2 population, we also performed genome-wide association study (GWAS) on the two anthocyanin pigmentation traits using the method of mixed linear model (MLM) implemented by the R program rMVP [47]. The GWAS results also indicated the existence of a major QTL in the region between 60 Mb and 75 Mb on chromosome E10 (Figure S2). In addition, some of the minor QTLs mapped by the linkage analysis (e.g., qFPC6) also showed obvious signal in the GWAS (Figure S2; Table 3). Therefore, the results of linkage analysis and GWAS were consistent. However, it is noticeable that there was an obvious steady –log10(P) peak detected at ~30 Mb on chromosome E02 in both traits and both cropping seasons (Figure S2), but no QTL mapped at that position (Figure S1). By careful examination, we found that this –log10(P) peak was just located at the two bins, B02_129 and B02_130, which were mapped to E10 in the genetic map, located inside the bin interval of the major QTL and completely linked with B10_249 (Tables S1 and S2; Table 3). This explains the cause of the –log10(P) peak on E02.
The bin interval of the major QTL (B10_240-B10_249) was 1.68 Mb wide in the reference genome (Table 3), harboring 103 genes according to the annotation (Table S3). In addition, it is interesting that the bin B02_129 from E02 happened to be mapped between B10_240 and B10_249, completely linked with (0 cM to) B10_249 (Table S1). The size of B02_129 was 400 kb, in which only four genes were annotated (Table S3). Among these annotated genes, only one (Smechr1001624) met the criteria for reliable effective variation between the wild parent YQ and the cultivated parent YZQ. A SNP (A/G) between the two parents was found at 64,566,331 bp inside Smechr1001624, making the codon AGT in YQ altered into GGT in YZQ and therefore leading to a change of the 743rd amino acid in the encoded protein from Serine in YQ into Glycine in YZQ. Smechr1001624 is predicted to encode a WD repeat-containing (WDR) protein (Table S3). WDR proteins are a very large family. It has been found that WDR proteins play a critical role in the regulation of anthocyanin biosynthesis in many plant species [40,48,49,50,51,52]. WDR physically interacts with MYB and bHLH to form a transcriptional complex, which participates the regulation of the flavonoid pigment pathway [50]. The above information suggests that Smechr1001624 could possibly be a candidate gene for the major QTL.

3. Discussion

In this study, we constructed a high-density recombination bin-based genetic map of eggplant using an interspecific (S. melongena × S. incanum) F2 population consisting of 200 individuals (Table 1). Only one recombination bin-based genetic map of eggplant has been reported before [35] (named as Qian’s map here). Qian’s map is also constructed based on an interspecific F2 population by means of low-depth resequencing (~1.4× for each individual on average), but the wild parent is a different species (S. insanum), and the population is smaller (only 100 individuals). Qian’s map is 1384.6 cM in length (~32% shorter than ours) but contains 3918 bins (~4% more than those in our map). However, most (2370/3918 = 60.5%) of the bins in Qian’s map are redundant; only 1548 are effective bins (~42% of the number in our map). Therefore, the effective average distance between adjacent bins in Qian’s map is ~0.90 cM, which is ~60% larger than that in our map. So, our map is much longer and much denser than Qian’s map. Since the most apparent difference between our study and that of Qian et al. is that the population we used is twice as large as that used by them, population size must be the major cause leading to the significant difference between our map and Qian’s map in total length and effective density, indicating the importance of population size for high-density genetic map construction. Theoretically, the number of recombinations occurring in a population is directly proportional to the population size, while the number of markers (mainly SNPs) revealed by sequencing in a population is usually large enough to identify every genetic recombination occurred. Therefore, population size is the key for achieving the high density of a genetic map. In principle, the population size should be as larger as possible as long as the budget allows.
The method of constructing recombination bin-based genetic map by low-depth resequencing was originally proposed for permanent (e.g., RIL) populations [15]. So, most of the bin maps reported were constructed using permanent populations [15,16,17,18,19,21]. Nevertheless, F2 populations have also been used for constructing bin maps [20,22,35], although it suffers a problem that the heterozygous genotypes of markers are often miscalled as homozygous genotypes under low-depth sequencing. A way to deal with this problem is to predict the most possible marker genotypes by integrating the information of multiple nearby markers. Following this principle, the method of Huang et al. [15] was still adopted for F2 populations after modification in those studies reported. In the present study, we developed a different method for the bin map construction based on F2 populations. The results demonstrated the feasibility of our method. Although the current approaches appear to be effective, the high proportion of misjudgment on heterozygous genotypes due to low sequencing depth may still bring about genotyping mistakes, especially impeding precise identification of the exchange breakpoints of chromosomes, so as to affect the quality of the map. It has mentioned in Section 2.1 that the probability of a heterozygous genotype being misjudged as a homozygous genotype is p m = 1 / 2 k 1 , where k is sequencing depth. According to this formula, it can be found that the probability would be <1/16 when sequencing depth > 5×. So, a little higher (than low) sequencing depth is beneficial to the bin map construction based on F2 populations. For example, Luo et al. constructed a bin map in radish using an average sequencing depth of 7.2× for each F2 individual [22]. However, increasing sequencing depth will increase the cost.
Our study revealed that the relationship of genetic position with physical position in each eggplant chromosome usually appeared as a curve of three slopes, namely, steep slope → gentle slope → steep slope; but in chromosome E02 the curve only contained two slopes, namely, gentle slope → steep slope (Figure 3). Similar phenomenon was observed in the study of Qian et al. [35]. In particular, E02 also displays a curve of two slopes in their studies, suggesting that the exceptive performance of E02 is not an accidental result but a characteristic of E02. The nonlinear relationship between genetic position and physical position suggests that it is inappropriate to use genome-wise average RGP but region-wise average RGP in the studies of map-based cloning of genes.
Chromosome E04 was split into three linkage groups in the bin map constructed in this study. Physically, the three linkage groups were continuous according to the reference genome (Table S1). Each linkage group looked normal and they together could form a continuous three-slope relationship curve between genetic position and physical position similar to those of other chromosomes (Figure 3). So, the breakage of E04 as three linkage groups did not seem to be caused by experimental errors or data analysis errors. The result was really puzzling. Perhaps, the breakpoints between LG4a and LG4b and between LG4b and LG4c were two recombination hotspots with very high recombination rates in this population. Anyway, the separated linkage groups of E04 may not greatly affect the application of the map to QTL mapping.
In this study, we mapped a major QTL for anthocyanin pigmentation on E10. A major QTL for anthocyanin pigmentation has also been mapped on E10 before [23,34,37], which is estimated to be located at about 94-95 Mb in E10 [34] in the reference genome of S. melongena 67/3 line [53]. According to alignment analysis, we found that this location approximately corresponded to 66-67 Mb in the reference genome of SME-HQ1315 [54], very close to location of the major QTL mapped in this study (Table 2). So, we suspect that the reported major QTL and the one mapped in this study might be the same.
We have seen in the results that the GWAS validated the existence of the major QTL on chromosome E10. Besides, it also proved the correctness of the result that the two bins of chromosome E02 (B02_129 and B02_130) according to the reference genome were mapped to E10 in this study. This demonstrated an additional usefulness of high-density bin-based genetic maps for correcting the mistakes of genome sequence assembly and discovering the false localization of a major QTL by GWAS due to the incorrect genome sequence assembly.

4. Materials and Methods

4.1. Plant Materials

An eggplant (S. melongena) cultivar Yanzhiqie (YZQ; accession ID: V06B0149) and a wild relative (S. incanum) germplasm Yeqie (YQ; accession ID: V06B0864) kindly provided by the Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences were used as parents. The two parents are distinct in many traits including anthocyanin pigmentation. YZQ shows purple color in leaf vein and fruit pericarp, while YQ does not (Figure 4). A YZQ plant (female parent) and a YQ plant (male parent) were crossed, and an F1 plant was selfed to produce an F2 population of 200 plants. Further, an F2:3 population of 172 lines were developed by selfing the F2 plants.

4.2. Detection of Marker Genotypes by Whole Genome Resequencing

Genomic DNA was extracted from young leaves of every F2 plant and the two parents. The DNA samples were sequenced based on pair-end libraries using either the BGISeq500 platform (for the F2 plants, conducted by BGI) or the Illumina NovaSeq platform (for the parents, conducted by Novagene). The sequencing depth for each F2 plant was ~2×, while those for parents YZQ and YQ were ~53× and 42×, respectively. Adapter removing and quality control of reads were performed using the software Fastp [55] (https://github.com/OpenGene/fastp, Chen S., Shenzhen, China). Clean reads (150 bp in length each) were mapped to the eggplant reference genome SME-HQ1315 [54] (http://eggplant-hq.cn, accessed on 10 June 2020) using the software BWA-MEM [56] (http://maq.sourceforge.net, Li H., Cambridge, UK) and then sorted using the software SAMtools [57] (http://samtools.sourceforge.net, Li H., Cambridge, UK). Variants (including SNPs and short InDels) and their genotypes in the parents and the F2 plants were identified using the software FreeBayes [58] (v2, https://github.com/freebayes/freebayes, Garrison E., Boston, MA, USA), and the variants meeting the following criteria were selected as molecular markers: (1) diallelic; (2) homozygous in the parent YZQ; and (3) the minor allele frequency (MAF) ≥ 0.3 in the F2 population. According to the marker genotype information of the two parents, the genotype of every marker in each F2 plant was determined, denoted as 0 (for YZQ genotype), 1 (for heterozygous genotype) or 2 (for YQ genotype).

4.3. Construction of Recombination Bin-Based Genetic Map

A main problem of using F2 population to construct recombination bin-based genetic map is that each marker has heterozygous genotype, which is easy to be misidentified as parental genotypes under low sequencing depth. For this reason, we designed a method different from that of Huang et al. [15] to determine recombination bins. We used a sliding window with fixed size (300 kb) to calculate a genotype index for judging the genotype of the window center in each F2 plant. The genotype index was defined as: GI = 0 × p 0 + 1 × p 1 + 2 × p 2 , where p 0 , p 1 and p 2 were the proportions of markers with the YZQ genotype, the heterozygous genotype and the YQ genotype within the window, respectively. Obviously, GI varied between 0 and 2. It would be close to 0 when the window was in a YZQ-genotype region, but close to 2 when in a YQ-genotype region. In a heterozygous region, as a heterozygous marker had an equal probability to be misidentified as either the YZQ genotype or the YQ genotype, the genotyping errors of different markers could be partially eliminated after averaging in the sliding window, making the GI value vary around 1. So, in principle, genomic positions with different genotypes could be distinguished according to their GI values. In order to use GI to discriminate genotype explicitly, we established the following function after trial to transform GI into integer: let GI = 0 (YZQ genotype) when GI ≤ 0.2; 1 (heterozygous genotype) when 0.2 < GI ≤ 1.8; and 2 (YQ genotype) when GI > 1.8. To get reliable or valid GI estimates, a minimum number of 10 markers was required in a window. The window moved with a step length of 10 kb from one end to the other on each chromosome. Thus, a GI estimate was obtained every 10 kb in the genome in each F2 plant.
Based on the integer GI data, co-segregations of adjacent positions in the F2 population were examined. A genomic segment containing a string of consecutive positions that completely co-segregated was defined as a recombination bin. So, every chromosome was partitioned into many bins. Each bin behaved as a separate unit for genetic recombination like a marker in the population. Using chi-square test, bins showing very serious deviation from the theoretical segregation ratio (1:2:1) in the F2 population (chi-square > 25, p-value < 105) were filtered out. Then, by fixing the order of the bins in each chromosome according to the reference genome sequence, the genetic distances between adjacent bins were calculated using the software QTL IciMapping v4.2 [59] (https://isbreedingen.caas.cn/software/qtllcimapping/294607.htm, accessed on 24 July 2019). For those bins that could not be mapped according to the expected order, each was taken as an inquiry bin to search for the target bin that was closely linked to it, using the option Rippling provided in the software. If a target bin was found, the inquiry bin was tried to be inserted into the map beside the target bin. After no more bins could be added into the map, the following procedure was conducted on each linkage group: (1) bin genotype errors detected by the linkage analysis were corrected according to the flanking bins; (2) bins that had the same genetic position and were physically consecutive were merged as a larger bin; and (3) linkage analysis was performed again. This procedure was repeated until no bin genotype errors could be found and no bins could be merged in the map.

4.4. Field Experiment and Phenotyping

Field experiments of the F2:3 population with the two parents as controls were carried out twice in the farm of Fujian Agriculture and Forestry University at Yangzhong, Youxi (E118.485841, N26.287161) in two different cropping seasons in 2020. In the first season, seeds were sown in growing trays after pregermination on 20 February, and seedlings were transplanted into the field on 5 April. In the second season, seeds were sown on 9 May and seedlings were transplanted on 12 July. In both seasons, the seedlings to be transplanted were all at the age of five full leaves plus one emerging leaf. The seedlings were planted in a regime of 80 cm between rows and 100 cm between plants, under a randomized complete block design with two replicates. Five seedlings were planted for each F2:3 line in each replicate.
For each plant, leaf vein color (LVC) was observed from the adaxial side of a mature leaf at the stage when the second inflorescence on the main stem began to flower, measured as 0 (green), 1 (light purple), or 2 (purple); fruit pericarp color (FPC) was investigated from the fruits produced by the second inflorescence on the main stem at the stage of horticultural maturity, measured as 0 (white), 1 (greenish white), 2 (light green), 3 (green), 4 (dark green), 5 (purple), 6 (brown), or 7 (dark brown).

4.5. QTL Analysis and Candidate Gene Prediction

Mapping of QTLs underlying the color traits in the F2:3 population was performed based on the bin genotype data and the genetic map using the software QTL IciMapping v4.2 [59]. The LOD threshold at the overall (genome-wise) significance level of 0.05 was estimated by permutation test with 1000 replicates. Genome-wide association study (GWAS) was conducted to verify the mapped QTLs using the marker (SNP and short InDel) data and the method of mixed linear model (MLM) implemented by the R program rMVP [47]. Using the genome sequencing data of the two parents, possible candidate genes of a major QTL were predicted by examining reliable and effective coding sequence variations between the two parents within the QTL’s bin interval based on the following criteria: (1) the polymorphic site was homozygous with sufficient sequencing depth (counts ≥ 6) in both parents; (2) the variation could cause protein change between the two parents; and (3) the genotype of the polymorphic site in YZQ (or YQ) was the same as (or different from) that in the reference genome of HQ-1315, which is similar to YZQ in anthocyanin pigmentation [54].

5. Conclusions

In this study, a high-density recombination bin-based genetic map of eggplant was constructed based on an F2 population derived from an interspecific cross (S. melongena × S. incanum) using the whole genome resequencing strategy and was then applied to QTL analysis for anthocyanin pigmentation in leaf vein and fruit pericarp. A major QTL with pleiotropic effects on the two traits was steadily mapped in a small region on chromosome E10 in two cropping seasons, and a possible candidate gene of the QTL was identified. In addition, some mistakes of genome sequence assembly in the reference genome were found and corrected in the genetic map construction. Our results demonstrated the merit of high-density genetic map for high-resolution QTL mapping as well as for discovering and correcting the mistakes in genome sequence assembly.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms231810258/s1.

Author Contributions

Conceptualization, W.W. and Y.Z.; methodology, W.G., W.W. and Y.Z.; software, W.T., J.J., and J.X.; formal analysis, W.G., C.K. and X.X.; investigation, W.G., C.K., M.Y. and C.D.; data curation, W.G., C.K., M.Y. and C.D.; writing—original draft preparation, W.G., W.W. and Y.Z.; writing—review and editing, W.W. and Y.Z.; supervision, W.W. and Y.Z.; funding acquisition, Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Natural Science Foundation of Fujian Province (CN) (2020I0009) and Special fund project for scientific and technological innovation of Fujian agriculture and Forestry University (CN) (CX2X2020048A).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank the Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences for their kind providing the eggplant seeds of two parents.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Miller, M.R.; Dunham, J.P.; Amores, A.; Cresko, W.A.; Johnson, E.A. Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers. Genome Res. 2007, 17, 240–248. [Google Scholar] [CrossRef] [PubMed]
  2. Baird, N.A.; Etter, P.D.; Atwood, T.S.; Currey, M.C.; Shiver, A.L.; Lewis, Z.A.; Selker, E.U.; Cresko, W.A.; Johnson, E.A. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 2008, 3, e3376. [Google Scholar] [CrossRef] [PubMed]
  3. Elshire, R.J.; Glaubitz, J.C.; Sun, Q.; Poland, J.A.; Kawamoto, K.; Buckler, E.S.; Mitchell, S.E. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 2011, 6, e19379. [Google Scholar] [CrossRef] [PubMed]
  4. Sun, X.W.; Liu, D.Y.; Zhang, X.F.; Li, W.B.; Liu, H.; Hong, W.G.; Jiang, C.B.; Guan, N.; Ma, C.X.; Zeng, H.P. SLAF-seq: An efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS ONE 2013, 8, e58700. [Google Scholar] [CrossRef] [PubMed]
  5. Qi, Z.; Huang, L.; Zhu, R.; Xin, D.; Liu, C.; Han, X.; Jiang, H.; Hong, W.; Hu, G.; Zheng, H.; et al. A High-density genetic map for soybean based on specific length amplified fragment sequencing. PLoS ONE 2014, 9, e104871. [Google Scholar] [CrossRef] [PubMed]
  6. Xu, X.; Xu, R.; Zhu, B.; Yu, T.; Qu, W.; Lu, L.; Xu, Q.; Qi, X.; Chen, X. A high-density genetic map of cucumber derived from specific length amplified fragment sequencing (SLAF-seq). Front. Plant Sci. 2015, 5, 768. [Google Scholar] [CrossRef]
  7. Kujur, A.; Upadhyaya, H.D.; Shree, T.; Bajaj, D.; Das, S.; Saxena, S.M.; Badoni, S.; Kumar, V.; Tripathi, S.; Gowda, C.L.L.; et al. Ultra-high density intra-specific genetic linkage maps accelerate identification of functionally relevant molecular tags governing important agronomic traits in chickpea. Sci. Rep. 2015, 5, 9468. [Google Scholar] [CrossRef]
  8. Holtz, Y.; Ardisson, M.; Ranwez, V.; Besnard, A.; Leroy, P.; Poux, G.; Roumet, P.; Viader, V.; Santoni, S.; David, J. Genotyping by sequencing using specific allelic capture to build a high-density genetic map of durum wheat. PLoS ONE 2016, 11, e0154609. [Google Scholar] [CrossRef]
  9. Leon, T.B.D.; Linscombe, S.; Subudhi, P.K. Molecular dissection of seedling salinity tolerance in rice (Oryza sativa L.) using a high-density GBS-based SNP linkage map. Rice 2016, 9, 52. [Google Scholar] [CrossRef]
  10. Wen, Y.; Fang, Y.; Hu, P.; Tan, Y.; Wang, Y.; Hou, L.; Deng, X.; Wu, H.; Zhu, L.; Zhu, L.; et al. Construction of a high-density genetic map based on SLAF markers and QTL analysis of leaf size in rice. Front. Plant Sci. 2020, 11, 1143. [Google Scholar] [CrossRef]
  11. Wang, J.; Su, K.; Guo, Y.; Xing, H.; Zhao, Y.; Liu, Z.; Li, K.; Guo, X. Construction of a high-density genetic map for grape using specific length amplified fragment (SLAF) sequencing. PLoS ONE 2017, 12, e0181728. [Google Scholar] [CrossRef]
  12. Li, G.H.; Chen, H.C.; Liu, J.L.; Luo, W.L.; Xie, D.S.; Luo, S.B.; Wu, T.Q.; Akram, W.; Zhong, Y.J. A high-density genetic map developed by specific-locus amplified fragment (SLAF) sequencing and identification of a locus controlling anthocyanin pigmentation in stalk of Zicaitai (Brassica rapa L. ssp. chinensis var. purpurea). BMC Genom. 2019, 21, 343. [Google Scholar] [CrossRef] [PubMed]
  13. Gutierrez-Gonzalez, J.J.; Mascher, M.; Poland, J.; Muehlbauer, G.J. Dense genotyping-by-sequencing linkage maps of two Synthetic W7984×Opata reference populations provide insights into wheat structural diversity. Sci. Rep. 2019, 9, 1793. [Google Scholar] [CrossRef] [PubMed]
  14. Yu, X.; Zhang, M.; Yu, Z.; Yang, D.; Li, J.; Wu, G.; Li, J. An SNP-based high-density genetic linkage map for tetraploid potato using specific length amplified fragment sequencing (SLAF-seq) technology. Agronomy 2020, 10, 114. [Google Scholar] [CrossRef]
  15. Huang, X.; Feng, Q.; Qian, Q.; Zhao, Q.; Wang, L.; Wang, A.; Guan, J.; Fan, D.; Weng, Q.; Huang, T.; et al. High-throughput genotyping by whole-genome resequencing. Genome Res. 2009, 19, 1068–1076. [Google Scholar] [CrossRef] [PubMed]
  16. Xie, W.; Feng, Q.; Yu, H.; Huang, X.; Zhao, Q.; Xing, Y.; Yu, S.; Han, B.; Zhang, Q. Parent-independent genotyping for constructing an ultrahigh-density linkage map based on population sequencing. Proc. Natl. Acad. Sci. USA 2010, 107, 10578–10583. [Google Scholar] [CrossRef] [PubMed]
  17. Xu, X.; Zeng, L.; Tao, Y.; Vuong, T.; Wan, J.; Boerma, R.; Noe, J.; Li, Z.; Finnerty, S.; Pathan, S.M.; et al. Pinpointing genes underlying the quantitative trait loci for root-knot nematode resistance in palaeopolyploid soybean by whole genome resequencing. Proc. Natl. Acad. Sci. USA 2013, 110, 13469–13474. [Google Scholar] [CrossRef]
  18. Liu, H.; Niu, Y.; Gonzalez-Portilla, P.J.; Zhou, H.; Wang, L.; Zou, T.; Qin, C.; Tai, S.; Jansen, C.; Shen, Y.; et al. An ultra-high-density map as a community resource for discerning the genetic basis of quantitative traits in maize. BMC Genom. 2015, 16, 1078. [Google Scholar] [CrossRef]
  19. Han, K.; Jeong, H.J.; Yang, H.B.; Kang, S.M.; Kwon, J.K.; Kim, S.; Choi, D.; Kang, B.C. An ultra-high-density bin map facilitates high-throughput QTL mapping of horticultural traits in pepper (Capsicum annuum). DNA Res. 2016, 23, 81–91. [Google Scholar] [CrossRef]
  20. Hu, Z.; Deng, G.; Mou, H.; Xu, Y.; Chen, L.; Yang, J.; Zhang, M. A re-sequencing-based ultra-dense genetic map reveals a gummy stem blight resistance-associated gene in Cucumis melo. DNA Res. 2018, 25, 1–10. [Google Scholar] [CrossRef] [Green Version]
  21. Agarwal, G.; Clevenger, J.; Kale, S.M.; Wang, H.; Pandey, M.K.; Choudhary, D.; Yan, M.; Wang, X.; Culbreath, A.K.; Holbrook, C.C.; et al. A recombination bin-map identified a major QTL for resistance to Tomato Spotted Wilt Virus in peanut (Arachis hypogaea). Sci. Rep. 2019, 9, 18246. [Google Scholar] [CrossRef] [PubMed]
  22. Luo, X.; Xu, L.; Wang, Y.; Dong, J.; Chen, Y.; Tang, M.; Fan, L.; Zhu, Y.; Liu, L. An ultra-high-density genetic map provides insights into genome synteny, recombination landscape and taproot skin colour in radish (Raphanus sativus L.). Plant Biotechnol. J. 2020, 18, 274–286. [Google Scholar] [CrossRef]
  23. Barchi, L.; Lanteri, S.; Portis, E.; Valè, G.; Volante, A.; Pulcin, L.; Ciriaci, T.; Acciarri, N.; Barbierato, V.; Toppino, L.; et al. A RAD tag derived marker based eggplant linkage map and the location of QTLs determining anthocyanin pigmentation. PLoS ONE 2012, 7, e43740. [Google Scholar] [CrossRef] [PubMed]
  24. Doganlar, S.; Frary, A.; Daynay, M.C.; Lester, R.N.; Tanksley, S.D. A comparative genetic linkage map of eggplant (Solanum melongena) and its implications for genome evolution in the Solanaceae. Genetics 2002, 161, 1697–1711. [Google Scholar] [CrossRef] [PubMed]
  25. Frary, A.; Doganlar, S.; Daunay, M.C.; Tanksley, S.D. QTL analysis of morphological traits in eggplant and implications for conservation of gene function during evolution of solanaceous species. Theor. Appl. Genet. 2003, 107, 359–370. [Google Scholar] [CrossRef]
  26. Frary, A.; Frary, A.; Daunay, M.C.; Huvenaars, K.; Mank, R.; Doğanlar, S. QTL hotspots in eggplant (Solanum melongena) detected with a high resolution map and CIM analysis. Euphytica 2014, 197, 211–228. [Google Scholar] [CrossRef]
  27. Fukuoka, H.; Miyatake, K.; Nunome, T.; Negoro, S.; Shirasawa, K.; Isobe, S.; Asamizu, E.; Yamaguchi, H.; Ohyama, A. Development of gene-based markers and construction of an integrated linkage map in eggplant by using Solanum orthologous (SOL) gene sets. Theor. Appl. Genet. 2012, 125, 47–56. [Google Scholar] [CrossRef]
  28. Gramazio, P.; Prohens, J.; Plazas, M.; Andújar, I.; Herraiz, F.J.; Castillo, E.; Knapp, S.; Meyer, R.S.; Vilanova, S. Location of chlorogenic acid biosynthesis pathway and polyphenol oxidase genes in a new interspecific anchored linkage map of eggplant. BMC Plant Biol. 2014, 14, 350. [Google Scholar] [CrossRef]
  29. Miyatake, K.; Saito, T.; Negoro, S.; Yamaguchi, H.; Nunome, T.; Ohyama, A.; Fukuoka, H. Development of selective markers linked to a major QTL for parthenocarpy in eggplant (Solanum melongena L.). Theor. Appl. Genet. 2012, 124, 1403–1413. [Google Scholar] [CrossRef]
  30. Nunome, T.; Ishiguro, K.; Yoshida, T.; Hirai, M. Mapping of fruit shape and color development traits in eggplant (Solanum melongena L.) based on RAPD and AFLP markers. Breed. Sci. 2001, 51, 19–26. [Google Scholar] [CrossRef] [Green Version]
  31. Nunome, T.; Negoro, S.; Kono, I.; Kanamori, H.; Miyatake, K.; Yamaguchi, H.; Ohyama, A.; Fukuoka, H. Development of SSR markers derived from SSR-enriched genomic library of eggplant (Solanum melongena L.). Theor. Appl. Genet. 2009, 119, 1143–1153. [Google Scholar] [CrossRef] [PubMed]
  32. Salgon, S.; Raynal, M.; Lebon, S.; Baptiste, J.M.; Daunay, M.C.; Dintinger, J.; Jourda, C. Genotyping by sequencing highlights a polygenic resistance to Ralstonia pseudosolanacearum in eggplant (Solanum melongena L.). Int. J. Mol. Sci. 2018, 19, 357. [Google Scholar] [CrossRef] [PubMed]
  33. Wei, Q.; Wang, W.; Hu, T.; Hu, H.; Wang, J.; Bao, C. Construction of a SNP-based genetic map using SLAF-seq and in eggplant. Front. Genet. 2020, 11, 178. [Google Scholar] [CrossRef] [PubMed]
  34. Toppino, L.; Barchi, L.; Mercati, F.; Acciarri, N.; Perrone, D.; Martina, M.; Gattolin, S.; Sala, T.; Fadda, S.; Mauceri, A.; et al. A new intra-specific and high-resolution genetic map of eggplant based on a RIL population, and location of QTLs related to plant anthocyanin pigmentation and seed vigour. Genes 2020, 11, 745. [Google Scholar] [CrossRef]
  35. Qian, Z.; Zhang, B.; Chen, H.; Lu, L.; Duan, M.; Zhou, J.; Cui, Y.; Li, D. Identification of quantitative trait loci controlling the development of prickles in eggplant by genome re-sequencing analysis. Front. Plant Sci. 2021, 12, 731079. [Google Scholar] [CrossRef]
  36. Mattioli, R.; Francioso, A.; Mosca, L.; Silva, P. Anthocyanins: A comprehensive review of their chemical properties and health effects on cardiovascular and neurodegenerative diseases. Molecules 2020, 25, 3809. [Google Scholar] [CrossRef]
  37. Cericola, F.; Portis, E.; Lanteri, S.; Toppino, L.; Barchi, L.; Acciarri, N.; Pulcini, L.; Sala, T.; Rotino, G.L. Linkage disequilibrium and genome-wide association analysis for anthocyanin pigmentation and fruit color in eggplant. BMC Genom. 2014, 15, 896. [Google Scholar] [CrossRef]
  38. Mangino, G.; Arrones, A.; Plazas, M.; Pook, T.; Prohens, J.; Gramazio, P.; Vilanova, S. Newly developed MAGIC population allows identification of strong associations and candidate genes for anthocyanin pigmentation in eggplant. Front. Plant Sci. 2022, 13, 847789. [Google Scholar] [CrossRef]
  39. Li, J.; Ren, L.; Gao, Z.; Jiang, M.; Liu, Y.; Zhou, L.; He, Y.; Chen, H. Combined transcriptomic and proteomic analysis constructs a new model for light-induced anthocyanin biosynthesis in eggplant (Solanum melongena L.). Plant Cell Environ. 2017, 40, 3069–3087. [Google Scholar] [CrossRef]
  40. Li, J.; He, Y.J.; Zhou, L.; Liu, Y.; Jiang, M.; Ren, L.; Chen, H. Transcriptome profiling of genes related to light-induced anthocyanin biosynthesis in eggplant (Solanum melongena L.) before purple color becomes evident. BMC Genom. 2018, 19, 201. [Google Scholar] [CrossRef] [Green Version]
  41. Zhang, S.; Zhang, A.; Wu, X.; Zhu, Z.; Yang, Z.; Zhu, Y.; Zha, D. Transcriptome analysis revealed expression of genes related to anthocyanin biosynthesis in eggplant (Solanum melongena L.) under high-temperature stress. BMC Plant Biol. 2019, 19, 387. [Google Scholar] [CrossRef] [PubMed]
  42. Moglia, A.; Florio, F.E.; Iacopino, S.; Guerrieri, A.; Milani, A.M.; Comino, C.; Barchi, L.; Marengo, A.; Cagliero, C.; Rubiolo, P.; et al. Identification of a new R3 MYB type repressor and functional characterization of the members of the MBW transcriptional complex involved in anthocyanin biosynthesis in eggplant (S. melongena L.). PLoS ONE 2020, 15, e0232986. [Google Scholar]
  43. Florio, F.E.; Gattolin, S.; Toppino, L.; Bassolino, L.; Fibiani, M.; Scalzo, R.L.; Rotino, G.L. A SmelAAT acyltransferase variant causes a major difference in eggplant (Solanum melongena L.) peel anthocyanin composition. Int. J. Mol. Sci. 2021, 22, 9174. [Google Scholar] [CrossRef] [PubMed]
  44. He, Y.; Li, D.; Li, S.; Liu, Y.; Chen, H. SmBICs inhibit anthocyanin biosynthesis in eggplant (Solanum melongena L.). Plant Cell Physiol. 2021, 62, 1001–1011. [Google Scholar] [CrossRef] [PubMed]
  45. Xi, H.; He, Y.; Chen, H. Functional characterization of SmbHLH13 in anthocyanin biosynthesis and flowering in eggplant. Hort. Plant J. 2021, 1, 73–80. [Google Scholar] [CrossRef]
  46. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
  47. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, S.; Li, X.; et al. rMVP: A memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study. Genom. Proteom. Bioinform. 2021, 19, 619–628. [Google Scholar] [CrossRef]
  48. Dong, W.; Niu, L.; Gu, J.; Gao, F. Isolation of a WD40-repeat gene regulating anthocyanin biosynthesis in storage roots of purple-fleshed sweet potato. Acta Physiol. Plant. 2014, 36, 1123–1132. [Google Scholar] [CrossRef]
  49. Jia, N.; Wang, J.; Wang, Y.; Ye, W.; Liu, J.; Jiang, J.; Sun, J.; Yan, P.; Wang, P.; Wang, F.; et al. The light-induced WD40-repeat transcription factor DcTTG1 regulates anthocyanin biosynthesis in Dendrobium candidum. Front. Plant Sci. 2021, 12, 633333. [Google Scholar] [CrossRef]
  50. Lloyd, A.; Brockman, A.; Aguirre, L.; Campbell, A.; Bean, A.; Cantero, A.; Gonzalez, A. Advances in the MYB–bHLH–WD repeat (MBW) pigment regulatory model: Addition of a WRKY factor and co-option of an anthocyanin MYB for betalain regulation. Plant Cell Physiol. 2017, 58, 1431–1441. [Google Scholar] [CrossRef]
  51. Pang, Y.; Wenger, J.P.; Saathoff, K.; Peel, G.J.; Wen, J.; Huhman, D.; Allen, S.N.; Tang, Y.; Cheng, X.; Tadege, M.; et al. A WD40 repeat protein from Medicago truncatula is necessary for tissue-specific anthocyanin and proanthocyanidin biosynthesis but not for trichome development. Plant Physiol. 2009, 151, 1114–1129. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  52. Sompornpailin, K.; Makita, Y.; Yamazaki, M.; Saito, K. A WD-repeat-containing putative regulatory protein in anthocyanin biosynthesis in Perilla frutescens. Plant Mol. Biol. 2002, 50, 485–495. [Google Scholar] [CrossRef] [PubMed]
  53. Barchi, L.; Pietrella, M.; Venturini, L.; Minio, A.; Topino, L.; Acquadro, A.; Andolfo, G.; Aprea, G.; Avanzato, C.; Bassolino, L.; et al. A chromosome-anchored eggplant genome sequence reveals key events in Solanaceae evolution. Sci. Rep. 2019, 9, 11769. [Google Scholar] [CrossRef]
  54. Wei, Q.; Wang, J.; Wang, W.; Hu, T.; Hu, H.; Bao, C. A high-quality chromosome-level genome assembly reveals genetics for important traits in eggplant. Hortic. Res. 2020, 7, 153. [Google Scholar] [CrossRef] [PubMed]
  55. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  56. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
  57. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Drubin, R.; 1000 Genome Project Data Processing Subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef]
  58. Garrison, E.; Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv 2012, arXiv:1207.3907. [Google Scholar]
  59. Meng, L.; Li, H.; Zhang, L.; Wang, J. QTL IciMapping: Integrated software for genetic linkage map construction and quantitative trait locus mapping in biparental populations. Crop J. 2015, 3, 269–283. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Characteristics of the markers and the F2 population: (a) distribution of marker density (number of markers per Mb) in each chromosome. The color scale is on the right; (b) distribution of minor allele frequency (MAF) of the markers in the F2 population; and (c) distribution of relatedness score between F2 individuals.
Figure 1. Characteristics of the markers and the F2 population: (a) distribution of marker density (number of markers per Mb) in each chromosome. The color scale is on the right; (b) distribution of minor allele frequency (MAF) of the markers in the F2 population; and (c) distribution of relatedness score between F2 individuals.
Ijms 23 10258 g001
Figure 2. High-density recombination bin-based genetic map constructed in eggplant. The position of each bin is indicated by a horizontal line. The genetic distance scale in cM is shown on the left. Chromosome E04 was divided into three linkage groups (4a, 4b and 4c).
Figure 2. High-density recombination bin-based genetic map constructed in eggplant. The position of each bin is indicated by a horizontal line. The genetic distance scale in cM is shown on the left. Chromosome E04 was divided into three linkage groups (4a, 4b and 4c).
Ijms 23 10258 g002
Figure 3. Relationship of the genetic positions with the physical positions of the bins in each chromosome. Each circle indicates a bin. The physical position of a bin is represented by the center of the bin. The vertical dashed lines divide a chromosome into three regions. Arrows indicate the possible hotspots of genetic recombination in region II. For chromosome E04, the middle gray part is LG4b, while the left and right black parts are LG4a and LG4c, respectively. The blank arrowheads indicate the breakpoints between linkage groups.
Figure 3. Relationship of the genetic positions with the physical positions of the bins in each chromosome. Each circle indicates a bin. The physical position of a bin is represented by the center of the bin. The vertical dashed lines divide a chromosome into three regions. Arrows indicate the possible hotspots of genetic recombination in region II. For chromosome E04, the middle gray part is LG4b, while the left and right black parts are LG4a and LG4c, respectively. The blank arrowheads indicate the breakpoints between linkage groups.
Ijms 23 10258 g003
Figure 4. Leaves (a) and fruits (b) of YZQ, F1 and YQ. Bar = 3 cm in (a) and 5 cm in (b).
Figure 4. Leaves (a) and fruits (b) of YZQ, F1 and YQ. Bar = 3 cm in (a) and 5 cm in (b).
Ijms 23 10258 g004
Table 1. Information of the recombination bin-based genetic map constructed in eggplant.
Table 1. Information of the recombination bin-based genetic map constructed in eggplant.
LG aLength (cM)No. of Bins bSpacing (cM) cSpan (Mb) dFull (Mb) e% f
1239.4484 (474) 0.50 (0.51)106.63106.6499.99
2139.1298 (282)0.47 (0.50)64.6675.4285.73
3210.3375 (364)0.56 (0.58)96.7196.71100.00
4a+4b+bc160.8281 (259)0.58 (0.62)79.1680.2898.60
4a39.279 (78)0.50 (0.51)5.085.3395.31
4b80.4104 (83)0.78 (0.98)67.9267.92100.00
4c41.298 (98)0.42 (0.42)6.167.0387.62
5123.5212 (209)0.59 (0.59)80.7381.5998.95
6174.9368 (358)0.48 (0.49)89.3789.6899.65
7226.9338 (334)0.67 (0.68)106.41106.7999.64
8138.4249 (241)0.56 (0.58)86.6286.8399.76
9136.9314 (298)0.44 (0.46)89.1489.6499.44
10121.7244 (224)0.50 (0.55)84.1784.17100.00
11170.9348 (339)0.49 (0.51)101.22101.22100.00
12154.3265 (262)0.58 (0.59)73.9874.1799.74
Total2022.83776 (3644)0.54 (0.56)1058.801073.1498.66
Note: a. LG, linkage group, which is numbered in accordance with the corresponding chromosome. b. Number of bins or effective bins (in parenthesis). c. Average distance between adjacent bins or effective bins (in parenthesis). d. Physical span from the first bin to the last bin. e. Full physical length of the chromosome or chromosomal segment. f. Ratio of physical span to full physical length.
Table 2. Average ratio of genetic distance to physical distance (cM/Mb) in different regions and the whole span of the genetic map on each eggplant chromosome.
Table 2. Average ratio of genetic distance to physical distance (cM/Mb) in different regions and the whole span of the genetic map on each eggplant chromosome.
ChromosomeRegion IRegion IIRegion IIIWhole
E014.530.547.312.25
E02-0.654.212.15
E033.120.335.382.17
E045.900.325.902.03
E055.600.075.981.53
E065.800.122.841.96
E0710.291.275.122.13
E084.130.133.641.60
E095.030.227.311.54
E105.490.197.421.45
E115.890.357.681.69
E126.660.206.522.09
Note: For chromosome E04, the average RGPs in region I and region III were estimated based on LG4a+LG4b and LG4b+LG4c, respectively, while that in the whole chromosome was estimated based on LG4a+LG4b+LG4c.
Table 3. Mapped QTLs for leaf vein color (LVC) and fruit pericarp color (FPC) in eggplant.
Table 3. Mapped QTLs for leaf vein color (LVC) and fruit pericarp color (FPC) in eggplant.
QTLSeason aChr.Pos. (cM)Bin interval (kb) bLODTLOD cAdd dDom ePVE (%) f
qLVC41LG4a33B04_76-B04_77
(4070–4120)
4.144.03−0.1670.0003.39
qLVC101E1063B10_240-B10_24941.064.03−0.7100.08956.62
2E1063B10_240-B10_24948.274.17−0.7460.00172.64
1+2E1063B10_240-B10_249
(63,480–65,160)
−0.7280.04564.63
qLVC111E112B11_7-B11_8
(450–660)
4.314.030.1680.0553.46
qLVC121E1242B12_106-B12_108
(4110–4130)
4.344.170.0260.2403.50
qFPC61E0634B06_85-B06_88
(2590–2640)
6.874.05−0.376−0.0435.97
qFPC10a1E106B10_18-B10_19
(1190–1220)
4.464.05−0.302−0.0543.92
qFPC10b1E1063B10_240-B10_24937.774.05−1.1340.08551.72
2E1063B10_240-B10_24938.324.05−1.2260.04958.94
1+2E1063B10_240-B10_249
(63,480–65,160)
−1.1800.06755.33
Note: a. 1+2 means the estimates obtained in the two seasons were either averaged or combined. b. The data in the parenthesis indicate the physical interval (in kb) between the start point of the left bin and the end point of the right bin. c. LOD threshold at the overall significance level of 0.05 estimated by permutation tests. d. Additive effect. The negative sign indicates that the allele from the wild parent YQ decreased the trait value. e. Dominance effect. f. Percentage of phenotypic variance explained by the QTL.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Guan, W.; Ke, C.; Tang, W.; Jiang, J.; Xia, J.; Xie, X.; Yang, M.; Duan, C.; Wu, W.; Zheng, Y. Construction of a High-Density Recombination Bin-Based Genetic Map Facilitates High-Resolution Mapping of a Major QTL Underlying Anthocyanin Pigmentation in Eggplant. Int. J. Mol. Sci. 2022, 23, 10258. https://doi.org/10.3390/ijms231810258

AMA Style

Guan W, Ke C, Tang W, Jiang J, Xia J, Xie X, Yang M, Duan C, Wu W, Zheng Y. Construction of a High-Density Recombination Bin-Based Genetic Map Facilitates High-Resolution Mapping of a Major QTL Underlying Anthocyanin Pigmentation in Eggplant. International Journal of Molecular Sciences. 2022; 23(18):10258. https://doi.org/10.3390/ijms231810258

Chicago/Turabian Style

Guan, Wenxiang, Changjiao Ke, Weiqi Tang, Jialong Jiang, Jing Xia, Xiaofang Xie, Mei Yang, Chenfeng Duan, Weiren Wu, and Yan Zheng. 2022. "Construction of a High-Density Recombination Bin-Based Genetic Map Facilitates High-Resolution Mapping of a Major QTL Underlying Anthocyanin Pigmentation in Eggplant" International Journal of Molecular Sciences 23, no. 18: 10258. https://doi.org/10.3390/ijms231810258

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop