Next Article in Journal
Impact of Soil Management Practices on Soil Culturable Bacteriota and Species Diversity in Central European a Productive Vineyard under Warm and Dry Conditions
Next Article in Special Issue
Unified Assembly of Chloroplast Genomes: A Comparative Study of Grapes Representing Global Geographic Diversity
Previous Article in Journal
Resistances and Physiological Responses of Impatiens uliginosa to Copper Stress
Previous Article in Special Issue
Cytogenetics Study of Four Edible and Ornamental Zingiber Species (Zingiberaceae) from Thailand
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

NGS-Based Multi-Allelic InDel Genotyping and Fingerprinting Facilitate Genetic Discrimination in Grapevine (Vitis vinifera L.)

1
State Key Laboratory of Vegetable Biobreeding, Tianjin Academy of Agricultural Sciences, Tianjin 300192, China
2
College of Life Sciences, Nankai University, Tianjin 300071, China
3
National Key Laboratory of Crop Genetic Improvement, National Center of Oil Crop Improvement, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
4
Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou 450009, China
5
Tianjin Agricultural Development Service Center, Tianjin 300061, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Horticulturae 2024, 10(7), 752; https://doi.org/10.3390/horticulturae10070752
Submission received: 14 June 2024 / Revised: 9 July 2024 / Accepted: 12 July 2024 / Published: 16 July 2024
(This article belongs to the Special Issue Genetics and Molecular Breeding of Fruit Tree Species)

Abstract

:
Molecular markers play a crucial role in marker-assisted breeding and varietal identification. However, the application of insertion/deletion markers (InDels) in grapevines has been limited by the low throughput and separability of gel electrophoresis. To developed effective InDel markers for grapevines, this study reports a novel, effective and high-throughput pipeline for InDel marker development and identification. After rigorous filtering, 11 polymorphic multi-allelic InDel markers were selected. These markers were then used to perform genetic identification of 123 elite grape cultivars using agarose gel electrophoresis and next-generation sequencing (NGS). The polymorphism rate of the InDel markers identified by gels was 37.92%, while the NGS-based results demonstrated a higher polymorphism rate of 61.12%. Finally, the NGS-based fingerprints successfully distinguished 122 grape varieties (99.19%), surpassing the gels, which could distinguish 116 grape varieties (94.31%). Specifically, we constructed phylogenetic trees based on the genotyping results from both gels and NGS. The population structure revealed by the NGS-based markers displayed three primary clusters, consisting of the patterns of the evolutionary divergence and geographical origin of the grapevines. Our work provides an efficient workflow for multi-allelic InDel marker development and practical tools for the genetic discrimination of grape cultivars.

1. Introduction

The grapevine (V. vinifera L.) is one of the earliest domesticated fruit plants, characterized by the absence of genetic barriers within the genus [1]. Over the past thousands of years, grapevine introduction, breeding and intensive trade have blurred genetic relationships [2]. Moreover, the genetic diversity of grapes gradually decreases while domestication progresses [3,4,5]. Currently, synonyms and homonyms are prevalent in the grape market, which has negative effects on breeding activities and breeders’ rights [6]. Therefore, economical, efficient and accurate variety identification methods are urgently needed to improve breeding efficiency and protect the rights of grape breeders.
DNA molecular markers are crucial for grape varietal identification [7]. Among them, simple sequence repeat (SSR) and amplified fragment length polymorphism (AFLP) are widely used in grapevine identification. The use of 20 SSR markers helped identify three genetic populations among 1378 varieties [8]. The genetic variability in ‘Sangiovese’, ‘Sanforte’ and ‘Montepulciano’ grapes was analyzed using multiple molecular markers such as SSR and AFLP [9]. However, the application of SSR markers is hindered by the low throughput and intensive data processing [10].
With the development of sequencing technology, a large number of single nucleotide polymorphism (SNP) and InDel markers were identified. SNP and InDel have become the most promising markers in genetic research and molecular marker-assisted breeding due to their wide distribution in genomes and suitability for high-throughput genotyping [11,12]. Emanuelli et al. [13] employed SNP markers to verify European grapevine cultivars, while Wang et al. [14] used SNP markers to distinguish major grape cultivars in China. Generally, InDels are defined as short insertion or deletion of up to 50 nucleotides at a single locus. Compared to SNPs, InDel markers can be easily detected using gel electrophoresis. In recent years, InDel markers have been applied as fingerprints in various crops, such as rice [15], maize [16], peach [17], apple [18], tomato [19] and cucumber [20]. However, as of now, few InDel markers have been developed for grape-variety fingerprinting.
Multi-allelic InDels refer to variations caused by multiple different sizes of insertion or deletion at one allele within a population. Multi-allelic InDels are common in plants [21]. The frequent selfing and hybridization in plants increase the diversity of alleles, and the relatively high rate of genetic mutations promotes the formation of multi-allelic InDels [22]. Gel electrophoresis has a relatively low resolution and may not be sensitive enough to separate different fragments of multi-allelic InDels [23]. The NGS allows the accurate identification of multi-allelic InDels and facilitates data processing. However, the application of NGS is limited by the high costs and lack of professional bioinformatics analysis tools.
This study established a practical workflow for multi-allelic InDel selection and identification. A total of 11 polymorphic InDel primers were selected from a variant database constructed using the resequencing data of 499 grapevine lines. A total of 123 core germplasms were collected and genotyped using gel electrophoresis and NGS. The polymorphism rates and discriminability of different genotyping strategies highlight the advantages of NGS-based methods. The fingerprints constructed by multi-allelic InDels in this study provide valuable tools for the genetic discrimination of grapes.

2. Materials and Methods

2.1. Materials

Preliminary screening of InDel markers was conducted using 499 whole-genome resequencing data from the public databases (Table S1). In August 2022, 123 young grapevine leaf samples were collected at the Institute of Forestry and Pomology, Tianjin Academy of Agricultural Sciences (Table S2). These samples exhibit a diverse genetic background and a wide range of phenotypic variations. Healthy and young leaves were collected under cool and dry conditions in the early morning. The collected leaf samples were homogenized with a 4 mm steel ball in a Retsch MM 400 Mixer Mill after chilling in liquid nitrogen. The total genomic DNA of all samples was extracted using the Plant Genomic Extraction Kit (Tiangen Biotech, Beijing, China). Subsequently, DNA concentration and purity were assessed using a NanoDrop™ 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). Qualified DNA samples were then stored at −20 °C.

2.2. Screening of InDels Markers

Initially, grape resequencing data were downloaded to construct a variant database, followed by quality control of the raw data. Fastp software (v.0.23.4) was used for data quality assessment and preprocessing of the sequencing data [24]. Subsequently, the pre-processed sequencing data were aligned to the reference genome using the Burrows–Wheeler aligner (BWA, v.0.7.17) to generate bam files [25]. SAMtools (v.1.14) was utilized for sorting, indexing, filtering and statistical analysis of these files [26]. Following this, the Genome Analysis Toolkit (GATK, v.4.0.10.0) was employed to detect InDels, and programming was utilized to generate a final Variant Call Format (VCF) file, containing information about variations such as InDel locations, types, quality and their distribution across different samples [27].
High-quality polymorphic InDel sites were obtained by further filtering using the following criteria: (i) InDels are not on repeat regions in the genome, (ii) multi-allelic InDels, and (iii) the size of an InDel is more than 20 bp. Additionally, the length of the PCR product of the InDel was required to be within the range of 150–300 bp to enable separation by agarose gel electrophoresis. Genetic diversity parameters such as polymorphic information content (PIC), heterozygosity (Het), minor allele frequency (MAF) and gene diversity index (GDI) were calculated using the Python 3.12.4 script (https://github.com/Lvmingjie/indel-seq, accessed on 16 May 2024).

2.3. InDel Primers Design and Selection

Primer design was carried out using Primer3 software [28]. The 100 bp upstream and downstream sequences of each InDel were extracted. The primer design parameters were set as follows: (1) GC content less than 60%; (2) primer size between 18 bp and 22 bp; (3) melting temperature (Tm) between 55 °C and 62 °C. Primer sequences that amplify unique loci were selected using e-pcr software (v.2.3.12) [29]. Subsequently, the genotype distribution data were analyzed, and 24 primer pairs exhibiting an even genotype distribution were chosen for further experiments.

2.4. InDel Genotyping

A total of 24 pairs of synthesized primers were screened using PCR amplification and agarose gel electrophoresis (Table S3). Based on the gel results, the core markers were selected based on the following principles: multiple and clear bands, insertion or deletion sizes over 20 bp and tendency to be located on different chromosomes. Finally, eleven core primers were chosen for the subsequent amplification of DNA from 123 grape samples (Table S3). PCR amplification was conducted in a 25 μL reaction volume containing 2 μL DNA (50 ng/μL), 0.25 μL forward primer (10 μM), 0.25 μL reverse primer (10 μM), 12.5 μL 2x Taq Master Mix and 10 μL ddH2O. The amplification process was carried out using an Applied Biosystems Veriti Thermal Cycler with the following conditions: 98 °C for 5 min, 35 cycles at 95 °C for 20 s, 56 °C for 20 s, and 72 °C for 20 s, followed by a final extension at 72 °C for 10 min and a concluding step at 4 °C to halt the reaction. Subsequently, 5 μL of the PCR products were separated by 2% agarose gel electrophoresis, and band information was visualized under UV light. The bands on the agarose gel electrophoresis profiles were recorded sequentially from smallest to largest as “1” “2” “3”, with the absence of a band recorded as “0”. For example, a single band was noted as “1_1”, and double bands were noted as “1_2”. This information was used to conduct further genetic analysis.
Since gel electrophoresis was not sufficient to fully discriminate the genotyping results, sequencing was used for further genotyping in this study. Eleven adapter-equipped core primers and universal barcode primers were used for two rounds of amplification. The universal barcode primers and PCR system used in this experiment were based on the system described in the study by Liu et al. [30]. Products from the second round of amplification, originating from distinct individual plants, were then mixed in equal amounts and subjected to 150 bp paired-end NGS. The sequences of each target site were decoded using a custom script. To avoid and exclude sequencing errors, we set filtering requirements: total reads per well 1000; each genotype ≥ 100; Top1 percentage of genotypes per well ≥ 30%; Top2/Top1 percentage of genotypes per well ≥ 30%. This confirms the presence of the genotype in the sample, thereby improving the accuracy of the sequencing results.

2.5. Phylogenetic Analyses

The phylogenetic tree was constructed using FastTree (v2.1.11) [31] and illustrated with FigTree (v1.4.4) (http://tree.bio.ed.ac.uk/software/figtree/, accessed on 10 December 2023). Principal Component Analysis (PCA) was performed using GCTA (v1.940) software [32] and PCA diagrams were generated using the R 4.4.1 language (https://github.com/Lvmingjie/indel-seq/MM_InDel, accessed on 16 May 2024).

3. Results

3.1. Screening of InDels Markers

To select a set of stable and polymorphic multi-allelic InDel markers, we developed a rigorous filtering pipeline (Figure 1A). A total of 126,493,127 InDels were identified using 499 grape resequencing data (Figure 1B). Subsequently, 80 high-quality and highly polymorphic markers were retained based on the following criteria (Figure 1C; Table S2): (1) the number of InDel alleles ≥ 4; (2) InDel lengths between 20 bp and 100 bp; (3) the size of each of the two types ≥ 20 cp; (4) no repeats within 100 bp upstream and downstream of the InDel site; (5) missing rate ≤ 10%; and (6) maximum genotype frequency ≤ 50%. Based on the genotype frequencies observed in 499 samples, 24 InDels were selected, with uniformly distributed genotype frequencies across 19 chromosomes. Subsequently, based on the PCR and agarose gel electrophoresis results of 24 InDels, a total of 11 primers, which yielded clear and multiple amplified bands, were selected as the core markers (Figure 1D).

3.2. Genotyping of 123 Grape Lines Using Agarose Gel Electrophoresis

To investigate the genetic discriminability of the 11 core markers. We collected 123 elite grape cultivars with diverse ecotypes and phenotypes in China. PCR and agarose gel electrophoresis were used to analyze the 11 core multi-allelic InDels in the grape cultivars, resulting in a total of 1353 distinct bands on agarose gel electrophoresis, indicating an average polymorphism rate of 37.92% (Figure S1; Table S4). Based on the genotyping results, GDI, Het, MAF and PIC values were calculated for the 11 InDels (Figure 2). The PIC values of the InDels ranged from 0.305 to 0.566, with a mean of 0.407. Most (82%) fell within the range of 0.3 to 0.5. The Het values ranged from 0.359 to 0.611, with a mean of 0.488. The average MAF was 0.306, ranging from 0.210 to 0.411. The average GDI was 0.4882, ranging from 0.359 to 0.611. These results indicate that the 11 candidate multi-allelic InDels exhibited a high level of polymorphism, making them optimal markers for grape fingerprinting.

3.3. NGS-Based Genotyping of 123 Grape Lines

In addition to agarose gel electrophoresis, NGS was performed on all 123 grape samples for further analysis. The genotyping outcomes derived from sequencing can provide richer information on InDels (Table S5). NGS revealed a higher average polymorphism rate of 61.12%. Based on the genotyping results, GDI, Het, MAF and PIC values were calculated for the 11 InDels (Figure 2). The mean values for PIC, GDI and Het of the 11 markers were 0.860, 0.806 and 0.647, respectively, which were significantly higher than those revealed by the gels.

3.4. Discrimination and Fingerprints of 123 Grape Lines

The InDel fingerprints were constructed based on the gels and NGS. Based on the fingerprinting data, heatmaps were used to illustrate the unique genotypes of grapes through distinct colors (Figure 3A,B). Two sets of genotype data from 123 accessions were analyzed to assess the identification efficiency. The NGS-based fingerprints successfully discriminated 122 accessions, achieving an identification efficiency of 99.18% (Figure 3D). The only exception was observed for ‘Maple Leaf Grapes’ and ‘NO.8’, both of which are round leaf grapes. The results based on agarose gel electrophoresis only distinguished 116 grape varieties. These results proved that the NGS method is more efficient than agarose gel electrophoresis in the genetic discrimination of grape varieties (Figure 3C).

3.5. Phylogenetic Analyses of 123 Grapes

Phylogenetic trees of 123 grape varieties were constructed based on the genotyping data of gels and NGS. The NGS-based hierarchical cluster successfully classified the 123 grape samples into three distinct populations (Pop−1, Pop−2, Pop−3). Interestingly, in Pop−1(ID 1–19), 71% of the grapes were rootstock, such as ‘cr1’, ‘cr2’, and ‘5BB♀’. ‘SP137’, ‘ST0998’, and ‘Zhuosexiang16’ were rootstock. Pop−2 (ID 20–64) included 45 species, most of which were V. vinifera × V. labrusca hybrids, 18 were V. vinifera, such as ‘1103♀’ and ‘maple leaf grape’, and three interspecific hybrids within American populations. Among the 59 Pop−3 varieties (ID 65–123), 73% consisted of V. vinifera, while 24% were hybrids of V. vinifera and V. labrusca. Cultivars with relatively close relationships clustered together, such as ‘5BB’, ‘101−14’, ‘Hongfushi’ and ‘Izu_Nishiki’, ‘Tianshan’’ and ‘Shaoxing_1’ (Figure 4C). The clustering diagram indicates that multi-allelic InDel markers can significantly reflect the genetic variation within grapevine populations and exhibit strong varietal differentiation ability. Compared to the gel-based results, the clustering tree constructed based on NGS provides a more comprehensive and accurate differentiation of grape populations (Figure 4A). In addition, the dispersion in PCA subgroups of NGS-based genotyping was clearer than that of gel-based, which coincided with the results of the hierarchical clustering (Figure 4B,D).

4. Discussion

The grape, an important economic fruit crop with a storied history of cultivation and diverse varieties, is ranked as one of the world’s most extensively cultivated fruit crops [33]. The cultivation of V. vinifera extends across diverse geographical regions, facilitating frequent exchanges of cultivars and common interspecific hybridization, which complicates the taxonomy and identification of varieties [34]. InDel molecular markers, known for their polymorphism and genetic stability, are crucial tools for crop genetic improvement, varietal identification and functional genomics research. Their application spans population genetic analysis, molecular breeding and medical diagnostics, demonstrating their potential to augment genetic improvement and germplasm innovation across various species [35,36,37,38]. Despite the successful application of InDel markers in several crops and organisms, a dedicated InDel-based fingerprinting system for grapevines remains elusive. The establishment of an efficient and precise fingerprinting platform is imperative for the identification of grapevine varieties and population genetic analysis.
The rapid development of NGS has provided unprecedented opportunities for genomic research. Although the cost of NGS technology is continuously declining, the cost of high-throughput sequencing is still high, especially in large-scale sequencing projects. Multiplex sequencing is a practical approach to address the issue. By pooling the DNA or RNA of multiple samples, this approach significantly reduces experimental costs, reagent expenses and manpower resources. The Hi-TOM platform is an online platform for the sequencing of multiple samples and multiple target sites. The Hi-TOM strategy markedly conserves time and cost, proving to be more economical and expedient, especially when applied to a large number of samples or loci. To ensure the quality of multiplex sequencing, the demultiplexed method and InDel genotyping strategy are important. By fixing the bridge sequences and barcoding primers, the Hi-TOM tool has high reliability and sensitivity in tracking various mutations, especially complex chimeric mutations, frequently induced by genome editing. Consequently, multiplex sequencing based on the Hi-TOM sequencing platform was chosen for the identification of NGS-based indel marker in this study. To eliminate sequencing errors and enhance the identification accuracy of InDel markers, we developed a comprehensive filtration method for multi-allelic InDels based on several parameters, such as the read depth, heterozygosity, missing rate in population.
Multi-allelic InDels, characterized by the multiple insertions or deletions in single locus, exhibit higher polymorphism than traditional SNPs or InDels. However, the resolution of gel electrophoresis may not be able to identify all events of InDels. The high sensitivity of NGS enables the accurate detection of small insertion or deletion events, making it possible to identify more genotypes of an InDel. In this study, we developed a pipeline, including data quality control, genotype phasing and variety fingerprinting, for the analysis of NGS-based multi-allelic InDels. The NGS-based identification recalled all genotypes obtained from gel electrophoresis and demonstrated higher discrimination power.
Using 11 multi-allelic InDel markers, we performed genotyping of 123 grape cultivars, including Eurasian species, American species and European and American hybrids. The grapes exhibited a high degree of Het, with values ranging from 0.429 to 0.762, and a mean value of 0.647. These values were higher than those reported in previous studies [39]. Due to the polymorphic nature of InDel markers, their PIC values are relatively high, ranging from 0.398 to 0.981. The average PIC value in this study was 0.86, which is higher than the PIC value of 0.38 for SNP markers [40] and 0.33 for broccoli [41], indicating a higher rate of polymorphism of 11 multi-allelic InDels. The average GDI value was 0.806, which is higher than the value of 0.271 observed in winter wheat [42]. Among the InDel markers, 81.82% had GDI values ranging from 0.7 to 1, indicating the higher level of GDI of the core markers. The 11 NGS-based core markers used in this study could discriminate 122 grape varieties, resulting in an identification efficiency of 99.18%. The remaining unseparated ‘Maple Leaf grapes’ and ‘No. 8’ were round-leafed grapes. This study suggests the superiority of NGS over agarose gel electrophoresis in genetically discriminating grape varieties.

5. Conclusions

Molecular markers and fingerprints are crucial in the marker-assisted breeding and varietal identification of grapevines. In this study, we developed a novel, effective and high-throughput pipeline for multi-allelic-InDel selection and identification. Using this pipeline, we developed 11 high polymorphic multi-allelic InDel markers based on 499 resequencing data for the genetic discrimination of grapevines. We performed genetic identification of 123 grape cultivars using agarose gel electrophoresis and NGS. The NGS-based InDel markers processed a higher rate of polymorphism and showed a superior performance in the genetic discrimination of grapes compared to the gel approach. Our work provides a practical workflow for multi-allelic InDel marker development and valuable tools for the genetic discrimination and marker-assisted breeding in grapes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/horticulturae10070752/s1, Figure S1: Agarose gel electrophoresis results of 123 grape samples using 11 core multi-allelic InDel markers; Table S1: Grape resequencing data used in this study; Table S2: 123 grape cultivars used in this study; Table S3: Information for 80 high-quality multi-allelic InDel primers; Table S4: Genotyping results for 123 cultivars based on agarose gel electrophoresis; Table S5: Genotyping results for 123 cultivars based on NGS.

Author Contributions

Conceptualization, M.L. and J.H.; methodology, N.Z. and M.L.; software, G.J.; validation, G.J., M.L., Y.Y., H.Z. (Hong Zhang), Y.G., Q.W. and R.C.; formal analysis, G.J.; investigation, M.L.; resources, J.H., Q.J., J.J., H.Z. (He Zhang) and J.W.; data curation, G.J.; writing—original draft preparation, G.J.; writing—review and editing, G.J. and M.L.; visualization, G.J.; supervision, M.L.; project administration, N.Z.; funding acquisition, J.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Tianjin (23JCYBJC00770 to M.L., 22JCYBJC00190 to Y.Y.), the National Natural Science Foundation of China (32302579 to Y.Y.), the China Agriculture Research System of MOF and MARA (CARS-29-2 to J.H.), the Innovative Research of Seed Industry of Tianjin Academy of Agricultural Science (2022ZYCX014 to J.H.) and the Innovative Research and Experimental Projects for Young Researchers of Tianjin Academy of Agricultural Science (2021002 to N.Z.).

Data Availability Statement

The raw sequence data reported in this paper have been deposited in the Genome Sequence Archive in the National Genomics Data Center of China (GSA: CRA016408) that are publicly accessible at https://ngdc.cncb.ac.cn/gsa (accessed on 11 May 2024). The analysis scripts for this study are deposited in the online repository at https://github.com/Lvmingjie/indel-seq (accessed on 16 May 2024).

Conflicts of Interest

The authors declare that they have no conflicts of interest.

References

  1. Dong, Y.; Duan, S.; Xia, Q.; Liang, Z.; Dong, X.; Margaryan, K.; Musayev, M.; Goryslavets, S.; Zdunić, G.; Bert, P.-F.; et al. Dual domestications and origin of traits in grapevine evolution. Science 2023, 379, 892–901. [Google Scholar] [CrossRef] [PubMed]
  2. Grassi, F.; Gabriella, D.L. Back to the Origins: Background and Perspectives of Grapevine Domestication. Int. J. Mol. Sci. 2021, 22, 4518. [Google Scholar] [CrossRef] [PubMed]
  3. Sabir, A.; Tangolar, S.; Büyükalaca, S.; Kafkas, S. Ampelographic and molecular diversity among grapevine (Vitis spp.) cultivars. Czech J. Genet. Plant Breed. 2009, 45, 160–168. [Google Scholar]
  4. Zhou, Y.; Massonnet, M.; Sanjak, J.S.; Cantu, D.; Gaut, B.S. Evolutionary genomics ofgrape (Vitis vinifera ssp. vinifera) domestication. Proc. Natl. Acad. Sci. USA 2017, 114, 11715–11720. [Google Scholar]
  5. Xiao, H.; Liu, Z.; Wang, N.; Long, Q.; Cao, S.; Huang, G.; Liu, W.; Peng, Y.; Riaz, S.; Walker, A.M.; et al. Adaptive and maladaptive introgression in grapevine domestication. Proc. Natl. Acad. Sci. USA 2023, 120, e2222041120. [Google Scholar] [CrossRef]
  6. Martínez, M.C.; Boso, S.; Gago, P.; Muñoz-Organero, G.; De Andrés, M.T.; Gaforio, L.; Cabello, F.; Santiago, J.L. Value of two Spanish live grapevine collections in the resolution of synonyms, homonyms and naming errors. Aust. J. Grape Wine R. 2018, 24, 430–438. [Google Scholar] [CrossRef]
  7. Tessier, C.; David, J.; This, P.; Boursiquot, J.M.; Charrier, A. Optimization of the choice of molecular markers for varietal identification in Vitis vinifera L. Theor. Appl. Genet. 1999, 98, 171–177. [Google Scholar] [CrossRef]
  8. Shen, X.; Guo, W.; Zhu, X.; Yuan, Y.; Yu, J.Z.; Kohel, R.J.; Zhang, T. Molecular mapping of QTLs for fiber qualities in three diverse lines in Upland cotton using SSR markers. Mol. Breeding 2005, 15, 169–181. [Google Scholar] [CrossRef]
  9. Zombardo, A.; Meneghetti, S.; Morreale, G.; Calò, A.; Costacurta, A.; Storchi, P. Study of inter-and intra-varietal genetic variability in grapevine cultivars. Plants 2022, 11, 397. [Google Scholar] [CrossRef] [PubMed]
  10. Taheri, S.; Abdullah, T.L.; Yusop, M.R.; Hanafi, M.M.; Sahebi, M.; Azizi, P.; Shamshiri, R.R. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules. 2018, 23, 399. [Google Scholar] [CrossRef]
  11. Lv, Y.; Liu, Y.; Zhao, H. mInDel: A high-throughput and efficient pipeline for genome-wide InDel marker development. BMC Genom. 2016, 17, 290. [Google Scholar] [CrossRef]
  12. Rimbert, H.; Darrier, B.; Navarro, J.; Kitt, J.; Choulet, F.; Leveugle, M.; Duarte, J.; Rivière, N.; Eversole, K. High throughput SNP discovery and genotyping in hexaploid wheat. PLoS ONE 2018, 13, e0186329. [Google Scholar] [CrossRef]
  13. Emanuelli, F.; Lorenzi, S.; Grzeskowiak, L.; Catalano, V.; Stefanini, M.; Troggio, M.; Myles, S.; Martinez-Zapater, J.M.; Zyprian, E.; Moreira, F.M.; et al. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape. BMC Plant Biol. 2013, 13, 39. [Google Scholar] [CrossRef] [PubMed]
  14. Wang, F.; Fan, X.; Zhang, Y.; Sun, L.; Liu, C.; Jiang, J. Establishment and application of an SNP molecular identification system for grape cultivars. J. Integr. Agric. 2022, 21, 1044–1057. [Google Scholar] [CrossRef]
  15. Lü, Y.; Cui, X.; Li, R.; Huang, P.; Zong, J.; Yao, D.; Li, G.; Zhang, D.; Yuan, Z. Development of genome-wide insertion/deletion markers in rice based on graphic pipeline platform. J. Integr. Plant Biol. 2015, 57, 980–991. [Google Scholar] [CrossRef]
  16. Liu, Z.; Zhao, Y.; Zhang, Y.; Xu, L.; Zhou, L.; Yang, W.; Zhao, H.; Zhao, J.; Wang, F. Development of Omni InDel and supporting database for maize. Front. Plant Sci. 2023, 14, 1216505. [Google Scholar] [CrossRef]
  17. Guan, L.; Cao, K.; Li, Y.; Guo, J.; Xu, Q.; Wang, L. Detection and application of genome-wide variations in peach for association and genetic relationship analysis. BMC Genet. 2019, 20, 101. [Google Scholar] [CrossRef] [PubMed]
  18. Wang, X.; Shen, F.; Gao, Y.; Wang, K.; Chen, R.; Luo, J.; Yang, L.; Zhang, X.; Qiu, C.; Li, W.; et al. Application of genome-wide insertion/deletion markers on genetic structure analysis and identity signature of Malus accessions. BMC Plant Biol. 2000, 20, 540. [Google Scholar] [CrossRef]
  19. Liu, X.; Geng, X.; Zhang, H.; Shen, H.; Yang, W. Association and genetic identification of loci for four fruit traits in tomato using InDel markers. Front. Plant Sci. 2017, 8, 1269. [Google Scholar] [CrossRef] [PubMed]
  20. Shen, D.; Liu, B.; Qiu, Y.; Zhang, X.; Zhang, Z.; Wang, H.; Li, X.; Li, S. Development and application of cucumber InDel markers based on genome re-sequencing. Plant Genet. Res. 2013, 14, 278–283. [Google Scholar]
  21. Liang, S.; Lin, F.; Qian, Y.; Zhang, T.; Wu, Y.; Qi, Y.; Ren, S.; Ruan, L.; Zhao, H. A cost-effective barcode system for maize genetic discrimination based on bi-allelic InDel markers. Plant Methods 2020, 16, 101. [Google Scholar] [CrossRef] [PubMed]
  22. This, P.; Lacombe, T.; Thomas, M.R. Historical origins and genetic diversity of wine grapes. Trends Genet. 2006, 22, 511–519. [Google Scholar] [CrossRef]
  23. Chung, H.Y.; Won, S.Y.; Kim, Y.K.; Kim, J.S. Development of the chloroplast genome-based InDel markers in Niitaka (Pyrus pyrifolia) and its application. Plant Biotechnol. Rep. 2019, 13, 51–61. [Google Scholar] [CrossRef]
  24. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  25. Li, H.; Durbin, R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  26. Li, H.; Handsaker, B.; Wysoker, A.; Fennell, T.; Ruan, J.; Homer, N.; Marth, G.; Abecasis, G.; Durbin, R. The sequence alignment/map format and SAMtools. Bioinformatics 2009, 25, 2078–2079. [Google Scholar] [CrossRef] [PubMed]
  27. McKenna, A.; Hanna, M.; Banks, E.; Sivachenko, A.; Cibulskis, K.; Kernytsky, A.; Garimella, K.; Altshuler, D.; Gabriel, S.; Daly, M.; et al. The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010, 20, 1297–1303. [Google Scholar] [CrossRef]
  28. Rozen, S.; Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. In Bioinformatics Methods and Protocols; Humana Press: Totowa, NJ, USA, 1999; pp. 365–386. [Google Scholar]
  29. Schuler, G.D. Sequence mapping by electronic PCR. Genome Res. 1997, 7, 541–550. [Google Scholar] [CrossRef] [PubMed]
  30. Liu, Q.; Wang, C.; Jiao, X.; Zhang, H.; Song, L.; Li, Y.; Gao, C.; Wang, K. Hi-TOM: A platform for high-throughput tracking of mutations induced by CRISPR/Cas systems. Sci. China Life Sci. 2019, 62, 1–7. [Google Scholar] [CrossRef] [PubMed]
  31. Price, M.N.; Dehal, P.S.; Arkin, A.P. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS ONE 2010, 5, e9490. [Google Scholar] [CrossRef] [PubMed]
  32. Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
  33. Liang, Z.; Duan, S.; Sheng, J.; Zhu, S.; Ni, X.; Shao, J.; Liu, C.; Nick, P.; Du, F.; Fan, P.; et al. Whole-genome resequencing of 472 Vitis accessions for grapevine diversity and demographic history analyses. Nat. Commun. 2019, 10, 1190. [Google Scholar] [CrossRef]
  34. Myles, S.; Boyko, A.R.; Owens, C.L.; Brown, P.J.; Grassi, F.; Aradhya, M.K.; Prins, B.; Reynolds, A.; Chia, J.-M.; Ware, D.; et al. Genetic structure and domestication history of the grape. Proc. Natl. Acad. Sci. USA 2011, 108, 3530–3535. [Google Scholar] [CrossRef]
  35. Kvikstad, E.M.; Tyekucheva, S.; Chiaromonte, F.; Makova, K.D. A macaque’s-eye view of human insertions and deletions: Differences in mechanisms. PLoS Comput. Biol. 2007, 3, e176. [Google Scholar] [CrossRef] [PubMed]
  36. Jain, A.; Roorkiwal, M.; Kale, S.; Garg, V.; Yadala, R.; Varshney, R.K. InDel markers: An extended marker resource for molecular breeding in chickpea. PLoS ONE 2019, 14, e0213999. [Google Scholar] [CrossRef] [PubMed]
  37. Adedze, Y.M.N.; Lu, X.; Xia, Y.; Sun, Q.; Nchongboh, C.G.; Alam, M.A.; Liu, M.; Yang, X.; Zhang, W.; Deng, Z.; et al. Agarose-resolvable InDel markers based on whole genome re-sequencing in cucumber. Sci. Rep. 2021, 11, 3872. [Google Scholar] [CrossRef]
  38. Pan, G.; Li, Z.; Huang, S.; Tao, J.; Shi, Y.; Chen, A.; Li, J.; Tang, H.; Chang, L.; Deng, Y.; et al. Genome-wide development of insertion-deletion (InDel) markers for Cannabis and its uses in genetic structure analysis of Chinese germplasm and sex-linked marker identification. BMC Genom. 2021, 22, 595. [Google Scholar] [CrossRef] [PubMed]
  39. Lijavetzky, D.; Cabezas, J.; Ibáñez, A.; Rodríguez, V.; Martínez-Zapater, J.M. High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology. BMC Genom. 2007, 8, 424. [Google Scholar] [CrossRef] [PubMed]
  40. Yang, Y.; Lyu, M.; Liu, J.; Wu, J.; Wang, Q.; Xie, T.; Li, H.; Chen, R.; Sun, D.; Yang, Y.; et al. Construction of an SNP fingerprinting database and population genetic analysis of 329 cauliflower cultivars. BMC Plant Biol. 2022, 22, 522. [Google Scholar] [CrossRef]
  41. Shen, Y.; Wang, J.; Shaw, R.K.; Yu, H.; Sheng, X.; Zhao, Z.; Li, S.; Gu, H. Development of GBTS and KASP panels for genetic diversity, population structure, and fingerprinting of a large collection of broccoli (Brassica oleracea L. var. italica) in China. Front. Plant Sci. 2021, 12, 655254. [Google Scholar] [CrossRef]
  42. Eltaher, S.; Sallam, A.; Belamkar, V.; Emara, H.A.; Nower, A.A.; Salem, K.F.M.; Poland, J.; Baenziger, P.S. Genetic diversity and population structure of F3: 6 Nebraska winter wheat genotypes using genotyping-by-sequencing. Front. Genet. 2018, 9, 76. [Google Scholar] [CrossRef]
Figure 1. Development of multi-allelic InDel markers in grapevine. (A) Pipeline for grapevine fingerprint database construction based on multi-allelic InDel markers. (B) Gene (black line plot in the outer track) and InDel (red line plot in the inner track) density derived from 499 grape resequencing data across 19 chromosomes. (C) Physical location of 80 high-quality multi-allelic InDels. A total of 24 highly polymorphic primers were labeled in red style. (D) Agarose gel electrophoresis identification of 24 polymorphic InDels selected from the 80 high-quality markers. A total of 11 markers in the red boxes were selected for further analysis.
Figure 1. Development of multi-allelic InDel markers in grapevine. (A) Pipeline for grapevine fingerprint database construction based on multi-allelic InDel markers. (B) Gene (black line plot in the outer track) and InDel (red line plot in the inner track) density derived from 499 grape resequencing data across 19 chromosomes. (C) Physical location of 80 high-quality multi-allelic InDels. A total of 24 highly polymorphic primers were labeled in red style. (D) Agarose gel electrophoresis identification of 24 polymorphic InDels selected from the 80 high-quality markers. A total of 11 markers in the red boxes were selected for further analysis.
Horticulturae 10 00752 g001
Figure 2. Genetic information of 11 core multi-allelic InDels. (A) Polymorphic information content (PIC) of 11 core markers in 123 grape cultivars. (B) Gene diversity index (GDI) of 11 core markers in 123 grape cultivars. (C) Heterozygosity (Het) of 11 core markers in 123 grape cultivars. (D) Minor allele frequency (MAF) of 11 core markers in 123 grape cultivars.
Figure 2. Genetic information of 11 core multi-allelic InDels. (A) Polymorphic information content (PIC) of 11 core markers in 123 grape cultivars. (B) Gene diversity index (GDI) of 11 core markers in 123 grape cultivars. (C) Heterozygosity (Het) of 11 core markers in 123 grape cultivars. (D) Minor allele frequency (MAF) of 11 core markers in 123 grape cultivars.
Horticulturae 10 00752 g002
Figure 3. Fingerprinting of 123 grape cultivars. (A) The fingerprints of 123 grape cultivars based on agarose gel electrophoresis results. (B) The fingerprints of 123 grape cultivars based on NGS. (C) The number of genotypes identified by agarose gel electrophoresis and NGS. The y-axis was the genotype number of markers revealed by NGS and gel. (D) The discernibility of different combinations of 11 multi-allelic InDel markers for 123 grape accessions.
Figure 3. Fingerprinting of 123 grape cultivars. (A) The fingerprints of 123 grape cultivars based on agarose gel electrophoresis results. (B) The fingerprints of 123 grape cultivars based on NGS. (C) The number of genotypes identified by agarose gel electrophoresis and NGS. The y-axis was the genotype number of markers revealed by NGS and gel. (D) The discernibility of different combinations of 11 multi-allelic InDel markers for 123 grape accessions.
Horticulturae 10 00752 g003
Figure 4. Population structure analysis of 123 grape cultivars. (A) The phylogenetic tree of 123 grape cultivars based on agarose gel electrophoresis results. (B) PCA analysis based on agarose gel electrophoresis results. (C) The phylogenetic tree of 123 grape cultivars based on the NGS results. (D) PCA analysis based on the NGS results. All colors were marked according to the NGS results: Pop−1 (red), Pop−2 (green) and Pop−3 (purple).
Figure 4. Population structure analysis of 123 grape cultivars. (A) The phylogenetic tree of 123 grape cultivars based on agarose gel electrophoresis results. (B) PCA analysis based on agarose gel electrophoresis results. (C) The phylogenetic tree of 123 grape cultivars based on the NGS results. (D) PCA analysis based on the NGS results. All colors were marked according to the NGS results: Pop−1 (red), Pop−2 (green) and Pop−3 (purple).
Horticulturae 10 00752 g004
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jia, G.; Zhang, N.; Yang, Y.; Jin, Q.; Jiang, J.; Zhang, H.; Guo, Y.; Wang, Q.; Zhang, H.; Wu, J.; et al. NGS-Based Multi-Allelic InDel Genotyping and Fingerprinting Facilitate Genetic Discrimination in Grapevine (Vitis vinifera L.). Horticulturae 2024, 10, 752. https://doi.org/10.3390/horticulturae10070752

AMA Style

Jia G, Zhang N, Yang Y, Jin Q, Jiang J, Zhang H, Guo Y, Wang Q, Zhang H, Wu J, et al. NGS-Based Multi-Allelic InDel Genotyping and Fingerprinting Facilitate Genetic Discrimination in Grapevine (Vitis vinifera L.). Horticulturae. 2024; 10(7):752. https://doi.org/10.3390/horticulturae10070752

Chicago/Turabian Style

Jia, Guiying, Na Zhang, Yingxia Yang, Qingdong Jin, Jianfu Jiang, Hong Zhang, Yutong Guo, Qian Wang, He Zhang, Jianjin Wu, and et al. 2024. "NGS-Based Multi-Allelic InDel Genotyping and Fingerprinting Facilitate Genetic Discrimination in Grapevine (Vitis vinifera L.)" Horticulturae 10, no. 7: 752. https://doi.org/10.3390/horticulturae10070752

APA Style

Jia, G., Zhang, N., Yang, Y., Jin, Q., Jiang, J., Zhang, H., Guo, Y., Wang, Q., Zhang, H., Wu, J., Chen, R., Huang, J., & Lyu, M. (2024). NGS-Based Multi-Allelic InDel Genotyping and Fingerprinting Facilitate Genetic Discrimination in Grapevine (Vitis vinifera L.). Horticulturae, 10(7), 752. https://doi.org/10.3390/horticulturae10070752

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop