Genetic Diversity Analysis of Cotton Cultivars Using a 40K Liquid Chip in Northern Xinjiang
Abstract
1. Introduction
2. Results
2.1. The SNP Distribution Characteristics in the Northern Xinjiang Cotton Population
2.2. Genetic Diversity in Northern Xinjiang Cotton Cultivars
2.3. Genetic Basis of Northern Xinjiang Cotton Cultivar Improvement
3. Discussion
4. Materials and Methods
4.1. Materials
4.2. Genotyping and SNP Analysis
4.3. SNP Variant Annotation
4.4. Kinship Analysis
4.5. Transcriptome Analysis
4.6. Published Data Download
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Fang, L.; Wang, Q.; Hu, Y.; Jia, Y.H.; Chen, J.D.; Liu, B.L.; Zhang, Z.Y.; Guan, X.Y.; Chen, S.Q.; Zhou, B.L.; et al. Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits. Nat. Genet. 2017, 49, 1089–1098. [Google Scholar] [CrossRef]
- Wang, M.; Tu, L.; Lin, M.; Lin, Z.; Wang, P.; Yang, Q.; Ye, Z.; Shen, C.; Li, J.; Zhang, L.; et al. Asymmetric subgenome selection and cis-regulatory divergence during cotton domestication. Nat. Genet. 2017, 49, 579–587. [Google Scholar] [CrossRef]
- Ma, Z.Y.; He, S.P.; Wang, X.F.; Sun, J.L.; Zhang, Y.; Zhang, G.Y.; Wu, L.Q.; Li, Z.K.; Liu, Z.H.; Sun, G.F.; et al. Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield. Nat. Genet. 2018, 50, 803–813. [Google Scholar] [CrossRef] [PubMed]
- Percy, R.G.; Wendel, J.F. Allozyme evidence for the origin and diversification of Gossypium barbadense L. Theor. Appl. Genet. 1990, 79, 529–542. [Google Scholar] [CrossRef]
- Wendel, J.F.; Percy, R.G. Allozyme diversity and introgression in the Galapagos Islands endemic Gossypium darwinii and its relationship to continental G. barbadense. Biochem. Syst. Ecol. 1990, 18, 517–528. [Google Scholar] [CrossRef]
- Wendel, J.F.; Brubaker, C.L.; Percival, A.E. Genetic diversity in Gossypium hirsutum and the origin of upland cotton. Am. J. Bot. 1992, 79, 1291–1310. [Google Scholar] [CrossRef]
- Brubaker, C.L.; Wendel, J.F. Reevaluating the origin of domesticated cotton (Gossypium hirsutum; Malvaceae) using nuclear restriction fragment length polymorphisms (RFLPs). Am. J. Bot. 1994, 81, 1309–1326. [Google Scholar] [CrossRef]
- Hulse-Kemp, A.M.; Lemm, J.; Plieske, J.; Ashrafi, H.; Buyyarapu, R.; Fang, D.D.; Frelichowski, J.; Giband, M.; Hague, S.; Hinze, L.L.; et al. Development of a 63K SNP array for cotton and high-density mapping of intraspecific and interspecific populations of Gossypium spp. G3 (Bethesda) 2015, 5, 1187–1209. [Google Scholar] [CrossRef] [PubMed]
- Malik, W.; Ashraf, J.; Iqbal, M.Z.; Khan, A.A.; Qayyum, A.; Ali Abid, M.; Noor, E.; Ahmad, M.Q.; Abbasi, G.H. Molecular markers and cotton genetic improvement: Current status and future prospects. Sci. World J. 2014, 2014, 607091. [Google Scholar] [CrossRef]
- Jin, S.; Han, Z.; Hu, Y.; Si, Z.; Dai, F.; He, L.; Cheng, Y.; Li, Y.; Zhao, T.; Fang, L. Structural variation (SV)-based pan-genome and GWAS reveal the impacts of SVs on the speciation and diversification of allotetraploid cottons. Mol. Plant 2023, 16, 678–693. [Google Scholar] [CrossRef]
- Wang, S.; Chen, J.D.; Zhang, W.P.; Hu, Y.; Chang, L.J.; Fang, L.; Wang, Q.; Lv, F.N.; Wu, H.T.; Si, Z.F.; et al. Sequence-based ultra-dense genetic and physical maps reveal structural variations of allopolyploid cotton genomes. Genome Biol. 2015, 16, 108–125. [Google Scholar] [CrossRef]
- Zhou, Z.; Jiang, Y.; Wang, Z.; Gou, Z.; Lyu, J.; Li, W.; Yu, Y.; Shu, L.; Zhao, Y.; Ma, Y. Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean. Nat. Biotechnol. 2015, 33, 408–414, Erratum in Nat. Biotechnol. 2016, 34, 441. [Google Scholar] [CrossRef]
- Huang, X.; Sang, T.; Zhao, Q.; Feng, Q.; Zhao, Y.; Li, C.; Zhu, C.; Lu, T.; Zhang, Z.; Li, M. Genome-wide association studies of 14 agronomic traits in rice landraces. Nat. Genet. 2010, 42, 961–967. [Google Scholar] [CrossRef]
- Zhang, J.; Yang, J.J.; Zhang, L.K.; Luo, J.; Zhao, H.; Zhang, J.; Wen, C.L. A new SNP genotyping technology Target SNP-seq and its application in genetic analysis of cucumber varieties. Sci. Rep. 2020, 10, 5623–5633, Correction in Sci. Rep. 2021, 11, 8010. [Google Scholar] [CrossRef] [PubMed]
- Guo, Z.F.; Yang, Q.; Huang, F.F.; Zheng, H.J.; Sang, Z.Q.; Xu, Y.F.; Zhang, C.; Wu, K.S.; Tao, J.J.; Prasanna, B.M.; et al. Development of high-resolution multiple-SNP arrays for genetic analyses and molecular breeding through genotyping by target sequencing and liquid chip. Plant Commun. 2021, 2, 100230. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.C.; Liu, S.L.; Zhang, Z.F.; Ni, L.B.; Chen, X.M.; Ge, Y.X.; Zhou, G.A.; Tian, Z.X. GenoBaits Soy40K: A highly flexible and low-cost SNP array for soybean studies. Sci. China Life Sci. 2020, 65, 359–362. [Google Scholar] [CrossRef]
- Cui, F.; Zhang, N.; Fan, X.; Zhang, W.; Zhao, C.; Yang, L.; Pan, R.; Chen, M.; Han, J.; Zhao, X. Utilization of a Wheat660K SNP array-derived high-density genetic map for high-resolution mapping of a major QTL for kernel number. Sci. Rep. 2017, 7, 3788–3799. [Google Scholar] [CrossRef]
- Liu, S.; Xiang, M.; Wang, X.; Li, J.; Cheng, X.; Li, H.; Singh, R.P.; Bhavani, S.; Huang, S.; Zheng, W.; et al. Development and application of the GenoBaits WheatSNP16K array to accelerate wheat genetic research and breeding. Plant Commun. 2025, 6, 101138. [Google Scholar] [CrossRef]
- Si, Z.; Jin, S.; Li, J.; Han, Z.; Li, Y.; Wu, X.; Ge, Y.; Fang, L.; Zhang, T.; Hu, Y. The design, validation, and utility of the “ZJU CottonSNP40K” liquid chip through genotyping by target sequencing. Ind. Crop. Prod. 2022, 188, 115629–115636. [Google Scholar]
- Chen, H.; Han, Z.; Ma, Q.; Dong, C.; Ning, X.; Li, J.; Lin, H.; Xu, S.; Li, Y.; Hu, Y.; et al. Identification of elite fiber quality loci in upland cotton based on the genotyping-by-target-sequencing technology. Front. Plant Sci. 2022, 13, 1027806. [Google Scholar] [CrossRef]
- Fan, M.; Wang, M.; Bai, M.-Y. Diverse roles of SERK family genes in plant growth, development and defense response. Sci. China Life Sci. 2016, 59, 889–896. [Google Scholar] [CrossRef]
- Lan, Z.; Song, Z.; Wang, Z.; Li, L.; Liu, Y.; Zhi, S.; Wang, R.; Wang, J.; Li, Q.; Bleckmann, A.; et al. Antagonistic RALF peptides control an intergeneric hybridization barrier on Brassicaceae stigmas. Cell 2023, 186, 4773–4787.e4712. [Google Scholar] [CrossRef]
- Zhang, Y.; Tian, H.; Chen, D.; Zhang, H.; Sun, M.; Chen, S.; Qin, Z.; Ding, Z.; Dai, S. Cysteine-rich receptor-like protein kinases: Emerging regulators of plant stress responses. Trends Plant Sci. 2023, 28, 776–794. [Google Scholar] [CrossRef]
- Zhang, T.Z.; Hu, Y.; Jiang, W.K.; Fang, L.; Guan, X.Y.; Chen, J.D.; Zhang, J.B.; Saski, C.A.; Scheffler, B.E.; Stelly, D.M.; et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat. Biotechnol. 2015, 33, 531–537. [Google Scholar] [CrossRef] [PubMed]
- He, S.P.; Sun, G.F.; Geng, X.L.; Gong, W.F.; Dai, P.H.; Jia, Y.H.; Shi, W.J.; Pan, Z.E.; Wang, J.D.; Wang, L.Y.; et al. The genomic basis of geographic differentiation and fiber improvement in cultivated cotton. Nat. Genet. 2021, 53, 916–924. [Google Scholar] [CrossRef]
- Li, X.; Wang, Y.; Cai, C.; Ji, J.; Han, F.; Zhang, L.; Chen, S.; Zhang, L.; Yang, Y.; Tang, Q. Large-scale gene expression alterations introduced by structural variation drive morphotype diversification in Brassica oleracea. Nat. Genet. 2024, 56, 517–529. [Google Scholar] [CrossRef] [PubMed]
- Zhang, C.; Shao, Z.; Kong, Y.; Du, H.; Li, W.; Yang, Z.; Li, X.; Ke, H.; Sun, Z.; Shao, J. High-quality genome of a modern soybean cultivar and resequencing of 547 accessions provide insights into the role of structural variation. Nat. Genet. 2024, 56, 2247–2258. [Google Scholar] [CrossRef]
- Guan, H.; Lu, Y.; Li, X.; Liu, B.; Li, Y.; Zhang, D.; Liu, X.; He, G.; Li, Y.; Wang, H.; et al. Development of a MaizeGerm50K array and application to maize genetic studies and breeding. Crop J. 2024, 12, 1686–1696. [Google Scholar] [CrossRef]
- Li, Z.; Wang, L.; Liu, Y.; Ma, X.; Zhang, A.; Luo, Z.; Yan, M.; Zhou, L.; Chen, L.; Luo, L.; et al. WDR6K, a designed SNP array for the research and improvement of rice drought-resistance. Plant Stress. 2025, 15, 100800. [Google Scholar] [CrossRef]
- Fang, L.; Gong, H.; Hu, Y.; Liu, C.; Zhou, B.; Huang, T.; Wang, Y.; Chen, S.; Fang, D.D.; Du, X.; et al. Genomic insights into divergence and dual domestication of cultivated allotetraploid cottons. Genome Biol. 2017, 18, 33–45. [Google Scholar] [CrossRef]
- Li, Y.; Si, Z.; Wang, G.; Shi, Z.; Chen, J.; Qi, G.; Jin, S.; Han, Z.; Gao, W.; Tian, Y. Genomic insights into the genetic basis of cotton breeding in China. Mol. Plant 2023, 16, 662–677. [Google Scholar] [CrossRef]
- Paterson, A.H.; Brubaker, C.L.; Wendel, J.F. A rapid method for extraction of cotton (Gossypium spp.) genomic DNA suitable for RFLP or PCR analysis. Plant Mol. Biol. Rep. 1993, 11, 122–127. [Google Scholar] [CrossRef]
- Hu, Y.; Chen, J.D.; Fang, L.; Zhang, Z.Y.; Ma, W.; Niu, Y.C.; Ju, L.Z.; Deng, J.Q.; Zhao, T.; Lian, J.M.; et al. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat. Genet. 2019, 51, 739–748. [Google Scholar] [CrossRef]
- Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef]
- Wang, K.; Li, M.Y.; Hakonarson, H. ANNOVAR: Functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010, 38, e164. [Google Scholar] [CrossRef]
- Felsenstein, J. PHYLIP (Phylogeny Inference Package), version 3.6.; Department of Genome Sciences, University of Washington: Seattle, WA, USA, 2005. [Google Scholar]
- Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
- Zheng, X.; Gogarten, S.M.; Lawrence, M.; Stilp, A.; Conomos, M.P.; Weir, B.S.; Laurie, C.; Levine, D. SeqArray—A storage-efficient high-performance data format for WGS variant calls. Bioinformatics 2017, 33, 2251–2257. [Google Scholar] [CrossRef] [PubMed]
- Kim, D.; Paggi, J.M.; Park, C.; Bennett, C.; Salzberg, S.L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019, 37, 907–915. [Google Scholar] [CrossRef] [PubMed]
- Pertea, M.; Pertea, G.M.; Antonescu, C.M.; Chang, T.-C.; Mendell, J.T.; Salzberg, S.L. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015, 33, 290–295. [Google Scholar] [CrossRef] [PubMed]
- Dai, F.; Chen, J.; Zhang, Z.; Liu, F.; Li, J.; Zhao, T.; Hu, Y.; Zhang, T.; Fang, L. COTTONOMICS: A comprehensive cotton multi-omics database. Database 2022, 2022, baac080. [Google Scholar] [CrossRef]
- Han, Z.G.; Chen, H.; Cao, Y.W.; He, L.; Si, Z.F.; Hu, Y.; Lin, H.; Ning, X.Z.; Li, J.L.; Ma, Q.; et al. Genomic insights into genetic improvement of upland cotton in the world’s largest growing region. Ind. Crop. Prod. 2022, 183, 114929–114938. [Google Scholar] [CrossRef]
- Ma, Z.; Zhang, Y.; Wu, L.; Zhang, G.; Sun, Z.; Li, Z.; Jiang, Y.; Ke, H.; Chen, B.; Liu, Z.; et al. High-quality genome assembly and resequencing of modern cotton cultivars provide resources for crop improvement. Nat. Genet. 2021, 53, 1385–1391. [Google Scholar] [CrossRef] [PubMed]



| Chr | Length (bp) | SNP Number | Chr | Length (bp) | SNP Number |
|---|---|---|---|---|---|
| A01 | 118,174,371 | 1610 | D01 | 64,698,102 | 1118 |
| A02 | 108,272,889 | 625 | D02 | 69,777,850 | 1003 |
| A03 | 111,586,618 | 792 | D03 | 53,896,199 | 785 |
| A04 | 87,703,368 | 547 | D04 | 56,935,404 | 555 |
| A05 | 110,845,161 | 1123 | D05 | 63,929,679 | 890 |
| A06 | 126,488,190 | 1323 | D06 | 65,459,843 | 1207 |
| A07 | 96,598,283 | 1625 | D07 | 58,417,686 | 837 |
| A08 | 125,056,055 | 2168 | D08 | 69,080,421 | 1052 |
| A09 | 83,216,487 | 985 | D09 | 52,000,373 | 959 |
| A10 | 115,096,118 | 947 | D10 | 66,881,427 | 877 |
| A11 | 121,376,521 | 1372 | D11 | 71,358,197 | 754 |
| A12 | 107,588,319 | 867 | D12 | 61,693,100 | 852 |
| A13 | 110,367,549 | 1238 | D13 | 64,447,585 | 741 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zheng, Z.; Wang, N.; Jin, S.; Ning, K.; Feng, G.; Gao, H.; Si, Z.; Zhang, T.; Ai, N. Genetic Diversity Analysis of Cotton Cultivars Using a 40K Liquid Chip in Northern Xinjiang. Int. J. Mol. Sci. 2026, 27, 545. https://doi.org/10.3390/ijms27010545
Zheng Z, Wang N, Jin S, Ning K, Feng G, Gao H, Si Z, Zhang T, Ai N. Genetic Diversity Analysis of Cotton Cultivars Using a 40K Liquid Chip in Northern Xinjiang. International Journal of Molecular Sciences. 2026; 27(1):545. https://doi.org/10.3390/ijms27010545
Chicago/Turabian StyleZheng, Zhihong, Ningshan Wang, Shangkun Jin, Kewei Ning, Guoli Feng, Haiqiang Gao, Zhanfeng Si, Tianzhen Zhang, and Nijiang Ai. 2026. "Genetic Diversity Analysis of Cotton Cultivars Using a 40K Liquid Chip in Northern Xinjiang" International Journal of Molecular Sciences 27, no. 1: 545. https://doi.org/10.3390/ijms27010545
APA StyleZheng, Z., Wang, N., Jin, S., Ning, K., Feng, G., Gao, H., Si, Z., Zhang, T., & Ai, N. (2026). Genetic Diversity Analysis of Cotton Cultivars Using a 40K Liquid Chip in Northern Xinjiang. International Journal of Molecular Sciences, 27(1), 545. https://doi.org/10.3390/ijms27010545
