Next Article in Journal
Overcoming Immunological Challenges to Helper-Dependent Adenoviral Vector-Mediated Long-Term CFTR Expression in Mouse Airways
Next Article in Special Issue
Characterizing Y-STRs in the Evaluation of Population Differentiation Using the Mean of Allele Frequency Difference between Populations
Previous Article in Journal
Characterization of the Common Japonica-Originated Genomic Regions in the High-Yielding Varieties Developed from Inter-Subspecific Crosses in Temperate Rice (Oryza sativa L.)
Previous Article in Special Issue
Ancestry Prediction Comparisons of Different AISNPs for Five Continental Populations and Population Structure Dissection of the Xinjiang Hui Group via a Self-Developed Panel
Open AccessArticle

Joint Genetic Analyses of Mitochondrial and Y-Chromosome Molecular Markers for a Population from Northwest China

by 1,2,3, 4, 1,2,3, 1,2,3, 1,2,3 and 1,2,3,5,*
Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi’an Jiaotong University, Xi’an 710004, China
Clinical Research Center of Shaanxi Province for Dental and Maxillofacial Diseases, College of Stomatology, Xi’an Jiaotong University, Xi’an 710004, China
College of Medicine & Forensics, Xi’an Jiaotong University Health Science Center, Xi’an 710061, China
Department of Epidemiology, University of Washington, Seattle, WA 98105, USA
Multi-Omics Innovative Research Center of Forensic Identification, Department of Forensic Genetics, School of Forensic Medicine, Southern Medical University, Guangzhou 510515, China
Author to whom correspondence should be addressed.
Genes 2020, 11(5), 564;
Received: 6 March 2020 / Revised: 9 May 2020 / Accepted: 11 May 2020 / Published: 18 May 2020
(This article belongs to the Special Issue Forensic Genetics and Genomics)


The genetic markers on mitochondria DNA (mtDNA) and Y-chromosome can be applied as a powerful tool in population genetics. We present a study to reveal the genetic background of Kyrgyz group, a Chinese ethnic group living in northwest China, and genetic polymorphisms of 60 loci on maternal inherited mtDNA and 24 loci on paternal inherited Y-chromosome short tandem repeats (Y-STRs) were investigated. The relationship between the two systems was tested, and the result indicated that they were statistically independent from each other. The genetic distances between Kyrgyz group and 11 reference populations for mtDNA, and 13 reference populations for Y-STRs were also calculated, respectively. The present results demonstrated that the Kyrgyz group was genetically closer to East Asian populations than European populations based on the mtDNA loci but the other way around for the Y-STRs. The genetic analyses could largely strengthen the understanding for the genetic background of the Kyrgyz group.
Keywords: mtDNA; Y-STRs; independence test mtDNA; Y-STRs; independence test

1. Introduction

As one of the populous countries in the world, China consists of 56 official ethnic groups, some of them, including the Uygur, Kazak, Uzbek, and Kyrgyz groups, live in northwest China [1]. On the basis of molecular genetic markers on human autosomes [2,3,4,5,6], the previous studies indicated that these ethnic groups had ancestral components of East Asian and European populations. There are more than 186,000 Kyrgyz individuals ( in China. Based on two types of autosomal markers, insertion/deletion (InDel) [3] and short tandem repeat (STR) loci [4], our previous studies revealed that the Kyrgyz group was genetically closer to other ethnic groups in northwest China. The current research was conducted on the Kyrgyz group using genetic markers of maternal inherited mitochondria DNA (mtDNA) and paternal inherited Y-chromosome STRs (Y-STRs) in order to reveal the genetic relationships between Kyrgyz group and different reference populations from the maternal and paternal perspective [7,8].
The marker system on mtDNA can be applied as a useful tool for individual identification, maternal lineage test, and ancestry inference in forensic applications [9,10,11]. Its rapid mutation rate, lacking recombination, maternal inheritance and high polymorphism, make mtDNA loci the reasonable marker system for analyzing maternal genetic relationships among different ethnic groups. Compared with nuclear DNA, mtDNA has great advantages in forensic applications for testing highly degraded, damaged, or trace amounts of biological samples [12] due to its high copy numbers per cell, which improves the success rate of mtDNA detection [11].
Another marker system used in this study is Y-STRs, which contribute to plenty of Y chromosomal haplotypes for forensic applications, historical researches, and genealogical investigation of paternal relatives [13,14]. Based on that, there is a large public database, named the Y-STR Haplotype Reference Database (YHRD,, which contains a large number of Y-STR haplotypes from different populations all over the world, can be used as reference population data.
The purpose of the current study was to further understand the historical background of the Kyrgyz group and its paternal and maternal genetic relationships with various populations. We also conducted an independence test between the two marker systems in this study based on the method, an extension of contingency chi-square tests for linkage disequilibrium [15], as suggested by Dr. Bruce Weir from the University of Washington.

2. Methods and Materials

2.1. Sample Collections

To detect the mtDNA loci and Y-STRs, we collected the blood samples of unrelated healthy individuals from Kyrgyz group in the Kizilsu Kirghiz Autonomous Prefecture, in the northwest of the People’s Republic of China. Nm (Nm = 138) samples were detected by the mtDNA marker system, Ny (Ny = 241) male samples were detected by the Y-STR marker system, and Nj (Nj = 88) samples were detected by both of the two marker systems. This study was performed in accordance with the humane and ethical research principles approved by the ethical committee of Xi’an Jiaotong University Health Science Center, China. All the reference population data for mtDNA and Y-STRs were collected from previously published researches.

2.2. Amplification and Genotyping

We detected 60 mtDNA loci using a commercial kit named Expressmarker mtDNA-SNP 60 (AGCU ScienTech Incorporation, Jiangsu, China), which contained 58 mtDNA single nucleotide polymorphisms (SNPs) including nt152, nt709, nt1541, nt1719, nt1811, nt2706, nt3010, nt3348, nt3970, nt4216, nt4491, nt4833, nt4883, nt5178, nt5417, nt5442, nt5460, nt6446, nt7028, nt7196, nt7600, nt8020, nt8414, nt8584, nt8684, nt8697, nt8701, nt8793, nt8794, nt8964, nt9123, nt9477, nt9545, nt9698, nt9824, nt10310, nt10397, nt10398, nt10400, nt10873, nt11215, nt11251, nt11719, nt12372, nt12705, nt12811, nt13104, nt13928, nt14569, nt14668, nt15043, nt15784, nt16126, nt16129, nt16311, nt16316, nt16319, nt16362), an InDel locus (DEL, the 9bp deletion of CCCCCTCTA sequence; NORM, non-deletion of 9bp), and a (CA)n locus (nt514-nt524). These 60 loci were separately amplified with three multiplex amplification panels in three independent 10ul volume systems according to the manufacturer’s instruction [16], analyzed using capillary electrophoresis platform performed by ABI Prism 3500xL Genetic Analyzer (Applied Biosystems, Foster City, CA, USA), and genotyped by GeneMapper® ID-X software v 1.3 (Applied Biosystems, Forster City, CA, USA).
Another commercial kit, Microreader 24Y Direct ID System (Microread Genetics Ltd., Jiangsu, China) was applied to detect 24 Y-STRs, containing 23 overlapped loci with PowerPlex Y23 System (DYS393, DYS570, DYS19, DYS392, DYS549, YGATAH4, DYS458, DYS481, DYS635, DYS448, DYS533, DYS456, DYS389I, DYS390, DYS389II, DYS438, DYS576, DYS391, DYS439, DYS437, DYS385a/b, and DYS643) and DYS460 loci. Multiplex amplification was conducted in a 10ul volume system as descried in the manufacturer’s instruction [17]. Twenty-four Y-STRs were analyzed and genotyped by the same platform and software as the detection of 60 mtDNA loci.
In this research, M4615 (Microread Genetics Ltd., Jiangsu, China) and 9947A (Promega, Madison, WI, USA) were used as control DNA in the genotyping of Y-STRs and mtDNA loci, respectively. And ddH2O was used as negative control.

2.3. Statistical Analyses

For mtDNA loci, the two forensic parameters, haplotype diversity (HDm) and nucleotide diversity (NDm), are calculated by DnaSP version 5.0 [18], respectively. Random matching probability (RMPm) and discrimination power (DPm) are calculated by the formula (1) below [19].
RMP m = P m h i 2
where P m h i indicates haplotype frequency of the i -th haplotype on mtDNA in a certain population.
Gene diversity (GDy), haplotype diversity (HDy), and matching probability (MPy) for Y-STRs are measured using the following formulas (2), (3) and (4) [19]:
GD y   = N y N y 1 ( 1 P y a i 2 )
HD y   = N y N y 1 ( 1 P y h i 2 )
MP y   = P y a i 2
where N y indicates the total number of individuals who are detected by the Y-STR marker system in a population, P y a i is the allele frequency of the i -th allele of locus “a” on Y chromosome, and P y h i is the haplotype frequency of the i -th haplotype on Y chromosome in a population.
After aligning with the revised Cambridge Reference Sequence (rCRS) [20], the genotyping results of mtDNA loci except (CA)n were classified into various haplogroups [8]. Meanwhile, the genetic distance, pairwise Fst values, for 58 mtDNA SNPs except 9bp and (CA)n among Kyrgyz group and other reference populations were estimated by Arlequin version 3.0 [21]. For Y-STRs, the Y-STR haplogroups were predicted by NevGen Genealogy Tools version 1.1 ( In addition, another value of genetic distance for 23 Y-STRs except DYS460, i.e. pairwise Rst values [22], were estimated online ( based on YHRD database. Plots of neighbor-joining (NJ) tree and multidimensional scaling analysis (MDS) were drawn by MEGA version 6.06 ( and PASW STATISTICS version 18.0 ( based on pairwise Fst values for 58 mtDNA SNPs and pairwise Rst values for 23 Y-STRs.
The test of independence between the two systems was conducted by R statistical software version 3.0.2 ( The 88 individuals detected by both systems were studied in this part. For each pair of individuals, the number of Y-STRs with the same genotypes was recorded as y , ranged from 0 to 24, and the number of mtDNA loci except 9bp and (CA)n with the same genotypes was recorded as m , which ranged from 0 to 58. There were 3828 distinct pairs of 88 individuals in total, therefore, we had the values of y i and m i , where i = 1 ,   2 ,   3828 . A Fisher’s exact test [23] by R function (fisher.test {stats}) for independence test between Y-STRs and mtDNA SNPs was performed based on the matrix formed by y i and m i .

3. Results

3.1. Forensic Parameters

Allele frequencies of 60 mtDNA loci in 138 Kyrgyz individuals are shown in Table S1. According to the results, most of the mtDNA SNPs (48 out of 58 SNPs) were detected as transition, two SNPs (nt5178 and nt7196) were identified as transversion, and eight SNPs (nt1541, nt3348, nt3970, nt6446, nt8697, nt8793, nt8964, and nt13928) were found to be no polymorphisms. In Table S2, the 60 mtDNA loci defined 55 haplotypes and 28 haplogroups except (CA)n for haplogroup analysis. The RMPm and DPm values calculated on the basis of the number of haplotypes were 0.0807 and 0.9193. In addition, the values of HDm and NDm were 0.9838 +/− 0.0035 and 0.1820 +/− 0.0929.
Allele frequencies and GDy and MPy values for 24 Y-STRs in 241 individuals are listed in Table S3. In total, 184 alleles on 24 Y-STRs were observed with their allele frequencies ranging from 0.0041 to 0.8631. The GDy values on 24 Y-STRs ranged from 0.2468 (DYS393) to 0.7653 (DYS389 II). A total of 152 haplotypes and 27 haplogroups were defined by the 24 Y-STRs, which contained 23 Y-STRs overlapped with PowerPlex Y23 System (Promega, Madison, USA), respectively (Table S4). Calculated from the haplotype frequencies, the HDy value was equal to 0.9863.

3.2. Haplogroup Analyses for 59 mtDNA loci

The haplogroup results for 59 mtDNA loci in 138 individuals were listed in Table S2. A pie chart of mtDNA haplogroup frequencies was shown in Figure 1a. The highest frequencies of mtDNA haplogroups were haplogroup C (14.49%) and haplogroup D4 (14.49%), followed by haplogroup H (13.77%) in Kyrgyz individuals. The largest haplogroup was observed as haplogroup D lineages, which included haplogroups D4 (14.49%), D4b (1.45%), D4e (1.45%), and D5 (2.90%), accounting for 20.29% in total.
According to the haplogroup geographic affiliations by Underhill and Kivisild [24], most of the mtDNA haplogroups in the 138 Kyrgyz individuals, which were more common in East Asian populations, were haplogroup D (20.29%) including haplogroups D4, D4e, D4b, and D5; haplogroup C (14.49%); haplogroup G (8.70%); haplogroup M (8.70%, except haplogroups C, D, G and Z) including haplogroups M*, M7b1a1, M7c2, M8a, M9, and M9a; haplogroup B (8.70%) including haplogroups B4’5, B4a, and B5; haplogroup A (6.52%); haplogroup N (2.17%, except haplogroups A, B, J, H, HV, U and W) including haplogroups N9 and N9a; and haplogroup Z (1.45%). The remaining haplogroups in the studied Kyrgyz group accounted for 30.43%, which were widely dispersed in European and South Asian populations in different ratios, and included haplogroup H (13.77%); haplogroup U (8.70%) including haplogroups U1, U2’3’4’7’, U5a’b, and U7; haplogroup HV (3.62%); haplogroup N (2.90%) including haplogroup N1a1; haplogroup J (0.72%); and haplogroup W (0.72%). A majority of the mtDNA haplogroups in the studied group was common in East and South Asian populations. These haplogroups, which were often observed in East Asian populations, accounted for over two thirds (69.57%) in the studied group, which implied critical maternal influence from East Asian populations.

3.3. Haplogroup Analyses for 23 Y-STRs

The Y-STR haplogroup distributions in 241 male individuals were listed in Table S4 and shown in Figure 1b. The frequency of the R1a haplogroup was the highest (48.13%), followed by the C2 M217 >> F1918 haplogroup (24.07%). The haplogroup R lineages had the dominant frequency with a total of 53.11%, consisting of haplogroups R1a (48.13%), R1b (0.83%), R1b M73 (3.32%), and R2 M479 (0.83%). The total frequency of haplogroup C lineages took the second place (26.56%), which included haplogroups C2 M217 misc (0.83%), C2 M217 > F1067 (0.83%), C2 M217 >> F1918 (24.07%), and C2b1a1b1 F3985 (0.83%), respectively.
The geographic affiliations of the Y-STR haplogroups for the paternal lineages in this study were different from those of the mtDNA haplogroups for the maternal lineages. The Y-STR haplogroup with the highest frequency in Kyrgyz individuals was haplogroup R1a (48.13%), which was mostly prevalent in European and South Asian populations, according to previously reported researches [25,26,27]. Haplogroup C-M217, mostly distributed in East Asian and American populations, also occurred frequently in the Kyrgyz group (with the frequency of 25.73%). The haplogroups J2, R1b, J1, G*, and I1 which were more common in European populations were 6.64%, 4.15%, 0.83%, 0.41%, and 0.41% in the studied Kyrgyz group, respectively [28], and the haplogroups J2 and J1 also occurred in South Asian and Sub-Saharan African populations, respectively [24]. Nevertheless, the Y-STR haplogroups that were mainly prevalent in East Asian populations presented at lower frequencies in Kyrgyz individuals, including the haplogroups D1 (3.32%, also in North Asian populations), N1 (3.32%), O2a (2.90%, also in South Asian populations) and O1 (0.41%). Compared with the maternal inheritance of mtDNA, the males of the Kyrgyz group kept more paternal ancestry component from European populations.

3.4. Population Genetic Differentiation Analyses for mtDNA

The pairwise Fst values for 58 mtDNA SNPs were calculated between the Kyrgyz group and 11 reference populations (East Asian populations: Beijing Han, Southern Han, Denver Han, Japanese [29], Xibe [30], and Kazak in northwest China [16]; European populations: Estonian [11], Hispanic, and Caucasian [31]; West Asian population: Iranian [32]; and African population: African American [31]). The pairwise Fst values were graphically represented by a heatmap in Figure 2a, which were indicated by gradient colors from blue (low values) to red (high values), ranging from 0.0007 (between Beijing Han and Sothern Han populations) to 0.4677 (between African American and Estonian populations). We also plotted the MDS (Figure 3a) and NJ tree (Figure 4a) based on the pairwise Fst values. In Figure 3a, the Kyrgyz and Kazak ethnic groups, located at the central area, were genetically close to each other and surrounded by other East Asian populations. There were three main branches on the NJ tree in the Figure 4a, in which the top branch had only one population (African American); the middle branch included Kazak, Kyrgyz, other East Asian populations, and a European population; and the bottom branch consisted of European and West Asian populations. In the middle branch, the Kyrgyz group was first clustered with Kazak ethnic group, and then with Xibe, Southern Han, Beijing Han, and Denver Han populations, and finally with Japanese and Hispanic populations.

3.5. Population Genetic Differentiation Analyses for Y-STRs

As shown in Figure 2b, a heatmap of the pairwise Rst values for 23 Y-STRs indicated the genetic relationships of Kyrgyz group and 13 reference populations. These 13 reference populations included East Asian populations: Beijing Han [33,34,35], Guangdong Han [36], Southern Han [37], Asian American [38], Japanese [39,40,41,42], and Qiemo Uygur and Xinjiang Uygur in northwest China [43,44,45]; European populations: Estonian [46], Italian [47,48,49,50,51,52], Norwegian [53,54], European American, and Hispanic American [38]; and African population: African American [38]. According to the Rst values, the Kyrgyz group had the closest distances to the northwest Chinese populations, Qiemo Uygur (Rst = 0.0831) and Xinjiang Uygur (Rst = 0.0840), followed by the European populations with Rst ranging from 0.1177 to 0.1639, and then by other populations from East Asia (the Rst values were in the range of 0.2199 to 0.2957). The Kyrgyz group was the farthest distant from the African Americans (Rst = 0.3591). The MDS and NJ tree were plotted based on Rst values, respectively. In the MDS plot (Figure 3b), the Kyrgyz group, located at the first quadrant, was closer to the European populations. The NJ tree (Figure 4b) showed that the African population formed the top branch by itself, the East Asian populations formed the middle branch, and the Kyrgyz, two Uygur ethnic groups, and European populations formed the bottom branch.

3.6. Independence Test for Two Marker Systems

The contingency table of the two genetic marker systems containing 3828 distinct pairs of 88 individuals was listed in Table S5. Fisher’s exact tests were conducted with simulated p-values based on 2000 replicates. There were no statistically significant correlations between the 58 mtDNA SNPs and 24 Y-STRs (p-value = 0.4168), indicating that the two systems were mutually independent from each other.

4. Discussion

In this study, we analyzed the genetic diversities based on the maternal inherited 60 mtDNA loci and the paternal inherited 24 Y-STRs for the Kyrgyz group in China. The haplogroup distributions for mtDNA and Y-STRs indicated the different genetic backgrounds of Kyrgyz group in maternal and paternal lineages. Also, we found that the results of the genetic relationships of the Kyrgyz group and reference populations were not completely equivalent between maternal and paternal inheritances.
Pairwise Fst or Rst values are fundamental measures to indicate the genetic distances between populations for mtDNA and Y-STRs, respectively. In our previous studies [3,4], we calculated pairwise Fst values among the Kyrgyz group and reference populations based on autosomal 30 InDels and 21 STRs to explore their genetic relationships in autosomal inheritance. Those results showed that Kyrgyz group had close genetic relationships to the Kazak and Uygur groups, who also lived in Xinjiang province. Then, the MDS plot for pairwise Fst values based on these autosomal InDels revealed that the Kyrgyz group was distributed in the middle of the plot and between East Asian and European populations [3].
In this study, we collected the genotyping data of populations worldwide as the reference populations to calculate pairwise Fst for mtDNA loci and pairwise Rst for Y-STRs. The two sets of values indicated that the Kyrgyz group had the closest genetic relationships with the ethnic groups in northwest China (the closest to the Kazak group on mtDNA markers and the closest to the Uygur group on Y chromosome markers). Beyond that, the pairwise Fst values for mtDNA SNPs revealed that the Kyrgyz group was genetically closer to East Asian populations, in contrast to the results from pairwise Rst values for Y-STRs, which showed that Kyrgyz group was genetically closer to European populations.
On the MDS plot for mtDNA (Figure 3a), the Kyrgyz and Kazak ethnic groups lied closely near East Asian populations. When it came to Y-STRs (Figure 3b), the Kyrgyz group was surrounded by European populations. We got similar results from the plots of the two NJ trees (Figure 4a,b). Compared with the paternal inheritance, the Kyrgyz group on the maternal inheritance was closer to East Asian populations, indicating some differences in their maternal and paternal evolutionary histories. Previously reported research indicated the similar pattern with the differences in maternal and paternal inheritances was also observed in the Kurdish groups by Nasidze et al. [55], which showed closer relationships with European populations than the Caucasian populations based on mtDNA but the opposite based on Y-chromosome markers.
The particular genetic structure of Kyrgyz group was related to its geographical location, historical background, and marriage customs. The Kyrgyz individuals collected by our study were from the Kizilsu Kirghiz Autonomous Prefecture in the northwest part of China, where the northern and western parts are bordered by Kyrgyzstan and Tajikistan, respectively. The area where the Kyrgyz group located was the route of the ancient Silk Road [56] and an important hub for exchanges between East Asia and Europe. This geographical location also creates an opportunity for genetic exchanges, which may be a reason to explain that the genetic structure of the Kyrgyz group is Eurasian in autosomal inheritance.
From the Han dynasty to the early Qing dynasty, the Kyrgyz people were active in the Yenisei River Basin [57]. In 1702, due to the invasion of Tsarist Russia and their fierce conflicted with the Kyrgyz people, the Junggar tribes, who ruled the area at that time, forced the Kyrgyz people to move to the Tianshan area, where they merged with a small number of Kyrgyz who had previously moved westward, and then the merged Kyrgyz people were distributed into the area surrounding Lake Issyk-Kul and the Chu and Taras river basins [58]. Subsequently, due to the oppression of the Junggar tribes, some of those Kyrgyz people moved to Tashkent, Fergana, and the surrounding areas in Central Asia, and others moved to the Pamirs, Kush Mountains, and Karakorum Mountains, forming the east–west “Brut” [59,60], which eventually evolved into the current distribution of the Kyrgyz group in China. During this long migration process, the Kyrgyz group intermarried with the surrounding ethnic groups, especially the ethnic groups from East Asia. Therefore, the ancestry components of East Asian populations were added to form the current genetic structure of the Chinese Kyrgyz group.
The Kyrgyz group adopts an outside-marriage system [61], which stipulates that individuals within seven generations of immediate family members or five generations of collateral relatives cannot be intermarried, nor can men and women who grew up drinking the same breast milk [62]. Before a couple from the Kyrgyz group gets engaged, their family and milking histories need to be known in great detail. Kyrgyz women cannot get married with men from other ethnic groups or men of different religions. However, Kyrgyz men are able to marry women from other ethnic groups, and women of different religions [63]. When a husband dies, the widow cannot take away their children and cannot change the tribe or ethnic group to which the children belong [58]. In the present study, approximate 70% of the mtDNA haplogroups in the Kyrgyz group distributed in East Asian populations based on maternal inheritance, whereas on account of Y-STR of paternal inheritance, the Kyrgyz group was closer to the European than East Asian populations. Previously reported research indicated that the Kyrgyz group had the mixture ancestry information components of East Asian and European populations in terms of autosomal genetic markers [3]. In the mitochondrial matrilineal genetic study, a very high proportion of ancestry component of the Kyrgyz group belonging to East Asian populations were mainly derived from the likely reason of intermarriage between Kyrgyz men and East Asian women according to Kyrgyz marriage custom. The Y chromosomal genetic markers strictly follow paternal inheritance, where little recombination occurs during the paternally genetic process. Thus, a large proportion of European ancestry component in Kyrgyz male individuals could still be detected in the Y-STR genetic system for paternal inheritance.
The Fisher’s exact tests indicated that the two genetic marker systems were mutually independent between mtDNA SNPs and Y-STRs. In fact, the Y chromosome DNA marker is inside the nucleus of the cell, whereas mtDNA is in the mitochondria outside the nucleus. Therefore, the two marker systems are generally considered as independent inheritances during the courses of maternal and paternal inheritances. As expected, there were no statistically significant correlations between the two marker systems in current study. Since independent inheritance is the premise for the joint application of two different marker systems, and mtDNA and Y chromosome genetic markers are always analyzed simultaneously in the field of population genetics [24,55], we presented a methodology to test for the independence between them. However, additional studies will be warranted to confirm or deny the present result.

5. Conclusions

The obtained mtDNA haplogroups in this study accounted for over two thirds of the studied Kyrgyz individuals, which were also common in East Asian populations, whereas the observed Y-STR halogroups that were mainly prevalent in East Asia populations accounted for only a small fraction of a total of the obtained Y-STR haplogroups. In population genetic relationship analyses, the Kyrgyz group was genetically closer to East Asian populations than European populations based on the mtDNA marker system. However, the analyses using the Y chromosome marker system provided different results, indicating the potential differences between the maternal and paternal inheritances. In future study, more genotyping data of different genetic markers from the Kyrgyz group are needed to further reveal the genetic background of the Chinese Kyrgyz group and its genetic relationships with other populations.

Supplementary Materials

The following are available online at, Table S1: The allele frequencies of 60 mtDNA loci in Kyrgyz group (n = 138), Table S2: Haplogroup frequencies and counts for 59 mtDNA loci except (CA)n in 138 Kyrgyz individuals, Table S3: The allele frequencies and forensic parameters of 24 Y-STRs in Kyrgyz group (n = 241), Table S4: Haplogroup frequencies and counts for 23 Y-STRs except DYS460 in 241 Kyrgyz males, Table S5: Contingency table for independence tests between 24 Y-STRs and 58 mtDNA SNPs (n = 88).

Author Contributions

Conceptualization, B.Z. and Y.G.; methodology, B.Z. and Y.G.; writing—original draft preparation, Y.G.; writing—review and editing, Z.X. and X.J.; visualization, W.C. and C.C.; funding acquisition, B.Z.; All authors have read and agreed to the published version of the manuscript.


This research was funded by the National Natural Science Foundation of China (NSFC, No. 81525015, 81930055).


Thanks for Bruce Weir (University of Washington) for kindly providing the method to calculate relationship between two systems.

Conflicts of Interest

The authors declare no conflict of interest.


  1. Minahan, J. Ethnic Groups of North, East, and Central Asia: An Encyclopedia; Ethnic Groups of the World; ABC-CLIO: Santa Barbara, CA, USA, 2014. [Google Scholar]
  2. Jin, X.; Cui, W.; Wei, Y.; Mu, Y.; Dong, Q.; Chen, C.; Guo, Y.; Kong, T.; Zhu, B.; Chen, J. Genetic Characteristics of 19 Strs in Chinese Uzbek Ethnic and Its Phylogenetic Relationships with Other 24 Populations. Int. J. Leg. Med. 2018, 132, 729–731. [Google Scholar] [CrossRef] [PubMed]
  3. Guo, Y.; Chen, C.; Jin, X.; Cui, W.; Wei, Y.; Wang, H.; Kong, T.; Mu, Y.; Zhu, B. Autosomal Dips for Population Genetic Structure and Differentiation Analyses of Chinese Xinjiang Kyrgyz Ethnic Group. Sci. Rep. 2018, 8, 11054. [Google Scholar] [CrossRef] [PubMed]
  4. Guo, Y.; Chen, C.; Xie, T.; Cui, W.; Meng, H.; Jin, X.; Zhu, B. Forensic Efficiency Estimate and Phylogenetic Analysis for Chinese Kyrgyz Ethnic Group Revealed by a Panel of 21 Short Tandem Repeats. R. Soc. Open Sci. 2018, 5, 172089. [Google Scholar] [CrossRef] [PubMed]
  5. He, G.L.; Wang, Z.; Wang, M.G.; Luo, T.; Liu, J.; Zhou, Y.; Gao, B.; Hou, Y.P. Forensic Ancestry Analysis in Two Chinese Minority Populations Using Massively Parallel Sequencing of 165 Ancestry-Informative Snps. Electrophoresis 2018, 39, 2732–2742. [Google Scholar] [CrossRef] [PubMed]
  6. Lan, Q.; Shen, C.; Jin, X.; Guo, Y.; Xie, T.; Chen, C.; Cui, W.; Fang, Y.; Yang, G.; Zhu, B. Distinguishing Three Distinct Biogeographic Regions with an in-House Developed 39-Aim-Indel Panel and Further Admixture Proportion Estimation for Uyghurs. Electrophoresis 2019, 40, 1525–1534. [Google Scholar] [CrossRef] [PubMed]
  7. Barbieri, C.; Hubner, A.; Macholdt, E.; Ni, S.; Lippold, S.; Schroder, R.; Mpoloka, S.W.; Purps, J.; Roewer, L.; Stoneking, M.; et al. Refining the Y Chromosome Phylogeny with Southern African Sequences. Hum. Genet. 2016, 135, 541–553. [Google Scholar] [CrossRef]
  8. van Oven, M.; Kayser, M. Updated Comprehensive Phylogenetic Tree of Global Human Mitochondrial DNA Variation. Hum. Mutat. 2009, 30, E386–E394. [Google Scholar] [CrossRef]
  9. King, T.E.; Fortes, G.G.; Balaresque, P.; Thomas, M.G.; Balding, D.; Maisano Delser, P.; Neumann, R.; Parson, W.; Knapp, M.; Walsh, S.; et al. Identification of the Remains of King Richard Iii. Nat. Commun. 2014, 5, 5631. [Google Scholar] [CrossRef]
  10. Kutanan, W.; Kampuansai, J.; Srikummool, M.; Brunelli, A.; Ghirotto, S.; Arias, L.; Macholdt, E.; Hubner, A.; Schroder, R.; Stoneking, M. Contrasting Paternal and Maternal Genetic Histories of Thai and Lao Populations. Mol. Biol. Evol. 2019, 36, 1490–1506. [Google Scholar] [CrossRef]
  11. Stoljarova, M.; King, J.L.; Takahashi, M.; Aaspollu, A.; Budowle, B. Whole Mitochondrial Genome Genetic Diversity in an Estonian Population Sample. Int. J. Leg. Med. 2016, 130, 67–71. [Google Scholar] [CrossRef]
  12. de Boer, H.H.; Blau, S.; Delabarde, T.; Hackman, L. The Role of Forensic Anthropology in Disaster Victim Identification (Dvi): Recent Developments and Future Prospects. Forensic Sci. Res. 2019, 4, 303–315. [Google Scholar] [CrossRef] [PubMed]
  13. Kayser, M. Forensic Use of Y-Chromosome DNA: A General Overview. Hum. Genet. 2017, 136, 621–635. [Google Scholar] [CrossRef] [PubMed]
  14. Purps, J.; Geppert, M.; Nagy, M.; Roewer, L. Validation of a Combined Autosomal/Y-Chromosomal Str Approach for Analyzing Typical Biological Stains in Sexual-Assault Cases. Forensic Sci. Int. Genet. 2015, 19, 238–242. [Google Scholar] [CrossRef] [PubMed]
  15. Weir, B.S. Inferences About Linkage Disequilibrium. Biometrics 1979, 35, 235–254. [Google Scholar] [CrossRef]
  16. Xie, T.; Hu, L.; Guo, Y.X.; Li, Y.C.; Chen, F.; Zhu, B.F. Genetic Polymorphism Analysis of Mitochondrial DNA from Chinese Xinjiang Kazak Ethnic Group by a Novel Mitochondrial DNA Genotyping Panel. Mol. Biol. Rep. 2019, 46, 17–25. [Google Scholar] [CrossRef]
  17. Liu, Y.; Guo, Y.; Jin, X.; Mei, S.; Xie, T.; Lan, Q.; Fang, Y.; Zhu, B. Developmental Validation Study of a 24-Plex Y-Str Direct Amplification System for Forensic Application. Int. J. Leg. Med. 2020, 134, 945–951. [Google Scholar] [CrossRef]
  18. Librado, P.; Rozas, J. Dnasp V5: A Software for Comprehensive Analysis of DNA Polymorphism Data. Bioinformatics 2009, 25, 1451–1452. [Google Scholar] [CrossRef]
  19. Nei, M. Molecular Evolutionary Genetics; Columbia University Press: New York, NY, USA, 1987. [Google Scholar]
  20. Andrews, R.M.; Kubacka, I.; Chinnery, P.F.; Lightowlers, R.N.; Turnbull, D.M.; Howell, N. Reanalysis and Revision of the Cambridge Reference Sequence for Human Mitochondrial DNA. Nat. Genet. 1999, 23, 147. [Google Scholar] [CrossRef] [PubMed]
  21. Excoffier, L.; Laval, G.; Schneider, S. Arlequin (Version 3.0): An Integrated Software Package for Population Genetics Data Analysis. Evol. Bioinform. Online 2007, 1, 47–50. [Google Scholar] [CrossRef] [PubMed]
  22. Excoffier, L.; Smouse, P.E.; Quattro, J.M. Analysis of Molecular Variance Inferred from Metric Distances among DNA Haplotypes-Application to Human Mitochondrial-DNA Restriction Data. Genetics 1992, 131, 479–491. [Google Scholar] [PubMed]
  23. Fisher, R.A. Statistical Methods for Research Workers, 14th ed.; Oliver and Boyd: Edinburgh, Scotland, 1970. [Google Scholar]
  24. Underhill, P.A.; Kivisild, T. Use of Y Chromosome and Mitochondrial DNA Population Structure in Tracing Human Migrations. Annu. Rev. Genet. 2007, 41, 539–564. [Google Scholar] [CrossRef] [PubMed]
  25. Haak, W.; Lazaridis, I.; Patterson, N.; Rohland, N.; Mallick, S.; Llamas, B.; Brandt, G.; Nordenfelt, S.; Harney, E.; Stewardson, K.; et al. Massive Migration from the Steppe Was a Source for Indo-European Languages in Europe. Nature 2015, 522, 207–211. [Google Scholar] [CrossRef] [PubMed]
  26. Allentoft, M.E.; Sikora, M.; Sjogren, K.G.; Rasmussen, S.; Rasmussen, M.; Stenderup, J.; Damgaard, P.B.; Schroeder, H.; Ahlstrom, T.; Vinner, L.; et al. Population Genomics of Bronze Age Eurasia. Nature 2015, 522, 167–172. [Google Scholar] [CrossRef] [PubMed]
  27. Underhill, P.A.; Poznik, G.D.; Rootsi, S.; Jarve, M.; Lin, A.A.; Wang, J.; Passarelli, B.; Kanbar, J.; Myres, N.M.; King, R.J.; et al. The Phylogenetic and Geographic Structure of Y-Chromosome Haplogroup R1a. Eur. J. Hum. Genet. 2015, 23, 124–131. [Google Scholar] [CrossRef] [PubMed]
  28. Myres, N.M.; Rootsi, S.; Lin, A.A.; Jarve, M.; King, R.J.; Kutuev, I.; Cabrera, V.M.; Khusnutdinova, E.K.; Pshenichnov, A.; Yunusbayev, B.; et al. A Major Y-Chromosome Haplogroup R1b Holocene Era Founder Effect in Central and Western Europe. Eur. J. Hum. Genet. 2011, 19, 95–101. [Google Scholar] [CrossRef]
  29. Zheng, H.X.; Yan, S.; Qin, Z.D.; Wang, Y.; Tan, J.Z.; Li, H.; Jin, L. Major Population Expansion of East Asians Began before Neolithic Time: Evidence of Mtdna Genomes. PLoS ONE 2011, 6, e25835. [Google Scholar] [CrossRef]
  30. Shen, C.M.; Hu, L.; Yang, C.H.; Yin, C.Y.; Li, Z.D.; Meng, H.T.; Guo, Y.X.; Mei, T.; Chen, F.; Zhu, B.F. Genetic Polymorphisms of 54 Mitochondrial DNA Snp Loci in Chinese Xibe Ethnic Minority Group. Sci. Rep. 2017, 7, 44407. [Google Scholar] [CrossRef]
  31. King, J.L.; Larue, B.L.; Novroski, N.M.; Stoljarova, M.; Seo, S.B.; Zeng, X.P.; Warshauer, D.H.; Davis, C.P.; Parson, W.; Sajantila, A.; et al. High-Quality and High-Throughput Massively Parallel Sequencing of the Human Mitochondrial Genome Using the Illumina Miseq. Forensic Sci. Int. Genet. 2014, 12, 128–135. [Google Scholar] [CrossRef]
  32. Derenko, M.; Malyarchuk, B.; Bahmanimehr, A.; Denisova, G.; Perkova, M.; Farjadian, S.; Yepiskoposyan, L. Complete Mitochondrial DNA Diversity in Iranians. PLoS ONE 2013, 8, e80673. [Google Scholar] [CrossRef]
  33. Kwak, K.D.; Jin, H.J.; Shin, D.J.; Kim, J.M.; Roewer, L.; Krawczak, M.; Tyler-Smith, C.; Kim, W. Y-Chromosomal Str Haplotypes and Their Applications to Forensic and Population Studies in East Asia. Int. J. Leg. Med. 2005, 119, 195–201. [Google Scholar] [CrossRef]
  34. Nothnagel, M.; Fan, G.Y.; Guo, F.; He, Y.F.; Hou, Y.P.; Hu, S.P.; Huang, J.; Jiang, X.H.; Kim, W.; Kim, K.; et al. Revisiting the Male Genetic Landscape of China: A Multi-Center Study of Almost 38,000 Y-Str Haplotypes. Hum. Genet. 2017, 136, 485–497. [Google Scholar] [CrossRef] [PubMed]
  35. Lang, M.; Liu, H.; Song, F.; Qiao, X.; Ye, Y.; Ren, H.; Li, J.; Huang, J.; Xie, M.; Chen, S.; et al. Forensic Characteristics and Genetic Analysis of Both 27 Y-Strs and 143 Y-Snps in Eastern Han Chinese Population. Sci. Int. Genet. 2019, 42, e13–e20. [Google Scholar] [CrossRef] [PubMed]
  36. Wang, Y.; Zhang, Y.J.; Zhang, C.C.; Li, R.; Yang, Y.; Ou, X.L.; Tong, D.Y.; Sun, H.Y. Genetic Polymorphisms and Mutation Rates of 27 Y-Chromosomal Strs in a Han Population from Guangdong Province, Southern China. Forensic Sci. Int. Genet. 2016, 21, 5–9. [Google Scholar] [CrossRef]
  37. Ou, X.; Wang, Y.; Liu, C.; Yang, D.; Zhang, C.; Deng, S.; Sun, H. Haplotype Analysis of the Polymorphic 40 Y-Str Markers in Chinese Populations. Forensic Sci. Int. Genet. 2015, 19, 255–262. [Google Scholar] [CrossRef] [PubMed]
  38. Coble, M.D.; Hill, C.R.; Butler, J.M. Haplotype Data for 23 Y-Chromosome Markers in Four U.S. Population Groups. Forensic Sci. Int. Genet. 2013, 7, e66–e68. [Google Scholar] [CrossRef] [PubMed]
  39. Watahiki, H.; Fujii, K.; Fukagawa, T.; Mita, Y.; Kitayama, T.; Mizuno, N. Polymorphisms and Microvariant Sequences in the Japanese Population for 25 Y-Str Markers and Their Relationships to Y-Chromosome Haplogroups. Forensic Sci. Int. Genet. 2019, 41, E1–E7. [Google Scholar] [CrossRef]
  40. Uchihi, R.; Yamamoto, T.; Usuda, K.; Yoshimoto, T.; Tanaka, M.; Tokunaga, S.; Kurihara, R.; Tokunaga, K.; Katsumata, Y. Haplotype Analysis with 14 Y-Str Loci Using 2 Multiplex Amplification and Typing Systems in 2 Regional Populations in Japan. Int. J. Leg. Med. 2003, 117, 34–38. [Google Scholar] [CrossRef]
  41. Lessig, R.; Willuweit, S.; Krawczak, M.; Wu, F.C.; Pu, C.E.; Kim, W.; Henke, L.; Henke, J.; Miranda, J.; Hidding, M.; et al. Asian Online Y-Str Haplotype Reference Database. Leg Med. 2003, 5 (Suppl. 1), S160–S163. [Google Scholar] [CrossRef]
  42. Mizuno, N.; Nakahara, H.; Sekiguchi, K.; Yoshida, K.; Nakano, M.; Kasai, K. 16 Y Chromosomal Str Haplotypes in Japanese. Forensic Sci. Int. 2008, 174, 71–76. [Google Scholar] [CrossRef]
  43. Bian, Y.N.; Zhang, S.H.; Zhou, W.; Zhao, Q.; Siqintuya; Zhu, R.X.; Wang, Z.; Gao, Y.Z.; Hong, J.; Lu, D.R.; et al. Analysis of Genetic Admixture in Uyghur Using the 26y-Str Loci System. Sci. Rep. 2016, 6, 1–9. [Google Scholar] [CrossRef]
  44. Shan, W.J.; Ablimit, A.; Zhou, W.J.; Zhang, F.C.; Ma, Z.H.; Zheng, X.F. Genetic Polymorphism of 17 Y Chromosomal Strs in Kazakh and Uighur Populations from Xinjiang, China. Int. J. Leg. Med. 2014, 128, 743–744. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, D.; Cao, G.; Xie, M.; Cui, X.; Xiao, L.; Tian, C.; Ye, Y. Y Chromosomal Str Haplotypes in Chinese Uyghur, Kazakh and Hui Ethnic Groups and Genetic Features of Dys448 Null Allele and Dys19 Duplicated Allele. Int. J. Leg. Med. 2019. [Google Scholar] [CrossRef] [PubMed]
  46. Lessig, R.; Edelmann, J. Population Data of Y-Chromosomal Strs in Lithuanian, Latvian and Estonian Males. Forensic Sci. Int. 2001, 120, 223–225. [Google Scholar] [CrossRef]
  47. Robino, C.; Ralf, A.; Pasino, S.; De Marchi, M.R.; Ballantyne, K.N.; Barbaro, A.; Bini, C.; Carnevali, E.; Casarino, L.; Di Gaetano, C.; et al. Development of an Italian Rm Y-Str Haplotype Database: Results of the 2013 Gefi Collaborative Exercise. Forensic Sci. Int. Genet. 2015, 15, 56–63. [Google Scholar] [CrossRef] [PubMed]
  48. Rapone, C.; D’Atanasio, E.; Agostino, A.; Mariano, M.; Papaluca, M.T.; Cruciani, F.; Berti, A. Forensic Genetic Value of a 27 Y-Str Loci Multiplex (Yfiler((R)) Plus Kit) in an Italian Population Sample. Forensic Sci. Int. Genet. 2016, 21, e1–e5. [Google Scholar] [CrossRef] [PubMed]
  49. Lacerenza, D.; Aneli, S.; Di Gaetano, C.; Critelli, R.; Piazza, A.; Matullo, G.; Culigioni, C.; Robledo, R.; Robino, C.; Calo, C. Investigation of Extended Y Chromosome Str Haplotypes in Sardinia. Forensic Sci. Int. Genet. 2017, 27, 172–174. [Google Scholar] [CrossRef]
  50. Sarno, S.; Tofanelli, S.; De Fanti, S.; Quagliariello, A.; Bortolini, E.; Ferri, G.; Anagnostou, P.; Brisighelli, F.; Capelli, C.; Tagarelli, G.; et al. Shared Language, Diverging Genetic Histories: High-Resolution Analysis of Y-Chromosome Variability in Calabrian and Sicilian Arbereshe. Eur. J. Hum. Genet. 2016, 24, 600–606. [Google Scholar] [CrossRef]
  51. Brisighelli, F.; Blanco-Verea, A.; Boschi, I.; Garagnani, P.; Pascali, V.L.; Carracedo, A.; Capelli, C.; Salas, A. Patterns of Y-Str Variation in Italy. Forensic Sci. Int. Genet. 2012, 6, 834–839. [Google Scholar] [CrossRef]
  52. Piglionica, M.; Baldassarra, S.L.; Giardina, E.; Stella, A.; D’Ovidio, F.D.; Frati, P.; Lenato, G.M.; Resta, N.; Dell’Erba, A. Population Data for 17 Y-Chromosome Strs in a Sample from Apulia (Southern Italy). Forensic Sci. Int. Genet. 2013, 7, e3–e4. [Google Scholar] [CrossRef]
  53. Dupuy, B.M.; Andreassen, R.; Flones, A.G.; Tomassen, K.; Egeland, T.; Brion, M.; Carracedo, A.; Olaisen, B. Y-Chromosome Variation in a Norwegian Population Sample. Forensic Sci. Int. 2001, 117, 163–173. [Google Scholar] [CrossRef]
  54. Dupuy, B.M.; Stenersen, M.; Lu, T.T.; Olaisen, B. Geographical Heterogeneity of Y-Chromosomal Lineages in Norway. Forensic Sci. Int. 2006, 164, 10–19. [Google Scholar] [CrossRef] [PubMed]
  55. Nasidze, I.; Quinque, D.; Ozturk, M.; Bendukidze, N.; Stoneking, M. Mtdna and Y-Chromosome Variation in Kurdish Groups. Ann. Hum. Genet. 2005, 69, 401–412. [Google Scholar] [CrossRef] [PubMed]
  56. Taylor, W.; Shnaider, S.; Abdykanova, A.; Fages, A.; Welker, F.; Irmer, F.; Seguin-Orlando, A.; Khan, N.; Douka, K.; Kolobova, K.; et al. Early Pastoral Economies along the Ancient Silk Road: Biomolecular Evidence from the Alay Valley, Kyrgyzstan. PLoS ONE 2018, 13, e0205646. [Google Scholar] [CrossRef] [PubMed]
  57. Kassymbekova, B. China’s Borderlands: The Faultline of Central Asia. Lab. Russ. Rev. Soc. Res. 2017, 9, 157–159. [Google Scholar]
  58. Du, R.Q.; An, W.E. Kyrgyz Ethnic Group; The Ethnic Publishing House: Beijing, China, 1991. [Google Scholar]
  59. He, J.H.; Zhang, G.H. Ethnography of Kizilsukirz Autonomous Prefecture; Xinjiang Kizilsu Kyrgyz Press: Xinjiang, China, 1992. [Google Scholar]
  60. Prior, D. The Sabdan Baatir Codex: Epic and the Writing of Northern Kirghiz History; Brill Press: Leiden, The Netherlands, 2013. [Google Scholar]
  61. Zerjal, T.; Wells, R.S.; Yuldasheva, N.; Ruzibakiev, R.; Tyler-Smith, C. A Genetic Landscape Reshaped by Recent Events: Y-Chromosomal Insights into Central Asia. Am. J. Hum. Genet. 2002, 71, 466–482. [Google Scholar] [CrossRef]
  62. Forde, C.D. Habitat, Economy and Society; a Geographical Introduction to Ethnology; Methuen & co. ltd.: London, UK, 1934. [Google Scholar]
  63. Chen, C.; Cyrnus, A. Kyrgyz Ethnic Group; Xinjiang Art Photography Press: Xinjiang, China, 1996. [Google Scholar]
Figure 1. The pie charts for the haplogroup frequencies of the Kyrgyz group, (a) based on 59 mtDNA loci in 138 individuals, haplogroup M in this plot excluded haplogroups C, D, G and Z; haplogroup N in this plot excluded haplogroups A, B, J, H, HV, U and W; (b) based on 23 Y-STRs in 241 male individuals.
Figure 1. The pie charts for the haplogroup frequencies of the Kyrgyz group, (a) based on 59 mtDNA loci in 138 individuals, haplogroup M in this plot excluded haplogroups C, D, G and Z; haplogroup N in this plot excluded haplogroups A, B, J, H, HV, U and W; (b) based on 23 Y-STRs in 241 male individuals.
Genes 11 00564 g001
Figure 2. The two heatmaps for genetic distances among the Kyrgyz group and the reference populations, (a) pairwise Fst values based on 58 mtDNA SNPs among 12 populations; (b) pairwise Rst values based on 23 Y-STRs among 14 populations.
Figure 2. The two heatmaps for genetic distances among the Kyrgyz group and the reference populations, (a) pairwise Fst values based on 58 mtDNA SNPs among 12 populations; (b) pairwise Rst values based on 23 Y-STRs among 14 populations.
Genes 11 00564 g002
Figure 3. The multidimensional scaling analysis (MDS) plots for Kyrgyz group and reference populations, (a) based on the same of 58 mtDNA SNPs; (b) based on the same of 23 Y-STRs.
Figure 3. The multidimensional scaling analysis (MDS) plots for Kyrgyz group and reference populations, (a) based on the same of 58 mtDNA SNPs; (b) based on the same of 23 Y-STRs.
Genes 11 00564 g003
Figure 4. The neighbor-joining (NJ) trees for the Kyrgyz group and reference populations, (a) based on the same of 58 mtDNA SNPs; (b) based on the same of 23 Y-STRs.
Figure 4. The neighbor-joining (NJ) trees for the Kyrgyz group and reference populations, (a) based on the same of 58 mtDNA SNPs; (b) based on the same of 23 Y-STRs.
Genes 11 00564 g004
Back to TopTop