Next Article in Journal
Comparative Transcriptome Analysis Reveals Key Genes and Pathways Involved in Prickle Development in Eggplant
Previous Article in Journal
Angiotensinogen Gene Missense Polymorphisms (rs699 and rs4762): The Association of End-Stage Renal Failure Risk with Type 2 Diabetes and Hypertension in Egyptians
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Genetic Diversity and Population Structure Analysis of Triticum aestivum L. Landrace Panel from Afghanistan

1
Department of Field Crops, Ege University, Bornova, Izmir 35100, Turkey
2
International Center for Agricultural Research in the Dry Areas (ICARDA), ICARDA-PreBreeding & Genebank Operations, Rabat 10000, Morocco
3
International Maize and Wheat Improvement Center (CIMMYT), Carretera México-Veracruz Km. 45, El Batán, Texcoco C.P. 56237, Mexico
4
IRTA (Institute for Food and Agricultural Research and Technology), 25198 Lleida, Spain
5
International Center for Agricultural Research in the Dry Areas (ICARDA), Turkey-ICARDA Regional Cereal Rust Research Center (RCRRC), Menemen, Izmir 35661, Turkey
6
Department of Plant Breeding and Genetics, University of Agriculture Faisalabad, Faisalabad 38000, Pakistan
*
Author to whom correspondence should be addressed.
Genes 2021, 12(3), 340; https://doi.org/10.3390/genes12030340
Submission received: 25 January 2021 / Revised: 18 February 2021 / Accepted: 20 February 2021 / Published: 25 February 2021
(This article belongs to the Section Plant Genetics and Genomics)

Abstract

:
Landraces are a potential source of genetic diversity and provide useful genetic resources to cope with the current and future challenges in crop breeding. Afghanistan is located close to the centre of origin of hexaploid wheat. Therefore, understanding the population structure and genetic diversity of Afghan wheat landraces is of enormous importance in breeding programmes for the development of high-yielding cultivars as well as broadening the genetic base of bread wheat. Here, a panel of 363 bread wheat landraces collected from seven north and north-eastern provinces of Afghanistan were evaluated for population structure and genetic diversity using single nucleotide polymorphic markers (SNPs). The genotyping-by-sequencing of studied landraces after quality control provided 4897 high-quality SNPs distributed across the genomes A (33.75%), B (38.73%), and D (27.50%). The population structure analysis was carried out by two methods using model-based STRUCTURE analysis and cluster-based discriminant analysis of principal components (DAPC). The analysis of molecular variance showed a higher proportion of variation within the sub-populations compared with the variation observed as a whole between sub-populations. STRUCTURE and DAPC analysis grouped the majority of the landraces from Badakhshan and Takhar together in one cluster and the landraces from Baghlan and Kunduz in a second cluster, which is in accordance with the micro-climatic conditions prevalent within the north-eastern agro-ecological zone. Genetic distance analysis was also studied to identify differences among the Afghan regions; the strongest correlation was observed for the Badakhshan and Takhar (0.003), whereas Samangan and Konarha (0.399) showed the highest genetic distance. The population structure and genetic diversity analysis highlighted the complex genetic variation present in the landraces which were highly correlated to the geographic origin and micro-climatic conditions within the agro-climatic zones of the landraces. The higher proportions of admixture could be attributed to historical unsupervised exchanges of seeds between the farmers of the central and north-eastern provinces of Afghanistan. The results of this study will provide useful information for genetic improvement in wheat and is essential for association mapping and genomic prediction studies to identify novel sources for resistance to abiotic and biotic stresses.

1. Introduction

Wheat (Triticum aestivum L.) is the primary source of calorie intake for more than one third of the global population and is ranked as the third most important crop in the world [1,2]. The common hexaploid bread wheat was domesticated around 8000 years ago in the Fertile Crescent [3]. There has been an overall increase in wheat production worldwide over the years; however, countries like Afghanistan have seen steady decline in wheat production, which is alarming in regions already struggling economically and politically for stability and to meet food demand [2]. Today, wheat plays a critical role in the fight against food security; however, in order to achieve high production goals, most of the breeding programmes have started utilizing naturally existing diversity deemed fit for a specific agro-climatic region or are simply relying on germplasm developed by private and international research organizations. This has led to narrowing down the genetic diversity from a broader gene pool. Such narrow genetic diversity poses a serious threat to sustainable production, which is crucial for the fast-growing global population and dramatically changing climatic conditions [4]. Wheat landraces are considered the only primary gene pool for bread wheat and for hexaploid bread wheat. The importance is more significant because of its hybrid origin and it does not share a direct link to an ancestor with similar genetic constituents [5]. Therefore, to widen the already available gene pool, landraces have been used in wheat breeding programmes and have achieved success in introducing disease resistance, quality, and other useful agronomic traits into elite lines and cultivars [5,6,7].
Landraces are defined as locally adapted distinct species with higher levels of tolerance to both abiotic and biotic stresses, resulting in moderate to high yield with low to zero input agricultural conditions [8,9]. Landraces are considered to have evolved as a combination of both natural and artificial selection over time and they adapt not only to the crops’ centres of origin but also to new geographic regions following introduction and transplantation. Landraces play an important role in modern plant breeding by transferring various key traits to cultivars and have been used in breeding programmes of several crops including maize, legumes, rice, and wheat [10,11,12,13,14,15].
Afghanistan lies close to the centre of origin of hexaploid wheat and enjoys rich topography, rich history, and a complex tribal system. It is considered the third largest centre of origin for many domesticated crops worldwide and has played crucial role in the domestication of many crops such as barley (Hordeum vulgare), peas (Pisum sativum), rye (Secale cereal), and chickpeas (Cicer arietinum) [16]. During the past years, several factors, including frequent armed conflicts, have resulted in the loss of all of the known germplasm collection [2]; in addition, only a few studies with small to moderate populations have investigated the population structure and genetic diversity of the Afghan wheat landraces [2,5,17,18,19,20]. The Afghan bread wheat population used in the present study is conserved and maintained in the gene bank of the International Centre of Agricultural Research in Dry Areas (ICARDA) and is composed of 363 accessions collected from seven north and north-eastern provinces of Afghanistan. These Afghan wheat accessions are considered as landraces based on their collection time, location, and information. However, so far, no genetic assay has been applied before to confirm the genetic basis of these wheat accessions.
The recent advances in the development of molecular markers and high-throughput systems have generated a wealth of information for a large number of plants [21,22,23]. Single nucleotide polymorphisms (SNPs) are the most common variations in the genome sequence [2,24], making them well suited for studies like genome wide association studies (GWAS) [25], genomic selection [26], population structure, and genetic diversity [22], requiring a large number of markers. Thus, application of SNPs markers enables rapid genotyping with low error rates, as they occur on the genome with high frequency [24]. The objective of the present study was to examine a large collection of Afghan bread wheat landraces to unravel the population structure and genetic diversity among the landraces and the sub-populations. Understanding the genetic diversity and population structure of these landraces will aid in the utilization of these bread wheat landraces in future breeding programmes in order to enhance the genetic base of hexaploid wheat and identify new genomic regions/sources of resistance and economically useful traits, and finally to deploy those regions in elite wheat breeding programmes.

2. Results

2.1. SNP Marker Distribution

A total of 363 bread wheat landraces collected from seven provinces of Afghanistan were genotyped using Diversity Arrays technology (DArT). One of the landraces from Kabul did not correspond to the recorded geographic location and therefore has been either recorded incorrectly or is a mixture (Figure 1). A set of 23,079 SNPs was generated and, after quality control and SNP filtering, a set of 4897 high-quality SNPs with known chromosomal positions on the 21 chromosomes of wheat were used in the final analysis. The highest number of markers were mapped on the B genome (1897), followed by the A genome (1653) and the D genome (1347). The average density of markers across the whole genome was 1.51 markers per kb (Table 1; Figure 2). The highest number of markers was found on chromosome 2B (378), followed by chromosome 2A (325) and chromosomes 3B and 5B (308). The lowest number of markers was found on chromosome 4B (115). The lowest marker density was observed on chromosome 4D (3.44 kb) among all the chromosomes, whereas the highest marker density was observed on chromosome 2D (0.84 kb). Average marker densities on the A, B, and D genomes were 1.54 kb, 1.53 kb, and 1.55 kb, respectively.

2.2. Population Stratification and Genetic Relationships

The optimal population stratification was measured using two approaches. The first approach was model-based as implemented in the STRUCTURE program. The optimum number of sub-populations was inferred based on the rate of change of likelihood (ΔK). The results indicated that the population structure was optimum at K = 2. In the plot of K against ΔK, there was a steep slope at K = 2 following the flattening of the curve (Figure 3.) Population structure showed higher proportions of admixture. However, the landraces with close geographic proximities like Badakhshan and Takhar were clustered together, whereas Baghlan and Kunduz showed more similarity and were grouped together. In the two sub-populations identified by the STRUCTURE algorithm, there was no clear differentiation amongst the landraces. The accessions from Kabul, Konarha, and Samangan were clustered in Sub-population 2, whereas the landraces from Badakhshan, Baghlan, Kunduz, and Takhar were distributed in both sub-populations (Table S2). Among the two sub-populations, 238 landraces were grouped in Sub-population 1, while 125 landraces were grouped in Sub-population 2. The fixation index (Fst) estimated by the STRUCTURE results suggested significant divergence within the two sub-populations (Table 2). An Fst value of 0.51 and 0.14 was obtained for Sub-population 1 and Sub-population 2, respectively.
The second approach was based on discriminant analysis of the principal components (DAPC) and the results showed at least three sub-populations (Figure 4). Hence, the landraces were grouped into three sub-populations with 175, 117, and 71 landraces in Sub-populations 1, 2, and 3 respectively. Sub-population 1 comprised landraces from Baghlan, Takhar, Kunduz, and Badakhshan. Landraces from Kabul, Konarha, and Samangan were grouped together in Sub-population 2, and Sub-population 3 contained 71 landraces; the majority of them were from Takhar (25) and Badakhshan (23). The DAPC analysis clustered the landraces from Badakhshan and Takhar together, those from Baghlan and Kunduz together, and those from Kabul and Konarha together (Figure 4).
The results of principal component analysis (PCA) is in accordance with the DAPC analysis and showed three clusters. The first, second, and third PCs explained 18.76%, 10.69%, and 4.34% of the total variation, respectively (Figure 5).
It is to be noted that both the STRUCTURE and DAPC analysis clustered the population with high levels of admixture, which was evident from the neighbour-joining based clustering and minimum spanning network (MSN) (Figure 6 and Figure 7). The neighbour-joining-based clustering and MSN showed the landraces clustered with high admixture and the clustering of the landraces tend to be based on their geographic origin, as the landraces from Badakhshan, Takhar, Baghlan, and Kunduz were mostly grouped together.

2.3. Genetic Differentiation among Sub-Populations

The analysis of molecular variance (AMOVA) based on the STRUCTURE results revealed that there was very small genetic variation (0.29%) between the two sub-populations. However, greater genetic variance was found within the two sub-populations (99.71%) (Table 2). The same results were identified with clustering based on DAPC and geographic origin, with genetic variance of 2.4% and 4.4%, respectively, between the sub-populations derived from DAPC and by geographic background. These results suggest that the landraces from Afghanistan shared a common ancestry and are highly admixed, with variation within the sub-populations from STRUCTURE, DAPC, and origin-based analysis were 99.71%, 97.53%, and 95.58%, respectively (Table 3). In order to explore the levels of diversity among the sub-populations, genetic distances among the sub-populations were also calculated (Table 4). The difference among the two sub-populations clustered by STRUCTURE reflected a genetic distance of 0.004, whereas a maximum genetic distance of 0.023 in the sub-populations from DAPC analysis was found between sub-population 1 and sub-population 2. The clustering with respect to the geographic origins of the landraces showed the maximum genetic distance of 0.399 between the landraces from Konarha and Samangan, followed by a genetic distance of 0.308 between landraces from Kabul and Konarha. The minimum genetic distance of 0.003 was estimated between the landraces from Badakhshan and Takhar.

2.4. Genetic Diversity across Sub-Populations

The mean values of different alleles (Na) and the number of effective alleles (Ne) of the two sub-populations identified by STRUCTURE were 1.999 and 1.591, respectively (Table 5), and the mean values for overall population for Shannon’s Index (I), the diversity index (He), and the unbiased diversity index (uHe) were 0.516, 0.345, and 0.346, respectively. Between the two sub-populations grouped by STRUCTURE, Sub-population 1 (I = 0.518, He = 0.346, and uHe = 0.348) showed higher diversity than Sub-population 2 (I = 0.514, He = 0.343, and uHe = 0.345). Sub-population 2 clustered by the DAPC approach showed higher diversity (I = 0.517, He = 0.345, and uHe = 0.347) compared with Sub-population 1 and Sub-population 3. The overall mean values of the genetic indices for the populations derived from the DAPC approach were Na = 1.994, Ne = 1.579, I = 0.506, He = 0.338, and uHe = 0.341. When the landraces were grouped based on their geographic origin, the landraces from Badakhshan province showed the highest genetic diversity (I = 0.509, He = 0.340, and uHe = 0.343), whereas the landraces from Kabul showed the least genetic diversity (I = 0.304, He = 0.0236, and uHe = 0.231). Since there was only one landrace accession from Konarha and Samangan, their genetic indices were not calculated and hence a comparison with other provinces was not possible.

2.5. Clustering via Geographic Origin

The two methods used in the study to assess the population structure clustered the landraces with close proximity to each other together, but with a high proportion of admixture. For example, in the DAPC method, 72 landraces from Badakshan province were grouped in Sub-population 1, while 24 and 23 landraces of the same origin were grouped in Sub-population 2 and Sub-population 3, respectively. Similarly, 40 landraces from Baghlan province were clustered in Sub-population 2, while the rest divided into Sub-population 1 (12) and Sub-population (3). The majority of the landraces from Takhar province were grouped into Sub-population 1 (77), the rest were grouped into Sub-population 3 (25) and Sub-population 2 (17). Six out of seven landraces collected from Kabul province were grouped into Sub-population 2, along with the two landraces from Konarha and Samangan province, whereas one landrace from Kabul province was grouped into Sub-population 3. A similar trend was observed when the whole panel was divided into two sub-populations using the STRUCTURE method, where the landraces from Badakhshan, Takhar, Kunduz, and Baghlan were assigned to both the sub-populations. However, the landraces from Kabul, Konarha, and Samangan were all grouped in Sub-population 2. In the overall trend, it was observed that the majority of the landraces from the neighbouring states were slightly favoured to group together than with the landraces from the provinces which are far away (Figure S1). However, based on the bar plot of the membership coefficient of the 363 landraces obtained from DAPC analysis, it was observed that the landraces from the north and north-eastern states of Afghanistan are highly admixed (Figure 8).

3. Discussion

The potential use of wheat landraces for both abiotic and biotic stress resistance has been given significant importance in many breeding programmes [27,28,29,30,31]. Genetic diversity analysis is a stepping stone towards the development of high-yielding germplasm in order to meet the food security requirements of the world’s increasing population [32]. The current study was planned to provide valuable information on the population structure and genetic diversity of 363 bread wheat landraces collected from seven provinces of Afghanistan, which could potentially be used to broaden the genetic base of Afghan wheat germplasm. It will also subsequently help in genome-wide association study (GWAS) in order to identify novel genomic regions associated with resistance against abiotic and biotic stresses.
In this study, a set of approximately 5000 genotyping-by-sequencing (GBS) derived high-quality SNPs were used, obtained from 363 bread wheat landraces. This research shows the usefulness of GBS-derived markers for population structure and genetic diversity studies. In the current study, the B genome had the highest number of SNPs, followed by the A and D genomes, which is in agreement with many previous studies [33,34]. Generally, in previous studies of wheat genetic diversity and population studies, the number of markers on the A and/or B genomes was many times higher than in the D genome [33,34,35,36]. However, in the present study, the number of markers on the D genome was slightly less than those on the A and B genome and, more interestingly, the D genome had the highest marker density per kb. This indicates that these landraces possess higher sequence diversity in the D genome compared with other sources. Greater sequence diversity in the D genome may indicate the presence of novel sources of genes that could be effectively utilized in elite wheat breeding programmes around the world [37]. Reduction in the D genome bottleneck will broaden the genetic base and enhance the rate of genetic gain and, as a result, will help in protecting wheat from ever-changing adverse climatic conditions and current limited genetic variation for the useful economical traits [4].

3.1. Population Structure

Population stratification analysis is useful for understanding genetic diversity and also facilitates GWAS analysis [38]. In the mapping studies, the presence of genetic structure within population can lead to false positives; therefore, special emphasis is given to carefully analysing the underlying population structure of any population to be used for marker–trait associations [39]. Therefore, analysis of population structure is considered the first step to conduct GWAS for true marker–trait associations [32,40]. In our study, the STRUCTURE results suggested the population could be clustered in two sub-populations. The clustering was mildly in accordance with the geographic background of the landraces. The two clusters were highly admixed and suggested that most of the variation is among the landraces within the clusters. Although two sub-populations may be reasonable for our panel, based on the population size and the variation in the number of landraces representing the seven provinces from which they were collected, we were still a bit sceptical about the results from STRUCTURE. It has been reported that the optimal k = 2 value sometimes misrepresents the true structure in the population and could possibly mean that either there is no definite population structure, or the STRUCTURE program failed to successfully identify the underlying genetic structure of the collection [41]. In addition to STRUCTURE, we used DAPC, PCA, neighbour-joining cluster analysis, and MSN analysis to truly understand the genetic structure of this population. Based on the results of these analyses, it was observed that there are at least three sub-populations in the population. The results of DAPC and PCA also contained admixture within the clusters, yet this meets our expectations in a better way, as it was observed that the landraces from the neighbouring provinces were mostly clustered together as compared with the two sub-populations derived by STRUCTURE. Based on the DAPC results, the landraces from Badakhshan and Takhar were grouped together, the landraces from Baghlan and Kunduz were grouped together, and the landraces from Kabul and Konarha were genetically closer to each other. The provinces of Badakhshan, Takhar, Baghlan, and Kunduz belong to one agro-ecological zone of Afghanistan (i.e., north-eastern), with similar overall climate conditions except for annual average precipitation, which is significantly higher in Badakhshan and Takhar compared with Baghlan and Kunduz, which may contribute to the genetic similarity observed in the study between the landraces from Badakhshan and Takhar and between the landraces from Baghlan and Kunduz (Table S3). Furthermore, the high levels of overall admixture between the landraces as revealed by all the methods used in the study may also be attributed to the free exchange of seeds between farmers of the north-eastern states of Afghanistan. This is in agreement with previous studies reporting high admixture and low genetic variation among the Afghan landraces [5,42,43]. A previous study of the population structure and genetic diversity of diverse 446 Afghan landraces collected from 17 provinces and different agro-climatic regions showed higher genetic diversity compared with our study [2]. However, this can be attributed to the diverse collection from distant provinces, as the genetic structure of the landraces from Badakshan, Takhar, and Baghlan was similar to our study, indicating that the higher diversity of Afghan landraces in the previous study is due to the fact that the landraces were collected from provinces far from each other, whereas in the present study, the landraces were primarily collected from adjacent provinces.

3.2. Genetic Differentiation of Populations

Genetic differentiation due to population structure is measured by Fst. An Fst value ranges from 0 to 1 and a value of 0.15 is considered significant in distinguishing the populations [44]. Thus, significant differentiation was observed in both sub-populations found by STRUCTURE. This is in accordance with the AMOVA results, where most of the variation (99.71%) was accounted for within sub-populations. Similarly, when the population was clustered into three sub-populations, as was indicated by DAPC analysis, the majority of the variation (97.53%) was observed within the population and only a small fraction (2.47%) of the variation was accounted for between the three sub-populations. When the geographic origin was used as a proxy for clustering the landraces, a similar trend was observed and 95.58% of the total variation was observed within populations. Our results are in agreement with the previous study, which also observed higher genetic variation within the landraces of the sub-populations [2]. The DAPC analysis clustered the population into three sub-populations, in which majority of the landraces from Badakhshan and Takhar provinces clustered together in Sub-population 1 and the majority of the landraces from Baghlan and Kunduz provinces were grouped into Sub-population 2. These results closely resemble existing micro-climatic conditions within the north-eastern agro-climatic zone of Afghanistan. Genetic distance analysis was used to identify the difference among the provinces; the strongest correlation was observed between landraces from Badakhshan and Takhar, again in agreement with the results observed by DAPC analysis. The highest genetic distances were observed in landraces from Konarha and Samangan (0.399), and Kabul and Konarha (0.308); however, even these distances are not significant, which implies that due to the narrow agro-climatic zone of the landraces collected from the seven provinces of Afghanistan, there are high proportions of admixture among them and the differences between the landraces is due to their geographic background. The same findings were observed in the previous studies, where landraces from Badakshan and Takhar showed higher levels of similarity and the landraces were differentiated on the basis of geographic origin and agro-climatic zones [2,5,30].

3.3. Genetic Diversity Indices

The allelic patterns and indices of genetic diversity shed light on the genetic diversity within sub-populations. Clustering based on STRUCTURE observed slightly higher genetic diversity in Sub-population 1 compared with Sub-population 2, which could be attributed to the size of Sub-population 1. In the case of three sub-populations, as grouped by DAPC analysis, Sub-population 2 showed the highest genetic diversity. Sub-population 2 comprised mainly landraces from Baghlan and Kunduz, identifying the highest genetic diversity amongst the landraces from these regions. The landraces from Baghlan also showed significant genetic diversity in an earlier study, thus confirming our results [5]. Interestingly, when the landraces were clustered based on their geographic background, the highest genetic diversity was observed in landraces from Badakhshan province. This may be attributed to the high number of accessions from Badakhshan. However, similar findings were reported in a former study, which indicated the highest genetic diversity among the landraces from Badakhshan province when compared with landraces from nine other provinces in Afghanistan [2]. Badakhshan was followed by Takhar and Baghlan provinces. Although the results suggested that, however, there is low genetic variation between the sub-populations of different provinces in north and north-eastern provinces of Afghanistan, there exists considerably high genetic diversity in landraces within the provinces and sub-populations. The understanding of diversity within the sub-populations of landrace populations from Afghanistan will improve future planning collecting missions to gather more diversity, optimize conservation strategies for landraces from Afghanistan, and efficiently provide valuable traits and alleles needed in wheat breeding programmes.

4. Materials and Methods

4.1. Plant Material

Three hundred and sixty-three Afghan bread wheat landrace accessions assembled from 7 north and north-eastern states of Afghanistan were used in the study. All landraces are conserved in the International Center for Agricultural Research in the Dry Areas (ICARDA) gene bank. The highest numbers of landraces were collected from Badakhshan and Takhar regions, with 119 accessions from each region. The rest of the accessions were collected from Baghlan (n = 66), Kabul (n = 7), Konarha (n = 1), Kunduz (n = 50), and Samangan (n = 1). Badakhshan, Baghlan, Takhar, and Kunduz belong to one agro-ecological zone (i.e., north-eastern), whereas Kabul, Samangan, and Konarha belong to three different agro-ecological zones, which are central, north and east, respectively. Details of the landraces used in the study are presented in the Table S1.

4.2. Genotyping

The landraces were genotyped by the high-throughput genotyping by sequencing (GBS) method using DArT technology [45] at the Genetic Analysis Service for Agriculture (SAGA) of the International Maize and Wheat Improvement Center (CIMMYT) in Mexico, supported by the Seed of Discovery Project, SeeD. Markers were filtered based on missing data >20%, minor allele frequency (MAF) < 5%, and other parameters such as call rate, polymorphic information content (PIC), and reproducibility. In the final set, only the markers with known chromosomal location and position were retained, which resulted in 4897 high-quality SNP markers to be used in the study.

4.3. Population Structure Analysis

In order to analyse the underlying population structure in the Afghan landraces, STRUCTURE software (v.2.3.4) was used, which implements model-based Bayesian cluster analysis [46]. The program was run for 10 independent replicates for each putative sub-population ranging from k = 2–10 under the admixture model of population structure and was assessed with a burn-in period of 50,000 and 50,000 Markov Chain Monte Carlo (MCMC) replications. To infer the optimal clusters/sub-populations, the best K value representing the optimum number of sub-populations was estimated as Delta KK) based on the change in the log probability of data between successive structure iterations using Structure Harvester [47]. For the optimal K value, membership coefficient matrices of three replicates from the STRUCTURE analysis were used in CLUMPP [48] to generate both individual and population Q-matrices, which after integration of the geographical information, were used to create bar plot using DISTRUCT program [48].
The second approach to analyse the population structure was based on the discriminant analysis of principal components (DAPC), which was implemented using an R package, “adegenet” [49]. DAPC uses K-means clustering of principal components to identify groups of individuals. It is run with different numbers of clusters, each of which gives rise to a statistical model and an associated likelihood. The best supported model, and therefore the number and nature of clusters, it is assessed using the Bayesian Information Criterion (BIC).

4.4. Genetic Diversity and Analysis of Molecular Variance

In order to analyse the genetic variation amongst the 363 bread wheat landraces from Afghanistan, various diversity parameters like the number of different alleles (Na), the number of effective alleles (Ne), Shannon’s index (I), the diversity index (He), the unbiased diversity index (uHe), and the percentage of polymorphic loci (PPL) were measured using GenAlex v6.503 [50]. The number of sub-populations defined by STRUCTURE and DAPC, and sub-populations based on the geographic origin of the landraces were used for AMOVA and the calculations of Nei’s genetic identity and genetic distance among populations using GenAlex v6.503 [50]. For the estimation of genetic relationship among landraces, Principal Component Analysis (PCA) was estimated using the R package “adegenet” [51] in R studio [52]. A neighbour-joining phylogenetic tree based on a simple matching dissimilarity coefficient without the assumption of an evolutionary hierarchy and a minimum spanning network (MSN) was constructed using the r package “poppr” [53].

5. Conclusions

This study provided a detailed view of the population structure and genetic diversity of 363 bread wheat landraces from seven provinces of Afghanistan. In this study, SNPs obtained from high-throughput GBS technology were used to explore population structure and genetic diversity of bread wheat landraces and suggests that these markers with high genome coverage are useful for revealing the genetic diversity and population structure. The population structure analysis revealed that the landraces originating from different geographic regions of north and north-eastern parts of Afghanistan were best divided into three distinct groups based mainly on geographical background and micro-climatic conditions within the agro-ecological zones of the landraces. However, high proportions of admixture were also observed, revealing the possibility of historical free seed exchange among farmers within these provinces. Further, it was observed that the genetic diversity was higher within the landraces compared with between the sub-populations, which is unique to these particular landraces and has been previously reported as well. The study also found out that the number of markers on the D genome, interestingly, was nearly similar to those on the A and B genomes, which is also unique to these landraces, as previous studies on bread wheat reported that the D genome had 2–5 times fewer markers than the A and B genomes. This indicated high sequence diversity present in the D genome in the Afghan landraces. Thus, this suggests that this diversity can be utilized in the elite wheat breeding programmes to widen the genetic base of the overall breeding populations. This knowledge of the genetic diversity and population structure of landraces from central and north-eastern provinces of Afghanistan will be important for future studies in the region, conferring resistance to abiotic and biotic stresses from GWAS to identify and isolate marker–trait associations for important economical traits.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/12/3/340/s1. Figure S1: STRUCTURE bar plot at K = 2 for 363 landraces used in the study. Table S1: List of bread wheat landraces used in the study, including their province of origin and longitude and latitude coordinates. Table S2: Proportion of membership of each predefined population in each of the two clusters of bread wheat landraces using the STRUCTURE program. Table S3: Average temperature and precipitation of provinces in the north-eastern and central agro-ecological zones of Afghanistan.

Author Contributions

Conceptualization, K.N.; methodology, M.M.T.; software, M.M.T., D.I., and S.M.; resources, M.d.S.L. and C.P.S.; writing—original draft preparation, M.M.T.; writing—review and editing, K.N., Z.K., and E.K.; funding acquisition, K.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Delivering Genetic Gain in Wheat project supported by the Bill and Melinda Gates Foundation (BMGF) and the UK Department for International Development (DFID), grant number OPP1133199.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bhatta, M.; Regassa, T.; Rose, D.J.; Baenziger, P.S.; Eskridge, K.M.; Santra, D.K.; Poudel, R. Genotype, Environment, Seeding Rate, and Top-Dressed Nitrogen Effects on End-Use Quality of Modern Nebraska Winter Wheat. J. Sci. Food Agric. 2017, 97, 5311–5318. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Manickavelu, A.; Jighly, A.; Ban, T. Molecular Evaluation of Orphan Afghan Common Wheat (Triticum Aestivum L.) Landraces Collected by Dr. Kihara Using Single Nucleotide Polymorphic Markers. BMC Plant Biol. 2014, 14, 320. [Google Scholar] [CrossRef] [Green Version]
  3. Dubcovsky, J.; Dvorak, J. Erratum: Genome Plasticity a Key Factor in the Success of Polyploid Wheat under Domestication (Science (1862)). Science 2007, 318, 393. [Google Scholar]
  4. Tehseen, M.M.; Tonk, F.A.; Tosun, M.; Amri, A.; Sansaloni, C.P.; Kurtulus, E.; Yazbek, M.; Al-Sham&aa, K.; Ozseven, I.; Safdar, L.B.; et al. Genome-Wide Association Study of Resistance to PstS2 and Warrior Races of Puccinia Striiformis f. Sp. Tritici (Stripe Rust) in Bread Wheat Landraces. Plant Genome 2020, e20066. [Google Scholar] [CrossRef]
  5. Terasawa, Y.; Kawahara, T.; Sasakuma, T.; Sasanuma, T. Evaluation of the Genetic Diversity of an Afghan Wheat Collection Based on Morphological Variation, HMW Glutenin Subunit Polymorphisms, and AFLP. Breed. Sci. 2009, 59, 361–371. [Google Scholar] [CrossRef]
  6. Rudd, J.C.; Horsley, R.D.; McKendry, A.L.; Elias, E.M. Host Plant Resistance Genes for Fusarium Head Blight: Sources, Mechanisms, and Utility in Conventional Breeding Systems. Crop Sci. 2001, 41, 620–627. [Google Scholar] [CrossRef] [Green Version]
  7. Nakamura, T.; Vrinten, P.; Saito, M.; Konda, M. Rapid Classification of Partial Waxy Wheats Using PCR-Based Markers. Genome 2002, 45, 1150–1156. [Google Scholar] [CrossRef]
  8. Zeven, A.C. Landraces: A Review of Definitions and Classifications. Euphytica 1998, 104, 127–139. [Google Scholar] [CrossRef]
  9. Endresen, D.T.F.; Street, K.; Mackay, M.; Bari, A.; Pauw, E.D. Predictive Association between Biotic Stress Traits and Eco-Geographic Data for Wheat and Barley Landraces. Crop Sci. 2011, 51, 2036–2055. [Google Scholar] [CrossRef]
  10. Reif, J.C.; Zhang, P.; Dreisigacker, S.; Warburton, M.L.; van Ginkel, M.; Hoisington, D.; Bohn, M.; Melchinger, A.E. Wheat Genetic Diversity Trends during Domestication and Breeding. Appl. Genet. 2005, 110, 859–864. [Google Scholar] [CrossRef] [PubMed]
  11. Isemura, T.; Noda, C.; Mori, S.; Yamashita, M.; Nakanishi, H.; Inoue, M.; Kamijima, O. Genetic Variation and Geographical Distribution of Azuki Bean (Vigna Angularis) Landraces Based on the Electrophoregram of Seed Storage Proteins. Breed. Sci. 2001, 51, 225–230. [Google Scholar] [CrossRef] [Green Version]
  12. Hargrove, T.R.; Cabanilla, V.L. The Impact of Semidwarf Varieties on Asian Rice-Breeding Programs. BioScience 1979, 29, 731–735. [Google Scholar] [CrossRef]
  13. Malvar, R.A.; Butrón, A.; Alvarez, A.; Ordás, B.; Soengas, P.; Revilla, P.; Ordás, A. Evaluation of the European Union Maize Landrace Core Collection for Resistance to Sesamia Nonagrioides (Lepidoptera: Noctuidae) and Ostrinia Nubilalis (Lepidoptera: Crambidae). J. Econ. Entomol. 2004, 97, 628–634. [Google Scholar] [CrossRef] [PubMed]
  14. Zhang, G.; Gu, C.; Wang, D. Molecular Mapping of Soybean Aphid Resistance Genes in PI 567541B. Appl. Genet. 2009, 118, 473–482. [Google Scholar] [CrossRef]
  15. Feldman, M.; Sears, E.R. The Wild Gene Resources of Wheat. Sci. Am. 1981, 244, 102–113. [Google Scholar] [CrossRef]
  16. Vavilov, N.I. Centers of Origin of Cultivated Plants. N. I. Vavilov Orig. Geogr. Cultiv. Plants 1926. [Google Scholar]
  17. Lagudah, E.S.; Flood, R.G.; Halloran, G.M. Variation in High Molecular Weight Glutenin Subunits in Landraces of Hexaploid Wheat from Afghanistan. Euphytica 1987, 36, 3–9. [Google Scholar] [CrossRef]
  18. Iwaki, K.; Haruna, S.; Niwa, T.; Kato, K. Adaptation and Ecological Differentiation in Wheat with Special Reference to Geographical Variation of Growth Habit and Vrn Genotype. Plant Breed. 2001, 120, 107–114. [Google Scholar] [CrossRef]
  19. Kazman, M.E. Evaluation of Afghan Land Races of Wheat for Multi-Purpose Breeding. In Proceedings of the 10th International Wheat Genetics Symposium, Paestum, Italy, 1–6 September 2003; SIM.I. pp. 587–589. [Google Scholar]
  20. Ghimire, S.K.; Akashi, Y.; Maitani, C.; Nakanishi, M.; Kato, K. Genetic Diversity and Geographical Differentiation in Asian Common Wheat (Triticum Aestivum L.), Revealed by the Analysis of Peroxidase and Esterase Isozymes. Breed. Sci. 2005, 55, 175–185. [Google Scholar] [CrossRef] [Green Version]
  21. Tuberosa, R.; Graner, A.; Varshney, R.K. Genomics of Plant Genetic Resources: An Introduction. Plant Genet. Resour. 2011, 9, 151–154. [Google Scholar] [CrossRef] [Green Version]
  22. Glaszmann, J.; Kilian, B.; Upadhyaya, H.; Varshney, R. Accessing Genetic Diversity for Crop Improvement. Curr. Opin. Plant Biol. 2010, 13, 167–173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Kilian, B.; Özkan, H.; Walther, A.; Kohl, J.; Dagan, T.; Salamini, F.; Martin, W. Molecular Diversity at 18 Loci in 321 Wild and 92 Domesticate Lines Reveal No Reduction of Nucleotide Diversity during Triticum Monococcum (Einkorn) Domestication: Implications for the Origin of Agriculture. Mol. Biol. Evol. 2007, 24, 2657–2668. [Google Scholar] [CrossRef] [Green Version]
  24. Rafalski, J.A. Novel Genetic Mapping Tools in Plants: SNPs and LD-Based Approaches. Plant Sci. 2002, 162, 329–333. [Google Scholar] [CrossRef]
  25. Myles, S.; Peiffer, J.; Brown, P.J.; Ersoz, E.S.; Zhang, Z.; Costich, D.E.; Buckler, E.S. Association Mapping: Critical Considerations Shift from Genotyping to Experimental Design. Plant Cell 2009, 21, 2194–2202. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Meuwissen, T.H.E.; Hayes, B.J.; Goddard, M.E. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps. Genetics 2001, 157, 1819–1829. [Google Scholar]
  27. Yao, F.; Zhang, X.; Ye, X.; Li, J.; Long, L.; Yu, C.; Li, J.; Wang, Y.; Wu, Y.; Wang, J.; et al. Characterization of Molecular Diversity and Genome-Wide Association Study of Stripe Rust Resistance at the Adult Plant Stage in Northern Chinese Wheat Landraces. BMC Genet. 2019, 20, 38. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Sehgal, D.; Dreisigacker, S.; Belen, S.; Küçüközdemir, Ü.; Mert, Z.; Özer, E.; Morgounov, A. Mining Centuries Old In Situ Conserved Turkish Wheat Landraces for Grain Yield and Stripe Rust Resistance Genes. Front. Genet. 2016, 7. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Bulli, P.; Zhang, J.; Chao, S.; Chen, X.; Pumphrey, M. Genetic Architecture of Resistance to Stripe Rust in a Global Winter Wheat Germplasm Collection. G3 GenesGenomesGenet 2016, 6, 2237–2253. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Manickavelu, A.; Joukhadar, R.; Jighly, A.; Lan, C.; Huerta-Espino, J.; Stanikzai, A.S.; Kilian, A.; Singh, R.P.; Ban, T. Genome Wide Association Mapping of Stripe Rust Resistance in Afghan Wheat Landraces. Plant Sci. 2016, 252, 222–229. [Google Scholar] [CrossRef] [Green Version]
  31. Maccaferri, M.; Zhang, J.; Bulli, P.; Abate, Z.; Chao, S.; Cantu, D.; Bossolini, E.; Chen, X.; Pumphrey, M.; Dubcovsky, J. A Genome-Wide Association Study of Resistance to Stripe Rust (Puccinia Striiformis f. Sp. Tritici) in a Worldwide Collection of Hexaploid Spring Wheat (Triticum Aestivum L.). G3 (Bethesda) 2015, 449–465. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Alemu, A.; Feyissa, T.; Letta, T.; Abeyo, B. Genetic Diversity and Population Structure Analysis Based on the High Density SNP Markers in Ethiopian Durum Wheat (Triticum Turgidum Ssp. Durum). BMC Genet. 2020, 21, 18. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Bhatta, M.; Morgounov, A.; Belamkar, V.; Poland, J.; Baenziger, P.S. Unlocking the Novel Genetic Diversity and Population Structure of Synthetic Hexaploid Wheat. BMC Genom. 2018, 19, 591. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Alipour, H.; Bihamta, M.R.; Mohammadi, V.; Peyghambari, S.A.; Bai, G.; Zhang, G. Genotyping-by-Sequencing (GBS) Revealed Molecular Genetic Diversity of Iranian Wheat Landraces and Cultivars. Front. Plant Sci. 2017, 8, 1293. [Google Scholar] [CrossRef]
  35. Allen, A.M.; Barker, G.L.A.; Wilkinson, P.; Burridge, A.; Winfield, M.; Coghill, J.; Uauy, C.; Griffiths, S.; Jack, P.; Berry, S.; et al. Discovery and Development of Exome-Based, Co-Dominant Single Nucleotide Polymorphism Markers in Hexaploid Wheat (Triticum Aestivum L.). Plant Biotechnol. J. 2013, 11, 279–295. [Google Scholar] [CrossRef] [Green Version]
  36. Iehisa, J.C.M.; Shimizu, A.; Sato, K.; Nishijima, R.; Sakaguchi, K.; Matsuda, R.; Nasuda, S.; Takumi, S. Genome-Wide Marker Development for the Wheat D Genome Based on Single Nucleotide Polymorphisms Identified from Transcripts in the Wild Wheat Progenitor Aegilops Tauschii. Appl. Genet. 2014, 127, 261–271. [Google Scholar] [CrossRef] [PubMed]
  37. Muleta, K.T.; Bulli, P.; Rynearson, S.; Chen, X.; Pumphrey, M. Loci Associated with Resistance to Stripe Rust (Puccinia Striiformis f. Sp. Tritici) in a Core Collection of Spring Wheat (Triticum Aestivum). PLoS ONE 2017, 12, e0179087. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Luo, Z.; Brock, J.; Dyer, J.M.; Kutchan, T.; Schachtman, D.; Augustin, M.; Ge, Y.; Fahlgren, N.; Abdel-Haleem, H. Genetic Diversity and Population Structure of a Camelina Sativa Spring Panel. Front. Plant Sci. 2019, 10, 184. [Google Scholar] [CrossRef] [Green Version]
  39. Eltaher, S.; Sallam, A.; Belamkar, V.; Emara, H.A.; Nower, A.A.; Salem, K.F.M.; Poland, J.; Baenziger, P.S. Genetic Diversity and Population Structure of F3:6 Nebraska Winter Wheat Genotypes Using Genotyping-By-Sequencing. Front. Genet. 2018, 9, 76. [Google Scholar] [CrossRef]
  40. Arora, S.; Singh, N.; Kaur, S.; Bains, N.S.; Uauy, C.; Poland, J.; Chhuneja, P. Genome-Wide Association Study of Grain Architecture in Wild Wheat Aegilops Tauschii. Front. Plant Sci. 2017, 8, 886. [Google Scholar] [CrossRef] [Green Version]
  41. Janes, J.K.; Miller, J.M.; Dupuis, J.R.; Malenfant, R.M.; Gorrell, J.C.; Cullingham, C.I.; Andrew, R.L. The K = 2 Conundrum. Mol. Ecol. 2017, 26, 3594–3602. [Google Scholar] [CrossRef] [Green Version]
  42. Gupta, V.; He, X.; Kumar, N.; Fuentes-Davila, G.; Sharma, R.K.; Dreisigacker, S.; Juliana, P.; Ataei, N.; Singh, P.K. Genome Wide Association Study of Karnal Bunt Resistance in a Wheat Germplasm Collection from Afghanistan. Int. J. Mol. Sci. 2019, 20, 3124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  43. Lee, S.; Choi, Y.-M.; Lee, M.-C.; Hyun, D.Y.; Oh, S.; Jung, Y. Geographical Comparison of Genetic Diversity in Asian Landrace Wheat (Triticum Aestivum L.) Germplasm Based on High-Molecular-Weight Glutenin Subunits. Genet. Resour. Crop. Evol. 2018, 65, 1591–1602. [Google Scholar] [CrossRef]
  44. Frankham, R.; Ballou, S.E.J.D.; Briscoe, D.A.; Ballou, J.D. Introduction to Conservation Genetics; Cambridge University Press: Cambridge, UK, 2002; ISBN 978-0-521-63985-9. [Google Scholar]
  45. Sansaloni, C.; Petroli, C.; Jaccoud, D.; Carling, J.; Detering, F.; Grattapaglia, D.; Kilian, A. Diversity Arrays Technology (DArT) and next-Generation Sequencing Combined: Genome-Wide, High Throughput, Highly Informative Genotyping for Molecular Breeding of Eucalyptus. In Proceedings of the BMC Proceedings; BioMed Central: London, UK, 2011; Volume 5, p. 54. [Google Scholar]
  46. Pritchard, J.K.; Stephens, M.; Donnelly, P. Inference of Population Structure Using Multilocus Genotype Data. Genetics 2000, 155, 945–959. [Google Scholar] [PubMed]
  47. Evanno, G.; Regnaut, S.; Goudet, J. Detecting the Number of Clusters of Individuals Using the Software STRUCTURE: A Simulation Study. Mol. Ecol. 2005, 14, 2611–2620. [Google Scholar] [CrossRef] [Green Version]
  48. Jakobsson, M.; Rosenberg, N.A. CLUMPP: A Cluster Matching and Permutation Program for Dealing with Label Switching and Multimodality in Analysis of Population Structure. Bioinformatics 2007, 23, 1801–1806. [Google Scholar] [CrossRef] [Green Version]
  49. Jombart, T.; Devillard, S.; Balloux, F. Discriminant Analysis of Principal Components: A New Method for the Analysis of Genetically Structured Populations. BMC Genet. 2010, 11, 94. [Google Scholar] [CrossRef] [Green Version]
  50. Peakall, R.; Smouse, P.E. GenAlEx 6.5: Genetic Analysis in Excel. Population Genetic Software for Teaching and Researchdan Update. Bioinformatics 2012, 28, 2537e2539. [Google Scholar]
  51. Jombart, T.; Ahmed, I. Adegenet 1.3-1: New Tools for the Analysis of Genome-Wide SNP Data. Bioinformatics 2011, 27, 3070–3071. [Google Scholar] [CrossRef] [Green Version]
  52. R Core Team. R: A Language and Environment for Statistical Computing; R Core Team: Vienna, Austria, 2013. [Google Scholar]
  53. Kamvar, Z.N.; Tabima, J.F.; Grünwald, N.J. Poppr: An R Package for Genetic Analysis of Populations with Clonal, Partially Clonal, and/or Sexual Reproduction. PeerJ 2014, 2, e281. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Distribution of the Afghan landraces according to their geographic origins.
Figure 1. Distribution of the Afghan landraces according to their geographic origins.
Genes 12 00340 g001
Figure 2. Distribution and the corresponding single nucleotide polymorphism (SNP) densities of 4897 SNPs across 21 chromosomes of bread wheat landraces from Afghanistan.
Figure 2. Distribution and the corresponding single nucleotide polymorphism (SNP) densities of 4897 SNPs across 21 chromosomes of bread wheat landraces from Afghanistan.
Genes 12 00340 g002
Figure 3. Population structure of the 363 Afghan bread wheat landraces.
Figure 3. Population structure of the 363 Afghan bread wheat landraces.
Genes 12 00340 g003
Figure 4. Inference of the sub-populations by discriminant analysis of the principal components (DAPC) analysis grouping landraces from different regions together.
Figure 4. Inference of the sub-populations by discriminant analysis of the principal components (DAPC) analysis grouping landraces from different regions together.
Genes 12 00340 g004
Figure 5. Principal component analysis (PCA) of 363 Afghan bread wheat landraces. colours are coded as the first three principal components (PCs) of the PCA recorded, respectively, on the red, green, and blue channel.
Figure 5. Principal component analysis (PCA) of 363 Afghan bread wheat landraces. colours are coded as the first three principal components (PCs) of the PCA recorded, respectively, on the red, green, and blue channel.
Genes 12 00340 g005
Figure 6. Neighbour-joining clustering of 363 Afghan bread wheat landraces.
Figure 6. Neighbour-joining clustering of 363 Afghan bread wheat landraces.
Genes 12 00340 g006
Figure 7. Minimum spanning network (MSN) of 363 Afghan bread wheat landraces.
Figure 7. Minimum spanning network (MSN) of 363 Afghan bread wheat landraces.
Genes 12 00340 g007
Figure 8. Estimated population membership probability of 363 bread wheat landraces from seven states of Afghanistan, where each bar represents a landrace.
Figure 8. Estimated population membership probability of 363 bread wheat landraces from seven states of Afghanistan, where each bar represents a landrace.
Genes 12 00340 g008
Table 1. Genomic distribution of 4897 SNPs mapped on 21 hexaploid wheat chromosomes.
Table 1. Genomic distribution of 4897 SNPs mapped on 21 hexaploid wheat chromosomes.
ChromosomeNo. of SNPsSNP PercentageStart PositionEnd PositionLength (kb)Density (kb)
1A2124.331181281,698280.51.33
1B2795.7604254,989254.40.91
1D1843.762442233,037230.61.27
2A3256.64833290,168289.30.89
2B3787.72593550,308549.71.46
2D2725.55262227,366227.10.84
3A2244.57105243,683243.61.09
3B3086.29179404,300404.11.31
3D1974.02798313,780313.01.59
4A1663.39282395,099394.82.38
4B1152.351793288,834287.02.51
4D1272.592043437,139435.13.44
5A2815.74689245,844245.20.87
5B3086.29173445,996445.81.45
5D1833.741657253,399251.71.38
6A1783.6353395,985395.92.22
6B2665.43404372,421372.01.40
6D1723.511168215,761214.61.25
7A2675.45333418,101417.81.57
7B2434.962096318,562316.51.31
7D2124.33888273,554272.71.29
Table 2. The STRUCTURE results of 363 Afghan landraces for the fixation index (Fst), average distances (expected heterozygosity/Het), gene flow (Nm), and the number of accessions assigned to each sub-population.
Table 2. The STRUCTURE results of 363 Afghan landraces for the fixation index (Fst), average distances (expected heterozygosity/Het), gene flow (Nm), and the number of accessions assigned to each sub-population.
PopulationInferred ClustersMean FstExp. HetNmNo. of Accessions
Pop10.6180.50910.23850.241238
Pop20.3820.14120.34011.52125
Table 3. Analysis of molecular variance (AMOVA) revealing genetic diversity in the Afghan landraces.
Table 3. Analysis of molecular variance (AMOVA) revealing genetic diversity in the Afghan landraces.
MethodSourcedfSSMSEst. Var.%
Model-Based (STRUCTURE)Among Pops1366.5567366.55670.6986970.29%
Within Pops36186625.44239.9597239.959799.71%
Total36286992 240.6584100.00%
Distance-Based (Cluster, DAPC)
Among Pops21832.573916.28636.0018692.47%
Within Pops36085159.43236.554236.55497.53%
Total36286992 242.5558100.00%
Based on Origin
Among Pops6377.973162.995520.9550264.41%
Within Pops3567356.36120.6639320.6639395.58%
Total3627734.33421.3655621.61896100%
df: degrees of freedom, SS: sum of squares, MS: mean square, Est. Var: estimated variance.
Table 4. Genetic distance among the populations by different methods.
Table 4. Genetic distance among the populations by different methods.
Method
Model-Based (STRUCTURE)Pop1Pop2
Pop1 0.996
Pop20.004
Distance-Based (Cluster, DAPC)
Pop1Pop2Pop3
Pop1 0.9770.991
Pop20.023 0.982
Pop30.0090.018
Based on Origin
BadakhshanBaghlanKabulKonarhaKunduzSamanganTakhar
Badakhshan 0.9750.8800.7730.9750.7830.997
Baghlan0.026 0.9070.7770.9930.8130.973
Kabul0.1280.098 0.7350.8960.7470.877
Konarha0.2570.2520.308 0.7680.6710.772
Kunduz0.0250.0070.1090.263 0.8120.976
Samangan0.2450.2070.2920.3990.208 0.780
Takhar0.0030.0270.1320.2590.0240.249
Nei’s genetic identity is above the diagonal and genetic distance is below the diagonal.
Table 5. Mean of different genetic parameters, including the number of different alleles (Na), the number of effective alleles (Ne), Shannon’s index (I), the diversity index (He), the unbiased diversity index (uHe), and the percentage of polymorphic loci (PPL) in each of the two subpopulations.
Table 5. Mean of different genetic parameters, including the number of different alleles (Na), the number of effective alleles (Ne), Shannon’s index (I), the diversity index (He), the unbiased diversity index (uHe), and the percentage of polymorphic loci (PPL) in each of the two subpopulations.
MethodPopNaNeIHeuHePPL
Model-Based (STRUCTURE)Pop11.9981.5940.5180.3460.34899.90%
Pop22.0001.5880.5140.3430.345100.00%
Mean1.9991.5910.5160.3450.34699.95%
Distance-Based (Cluster, DAPC *)
Pop11.9871.5670.4960.3310.33599.20%
Pop21.9971.5930.5170.3450.34799.80%
Pop31.9981.5780.5050.3370.34099.90%
Mean1.9941.5790.5060.3380.34199.63%
Based on Origin
Badakhshan1.9981.5840.5090.3400.34399.90%
Baghlan1.9841.5750.5030.3360.34198.90%
Kabul1.3121.3620.3040.2060.23154.90%
Konarha0.3581.0000.0000.0000.0000.00%
Kunduz1.9341.5560.4860.3240.33195.60%
Samangan0.3591.0000.0000.0000.0000.00%
Takhar1.9981.5780.5050.3370.34099.90%
Mean1.4201.3790.3300.2200.22764.17%
*—Discriminant Analysis of Principal Components.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tehseen, M.M.; Istipliler, D.; Kehel, Z.; Sansaloni, C.P.; da Silva Lopes, M.; Kurtulus, E.; Muazzam, S.; Nazari, K. Genetic Diversity and Population Structure Analysis of Triticum aestivum L. Landrace Panel from Afghanistan. Genes 2021, 12, 340. https://doi.org/10.3390/genes12030340

AMA Style

Tehseen MM, Istipliler D, Kehel Z, Sansaloni CP, da Silva Lopes M, Kurtulus E, Muazzam S, Nazari K. Genetic Diversity and Population Structure Analysis of Triticum aestivum L. Landrace Panel from Afghanistan. Genes. 2021; 12(3):340. https://doi.org/10.3390/genes12030340

Chicago/Turabian Style

Tehseen, Muhammad Massub, Deniz Istipliler, Zakaria Kehel, Carolina P. Sansaloni, Marta da Silva Lopes, Ezgi Kurtulus, Sana Muazzam, and Kumarse Nazari. 2021. "Genetic Diversity and Population Structure Analysis of Triticum aestivum L. Landrace Panel from Afghanistan" Genes 12, no. 3: 340. https://doi.org/10.3390/genes12030340

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop