Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines

Potapova, Nadezhda A.; Timoshchuk, Anna N.; Tiys, Evgeny S.; Vinichenko, Natalia A.; Leonova, Irina N.; Salina, Elena A.; Tsepilov, Yakov A.

doi:10.3390/plants12173019

Open AccessBrief Report

Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines

by

Nadezhda A. Potapova

^1,2,*,

Anna N. Timoshchuk

^3,4,

Evgeny S. Tiys

³,

Natalia A. Vinichenko

³,

Irina N. Leonova

¹

,

Elena A. Salina

¹ and

Yakov A. Tsepilov

^1,*

¹

Kurchatov Genomics Center, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia

²

Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, 127051 Moscow, Russia

³

Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, 630090 Novosibirsk, Russia

⁴

MSU Institute for Artificial Intelligence, Lomonosov Moscow State University, 119991 Moscow, Russia

^*

Authors to whom correspondence should be addressed.

Plants 2023, 12(17), 3019; https://doi.org/10.3390/plants12173019

Submission received: 25 July 2023 / Revised: 11 August 2023 / Accepted: 19 August 2023 / Published: 22 August 2023

(This article belongs to the Special Issue Selected Papers from the 7th International Scientific Conference “Plant Genetics, Genomics, Bioinformatics and Biotechnology” (PlantGen 2023))

Download

Browse Figures

Versions Notes

Abstract

:

Wheat is a cereal grain that plays an important role in the world’s food industry. The identification of the loci that change the concentration of elements in wheat seeds is an important challenge nowadays especially for genomic selection and breeding of novel varieties. In this study, we performed a multivariate genome-wide association study (GWAS) of the seven traits—concentrations of Zn, Mg, Mn, Ca, Cu, Fe, and K in grain—of the Russian collection of common wheat Triticum aestivum (N = 149 measured in two years in two different fields). We replicated one known locus associated with the concentration of Zn (IAAV1375). We identified four novel loci—BS00022069_51 (associated with concentrations of Ca and K), RFL_Contig6053_3082 (associated with concentrations of Fe and Mn), Kukri_rep_c70864_329 (associated with concentrations of all elements), and IAAV8416 (associated with concentrations of Fe and Mn)—three of them were located near the genes TraesCS6A02G375400, TraesCS7A02G094800, and TraesCS5B02G325400. Our result adds novel information on the loci involved in wheat grain element contents and may be further used in genomic selection.

Keywords:

wheat; Triticum aestivum; microelement; macroelement; GWAS; multivariate analysis

1. Introduction

Wheat is a cereal grain that plays an important role in the world’s food industry. Wheat lines have been studied using many genomic approaches, such as genome sequencing and resequencing [1,2,3]), transcriptome analysis [4,5,6], and SNP arrays [7,8,9], that have given one an opportunity to investigate many wheat traits, such as element concentrations, immune response, and drought resistance.

One of the important components of wheat seeds are micronutrients, such as calcium, zinc, and magnesium. Their concentrations are of special interest for genomic selection studies due to their role in grain food value. Micronutrient deficiency, also known as hidden hunger, is a result of food intake with low concentrations of micronutrients and vitamins [10]. It may lead to different diseases and even death. For that reason, an ability to increase concentrations of different micronutrients, including elements widely used in the food industry in wheat and its products, is an important option to affect the current situation with micronutrient deficiency in the world. Many attempts were made to uncover genomic loci affecting lower or higher concentrations of certain elements [11,12,13,14,15,16,17,18], such as Zn, Fe, Cu, Mn, and P.

The identification of the loci-changing element concentrations as well as genomic selection for concentrations is an important goal in agronomy nowadays. In this respect, special attention is to be paid to local varieties, which are adapted to particular climatic and geographical conditions of the place where they are grown. Russia ranks first in wheat exports in the world (FAOSTAT); therefore, studying Russian wheat lines and searching for the loci important for wheat quality leads to improving the food situation in the world. Until now, there has been only one study on spring Russian varieties [19] where spring wheat germplasm and concentrations of macro- and microelements and trace metals were analyzed.

Multivariate analysis allows to increase the power of analysis and to inspect different relations between traits, especially for polygenic traits when one locus may have an opposite effect on different traits of interest. For instance, a certain locus may positively affect the concentration of one element and negatively affect the concentration of another. This approach is widely used in animal breeding and human genetics but not widely used in plant genomics [20,21,22,23,24].

In this study, we performed a multivariate genome-wide association study on the dataset of the common Russian wheat lines Triticum aestivum. Seven traits that are the concentration levels of Zn, Mg, Mn, Ca, Cu, Fe, and K in grain were studied. As multivariate analysis allows for the investigation of combinations of traits, it becomes useful in breeding because it enables researchers to avoid the cases when one concentration increases and another decreases.

2. Material and Methods

2.1. Wheat Lines

A panel of 157 common spring wheat genotypes was used in this study. The panel consisted of 105 bread wheat varieties, 48 introgression lines (ILs), and 3 wheat relatives (Triticum dicoccum, Triticum timopheevii, and Triticum kiharae). Introgression lines were developed on the base of hybridization of spring bread wheat varieties with wild relatives (T. durum, T. dicoccum, T. dicoccoides, and T. timopheevii) and synthetic hexaploid wheat T. kiharae. More information on the origin of accessions including a list of plant material is available in [25].

2.2. Genotyping and Quality Control

Genomic DNA was isolated from 5–7 day-old seedlings as described in [26]. DNA purification for SNP genotyping was performed using the “Bio-Silica kit for DNA purification from reaction mixtures” according to the manufacturer’s protocol. DNA was then quantified using Qubit dsDNA BR Assay kits (Thermo Fisher Scientific, Waltham, MA, USA) on a Qubit 4 Fluorometer (Thermo Fisher Scientific). SNP genotyping was performed using the Illumina Infinium 15 K Wheat platform by TraitGenetics-Section of SGS Institute Fresenius GmbH (Gatersleben, Germany, www.traitgenetics.com (accessed on 1 August 2023)).

The dataset of 157 wheat lines, which included 13,007 SNP markers, was mapped on the wheat genome [27]. The genotypes were converted in the BED Plink format [28]. In total, 157 lines and 12794 SNPs were available for quality control and further analysis. We performed filtering of the lines and SNPs using the following filters: call rate of SNPs < 5% (--geno 0.05), minor allele frequency (MAF) < 1% (--maf 0.01), and call rate of lines < 5% (--mind 0.05). After filtering, 149 wheat lines and 11,405 SNPs remained.

The chromosomes 1A, 1B, 1D…7A, 7B, and 7D were labeled as 1, 2, 3, …, 19, 20, 21, respectively. The “Unknown” chromosome was labeled as 22. The distribution of SNPs along chromosomes (Supplementary Figure S1) shows that SNPs are distributed unequally, and many SNPs were located on the “Unknown” chromosome.

PCA analysis was performed using plink (v.1.90b6.26, [28]), and PCA plot was generated in R (version 2022.07.0, build 548) using ggplot2 and ggrepel libraries.

2.3. Phenotyping

Wheat genotypes were planted on the field of the Federal Research Center Institute of Cytology and Genetics of Siberian Branch of the Russian Academy of Sciences (Novosibirskaya Oblast, 54.9191° N, 82.9903° E) in 2018 and 2019. The experiment was conducted using randomized block design in two replicates on plots of 1 m width, 80 grains per row, and between-row spacing of 25 cm. Samples were sown in the second half of May and harvested in September in the phase of full grain ripeness. The soil of the experimental field consisted of leached chernozem; the fertile soil layer varied within 40–60 cm, and the humus content was 4.2%. The contents of nitrogen, phosphorus, and potassium were 0.34%, 0.30%, and 0.13%, respectively.

The growing season of 2018 was characterized by low temperatures in May (on average, 5 °C below normal) and high waterlogging in May–June. The weather conditions in 2019 were unstable due to uneven precipitation and temperature fluctuations in the second half of the growing season. Rainy weather was observed in May and July of 2019, and a slight drought was observed in June and August.

Randomly selected 20 spikes were harvested and bulk-threshed manually, and 10 g of grain from each genotype was used for evaluation of seven macro- and microelements (Zn, Mg, Mn, Ca, Cu, Fe, and K). The element contents were measured by atomic absorption spectrometry with flame atomization on a ContrAA 800 D instrument (Analytik Jena, Germany). Statistical processing of the results was performed using the Statistica v.10.0 software package.

We used data on the concentration of seven different elements in grain—Zn, Mg, Mn, Ca, Cu, Fe, and K—measured consequently in 2018 and 2019 for each wheat line. There were two replicas for each line and year for a total of 157 × 2 × 2 phenotypic measurements. For each element measurement in each year, we removed values outside the range of 3 IQR. Visualization and descriptive statistics were performed in R. Heritability (h²) was calculated using REML analysis implemented in GCTA (v1.94) [29]. Proportion of the explained variance for each of the elements was calculated using ANOVA in R. In this analysis, we considered the effect of the year or line on the element concentration separately.

2.4. GWAS Analysis

As for each wheat line the phenotypic information about two replicates for each year was available, we represented each line as a “twin pair”, where genotypes were the same while phenotypes were different related to each of the samples. Thus, we performed GWASs using 149 × 2 lines for each year. This approach gives an opportunity to increase the sample size and consequently the power of the study. GWAS analysis for each year and concentration level of the element was performed using mixed model approach implemented in GCTA (v1.94) [29]) fastGWA (fast MLM-based Genome-Wide Association) tool with the parameters --maf 0.01 (MAF ≥ 1%), --geno 0.05 (SNP call rate ≥ 95%), and --mind 0.05 (sample call rate ≥ 95%). Also, we used parameter --allow-no-sex (as there was no information about sex for samples).

We meta-analyzed the results for each concentration for two years to obtain a total sample size of 149 × 2 × 2 lines. Z-based sample-size-weighted meta-analysis was performed using the METAL (version 2011-03-25) [30].

As the next step, we combined different concentration levels into four multivariate traits and performed a multivariate GWAS. Multivariate traits clusters were formed based on the correlation matrix between traits (Supplementary Figure S2): (1) Ca and K; (2) Fe and Mn; (3) Mg and Zn; and (4) all elements. Beta and SE for beta used in the following multivariate analysis were calculated from METAL Z-scores according to the formula from [31]. Multivariate analysis was performed in R using the MultiABEL library. All genotyped SNPs were taken as a list of independent SNPs.

The p-value threshold for meta-analysis and multivariate analysis was set at 4.01 × 10⁻⁷. This value was obtained by the formula (0.05/(11,334 × (7 + 4)), where 11,334 is the number of SNPs used in the analysis, 7 is the number of univariate GWASs, and 4 is number of multivariate GWASs. Lambda values for each GWAS analysis were calculated in R. Variance of the elements explained by each significant SNP was estimated using p-value and number of genotypes involved in meta-analysis for traits. We used function qchisq (p-value, df = 1, and lower.tail = False) in R to obtain a chi-square value with df = 1 and then divided it by the number of genotypes. The result was multiplied by 100 to receive variance of the elements explained by SNP in percent.

For QQ plots, R library qqman (v.0.1.8) was used, and for Manhattan plots and phenotypic correlation matrix (with a correction for kinship, correlation method ward.D), we used R libraries RcolorBrewer (v.1.1.3), corrplot (v.0.92), qqman (v.0.1.8), and dplyr (v.1.1.1).

2.5. Functional Annotation

The gene nearest to a significant loci was determined with the EnsemblPlants genome browser [32] using track Sequence Genes. Detailed gene description was obtained from the UniProt database [33], and the level of certainty for protein annotation for all genes was labeled as “Predicted”.

2.6. Verification of Known Loci

We compared results from meta-analysis for each element and from clusters of multivariate analysis with already published results from [11,13]. First, we took all significant loci from the cited papers. Second, we verified whether these loci were in our analyzed dataset. Third, we considered whether the p-values for these loci obtained from the meta- and multivariate analyses were significant; i.e., p-value was below 0.05/35, where 35 is the number of significant loci from the papers.

3. Results

3.1. Concentration of Elements in Grain

We checked the distributions of the element concentration levels for each year (Supplementary Figures S3–S9). Some of the phenotypes had a skewed normal distribution. Up to 30 outliers were observed for each trait taking together two replicates for each year. These observations reveal the degree of diversity in the studied lines (Supplementary Table S1).

Heritability estimates (h²) were moderate almost for all traits (except Ca in 2018 and 2019, Mg in 2019, and Zn in 2019) with a good resemblance of estimates between years (Table 1). The proportion of the explained variance of element concentrations was high when we considered the differences between lines (Supplementary Table S2)—from 0.345 to 0.757. But when we considered the effect of the year, the explained variance was much lower—from 0.009 to 0.303.

3.2. Genetic Structure of the Studied Populations/Lines

The PCA plot of the 157 genotypes is presented on Figure 1. Fifteen different areas are illustrated by different colors, and every wheat line on the PCA is labeled by a number (described in the legend). As it can be seen, samples from the same geographical areas are located next to each other with a few exceptions. In general, there are two main clusters on the PCA: the first forming an oval in the middle of the plot and the second, more dispersed, in the upper left corner. There are samples genetically close to each other, for instance, lines 195-3 and 196-1 (numbers 38 and 39 on the PCA) or a cluster of lines 190 6-1, 213-1, Belorusskaya-80, and Festivalnaya (numbers 36, 44, 62, and 67, respectively). The first 10 components of the PCA compose a usual slope as it was observed in many different studies (Supplementary Figure S10).

3.3. GWAS of the Concentrations of Seven Elements

We performed a GWAS for each element and each year separately with the consequent meta-analysis of the GWAS for two years. In total, seven univariate GWAS results were obtained. Genomic control lambdas varied from 0.98 to 1.12 (Supplementary Table S3), with the GWAS for Mn having the biggest lambda. QQ plots showed deviations from the expected distribution for all the elements, except for Ca. Probably, this might be a result of the meta-analysis where an increased sample size led to an increased power of the analysis (Supplementary Figures S11–S17). The combined Manhattan plot for all seven concentration levels is presented in Supplementary Figure S18. One SNP (BS00022069_51) was significantly associated (p-value < 4.01 × 10⁻⁷) with the level of K.

As the next step, we performed the analysis of four multivariate traits. The joint results for the univariate and multivariate GWASs are presented in Table 2. The joint Manhattan plot for multivariate traits is presented in Figure 2. Genomic control lambdas varied from 0.97 to 1.10 (Supplementary Table S4). QQ plots showed deviations from the expected distribution for all multivariate traits (Supplementary Figures S19–S22).

Four SNPs were significantly associated with at least one multivariate trait, with one of the SNPs (BS00022069_51) significantly associated with K levels. For more details, see Table 2.

3.4. Functional Annotation of Discovered Loci

We used the EnsemblPlants genome browser [32] to investigate the nearest to the discovered top SNPs genes. We determined the nearest gene for three loci. The top SNPs of these loci were located within coding exons of these genes (Table 2).

3.5. Replication of Known Loci

We replicated one known published locus associated with the Zn concentration (IAAV1375, p-value in meta-analysis 0.001065, in multivariate analysis in cluster Mg and Zn—0.0029 and in cluster with all elements—0.0020) discovered in [11] (Supplementary Table S5). In the mentioned as well in the current study, the IAAV1375 genotype CC was associated with an increased Zn concentration in grain.

4. Discussion

In this study, we performed a multivariate genome-wide association study on Russian lines. We replicated one already known locus (IAAV1375) and discovered four novel loci that reached the stringent Bonferroni-corrected genome-wide significance. One of these loci was detected both in the meta- and multivariate analyses, while three others were detected in the multivariate analysis only.

The know replicated locus IAAV1375 is located on the chromosome 5A, position 502220915 inside the intron of TraesCS5A02G291300 protein-coding gene. According to [11], this gene is a probable UDP-arabinose 4-epimerase 1. We searched for orthologs of this gene in PantherDB [34] and found numerous orthologs belonging to the oxidoreductase PANTHER protein class in different plants. Furthermore, by utilizing the PANTHER Family and PANTHER Subfamily classifications, we obtained a more specific description: PINORESINOL REDUCTASE-RELATED and NMRA DOMAIN-CONTAINING PROTEIN. Unfortunately, no additional information was available for any of the plants with orthologs of this gene.

Interestingly, according to the novel wheat genome assembly (RefSeq v1.0), locus IAAV1375 is located on the same chromosome as locus BS00022069_51, which was found in our discovery analysis. These loci are relatively distant from each other (79.3 Mb). However, they exhibit a small but significant nonzero linkage disequilibrium (R² = 0.09, D’ = 0.51). We can speculate that this chromosomal region (chromosome 5A, positions 502 Mb–582 Mb) may contain a cluster of genes related to the regulation of element concentrations.

For all revealed loci, we determined the nearest genes, and, surprisingly, three of them were located in protein-coding genes. Moreover, the coordinates of top SNPs (see Table 2) were located within coding exons. A significant SNP from the RFL_Contig6053_3082 locus was located in the gene TraesCS6A02G375400. According to the UniProt, this gene belongs to ENTH domain-containing proteins playing a role in endocytosis and cytoskeletal machinery. Another SNP from the Kukri_rep_c70864_329 locus was located within the TraesCS7A02G094800 protein-coding gene. This gene belongs to the CRAL_TRIO_N domain-containing proteins that has a spectrum of various functions. The third SNP from the IAAV8416 locus was located within the protein-coding gene TraesCS5B02G325400, but there is no information about its function. The fourth SNP from the BS00022069_51 locus was located within the protein-coding gene TraesCS5A02G384200, but there was also no information about its function.

We used Z-score-based meta-analysis in our pipeline. Consequently, the resulting effect had no scale, and we could interstate it only in terms of the effect’s direction. For five discovered loci (BS00022069_51, RFL_Contig6053_3082, Kukri_rep_c70864_329, IAAV8416, and IAAV1375), we specified positive alleles associated with concentrations of elements. For the BS00022069_51—allele, C was positively associated with the K concentration; for the RFL_Contig6053_3082—allele, T was positively associated with concentrations of Fe and Mn; for the Kukri_rep_c70864_329—allele, T was positively associated with the concentration of Fe and allele C was positively associated with the concentration of Ca; for the IAAV8416 allele, T was positively associated with the concentration of Mn; and for the IAAV1375 allele, C was positively associated with the concentration of Zn.

Multivariate analysis indeed increases the power of analysis. This approach is widely used in animal and human genetics but not in plants [20,21,22,23,24]. The multivariate approach helps not only to increase the power of analysis but also to control the directionality of the effect for several traits to avoid the opposite selection.

To conclude, we performed a multivariate genome-wide association study of seven concentrations of elements and discovered four new loci and replicated one known.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants12173019/s1, Figure S1: Distribution of 11 405 SNPs along the chromosomes of 149 wheat lines, Figure S2: Correlated element clusters determined by multivariate analysis, Figure S3: Distribution of Zn concentration in wheat grains in 2018 and 2019, Figure S3: Distribution of Zn concentration in wheat grains in 2018 and 2019, Figure S4: Distribution of Mg concentration in wheat grains in 2018 and 2019, Figure S5: Distribution of Mn concentration in wheat grains in 2018 and 2019, Figure S6: Distribution of Ca concentration wheat grains in 2018 and 2019, Figure S7: Distribution of Cu concentration wheat grains in 2018 and 2019, Figure S8: Distribution of Fe concentration in wheat grains in 2018 and 2019, Figure S9: Distribution of K concentration in wheat grains in 2018 and 2019, Figure S10: Percentage of genetic variation explained by the first 10 principal components, Figure S11: Q-Q plot in meta-analysis for Fe, Figure S12: Q-Q plot in meta-analysis for K, Figure S13: Q-Q plot in meta-analysis for Mg, Figure S14: Q-Q plot in meta-analysis for Mg, Figure S15: Q-Q plot in meta-analysis for Zn, Figure S16: Q-Q plot in meta-analysis for Cu, Figure S17: Q-Q plot in meta-analysis for Ca, Figure S18: Manhattan plot for GWAS meta-analysis of wheat lines dataset, Figure S19: Q-Q plot for a Mg and Zn cluster determined in multivariate analysis, Figure S20: Q-Q plot for a Fe and Mn cluster determined in multivariate analysis, Figure S21: Q-Q plot for a Ca and K cluster determined in multivariate analysis, Figure S22: Q-Q plot for all the elements considered jointly in multivariate analysis. Table S1: Dispersion values of element concentration in studied samples, Table S2: Proportion of the explained variance of elements concentrations considering effect of year and line separately, Table S3: Lambda values from GWAS meta-analysis, Table S4: Lambda values from multivariate GWAS, Table S5: Replication of loci from [11,13]. In green is locus with significant p-value.

Author Contributions

Conceptualization, Y.A.T. and E.A.S.; methodology, Y.A.T., N.A.P., A.N.T. and E.S.T.; software, N.A.P., A.N.T. and E.S.T.; validation, N.A.P., A.N.T. and E.S.T.; formal analysis, N.A.P., A.N.T. and E.S.T.; investigation, Y.A.T., N.A.P., E.A.S., I.N.L. and N.A.V.; resources, Y.A.T. and I.N.L.; data curation, N.A.P. and I.N.L.; writing—original draft preparation, N.A.P. and Y.A.T.; writing—review and editing, N.A.P., Y.A.T., E.A.S. and I.N.L.; visualization, N.A.P.; supervision, E.A.S. and Y.A.T.; funding acquisition, E.A.S., I.N.L. and Y.A.T. All authors have read and agreed to the published version of the manuscript.

Funding

This study was done with financial support of the Russian Science Foundation (project 23-16-00041, https://rscf.ru/project/23-16-00041).

Data Availability Statement

Data available on request.

Acknowledgments

Multiplication of seed material and field trials were carried out at the Collective Use Center of Plant Reproduction as part of the implementation of the budget project FWNR-2022-0017.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analysis, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Walkowiak, S.; Gao, L.; Monat, C.; Haberer, G.; Kassa, M.T.; Brinton, J.; Ramirez-Gonzalez, R.H.; Kolodziej, M.C.; Delorean, E.; Thambugala, D.; et al. Multiple Wheat Genomes Reveal Global Variation in Modern Breeding. Nature 2020, 588, 277–283. [Google Scholar] [CrossRef]
Athiyannan, N.; Abrouk, M.; Boshoff, W.H.P.; Cauet, S.; Rodde, N.; Kudrna, D.; Mohammed, N.; Bettgenhaeuser, J.; Botha, K.S.; Derman, S.S.; et al. Long-Read Genome Sequencing of Bread Wheat Facilitates Disease Resistance Gene Cloning. Nat. Genet. 2022, 54, 227–231. [Google Scholar] [CrossRef] [PubMed]
Kumar, J.; Mishra, A.; Kumar, A.; Kaur, G.; Sharma, H.; Kaur, S.; Sharma, S.; Devi, K.; Garg, M.; Pandey, A.K.; et al. Whole Genome Re-Sequencing of Indian Wheat Genotypes for Identification of Genomic Variants for Grain Iron and Zinc Content. Mol. Biol. Rep. 2022, 49, 7123–7133. [Google Scholar] [CrossRef] [PubMed]
Liu, Z.; Xin, M.; Qin, J.; Peng, H.; Ni, Z.; Yao, Y.; Sun, Q. Temporal Transcriptome Profiling Reveals Expression Partitioning of Homeologous Genes Contributing to Heat and Drought Acclimation in Wheat (Triticum aestivum L.). BMC Plant Biol. 2015, 15, 152. [Google Scholar] [CrossRef]
Poretti, M.; Sotiropoulos, A.G.; Graf, J.; Jung, E.; Bourras, S.; Krattinger, S.G.; Wicker, T. Comparative Transcriptome Analysis of Wheat Lines in the Field Reveals Multiple Essential Biochemical Pathways Suppressed by Obligate Pathogens. Front. Plant Sci. 2021, 12, 720462. [Google Scholar] [CrossRef] [PubMed]
Prasad, G.; Mittal, S.; Kumar, A.; Chauhan, D.; Sahu, T.K.; Kumar, S.; Singh, R.; Yadav, M.C.; Singh, A.K. Transcriptome Analysis of Bread Wheat Genotype KRL3-4 Provides a New Insight Into Regulatory Mechanisms Associated With Sodicity (High PH) Tolerance. Front. Genet. 2022, 12, 782366. [Google Scholar] [CrossRef]
Sun, C.; Dong, Z.; Zhao, L.; Ren, Y.; Zhang, N.; Chen, F. The Wheat 660K SNP Array Demonstrates Great Potential for Marker-assisted Selection in Polyploid Wheat. Plant Biotechnol. J. 2020, 18, 1354–1360. [Google Scholar] [CrossRef]
Rimbert, H.; Darrier, B.; Navarro, J.; Kitt, J.; Choulet, F.; Leveugle, M.; Duarte, J.; Rivière, N.; Eversole, K.; on behalf of The International Wheat Genome Sequencing Consortium; et al. High Throughput SNP Discovery and Genotyping in Hexaploid Wheat. PLoS ONE 2018, 13, e0186329. [Google Scholar] [CrossRef]
Muhu-Din Ahmed, H.G.; Sajjad, M.; Zeng, Y.; Iqbal, M.; Habibullah Khan, S.; Ullah, A.; Nadeem Akhtar, M. Genome-Wide Association Mapping through 90K SNP Array for Quality and Yield Attributes in Bread Wheat against Water-Deficit Conditions. Agriculture 2020, 10, 392. [Google Scholar] [CrossRef]
Liu, J.; Wu, B.; Singh, R.P.; Velu, G. QTL Mapping for Micronutrients Concentration and Yield Component Traits in a Hexaploid Wheat Mapping Population. J. Cereal Sci. 2019, 88, 57–64. [Google Scholar] [CrossRef]
Velu, G.; Singh, R.P.; Crespo-Herrera, L.; Juliana, P.; Dreisigacker, S.; Valluru, R.; Stangoulis, J.; Sohu, V.S.; Mavi, G.S.; Mishra, V.K.; et al. Genetic Dissection of Grain Zinc Concentration in Spring Wheat for Mainstreaming Biofortification in CIMMYT Wheat Breeding. Sci. Rep. 2018, 8, 13526. [Google Scholar] [CrossRef] [PubMed]
Cu, S.T.; Guild, G.; Nicolson, A.; Velu, G.; Singh, R.; Stangoulis, J. Genetic Dissection of Zinc, Iron, Copper, Manganese and Phosphorus in Wheat (Triticum aestivum L.) Grain and Rachis at Two Developmental Stages. Plant Sci. 2020, 291, 110338. [Google Scholar] [CrossRef] [PubMed]
Wang, W.; Guo, H.; Wu, C.; Yu, H.; Li, X.; Chen, G.; Tian, J.; Deng, Z. Identification of Novel Genomic Regions Associated with Nine Mineral Elements in Chinese Winter Wheat Grain. BMC Plant Biol. 2021, 21, 311. [Google Scholar] [CrossRef]
Krishnappa, G.; Rathan, N.D.; Sehgal, D.; Ahlawat, A.K.; Singh, S.K.; Singh, S.K.; Shukla, R.B.; Jaiswal, J.P.; Solanki, I.S.; Singh, G.P.; et al. Identification of Novel Genomic Regions for Biofortification Traits Using an SNP Marker-Enriched Linkage Map in Wheat (Triticum aestivum L.). Front. Nutr. 2021, 8, 669444. [Google Scholar] [CrossRef] [PubMed]
Tong, J.; Zhao, C.; Sun, M.; Fu, L.; Song, J.; Liu, D.; Zhang, Y.; Zheng, J.; Pu, Z.; Liu, L.; et al. High Resolution Genome Wide Association Studies Reveal Rich Genetic Architectures of Grain Zinc and Iron in Common Wheat (Triticum aestivum L.). Front. Plant Sci. 2022, 13, 840614. [Google Scholar] [CrossRef]
Rathan, N.D.; Krishna, H.; Ellur, R.K.; Sehgal, D.; Govindan, V.; Ahlawat, A.K.; Krishnappa, G.; Jaiswal, J.P.; Singh, J.B.; Sv, S.; et al. Genome-Wide Association Study Identifies Loci and Candidate Genes for Grain Micronutrients and Quality Traits in Wheat (Triticum aestivum L.). Sci. Rep. 2022, 12, 7037. [Google Scholar] [CrossRef]
Krishnappa, G.; Khan, H.; Krishna, H.; Kumar, S.; Mishra, C.N.; Parkash, O.; Devate, N.B.; Nepolean, T.; Rathan, N.D.; Mamrutha, H.M.; et al. Genetic Dissection of Grain Iron and Zinc, and Thousand Kernel Weight in Wheat (Triticum aestivum L.) Using Genome-Wide Association Study. Sci. Rep. 2022, 12, 12444. [Google Scholar] [CrossRef]
Wang, J.; Shi, X.; Zhou, Z.; Qin, M.; Wang, Y.; Li, W.; Yang, P.; Wu, Z.; Lei, Z. Genetic Dissection of Grain Iron Concentration in Hexaploid Wheat (Triticum aestivum L.) Using a Genome-Wide Association Analysis Method. PeerJ 2022, 10, e13625. [Google Scholar] [CrossRef]
Morgounov, A.; Li, H.; Shepelev, S.; Ali, M.; Flis, P.; Koksel, H.; Savin, T.; Shamanin, V. Genetic Characterization of Spring Wheat Germplasm for Macro-, Microelements and Trace Metals. Plants 2022, 11, 2173. [Google Scholar] [CrossRef]
Hatoum, A.S.; Colbert, S.M.C.; Johnson, E.C.; Huggett, S.B.; Deak, J.D.; Pathak, G.A.; Jennings, M.V.; Paul, S.E.; Karcher, N.R.; Hansen, I.; et al. Multivariate Genome-Wide Association Meta-Analysis of over 1 Million Subjects Identifies Loci Underlying Multiple Substance Use Disorders. Nat. Ment. Health 2023, 1, 210–223. [Google Scholar] [CrossRef]
Macri, M.; Luigi-Sierra, M.G.; Guan, D.; Delgado, J.V.; Alvarez, J.F.; Amills, M.; Martínez, A.M. Univariate and Multivariate Genome-wide Association Studies for Hematological Traits in Murciano-Granadina Goats. Anim. Genet. 2023, 54, 491–499. [Google Scholar] [CrossRef] [PubMed]
Ruotsalainen, S.E.; Partanen, J.J.; Cichonska, A.; Lin, J.; Benner, C.; Surakka, I.; FinnGen; Reeve, M.P.; Palta, P.; Salmi, M.; et al. An Expanded Analysis Framework for Multivariate GWAS Connects Inflammatory Biomarkers to Functional Variants and Disease. Eur. J. Hum. Genet. 2021, 29, 309–324. [Google Scholar] [CrossRef]
Udosen, B.; Soremekun, O.; Kamiza, A.; Machipisa, T.; Cheickna, C.; Omotuyi, O.; Soliman, M.; Wélé, M.; Nashiru, O.; Chikowore, T.; et al. Meta-Analysis and Multivariate GWAS Analyses in 77,850 Individuals of African Ancestry Identify Novel Variants Associated with Blood Pressure Traits. Int. J. Mol. Sci. 2023, 24, 2164. [Google Scholar] [CrossRef] [PubMed]
Zlobin, A.S.; Nikulin, P.S.; Volkova, N.A.; Zinovieva, N.A.; Iolchiev, B.S.; Bagirov, V.A.; Borodin, P.M.; Aksenovich, T.I.; Tsepilov, Y.A. Multivariate Analysis Identifies Eight Novel Loci Associated with Meat Productivity Traits in Sheep. Genes 2021, 12, 367. [Google Scholar] [CrossRef]
Leonova, I.N.; Skolotneva, E.S.; Orlova, E.A.; Orlovskaya, O.A.; Salina, E.A. Detection of Genomic Regions Associated with Resistance to Stem Rust in Russian Spring Wheat Varieties and Breeding Germplasm. Int. J. Mol. Sci. 2020, 21, 4706. [Google Scholar] [CrossRef] [PubMed]
Kiseleva, A.A.; Shcherban, A.B.; Leonova, I.N.; Frenkel, Z.; Salina, E.A. Identification of New Heading Date Determinants in Wheat 5B Chromosome. BMC Plant Biol. 2016, 16, 8. [Google Scholar] [CrossRef]
Wang, S.; Wong, D.; Forrest, K.; Allen, A.; Chao, S.; Huang, B.E.; Maccaferri, M.; Salvi, S.; Milner, S.G.; Cattivelli, L.; et al. Characterization of Polyploid Wheat Genomic Diversity Using a High-density 90 000 Single Nucleotide Polymorphism Array. Plant Biotechnol. J. 2014, 12, 787–796. [Google Scholar] [CrossRef]
Purcell, S.; Neale, B.; Todd-Brown, K.; Thomas, L.; Ferreira, M.A.R.; Bender, D.; Maller, J.; Sklar, P.; De Bakker, P.I.W.; Daly, M.J.; et al. PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. Am. J. Hum. Genet. 2007, 81, 559–575. [Google Scholar] [CrossRef]
Yang, J.; Lee, S.H.; Goddard, M.E.; Visscher, P.M. GCTA: A Tool for Genome-Wide Complex Trait Analysis. Am. J. Hum. Genet. 2011, 88, 76–82. [Google Scholar] [CrossRef]
Willer, C.J.; Li, Y.; Abecasis, G.R. METAL: Fast and Efficient Meta-Analysis of Genomewide Association Scans. Bioinformatics 2010, 26, 2190–2191. [Google Scholar] [CrossRef]
Zhu, Z.; Zhang, F.; Hu, H.; Bakshi, A.; Robinson, M.R.; Powell, J.E.; Montgomery, G.W.; Goddard, M.E.; Wray, N.R.; Visscher, P.M.; et al. Integration of Summary Data from GWAS and EQTL Studies Predicts Complex Trait Gene Targets. Nat. Genet. 2016, 48, 481–487. [Google Scholar] [CrossRef] [PubMed]
Cunningham, F.; Allen, J.E.; Allen, J.; Alvarez-Jarreta, J.; Amode, M.R.; Armean, I.M.; Austine-Orimoloye, O.; Azov, A.G.; Barnes, I.; Bennett, R.; et al. Ensembl 2022. Nucleic Acids Res. 2022, 50, D988–D995. [Google Scholar] [CrossRef] [PubMed]
The UniProt Consortium. UniProt: The Universal Protein Knowledgebase. Nucleic Acids Res. 2017, 45, D158–D169. [Google Scholar] [CrossRef] [PubMed]
Thomas, P.G.; Campbell, M.J.; Kejariwal, A.; Thomas, P.; Karlak, B.; Daverman, R.; Diemer, K.; Muruganujan, A.; Narechania, A. PANTHER: A Library of Protein Families and Subfamilies Indexed by Function. Genome Res. 2003, 13, 2129–2141. [Google Scholar] [CrossRef]

Figure 1. PCA plot describing population structure of the studied wheat lines. Colors correspond to different geographical areas of the origin, and numbers on the PCA correspond to the line names provided in the legend.

Figure 2. Manhattan plot for GWAS meta-analysis of the wheat line datasets. Four significant SNPs (p-value < 4.01 × 10⁻⁷) are located on the 5B, 6A, 7A, and Unknown chromosomes (for more details, see Table 2).

Table 1. Heritability (h²) and standard error (se, in parentheses) for each of the traits in studied wheat lines.

Element	2018	2019
Ca	0.66 (0.052)	0.51 (0.059)
Cu	0.80 (0.034)	0.80 (0.032)
Fe	0.87 (0.024)	0.75 (0.040)
K	0.72 (0.048)	0.838 (0.030)
Mg	0.74 (0.043)	0.55 (0.058)
Mn	0.83 (0.029)	0.79 (0.033)
Zn	0.83 (0.028)	0.67 (0.047)

Table 2. GWAS results for the meta-analysis and multivariate analysis (p-value < 4.01 × 10⁻⁷).

Marker Name	Chromosome	Positive/Negative Alleles	Position	The Most Significant p-Value	Variance Explained by SNP, %	Significant Traits	The Nearest Gene
BS00022069_51	Unknown *	A/C	582104154	5.40 × 10⁻⁹	5.75	Ca and K, K	TraesCS5A02G384200 **
RFL_Contig6053_3082	6A	C/T	597790267	3.05 × 10⁻⁸	5.22	Fe and Mn	TraesCS6A02G375400
Kukri_rep_c70864_329	7A	C/T **	57873213	3.76 × 10⁻⁸	5.18	All traits	TraesCS7A02G094800
IAAV8416	5B	T/C ***	509487915	4.13 × 10⁻⁸	5.22	Fe and Mn	TraesCS5B02G325400

* According to novel assembly RefSeq v1.0, this locus is located on 5A chromosome. ** For Ca, positive allele is T and negative is C. *** For Mn, positive allele is C and negative is T.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Potapova, N.A.; Timoshchuk, A.N.; Tiys, E.S.; Vinichenko, N.A.; Leonova, I.N.; Salina, E.A.; Tsepilov, Y.A. Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines. Plants 2023, 12, 3019. https://doi.org/10.3390/plants12173019

AMA Style

Potapova NA, Timoshchuk AN, Tiys ES, Vinichenko NA, Leonova IN, Salina EA, Tsepilov YA. Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines. Plants. 2023; 12(17):3019. https://doi.org/10.3390/plants12173019

Chicago/Turabian Style

Potapova, Nadezhda A., Anna N. Timoshchuk, Evgeny S. Tiys, Natalia A. Vinichenko, Irina N. Leonova, Elena A. Salina, and Yakov A. Tsepilov. 2023. "Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines" Plants 12, no. 17: 3019. https://doi.org/10.3390/plants12173019

APA Style

Potapova, N. A., Timoshchuk, A. N., Tiys, E. S., Vinichenko, N. A., Leonova, I. N., Salina, E. A., & Tsepilov, Y. A. (2023). Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines. Plants, 12(17), 3019. https://doi.org/10.3390/plants12173019

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multivariate Genome-Wide Association Study of Concentrations of Seven Elements in Seeds Reveals Four New Loci in Russian Wheat Lines

Abstract

1. Introduction

2. Material and Methods

2.1. Wheat Lines

2.2. Genotyping and Quality Control

2.3. Phenotyping

2.4. GWAS Analysis

2.5. Functional Annotation

2.6. Verification of Known Loci

3. Results

3.1. Concentration of Elements in Grain

3.2. Genetic Structure of the Studied Populations/Lines

3.3. GWAS of the Concentrations of Seven Elements

3.4. Functional Annotation of Discovered Loci

3.5. Replication of Known Loci

4. Discussion

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI