Identification of New QTLs for Dietary Fiber Content in Aegilops biuncialis

Grain dietary fiber content is an important health-promoting trait of bread wheat. A dominant dietary fiber component of wheat is the cell wall polysaccharide arabinoxylan and the goatgrass Aegilops biuncialis has high β-glucan content, which makes it an attractive gene source to develop wheat lines with modified fiber composition. In order to support introgression breeding, this work examined genetic variability in grain β-glucan, pentosan, and protein content in a collection of Ae. biuncialis. A large variation in grain protein and edible fiber content was revealed, reflecting the origin of Ae. biuncialis accessions from different eco-geographical habitats. Association analysis using DArTseq-derived SNPs identified 34 QTLs associated with β-glucan, pentosan, water-extractable pentosan, and protein content. Mapping the markers to draft chromosome assemblies of diploid progenitors of Ae. biuncialis underlined the role of genes on chromosomes 1Mb, 4Mb, and 5Mb in the formation of grain β-glucan content, while other QTLs on chromosome groups 3, 6, and 1 identified genes responsible for total- and water-extractable pentosan content. Functional annotation of the associated marker sequences identified fourteen genes, nine of which were identified in other monocots. The QTLs and genes identified in the present work are attractive targets for chromosome-mediated gene transfer to improve the health-promoting properties of wheat-derived foods.


Introduction
Bread wheat (Triticum aestivum L., 2n = 6x = 42; AABBDD) is grown on more than 220 million hectares worldwide and is one of the most important components of the human diet. Due to its major role in human nutrition, wheat is a major source of dietary fiber (DF). DF has numerous health-promoting effects, including lowering the risk of coronary heart disease, colorectal cancer, inflammatory bowel disease, breast cancer, tumor formation, and mineral-related abnormalities [1]. The major DF components of the wheat grain are the cell wall polysaccharides, arabinoxylan (AX) and (1-3)(1-4)-β-D-glucan (β-glucan, BG), which account for about 70 and 20%, respectively, of the total cell wall polysaccharides in the starchy endosperm (and hence in white flour) [2]. DF components of wheat also affect age of the entire genome and enable efficient detection of QTLs. DArTSeq platform has already been used for whole-genome profiling in Aegilops species, including Ae. taushii [42] and Ae. sharonensis [43].
Ivanizs et al. [18] characterized phenotypic variation in heading time and genetic diversity in a collection of Ae. biuncialis accessions with geographically diverse origins using Silico-DArT markers. More recently, the complete chromosome set of Ae. umbellulata and Ae. comosa [44] was flow-sorted and sequenced by Illumina technology to generate chromosome-specific de novo draft sequence assemblies for the U-and M-genomes [45]. The chromosome-specific genomic resources thus obtained allowed us to determine the chromosomal location of orthologous genes and to develop markers for Aegilops chromosomes [45].
Motivated by the need to discover accessions suitable for interspecific hybridization programs and to identify QTLs for targeted transfer of high fiber and/or protein content, this work focused on the evaluation of phenotypic variability of β-glucan, total-and waterextractable pentosan, and protein content in a collection of 86 genotypes of Ae. biuncialis. Using the allelic composition of DArTSeq derived SNP markers in combination with a sequence similarity search on the U-and M-chromosome assemblies, we characterized the Aegilops genotypes to find markers and genomic regions closely associated with the desired quality traits.

Characterization of Quality Traits
Grain samples from single plots were analyzed to determine the variation in dietary fiber components, β-glucan (BG) and arabinoxylan, measured as total-(TP) and waterextractable pentosans (WEP) as well as the protein content of Ae. biuncialis accessions and the control wheat line Mv9kr1. As three Aegilops accessions produced a low quantity of grains, the quality traits were only determined in 83 genotypes. The accessions were chosen to represent a wide geographical distribution of Ae. biuncialis, extending from the Balkans to the Caspian Sea. As a result, a large variation was found for all traits that were analyzed.
The BG content of the Aegilops collection was significantly higher than that of the Mv9kr1 wheat (Table 1, Figure S1), with the Aegilops accessions exhibiting an average 38.1 mg/g BG content, ranging between 22.7 mg/g and 54.9 mg/g, and Mv9kr1 wheat line exhibiting much lower value (9.44 mg/g). Although there was no difference in total pentosan content between the Aegilops accessions (40.11 mg/g) and wheat (40.17 mg/g), the Aegilops collection showed a wide phenotypic variation (29.60 mg/g-50.77 mg/g). While the average content of waterextractable pentosan in Aegilops genotypes (10.82 mg/g) did not differ considerably from that of Mv9kr1 wheat (10.79 mg/g), a wide range (6.83 mg/g-15.42 mg/g) was observed in Ae. biuncialis. In terms of protein content, Mv9kr1 wheat line (12.91%) was significantly outperformed by all Ae. biuncialis accessions (by a mean value of 26.61% and a range from 19.61% to 33.49%). In summary, all Ae. biuncialis genotypes had significantly higher grain BG and protein content than Mv9kr1 wheat, as well as a high variation in the pentosan fractions.
To better understand the effect of genotype and environmental factors on grain composition, the two-year dataset of the 83 Ae. biuncialis accessions were investigated. Each year, the frequency distribution of the traits was studied separately. Because all of the analyzed traits had a normal distribution, their polygenic nature was confirmed ( Figure S2). Although genotype (G) and environment (E) had a significant effect on β-glucan content (Table 2), the G factor explained a greater proportion (77%) of the variance (Figure 1). This was supported by its high heritability (0.93).  The TP content did not differ significantly between seasons, indicating that the environment had no significant effect on this trait (Table 2). On the other hand, genotype had a strong influence on the TP and WEP content, accounting for approximately one-third of the total variance, 37% and 43%, respectively ( Figure 1). Both pentosan fractions were found to have a high degree of heritability (0.54 and 0.61, respectively).
The analysis of protein content revealed that the G × E interaction determined 83% of the total variance (Figure 1), indicating that Aegilops genotypes responded differently to different environmental conditions. That is most likely the reason for the low heritability (0.27) values and the fact that G accounted for only 16% of the total variance. Nonetheless, for the genome-wide association analysis, all of the quality traits were used.

Population Structure
After excluding accessions with failed DNA amplification, monomorphism, missing data, and allele frequencies that differed from what was described in the methods section, a total of 2602 DArTSeq derived SNP tags were used for whole-genome profiling (the population analysis and the GWAS) in the set of 86 Ae. biuncialis accessions.
To classify genotypes into subpopulations based on genetic similarity, the relatedness of the Aegilops accessions was studied using the Bayesian approach implemented in STRUCTURE, which provided the ∆K plotted against the K numbers of the subgroups. The optimal number of subpopulations was determined using the STRUCTURE Harvester software. The maximum ∆K (686.70) occurred at K = 2 ( Figure 2), resulting in the identification of two groups in the Aegilops collection. Based on DArTSeq-derived SNP data, Ae. biuncialis collection was divided into two subpopulations: subpopulations 1 and 2 ( Figure 3).  The majority of Aegilops genotypes (63 accessions) were assigned to subpopulation 1, with a Q1 mean membership of 0.85. The remaining 23 accessions were assigned to subpopulation 2, with a Q2 mean of 0.96. Based on the genetic data of the SNP-DArT markers, an unrooted phylogenetic tree was constructed using the maximum likelihood method to group accessions into clades ( Figure 4). The accessions of Ae. biuncialis were classified into two main subpopulations, which corresponded to the grouping pattern obtained from the STRUCTURE analysis (Figures 3 and 4). To identify the genetic structure within the Ae. biuncialis population, principal coordinate analysis (PCoA) ( Figure 5) was performed on the dataset of SNP-DArT markers obtained from 86 Aegilops genotypes. The first three coordinates explained 14.5%, 7.1%, and 4.8% of the variation, respectively, accounting for a total of 26.4%. The collection of Ae. biuncialis could be divided into two major groups, which fitted well with the subpopulations revealed by the Bayesian statistical method.
The three statistical approaches yielded consistent results for the genetic diversity of Ae. biuncialis population, in which the Aegilops genotypes were grouped into two subpopulations based on their geographic distribution. The accessions of Ae. biuncialis in subpopulation 1 come from the Peloponnese, Asia Minor, Azerbaijan, and the Near East, while the 23 members of the other subpopulation originate from the Balkans and North Africa ( Figure 6).  . Geographic distribution of Ae. biuncialis accessions. Subpopulation 1 is represented by green symbols, while subpopulation 2 is represented by yellow symbols, as determined by three statistical methods. Circles represent accessions with known geographic locations. If only the country of origin is known, a square representing the capital is used (no information was available for six accessions).

Determination of Marker-Trait Associations
Genome-wide association study (GWAS) was conducted during two seasons using the MLM (Q+K) model, allowing the identification of 34 QTLs representing regions associated with β-glucan, total-pentosan, water-extractable pentosan, and protein content ( Table 3). As three accessions could not be characterized for quality traits, 83 accessions were analyzed by GWAS. Primary to the GWAS analysis, the estimation of heredity divergence within the intended germplasm was determined through linkage disequilibrium (LD) calculation. LD estimated using the squared-allele frequency correlations (r 2 ), and based on genome-wide DArTSeq derived SNP markers was low with a mean r 2 = 0.049. The information about the LD of DArTSeq derived SNP markers has been presented in Figure S3. A marker-trait association (MTA) was considered significant when one or more markers had a LOD value greater than three. QQ plot and Manhattan plot were generated for individual traits based on the MLM and reported in Figures S4 and S5, respectively. WEP content was found to be involved in most MTAs, followed by grain protein content, whereas BG and TP contents were involved in three MTAs. Some QTLs, such as QTL-1, 2, 4, 5, 7, 8, and 12, were linked to multiple traits, and the multiple associations involved protein and fiber traits. Table 3. Marker-trait associations, DArTSeq markers, marker locations on the chromosomes of Aegilops (UM) and wheat (ABD), and p values on chromosomal regions associated with dietary fiber trait components [β-glucan (BG), total-pentosan (TP) and water-extractable pentosan (WEP)], and grain protein content. GWAS analysis was performed for each individual year, considering all four quality traits estimated for both years as a covariate, only significant QTLs in both environments were reported.

Chromosomal Location and Functional Annotation of the QTLs
To determine the chromosomal location of the marker-trait associations, sequences of thirty-four SNP-DArT markers were aligned to the wheat pseudomolecules (Appels et al., 2018) and to the assemblies of chromosomes 1U-7U and 1M-7M, which were obtained by sequencing chromosomes flow-sorted from diploid Ae. umbellulata and Ae. comosa, respectively [45]. Using two best hits, we determined chromosomal positions of 32 markers in Aegilops U or M genomes and hexaploid wheat A, B, and D genomes. Twenty-five (78.1%) of the 32 markers were found on the same homoeologous group chromosomes in Aegilops as in wheat (Table 3).
A sequence similarity search was also performed using the sequences of SNP-DArT markers to predict genes associated with MTAs (Table S2). Wheat gene sequences were obtained from the GrainGenes database (https://wheat.pw.usda.gov, accessed on 30 October 2021) and used for the NCBI BLAST search. The analysis identified fourteen genes, nine of which encode putative ripening-related protein 6 associated with the QTL for WEP and protein; 1-deoxy-D-xylulose-5-phosphate synthase 1, glutamate receptor 2.8-like, endoglucanase 5-like, and flavonol synthase/flavanone 3-hydroxylase-like, all co-located with WEP QTL; glutathione S-transferase 3-like linked to β-glucan, WEP and protein loci (Table 3).

Discussion
Alien introgression breeding is a promising approach to modifying the dietary fiber composition of bread wheat. In this work, we used GWAS for the first time to identify QTL regions determining dietary fiber composition in the wild wheat relative Ae. biuncialis. Suitable crossing partners with superior compositional traits, trait-related genomic data, and molecular tools obtained from the diversity scan of dietary fibers in a DArTSeqcharacterized collection of Ae. biuncialis accessions will facilitate the chromosome-mediated improvement of bioactive components in wheat.

Grain Composition of Ae. biuncialis
Grain β-glucan, TP, WEP, and protein content in Ae. biuncialis accessions and Mv9kr1 wheat line as determined in the present study were similar to those estimated earlier when the effect of added Aegilops chromosomes on the grain composition of wheat was analyzed [31,33]. Diversity analysis of 43 wild and cultivated diploid, tetraploid and hexaploid wheat accessions demonstrated that β-glucan is the dominant form of cell wall polysaccharides in Aegilops species with U and M genomes.
We found a considerable level of variation in the compositional traits of the Aegilops collection, which enabled association analysis and mapping. The analysis of variance revealed that the G was the main factor controlling most of the studied traits. However, the G × E studies found that traits related to cell wall polysaccharides, particularly the β-glucan content, had a high level of heritability, making these traits suitable for identifying marker-trait associations via GWAS analysis. On the other hand, protein content had low heritability, implying that the marker-trait associations found for this parameter could be misleading and need to be supported by future experiments.
There have been numerous studies on the effects of G and E on the edible fiber content of common wheat [46][47][48][49], but only a limited amount of information on the variation of grain compositional traits in Aegilops species has been published. Marcotuli et al. [32] investigated grain BG content and kernel weight in a panel of Triticum and Aegilops species. Similar to our study, a two-year analysis revealed a dominant role of the genotype in the variation of BG content.
Our systematic analysis for grain compositional traits identified Ae. biuncialis accessions with high BG content. The accessions TA2074 and MvGB381 had the highest level of BG, about 500% higher than the BG level of the control wheat line, pointing to these Ae. biuncialis genotypes as crossing partners of choice for chromosome-mediated modification of dietary fiber composition in bread wheat.

Genetic Diversity of the Ae. biuncialis Collection
There are only a few reports on the genetic diversity of Ae. biuncialis. Based on AFLP markers, Ae. biuncialis accessions from the two regions of the Iberian Peninsula were clustered in two groups, which correlated with their geographic origin [50]. Ae. biuncialis genotypes from the eastern side of the Black Sea and the western coast of the Caspian Sea were classified into two subgroups using RAPD markers, which corresponded with the clusters from the two areas of Transcaucasia [51]. Genetic diversity of Ae. geniculata, another tetraploid U-and M-genome species closely related to Ae. biuncialis, was characterized by AFLP markers [52], where a complete set of 411 Ae. geniculata individuals were grouped into seven subpopulations with a high level of correlation with geographic distribution in the Mediterranean Basin.
Using the Bayesian approach and PCoA for the SNP-DArT marker data, the present study identified two subpopulations with different geographic distributions in the Ae. biuncialis collection: subpopulation 1 comprised 63 genotypes from the Peloponnese, Asia minor, Azerbaijan, and the Near East, while subpopulation 2 originated primarily from the Balkans and North Africa. An earlier diversity analysis of the same Ae. biuncialis collection made by 32,700 dominant Silico-DArT markers identified five groups (A-E), which correlated with their geographic origin: A (North Africa), B (Balkans and West Turkey), C (Balkans and Near East), D (Peloponnese, Asia Minor, and Crimea) and E (Near East and Azerbaijan) [18]. Although a different number of subpopulations was identified depending on the type of markers used (Silico-vs. SNP-DArTseq), the spatial pattern of genetic diversity of Ae. biuncialis population showed high similarity. Based on co-dominant SNP markers, the accessions assigned previously to clusters D and E were grouped into subpopulation 1, while the genotypes belonging to clusters A and B were classified as subpopulation 2 in the present study ( Figure 5). Genotypes of Ae. biuncialis originating from North Africa and the Balkans (former subpopulations A and B) were clustered into one group (subpopulation 2) in the present study, suggesting that these genotypes share a common phylogenetic origin. Subpopulation 1 included genotypes previously classified into clusters D (Peloponnese, Asia Minor, and Crimea) and E (Near East and Azerbaijan) also indicating a closer phylogenetic relationship. A majority of Aegilops accessions from former cluster D have a mixed genome shared in different proportions by subpopulations 1 and 2, indicating a significant admixture of their ancestral genotypes. Because genotypes in subpopulation 1 are mostly found in southern Greece and Turkey, while most of the accessions in subpopulation 2 are originating from the Balkans and the western part of Turkey, their natural distribution areas may overlap in the Aegean region and Asia Minor, allowing intraspecific hybridizations to occur. The analysis of chloroplast DNA could provide more information about the genealogical relationship of subpopulations and the intraspecific evolution of Ae. biuncialis [53].
Pairwise LD measured by r 2 based on DArTSeq derived SNP markers was low (mean r 2 = 0.049). This is expected as in self-pollinated species, such Arabidospis or barley, LD can extend up to 10-30 kb and 212 kb, respectively [54,55]. LD, in fact, results from the interplay of many factors, such as selection, which causes locus-specific bottlenecks and it is one of the factors that increase LD between selected alleles at linked loci [56]. LD estimates provided here suggest the suitability of the markers used to perform GWAS and the success of the analysis.

GWAS and Candidate Gene Identification
Using the allelic distribution of SNP markers in the collection of Ae. biuncialis, 34 marker-trait associations were found, identifying QTL regions for grain BG, TP, WEP, and protein content. The present study also demonstrated the utility of chromosome genomics in facilitating genome analysis of wild gene source species with poorly studied genomes [57]. Because the segregating genetic map for Ae. biuncialis was not available, we used a sequence similarity search against the assemblies of individual chromosomes from the U and M genomes of Ae. umbellulata and Ae. comosa, respectively [45], to locate the associated markers on chromosomes of Aegilops. However, the chromosomal position of the QTLs established could be affected by the purity of the flow-sorted M-genome chromosome fractions that were sequenced [44,45].
The present study identified five QTLs (QTL3, 6, 16, 23, and 34) on Aegilops chromosomes 1M and 4M, while the same markers were found on wheat group 1 chromosomes. Previous comparative studies of wheat and diploid Aegilops species based on the mapping of conserved orthologous genes using single-gene FISH [45] and COS markers [44,58] revealed high macrosynteny of M-genome chromosomes with wheat homoeologs. Identification of the associated markers on different homoeologous groups in Ae. comosa (4M) and wheat (group 1) could be due to cross-contamination of chromosome 4M fraction by 1M chromosomes during flow-sorting. This explanation is supported by the fact that the 4M and 1M chromosome populations were detected close to each other on the bivariate flow karyotype of Ae. comosa [44,45]. Thus, we concluded that the most likely chromosome location of these markers was on the M-genome chromosome, which corresponded to the wheat homoeologous group identified by the BLAST analysis.
We mapped three QTLs related to grain BG content on chromosomes 1M, 4M, and 5M. Although chromosome 4M was not represented in the set of addition lines previously investigated by Rakszegi et al. [31], their results indicated that Aegilops group 5 chromosomes significantly affects cell wall polysaccharide synthesis and can modify the grain BG, TP, and WEP content in wheat. Our results suggest that genes on chromosomes 1M and 4M may also be required for BG synthesis in Ae. biuncialis, and that a transfer of chromosome 5M alone is insufficient to increase the wheat BG content to that of Aegilops.
Two (QTLs 2 and 3) of the three QTLs associated with BG content were identified on the Aegilops chromosomes of 1M and 5M, which is consistent with previous studies made on durum wheat [62] indicating the group 1 and 5 chromosomes have also QTLs affecting BG synthesis. Moreover, two members of the cellulose synthase-like gene family (CslF9 and CslF7) involved in β-glucan biosynthesis were assigned to the same homoeologous group chromosomes (groups 1 and 5) in barley and hexaploid wheat [31,63,64]. The third marker-trait association (QTL 1) was localized on chromosome 4M. Because Csl gene family members have not been identified on group 4 chromosomes in barley or wheat, it is less likely that QTL 1 on the structurally similar chromosome 4M contains a Csl gene variant in Ae. biuncialis. On the other hand, functional annotation revealed that the marker associated with QTL 1 co-located with a glutathione S-transferase 3-like gene (GTS3L), which is involved in various metabolic pathways with a wide range of substrates, including α,β-unsaturated carbonyls [65]. As the same QTL was not only associated with the BG level but also protein and WEP content, we may conclude that QTL1 affects the metabolism of cell wall polysaccharides in general.
The water-extractable pentosan content was associated with the highest number of QTLs (24 of the 30) related to the cell wall polysaccharide synthesis. Among them, 11 QTLs affected positively the edible fiber content, with group 1 (2 QTLs), group 3 (4 QTLs), group 6 (4 QTLs), and group 4 (1 QTLs) chromosomes being over-represented. This is consistent with earlier findings in durum and bread wheat. Thus, a QTL strongly affecting the WEP content was found on chromosome 1B of hexaploid wheat [48], which represented 59% of its phenotypic variance [66]. Using genome-wide association and meta-QTL analyses on multiple mapping populations seven QTLs for grain dietary fiber content were identified on chromosomes 1B, 3A, 3D, 5B, 6B, 7A, and 7B in durum-and bread wheat [62,67]. The transcriptome analysis identified 73 candidate genes that control the edible fiber content of bread wheat. Three major genes responsible for grain fiber content were detected on chromosomes 1B, 3A, 3D, and 6B showing co-localization with seven previously identified QTLs [67].
Glycosyltransferase gene families (GT) play an important role in the regulation of AX biosynthesis [68], with homoeoalleles of the GT47 gene encoding a subunit of β-1,4 xylan synthase located on the group three chromosomes in hexaploid wheat [69,70], and two alleles of GT61 encoding α-1,3-arabinosyltransferase located on chromosomes 1B and 6A [71]. The presence of putative orthologs of wheat genes (GT47 and GT61) on the 1B, 3ABD, and 6A chromosomes correlated well with the current result that the majority of QTLs associated with WEP content were identified on the same homoeologous groups of Aegilops chromosomes.
Our BLAST analysis identified several genes in QTLs associated with pentosan content. Five QTLs on chromosomes 1M, 3M, and 6M co-located with candidate genes encoding enzymes directly involved in carbohydrate or pentosan metabolism: endoglucanase 5-like, which hydrolyzes substrates with (1->4)-beta-glucosyl linkages, such as xyloglucan [72]; soluble starch synthase, one of the enzymes in the starch biosynthetic network [73], glycoside hydrolase/deacetylase, catalyzing glycolytic cleavage of O-glycosidic bonds, particularly those between two glucose residues in some polysaccharides [74]; glycosyltransferase, a key enzyme involved in the production of diverse carbohydrate-containing structures, and cell surface glycans [75].
The association of QTLs related to WEP content and candidate genes influencing polysaccharide metabolism emphasizes the important role of genes on chromosomes 1M, 3M, and 6M in the biosynthesis of cell wall components in Ae. biuncialis.

Plant Material
A set of 86 Aegilops biuncialis Vis. (2n = 4x = 28, U b U b M b M b ) accessions, originating from a wide range of ecological habitats and representing the geographic distribution of the species, was previously genotyped using the DArTSeq platform [18]. The accessions were provided by various germplasm collections across the world, as previously detailed by Ivanizs et al. [18]. The Aegilops accessions were maintained and propagated by the Cereal Genebank in Martonvásár. Hexaploid winter wheat (Triticum aestivum L.) line Mv9kr1, which contains a recessive crossability gene kr1 [76], was used as a crossing partner in the introgression breeding programs at Martonvásár [77] and was also used in this work.

Field Trial
The diverse collection of Ae. biuncialis was grown during two growing seasons (2015-16 and 2016-17) in an experimental area (Breeders nursery, Martonvásár, Hungary, geographic coordinates: 47 • 19 39 N, 18 • 47 01 E) owned by the Agricultural Institute, Centre for Agricultural Research. Each genotype was sown in 6 × 3 m rows (50 grains per row, 0.15 m row spacing) in three replicates (i.e., 2 rows per replicate) using randomized complete blocks design in chernozem soil as previously described by Ivanizs et al. [18]. Ae. biuncialis accessions were harvested manually in each replicate, where the grain mixture of replicates 1 and 2 or replicates 2 and 3 were considered as two biological samples for each genotype.

Quantitative Determination of Grain Composition
Grain composition analysis was performed on grain samples from two growing seasons (2015- 16 and 2016-17). Since three genotypes could not be maintained in field trials, 83 Ae. biuncialis accessions were characterized for quality traits (protein, β-glucan, and pentosans) together with the wheat genotype Mv9kr1. For each genotype, measurements were performed on two biological and two technical replications. Whole grain flour was prepared from 4 grams of grain per sample using a Retsch Mixer Mill MM 400 ball mill (Retsch, Haan, Germany). Wholemeal samples were immediately refrigerated and stored at −20 • C before use. Crude protein content was determined by the Dumas method in accordance with ICC Standard Method No. 167 [78] using the Elementar Rapid N III instrument (Elementar Analysensysteme GmbH, Langenselbold, Germany). Arabinoxylan content, measured as total pentosans (TP) and water-extractable pentosans (WEP), was determined using the colorimetric method following Douglas [79] with minor modifications [80], as reported by Rakszegi et al. [33]. Total mixed-linkage β-glucan content was determined in wholemeal samples using a Megazyme kit (Megazyme, Bray, Ireland) (ICC Standard Method No. 166 [81]; AACC Method No. 32-23.01 [82] as described by Rakszegi et al. [33]. Raw data for edible fiber and protein content measurements are included in Table S1.

QTL and Candidate Gene Detection
Genomic DNA was extracted from young leaves of 86 accessions of Ae. biuncialis using a QuickGene-Mini80 device (FujiFilm, Osaka, Japan) and a QuickGene DNA tissue kit (FujiFilm) according to the manufacturer's instructions. DNA samples of the Ae. biuncialis accessions were genotyped utilizing the DArTseq™ 1.0 array platform by Diversity Arrays Technologies Pty. Ltd. (University of Canberra, Australia). A total of 77,376 SNP-DArT markers were obtained after genome complexity reduction and subsequent SNP calling. Before performing GWAS, markers with a minimum allele frequency of less than 10% and those with more than 5% missing data points were removed from the data matrix using GenAlEx [83,84]. After filtering marker data, a total of 2602 SNPs were obtained and used for population structure analysis.
To determine the population structure, three different approaches were used: a principal coordinate analysis (PCoA) was performed using GenAlEx [83,84]; Bayesian clustering program STRUCTURE version 2.3.4 [85][86][87][88] was used to determine the population struc-ture per se, using an admixture model with correlated allele frequencies, with the number of subpopulations (K) from 1 to 10, burn-in period of 10 × 1003 iterations followed by 10 × 1003 Markov chain Monte Carlo (MCMC) iterations; unrooted Bayesian tree was created with TASSEL 5.2.70 software [89] using the bootstrap method. TASSEL 5.2.70 [89] was also used to identify significant associations with the grain quality traits. GWAS was carried out using Mixed Linear Model (MLM), which accounts for the Q matrix and kinship matrix (K). MLM-(Q + K), correlation coefficient (R2), and marker effect of each SNP associated with β-glucan, pentosan, water-extractable pentosan, and protein content were estimated. To provide the adjusted p values, the false discovery rate (FDR) was calculated, using a threshold of <5%, with the q-value package [90] in the R version 3.1.1 [91].
Functional annotations were performed using translated sequences from known wheat genes. Gene Ontology information was obtained from the Universal Protein Resource (ftp://ftp.uniprot.org, accessed on 21 May 2021; UniProt release 2019_11) database, and protein domains were identified using the Hidden Markov Model (HMM)-based HMMER 3.0 software package (http://eddylab.org/software/hmmer/, accessed on 21 May 2021) [93] in Pfam 32.0 entries (ftp://ftp.ebi.ac.uk, accessed on 21 May 2021) [94]. The results of BLASTn search against the chromosome sequences of wheat, Ae. comosa, and Ae. umbellulata, the protein collection extracted from the database (PFAM), and gene ontology information (GO) are summarized in Table S2. Finally, using the SNP sequences BLASTed against the monocot sequences annotated in NCBI, putative candidate genes associated with BG, TP, WEP, and protein content were identified. To identify Aegilops sequences, each "gene" or "gene family" name was searched for in the CAZy database [95]. To characterize variation in expression data in different experiments, all retrieved protein sequences were BLASTed against the PlantGDB database (http://www.plantgdb.org/cgi-bin/prj/PLEXdb/ProbeMatch.pl, accessed on 21 May 2021).

Statistical Analyses
Grain quality trait parameters were measured with two replications in two parallel samples per accession. The measurement was repeated two more times when the difference between the results of duplicated analyses exceeded 10%. For statistical evaluation, Linear Mixed Model analysis (using the restricted likelihood algorithm, REML) was applied for each compositional quality trait using SPSS 16.0 software (SPSS Inc., Chicago, IL, USA) as described by Virk et al. [96]. For all genotypes (G), two growing periods were considered as environments (E), with the E and the G being fixed factors. Multiple measurements of each genotype (two parallel samples with two replications) were regarded as the random factor (R). For each quality parameter, genotypic variance, repeatability, and variance of G × E interaction were calculated. The broad-sense heritability was evaluated based on the proportion of genotypic to phenotypic variance (h2 = VG/VP) using the following formula: VP = VG + (VG × E/2) + (Verror/2*4), where VP is phenotypic variance, VG is genotypic variance, VG × E is the variance of G × E interaction, and Verror e is variance error.

Conclusions
A genome-wide association approach was employed for the first time in Ae. Biuncialis, and revealed large genetic variation in grain edible fiber and protein content. BG, TP, and WEP content showed a high level of inheritance, making them promising traits to study a relationship between genotype and phenotype. Our GWAS analysis suggested that loci on chromosomes 1M b , 4M b , and 5M b affect grain BG content, whereas chromosome groups 3, 6, and 1 contain a majority of QTLs responsible for TP and WEP content. Aegilops orthologs for genes found in these QTLs may be transferred from selected accessions and pyramided in bread wheat to achieve improved grain quality. The present study will facilitate more efficient use of Ae. biuncialis in pre-breeding programs to develop wheat varieties with specially modified fiber content and improved health benefits.  COST is a funding agency for research and innovation networks. COST Actions help connect research initiatives across Europe and enable scientists to grow their ideas by sharing them with their peers-thus boosting their research, career and innovation.

Conflicts of Interest:
The authors declare no conflict of interest.