Next Article in Journal
Bacillus thuringiensis Effect on the Vegetative Development of Cotton Plants and the Biocontrol of Spodoptera frugiperda
Next Article in Special Issue
Conversion and Validation of Uniplex SNP Markers for Selection of Resistance to Cassava Mosaic Disease in Cassava Breeding Programs
Previous Article in Journal
Dose Expression for Pesticide Application in Citrus: Influence of Canopy Size and Sprayer
Previous Article in Special Issue
Physiological and Biochemical Basis of Faba Bean Breeding for Drought Adaptation—A Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Differential Gene Expression Analysis of Wheat Breeding Lines Reveal Molecular Insights in Yellow Rust Resistance under Field Conditions

1
Department of Plant Breeding, Swedish University of Agricultural Sciences, 230 53 Alnarp, Sweden
2
National Institute of Animal Biotechnology, Hyderabad 500032, India
3
Department of Microbiology, School of Life Sciences, Central University of Rajasthan, Bandarsindri, Kishangarh, Ajmer, Rajasthan 305801, India
4
Lantmännen Lantbruk, 268 31 Svalöv, Sweden
*
Author to whom correspondence should be addressed.
Agronomy 2020, 10(12), 1888; https://doi.org/10.3390/agronomy10121888
Submission received: 14 September 2020 / Revised: 3 November 2020 / Accepted: 23 November 2020 / Published: 29 November 2020
(This article belongs to the Special Issue Translational Research in Plant Breeding)

Abstract

:
The evolution of pathogens in the changing climate raises new challenges for wheat production. Yellow rust is one of the major wheat diseases worldwide, leading to an increased use of fungicides to prevent significant yield losses. The enhancement of the resistance potential of wheat cultivars is a necessary and environmentally friendly solution for sustainable wheat production. In this study, we aimed to identify the differentially expressed genes induced upon yellow rust infection in the field. Reference and de novo based transcriptome analysis was performed among the resistant and susceptible lines of a bi-parental population to study the global transcriptome changes in contrasting wheat genotypes. Based on the analysis, the de novo transcriptome analysis approach was found to be more supportive for field studies. Expression profiles, gene ontology, KEGG pathway analysis and enrichment studies indicated the relation between differentially expressed genes of wheat and yellow rust infection. The h0igh expression of genes related to non-race specific resistance along with pathogen-specific resistance might be a reason for the better resistance ability of a resistant wheat genotype in the field. The targeted metagenomic analysis of wheat samples revealed that Puccinia striiformis tritici was the most dominant pathogen along with other pathogens on the collected leaf material and validating the disease scoring carried out in the field and transcriptomics analyses.

1. Introduction

Yellow (stripe) rust (YR) in wheat (Triticum aestivum L.) is caused by the biotrophic pathogen Puccinia striiformis f. sp. tritici (Pst) and is distinguished by the yellow pustules that occur on leaves. It is a major disease of wheat worldwide and if left uncontrolled, may cause 100% yield losses in highly susceptible wheat cultivars [1]. The disease is airborne and spread through urediniospores which can disperse over long distances by wind [2]. New evolving races are regularly identified in north-western Europe and have become a major concern in recent years [3]. Genetic resistance to yellow rust in wheat at the seedling stage is mainly through the major resistance (R) genes while at the adult-plant stage, the genetic resistance can be either from R genes or adult-plant resistance genes (APR) or both. R genes recognize and neutralize specific pathogen effectors in a gene-for-gene interaction resulting in effector-triggered immunity (ETI) in plants leading to complete disease resistance [4]. R genes are easy to incorporate in breeding programs, and they can provide high-level resistance to YR. However, they are not durable due to the evolution of new virulent races [5]. Several historic epidemics were also observed in the past due to the breakdown of resistance to YR [6]. On the contrary, adult-plant stage resistance is due to the additive effects of several genes leading to partial quantitative resistance. Adult plant quantitative resistance is more durable due to its complex genetic nature and varieties with high and durable resistance can be developed by incorporating both R and APR genes [7]. Resistance genes for yellow rust are designated as Yr, and several of these are responsive mainly at the seedling stage while some are effective at the adult plant stage [7]. To date, 82 Yr genes have been identified, and around 25 of them are related to APR or high temperature adult plant resistance, whereas the rest provide all stage resistance (ASR) [8]. Several of the ASR genes (Yr5, Yr15, Yr53, Yr61, Yr65 and Yr69) are still effective and can be used in breeding for YR resistance [9]. Although these Yr genes have been genetically mapped to wheat chromosomes, only a small number of them have been isolated to date [9]. More than 140 QTL (Quantitative trait locus) have been identified with partial quantitative resistance to yellow rust and are distributed throughout the wheat genome [10]. A daunting task is to identify the candidate resistance genes within these QTL regions.
The transcriptome profiling and quantification of the differential expression of genes and proteins play a key role in the elucidation of regulatory pathways and gene-networks through wide transcripts coverage, high sensitivity, allele-specific differential expression and novel transcript identification [10,11,12,13,14]. Transcriptome analysis has facilitated the fine mapping of the Yr genes and in understanding the underlying regulatory mechanisms in several studies. Coram et al. (2010) identified 28 transcripts commonly induced upon race-specific resistance conferred by eight different Yr genes [15]. Hulbert et al. (2007) identified 59 putative rust-induced transcripts expressed in the flag leaves of the spring wheat Thatcher-Lr34/Yr18 isoline [16]. Chen et al. (2013) identified 102 and 113 rust responsive wheat genes associated with Yr5 and Yr39 genes, respectively [17]. Comparative transcriptomics revealed distinct differences in the responsive genes upon yellow rust and powdery mildew infection [18,19]. Transcriptomics studies have a high potential for innovative and exploratory studies towards novel insights into molecular mechanisms. However, a large number of genes identified in large scale omics studies impose a hurdle to identify candidate genes for functional validation. A map-based cloning approach has been successful at cloning key Yr genes, and induced mutagenesis identified Lr34/Yr18 to be an ABC transporter [20]. Map-based cloning in 4500 F2 plants followed by chemical mutagenesis and the screening of 1536 M2 lines led to the identification of a gene, Yr36, with a kinase and START domain [21]. Yr10 was identified as a CC-NBS-LRR domain-containing protein from a mapping population of 874 BC2F3 individuals [22]. In this work, we evaluated the possibility to identify candidate genes for YR resistance at the adult plant stage in a winter wheat bi-parental population through transcriptomics approach and also performed the metagenomics analysis to identify major wheat pathogens in the field trial.

2. Materials and Methods

2.1. Field Experiments and Sampling

A winter wheat bi-parental population of 109 lines obtained from the cross (Nimbus/3/SW, 2081221/2/SW2-7/Kranich) was sown in the field in Svalöv, Sweden, in the autumn of 2013. Scoring for yellow rust resistance was done at the booting stage (Zadoks stage 41–49) in the spring of 2014 with a scale of 1 (high resistance) to 9 (highly susceptible). On the basis of scoring, leaf materials from resistant and susceptible lines were collected from penultimate leaves pooled from three plants from each breeding line of the segregating population at the booting stage (Zadoks stage 41–49). Leaves were flash-frozen in liquid nitrogen and stored at −80 °C until further processing. Total RNA was extracted with the RNeasy Plant Mini kit (Qiagen, Hilden, Germany) including DNase treatment (RNase-free DNase set, Qiagen Inc., Santa Clarita, CA, USA).

2.2. RNAseq Library Preparation Sequencing and Quality Control

The concentration and quality of the RNA were estimated in ExperionTM Automated Electrophoresis System (Bio-Rad Laboratories, Hercules, CA, USA) and RIN values above seven were used to construct separate cDNA libraries. Paired-end sequencing was performed for 25 wheat lines on Illumina HiSeq 2500 instrument at the SciLifeLab (Stockholm, Sweden) with 300 base-pairs (bps) average fragment length. The FastQC program was used to analyse the quality of the raw sequencing reads [23]. Adapter sequences and low-quality reads were removed using trimmomatic [24].

2.3. De Novo and Reference Transcriptome Assembly and Validation

All the clean reads were mapped to the 13 known wheat pathogens (Zymoseptoria tritici (Desmazières) Quaedvlieg & Crous, Pyrenophora tritici-repentis (Diedicke) Drechsler, Bipolaris sorokiniana (Ito & Kuribayashi) Dastur, Puccinia graminis Persoon, Puccinia striiformis Westendorp, Puccinia triticina Eriksson, Phaeosphaeria nodorum (Berkeley) Quaedvlieg, Verkley & Crous, Pyrenophora tritici (Diedicke) Drechsler, Fusarium culmorum (W.G.Smith) Saccardo, Fusarium graminearum Schwabe, Fusarium oxysporum Schlechtendal, Fusarium verticil (Saccardo) Nirenberg and Magnaporthe oryzae Cavar) and un-aligned reads derived from individual lines were pooled to construct reference and de novo assemblies. De novo assembly was generated by using the Trinity assembler (version 2.5.1) [25]. Trinity pipeline was followed for the detection of differentially expressed genes. To facilitate a more in-depth comparison of the two different genotypes, reference-based transcriptome assembly was also generated through the alignment of all clean reads from each line to the reference wheat genome by using HiSat2 and Stingtie software [26]. The BUSCO software (version 2.4) was used to evaluate the quality of the two transcriptome assemblies [27]. Generated transcripts were clustered on 100% sequence identity through CD-HIT software [28]. Quality check of samples were also performed through expression abundance estimation of lines, PCA and MDS plot and YR field scoring of lines were also compared. High scoring (susceptible) and low scoring (resistant) lines were used as replicate for differential gene expression analysis.

2.4. Identification of Differentially Expressed Genes (DEGs), Annotation and Gene Ontology (GO) Analysis

The de novo and reference assembled transcriptome was then used as a reference to map the individual reads using the Bowtie2 program [29]. The transcript abundance, raw, transcript per million (TPM) and fragments per kilobase (kb) of the transcript sequence per million mapped reads (FPKM) was measured by using RSEM version 1.1.1131 for each sequenced line [30]. The DESeq2 package was employed to identify of differentially expressed genes (DEGs) from raw read counts at a false discovery rate (FDR) of 0.05 [31]. The heatmap of DEGs was generated through Pheatmap R package by using euclidean distance and hierarchical clustering algorithm [32]. The annotation of the DEGs was performed using the BLAST search program. Initially, BLASTx was performed with an e-value threshold of 1e-10 against wheat coding sequences (CDS) and Uniprot database. The Gene Ontology (GO) classification of DEGs in the genotypes were generated using the WEGO program [33]. The GO enrichment analysis was performed through AgriGO webserver by using hypergeometric statistical test methods and Yekutieli multiple test correction at a p-value of 0.05 [34], and visualization was done through REVIGO [35]. A KEGG pathway analysis was performed with the KOBAS program [36].

2.5. Metagenomic Analysis of Selected Wheat Pathogens

Targeted metagenomics analysis was performed to explore wheat-associated pathogen species in the samples. The trimmed RNA libraries were mapped to 13 selected wheat pathogens with Bowtie2 mapping software. The statistical significance of mapped reads were evaluated for pathogens with respect to resistant and susceptible groups.

3. Results

3.1. RNAseq Data Analyses

The resistant and susceptible lines were sequenced and analysed by the reference and de novo transcriptome assembly. A total number of 307 million read pairs (2 × 150 bps) were generated from the 20 lines. After quality control, 207 million high-quality reads of average length 126 bps were used in the construction of transcriptome assemblies. Library size, gene expression in lines, principal component analysis (PCA) plot and multidimensional scaling (MDS) plots of the samples are given in Figure 1. All the high quality wheat reads were pooled for the construction of reference and de novo assemblies and generated assemblies were clustered at 100% sequence identity. After clustering, 219,435 (N50 of 1835 bps) and 511,926 (N50 of 1420 bps) contigs were found for reference and de novo assemblies respectively. The longest contig sizes were 22,462 and 15,387 bps, respectively (Table 1). In BUSCO analysis, reference and de novo transcriptome assemblies achieved the 94.7% and 78.5% completeness, respectively. The percentage of partially complete BUSCO ranged from 99.1 to 94.0%, while the percentage of missing BUSCO was 0.9 and 6.0%, respectively (Table 2). The wheat lines, disease scoring, read count per line, high quality read, and mapping percentage are given in Supplementary Table S1.

3.2. DEGs Identified by De Novo and Reference-Based Methods

To quantify the transcriptomic variations in a sequenced line, we aligned the clean reads from each sample against the wheat reference and de novo transcriptome assemblies by using Bowtie2 with default parameters and RSEM software was used to quantify the transcript abundance to compare the expression level within and between different samples. The DEGs identified in the reference and de novo assembled transcriptome by DESeq2 software in pairwise comparisons between the resistance and susceptible lines were 141 and 8680, respectively, with a false discovery rate (FDR) < 0.05 (Table S2). To find wheat genes in the reference and de novo based analysis, identified DEGs were searched against wheat CDS sequences through BLAST similarity search, including up and downregulated transcripts (Figure 2). To determine the sample relations, differential expression data from the DESeq2 program were used to generate heat maps. Resistant and susceptible lines were grouped together in a different order with respect to reference and de novo transcriptome. However, most of the susceptible and resistant lines were grouped in the same clusters in both transcriptomes (Figure 3A,B). The expression heatmap of differentially expressed genes was clearly indicating a fewer number of upregulated genes in resistant lines than susceptible lines in their respective assemblies.

3.3. Differentially Expressed Genes Annotation and GO Enrichment

A BLASTX similarity search was performed for all the identified DEGs against the Uniprot database. Approximately 53% of the DEGs had BLASTX hits. The majority of the top BLASTX hit species in the reference-based DEGs belonged to Arabidopsis thaliana, Oryza sativa subsp. Japonica, and were followed by other cereal crops. In the de novo based analysis, DEGs were identified in approximately 17% of plant species, mainly: Arabidopsis thaliana, Oryza sativa janonica, Zea mays, Triticum aestivium, and Nicotiana benthamiana; 35% Schizosaccharomyces prombe and cerevisiae; 20% of fungal species, such as Puccinia gramine, Ustilago maydis, Cryptococcus neoformans and neoformans sp., Dictyostelium discoideum, Neurospora crassa, Candida glabrata and albicans sp., Neosartorya fischeri, Pneumocystis carinii, Solanum bulbocastanum, Trichosporon cutaneum, Agaricus bisporus, Aspergillus niger, flavus, terreus and oryzae, Chaetomium thermophilum and globosum sp., Pseudomonas aeruginosa, Eremothecium gossypii, Debaryomyces hansenii, Phanerochaete chrysosporium, Uromyces fabae, Laccaria bicolor, Kluyveromyces sp., Bacillus subtilis, Yarrowia lipolytica, Emericella nidulans. All the reference and de novo assembled transcripts were annotated against wheat genome and wheat annotated differentially expressed transcripts were considered for further downstream analysis. Wheat annotation for transcripts can be found in Tables S3 and S4.
The Gene Ontology (GO) classification of DEGs’ was represented in three main GO categories, i.e., the cellular component, molecular function and biological process in a histogram (Figure 4). In the GO analysis, the cellular component, cell and cell part (80%), organelle (55%) and membrane (46%) were highly represented compared to other components. Binding (64%) and catalytic activity (61%) were most represented among the molecular functions. Cellular process (65%), metabolic process (56%) and response to stimulus (36%) were the most dominant subcategories of biological processing. Cellular biosynthetic process, metabolic process, i.e., macromolecule, organic substance, nitrogen compound, aromatic compound metabolic process, and organic cyclic compound metabolic process were highly enriched in GO analysis. The significantly enriched GO term of the biological process can be further subcategorized into response to stress, response to stimulus, signal, methylation, multicellular organismal processing, multi-organism process and reproductive process which can be seen clearly in Figure 5A, generated by REVIGO. In Figure 5A (Table S5), the X and Y coordinates were based on the multidimensional scaling of a matrix of the GO terms’ semantic similarities whereas bubbles closeness on the plot were reflecting their closeness in the GO graph structure. It is clear that response to stress, stimulus, signalling and molecular localization-related terms were among the enriched GO terms. The KEGG database in KOBAS webserver was used to explore the networks of enriched pathways and gene products (Table S6). An enriched pathway among differentially expressed genes was provided as a scatter diagram with a degree of enrichment by the rich factor, p-adjust, and the number of genes enriched in a pathway (Figure 5B). The richness factor represents the ratio of the quantity of genes belonging to the pathway among differentially expressed genes to the total number of genes belonging to the pathway among all annotated genes. The lower the p-value, the higher the significance of metabolic pathways, whereas the size of bubbles represents the number of enriched genes. Metabolic pathways, the biosynthesis of secondary metabolites, peroxisome, glycerolipid metabolism, the mRNA surveillance pathway, plant–pathogen interaction, cutin, suberine and wax biosynthesis, glycerophospholipid metabolism and ubiquitin-mediated proteolysis have appeared as highly enriched pathways. To explore global transcriptome expression, RNAseq data of wheat resistant and susceptible lines were mapped on wheat genome. In general, genes in susceptible lines have a higher average read depth per million base pairs than resistant lines and an approximately similar trend has also been found at the genomic location of differentially expressed genes (Figure 6). However, higher read depth in resistant lines for differentially expressed genes was also found on wheat genome 1A, 2A, 3A, 3D and 5A, compared to susceptible lines which might be helpful to understand the molecular mechanism of resistance among the resistance lines.
To explore the field pathogenic factor association with the gene expression of wheat lines, targeted metagenomic analysis was performed by read mapping of resistant and susceptible line samples to the 13 known wheat pathogens. The highest number of reads was mapped to P. striiformis genome in the susceptible group (Figure 7). Significant differences in the read counts in the susceptible and resistant groups were also found in between those of P. graminis and P. triticiana, however, none of the wheat samples had more than 0.5% of total reads belonging to either of these two pathogens. RNA sequences from the pathogens B. sorokiniana, F. culmorum and Z. tritici were also identified with read counts of at least 1% of total reads in at least one sample. However, the read counts from these pathogens were not significantly different between the resistant and susceptible groups.

4. Discussion

In this study, resistant, susceptible and control wheat lines were sequenced. Control lines, common filed trial lines were used to check disease pressure in field whereas resistant and susceptible lines were selected on the basis of field scoring. To compare the transcriptomic variations among wheat lines exposed to yellow rust disease in the field, leaf tissue samples were collected and used for the RNA-seq analysis. On average, approximately 69% of reads were mapped to the wheat reference genome, which indicates the sequencing of other field-related genetic resources such as wheat pathogens. We performed the quality check of sequenced samples through gene expression data. The library size of the sequenced wheat lines was observed to be variable. However, the logCPM normalisation plot (Figure 1B) and gene expression level distributions both showed that the wheat lines had similar expression patterns and variation ranges suggesting that the sequenced line and sequencing data were comparable and suitable for downstream transcriptome analysis.
To know the effect of different biological and ecological factors of wheat field on wheat leaf transcriptomes, we performed the PCA analysis of wheat lines through TMM normalized expression counts for the wheat genome (Figure 1C). The first PCA component accounted for 71.01% of the total variability, which in this case corresponds to the reference genome specific variance. The second PCA component accounted for 12.76% of the overall variance which highlighted how the difference between the wheat lines might have been caused by other field factors. For a better understanding of the variation among the transcriptome of wheat lines, the MDS plot was used to check variations among samples (Figure 1D). The MDS plot showed that the transcriptome of control lines were very close to susceptible lines transcriptome. In order to identify the candidate genes of wheat among the resistant and susceptible ones, the samples from the control lines were removed from further downstream analysis. Moreover, susceptible and control lines transcriptome resemblance were also supported by field scoring. Although it was clear from the PCA and MDS plots that wheat lines had a varying expression in the field, it is challenging to make direct comparisons of field factors for transcriptomic variation among wheat lines.
Reference-based transcriptome analysis has been considered more effective than a de novo approach when the reference genome of organism is available [37]. However, very few studies have compared the two strategies [38,39] to identify differentially expressed genes, and no similar comparison has been found for field diseases. Thus, it is more important and interesting for field disease studies to determine whether the de novo assembly can detect the same genes, isoforms and the molecular responses as a reference genome-based analysis, and what else it was captured in transcriptome. In the present transcriptome analysis of wheat, we compared both strategies. The trinity de novo assembly has approximately twice the number of contigs than the reference assembly, which may be due to allelic variation among wheat genotypes, the lack of strand-specific information of genotypes or the sequencing of other unknown eukaryotic field environmental materials. Several genomes and transcriptome assembly studies have used BUSCO for evaluating the quality of assemblies. BUSCO detected the presence of 1347 extremely conserved core eukaryotic genes (CEGs) and their coverage in transcriptome assemblies for the evaluation of the completeness of the assembly. The BUSCO analysis showed that both assemblies were very close to complete in terms of gene content, together with the identified fragments of core-genes. Overall, both assemblies captured high percentages of ultra-conserved core eukaryotic genes (reference: 99.1%, de novo: 93.0%). However, the number of complete and single copy BUSCO were found more in de novo assemblies (Table 2). In the transcriptome analysis of wheat lines, the expression abundance and differential expression were different in the de novo and the reference-based analyses. However, there is a larger number of differentially expressed transcripts observed in the susceptible lines compared to the resistant lines (Figure 2 and Figure 3) in both assemblies. De novo and reference-based analysis indicated that the genes expressed in the resistant and susceptible lines might play key roles in their differential resistance abilities. This might be due to the presence of several external factors associated with susceptible lines in the field.
To understand the global trend of genomic variation among wheat lines exposed to different biological and ecological factors in the field, we identified and analysed DEGs from both resistant and susceptible lines in our study through generated reference-based and de novo transcriptome assemblies. The number of identified DEGs in the reference and de novo assembled transcriptome was very low in comparison to the total assembled transcripts, 141 and 8680, respectively, in the resistance and susceptible groups. Gene expression is the primary source to know concerning genetic variation among a population generated during a plant breeding process, and gene expression difference in the population could contribute and represent some of the observed differences in plant phenotype such as susceptibility to diseases. However, a low number of differentially expressed genes might indicate the least genomic variation among resistance and susceptible lines [40] and directly indicate the expression of key genes involved in disease resistance. To identify the potential genes involved in resistance, we investigated upregulated DEGs generated by both transcriptome assemblies. A BLAST similarity search was performed against wheat CDS sequences with the value 1e-10 to determine the commonly expressed genes and unique genes expressed between assemblies. A large proportion of transcript sequences failed to find homologous sequences in the wheat genome. With the used criteria, 126,669 and 107,651 transcript sequences were annotated in de novo and reference assemblies, respectively, in the similarity search of wheat CDS sequences (Tables S3 and S4). A large proportion of transcripts sequences generated from wheat lines were failed to find homologous sequences in wheat genome from both the assemblies. Huge number of un-annotating transcripts might be due to inclusion of lot of non-coding RNA content which are missing in wheat gene model or mixing of reads from other eukaryotic organisms.
Approximately 41% of sequences were found to be common between the de novo and reference assemblies, and approximately 29% of sequences were unique in assemblies which might indicate the difference generated by the used assembly approaches, tools and the inclusion of genomic material of other organisms in the wheat transcriptome. Furthermore, both methods identified many similar candidate genes putatively involved in resistant and susceptible genotypes. Thus, this also demonstrated the potential of the de novo method for the capturing of key genes even in the absence of a reference genome which can be useful for researchers working with field diseases with orphan species. In order to access expression difference in a wheat genotype, a list of upregulated differentially expressed genes was generated by both assemblies (Table 3). Differentially expressed genes from reference and de novo assemblies were annotated by the wheat genome and Uniprot database through BLAST similarity search. It is clearly visible in Table 3 that most of the differentially expressed transcripts related to the disease were identified through de novo transcriptome analysis approach. Cysteine-rich receptor-like protein kinases (CRKs) are most highly differentially expressed in resistant lines among all potential candidates. CRKs are transmembrane, and are involved in a wide range of receptor-like protein kinase-dependent signalling networks including pathogen detection [41]. Seven different isoforms of CRKs; six as CRK6 and one as CRK7 (Table 3) were expressed among resistant lines on different genome locations. The Level of expression varies with respect to the genomic location. However, all the CRKs possess a secretory signal. CRKs have shown a significant role in disease resistance such as NPR1 and NH1 mediated immunity in Arabidopsis thaliana and Oryza sativa subsp. Japonica against bacterial blight pathogen Xanthomomas oryzae pv. oryzae. Absence or low CRKs expression can make plants more susceptible to infection [42]. Diacylglycerol kinase 5 (DAG5) is another gene among other significant DEGs. DAG5 phosphorylates diacylglycerol (DAG) to produce a signalling molecule of phosphatidic acid as a second messenger. Diacylglycerol kinases are key signalling enzymes which are involved in phosphorylating diacylglycerol (DAG) to yield phosphatidic acid (PA). The biosynthesis of PA plays a crucial role in a eukaryotic metabolic and signalling process such as the glycerolipid metabolic process, intracellular signal transduction source, lipid phosphorylation, or the protein kinase C-activating G protein-coupled receptor signalling pathway. Phosphatidic acid is highly required for plant development, abiotic stress and pathogen attack. The presence of diacylglycerol kinase 5 (DAG5) among highly significant DEGs may be an indicator of high resistance among resistant lines [43].
Plant disease resistance in genes encodes two main classes of nucleotide-binding site leucine-rich repeat (NBS-LRR) proteins; TIR-domain-containing (TNL) and CC-domain-containing (CNL). TNLs and CNLs regulate plant resistance through different downstream pathways by inducing a series of defence responses, such as the activation of an oxidative burst, mitogen-associated protein kinase cascade, the induction of pathogenesis-related genes, and the hypersensitive response. In our study, four RGA isoforms; one for RGA2 and three for RGA4, were found among significant DEGs. All four RGAs were found at different genomic locations (Table 3), but all isoforms were having a relatively low but consistent gene expression. RGA isoforms are having an RX-like_CC motif (IPR038005), NB-ARC, LRR domains and belong to the CNL family. The consistent expression of the CNL type of RGAs indicates the “non-race specific” disease resistance and also the activated alternative non-race specific resistance induced by disease resistance proteins RPP8 and RPP13 which indicate a high presence of non-host-specific pathogens in the growing field [44]. LRR receptor-like serine/threonine-protein kinases such as FLS2, Y3475, leaf rust and rust resistance kinases determine the specific perception of pathogen-associated molecular patterns and initiate the innate immune MAP kinase signalling for enhanced resistance against pathogens. The expression of receptor-like cytoplasmic kinase (RLCK185) is clear evidence of the innate immunity triggered by fungal chitin signalling pathways through MAP kinases. Chitin recognition by CERK1 receptor triggers the MAP kinase (MPK3 and MPK6) cascade to search for a host protein that can interact with effector proteins and participates in the activation of defence genes during response to microbial peptidoglycans and chitin [45]. A plasma membrane-associated G-type lectin S-receptor-like serine/threonine-protein kinase (CES101) was also expressed among the significantly expressed DEGs and its involvement was suggested in innate immune response, protein phosphorylation and response to fungus which might have role in the recognition of the lectin-associated molecular pattern of fungus [46]. All RGAs were expressed in the resistant genotypes. Perhaps, the functionality of these RGAs was suppressed by pathogens or lost molecular function due to several genomic modifications in the susceptible genotype. The presence of the RGA4 isoform under an unclassified genome needs to be re-investigated in order to be used as a marker for plant breeding purposes [47]. The expression of protein-like TIFY 6b, known as a repressor of jasmonate responses, is suggesting that plants may be reducing the developmental and metabolic processes for better resource utilization under a biotic stress such as salt tolerance, dehydration and wounding [48]. In general, susceptible cultivars were shown to have a higher gene expression level than the resistant cultivars at the global transcriptome level, which might be due to the higher production and resources allocation against infection and diseases (Figure 6).
The mechanism of cell wall communication to the cytoplasm is not very well known for plant resistance. Wall-associated kinases (WAKs) have the potential to provide clues for the cell wall and the cytoplasm crosstalk. It has been found that WAKs were expressed at organ junctions of shoot, root and leaf in response to wall disturbances, and the expression of an antisense WAK gene in leaves reduced the WAK protein levels and exhibited a loss of cell expansion. The presence of WAKs on the cell wall is providing the evidence that the receptor-like kinase may have a significant role in the control of cell expansion, morphogenesis and development [49]. In our study, five transcripts were found to be related to wall-associated receptor kinases viz. WAK1, WAK3, WAK5 and WAK9. WAK1 and WAK3 were found on chromosomes 5A and 2B on reverse strands, respectively. The higher expression of WAK1 might show the reduced cell expansion during infection and disease. WAK5 and WAK9 were found on chromosomes 6B and 6D in the forward strand, respectively, which might indicate their involvement in the control of cell expansion, morphogenesis and development under normal circumstances.
The differential gene expression data analysis of wheat lines has increased our understanding of plant defence mechanism in the field and shown evidence for a pathogen-specific and broad-spectrum disease resistance mechanism. Metagenomics analysis of the RNA libraries of samples has also provided evidence for a pathogen-specific disease resistance mechanism along with broad-spectrum disease resistance (Figure 7). In metagenomics analysis, P. striiformis was found as the most dominant pathogen among susceptible wheat lines which indicates the susceptibility of wheat lines for this pathogen. The significant differential expression of genes involved in non-race-specific disease resistance among resistant wheat lines provides the evidence for the activation of PAMP-triggered innate immunity (PTI). The presence of other fungal pathogens such as B. sorokiniana, F. culmorum and Z. tritici were supporting the cause of the high activation of innate immunity in the field.

5. Conclusions

In the present study, transcriptome analysis was conducted between a resistant and susceptible wheat genotype with a different level of resistance in the field, and several differentially expressed genes were identified through reference and de novo transcriptome assembly. In the transcriptome analysis, the key challenge was the handling of cross-species eukaryotic molecular content of field samples, especially during field infection. In our comparative transcriptome study, the de novo approach was found to be more explorative than the reference assembly process due to high dependency on the reference genome, and the gene expression of similar genes from cross eukaryotic species like fungus has a high chance of influencing the gene expression quantification process due to short reads. In this study, many genes related to plant defence were upregulated in resistant wheat lines in the field. A significant number of genes involved in non-race specific resistance were overrepresented in resistant lines, which might be a reason for the good resistance ability of resistant wheat lines. Expression ofcysteine-rich receptor-like protein kinases, CNL type kinases and wall-associated kinases (WAKs) are suggesting their role in determining broad-spectrum plant defence in the field. GO enrichment and pathway analysis further confirmed that PTI triggered the innate immunity-related genes which were overrepresented in the wheat resistant lines. Potential candidate genes found in this study might provide a basis for future functional host–pathogen genomics research for field wheat. Molecular techniques such as RNA interference can be further used to understand the role of these genes in the field plant resistance.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4395/10/12/1888/s1, Table S1: Wheat lines, disease scoring, read count per line, high quality reads and mapping percentage; Table S2: Number of differentially expressed genes among lines with respect to reference and de novo transcriptome; Table S3: Reference transcriptome annotation for wheat genome; Table S4: De novo transcriptome annotation for wheat genome; Table S5: GO enrichment analysis of DEGs; Table S6: Pathway enrichment analysis of DEGs.

Author Contributions

S.K.K.: performed the data analysis and wrote the first draft of manuscripts; R.R.V. and F.O.: were responsible for experimental design, implementation, sampling and RNA extractions and manuscript writing; N.P.: helped the manuscript preparation and writing; T.H.: conducted the field trials; A.C. conceived the study, planned the experiment, and secured the funding for this research project. All authors have read and agreed to the published version of the manuscript.

Funding

This project was funded by the Lantmännen Research Foundation (2016F010), the Crafoord Foundation (20161025), SLU Grogrund (SLU.ltv.2019.1.1.1-623) and the Royal Physiographic Society of Lund.

Acknowledgments

We would like to thank Salvatore Caruso for assistance with the laboratory work. PlantLink, Plant Breeding Platform at SLU, and NIAB, India, are acknowledged for the bioinformatics support.

Conflicts of Interest

Authors have declared that they have no competing interest.

References

  1. Chen, X.M. Challenges and solutions for stripe rust control in the United States. Aust. J. Agric. Res. 2007, 58, 648–655. [Google Scholar] [CrossRef]
  2. Brown, J.K.M.; Hovmøller, M.S. Aerial Dispersal of Pathogens on the Global and Continental Scales and Its Impact on Plant Disease. Science 2002, 297, 537–541. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Hovmøller, M.S.; Walter, S.; Bayles, R.A.; Hubbard, A.; Flath, K.; Sommerfeldt, N.; Leconte, M.; Czembor, P.; Rodriguez-Algaba, J.; Thach, T.; et al. Replacement of the European wheat yellow rust population by new races from the centre of diversity in the near-Himalayan region. Plant Pathol. 2016, 65, 402–411. [Google Scholar] [CrossRef] [Green Version]
  4. Jones, J.D.G.; Dangl, J.L. The plant immune system. Nature 2006, 444, 323–329. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Schwessinger, B. Fundamental wheat stripe rust research in the 21st century. New Phytol. 2017, 213, 1625–1631. [Google Scholar] [CrossRef]
  6. Bux, H.; Ashraf, M.; Chen, X. Expression of high-temperature adult-plant (HTAP) resistance against stripe rust (Puccinia striiformis f. sp. tritici) in Pakistan wheat landraces. Can. J. Plant Pathol. 2012, 34, 68–74. [Google Scholar] [CrossRef]
  7. Chen, W.; Wellings, C.; Chen, X.; Kang, Z.; Liu, T. Wheat stripe (yellow) rust caused by Puccinia striiformis f. sp. tritici. Mol. Plant Pathol. 2014, 15, 433–446. [Google Scholar] [CrossRef]
  8. Jamil, S.; Shahzad, R.; Ahmad, S.; Fatima, R.; Zahid, R.; Anwar, M.; Iqbal, M.Z.; Wang, X. Role of Genetics, Genomics, and Breeding Approaches to Combat Stripe Rust of Wheat. Front. Nutr. 2020, 7, 173. [Google Scholar] [CrossRef]
  9. Xu, L.S.; Wang, M.N.; Cheng, P.; Kang, Z.S.; Hulbert, S.H.; Chen, X.M. Molecular mapping of Yr53, a new gene for stripe rust resistance in durum wheat accession PI 480148 and its transfer to common wheat. Theor. Appl. Genet. 2013, 126, 523–533. [Google Scholar] [CrossRef]
  10. Rosewarne, G.M.; Herrera-Foessel, S.A.; Singh, R.P.; Huerta-Espino, J.; Lan, C.X.; He, Z.H. Quantitative trait loci of stripe rust resistance in wheat. Theor. Appl. Genet. 2013, 126, 2427–2449. [Google Scholar] [CrossRef] [Green Version]
  11. Lindlöf, A.; Chawade, A.; Sikora, P.; Olsson, O. Comparative Transcriptomics of Sijung and Jumli Marshi Rice during Early Chilling Stress Imply Multiple Protective Mechanisms. PLoS ONE 2015, 10, e0125385. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Yaari, G.; Bolen, C.R.; Thakar, J.; Kleinstein, S.H. Quantitative set analysis for gene expression: A method to quantify gene set differential expression including gene-gene correlations. Nucleic Acids Res. 2013, 41, e170. [Google Scholar] [CrossRef] [PubMed]
  13. Chawade, A.; Alexandersson, E.; Bengtsson, T.; Andreasson, E.; Levander, F. Targeted Proteomics Approach for Precision Plant Breeding. J. Proteome Res. 2016, 15, 638–646. [Google Scholar] [CrossRef] [Green Version]
  14. Chawade, A.; Armoniené, R.; Berg, G.; Brazauskas, G.; Frostgård, G.; Geleta, M.; Gorash, A.; Henriksson, T.; Himanen, K.; Ingver, A.; et al. A transnational and holistic breeding approach is needed for sustainable wheat production in the Baltic Sea region. Physiol. Plant. 2018, 164, 442–451. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Coram, T.E.; Huang, X.; Zhan, G.; Settles, M.L.; Chen, X. Meta-analysis of transcripts associated with race-specific resistance to stripe rust in wheat demonstrates common induction of blue copper-binding protein, heat-stress transcription factor, pathogen-induced WIR1A protein, and ent-kaurene synthase transcripts. Funct. Integr. Genom. 2010, 10, 383–392. [Google Scholar]
  16. Hulbert, S.H.; Bai, J.; Fellers, J.P.; Pacheco, M.G.; Bowden, R.L. Gene Expression Patterns in Near Isogenic Lines for Wheat Rust Resistance Gene Lr34/Yr18. Phytopathology 2007, 97, 1083–1093. [Google Scholar] [CrossRef] [Green Version]
  17. Chen, X.; Coram, T.; Huang, X.; Wang, M.; Dolezal, A. Understanding Molecular Mechanisms of Durable and Non-durable Resistance to Stripe Rust in Wheat Using a Transcriptomics Approach. Curr. Genom. 2013, 14, 111–126. [Google Scholar] [CrossRef] [PubMed]
  18. Zhang, H.; Yang, Y.; Wang, C.; Liu, M.; Li, H.; Fu, Y.; Wang, Y.; Nie, Y.; Liu, X.; Ji, W. Large-scale transcriptome comparison reveals distinct gene activations in wheat responding to stripe rust and powdery mildew. BMC Genom. 2014, 15, 898. [Google Scholar] [CrossRef] [Green Version]
  19. Hao, Y.; Wang, T.; Wang, K.; Wang, X.; Fu, Y.; Huang, L.; Kang, Z. Transcriptome Analysis Provides Insights into the Mechanisms Underlying Wheat Plant Resistance to Stripe Rust at the Adult Plant Stage. PLoS ONE 2016, 11, e0150717. [Google Scholar] [CrossRef]
  20. Krattinger, S.G.; Lagudah, E.S.; Spielmeyer, W.; Singh, R.P.; Huerta-Espino, J.; McFadden, H.; Bossolini, E.; Selter, L.L.; Keller, B. A Putative ABC Transporter Confers Durable Resistance to Multiple Fungal Pathogens in Wheat. Science 2009, 323, 1360. [Google Scholar] [CrossRef] [Green Version]
  21. Fu, D.; Uauy, C.; Distelfeld, A.; Blechl, A.; Epstein, L.; Chen, X.; Sela, H.; Fahima, T.; Dubcovsky, J. A kinase-START gene confers temperature-dependent resistance to wheat stripe rust. Science 2009, 323, 1357–1360. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Liu, W.; Frick, M.; Huel, R.; Nykiforuk, C.L.; Wang, X.; Gaudet, D.A.; Eudes, F.; Conner, R.L.; Kuzyk, A.; Chen, Q.; et al. The Stripe Rust Resistance Gene Yr10 Encodes an Evolutionary-Conserved and Unique CC–NBS–LRR Sequence in Wheat. Mol. Plant 2014, 7, 1740–1755. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc (accessed on 1 November 2019).
  24. Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Best Practices for De Novo Transcriptome Assembly with Trinity. Available online: https://informatics.fas.harvard.edu/best-practices-for-de-novo-transcriptome-assembly-with-trinity.html (accessed on 1 November 2019).
  26. Pertea, M.; Kim, D.; Pertea, G.M.; Leek, J.T.; Salzberg, S.L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 2016, 11, 1650. [Google Scholar] [CrossRef] [PubMed]
  27. Simão, F.A.; Waterhouse, R.M.; Ioannidis, P.; Kriventseva, E.V.; Zdobnov, E.M. BUSCO: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 2015, 31, 3210–3212. [Google Scholar] [CrossRef] [Green Version]
  28. Fu, L.; Niu, B.; Zhu, Z.; Wu, S.; Li, W. CD-HIT: Accelerated for clustering the next generation sequencing data. Bioinformatics 2012, 28, 3150–3152. [Google Scholar] [CrossRef]
  29. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 2012, 9, 357. [Google Scholar] [CrossRef] [Green Version]
  30. Li, B.; Dewey, C.N. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinform. 2011, 12, 323. [Google Scholar] [CrossRef] [Green Version]
  31. DESeq2 Package. Available online: http://www.bioconductor.org/packages/release/bioc/html/DESeq2.html (accessed on 1 December 2019).
  32. Pheatmap: A Function to Draw Clustered Heatmaps. Available online: https://www.rdocumentation.org/packages/pheatmap/versions/1.0.12/topics/pheatmap (accessed on 1 December 2019).
  33. Ye, J.; Fang, L.; Zheng, H.; Zhang, Y.; Chen, J.; Zhang, Z.; Wang, J.; Li, S.; Li, R.; Bolund, L.; et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34, W293–W297. [Google Scholar] [CrossRef]
  34. Du, Z.; Zhou, X.; Ling, Y.; Zhang, Z.; Su, Z. Agrigo: A GO analysis toolkit for the agricultural community. Nucleic Acids Res. 2010, 38, W64–W70. [Google Scholar] [CrossRef] [Green Version]
  35. Supek, F.; Bošnjak, M.; Škunca, N.; Šmuc, T. REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms. PLoS ONE 2011, 6, e21800. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Xie, C.; Mao, X.; Huang, J.; Ding, Y.; Wu, J.; Dong, S.; Kong, L.; Gao, G.; Li, C.-Y.; Wei, L. KOBAS 2.0: A web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 2011, 39, W316–W322. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Martin, J.A.; Wang, Z. Next-generation transcriptome assembly. Nat. Rev. Genet. 2011, 12, 671–682. [Google Scholar] [CrossRef] [PubMed]
  38. Kovi, M.R.; Abdelhalim, M.; Kunapareddy, A.; Ergon, Å.; Tronsmo, A.M.; Brurberg, M.B.; Hofgaard, I.S.; Asp, T.; Rognli, O.A. Global transcriptome changes in perennial ryegrass during early infection by pink snow mould. Sci. Rep. 2016, 6, 1–15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Ward, J.A.; Ponnala, L.; Weber, C.A. Strategies for transcriptome analysis in nonmodel plants. Am. J. Bot. 2012, 99, 267–276. [Google Scholar] [CrossRef]
  40. Zhang, W.; Duan, S.; Kistner, E.O.; Bleibel, W.K.; Huang, R.S.; Clark, T.A.; Chen, T.X.; Schweitzer, A.C.; Blume, J.E.; Cox, N.J.; et al. Evaluation of genetic variation contributing to differences in gene expression between populations. Am. J. Hum. Genet. 2008, 82, 631–640. [Google Scholar] [CrossRef] [Green Version]
  41. Kimura, S.; Hunter, K.; Vaahtera, L.; Tran, H.C.; Citterico, M.; Vaattovaara, A.; Rokka, A.; Stolze, S.C.; Harzen, A.; Meißner, L.; et al. CRK2 and C-terminal Phosphorylation of NADPH Oxidase RBOHD Regulate Reactive Oxygen Species Production in Arabidopsis. Plant Cell 2020, 32, 1063. [Google Scholar] [CrossRef] [Green Version]
  42. Chern, M.; Xu, Q.; Bart, R.S.; Bai, W.; Ruan, D.; Sze-To, W.H.; Canlas, P.E.; Jain, R.; Chen, X.; Ronald, P.C. A Genetic Screen Identifies a Requirement for Cysteine-Rich–Receptor-Like Kinases in Rice NH1 (OsNPR1)-Mediated Immunity. PLoS Genet. 2016, 12, e1006049. [Google Scholar]
  43. Escobar-Sepúlveda, H.F.; Trejo-Téllez, L.I.; Pérez-Rodríguez, P.; Hidalgo-Contreras, J.V.; Gómez-Merino, F.C. Diacylglycerol Kinases Are Widespread in Higher Plants and Display Inducible Gene Expression in Response to Beneficial Elements, Metal, and Metalloid Ions. Front. Plant Sci. 2017, 8, 129. [Google Scholar] [CrossRef] [Green Version]
  44. McHale, L.; Tan, X.; Koehl, P.; Michelmore, R.W. Plant NBS-LRR proteins: Adaptable guards. Genome Biol. 2006, 7, 1–11. [Google Scholar] [CrossRef] [Green Version]
  45. Yamaguchi, K.; Yamada, K.; Ishikawa, K.; Yoshimura, S.; Hayashi, N.; Uchihashi, K.; Ishihama, N.; Kishi-Kaboshi, M.; Takahashi, A.; Tsuge, S.; et al. A Receptor-like Cytoplasmic Kinase Targeted by a Plant Pathogen Effector Is Directly Phosphorylated by the Chitin Receptor and Mediates Rice Immunity. Cell Host Microbe 2013, 13, 347–357. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Tang, J.; Lin, G.; Langdon, W.Y.; Tao, L.; Zhang, J. Regulation of C-Type Lectin Receptor-Mediated Antifungal Immunity. Front. Immunol. 2018, 9, 123. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Chen, X.-R.; Brurberg, M.B.; Elameen, A.; Klemsdal, S.S.; Martinussen, I. Expression of resistance gene analogs in woodland strawberry (Fragaria vesca) during infection with Phytophthora cactorum. Mol. Genet. Genom. 2016, 291, 1967–1978. [Google Scholar] [CrossRef] [Green Version]
  48. Ye, H.; Du, H.; Tang, N.; Li, X.; Xiong, L. Identification and expression profiling analysis of TIFY family genes involved in stress and phytohormone responses in rice. Plant Mol. Biol. 2009, 71, 291–305. [Google Scholar] [CrossRef] [PubMed]
  49. Wagner, T.A.; Kohorn, B.D. Wall-associated kinases are expressed throughout plant development and are required for cell expansion. Plant Cell 2001, 13, 303–318. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Quality evaluation of the samples: (A) library size of the sequenced wheat lines; (B) boxplot of logCPM expression values across the samples; (C) principal component analysis (PCA); and (D) multidimensional scaling (MDS) plot of the Trimmed Means of M-values (TMM) normalized expression values of samples. Resistance, susceptible and control lines are represented through red, green and blue colours, respectively. Control lines (cont): blue, resistant lines (Res): red, susceptible lines (Sus): green.
Figure 1. Quality evaluation of the samples: (A) library size of the sequenced wheat lines; (B) boxplot of logCPM expression values across the samples; (C) principal component analysis (PCA); and (D) multidimensional scaling (MDS) plot of the Trimmed Means of M-values (TMM) normalized expression values of samples. Resistance, susceptible and control lines are represented through red, green and blue colours, respectively. Control lines (cont): blue, resistant lines (Res): red, susceptible lines (Sus): green.
Agronomy 10 01888 g001
Figure 2. Summary of differential gene expression and comparison: (A) the number of differentially expressed transcripts identified using reference-based assembly and de novo based assembly; (B) Venn diagrams of the number of differentially expressed transcripts for pairwise comparisons at a false discovery rate (FDR) of <0.05.
Figure 2. Summary of differential gene expression and comparison: (A) the number of differentially expressed transcripts identified using reference-based assembly and de novo based assembly; (B) Venn diagrams of the number of differentially expressed transcripts for pairwise comparisons at a false discovery rate (FDR) of <0.05.
Agronomy 10 01888 g002
Figure 3. Heat maps of the expression pattern of differentially expressed genes by using (A) 141 reference and (B) 8680 de novo assemblies. Red and cyan represent the resistant and susceptible lines, respectively.
Figure 3. Heat maps of the expression pattern of differentially expressed genes by using (A) 141 reference and (B) 8680 de novo assemblies. Red and cyan represent the resistant and susceptible lines, respectively.
Agronomy 10 01888 g003
Figure 4. Gene ontology classifications of the differentially expressed genes by the WEGO tool.
Figure 4. Gene ontology classifications of the differentially expressed genes by the WEGO tool.
Agronomy 10 01888 g004
Figure 5. Gene ontology and pathways: (A) description of enriched GO terms associated with differentially expressed genes through a scatterplot in a two-dimensional space derived by applying multidimensional scaling to a matrix of the GO terms’ semantic similarities. The bubble colour indicates significance (−log10 p-value) and size indicates the frequency of the GO terms in the underlying gene ontology annotation database, such as a larger circle in blue which represents the most significant enriched term. Red colour represent higher p-values (B) The KEGG enrichment of differentially expressed genes as a scatter diagram with a degree of enrichment by the richness factor, p-adjust, and the number of genes enriched in a pathway. The number of enriched DEGs in the pathway is indicated by the circle area, and the circle colour represents the ranges of the p-adjust.
Figure 5. Gene ontology and pathways: (A) description of enriched GO terms associated with differentially expressed genes through a scatterplot in a two-dimensional space derived by applying multidimensional scaling to a matrix of the GO terms’ semantic similarities. The bubble colour indicates significance (−log10 p-value) and size indicates the frequency of the GO terms in the underlying gene ontology annotation database, such as a larger circle in blue which represents the most significant enriched term. Red colour represent higher p-values (B) The KEGG enrichment of differentially expressed genes as a scatter diagram with a degree of enrichment by the richness factor, p-adjust, and the number of genes enriched in a pathway. The number of enriched DEGs in the pathway is indicated by the circle area, and the circle colour represents the ranges of the p-adjust.
Agronomy 10 01888 g005
Figure 6. Circos visualization of the RNAseq data at the wheat genome-wide level. From the outer to inner ring; karyotype of the wheat genome; average read depth per million base pair of wheat genome mapping of the resistant (red) and susceptible (green) line; the genomic location of differentially expressed genes for the resistant (red) and susceptible (green) line. Height of bars in outer and inner ring shows the average read depth per million base pair with respect to the genomic location.
Figure 6. Circos visualization of the RNAseq data at the wheat genome-wide level. From the outer to inner ring; karyotype of the wheat genome; average read depth per million base pair of wheat genome mapping of the resistant (red) and susceptible (green) line; the genomic location of differentially expressed genes for the resistant (red) and susceptible (green) line. Height of bars in outer and inner ring shows the average read depth per million base pair with respect to the genomic location.
Agronomy 10 01888 g006
Figure 7. Metagenomic analysis of the selected wheat pathogens: (A) the percentage of pathogen reads that are mapped to each species (*** p < 0.001; ** p < 0.01); (B) the mapping percentage of reads for the pathogenic fungal genome.
Figure 7. Metagenomic analysis of the selected wheat pathogens: (A) the percentage of pathogen reads that are mapped to each species (*** p < 0.001; ** p < 0.01); (B) the mapping percentage of reads for the pathogenic fungal genome.
Agronomy 10 01888 g007
Table 1. Assembly metric of the reference and de novo transcriptome assemblies.
Table 1. Assembly metric of the reference and de novo transcriptome assemblies.
TranscriptomeContigsN50Avg. LengthAssembly SizeLongest Contig SizeContig Size >10 KBContig Size >1 KB
Reference219,43518351477324,000,55122,46258140,664
De novo511,9261420953488,058,76215,38730175,517
Table 2. Results of the BUSCO analysis for the transcriptome assemblies’ validation.
Table 2. Results of the BUSCO analysis for the transcriptome assemblies’ validation.
BUSCO DescriptionReferenceDe Novo
Complete BUSCOs (S+D)1302 (94.7%)1079 (78.5%)
Complete and single-copy BUSCOs (S)69 (5.0%)229 (16.7%)
Complete and duplicated BUSCOs (D)1233 (D: 89.7%)850 (D: 61.8%)
Fragmented BUSCOs (F)61 (4.4%)213 (15.5%)
Missing BUSCOs (M)12 (0.9%)83 (6.0%)
Table 3. List of upregulated differentially expressed genes (p-value < 0.05) in de novo and reference assemblies that can be considered as potential candidate genes involved in resistance. The table is sorted according to p-values. FC: fold change. IWGSC: International Wheat Genome Sequencing Consortium
Table 3. List of upregulated differentially expressed genes (p-value < 0.05) in de novo and reference assemblies that can be considered as potential candidate genes involved in resistance. The table is sorted according to p-values. FC: fold change. IWGSC: International Wheat Genome Sequencing Consortium
Reference/De Novolog2FCAnnotationUniprotIdsIWGSCSignal
TranscriptChromosome Location
TRINITY_DN51078_c2_g4_i2805.981Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ2D:189356990:189359834:1Y
TRINITY_DN46223_c0_g1_i2598.303Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ5A:546234176:546238163:−1Y
TRINITY_DN46223_c0_g1_i3182.836Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ5A:546238637:546241834:1Y
TRINITY_DN46223_c0_g1_i1752.416Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ5A:546238637:546241834:1Y
TRINITY_DN38661_c1_g1_i138.481Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ2B:245784381:245787768:1Y
TRINITY_DN45544_c0_g1_i2416.282Cysteine-rich receptor-like protein kinaseCRK6_ORYSJ2D:20432763:20439635:1Y
TRINITY_DN37687_c0_g2_i511.964Cysteine-rich receptor-like protein kinaseCRK7_ARATH5A:546306021:546309203:−1Y
TRINITY_DN49928_c0_g1_i1811.545Diacylglycerol kinase 5 OS = Arabidopsis thalianaDGK5_ARATH2A:748963953:748967651:−1N
TRINITY_DN35364_c0_g1_i32.971Disease resistance protein RGA2 OS = SolanumRGA2_SOLBU1D:455752931:455755683:1N
TRINITY_DN44057_c1_g5_i17.574Disease resistance protein RPP8 OS = ArabidopsisRPP8_ARATH6B:662028638:662032091:−1N
TRINITY_DN39548_c1_g3_i211.392Leaf rust 10 disease-resistance locus receptor-likeLRL28_ARATH3B:29721933:29735756:1Y
TRINITY_DN35553_c1_g1_i15.456Leaf rust 10 disease-resistance locus receptor-likeLRL28_ARATH6D:2047133:2050237:−1Y
TRINITY_DN38577_c0_g1_i112.655LRR receptor-like serine/threonine-proteinFLS2_ARATH1D:466386615:466388390:−1Y
TRINITY_DN38835_c1_g4_i134.145LRR receptor-like serine/threonine-proteinFLS2_ORYSJ6D:380568621:380572398:1Y
TraesCS2B02G608600.159.366Probable LRR receptor-like serine/threonine-proteinY3475_ARATH2B:788840706:788842030:−1N
TRINITY_DN41813_c2_g5_i328.015Probable LRR receptor-like serine/threonine-proteinY3475_ARATH2D:646488160:646491914:1N
TRINITY_DN51069_c0_g2_i82.348Protein TIFY 6b OS = Oryza sativa subsp.TIF6B_ORYSJ5B:369635031:369638011:−1N
TRINITY_DN35330_c0_g3_i83.364Putative disease resistance protein RGA4RGA4_SOLBUUn:47532810:47545586:1N
TRINITY_DN51664_c0_g1_i113.269Putative disease resistance protein RGA4RGA4_SOLBUUn:95706705:95715329:1N
TRINITY_DN35364_c0_g4_i13.209Putative disease resistance protein RGA4RGA4_SOLBUUn:234394428:234397967:1N
TRINITY_DN51887_c1_g1_i14.127Putative disease resistance RPP13-likeR13L2_ARATH7D:11663563:11672717:1N
TRINITY_DN52456_c0_g2_i158.426Rust resistance kinase Lr10 OS = TriticumLRK10_WHEAT1A:9359231:9363721:1N
TraesCS6B02G091700.14.309S-(+)-linalool synthase, chloroplastic OS = OryzaLINS_ORYSJ6B:67408283:67411671:−1N
TRINITY_DN51301_c1_g1_i1011.432Vesicle-associated protein 1-1 OS = ArabidopsisVAP11_ARATH7B:717630687:717634406:−1N
TRINITY_DN46935_c4_g2_i75.042Vesicle-associated protein 1-1 OS = ArabidopsisVAP11_ARATH7B:717785488:717796197:1N
TRINITY_DN52084_c1_g1_i819.697Wall-associated receptor kinase 1 OS = ArabidopsisWAK1_ARATH5A:464158592:464170977:−1Y
TRINITY_DN50322_c1_g1_i115.429Wall-associated receptor kinase 3 OS = ArabidopsisWAK3_ARATH2B:657850507:657854763:−1Y
TRINITY_DN39541_c5_g2_i211.016Wall-associated receptor kinase 5 OS = ArabidopsisWAK5_ARATH6D:467856066:467860898:1Y
TRINITY_DN38381_c2_g3_i65.209Wall-associated receptor kinase 5 OS = ArabidopsisWAK5_ARATH6B:713458733:713462914:1Y
TRINITY_DN52648_c2_g2_i17.132Wall-associated receptor kinase-like 9WAKLH_ARATH6D:467856066:467860898:1Y
TRINITY_DN51423_c2_g1_i412.889Receptor-like cytoplasmic kinase 185 OS = OryzaRK185_ORYSJ2B:104817628:104821546:−1N
TRINITY_DN50690_c0_g4_i15.46G-type lectin S-receptor-like serine/threonine-proteinY1130_ARATH2D:642197592:642202812:1N
TRINITY_DN35935_c1_g1_i35.033G-type lectin S-receptor-like serine/threonine-proteinCE101_ARATH5B:690469413:690490270:−1N
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kumar Kushwaha, S.; Vetukuri, R.R.; Odilbekov, F.; Pareek, N.; Henriksson, T.; Chawade, A. Differential Gene Expression Analysis of Wheat Breeding Lines Reveal Molecular Insights in Yellow Rust Resistance under Field Conditions. Agronomy 2020, 10, 1888. https://doi.org/10.3390/agronomy10121888

AMA Style

Kumar Kushwaha S, Vetukuri RR, Odilbekov F, Pareek N, Henriksson T, Chawade A. Differential Gene Expression Analysis of Wheat Breeding Lines Reveal Molecular Insights in Yellow Rust Resistance under Field Conditions. Agronomy. 2020; 10(12):1888. https://doi.org/10.3390/agronomy10121888

Chicago/Turabian Style

Kumar Kushwaha, Sandeep, Ramesh R. Vetukuri, Firuz Odilbekov, Nidhi Pareek, Tina Henriksson, and Aakash Chawade. 2020. "Differential Gene Expression Analysis of Wheat Breeding Lines Reveal Molecular Insights in Yellow Rust Resistance under Field Conditions" Agronomy 10, no. 12: 1888. https://doi.org/10.3390/agronomy10121888

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop