Next Article in Journal
The Anaerobic Fungi: Challenges and Opportunities for Industrial Lignocellulosic Biofuel Production
Next Article in Special Issue
Variability, Functions and Interactions of Plant Virus Movement Proteins: What Do We Know So Far?
Previous Article in Journal
Leishmania Protein Kinases: Important Regulators of the Parasite Life Cycle and Molecular Targets for Treating Leishmaniasis
Previous Article in Special Issue
Persistent Southern Tomato Virus (STV) Interacts with Cucumber Mosaic and/or Pepino Mosaic Virus in Mixed- Infections Modifying Plant Symptoms, Viral Titer and Small RNA Accumulation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Arabidopsis thaliana Genes Associated with Cucumber mosaic virus Virulence and Their Link to Virus Seed Transmission

1
Unidad de Fisiología Vegetal, Departamento Ciencias Farmacéuticas y de la Salud, Facultad de Farmacia, Universidad San Pablo-CEU Universities, Boadilla del Monte, 28003 Madrid, Spain
2
Servicio de Reumatología, Hospital Universitario de la Princesa, Instituto de Investigación Sanitaria (IIS-IP), 28006 Madrid, Spain
3
Centro de Biotecnología y Genómica de Plantas UPM-INIA and Departamento de Biotecnología-Biología Vegetal, E.T.S. Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, 28045 Madrid, Spain
*
Author to whom correspondence should be addressed.
Microorganisms 2021, 9(4), 692; https://doi.org/10.3390/microorganisms9040692
Submission received: 5 March 2021 / Revised: 22 March 2021 / Accepted: 23 March 2021 / Published: 27 March 2021
(This article belongs to the Special Issue Plant Viruses: From Ecology to Control)

Abstract

:
Virulence, the effect of pathogen infection on progeny production, is a major determinant of host and pathogen fitness as it affects host fecundity and pathogen transmission. In plant–virus interactions, ample evidence indicates that virulence is genetically controlled by both partners. However, the host genetic determinants are poorly understood. Through a genome-wide association study (GWAS) of 154 Arabidopsis thaliana genotypes infected by Cucumber mosaic virus (CMV), we identified eight host genes associated with virulence, most of them involved in response to biotic stresses and in cell wall biogenesis in plant reproductive structures. Given that virulence is a main determinant of the efficiency of plant virus seed transmission, we explored the link between this trait and the genetic regulation of virulence. Our results suggest that the same functions that control virulence are also important for CMV transmission through seeds. In sum, this work provides evidence of a novel role for some previously known plant defense genes and for the cell wall metabolism in plant virus interactions.

1. Introduction

Viruses are major plant pathogens due to the detrimental effect of their infections on the host (i.e., to their virulence) [1]. Indeed, they have great impact on agronomic production worldwide, being the second most important cause of economic losses in crops only behind fungi [2] and accounting for the largest fraction of plant emerging diseases [3]. Hence, understanding the genetic basis of virulence in plant–virus interactions is central in minimizing the damage of virus epidemics.
Although virulence can be intuitively viewed as a pathogen-controlled trait, a large body of evidence indicates that in plant–virus interactions it is modulated by both host and pathogen genetic determinants [4,5,6,7]. Plant virus genes and even single mutations determining virulence have been extensively characterized, e.g., [8,9]. In parallel, plant genetic determinants have been also studied although at a lesser extent [8,10,11]. Interestingly, most of the works on the “host side” addressed the question from a plant pathology perspective. In this context, virulence is often defined either as the virus capacity to gain entrance to the plant or as the virus ability to induce symptoms, generally in the vegetative structures [12]. On the one hand, the use of the former definition of virulence led mostly to the identification of plant genes conferring resistance/immunity to virus infection, such as those related to the plant hypersensitive response, RNA silencing machinery or systemic acquired resistance [13,14,15]. On the other hand, host genes regulating virus cell-to-cell and long-distance movement, and plant photosynthesis and development [5,11,16] have been identified as virulence determinants when symptom severity was used as a proxy. Notably, most of these studies quantified virulence as a qualitative trait (susceptibility vs. immunity and mild vs. severe symptoms), which may limit the power of the approach to identify genetic determinants by reducing the information contained in qualitative variables as compared with analyses based on quantitative ones [17]. Moreover, virulence is also manifested in other plant traits, such as their ability to reproduce and survive, which are often not considered in the above-mentioned studies [18] but are relevant in agricultural settings and central in other ecological contexts in which virus infections are also commonplace.
Accumulating data indicates that, besides being of agronomic importance, plant viruses are also major ecological agents in wild ecosystems. Viral infections can drastically reduce the number of individuals in wild plant populations by decreasing the competitive or reproductive abilities of infected individuals [3,19,20]. As a consequence, wild plants evolved quantitative resistance and tolerance in response to infection [21,22,23], which suggests that viruses shape the genetic composition of the host population. Indeed, there is evidence that virus infections may act as a selective force for wild plant populations [23]. In wild ecosystems, the main impact of virus infections on the host population is through their detrimental effect on plant fitness [1]. Thus, analyses of virulence from an ecological or evolutionary perspective generally used the effect of infection on plant progeny as a proxy, but very little is known on the host genes regulating the effect of virus infection on plant fitness [18,24]. Certainly, the development of plant symptoms in vegetative structures may potentially impact plant fitness and it is likely that both traits have genetic associations [10]. However, the effect of infection on plant growth and on plant reproduction are not necessarily linked, e.g., [20,25,26]. Hence, the host genetic determinants of virulence as the effect of virus infection on plant progeny production are still poorly understood.
Under this definition, virulence has a direct impact not only on the host fitness but also on the pathogen’s. For instance, for plant viruses that are horizontally transmitted through insect vectors or by contact, it affects the number of available hosts [1]. More importantly, seed production is a major fitness component for vertically transmitted plant viruses, which accounts for at least 25% of all known species [27]. Indeed, we have recently shown that virulence is one of the main determinants of Cucumber mosaic virus (CMV) and Turnip mosaic virus (TuMV) seed transmission rate in Arabidopsis thaliana [28]. These results suggest that these two traits might have at least partially overlapping genetic regulations, although such link has not been explored yet.
To address these questions, we utilized the interaction between Cucumber mosaic virus (CMV, Bromoviridae) and Arabidopsis thaliana (from here on “Arabidopsis”, Brassicaceae) as model. CMV is the plant RNA virus with the broadest host range, infecting about 1200 species in more than 100 plant families, including Arabidopsis [29]. This virus is horizontally transmitted by aphids and in Arabidopsis is also seed transmitted with an efficiency that depends on the host per virus genotype x genotype interaction, indicating that the host is involved in controlling this trait [28]. CMV is commonly found in wild populations of Arabidopsis at up to 80% prevalence [21], and therefore the Arabidopsis–CMV interaction is relevant in nature. Indeed, recent work strongly suggests that CMV infection selects for defenses in Arabidopsis populations of the Iberian Peninsula [23]. This geographic region has been shown to contain the largest Arabidopsis genetic diversity in Eurasia due to its role as refugia during the last glaciations [30,31]. Accordingly, substantial genetic variation has been described for relevant adaptive traits including diversity for responses to pathogens [23,32,33,34]. For instance, the infection of ten Iberian Arabidopsis genotypes, representing the variation of the species in this region, with different CMV and TuMV strains showed that virulence (quantitatively measured as the effect of infection on viable seed production) is also controlled by both virus and host genetic determinants [4,23,26].
In this work, we aim at characterizing the Arabidopsis genetic determinants associated with the effect of CMV infection on plant progeny production as a measure of virulence, and to explore the link between the generic control of virulence and of CMV seed transmission rate. To do so, we performed a genome-wide association study (GWAS) with 154 Arabidopsis genotypes from the Iberian Peninsula, for which annotated genomes are publicly available [35] and using Arabidopsis seed production in infected and non-infected plants as the quantitative relevant trait. We further investigated whether the identified genetic determinants of virulence play also a role in modulating CMV seed transmission rate.

2. Materials and Methods

2.1. Plant Material

We used 165 Arabidopsis genotypes from the Iberian Peninsula (Figure 1 and Table S1). Genotypes were collected from different populations and selected to cover the genetic and environmental diversity of the species in the region [36,37]. This collection spanned 800 km × 700 km, populations being spaced 384.9 ± 3.7 km on the average (from 20.2 to 1038.1 km). Altitudes ranged from 123 to 1670 m above sea level. Each sample was genetically different based on previous single nucleotide polymorphism (SNP) genotyping and genome sequences [38,39]. Similar sets of Arabidopsis genotypes have been previously shown to be powerful for fine mapping of genomic regions associated with natural variation of quantitative traits, such as plant life history traits, including seed weight [40], seed dormancy [41], and flowering time [39] both under field and greenhouse conditions.
All genotypes used in this study were propagated by selfing during two generations by the single seed descent procedure, in a glasshouse supplemented with lamps to pro- vide a long-day photoperiod. This allowed for the reduction of residual heterozygosity that might contain some wild individuals but also removed any potential maternal and grand-mother effects. Seeds were stratified (darkness, 4 °C) for 7 days before germination at 25/20 °C day/night, 16 h light in a greenhouse. Ten-day-old seedlings were transferred to 4 °C, 8 h light, for vernalization for 8 weeks. After vernalization, plants were transplanted to 0.43 L pots containing a 3:1 peat-vermiculite mix and returned to the greenhouse, where they were kept at 25/20 °C day/night and 16 h light (intensity, 120 to 150 mol s/m2) until the end of the experiment.

2.2. Virus Isolate and Inoculation

Fny-CMV (GenBank accession numbers NC_002034, NC_002035 and NC_001440) was derived from biologically active clones [42] by in vitro transcription with T7 RNA polymerase (New England Biolabs, Ipswich, MA, USA), and transcripts were used to infect Nicotiana benthamiana plants for virus multiplication. Six Arabidopsis plants per genotype were mechanically inoculated with N. benthamiana CMV-infected tissue ground in a solution containing 0.1 M Na2HPO4, 0.5 M NaH2PO4, and 0.02% DIECA (0.01 M phosphate buffer (pH 7.0), 0.2% sodium diethyldithiocarbamate), and four plants per genotype were mock-inoculated with inoculation buffer. Inoculations were carried out when plants were at developmental stages 1.05 to 1.06 [43]. After inoculation, all individuals were randomized in the greenhouse. The efficiency of inoculations was determined by detecting virus presence in all plants. To do so, 15 days post-inoculation three disks with a diameter of 4 mm were collected from different systemically infected rosette leaves. From these plant samples, total RNA extracts were obtained using TRIzol® reagent (Life Technologies, Carlsbad, CA, USA), and 10 ng of total RNA was added to a Brilliant III Ultra-Fast SYBR green qRT-PCR master mix (Agilent Technologies, Santa Clara, CA, USA) according to the manufacturer’s recommendations. Specific primers were used to amplify a 154-nt fragment of the CMV MP gene [28]. Each plant sample was assayed in duplicate on a LightCycler 480 II real-time PCR system (Roche, Indianapolis, IN, USA). The rate of inoculation success was 98.8%. A similar procedure was used to confirm CMV absence in mock-inoculated plants.

2.3. Quantification of Virulence

Seeds were harvested at complete plant senescence, and the total seed weight per plant (SW) was obtained for infected and mock-inoculated plants. Virulence (V) was estimated as 1 minus the ratio of the total seed weight of each infected plant (SWi) to the averaged total seed weight of mock inoculated (SWm) plants from the same genotype.
Seed viability was measured as the germination percentage of 200 seeds per plant. Germination assays were carried out at least 60 days after harvesting to avoid differences on seed dormancy. Relative differences in seed viability between infected and control plants were used to correct V values such that only viable seed production was reflected.

2.4. Efficiency of Virus Seed Transmission

The efficiency of CMV seed transmission was estimated as the percentage of infected seeds that gave rise to infected progeny in grow-out tests. One-hundred seeds per replicate were washed in a 10% bleach solution. Then, seeds were placed into Petri dishes containing Murashige–Skoog medium, stratified for 5 days at 4 °C, and germinated in a growth chamber at 22 °C, under 16 h of light (intensity, 120 to 150 mol s/m2). Following [28], seedlings at 15 days poststratification were pooled in groups of 5 for a total of 20 groups per replicate. These groups were tested for the presence of CMV via qRT-PCR as described above. The percentage of virus-infected seeds (ST) was estimated using a Poisson distribution as: p = 1 − (1 − y/n)1/k, where p is the probability of virus transmission by a single seed, y is the number of positive samples, n is the total number of samples assayed (n = 20), and k is the number of seedlings per sample (k = 5).

2.5. Data Treatment and Statistical Analyses

In the 165 Arabidopsis genotypes, we analyzed the average variance in V across plants of the same genotype. Within-genotype variance in this trait larger than two-fold the average variance in the whole set of genotypes was considered as indicative of an unreliable estimate of V. Nine Arabidopsis genotypes showed such inflated variance and were eliminated from the analysis and two more were deleted due to low number of replicates because of inoculation failure, and thus that the final dataset used for the GWAS contained 154 genotypes. We explored the phenotypic variation of V across Arabidopsis genotypes using the following general linear mixed model (GLMM): V = μ + Genotype + ε, where μ is the overall mean of the phenotypic data, “genotype” corresponds to the genetic differences among the selected Arabidopsis genotypes, and ε is the residual error term. Normality of the residuals was achieved through (V + 2)5.36 transformation (Kolmogorov–Smirnov test p-value = 0.143). The factor “genotype” was treated as a random factor.
Broad-sense heritability was estimated as h2b = VG/(VG + VE), where VG is the among-genotypes variance component and VE is the residual variance. Variance components were determined using GLMMs by the Restricted Maximum Likelihood (REML) method [44] as implemented in the R-library lme4 [45]. Statistical analyses were conducted using R version 3.6.0 [46].

2.6. Genome-Wide Association Study (GWAS)

The 154 Arabidopsis genotypes have been genotyped for 4,932,457 million single nucleotide polymorphisms (SNPs) evenly spaced across the genomes [35]. Following [39], only SNPs present in at least 55% of the genotypes and with minor allele relative frequency (MAF) > 0.03 were considered in this study, resulting in a total of 2,071,858 SNPs from which 88.6% were present in all accessions. GWAS was run for virulence using the FarmCPU (fixed and random model circulating probability unification) as implemented in the R package GAPIT v3 [47]. FarmCPU minimizes false positives by accounting for linkage disequilibrium between SNPs, which reduces model overfitting as compared to other methods [48]. To determine the optimal number of principal components to include, forward model selection using the Bayesian information criterion (BIC) was conducted. The Manhattan plot was constructed with R library rMVP [49]. Following previous studies [50,51,52], a threshold of −log10(p) ≥ 4 was considered to identify SNPs associated with natural variation of the trait measured in this study. Within this set of SNPs, we conservatively focused on those below the false discovery rate (FDR) threshold in order to minimize false positives/negatives. For our data set, FDR = 1 × 10−8. The functional annotation of SNPs was carried out using SnpEff v.4.1 [53] and the TAIR database v.10 [54]. Gene ontology (GO) annotation enrichment was tested with PANTHER v16.0 using the binomial test [55] and visualized using SimRel for semantic similarity measure with a threshold of C = 0.7, as implemented in REVIGO [56].

2.7. Bayesian Sparse Linear Mixed Model (BSLMM)

BSLMM as implemented in the GEMMA (genome-wide efficient mixed model association) software was used to infer the genetic architecture of CMV virulence in Arabidopsis [57], which allows testing whether the analyzed trait is determined by many loci of small effect (polygenic) or rather determined by a few loci of large effector (oligogenic). Monte Carlo Markov chains (MCMCs) were run for 10 million generations (recording every ten steps), with 10% discarded as burn-in. MAF cut-off was set at 3% and a normalized kinship matrix, more appropriate if SNPs with large effects have low MAF, was included. Normalized kinship matrix was estimated with Tassel 5.2.70 software [58]. The proportion of phenotypic variance explained by the available genotypes (PVE) was used as an estimator of the heritability of a given phenotypic trait. PVE is a flexible Bayesian equivalent of the narrow-sense heritability (h2). BSLMM also estimated the proportion of genetic variance explained by sparse effects (PGE); that is, the proportion of variance explained by loci with large effects.

2.8. Selection of SNPs Linking CMV Virulence and Seed Transmission Rate

A Random Forest (RF) analysis was made to determine the SNPs associated with both CMV virulence and seed transmission rate. In this analysis, we used seed transmission rate estimates of 35 Arabidopsis genotypes with extreme virulence phenotypes as the response variable, and as the input, only the SNPs associated with CMV virulence as determined by GWAS (−log10(p) ≥ 4). Arabidopsis genetic group [39] and CMV virulence values were included as covariates (randomForest R package [59]). The model was run for 2000 trees and mtry = 2, as estimated with the trainControl function from the caret R package [60]. Significant association of each SNP with the response variable was estimated based on the empirical null distribution of SNPs with no importance in this variable following [61], as implemented in the r2VIM R package [62]. The relative importance of each SNP in CMV seed transmission rate was quantified as % increase in mean square error (MSE), that is, the increase in the error made by the RF in predicting the trait when the SNP is removed from the analysis.

3. Results

3.1. Natural Variation for CMV Virulence in Arabidopsis Genotypes

Overall, CMV-infected plants developed symptoms ranging from mild mosaic to leaf necrosis (Figure S1). Virus infection reduced plant progeny production, showing medium–high virulence in Arabidopsis genotypes (V = 0.637 ± 0.035; that is, infected plants produced on average 64% less seeds than mock-inoculated ones). However, virulence greatly varied across Arabidopsis genotypes, from plant sterilization (V = 1 in 13 genotypes) to overcompensation of the effect of infection on plant progeny production (negative values of V in 16 genotypes, with minimum values of −1.869 in genotype Lam-0) (Figure 2 and Table S1). This ample variation did not depend on the Arabidopsis genetic group (F = 1.801; p = 0.150) but did vary according to the plant genotype (F = 12.619; p = 1 × 10−4).
The broad-sense heritability of CMV virulence in Arabidopsis showed moderate–high values: h2b = 0.68. Therefore, there was significant genetic variation among the studied Arabidopsis genotypes for the analyzed trait, such that the utilized population allowed a meaningful GWAS.

3.2. Genetic Architecture of Virulence in Arabidopsis

The 154 Arabidopsis genotypes accounted for 2.07 million SNPs, which were included in a BSLMM to evaluate how much of the variance in CMV virulence was explained by these SNPs. A moderate percentage of the phenotypic variance was explained by the genotyped SNPs [PVE: 51%; 95% ETPI (equal-tail probability intervals): 15–76%]. The strength of association of a SNP with CMV virulence above or equal to 0.25 was considered an indication of a large effect SNP that contributed the most to the phenotype. These large effect variants explained 58% (PGE: 58%; 95% ETPI: 32–96%) of the variance in CMV virulence. The number of SNPs with a large effect size was low (mean: 10; 95% ETPI: 4–18). Thus, the BSLMM indicated that Arabidopsis genetic architecture of CMV virulence involved few SNPs with detectable large effects.
In parallel with the BSLMM, we performed a GWAS. The results indicated that 223 SNPs were significantly associated with CMV virulence in Arabidopsis (−log10(p) ≥ 4), with effect size varying from 0.08 to 0.42 (Table S2). To provide global insights into the biological processes associated with CMV virulence, we performed a GO term enrichment analysis based on the 223 detected SNPs (Figure 3). We identified 37 enriched GO terms (p < 1 × 10−3), which were reduced to 30 after eliminating redundant terms (Table S3). Similarity analyses of these 30 GO terms indicated that the largest clusters grouped functions related to response to stress (eight GO terms), cell wall biogenesis and cellular metabolisms (four GO terms each) and glucosinolate catabolism (three GO terms). GO terms in these four categories accounted for 63% of all enriched terms (Figure 3). Most other terms corresponded to general housekeeping functions (Table S3). One exception was SNP enrichment in genes related to seed coat development, a function highly related to progeny production (Figure 3).
Only eight of the 223 SPNs passed the FDR threshold (FDR-adjusted p-value < 3 × 10−3 and −log10(p) ≥ 8) (Figure 4), representing most of the SNPs with the largest effects in virulence (effect size > 0.20) (Table S2). With few exceptions (nine SNPs, four of them affecting the same gene), the rest of the 215 SNPs had effects lower than 0.20. Therefore, only a reduced number of SNPs with large effects size were strongly associated with CMV virulence in Arabidopsis, in agreement with BSLMM.
Altogether, our analyses indicate that Arabidopsis genetic determinants of CMV virulence are mainly involved in functional categories associated with stress response and cell wall biogenesis. Of the identified SNPs, a few have detectable large effects and are strongly associated with the analyzed trait, whereas a larger number of SNPs are less strongly associated with virulence, most of them with smaller effect size.

3.3. Arabidopsis Genetic Determinants of CMV Virulence Identified by GWAS

We focused on the eight SNPs that passed the FDR threshold (Figure 4). MAF values for these SNPs varied between 0.08 and 0.11. For all of them, the minor variant had a negative effect on CMV virulence (Table 1).
The two Arabidopsis SNPs with the strongest association with CMV virulence were located at genes encoding a GDP-d-mannose 4,6-dehydratase (GMD1) and a peptate lyase-like protein (PLL18) (FDR < 6.53 × 10−7). Both genes are involved in cell wall metabolism [63,64]. Of the remaining six SNPs strongly associated with virulence, three were involved in response to biotic and abiotic stresses: heat shock protein 20-like (HSP20-like), the zinc finger protein ZAT8 and the late upregulated in response to Hyaloperonospora parasitica protein 1 (LURP1) [65,66,67,68,69]. Another two were related to different biological processes such as flowering time (ROTUNDIFOLIA like 13 protein, RTFL13) [70] and DNA methylation (ORTHRUS-like protein, ORTHL) [71], and one had unknown function (AT1G53635). To confirm the association of these SNPs with CMV virulence, we performed General Linear Models (GLMs) comparing CMV virulence in Arabidopsis genotypes with different variants in the eight detected SNPs. In all cases, analyses indicated significant differences (F1,142–152 ≥ 8.74; p < 3.6 × 10−3), supporting the role of these SNPs in the analyzed trait, with minor variants always showing a reduced virulence (Figure 5).
In summary, the Arabidopsis SNPs strongly associated with CMV virulence are located in genes mostly involved in cell wall biogenesis and in stress response in agreement with GO term enrichment analyses, and minor variants of the SNPs in these genes generally reduce CMV virulence.

3.4. Link between Arabidopsis Genetic Determinants of CMV Virulence and Seed Transmission

To analyze the potential genetic links between CMV virulence and seed transmission rate in the Arabidopsis genome, we selected 35 plant genotypes representing extreme virulence phenotypes in which we estimated the per cent of CMV seed transmission (ST) (Table S1). ST varied from absence of infected seeds (genotypes Cdc-3, Iso-4 and Lum-0) to 100% of infected seeds (genotypes Alm-0, Amu-0, Ang-0 and Aul-0). Then, we performed a RF analysis including the 223 SNPs associated with CMV virulence as predictors of virus seed transmission rate. The RF detected only seven of these SNPs as significantly associated with this trait (p < 0.041) (Table 2). None of the eight SNPs strongly associated with virulence was so with virus seed transmission rate.
GLMs comparing CMV seed transmission rate in Arabidopsis genotypes with different variants in the seven SNPs indicated significant differences only for the two with the highest % MSE (F1,34 ≥ 8.74; p < 0.033) (Figure 6). The two SNPs associated with both CMV virulence and seed transmission rate were located in genes encoding the calcineurin B-like (CBL)-interacting protein kinase 2 (CIPK2) and the MOS4-ASSOCIATED COMPLEX SUBUNIT 5C protein (MAC5C). The former is a calcium regulated protein [72], and the latter has been reported as being involved in cell wall metabolism [73].
Hence, Arabidopsis genes involved in both CMV virulence and seed transmission rate control stress responses and cell wall metabolism.

4. Discussion

Most analyses of the host genetic determinants of plant virus virulence have identified genes involved in symptom development and/or plant susceptibility [8,11]. In contrast, genes associated with the effect of virus infection on plant progeny production are poorly characterized [74]. Identifying this genetic control is of great relevance to understand plant–virus interactions from an agronomic and ecological perspective: first, seed production is a key trait for wild plant population dynamics and a trait of agronomic importance [75,76]. Second, it modulates the chances for virus horizontal transmission (i.e., determines the number of available susceptible individuals) [1], and is central for viruses that disperse through seeds [27,77]. Here, through a GWAS including 154 plant genotypes, we identified Arabidopsis genes involved in the effect of CMV infection on seed production and we explored the potential link of these genes with the efficiency of virus seed transmission.
We identified SNPs in eight plant genes strongly associated with the effect of CMV infection on seed production. Three of these genes (HSP20-like, ZAT8 and LURP1) were related to the response to biotic and abiotic stresses. The first two genes have been shown to be overexpressed in response to heat and salinity [78,79]. However, there is also evidence of their role in plant response to pathogens. For instance, many HSPs have been reported to alter their expression patterns upon infection by multiple pathogens [80]. Indeed, in N. benthamiana and rice HSP20 expression has been shown to change upon infection by Impatiens necrotic spot virus and Rice stripe virus, respectively, facilitating virus replication and within-host movement [68,81]. Similarly, Arabidopsis transcription factor ZAT8 knock-out mutants have been reported to have higher susceptibility to Pseudomonas syringae [67], and in the same host other transcription factors of the ZAT family are responsive to infection by viruses, including CMV [82]. The third gene, LURP1, is also regulated in response to biotic stresses, being required for basal defense to Hyaloperonospora arabidopsidis [69]. Interestingly, our results are not the first evidence of a link between defenses to H. arabidopsidis and CMV. Another well-characterized resistance gene to this oomycete (RPP8) is allelic to RCY-1, which confers resistance to the isolate Y of CMV [83]. These results and ours suggest that H. arabidopsidis and CMV infection may trigger common defense pathways. In nature, Arabidopsis plants are commonly infected by both CMV and H. arabidopsidis, both pathogens imposing selective pressures on the host [21,23,84], such that common defense pathways would be selectively advantageous while minimizing energy investment at the same time. The evolutionary processes leading to this common solution is an interesting avenue of future research. It should be noted that the expression of HSP20-like, ZAT8 and LURP1 has been mostly studied in vegetative plant tissues, and their effect has been generally linked to reduced pathogen load and/or lower effect of infection on plant growth. The association of these three genes with the effect of infection on seed production might be explained if: (i) their differential expression in vegetative tissues affects plant reproduction, or (ii) these genes are also differentially regulated in plant reproductive organs. In any case, our results provide tentative evidence of a novel function for these genes in plant–virus interactions.
Perhaps more interestingly, the SNPs with the strongest association, and the largest effect, with virulence were located in genes related to the cell wall metabolism (GMD1 and PLL18). Current information on changes in the composition, structure and function of plant cell walls in response to infection has been largely restricted to non-viral pathogens [85,86]. The scant work on plant–virus interactions focused on the association between cell wall modifications and virus cell-to-cell and, to a lesser extent, systemic movement. For instance, pectin methylesterases and hydroxyproline-rich glycoproteins are needed for efficient Tobacco mosaic virus and Potato virus Y movement in tobacco and potato, respectively [87,88,89]. It could be argued that GMD1 and PLL18 work in a similar way: certain variants of these proteins might facilitate virus cell-to-cell and systemic colonization from entry points at plant leaves and therefore promote the invasion of reproductive structures. However, none of the two proteins encoded by these genes are significantly expressed in Arabidopsis leaves. Rather, their expression is largely restricted to flowers and pollen grains as they are mainly involved in the synthesis and remodeling of cell walls in these organs [63,64]. Hence, we hypothesize that CMV infection alters the functioning of these proteins, leading to the malformation of cell walls in flowers and/or pollen, which may affect plant fertility and therefore seed production. Arabidopsis genotypes with variants in these genes less prone to CMV effects would suffer less from infection. Further functional analyses of these GMD1 and PLL18 upon virus infection will allow testing this hypothesis. Regardless the mechanism involved, our results reveal a new role of the cell wall in plant–virus interactions as a regulator of the effect of infection on plant fitness. The identification of four large effect size SNPs that did not pass the FDR threshold but were significantly associated with CMV virulence within a gene encoding the prolyl 4-hydroxylase 11 (P4H11) (Table S2), which is also involved in cell wall biogenesis [90], and the results of our GO term enrichment, further support this conclusion.
The other two Arabidopsis genes strongly associated with CMV virulence were RTFL13 and ORTHL. RTFL13 has been shown to be a regulator of flowering time that is expressed during cambium formation in Arabidopsis inflorescences [70]. There is ample evidence of the link between flowering time and the effect of virus infection on seed production [25,26,91]. Although the genetic bases are unknown, flowering genes are thought to control this process [18]. In agreement, RTFL13 would be a candidate gene to be involved in such control. As for ORTHL, an epigenetic regulator of embryogenesis [92], any effect of virus infection on embryo development would directly impact plant fitness. Our results suggest that such effects would occur through alterations of the epigenetic regulation of plant reproduction. Indeed, virus infections have been repeatedly shown to alter plant epigenetics (including DNA methylation) [93,94]. Hence, the identification of these two genes as potential regulators of CMV virulence support the idea that at least part of the host control of this trait occurs at the plant reproductive stage, and independently of the virus effects on vegetative structures.
Although none of the Arabidopsis SNPs strongly associated with CMV virulence were also modulating virus seed transmission rate, our RF analysis identified two SNPs that were linked to both traits: CIPK2 and MAC5C. CIPK2 is a protein kinase associated to calcineurin B-like protein (CBLs), which is a group of calcium sensors. In plants, calcium fluxes are important for sensing a variety of stimuli, including abiotic conditions and pathogens, and in these processes both CBLs and CIPKs are involved [72,95]. For instance, tobacco CIPK2 has been shown to be involved in tolerance to drought [72]. Although in Arabidopsis the role of CIPK2 is not known, other CIPKs (such as CIPK6) have been demonstrated to induce plant resistance to pathogens when silenced. Thus, it could be hypothesized that CIPK2 acts in a similar way in response to CMV. Interestingly, CIPK2 is highly expressed in flowers [96], which may explain its common role in virulence and seed transmission: the differential expression of CIPK2 may induce local resistance to the virus in flowers, minimizing the effects of infection in seed production and preventing seed invasion. Similarly, MAC5C, part of the secondary cell wall regulation network [73], is expressed during seed germination [97]. In infected seeds, MAC5C may contribute to minimize the effect of CMV presence on secondary wall metabolism, promoting seed viability and favoring virus seed transmission. As part of the MAC complex, MAC5C also has a role in plant resistance to pathogens [98]. The specific mechanism by which MAC5C is involved in plant defenses is not well characterized [99]. However, this work was restricted to analysis in plant vegetative structures. Similarly to CIPK2, MAC5C could also influence virulence and seed transmission rate by restricting virus invasion of reproductive organs. It is worth mentioning that here we did not quantify CMV load in the reproductive plant organs, which would have contributed to understand whether certain variants of these two genes prevent virus invasion. On the other hand, we show that variants with lower virulence had higher seed transmission rate, suggesting a negative correlation between both traits, which is in agreement with our previous results [28], strengthening the evidence of a role of these genes as multi-trait regulators.
In summary, this work identifies genes related to stress response and cell wall metabolism as potential regulators of the effect of virus infection on plant progeny production and of virus seed transmission rate. Although functional analyses are needed to validate these results, our GWAS provides tentative evidence of novel roles for these plant genes in plant–virus interactions, opening a new avenue of future research.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/microorganisms9040692/s1, Figure S1: Symptoms induced by CMV infection in Arabidopsis. Upper line shows mock-inoculated plants and lower line CMV-infected plants of Arabidopsis genotypes Cem-0 (A and E), Gra-0 (B and F), Ses-0 (C and G) and Vas-0 (D and H). Table S1: List of Arabidopsis genotypes utilized in the GWAS and average virulence (V) and seed transmission rate (ST) values per genotype; Table S2: SNPs in the Arabidopsis genome associated with CMV virulence; Table S3: GO enriched terms in the 223 host SNPs associated with CMV virulence in Arabidopsis thaliana.

Author Contributions

Conceptualization, I.P.; formal analysis, N.M.; investigation, A.C. and M.G.-V.; data curation, I.P. and N.M.; writing—original draft preparation, I.P.; writing—review and editing, N.M., A.C. and E.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Plan Nacional I + D + i, Ministerio de Economía y Competitividad (Agencia Nacional de Investigación), Spain [PID2019-109579RB-I00] to IP, by Ministerio de Economía y Competitividad (Instituto de Salud Carlos III) [PIE13/00041], and by the “Severo Ochoa Program for Centres of Excellence in R&D” from the Agencia Estatal de Investigación of Spain, grant SEV-2016-0672 (2017–2021) to the CBGP. A.C. was supported by a Formación de Personal Investigador contract from MINECO (BES-2017-080783).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available as supplementary material.

Acknowledgments

Marisa López-Herranz provided excellent technical support.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Read, A.F. The evolution of virulence. Trends Microbiol. 1994, 2, 73–76. [Google Scholar] [CrossRef]
  2. Oerke, E.-C. Crop losses to pests. J. Agric. Sci. 2006, 144, 31–43. [Google Scholar] [CrossRef]
  3. Anderson, P.K.; Cunningham, A.A.; Patel, N.G.; Morales, F.J.; Epstein, P.R.; Daszak, P. Emerging infectious diseases of plants: Pathogen pollution, climate change and agrotechnology drivers. Trends Ecol. Evol. 2004, 19, 535–544. [Google Scholar] [CrossRef] [PubMed]
  4. Pagán, I.; Alonso-Blanco, C.; García-Arenal, F. The relationship of within-host multiplication and virulence in a plant-virus system. PLoS ONE 2007, 2, e786. [Google Scholar] [CrossRef] [Green Version]
  5. Pallás, V.; García, J.A. How do plant viruses induce disease? Interactions and interference with host components. J. Gen. Virol. 2011, 92, 2691–2705. [Google Scholar] [CrossRef]
  6. Paudel, D.B.; Sanfaçon, H. Exploring the diversity of mechanisms associated with plant tolerance to virus infection. Front. Plant Sci. 2018, 9, 1575. [Google Scholar] [CrossRef] [PubMed]
  7. Shukla, A.; Pagán, I.; García-Arenal, F. Effective tolerance based on resource reallocation is a virus-specific defence in Arabidopsis thaliana. Mol. Plant Pathol. 2018, 19, 1454–1465. [Google Scholar] [CrossRef] [Green Version]
  8. García, J.A.; Pallás, V. Viral factors involved in plant pathogenesis. Curr. Opt. Virol. 2015, 11, 21–30. [Google Scholar] [CrossRef] [PubMed]
  9. Palukaitis, P. Determinants of pathogenesis. In Cucumber Mosaic Virus; Palukaitis, P., García-Arenal, F., Eds.; APS Press: Washington, DC, USA, 2019; pp. 145–154. [Google Scholar]
  10. Ascencio-Ibáñez, J.T.; Sozzani, R.; Lee, T.J.; Chu, T.M.; Wolfinger, R.D.; Cella, R.; Hanley-Bowdoin, L. Global analysis of Arabidopsis gene expression uncovers a complex array of changes impacting pathogen response and cell cycle during geminivirus infection. Plant Physiol. 2008, 148, 436–454. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. García-Ruíz, H. Host factors against plant viruses. Mol. Plant Pathol. 2019, 20, 1588–1601. [Google Scholar] [CrossRef] [PubMed]
  12. Dieckmann, U.; Metz, J.A.; Sabelis, M.W.; Sigmund, K. Adaptive dynamics of infectious diseases. In Pursuit of Virulence Management; Cambridge University Press: Cambridge, UK, 2002. [Google Scholar]
  13. Kang, B.-C.; Yeam, I.; Jahn, M.M. Genetics of plant virus resistance. Annu. Rev. Phytopathol. 2005, 43, 581–621. [Google Scholar] [CrossRef] [Green Version]
  14. de Ronde, D.; Butterbach, P.; Kormelink, R. Dominant resistance against plant viruses. Front. Plant Sci. 2014, 5, 307. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Faoro, F.; Gozzo, F. Is modulating virus virulence by induced systemic resistance realistic? Plant Sci. 2015, 234, 1–13. [Google Scholar] [CrossRef] [PubMed]
  16. Carr, J.P.; Murphy, A.M. Host responses: Susceptibility. In Cucumber Mosaic Virus; Palukaitis, P., García-Arenal, F., Eds.; APS Press: Washington, DC, USA, 2019; pp. 47–58. [Google Scholar]
  17. Drummond, D.A.; Raval, A.; Wilke, C.O. A single determinant dominates the rate of yeast protein evolution. Mol. Biol. Evol. 2006, 23, 327–3337. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  18. Pagán, I.; García-Arenal, F. Tolerance to plant pathogens: Theory and experimental evidence. Int. J. Mol. Sci. 2018, 19, 810. [Google Scholar] [CrossRef] [Green Version]
  19. Malmstrom, C.M.; McCullough, A.J.; Johnson, H.A.; Newton, L.A.; Borer, E.T. Invasive annual grasses indirectly increase virus incidence in California native perennial bunchgrasses. Oecologia 2005, 145, 153–164. [Google Scholar] [CrossRef]
  20. Vijayan, V.; López-González, S.; Sánchez, F.; Ponz, F.; Pagán, I. Virulence evolution of a sterilizing plant virus: Tuning multiplication and resource exploitation. Virus Evol. 2017, 3, vex033. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Pagán, I.; Fraile, A.; Fernández-Fueyo, E.; Montes, N.; Alonso-Blanco, C.; García-Arenal, F. Arabidopsis thaliana as a model for the study of plant–virus co-evolution. Philos. Trans. R. Soc. Lond. B 2010, 365, 1983–1995. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Moreno-Pérez, M.G.; Pagán, I.; Aragón-Caballero, L.; Cáceres, F.; Fraile, A.; García-Arenal, F. Ecological and genetic determinants of Pepino mosaic virus emergence. J. Virol. 2014, 88, 3359–43368. [Google Scholar] [CrossRef] [Green Version]
  23. Montes, N.; Alonso-Blanco, C.; García-Arenal, F. Cucumber mosaic virus infection as a potential selective pressure on Arabidopsis thaliana populations. PLoS Pathog. 2019, 15, e1007810. [Google Scholar] [CrossRef] [Green Version]
  24. Pagán, I.; García-Arenal, F. Tolerance of plants to plant pathogens: A unifying view. Annu Rev. Plant Pathol. 2020, 18, 9.1–9.20. [Google Scholar] [CrossRef]
  25. Pagán, I.; Alonso-Blanco, C.; García-Arenal, F. Host responses in life-history traits and tolerance to virus infection in Arabidopsis thaliana. PLoS Pathog. 2008, 4, e1000124. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Montes, N.; Vijayan, V.; Pagán, I. Trade-offs between host tolerances to different pathogens in plant-virus interactions. Virus Evol. 2020, 6, veaa019. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Sastry, K.S. Seed-Borne Plant Virus Diseases; Springer: New Delhi, India, 2013. [Google Scholar]
  28. Cobos, A.; Montes, N.; López-Herranz, M.; Gil-Valle, M.; Pagán, I. Within-host multiplication and speed of colonization as infection traits associated with plant virus vertical transmission. J. Virol. 2019, 93, e01078-19. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Palukaitis, P.; García-Arenal, F. Cucumber Mosaic Virus; APS Press: Washington, DC, USA, 2019. [Google Scholar]
  30. Brennan, A.C.; Méndez-Vigo, B.; Haddioui, A.; Martínez-Zapater, J.M.; Picó, F.X.; Alonso-Blanco, C. The genetic structure of Arabidopsis thaliana in the south-western Mediterranean range reveals a shared history between North Africa and southern Europe. BMC Plant Biol. 2014, 14, 17. [Google Scholar] [CrossRef] [Green Version]
  31. Lee, C.R.; Svardal, H.; Farlow, A.; Exposito-Alonso, M.; Ding, W.; Novikova, P.; Alonso-Blanco, C.; Weigel, D.; Nordborg, M. On the post-glacial spread of human commensal Arabidopsis thaliana. Nat. Commun. 2017, 8, 14458. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Jorgensen, T.H.; Emerson, B.C. Functional variation in a disease resistance gene in populations of Arabidopsis thaliana. Mol. Ecol. 2008, 17, 4912–4923. [Google Scholar] [CrossRef]
  33. Huard-Chauveau, C.; Perchepied, L.; Debieu, M.; Rivas, S.; Kroj, T.; Kars, I.; Bergelson, J.; Roux, F.; Roby, D. An atypical kinase under balancing selection confers broad-spectrum disease resistance in Arabidopsis. PLoS Genet. 2013, 9, e1003766. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Karasov, T.L.; Kniskern, J.M.; Gao, L.; DeYoung, B.J.; Ding, J.; Dubiella, U.; Lastra, R.O.; Nallu, S.; Roux, F.; Innes, R.W.; et al. The long-term maintenance of a resistance polymorphism through diffuse interactions. Nature 2014, 512, 436–440. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. The 1001 Genomes Consortium. 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell 2016, 166, 481–491. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Picó, F.X.; Méndez-Vigo, B.; Martinez-Zapater, J.M.; Alonso-Blanco, C. Natural genetic variation of Arabidopsis thaliana is geographically structured in the Iberian Peninsula. Genetics 2008, 180, 1009–1021. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Méndez-Vigo, B.; Picó, F.X.; Ramiro, M.; Martinez-Zapater, J.M.; Alonso-Blanco, C. Altitudinal and climatic adaptation is mediated by flowering traits and FRI, FLC, and PHYC genes in Arabidopsis. Plant Physiol. 2011, 157, 1942–1955. [Google Scholar] [CrossRef] [Green Version]
  38. Gomaa, N.H.; Montesinos-Navarro, A.; Alonso-Blanco, C.; Picó, F.X. Temporal variation in genetic diversity and effective population size of Mediterranean and subalpine Arabidopsis thaliana populations. Mol. Ecol. 2001, 20, 3540–3554. [Google Scholar] [CrossRef] [PubMed]
  39. Tabas-Madrid, D.; Méndez-Vigo, B.; Arteaga, N.; Marcer, A.; Pascual-Montano, A.; Weigel, D.; Picó, F.X.; Alonso-Blanco, C. Genome-wide signatures of flowering adaptation to climate temperature: Regional analyses in a highly diverse native range of Arabidopsis thaliana. Plant Cell Environ. 2018, 41, 1806–1820. [Google Scholar] [CrossRef] [PubMed]
  40. Manzano-Piedras, E.; Marcer, A.; Alonso-Blanco, C.; Picó, F.X. Deciphering the adjustment between environment and life history in annuals: Lessons from a geographically-explicit approach in Arabidopsis thaliana. PLoS ONE 2014, 9, e87836. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  41. Vidigal, D.S.; Marques, A.C.; Willems, L.A.; Buijs, G.; Méndez-Vigo, B.; Hilhorst, H.W.; Bentsink, L.; Picó, F.X.; Alonso-Blanco, C. Altitudinal and climatic associations of seed dormancy and flowering traits evidence adaptation of annual life cycle timing in Arabidopsis thaliana. Plant Cell Environ. 2016, 39, 1737–1748. [Google Scholar] [CrossRef] [Green Version]
  42. Rizzo, T.M.; Palukaitis, P. Construction of full-length cDNA clones of cucumber mosaic virus RNAs 1, 2 and 3: Generation of infectious RNA transcripts. Mol. Gen. Genet. 1990, 222, 249–256. [Google Scholar] [CrossRef]
  43. Boyes, D.C.; Zayed, A.M.; Ascenzi, R.; McCaskill, A.J.; Hoffman, N.E.; Davis, K.R.; Görlach, J. Growth stage–based phenotypic analysis of Arabidopsis: A model for high throughput functional genomics in plants. Plant Cell 2001, 13, 1499–1510. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Lynch, M.; Walsh, B. Genetics and Analysis of Quantitative Traits; Sinauer Associates Inc.: Sunderland, MA, USA, 1998. [Google Scholar]
  45. Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models using lme4. J. Stat. Soft. 2015, 67, 1–48. [Google Scholar] [CrossRef]
  46. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018. [Google Scholar]
  47. Tang, Y.; Liu, X.; Wang, J.; Li, M.; Wang, Q.; Tian, F.; Su, Z.; Pan, Y.; Liu, D.; Lipka, A.E.; et al. GAPIT Version 2: An enhanced integrated tool for genomic association and prediction. Plant Genome 2016, 9, 2. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Liu, X.; Huang, M.; Fan, B.; Buckler, E.S.; Zhang, Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016, 12, e1005767. [Google Scholar] [CrossRef]
  49. Yin, L.; Zhang, H.; Tang, Z.; Xu, J.; Yin, D.; Zhang, Z.; Yuan, X.; Zhu, M.; Zhao, Z.; Li, X.; et al. rMVP: Memory-Efficient, Visualize-Enhanced, Parallel-Accelerated GWAS Tool. R Package Version 1.0.4. 2020. Available online: https://CRAN.R-project.org/package=rMVP (accessed on 26 February 2021).
  50. Van Rooijen, R.; Aarts, M.G.M.; Harbinson, J. Natural genetic variation for acclimation of photosynthetic light use efficiency to growth irradiance in Arabidopsis. Plant Physiol. 2015, 167, 1412–1484. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  51. Thoen, M.P.M.; Olivas, N.H.D.; Kloth, K.J.; Coolen, S.; Huang, P.-P.; Aarts, M.G.M.; Bac-Molenaar, J.A.; Bakker, J.; Bouwmeester, H.J.; Broekgaarden, C.; et al. Genetic architecture of plant stress resistance: Multi-trait genome-wide association mapping. New Phytol. 2017, 213, 1346–1362. [Google Scholar] [CrossRef] [Green Version]
  52. Rubio, B.; Cosson, P.; Caballero, M.; Revers, F.; Bergelson, J.; Roux, F.; Schurdi-Levraud, V. Genome-wide association study reveals new loci involved in Arabidopsis thaliana and Turnip mosaic virus (TuMV) interactions in the field. New Phytol. 2019, 221, 2026–2038. [Google Scholar] [CrossRef] [PubMed]
  53. Cingolani, P.; Platts, A.; Wangle, L.; Coon, M.; Nguyen, T.; Wang, L.; Land, S.J.; Lu, X.; Ruden, D.M. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly 2012, 6, 80–92. [Google Scholar] [CrossRef] [Green Version]
  54. Berardini, T.Z.; Reiser, L.; Li, D.; Mezheritsky, Y.; Muller, R.; Strait, E.; Huala, E. The arabidopsis information resource: Making and mining the “gold standard” annotated reference plant genome. Genesis 2015, 53, 474–485. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Mi, H.; Ebert, D.; Muruganujan, A.; Mills, C.; Albou, L.-P.; Mushayamaha, T.; Thomas, P.D. PANTHER version 16: A revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucl. Acids Res. 2021, 49, D394–D403. [Google Scholar] [CrossRef]
  56. Supek, F.; Bosnjak, M.; Skunca, N.; Smuc, T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE 2011, 6, e21800. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Zhou, X.; Carbonetto, P.; Stephens, M. Polygenic modeling with Bayesian Sparse Linear Mixed Models. PLoS Genet. 2013, 9, e1003264. [Google Scholar] [CrossRef] [Green Version]
  58. Bradbury, P.J.; Zhang, Z.; Kroon, D.E.; Casstevens, T.M.; Ramdoss, Y.; Buckler, E.S. TASSEL: Software for association mapping of complex traits in diverse samples. Bioinformatics 2007, 23, 2633–2635. [Google Scholar] [CrossRef] [PubMed]
  59. Liaw, A.; Wiener, M. Classification and Regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
  60. Kuhn, M. Caret: Classification and Regression Training. R Package Version 6.0-86. 2020. Available online: https://CRAN.R-project.org/package=caret (accessed on 26 February 2021).
  61. Janitza, S.; Celik, E.; Boulesteix, A. A computationally fast variable importance test for random forests for high-dimensional data. Adv. Data Anal. Classif. 2018, 12, 885–915. [Google Scholar] [CrossRef]
  62. Szymczak, S.; Holzinger, E.; Dasgupta, A.; Malley, J.D.; Molloy, A.M.; Mills, J.L.; Brody, L.C.; Stambolian, D.; Bailey-Wilson, J.E. r2VIM: A new variable selection method for random forests in genome-wide association studies. BioData Min. 2016, 9, 1–15. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Bonin, C.P.; Freshour, G.; Hahn, M.G.; Vanzin, G.F.; Reiter, W.-D. The GMD1 and GMD2 genes of Arabidopsis encode isoforms of GDP-D-mannose 4,6-dehydratase with cell type-specific expression patterns. Plant Physiol. 2003, 132, 883–892. [Google Scholar] [CrossRef] [Green Version]
  64. Sun, L.; van Nocker, S. Analysis of promoter activity of members of the PECTATE LYASE-LIKE (PLL) gene family in cell separation in Arabidopsis. BMC Plant Biol. 2010, 10, 152. [Google Scholar] [CrossRef] [Green Version]
  65. Ciftci-Yilmaza, S.; Mittler, R. The zinc finger network of plants. Cell Mol. Life Sci. 2008, 65, 1150–1160. [Google Scholar] [CrossRef]
  66. Waters, E.R.; Vierling, E. Plant small heat shock proteins—Evolutionary and functional diversity. New Phytol. 2020, 227, 24–37. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Yin, M. Genetic Dissection of Nitric Oxide Signalling Network in Plant Defence Response. Ph.D. Thesis, The University of Edinburgh, Edinburgh, UK, 2014. Available online: http://hdl.handle.net/1842/10462 (accessed on 26 February 2021).
  68. Li, J.; Xiang, C.-Y.; Yang, J.; Chen, P.-P.; Zhang, H.-M. Interaction of HSP20 with a viral RdRp changes its sub-cellular localization and distribution pattern in plants. Sci. Rep. 2015, 5, 14016. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  69. Knoth, C.; Eulgem, T. The oomycete response gene LURP1 is required for defense against Hyaloperonospora parasitica in Arabidopsis thaliana. Plant J. 2008, 55, 53–64. [Google Scholar] [CrossRef] [PubMed]
  70. Davin, N.; Edger, P.P.; Hefer, C.A.; Mizrachi, E.; Schuetz, M.; Smets, E.; Myburg, A.A.; Douglas, C.J.; Schranz, M.E.; Lens, F. Functional network analysis of genes differentially expressed during xylogenesis in soc1ful woody Arabidopsis plants. Plant J. 2016, 86, 376–390. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  71. Kraft, E.; Bostick, M.; Jacobsen, S.E.; Callis, J. ORTH/VIM proteins that regulate DNA methylation are functional ubiquitin E3 ligases. Plant J. 2008, 56, 704–715. [Google Scholar] [CrossRef] [Green Version]
  72. Wang, Y.; Sun, T.; Li, T.; Wang, M.; Yang, G.; He, G. A CBL-interacting protein kinase TaCIPK2 confers drought tolerance in transgenic tobacco plants through regulating the stomatal movement. PLoS ONE 2016, 11, e0167962. [Google Scholar] [CrossRef] [PubMed]
  73. Taylor-Teeples, M.; Lin, L.; de Lucas, M.; Turco, G.; Toal, T.W.; Gaudinier, A.; Young, N.F.; Trabucco, G.M.; Veling, M.T.; Lamothe, R.; et al. An Arabidopsis gene regulatory network for secondary cell wall synthesis. Nature 2015, 517, 571–575. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  74. Bartoli, C.; Roux, F. Genome-Wide Association Studies in plant pathosystems: Toward an ecological genomics approach. Front. Plant Sci. 2017, 8, 763. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  75. Madden, L.V.; Jeger, M.J.; van den Bosch, F. A theoretical assessment of the effects of vector-virus transmission mechanism on plant virus disease epidemics. Phytopathology 2000, 90, 576–594. [Google Scholar] [CrossRef] [Green Version]
  76. Hily, J.M.; García, A.; Moreno, A.; Plaza, M.; Wilkinson, M.D.; Fereres, A.; Fraile, A.; García-Arenal, F. The relationship between host lifespan and pathogen reservoir potential: An analysis in the system Arabidopsis thaliana-Cucumber mosaic virus. PLoS Pathog. 2014, 10, e1004492. [Google Scholar] [CrossRef]
  77. Pagán, I. Movement between plants: Vertical transmission. In Cucumber Mosaic Virus; Palukaitis, P., García-Arenal, F., Eds.; APS Press: Washington, DC, USA, 2019; pp. 185–198. [Google Scholar]
  78. Yin, M.; Wang, Y.; Zhang, L.; Li, J.; Quan, W.; Yang, L.; Wang, Q.; Chan, Z. The Arabidopsis Cys2/His2 zinc finger transcription factor ZAT18 is a positive regulator of plant tolerance to drought stress. J. Exp. Bot. 2017, 68, 2991–3005. [Google Scholar] [CrossRef] [PubMed]
  79. Sewelam, N.; Kazan, K.; Hüdig, M.; Maurino, V.G.; Schenk, P.M. The AtHSP17.4C1 gene expression is mediated by diverse signals that link biotic and abiotic stress factors with ROS and can be a useful molecular marker for oxidative stress. Int. J. Mol. Sci. 2019, 20, 3201. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  80. Haq, S.; Khan, A.; Ali, M.; Khattak, A.M.; Gai, W.-X.; Zhang, H.-X.; Wei, A.-M.; Gong, Z.-H. Heat shock proteins: Dynamic biomolecules to counter plant biotic and abiotic stresses. Int. J. Mol. Sci. 2019, 20, 5321. [Google Scholar] [CrossRef] [Green Version]
  81. Senthil, G.; Liu, H.; Puram, V.G.; Clark, A.; Stromberg, A.; Goodin, M.M. Specific and common changes in Nicotiana benthamiana gene expression in response to infection by enveloped viruses. J. Gen. Virol. 2005, 86, 2615–2625. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  82. Mitsuya, Y.; Takahashi, Y.; Berberich, T.; Miyazaki, A.; Matsumura, H.; Takahashi, H.; Terauchi, R.; Kusano, T. Spermine signaling plays a significant role in the defense response of Arabidopsis thaliana to cucumber mosaic virus. J. Plant Physiol. 2009, 166, 626–643. [Google Scholar] [CrossRef] [PubMed]
  83. Takahashi, H.; Miller, J.; Nozaki, Y.; Takeda, M.; Shah, J.; Hase, S.; Ikegami, M.; Ehara, Y.; Dinesh-Kumar, S.P.; Sukamto. RCY1, an Arabidopsis thaliana RPP8/HRT family resistance gene, conferring resistance to cucumber mosaic virus requires salicylic acid, ethylene and a novel signal transduction mechanism. Plant J. 2002, 32, 655–667. [Google Scholar] [CrossRef] [PubMed]
  84. Herlihy, J.; Ludwig, N.R.; van den Ackerveken, G.; McDowell, J.M. Oomycetes used in Arabidopsis research. In The Arabidopsis Book; BioOne: Washington, DC, USA, 2019; pp. 1–26. [Google Scholar]
  85. Bellincampi, D.; Cervone, F.; Lionetti, V. Plant cell wall dynamics and wall-related susceptibility in plant–pathogen interactions. Front. Plant Sci. 2014, 5, 228. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  86. Bethke, G.; Thao, A.; Xiong, G.; Li, B.; Soltis, N.E.; Hatsugai, N.; Hillmer, R.A.; Katagiri, F.; Kliebenstein, D.J.; Pauly, M.; et al. Pectin biosynthesis is critical for cell wall integrity and immunity in Arabidopsis thaliana. Plant Cell 2016, 28, 537–556. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  87. Chen, M.-H.; Sheng, J.; Hind, G.; Handa, A.K.; Citovsky, V. Interaction between the tobacco mosaic virus movement protein and host cell pectin methylesterases is required for viral cell-to-cell movement. EMBO J. 2000, 5, 913–920. [Google Scholar] [CrossRef] [PubMed]
  88. Lionetti, V.; Raiola, A.; Cervone, F.; Bellincampi, D. Transgenic expression of pectin methylesterase inhibitors limits tobamovirus spread in tobacco and Arabidopsis. Mol. Plant. Pathol. 2014, 15, 265–274. [Google Scholar] [CrossRef]
  89. Otulak-Kozieł, K.; Kozieł, E.; Lockhart, B.E.L. Plant cell wall dynamics incompatible and incompatible potato response to infection caused by Potato Virus Y (PVYNTN). Int. J. Mol. Sci. 2018, 19, 862. [Google Scholar] [CrossRef] [Green Version]
  90. Vlad, F.; Spano, T.; Vlad, D.; Bou Daher, F.; Ouelhadj, A.; Kalaitzis, P. Arabidopsis prolyl 4-hydroxylases are differentially expressed in response to hypoxia, anoxia and mechanical wounding. Physiol. Plant. 2007, 130, 471–483. [Google Scholar] [CrossRef]
  91. Hily, J.M.; Poulicard, N.; Mora, M.A.; Pagán, I.; García-Arenal, F. Environment and host genotype determine the outcome of a plant-virus interaction: From antagonism to mutualism. New Phytol. 2016, 209, 812–822. [Google Scholar] [CrossRef] [PubMed]
  92. Liu, S.; Yu, Y.; Ruan, Y.; Meyer, D.; Wolff, M.; Xu, L.; Wang, N.; Steinmetz, A.; Shen, W.-H. Plant SET- and RING-associated domain proteins in heterochromatinization. Plant J. 2007, 52, 914–926. [Google Scholar] [CrossRef] [PubMed]
  93. Baulcome, D.C.; Dean, C. Epigenetic regulation in plant responses to the environment. Cold Spring Harb. Perspect. Biol. 2014, 6, a019471. [Google Scholar] [CrossRef] [PubMed]
  94. Cao, M.; Du, P.; Wang, X.; Yu, Y.Q.; Qiu, Y.H.; Li, W.; Gal-On, A.; Zhou, C.; Li, Y.; Ding, S.W. Virus infection triggers widespread silencing of host genes by a distinct class of endogenous siRNAs in Arabidopsis. Proc. Natl. Acad. Sci. USA 2014, 111, 14613. [Google Scholar] [CrossRef] [Green Version]
  95. Sardar, A.; Nandi, A.K.; Chattopadhyay, D. CBL-interacting protein kinase 6 negatively regulates immune response to Pseudomonas syringae in Arabidopsis. J. Exp. Bot. 2017, 68, 3573–3584. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  96. Klepikova, A.V.; Kasianov, A.S.; Gerasimov, E.S.; Logacheva, M.D.; Penin, A.A. A high-resolution map of the Arabidopsis thaliana developmental transcriptome based on RNA-seq profiling. Plant J. 2016, 88, 1058–1070. [Google Scholar] [CrossRef] [PubMed]
  97. Dekkers, B.J.; Pearce, S.; van Bolderen-Veldkamp, R.P.; Marshall, A.; Widera, P.; Gilbert, J.; Drost, H.G.; Bassel, G.W.; Müller, K.; King, J.R.; et al. Transcriptional dynamics of two seed compartments with opposing roles in Arabidopsis seed germination. Plant. Physiol. 2013, 163, 205–215. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  98. Jia, T.; Zhang, B.; You, C.; Zhang, Y.; Zeng, L.; Li, S.; Johnson, K.C.M.; Yu, B.; Li, X.; Chen, X. The Arabidopsis MOS4-Associated Complex promotes microRNA biogenesis and precursor messenger RNA splicing. Plant Cell 2017, 29, 2626–2643. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  99. Monaghan, J.; Xu, F.; Xu, S.; Zhang, Y.; Li, X. Two putative RNA-binding proteins function with unequal genetic redundancy in the MOS4-associated complex. Plant. Physiol. 2010, 154, 1783–1793. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Geographic distribution of Arabidopsis thaliana genotypes analyzed in this study. Circles indicate population locations. Colors indicate genetic groups as defined in [39] (group 1: purple, group 2: blue; group 3: red; group 4: green). Grey dots indicate genotypes eliminated from the analysis.
Figure 1. Geographic distribution of Arabidopsis thaliana genotypes analyzed in this study. Circles indicate population locations. Colors indicate genetic groups as defined in [39] (group 1: purple, group 2: blue; group 3: red; group 4: green). Grey dots indicate genotypes eliminated from the analysis.
Microorganisms 09 00692 g001
Figure 2. Frequency distributions of Cucumber mosaic virus (CMV) virulence in 154 Arabidopsis thaliana genotypes.
Figure 2. Frequency distributions of Cucumber mosaic virus (CMV) virulence in 154 Arabidopsis thaliana genotypes.
Microorganisms 09 00692 g002
Figure 3. Scatterplot of enriched gene ontology (GO) terms for Arabidopsis determinants of CMV virulence. Color scale indicates significance of the term enrichment (−log10(p). Dot diameter indicates fold enrichment.
Figure 3. Scatterplot of enriched gene ontology (GO) terms for Arabidopsis determinants of CMV virulence. Color scale indicates significance of the term enrichment (−log10(p). Dot diameter indicates fold enrichment.
Microorganisms 09 00692 g003
Figure 4. Manhattan plot illustrating the Arabidopsis thaliana genomic regions associated with CMV virulence. Dashed line indicates false discovery rate (FDR) threshold. Color scale represents single nucleotide polymorphism (SNP) density.
Figure 4. Manhattan plot illustrating the Arabidopsis thaliana genomic regions associated with CMV virulence. Dashed line indicates false discovery rate (FDR) threshold. Color scale represents single nucleotide polymorphism (SNP) density.
Microorganisms 09 00692 g004
Figure 5. Distribution of CMV virulence across Arabidopsis genotypes harboring major (blue) and minor (red) variants of the eight SNPs strongly associated with this trait. Each SNP is indicated by the chromosome (C) and the position in which it is located, and the corresponding gene is shown in grey.
Figure 5. Distribution of CMV virulence across Arabidopsis genotypes harboring major (blue) and minor (red) variants of the eight SNPs strongly associated with this trait. Each SNP is indicated by the chromosome (C) and the position in which it is located, and the corresponding gene is shown in grey.
Microorganisms 09 00692 g005
Figure 6. Distribution of CMV virulence and seed transmission rate across Arabidopsis genotypes harboring major (blue) and minor (red) variants of the two SNPs associated with both traits. Letters indicate the nucleotide at each SNP for the major and minor variants. Seed transmission rate is shown as ST/100 to fit in the same scale as virulence. Each SNP is indicated by the chromosome (C) and the position in which it is located, and the corresponding gene is shown in grey.
Figure 6. Distribution of CMV virulence and seed transmission rate across Arabidopsis genotypes harboring major (blue) and minor (red) variants of the two SNPs associated with both traits. Letters indicate the nucleotide at each SNP for the major and minor variants. Seed transmission rate is shown as ST/100 to fit in the same scale as virulence. Each SNP is indicated by the chromosome (C) and the position in which it is located, and the corresponding gene is shown in grey.
Microorganisms 09 00692 g006
Table 1. SNPs significantly associated with virulence according to FDR threshold.
Table 1. SNPs significantly associated with virulence according to FDR threshold.
SNP aProteinDescriptionFDREffect
C5_26478732GMD1Cell wall metabolism8.92 × 10−11−0.42
C3_10139431PLL18Cell wall metabolism6.53 × 10−7−0.40
C4_11603821HSP20-likeAbiotic stress response1.81 × 10−7−0.22
C3_16927277ZAT8Biotic and abiotic stress response3.96 × 10−6−0.30
C3_8500254RTFL13Flowering time2.05 × 10−5−0.23
C1_20023793AT1G53635Unknown9.81 × 10−4−0.23
C4_5465990ORTHLDNA methylation1.56 × 10−3−0.21
C2_6211877LURP1Biotic stress response2.51 × 10−3−0.22
a Each SNP is named by the chromosome (C) in which it is located and its position in the chromosome.
Table 2. SNPs significantly associated with CMV seed transmission rate according to Random Forest (RF). CIPK2: calcineurin B-like (CBL)-interacting protein kinase 2; JAL4: Jacalin-related lectin 4; MAC5C: MOS4-ASSOCIATED COMPLEX SUBUNIT 5C protein. MEG: Maternally expressed gene.
Table 2. SNPs significantly associated with CMV seed transmission rate according to Random Forest (RF). CIPK2: calcineurin B-like (CBL)-interacting protein kinase 2; JAL4: Jacalin-related lectin 4; MAC5C: MOS4-ASSOCIATED COMPLEX SUBUNIT 5C protein. MEG: Maternally expressed gene.
SNP aProteinDescriptionp-Value% Inc. MSE b
C5_2202898CIPK2Response to abiotic stress0.0128.11
C5_2198824MAC5CCell wall metabolism0.0122.77
C5_11777493AT5G31963Unknown2 × 10−160.83
C2_7038732MEGEmbryogenesis2 × 10−160.73
C3_15811438AT3G44030Unknown0.0410.72
C1_12253373JAL4Response to biotic stress0.0410.53
C1_16528990AT1G43750Unknown0.0330.49
a Each SNP is named by the chromosome (C) in which it is located and its position in the chromosome. b Per cent of increase in the mean square error (MSE).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Montes, N.; Cobos, A.; Gil-Valle, M.; Caro, E.; Pagán, I. Arabidopsis thaliana Genes Associated with Cucumber mosaic virus Virulence and Their Link to Virus Seed Transmission. Microorganisms 2021, 9, 692. https://doi.org/10.3390/microorganisms9040692

AMA Style

Montes N, Cobos A, Gil-Valle M, Caro E, Pagán I. Arabidopsis thaliana Genes Associated with Cucumber mosaic virus Virulence and Their Link to Virus Seed Transmission. Microorganisms. 2021; 9(4):692. https://doi.org/10.3390/microorganisms9040692

Chicago/Turabian Style

Montes, Nuria, Alberto Cobos, Miriam Gil-Valle, Elena Caro, and Israel Pagán. 2021. "Arabidopsis thaliana Genes Associated with Cucumber mosaic virus Virulence and Their Link to Virus Seed Transmission" Microorganisms 9, no. 4: 692. https://doi.org/10.3390/microorganisms9040692

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop