Next Article in Journal
Insight into the Microbiota of Orthopteran in Relation to Gut Compartmentalisation
Previous Article in Journal
Mechanisms of Impact of Alnus ferdinandi-coburgii Odor Substances on Host Location of Tomicus yunnanensis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Shotgun Metagenome Analysis of Two Schizaphis graminum Biotypes over Time With and Without Carried Cereal Yellow Dwarf Virus

by
Yan M. Crane
1,2,
Charles F. Crane
1,3,
Subhashree Subramanyam
1,2 and
Brandon J. Schemerhorn
1,2,*
1
Crop Production and Pest Control Research Unit, United States Department of Agriculture-Agricultural Research Service (USDA-ARS), West Lafayette, IN 47907, USA
2
Department of Entomology, Purdue University, West Lafayette, IN 47907, USA
3
Department of Botany and Plant Pathology, Purdue University, West Lafayette, IN 47907, USA
*
Author to whom correspondence should be addressed.
Insects 2025, 16(6), 554; https://doi.org/10.3390/insects16060554
Submission received: 17 April 2025 / Revised: 14 May 2025 / Accepted: 21 May 2025 / Published: 23 May 2025
(This article belongs to the Section Insect Behavior and Pathology)

Simple Summary

The greenbug aphid is one of the leading pests of wheat and sorghum in the United States. Messenger RNA was used to inventory the microorganisms in two forms of greenbugs, each with and without a wheat virus, over 20 days. Many of the most common bacteria were similar to those in human feces, while an important aphid-specific bacterium was not as dominant as usually seen in aphids. The overall microbial population declined by 50% from day 5 to day 20, led by a decline in the typical fecal organisms. The greenbug genotype and wheat virus affected the microorganisms less than the collection date. This study adds to basic knowledge about microbes in aphids and the methods used to census these microbes.

Abstract

The greenbug aphid (Schizaphis graminum (Rondani)) is a major pest of wheat and an important vector of wheat viruses. An RNA-seq study was conducted to investigate the microbial effects of two greenbug genotypes, the presence or absence of cereal yellow dwarf virus, and the condition of the wheat host over a 20-day time course of unrestricted greenbug feeding. Messenger RNA reads were mapped to ca. 47,000 bacterial, 1218 archaeal, 14,165 viral, 571 fungal, and 94 protozoan reference or representative genomes, plus greenbug itself and its wheat host. Taxon counts were analyzed with QIIME2 and DESeq2. Distinct early (days 1 through 10) and late (days 15 and 20) communities differed in the abundance of typical enteric genera (Shigella, Escherichia, Citrobacter), which declined in the late community, while the ratio of microbial to greenbug read counts declined 50% and diversity measures increased. The nearly universal aphid endosymbiont, Buchnera aphidicola, accounted for less than 25% of the read counts in both communities. There were 302 differentially expressed (populated) genera with respect to early and late dates, while 25 genera differed between the greenbug genotypes and nine differed between carrier and virus-free greenbugs. The late community was likely responding to starvation as the wheat host succumbed to aphid feeding. Our results add to basic knowledge about aphid microbiomes and offer an attractive alternative method to assess insect microbiomes.

1. Introduction

Phytophagous insects cohabit with a multitude of microorganisms (the microbiome) that potentially influence their nutrition [1], longevity, reproduction [2], behavior [3], and gene expression [4,5]. The insect can interact with its microbiome in various ways, including, among others, diet, immune and stress responses [6,7,8,9], specific signaling pathways [4], and anatomical adaptations [8], which in turn depend upon insect genetics. The insect also interacts with its host plant individually and as a population. Feeding withdraws nutrients from the plant, disrupts plant tissues, opens the way for plant pathogens to invade [10], and in some cases introduces pathogens [11,12,13], toxins [14], or effectors of hypersensitive response into the plant [15]. Thus, the quality and quantity of the insects’ diet can respond to elapsed time by way of increased insect population density, sustained withdrawal of nutrients, development of symptomatic disease, and ultimately death of the host plant.
Because of their small body size, RNA-seq experiments in insects often sample whole individuals or groups of individuals. In this case, the transcriptome includes transcripts from the microbiome and potentially the immediate environment. With deep sequencing of the target transcriptome, there are enough microbial reads to survey the composition and community structure of the microbiome. Since these transcripts come from throughout the genome, the Silva and UNITE databases of hypervariable rRNA regions are useless for assigning them to organisms, and new methods to count transcripts in whole genomes must be developed. Our previous RNA-seq study of gene expression in greenbug aphids (Schizaphis graminum (Rondani)) [16] generated 110 million microbial reads over a time course that included the decline of the host plant under feeding pressure. Here we describe how we counted these reads to whole genomes and how the microbiome responded to greenbug biotype and carried cereal yellow dwarf virus over the time course. We compare the resulting taxonomic abundances to the results of 16S rDNA amplicon sequencing in related aphid species. This appears to be the first report of a microbiome from the greenbug itself.
Greenbug remains a major pest of small grains, especially wheat and sorghum [17], although it is also a generalist pest of grasses [17]. Annual economic loss to greenbug probably exceeded USD 250,000,000 in 1999 [18]. Greenbug outbreaks have decreased since 2000 (https://entomology.k-state.edu/extension/crop-protection/sorghum/greenbug.html, accessed on 7 May 2025), and current estimates of yield loss specific to greenbug are unavailable. Greenbug saliva has been considered toxic to the plant [19], and greenbug is an important vector of yellow-dwarf viruses [20] that cause disease in grasses. The main vectored viruses are barley yellow dwarf virus (BYDV-PAV) and cereal yellow dwarf virus (CYDV-RPV and CYDV-RPS), which collectively were the eighth leading cause of wheat yield reduction in 29 US states and two Canadian provinces in 2024 (https://cropprotectionnetwork.org/publications/wheat-disease-loss-estimates-from-the-united-states-and-ontario-canada-2024, accessed on 7 May 2025), causing about 10% of the loss produced by the leading disease, stripe rust. Historically, these viruses have been considered as closely related luteoviruses, but as more fully sequenced genomes have become available, their perceived relationship has become more tenuous [21]. Currently, they are classified in separate genera (Polerovirus and Luteovirus) with different arrangements of open reading frames. CYDV is more closely related to other poleroviruses that do not infect grasses [22]. For example, CYDV strains formed a sister clade to potato leaf roll virus with 97% bootstrap support in a maximum likelihood tree of Polerovirus based on the translated amino acid sequence of open reading frame two [22].
Amplicon-based studies have dealt with microbiomes in related aphid species, but apparently not S. graminum itself; thus, there is a gap in basic knowledge to fill about this important wheat and sorghum pest. Related genera include Rhopalosiphum, Aphis, Protaphis, and more distantly Acyrtosiphon, Myzus, Sitobion, and Diuraphis [23]. Buchnera aphidicola, the obligate intracellular endosymbiont of almost all aphids, comprised 84.4% of total reads in S. avenae [24], at least 98.5% of total reads in six samples of R. padi [24], 100% of total reads in D. noxia [25], 53% to 94% in five samples of A. gossypii [26], 86.8% and 99.9% in two color morphs of A. gossypii [27], 70% to 98% in six collections of A. gossypii [28], and 95% to 98% in M. persicae [28]. The percentage of Buchnera among 16S amplicons seems not to have been recently reported in Acyrthosiphon, but the titer of secondary endosymbiont Hamiltonella defensa declines reversibly at low temperature [29]. Buchnera was about 100 times as abundant as Arsenophonus or Wolbachia in A. glycines as measured by quantitative PCR [30]. Fakhour et al. [25] also reported the secondary symbionts Regiella insecticola and Hamiltonella defensa in 77.8% of collections of S. avenae and free-living genera Erwinia, Pantoea, Pseudomonas, Acinetobacter, and Staphylococcus in 28.3% of collections of R. padi. Ma et al. [26] also found Serratia to account for 24% of reads from one sample of A. gossypii. Gallo-Franco et al. [28] found 17.5% of the reads of one collection of A. gossypii to come from Pseudomonas. In contrast to the preponderance of Buchnera in the preceding examples, He et al. [31] did not exclude rare taxa and found 1607 genera of bacteria and 37 archaeal taxa in M. persicae fed Chinese cabbage, eggplant, pepper, or tobacco, with the greatest diversity of genera on pepper. Buchnera comprised only 12.5% to 25.5% of the reads from the solanaceous-fed aphids and was outnumbered by Pseudomonas (62% and 69%) with feeding on eggplant or tobacco, and this excess of Pseudomonas was upheld with qPCR with genus-specific primers.
During data analysis, it was found that 20 of the 95 samples in our RNA-seq study also contained Rhopalosiphum padi virus (RhPV, not to be confused with CYDV-RPV). RhPV is pathogenic to aphids and reduces their lifespan and fecundity [32]. RhPV can be transmitted to and from host plants and is mobile in host plants [32] without causing visible plant disease. Thus, there was also an opportunity to investigate if RhPV independently affected the greenbug microbiome.
This study was designed to test the effects, if any, of greenbug genotype, CYDV carrier status, time, and incidentally RhPV, on the identity, relative abundance, and diversity metrics of microorganisms in or on the greenbugs. By following the greenbug population through the growth and decline of its host wheat plants, we explored the response of the microbiome to conditions not usually considered in microbiome studies. It also provides an indirect opportunity to compare shotgun with amplicon-based methods to analyze microbiomes, given that amplicon-based analyses have been performed in related aphid species.

2. Materials and Methods

2.1. Experimental Execution

Reads were reanalyzed from a previous RNA-seq study of greenbug gene expression in relation to biotype, harbored CYDV-RPV, and time post-infestation [16]. The experimental layout, which is diagrammed in Figure 1, was the same as in Crane et al. [16]. The populations of biotypes B and H descended parthenogenetically from a single individual of each biotype and therefore were genetically uniform. Half of the aphids of each biotype were allowed to acquire CYDV-RPV from infected wheat cv. “Newton” plants, while the other half remained CYDV-RPV-free. A sequence specific to CYDV-RPV was amplifiable only from infected wheat, whereas nine other wheat viruses were not detected [16]. Then the aphids were moved to separate sets of CYDV-RPV-free wheat cv. “Newton” plants that were placed in separate growth chambers, and their population was allowed to increase over 20 days with sampling at days 0, 1, 2, 3, 5, 10, 15, and 20. Three groups of aphids were sampled from separate wheat plants (biological replicates) per combination of biotype, viral carrier status, and timepoint. Each individual wheat plant was sampled only once. Over this time course, the wheat plants with viruliferous aphids developed yellow-dwarf symptoms, indicating successful infection. The CYDV-RPV-specific sequence could be amplified only from the viruliferous aphids [16]. Aphids were flash-frozen by dropping an infested leaf segment into liquid nitrogen, then collecting and counting the aphids. RNA was extracted from each group of aphids with the Qiagen RNeasy Mini kit (Germantown, MD, USA) and stored at −80 °C. Library preparation and Illumina sequencing were performed by Novogene USA (Sacramento, CA, USA). Sequence was obtained for 95 of 96 samples; the failed sample was one replicate of biotype B without CYDV at day 15. The resulting reads are available at the NCBI sequence read archive under project PRJNA981508.

2.2. Bioinformatic Processing

As part of the previous study of greenbug gene expression [16], demultiplexed reads from Novogene USA (Sacramento, CA, USA) were processed, as depicted in Figure 2. Read trimming, quality filtering, and removal of greenbug rRNA, were performed with HTStream ([33]; https://github.com/s4hts/HTStream, accessed on 7 May 2025; Bioinformatics Core, University of California, Davis, CA, USA) without the “superdeduper” step, which otherwise would have removed duplicate reads. This was justified because the library preparation did not involve PCR amplification, so duplicate reads were not expected. HTStream used NCBI Genbank aphid accessions AH003128.2, AB369153.1, AB369137.1, and S50426.1 as rRNA reference sequences. Reads with entropy below 0.7 were removed with BBduk ([34]; https://sourceforge.net/projects/bbmap/, accessed on 7 May 2025; U.S. Department of Energy Joint Genome Institute, Walnut Creek, CA, USA); such reads often included long microsatellites, polyA tails, or long tracts of single nucleotides indicative of low-quality sequencing. Empty or singleton reads left from previous processing were removed with fastq-pair [35]. From here onward, the analysis followed the general strategy of RNAseq, except that whole genomes took the place of gene sequences and taxa took the place of gene names. Reference or representative genomes of ca. 47,000 bacterial, 1218 archaeal, 14,165 viral, 571 fungal, and 94 protozoan taxa, plus greenbug (GCA_020882235.1) and wheat cv. “Chinese Spring” (GCF_018294505.1) were downloaded from NCBI Genomes, split into 63 subsets, and indexed with bwa index [36]. The slowness of bwa indexing even in a high-performing cluster environment necessitated this much splitting of the genomic database. Each read was mapped to each of 63 database chunks with bwa mem [36]. A perl script, maxscore1015.pl (https://zenodo.org/records/13750628; accessed on 11 September 2024), was written to identify the highest scoring hit for each read among the .sam outputs of bwa mem, and the read was credited to that genome. Another perl script, grouptaxa1211.pl (https://zenodo.org/records/13750628; accessed on 11 September 2024), was written to join an NCBI taxonomic identifier to each accession and group accession counts by taxon. Read counts that mapped to wheat and greenbug were ignored. Microbial read counts were structured exactly like gene read counts, and thus DESeq2 [37] was used to detect differential “expression” (population if all taxa were equally transcribed). Read counts at each taxonomic level from genus to phylum were also transformed to biom format by biom convert [38] and imported into a QIIME2 feature table for display and statistical analysis with functions “diversity core-metrics”, “diversity beta-group-significance” [39], “diversity adonis” [40], and “taxa barplot”.

3. Results

3.1. Overview

Over the entire study, 118,059,680 reads (1.233% of total) mapped to wheat, 9,350,093,175 (97.619%) to greenbug, and 110,008,016 (1.149%) to microbes. There were 16,474 to 10,239,120 microbial reads per sample. The ratio of total microbial counts to total greenbug counts peaked at day 5 and declined 50% by day 20 (Figure 3, where the consecutive timepoints are numbered 0 through 7). The microbial reads matched 3833 genera, 836 families, 367 orders, 153 classes, and 71 phyla. Bacteria accounted for 46 of the 50 most abundant genera. While the 30 most abundant genera accounted for 70 to 90% of the reads among samples, the counts indicated a diverse microbiome with representatives from all five kingdoms searched (bacteria, archaea, viruses, fungi, and protozoa). There were 380 to 1672 taxa encountered per sample; the 1672 came from the sample with 10,239,120 microbial reads, but the 380 came from a sample with the third lowest total read count. Over all the replicates, Pearson’s correlation coefficient was 0.735 for taxon count to read count. Rarefaction to 100,000 reads retained 89 of 95 samples, and the counts of observed genera did not reach saturation within that interval when samples were split by biotype, carrier status, presence of RhPV, or relative time (Figure 4). The rarefaction curves were nearly superimposed for splits by biotype, carrier status, and presence of RhPV; they were separated only for splits of samples by relative time. A plot of counts of genera versus total counts of reads (Figure 5) revealed wide variation with continued increase in the number of genera into the millions of reads.

3.2. Diversity

For all genera, including Buchnera, taxonomic diversity within samples was estimated with Pielou evenness and Shannon entropy after rarefaction to the lowest read count (16,470). The distribution of diversity estimates was displayed as boxplots (Figure 6A–J) with the Kruskal–Wallis p-value for pairwise comparisons involving early versus late time (days 0 through 10 versus days 15 and 20), biotypes, CYDV carrier status, RhPV infection, and RhPV infection with more than 100 RhPV counts in the sample (range 127 to 789,356). Only late versus early collection dates differed significantly in either index (Figure 6A,F). Both indices increased as overall microbial abundance decreased on days 15 and 20. The observed evenness values indicated low to moderate within-community diversity with a few abundant genera. The observed Shannon values indicated high to very high within-community diversity because of the large number of rare genera encountered. Increasing the rarefaction limit from 16,470 to 100,000 negligibly affected the evenness or Shannon values, which approached a limit with only 2000 to 4000 reads.
For all genera, including Buchnera, taxonomic diversity between samples was estimated with the Bray–Curtis dissimilarity after rarefaction to 16,470 reads. The first three principal components were plotted (Figure 7) with color-coding for collection dates, biotype, CYDV carrier status, and RhPV infection status. All four panels display the same plot, but only panel A (early versus late collection) shows separation of colors into disjunct groups with a single exception for one sample from day 5. This result also supports a fundamental difference between microbial communities on days 15 and 20 versus earlier samples. Increasing the rarefaction limit from 16,470 to 100,000 eliminated six spots but otherwise negligibly affected the principal components plot. Application of PERMANOVA in Adonis [39] to Bray–Curtis distances also showed that only early versus late time affected the communities significantly, although the interactions of biotype with time and viral carrier status with time were almost significant at p = 0.05 (Table 1).

3.3. Generic Composition

The 30 most abundant genera are listed in Figure 8. Supplemental Figures S1–S4 list the top 30 families, orders, classes, and phyla. In Figure 8, “other” is the sum of all other genera, and the size of each bar is proportional to the counts in each combination of biotype, CYDV-carrier status, and collection date. Aside from Buchnera, the most abundant genera are typical fecal bacteria that have been reported in vertebrates as well as other insects: Shigella, Desulfovibrio, Escherichia, Citrobacter, Enterobacter, and Acinetobacter. Pseudomonas and Ralstonia have also been found in mammalian feces. In contrast, Microcystis and Lamprocystis are photosynthetic, although the latter is anaerobic. A third photosynthetic alga, Nannochloropsis, was far less abundant. Figure 8 indicates two distinct communities that differ in the abundance of Shigella and Escherichia, which decreased on days 15 and 20. These two communities correspond to the disjunct Bray–Curtis clusters in Figure 7.
Taxa varied greatly in counts among samples, even replicates, reflecting seemingly random outbreaks of particular organisms. One measure of such outbreaks is the ratio of the maximum count in any one sample to the total count over all samples. This measure is given in Table 2 for 58 genera that were represented by at least 100,000 reads. The most unevenly encountered genera were the fungi Mucor and Letharia. For Mucor, 98.5% (117,316) of counts came from one replicate of greenbug biotype B harboring CYDV-RPV at 5 days, and no other sample exceeded 250 counts. For Letharia, 97.9% (753,389) of counts came from one replicate of greenbug biotype B without harbored CYDV-RPV at 0 days, and five other samples had between 1000 and 5000 counts. Bacterial genera Fibrisoma (95.9%), Variovorax (88.7%), Pantoea (80.9%), and Serratia (74.8%) also had more than 70% of their total counts from one sample. At the other extreme, one fungus (Gilbertella, 4.3%) and eight bacteria (Burkholderia (3.5%) through Shigella (5.3%) and Escherichia (5.0%)) had no more than 6% of their total counts in any one sample. Buchnera was intermediate in stability, with 21% of its total counts in its greatest sample. Count ratios of Buchnera to Schizaphis varied 35-fold from 0.0002 to 0.0069 among samples. In comparison, the count ratios of Shigella to Schizaphis varied 290-fold from 0.0000522 to 0.015.
Stability is the opposite of outbreaking, and a set of sufficiently stably abundant genera could constitute a core microbiome if it exists. One measure of stability is S = min(ni/Ni)/max(ni/Ni), where ni is the count of the taxon in sample i and Ni is the total read count of sample i. S lies in the interval [0, 1], with higher values denoting greater stability. Supplemental Table S1 lists values of S in descending order for 58 genera that have at least 100,000 summed counts over all samples. Even the most stable genera have a ten-fold variation among samples, and therefore, a core microbiome seems not to exist. However, 50 of these genera have non-zero counts in every sample, as reflected in S and in the presence across all columns in Figure 8.
Eleven genera of Enterobacteriaceae had at least 10,000 total counts. To examine the similarity of their response to that of Shigella, the fraction of each one’s counts to total microbial counts was calculated for each sample. Then each fraction was divided by the maximum value of that fraction for that taxon. The mean of such fractions of maximum fractions was then calculated over days 0 through 10 and for days 15 and 20. Table 3 gives the results. Shigella and Escherichia had the biggest drop, but Citrobacter, Enterobacter, Klebsiella, Raoultella, and Cedecea also dropped greatly on days 15 and 20. In contrast, Salmonella, Kluyvera, Blochmannia, and Leclercia increased on days 15 and 20. Of note, the four most abundant genera of Enterobacteriaceae all dropped on days 15 and 20.
Manzano-Marin et al. [41] mentioned eight genera of secondary bacterial symbionts from aphids at large. We detected all of these, although only Erwinia, Serratia, Arsenophonus, Sodalis, and Wolbachia had more than 10,000 counts. Fukatsuia, Hamiltonella, and Regiella had fewer than 20 counts over the entire study. Many species in Erwinia, Serratia, and Sodalis are free-living and not symbiotic.
Oddly, CYDV-RPV was not detected among reads, even though the wheat plants with viruliferous greenbugs developed yellow dwarf disease. The most abundant virus by counts was Rhopalosiphum padi virus, an aphid pathogen that was detected in 20 samples, of which 18 had CYDV-viruliferous greenbugs that accounted for 99.996% of the RhPV counts. The RhPV-positive samples were split 9:11 between biotypes B and H. The other most abundant insect virus was Diolcogaster facetosa bracovirus (ranked 797th out of all counts). Four bacteriophages that affect Enterobacteriaceae were also among the top ten viruses: Shigella phage SfIV (685th), Escherichia phage 500465-1 (831st), Enterobacteria phage DE3 (883rd), and Enterobacteria phage P7 (984th). Overall, there were counts for 301 phage accessions. Many appeared only as a few hundred counts in one sample.
Raw counts of genera were subjected to DESeq2 [37] to identify genera with significant response to time, biotype, or harbored CYDV-RPV. The time comparisons included each timepoint to day 0 and also comparisons between early and late (days 15 and 20) relative time. The results are summarized in Table 4 and appear in full in Supplemental Tables S2–S9. Even though biotype and CYDV did not affect communities overall, individual genera were up- or down-expressed (populated) with both. Some genera responded significantly in more than one comparison and therefore appear more than once in Table 4. In the comparison of timepoints to day 0, there were 299 significantly differentially populated genera, of which 182 were down-populated at late times and 117 were up-populated at later times. In the comparison of late to early relative timepoints (days 15 and 20 versus all previous days), there were 302 significantly differentially populated genera, of which 173 were down-populated and 129 were up-populated. Shigella, Escherichia, and Citrobacter were the most abundant down-populated genera on days 15 and 20, while Aquabacterium was a prominent up-populated genus. In the comparison of biotypes, there were 25 significantly differentially populated genera, of which 24 were down-populated in biotype H. The most abundant of these down-populated genera was Pseudomonas. In the comparison of CYDV-carrier with CYDV-free greenbugs, there were nine significantly differentially populated genera, of which six Wwere down-populated in CYDV-carriers. Microvirga was the most abundant down-populated genus, while Chryseobacterium was the most abundant up-populated genus.

4. Discussion

Our findings contrast with most rRNA amplicon-based studies in aphids, where Buchnera accounts for 70–99% of counts. In our study, Buchnera ranked after Shigella and Desulfovibrio and was just one of several dominant genera. A plausible reason for this discrepancy is the high ploidy of Buchnera bacteroids, which have 60 to 200 or more circular genomes per cell [42]. However, Buchnera genomes have lost numerous genes during the course of evolution [43], so there are fewer genes being transcribed. PCR amplification of 16S rDNA reflects the copy number of the rRNA genes, which have been retained. We found a high diversity of taxa, and sampling even millions of reads did not fully exhaust the roster of taxa, as evidenced by the rarefaction curves in Figure 4 and their endpoints in Figure 5. The greenbugs and their environment contained a large number of infinitesimally rare, possibly dispersing taxa. The high diversity that we found is most similar to the findings of He et al. [31] in Myzus persicae fed on Chinese cabbage, eggplant, tobacco, and pepper. There, Pseudomonas outnumbered Buchnera on two diets, and Ralstonia, Burkholderia, Acinetobacter, and Allobaculum were prominent. All of these genera except Allobaculum were among the 30 most abundant genera in greenbugs. However, the Myzus microbiome lacked the abundant Enterobacteriaceae (Shigella, Escherichia, Citrobacter, Klebsiella) found in greenbugs. Sharawi [44] cultured Escherichia coli and single species of Serratia, Bacillus, Micrococcus, and Staphylococcus from Myzus persicae, indicating the presence of these genera in Myzus. All of these bacterial genera were abundant, with more than 85,000 counts in greenbugs. The great preponderance of Buchnera in reported aphid microbiomes also implies that the aphid gut lumen is relatively free of bacteria, upholding the finding by Douglas [45] that the gut lumen of Myzus persicae was devoid of microorganisms when embedded, sectioned, and examined with light microscopy. However, unconsolidated material can be lost from deparaffinized sections during washing and staining, so confirmation with plastic thick (1 µm) sections (or scanning electron microscopy after freeze-fracturing) is warranted. The transit time of ingested sap through the gut is important in relation to the doubling time of the gut microorganisms. If the transit time is short, then microbial populations might be reduced, although this does not preclude high diversity.
The collection procedure was intended to prevent injury to the greenbugs prior to freezing, so as to avoid wound responses in their transcriptome. Thus, leaf segments bearing greenbugs were frozen, and then the greenbugs were removed. The leaf segment was frozen concomitantly and became brittle, so the greenbug samples were contaminated with wheat and its resident microbiome. This is evident from the percentages of wheat (1.233%) and microbial (1.149%) counts among total counts. There is no spatial information in the counts themselves to indicate which organisms were in the gut, on the exoskeleton, in excreted honeydew, or on/in the leaf. The microbiome that we found was a superset of greenbug and wheat foliar microbiomes and thus was more diverse than microbiomes reported from washed aphids. However, the preponderance of Enterobacteriaceae and Desulfovibrio indicates that our greenbug microbiome was mostly from the gut, since these enteric taxa are not abundant in foliar microbiomes. Cernava et al. [46] found Enterobacteriaceae to comprise a minor part of the internal leaf microbiome of Eruca sativa, where most read counts from genomic DNA mapped to Pseudomonadales, Burkholderiales, Actinobacteria, and Rhizobiales. However, enteric genera including Shigella, Escherichia, Salmonella, and Citrobacter were encountered when the 16S V3-V8 region was amplified with enterobacterial primers Entero-F234 and Entero-R1423. Buchnera and Candidatus Hamiltonella were also amplified, indicating unacknowledged entrainment of aphids with the sampled leaves. In wheat, Mao et al. [47] found prominent unclassified Alcaligenaceae along with Lactiplantibacillus, Lactobacillus, Streptococcus, and Pseudomonas. Pantoea was the only genus mentioned from the Enterobacterales.
Our study was designed to investigate the relative effects of greenbug genotype, acquired CYDV-RPV, and time since population establishment, on the diversity and identity of microbes in or on the greenbugs. Greenbug biotypes are defined by their ability to colonize wheat lines with different aphid resistance genes [48]; there is no assurance that different biotypes are genetically close to or distant from one another, and there is no assurance of genetic uniformity within a biotype collected from different locations. However, our study took advantage of the parthenogenetic reproduction of greenbugs to assure genetic uniformity within its chosen biotypes. One weakness in our study was that the presence of CYDV-RPV in the host plants was not confirmed by primer-specific PCR late in the time course. That presence had been confirmed in a previous generation of wheat before the viruliferous greenbugs were placed on healthy wheat [16], and the plants that received viruliferous greenbugs appeared to develop yellow-dwarf disease, but the absence of detected CYDV in the collected greenbugs is concerning. Possibly, the library preparation procedure selected against CYDV, which lacks a polyA tail [49], and allowed recovery of RhPV, which has a polyA tail [50]. The time course spanned the lifespan of wheat under heavy greenbug feeding. The host plants were dying by day 20, and the enteric-depleted community of days 15 and 20 likely resulted from greenbug starvation. This would be especially likely if the previously abundant Enterobacteriaceae had a higher nutritional requirement than the other genera that succeeded them, and it is consistent with the overall microbial population decline on days 15 and 20. However, the differential “expression” of 25 genera between biotypes and nine genera after CYDV acquisition implies that subtle effects might exist for both factors.
The presence of RhPV was unexpected, as was its strong correlation to CYDV acquisition (99.996% of RhPV counts were in the greenbugs that carried CYDV). According to the NCBI taxonomy browser, CYDV and RhPV belong to different orders within the Pisoniviricetes, but their genomes have no homology by blastn alignment at an e-value of 1 × 10−8. Instead, three hypotheses are plausible: (1) that RhPV was present among the founders of both CYDV-acquiring subpopulations, perhaps having been transmitted via the plant from the aphids that started the CYDV infection in the wheat, since RhPV is mobile within wheat plants [32]; (2) that a diet of CYDV-infected wheat weakens innate immunity to RhPV; and (3) that acquisition of CYDV in itself makes greenbugs more susceptible to RhPV, perhaps by suppressing innate immunity to other viruses. This correlation merits experimental investigation.
Bioinformatic methodology might also have increased the apparent diversity in our study. Some studies explicitly eliminated rare taxa from the analysis of diversity. Most microbiome studies are based on amplification of hypervariable regions in 16S-23S rRNA genes or ITS sequences in 18S rRNA, with merging of overlapping paired-end reads, removal of PCR chimeras, and clustering or denoising to consensus or most abundant similar sequences over some narrow range (e.g., 3%; see [26], and the documentation of QIIME2) of sequence variability. That strategy combats amplification and sequencing error, but it also implicitly assumes that closely related taxa are not present in the community and that amplification is equally effective for all variants. Extensive databases (Silva, UNITE, GreenGenes) exist for such amplicons, many of which come from uncultured organisms. Having mRNA in hand, with no expectation of chimeric reads in the absence of PCR amplification, we chose instead to align each forward or reverse read to all publicly available whole genomes of viruses, bacteria, archaea, fungi, and protists, letting competition among closest hits determine the taxonomic assignment of each read. It is therefore possible that reads derived from conserved genes mapped randomly to multiple taxa, with ties broken in favor of the first of the identical hits. However, genome assemblies are expensive compared to amplicon sequencing and, therefore, might be biased to medically or agriculturally important organisms. The open question is how frequently erroneous reads more closely map to some related organism, which would inflate the number of hit taxa. Contamination from the laboratory or extraction kits can lead to amplification from common environmental bacteria, such as Ralstonia, Afipia, Pseudomonas, Acinetobacter, and Burkholderia, in blank samples subjected to ca. 30 cycles of PCR [51], but such contamination is swamped out with more than 107 cells in a sample [51] and is not expected to be discernible in unamplified RNA-seq samples. Therefore, “kitome” contamination probably did not affect the apparent presence of genera in our study.
The presence of more than 10,000 counts each of Erwinia, Serratia, Arsenophonus, Sodalis, and Wolbachia suggests that one or more of these genera might be a secondary symbiont in greenbugs, since these genera contain secondary symbiotic taxa in at least six lineages of aphids [41]. However, Schizaphis is not in any of these lineages, and only six of the Serratia reads came from Serratia symbiotica. In contrast, Ma et al. [26] also detected Serratia, Arsenophonus, and Wolbachia in Aphis gossypii, which, like greenbugs, is in the tribe Aphidini. Proving endosymbiosis would require demonstrating the presence of the bacterial genome within bacteroids in bacteriocytes. Even if an intracellular relationship does not exist, these and other bacteria might supply B-vitamins from the gut lumen. Another hemolymph-resident bacterium, Spiroplasma [52], was scarcely detected with a total of 47 counts. Therefore, Spiroplasma is unlikely to be a greenbug symbiont.
RNA is an imperfect proxy for cell counts. Transcription rates and mRNA stability surely vary among genes and taxa, and RNA counts per cell depend on cell volume, which is greater in eukaryotes than in bacteria. There is also a contrast between multicellular, filamentous fungi and unicellular bacteria and archaea. Cell or organism counts based on amplicon reads can be skewed by ploidy variation, as mentioned above. Sequence complexity (entropy) can also affect the range of matching sequences in reference genomes. We had to filter out low-entropy reads (long microsatellites, low-quality runs of single nucleotides) with bbduk because such reads were leading non-specifically to implausible hits (e.g., corals) for greenbugs in a growth chamber. Nevertheless, one of our top 30-ranking genera, Letharia, was implausibly present even after entropy filtration. Letharia is a lichenized ascomycete that grows on tree branches in the Pacific Northwest of the USA [53]. It is possibly a contaminant that was somehow introduced after sample collection.
In summary, taxon counts obtained from microbial mRNA in greenbugs differed markedly from counts obtained with 16S rDNA amplicons from related aphid species, indicating a much more diverse microbiome with reduced dominance of Buchnera aphidicola. It is not obvious why this should be so, although the high ploidy of Buchnera probably has a role. It will be difficult to understand the biological significance of this difference until results are obtained with both methods from the same aphid samples. The mRNA method revealed a population collapse of enteric bacteria as the host plants succumbed to aphid feeding. This collapse was more evident because the ratio of microbial to greenbug read counts was available, which would not have been the case with amplicon sequencing.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/insects16060554/s1, Figure S1. Barplotted relative frequencies of counts of the top 30 microbial families plus “Other” as the sum of all other families. Samples are encoded B or H for greenbug biotype, V if viruliferous, and 0 to 20 for days after infestation. Late depletion of Enterobacteriaceae is still evident. Figure S2. Barplotted relative frequencies of counts of the top 30 microbial orders plus “Other” as the sum of all other orders. Samples are encoded B or H for greenbug biotype, V if viruliferous, and 0 to 20 for days after infestation. The Shigella-depleted community is less distinct as more genera are grouped under Enterobacteriales. Figure S3. Barplotted relative frequencies of counts of the top 30 microbial classes plus “Other” as the sum of all other classes. Samples are encoded B or H for greenbug biotype, V if viruliferous, and 0 to 20 for days after infestation. The Shigella-depleted community is even less distinct. Various fungal classes appear when orders are grouped into bacterial classes. Figure S4. Barplotted relative frequencies of counts of the top 30 microbial phyla plus “Other” as the sum of all other phyla. Samples are encoded B or H for greenbug biotype, V if viruliferous, and 0 to 20 for days after infestation. There is a nearly continuous downward gradation in frequency of Pseudomonadota at later timepoints. Table S1. Stability and total counts of 58 abundant genera ranked by stability index S. Table S2. DESeq2 results for comparison by time, arranged by BH-adjusted p-value. Table S3. DESeq2 results for comparison by time, arranged by log2 fold change. Table S4. DESeq2 results for comparison by biotype, arranged by BH-adjusted p-value. Table S5. DESeq2 results for comparison by biotype, arranged by log2 fold change. Table S6. DESeq2 results for comparison by viral carrier status, arranged by BH-adjusted p-value. Table S7. DESeq2 results for comparison by viral carrier status, arranged by log2 fold change. Table S8. DESeq2 results for comparison by early versus late relative time, arranged by BH-adjusted p-value. Table S9. DESeq2 results for comparison of early versus late relative time, arranged by log2 fold change.

Author Contributions

Conceptualization, B.J.S. and Y.M.C.; methodology, Y.M.C. and C.F.C.; software, C.F.C.; formal analysis, C.F.C.; investigation, Y.M.C.; resources, Y.M.C.; data curation, C.F.C. and Y.M.C.; writing—original draft preparation, C.F.C. and Y.M.C.; writing—review and editing, Y.M.C., C.F.C., S.S. and B.J.S.; visualization, C.F.C. and Y.M.C.; supervision, B.J.S.; project administration, B.J.S.; funding acquisition, B.J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by USDA-ARS Research Project 5020-21000-000-D.

Data Availability Statement

The Illumina reads reside in NCBI SRA under bioproject PRJNA981508 (www.ncbi.nlm.nih.gov/bioproject/PRJNA981508, URL first accessed 8 June 2023). There are 94 runs from SRR25033616 through SRR25033709. Read counts and scripts have been deposited at https://doi.org/10.5061/dryad.z08kprrp1.

Acknowledgments

We thank Jean Galbraith and Steve Scofield (both of the USDA-ARS) for providing the aphids carrying CYDV-RPV and advice about virus inoculation. We are grateful to Sue Cambron and Lucy Springmeyer (both of the USDA-ARS) for their assistance with wheat cultivation, greenbug rearing, and RNA sampling.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper. Mention of a commercial or proprietary product does not constitute an endorsement by the US Federal Government.

References

  1. McMullen, J.G., II; Peters-Schulze, G.; Cai, J.; Patterson, A.D.; Douglas, A.E. How gut microbiome interactions affect nutritional traits of Drosophila melanogaster. J. Exp. Biol. 2020, 223, jeb227843. [Google Scholar] [CrossRef] [PubMed]
  2. Voirol, L.R.P.; Frago, E.; Kaltenpoth, M.; Hilker, M.; Fatouros, N.E. Bacterial symbionts in lepidoptera: Their diversity, transmission, and impact on the host. Front. Microbiol. 2018, 9, 556. [Google Scholar] [CrossRef]
  3. Singh, S.; Singh, A.; Baweja, V.; Roy, A.; Chakraborty, A.; Singh, I.K. Molecular rationale of insect-microbes symbiosis—From insect behaviour to mechanism. Microorganisms 2021, 9, 2422. [Google Scholar] [CrossRef] [PubMed]
  4. Bai, S.; Yao, Z.; Raza, M.F.; Cai, Z.; Zhang, H. Regulatory mechanisms of microbial homeostasis in insect gut. Insect Sci. 2021, 28, 286–301. [Google Scholar] [CrossRef]
  5. Negroni, M.A.; Segers, F.H.I.D.; Vogelweith, F.; Foitzik, S. Immune challenge reduces gut microbial diversity and triggers fertility-dependent gene expression changes in a social insect. BMC Genom. 2020, 21, 816. [Google Scholar] [CrossRef]
  6. Haider, K.; Sufian, M.; Abbas, D.; Kabir, K.; Ali, M.S.; Kausar, Y.; Ghafar, M.A. The role of gut microbiota in shaping immune responses in Tephritidae fruit fly and prospective implications for management. Neotrop. Entomol. 2025, 54, 34. [Google Scholar] [CrossRef]
  7. Korsa, A.; Lo, L.K.; Gandhi, S.; Bang, C.; Kurtz, J. Oral immune priming treatment alters microbiome composition in the red flour beetle Tribolium castaneum. Front. Microbiol. 2022, 13, 793143. [Google Scholar] [CrossRef]
  8. Mondal, S.; Somani, J.; Roy, S.; Babu, A.; Pandey, A.K. Insect microbial symbionts: Ecology, interactions, and biological significance. Microorganisms 2023, 11, 2665. [Google Scholar] [CrossRef]
  9. Pang, X.; Xiao, X.; Liu, Y.; Zhang, R.; Liu, J.; Liu, Q.; Wang, P.; Cheng, G. Mosquito C-type lectins maintain gut microbiome homeostasis. Nat. Microbiol. 2016, 1, 16023. [Google Scholar] [CrossRef]
  10. Gossner, M.M.; Beenken, L.; Arend, K.; Begerow, D.; Persoh, D. Insect herbivory facilitates the establishment of an invasive plant pathogen. ISME Commun. 2021, 1, 6. [Google Scholar] [CrossRef]
  11. Limayem, A.; Martin, E.M.; Shankar, S. Study on the citrus greening disease: Current challenges and novel therapies. Microb. Pathog. 2024, 192, 106688. [Google Scholar] [CrossRef] [PubMed]
  12. Nancarrow, N.; Aftab, M.; Hollaway, G.; Rodoni, B.; Trebicki, P. Yield losses caused by barley yellow dwarf virus-PAV infection in wheat and barley: A three-year field study in south-eastern Australia. Microorganisms 2021, 9, 645. [Google Scholar] [CrossRef] [PubMed]
  13. Sisterton, M.S.; Burbank, L.P.; Krugner, R.; Haviland, D.; Stenger, D.C. Xylella fastidiosa and glassy-winged sharpshooter population dynamics in the southern San Joaquin Valley of California. Plant Dis. 2020, 104, 2994–3001. [Google Scholar] [CrossRef] [PubMed]
  14. Nicholson, S.J.; Hartson, S.D.; Puterka, G.J. Proteomic analysis of secreted saliva from Russian wheat aphid (Diuraphis noxia Kurd.) biotypes that differ in virulence to wheat. J. Proteomics 2012, 75, 2252–2268. [Google Scholar] [CrossRef]
  15. Nicolis, V.F.; Burger, N.F.V.; Botha, A.-M. Whole-body transcriptome mining for candidate effectors from Diuraphis noxia. BMC Genom. 2022, 23, 493. [Google Scholar] [CrossRef]
  16. Crane, Y.M.; Crane, C.F.; Schemerhorn, B.J. Differential gene expression between viruliferous and non-viruliferous Schizaphis graminum (Rondani). PLoS ONE 2023, 8, 32. [Google Scholar] [CrossRef]
  17. Royer, T.A.; Pendleton, B.B.; Elliott, N.C.; Giles, K.L. Greenbug (Hemiptera: Aphididae) biology, ecology, and management in wheat and sorghum. J. Integr. Pest Manag. 2015, 6, 19. [Google Scholar] [CrossRef]
  18. Webster, J.A.; Kenkel, P. Benefits of managing small-grain pests with plant resistance. In Economic, Environmental, and Social Benefits of Resistance in Field Crops; Wiseman, B.R., Webster, J.A., Eds.; Entomological Society of America: Lanham, MD, USA, 1999; pp. 87–114. [Google Scholar]
  19. Pendleton, B.B.; Copeland, A.L.; Michels, G.J., Jr. Effect of biotype and temperature on fitness of greenbug (Hemiptera: Aphididae) on sorghum. J. Econ. Entomol. 2009, 102, 1624–1627. [Google Scholar] [CrossRef]
  20. Burrows, M.E.; Caillaud, M.C.; Smith, D.M.; Gray, S.M. Biometrical genetic analysis of luteovirus transmission in the aphid Schizaphis graminum. Heredity 2007, 98, 106–113. [Google Scholar] [CrossRef]
  21. Mayo, M.A.; D’Arcy, C.J. Family Luteoviridae: A reclassification of luteoviruses. In The Luteoviridae; Smith, H.G., Barker, H., Eds.; CABI Publishing: Wallingford, UK, 1999; pp. 15–22. [Google Scholar]
  22. Lotos, L.; Efthimiou, K.; Maliogka, V.I.; Katis, N.I. Generic detection of poleroviruses using an RT-PCR assay targeting the RdRp coding sequence. J. Virol. Methods 2014, 198, 1–11. [Google Scholar] [CrossRef]
  23. Jousselin, E.; d’Acier, A.C.; Clamens, A.-L.; Galan, M.; Cruaud, C.; Barbe, V.; Manzano-Marín, A. Discordance between mitochondrial, nuclear, and symbiont genomes in aphid phylogenetics: Who is telling the truth? Zool. J. Linn. Soc. 2024, 201, zlae098. [Google Scholar] [CrossRef]
  24. Zepeda-Paulo, F.; Ortiz-Martinez, S.; Silva, A.X.; Lavandero, B. Low bacterial community diversity in two introduced aphid pests revealed with 16S rRNA amplicon sequencing. Peer J. 2018, 6, e4725. [Google Scholar] [CrossRef] [PubMed]
  25. Fakhour, S.; Ambroise, J.; Renoz, F.; Foray, V.; Gala, J.-L.; Hance, T. A large-scale field study of bacterial communities in cereal aphid populations across Morocco. FEMS Microbiol. Ecol. 2018, 94, fiy003. [Google Scholar] [CrossRef] [PubMed]
  26. Ma, Y.-J.; He, H.-P.; Zhao, H.-M.; Xian, Y.-D.; Guo, H.; Liu, B.; Xue, K. Microbiome diversity of cotton aphids (Aphis gossypii) is associated with host alternation. Sci. Rep. 2021, 11, 5260. [Google Scholar] [CrossRef]
  27. Guo, S.-K.; Gong, Y.-J.; Chen, J.-C.; Shi, P.; Cao, L.-J.; Yang, Q.; Hoffmann, A.A.; Wei, S.-J. Increased density of endosymbiotic Buchnera related to pesticide resistance in yellow morph of melon aphid. J. Pest Sci. 2020, 93, 1281–1294. [Google Scholar] [CrossRef]
  28. Gallo-Franco, J.J.; Duque-Gamboa, D.N.; Toro-Perea, N. Bacterial communities of Aphis gossypii and Myzus persicae (Hemiptera: Aphididae) from pepper crops (Capsicum sp.). Sci. Rep. 2019, 9, 5766. [Google Scholar] [CrossRef]
  29. Smith, A.H.; O’connor, M.P.; Deal, B.; Kotzer, C.; Lee, A.; Wagner, B.; Joffe, J.; Woloszynek, S.; Oliver, K.M.; Russell, J.A. Does getting defensive get you anywhere?—Seasonal balancing selection, temperature, and parasitoids shape real-world, protective endosymbiont dynamics in the pea aphid. Mol. Ecol. 2021, 30, 2449–2472. [Google Scholar] [CrossRef]
  30. Enders, L.S.; Miller, N.J. Stress-induced changes in abundance differ among obligate and facultative endosymbionts of the soybean aphid. Ecol. Evol. 2016, 6, 818–829. [Google Scholar] [CrossRef]
  31. He, B.; Chen, X.; Yang, H.; Cernava, T. Microbiome structure of the aphid Myzus persicae (Sulzer) is shaped by different Solanaceae plant diets. Front. Microbiol. 2021, 12, 667257. [Google Scholar] [CrossRef]
  32. Ban, L.; Didon, A.; Jonsson, L.M.V.; Glinwood, R.; Delp, G. An improved detection method for the Rhopalosiphum padi virus (RhPV) allows monitoring of its presence in aphids and movement within plants. J. Virol. Methods 2007, 142, 136–142. [Google Scholar] [CrossRef]
  33. Putri, G.H.; Anders, S.; Pyl, P.T.; Pimanda, J.E.; Zanini, F. Analysing high-throughput sequencing data in Python with HTSeq 2.0. Bioinformatics 2022, 38, 2943–2945. [Google Scholar] [CrossRef] [PubMed]
  34. Bushnell, B. BBMap: A fast, accurate, splice-aware aligner. In Lawrence Berkeley Nat. Lab. LBNL Rep; LBL Publications: Berkeley, CA, USA, 2014; LBNL-7065E; Available online: https://escholarship.org/uc/item/1h3515gn (accessed on 15 January 2024).
  35. Edwards, J.A.; Edwards, R.A. fastq-pair: Efficient synchronization of paired-end fastq files. bioRxiv 2019, 552885. [Google Scholar] [CrossRef]
  36. Li, H.; Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009, 25, 1754–1760. [Google Scholar] [CrossRef] [PubMed]
  37. Love, M.I.; Huber, W.; Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
  38. McDonald, D.; Clemente, J.C.; Kuczynski, J.; Rideout, J.R.; Stombaugh, J.; Wendel, D.; Wilke, A.; Huse, S.; Hufnagle, J.; Meyer, F.; et al. The Biological Observation Matrix (BIOM) format or: How I learned to stop worrying and love the ome-ome. GigaScience 2012, 1, 7. [Google Scholar] [CrossRef]
  39. Anderson, M.J. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 2001, 26, 32–46. [Google Scholar] [CrossRef]
  40. Oksanen, J.; Simpson, G.L.; Blanchet, F.G.; Kindt, R.; Legendre, P.; Minchin, P.R.; O’hara, R.B.; Solymos, P.; Stevens, M.H.H.; Wagner, H.; et al. Vegan: Community Ecology Package. 2018. Available online: https://CRAN.R-project.org/package=vegan (accessed on 10 May 2025).
  41. Manzano-Marin, A.; Coeur d’Acier, A.; Clamens, A.L.; Cruaud, C.; Barbe, V.; Jousselin, E. Co-obligate symbioses have repeatedly evolved across aphids, but partner identity and nutritional contributions vary across lineages. Peer Community J. 2023, 3, e46. [Google Scholar] [CrossRef]
  42. Komaki, K.; Ishikawa, H. Genomic copy number of intracellular bacterial symbionts of aphids varies in response to developmental stage and morph of their host. Insect Biochem. Mol. Biol. 2000, 30, 253–258. [Google Scholar] [CrossRef]
  43. Silva, F.J.; Latorre, A.; Moya, A. Genome size reduction through multiple events of gene disintegration in Buchnera APS. Trends Genet. 2001, 17, 615–618. [Google Scholar] [CrossRef]
  44. Sharawi, S.E. Morphological and molecular identification of novel green peach aphids (Myzus persicae) and their microbiome diversity in Taif Governorate. Indian J. Anim. Res. 2023, 57, 1177–1185. [Google Scholar] [CrossRef]
  45. Douglas, A.E. On the source of sterols in the green peach aphid, Myzus persicae, reared on holidic diet. J. Insect Physiol. 1988, 34, 403–408. [Google Scholar] [CrossRef]
  46. Cernava, T.; Erlacher, A.; Soh, J.; Sensen, C.W.; Grube, M.; Berg, G. Enterobacteriaceae dominate the core microbiome and contribute to the resistome of arugula (Eruca sativa Mill.). Microbiome 2019, 7, 13. [Google Scholar] [CrossRef] [PubMed]
  47. Mao, L.; Mhaske, P.; Xiao, J.; Tang, L.; Qian, L.; Zheng, Y. Phyllosphere bacterial communities in wheat and faba bean intercropping: Structural and functional mechanisms in response to stripe rust infection. Phytobiomes J. 2025, 2471–2906. [Google Scholar] [CrossRef]
  48. Michels, G.J., Jr. Graminaceous North American host plants of the greenbug with notes on biotypes. SW Entomol. 1986, 11, 55–66. [Google Scholar]
  49. Olmedo-Velarde, A.; Wilson, J.R.; Stallone, M.; DeBlasio, S.L.; Chappie, J.S.; Heck, M. Potato leafroll virus molecular interactions with plants and aphids: Gaining a new tactical advantage on an old foe. Physiol. Mol. Plant Pathol. 2023, 125, 102015. [Google Scholar] [CrossRef]
  50. Valles, S.M.; Chen, Y.; Firth, A.E.; Guérin, D.A.M.; Hashimoto, Y.; Herrero, S.; de Miranda, J.R.; Ryabov, E. ICTV virus taxonomy profile: Dicistroviridae. J. Gen. Virol. 2017, 98, 355–356. [Google Scholar] [CrossRef]
  51. Salter, S.J.; Cox, M.J.; Turek, E.M.; Calus, S.T.; Cookson, W.O.; Moffatt, M.F.; Turner, P.; Parkhill, J.; Loman, N.J.; Walker, A.W. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014, 12, 87. [Google Scholar] [CrossRef]
  52. Blow, F.; Douglas, A.E. The hemolymph microbiome of insects. J. Insect Physiol. 2019, 115, 33–39. [Google Scholar] [CrossRef]
  53. Ament-Velásquez, S.L.; Tuovinen, V.; Bergström, L.; Spribille, T.; Vanderpool, D.; Nascimbene, J.; Yamamoto, Y.; Thor, G.; Johannesson, H. The plot thickens: Haploid and triploid-like thalli, hybridization, and biased mating type ratios in Letharia. Front. Fungal Biol. 2021, 2, 656386. [Google Scholar] [CrossRef]
Figure 1. Experimental layout. Two greenbug biotypes, each with and without acquired CYDV, were sampled over eight timepoints from infestation until 20 days later.
Figure 1. Experimental layout. Two greenbug biotypes, each with and without acquired CYDV, were sampled over eight timepoints from infestation until 20 days later.
Insects 16 00554 g001
Figure 2. Flowchart of the informatic processing after demultiplexing and initial quality control of reads.
Figure 2. Flowchart of the informatic processing after demultiplexing and initial quality control of reads.
Insects 16 00554 g002
Figure 3. Relationship of total-microbe:greenbug counts ratio to time. Timepoints are encoded as 0, start; 1, one day post infestation; 2, 2 days; 3, 3 days; 4, 5 days; 5, 10 days; 6, 15 days; 7, 20 days post infestation. Combinations of biotype and CYDV-carrier status are encoded as B and BV for biotype B without and with carried CYDV, and H and HV for biotype H without and with carried CYDV. Total microbial counts declined markedly on days 15 and 20, primarily from loss of Shigella and Escherichia counts. A fitted Loess curve appears in blue.
Figure 3. Relationship of total-microbe:greenbug counts ratio to time. Timepoints are encoded as 0, start; 1, one day post infestation; 2, 2 days; 3, 3 days; 4, 5 days; 5, 10 days; 6, 15 days; 7, 20 days post infestation. Combinations of biotype and CYDV-carrier status are encoded as B and BV for biotype B without and with carried CYDV, and H and HV for biotype H without and with carried CYDV. Total microbial counts declined markedly on days 15 and 20, primarily from loss of Shigella and Escherichia counts. A fitted Loess curve appears in blue.
Insects 16 00554 g003
Figure 4. Rarefaction boxplot of counts of genera versus subsample size for 89 samples with at least 100,000 microbial reads. The darker curve is for all samples taken from days 0 through 10. The lighter curve is for days 15 and 20.
Figure 4. Rarefaction boxplot of counts of genera versus subsample size for 89 samples with at least 100,000 microbial reads. The darker curve is for all samples taken from days 0 through 10. The lighter curve is for days 15 and 20.
Insects 16 00554 g004
Figure 5. Counts of detected genera versus total read counts per sample.
Figure 5. Counts of detected genera versus total read counts per sample.
Insects 16 00554 g005
Figure 6. Within-community diversity, as measured by Pielou evenness in panels (AE) and Shannon entropy in panels (FJ). Panels (A,F) compare collections at days 15 and 20 post-infestation to all earlier collections. Panels (B,G) compare biotypes B and H. Panels (C,H) compare CYDV-free and CYDV-carrier conditions. Panels (D,I) compare the presence with absence of RhPV counts, while panels (E,J) compare samples with fewer than 100 Rhopalosiphum padi virus counts to samples with more than 100.
Figure 6. Within-community diversity, as measured by Pielou evenness in panels (AE) and Shannon entropy in panels (FJ). Panels (A,F) compare collections at days 15 and 20 post-infestation to all earlier collections. Panels (B,G) compare biotypes B and H. Panels (C,H) compare CYDV-free and CYDV-carrier conditions. Panels (D,I) compare the presence with absence of RhPV counts, while panels (E,J) compare samples with fewer than 100 Rhopalosiphum padi virus counts to samples with more than 100.
Insects 16 00554 g006
Figure 7. Principal components plot of Bray–Curtis dissimilarities among all individual replicates. Collection dates are encoded in color in panel (A). Biotypes are coded in color in panel (B), while CYDV carrier status is coded in panel (C) and RhPV presence is coded in panel (D). The colors are thoroughly interspersed in panels (BD).
Figure 7. Principal components plot of Bray–Curtis dissimilarities among all individual replicates. Collection dates are encoded in color in panel (A). Biotypes are coded in color in panel (B), while CYDV carrier status is coded in panel (C) and RhPV presence is coded in panel (D). The colors are thoroughly interspersed in panels (BD).
Insects 16 00554 g007
Figure 8. Barplotted relative frequencies of the top 30 microbial genera as defined by total counts over all timepoints and treatments. Samples are encoded B or H for greenbug biotype, V if carrying CYDV-RPV, and 0 to 20 for days post-infestation. “Other” is the sum of counts of all other genera.
Figure 8. Barplotted relative frequencies of the top 30 microbial genera as defined by total counts over all timepoints and treatments. Samples are encoded B or H for greenbug biotype, V if carrying CYDV-RPV, and 0 to 20 for days post-infestation. “Other” is the sum of counts of all other genera.
Insects 16 00554 g008
Table 1. Significance of Bray–Curtis distances among communities in response to time, biotype, and harbored CYDV-RPV.
Table 1. Significance of Bray–Curtis distances among communities in response to time, biotype, and harbored CYDV-RPV.
Factor *r2p-Value
rel_time0.4590.001
biotype0.0070.567
viral_status0.0060.690
biotype:rel_time0.0140.067
viral_status:rel_time0.0140.068
biotype:viral_status0.0100.408
* The value of rel_time (relative time) is early (days 0 through 10) or late (days 15 and 20). Viral status is harboring CYDV-RPV or free of CYDV-RPV. Colons indicate interactions. The r2 is a correlation coefficient.
Table 2. Maximum fraction of counts in a single sample for genera with at least 100,000 reads.
Table 2. Maximum fraction of counts in a single sample for genera with at least 100,000 reads.
GeneraMaximum Fraction of Counts
Mucor0.9846
Letharia0.9785
Fibrisoma0.9589
Variovorax0.8873
Pantoea0.8093
Serratia0.7483
Rhopalosiphum padi virus0.5483
Vibrio0.4438
Malassezia0.3932
Legionella0.3622
Anaerococcus0.3426
Corynebacterium0.3395
Novosphingobium0.3354
Alkalispirochaeta0.3174
Micrococcus0.3162
Desulfovibrio0.2465
Erwinia0.2308
Kocuria0.2276
Nocardioides0.2204
Buchnera0.2060
Pelomonas0.1734
Arthrobacter0.1578
Microcystis0.1565
Streptococcus0.1545
Staphylococcus0.1502
Propionibacterium0.1388
Klebsiella0.1315
Caulobacter0.1308
Brevundimonas0.1246
Clostridium0.0996
Sphingomonas0.0869
Bradyrhizobium0.0799
Microbacterium0.0796
Pseudomonas0.0687
Acidovorax0.0679
Providencia0.0610
Betaproteobacterium FWI20.0608
Rheinheimera0.0602
Streptomyces0.0590
Flavobacterium0.0585
Actinomadura0.0559
Afipia0.0552
Stenotrophomonas0.0552
Marinobacterium0.0545
Enterobacter0.0532
Shigella0.0530
Aquabacterium0.0506
Lamprocystis0.0506
Ralstonia0.0504
Escherichia0.0497
Methylobacterium0.0480
Pusillimonas0.0465
Terrisporobacter0.0460
Acinetobacter0.0446
Citrobacter0.0445
Gilbertella0.0425
Bacillus0.0384
Burkholderia0.0350
Table 3. Response of Enterobacteriaceae to time.
Table 3. Response of Enterobacteriaceae to time.
GenusTotal CountMean Days 0–10Mean Days 15–20
Shigella36,995,7900.68590.0661
Escherichia10,750,1230.68250.0711
Citrobacter2,335,0590.49830.0670
Enterobacter1,621,4810.46260.0742
Klebsiella264,3310.55740.1801
Salmonella72,4420.26270.3421
Kluyvera41,5710.18270.3337
Raoultella12,3060.44370.0908
Blochmannia11,8250.03020.0770
Leclercia11,2690.09540.3715
Cedecea10,2690.26550.0082
Table 4. Genera that responded most strongly to varied factors a.
Table 4. Genera that responded most strongly to varied factors a.
TaxonVariableDirectionBase_MeanLog2_Fold_ChangeAdj_p-Value
Shigellatimedown210,548.6−4.578.59 × 10−136
Escherichiatimedown61,277.55−4.52.19 × 10−124
Citrobactertimedown14,367.69−4.193.17 × 10−74
Shigellarel_timedown210,548.6−4.61.31 × 10−146
Escherichiarel_timedown61,277.55−4.482.38 × 10−134
Citrobacterrel_timedown14,367.69−4.071.99 × 10−82
Lethariabiotypedown146.68−3.691.73 × 10−7
Aequitasiabiotypedown109.78−6.261.18 × 10−5
Salipaludibacillusbiotypedown74.46−2.711.86 × 10−5
Microvirgavirusdown228.96−2.465.66 × 10−6
Salipaludibacillusvirusdown74.46−2.614.20 × 10−5
Lethariavirusdown101.13−2.576.54 × 10−4
AquabacteriumtimeUp1382.082.143.32 × 10−68
HerbaspirillumtimeUp591.392.051.74 × 10−55
DelftiatimeUp511.921.546.12 × 10−51
Aquabacteriumrel_timeUp1382.082.39.23 × 10−73
Herbaspirillumrel_timeUp591.391.851.20 × 10−57
Delftiarel_timeUp511.921.759.20 × 10−55
MicrovirgabiotypeUp140.131.572.03 × 10−3
MassiliabiotypeUp397.050.912.09 × 10−2
PorphyromonasbiotypeUp55.791.093.33 × 10−2
BlochmanniavirusUp14.4851.12 × 10−12
IzhakiellavirusUp17.355.941.83 × 10−11
ChryseobacteriumvirusUp293.10.912.07 × 10−4
a Time is time in days, while rel_time is early (days 0–10) or late (days 15 and 20). Down means lower counts with later time, lower counts with biotype H, and lower counts with harbored CYDV-RPV. The listed genera are the three most extreme for each variable and direction of fold-change.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Crane, Y.M.; Crane, C.F.; Subramanyam, S.; Schemerhorn, B.J. Shotgun Metagenome Analysis of Two Schizaphis graminum Biotypes over Time With and Without Carried Cereal Yellow Dwarf Virus. Insects 2025, 16, 554. https://doi.org/10.3390/insects16060554

AMA Style

Crane YM, Crane CF, Subramanyam S, Schemerhorn BJ. Shotgun Metagenome Analysis of Two Schizaphis graminum Biotypes over Time With and Without Carried Cereal Yellow Dwarf Virus. Insects. 2025; 16(6):554. https://doi.org/10.3390/insects16060554

Chicago/Turabian Style

Crane, Yan M., Charles F. Crane, Subhashree Subramanyam, and Brandon J. Schemerhorn. 2025. "Shotgun Metagenome Analysis of Two Schizaphis graminum Biotypes over Time With and Without Carried Cereal Yellow Dwarf Virus" Insects 16, no. 6: 554. https://doi.org/10.3390/insects16060554

APA Style

Crane, Y. M., Crane, C. F., Subramanyam, S., & Schemerhorn, B. J. (2025). Shotgun Metagenome Analysis of Two Schizaphis graminum Biotypes over Time With and Without Carried Cereal Yellow Dwarf Virus. Insects, 16(6), 554. https://doi.org/10.3390/insects16060554

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop