1. Introduction
Insects live in various environments and take diverse lifestyles. As a result, they are under the threat of microorganism infection in nature. However, they also establish a mutual relationship with some microorganisms. During the adaptation of insects to different environments and to different pathogenic pressures, microorganisms establish different relationships with insects [
1,
2,
3,
4,
5]. Even the related microorganism species have different effects on insects [
6]. In previous studies, we found that some microorganisms contaminated the honeydew secreted by the females of the Chinese white wax scale insect (
Ericerus pela) subsequently invade the host and then kill them. We found the lethal microorganisms were mainly
Cladosporium genus fungi, and we analyzed the changes of microorganism diversity after infection. However, we found some of the endosymbiont fungus of
E. pela was closely related to the lethal fungus [
7]. Why do the related fungi species have two absolutely opposite functions to the same host?
E. pela is a traditional resource insect of China and has been bred for more than 1000 years in China. The males secrete white wax and form wax layer covering on their bodies. Females are unable to secrete white wax, instead they form a hard chitin cuticle outside the body to protect themselves [
8,
9,
10]. The females secrete honeydew continually during incubation period, and the honeydew finally forms a huge ball, which adheres to the back of the female adults. The incubation period lasts for more than half a year. The female adults are immobile because of the rudimentary legs. The honeydew is vulnerable to be contaminated by microorganisms especially in the hot and humid regions [
7,
11]. It has been reported that the honeydew of scale insects and aphids is a source of nourishment to some microorganisms [
12]. For
E. pela, the honeydew they secrete is a good culture medium for
Cladosporium genus fungi [
7].
Cladosporium has been reported to be a special genus. They have tremendous adaptability to variety of environments and host. The species of this genus inhabited in a broad range of environments, coving from terrestrial to marine environments. Their hosts range from plants to animals and humans [
13,
14,
15].
Cladosporium was reported to infect mammals and human and cause illness [
16,
17]. They were also reported to cause plant blossom blight and leaf spot [
18,
19], because they produce a wide range of nature compounds with different bioactive compounds. Some
Cladosporium were used as biocontrol fungi of pests [
13,
20,
21]. Two studies also showed that,
C. cladosporioides infection of the host
Cirsium arvense increased
Cassida rubiginosa feeding plant leaves [
22]. However,
Cladosporium sp. infection of cauliflower lead to plant resistance to
Spodoptera litura [
23]. There has been no report about the symbiotic relationship between the
Cladosporium genus and insect host previously.
It has been well recognized that endosymbionts form mutual relationships with host in many insects, especially in the plant phloem-sucking insects (mainly hemipterous insects). They provide essential amino acids and vitamins, which are deficient in the phloem sap, and act as nutrition partners with the insect host. Except for providing nutrition, endosymbiont bacteria also play defensive roles for the host [
24,
25,
26,
27]. In aphids, some bacteria are able to produce some antibiotics or toxins, and these compounds protect the hosts from microbial pathogens, parasites, and predators [
24,
28]. Scale insects are important parts of plant-sucking insects, the endosymbiont fungi are thought to have similar functions with symbiont bacteria in aphids.
There has been convincing evidence indicated the endosymbionts are evolved repeatedly from free-living lineages. The weak pathogen of some symbionts and special location distribution in host imply their environmental origination [
24,
25]. It was postulated that the endosymbiosis of scale insects is a legacy of the association with microorganisms. The ancestors of scale insect were considered to live under the leaf litters of plants or in the soil. They feed on the root of plants, even the degenerated plants, fungi, and bacteria. The lifestyle leads to the association of a wide range of microorganisms. Maybe they established a certain endosymbiont system before their habitat on the aerial plant parts [
29]. However, how the environmental fungi are captured by scale insects and what functions are lost during the formation of a mutual relationship with the host is not clear.
The symbionts lost essential biological functions accompanied by the formation of association. Their genome remained with evolutionary trajectories related to the trend of function loss and transition. The genomes of free-living microorganisms, obligate mutualisms, and facultative mutualisms have different genome sizes and characters. Generally, the obligate microorganisms have a drastically reduced genome. While facultative mutualisms have moderately reduced genomes and have some pathogen features and free-living abilities. The facultative symbionts are thought to be the transition stage from free-living pathogens to mutualisms [
24,
26,
28]. Some of the endogenous microorganisms of
E. pela could be cultured in vitro and have relatives living freely in the environment. Comparing endogenous microorganisms that have free-living ability with their relatives in nature will provide the opportunities to understand the genetic information related to functional changes that lead to adaptation to insect lifestyle in vivo.
To understand the genomic basis that determines the function adaptation of the two Cladosporium species, we cultured the fungi from the honeydew produced by E. pela female adults in this study, and at the same time we also cultured the endogenous fungi of E. pela. The comparative analyses of genomic character of Cladosporium isolated from honeydew and endogenous Cladosporium will provide key information to address the question of why the two Cladosporium species have drastically different functions to E. pela through the view of their adaptation to the host-restricted lifestyle during association evolution between the microorganism and host.
2. Materials and Methods
2.1. Fungi Cultivation
The honeydew of the infected E. pela by Cladosporium sp. (pathogen) was collected and washed quickly with 75% ethanol in a 1.5 mL centrifuge tube. Then, the honeydew was dissolved in different concentrations with sterilized water. First, 50 μL of each concentration suspension was spread on the PDA medium, and cultured at 28 °C for about three days. The single colony was streak cultured on a new plate, and after several days the single colony was transferred on new plate. These steps were repeated several times until pure fungus clone was obtained. The cultured fungus was picked up for genome sequencing.
To isolate the endosymbionts of E. pela, the eggs were used to isolate endosymbionts. The eggs produced by healthy individuals were washed with 75% ethanol for surface disinfection, and then washed with sterile water. The eggs were homogenized after adding about 100 μL PBS in a 1.5 mL centrifuge tube with a pestle. After short centrifugation, the supernatant was transferred into a new centrifuge tube and dissolved in different concentration. The fungi were cultured in the same way as the honeydew.
2.2. DNA Isolation
The colony from one plate was collected by using a cell lifter. The fungus was grided after adding liquid nitrogen. It was incubated at 65 °C for 1 h after adding a CTAB lysis buffer. The supernatant was transferred to a new tube after centrifugation at 10,000 rpm for 5 min. The same volume of mixture of tris-saturated phenol/chloroform/isopentanol (25:24:1) was added to the supernatant, overturned, and blended sufficiently. The mixture was centrifugated at 10,000 rpm for 10 min, and the supernatant was transferred to a new tube and two-thirds of the volume of isopropanol was added. The DNA was precipitated after incubatation at −20 °C for more than 2 h and centrifuged at 12,000 rpm for 15 min. DNA was washed with 75% alcohol.
The internal transcribed spacer (ITS) gene fragment sequence was amplified by using the ITS4 and ITS5 primer pair. Single positive recombinant colonies were sequenced. A nucleotide sequences blastn search showed that the ITS sequence from honeydew fungus DNA has 99.63% similarity with C. sphaerospermum, and the ITS sequence from egg fungus DNA has 99.83% similarity with C. cladosporioides. The fungus isolated from honeydew was named as Cladosporium sp. (pathogen), and the fungus isolated from egg was named as Cladosporium sp. (endogensis).
2.3. Library Construction and Sequence Filter
The fungus genomic DNA was disrupted by using the ultrasonic DNA disrupter (Covaris, Woburn, MA, USA) to obtain short DNA fragments for library construction. For the 20 kb PacBio sequence library construction, the DNA was disrupted to fragments about 17 kb. After ExoVII digestion, damage repair, end repair, adapter conjunction, enzyme digestion, and fragment selection, the dumbbell-shaped fragment library was obtained. The library quality was validated by Qubit and Agilent 2100. The library was sequenced by PacBio Sequel.
For DNBSEQ library construction, about 1 μg of genomic DNA was disrupted to the fragments of 200–400 bp. The end of the cDNA was repaired, and the A base was added to the end and then the cDNA was connected with the adapter. The conjunction product was amplified by PCR then purified by magnetic bead. The cDNA was melted and became single strings. The single string was cyclized into a circle, and the linear DNA was digested. The circle production was sequenced by DNBSEQ.
2.4. Genome Assembly
The subreads were extracted from polymerase reads. The adapters were removed and the subreads with length shorter than 1000 bp were removed.
The subreads were corrected by using the FalconConsensus software [
30] or mixed to be corrected by the software Proovread [
31]. Highly reliable corrected reads were obtained. The Canu software [
32] was used for sequence assembly based on the corrected-reads. The short sequences from BGI sequencing were used for single nucleotide correction of the genome. The reliable assemble sequences were obtained. The SSPACE_Basic_v2.0 software [
33] was used to assemble scaffold base on the assemble sequence. The holes in the scaffolds were mended by the pbjelly2 software [
34].
2.5. Gene Prediction
The assembled genome sequences were aligned with transposon sequence database to identify the repeat sequences. The RepeatMasker software and RepeatProteinMasker software and the corresponding databases were used. For the Denovo prediction, a database was constructed based on its sequences by using the buildXDFDatabase software, then the transposon model was constructed by using the RepeatModeler software according to the database. Transposons were found using the Repeatmasker software according to the model (the software mentioned above were a softwarebag [
35]). Finally, the Tandem Repeat Finder software [
36] was used to predict tandem repeat sequences.
Non-code RNA was predicted by aligned with rRNA database or predicted by using the RNAmmer software [
37]. tRNA regions and tRNA second construction were predicted by using the tRNAscan software [
38]. sRNA was predicted by aligning with Rfam database [
39] by using the Infernal software [
40].
Gene predictions were performed based on the genewise prediction [
41], Augustus prediction [
42], and GeneMark-ES prediction [
43].
2.6. Gene Functional Annotation
Gene annotation was performed by amino acid sequences aligned with the database, which included Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), Cluster of Orthologous Groups of proteins (COG), Swiss-Prot, Trembl, Non-Redundant Protein Sequence Database (NR), and EggNOG.
The proteins were also aligned with the database of P450, Carbohydrate-Active enZYmes Database (CAZY), Eukaryotic Protein Kinases and Protein Phosphatases (EKPD), the Comprehensive Antibiotic Research Database (CARD), dbCAN (a web server and database for automated carbohydrate-active enzyme annotation), Transporter Classification Database (TCDB), Pathogen–Host Interactions (PHI), and PHOSPHATASE to understand their possible function related to fungal infection.
2.7. Core Gene and Specific Gene Analyses
Five sequenced fungi from
Cladosporium genus (
Table 1) and
Cladosporium sp. (pathogen) and
Cladosporium sp. (endogensis) were selected for Pan gene analyses. All the predicted proteins from the seven fungal genomes were selected for CD-HIT (v4.6.6) cluster analyses [
44,
45]. The final gene sets were used as Pan gene sets. The clusters that contains proteins from all the seven fungi were thought to be core gene sets. The proteins that cannot cluster with other sequences were thought to be specific gene sets. The Pan gene sets that exclude core and specific gene sets were thought to be dispensable gene sets.
2.8. Phylogenetic Tree Construction
Proteins of the seven fungi were searched by BLAST, and then we removed the redundant proteins using solar software (Version 0.9.6). The aligned gene families were TreeFam clustered using Hcluster_sg software (
https://github.com/douglasgscofield/hcluster. accessed on 3 August 2020). The Muscle software [
46,
47] was used to perform mutule protein sequence alignment of the clustered gene family. The protein alignment results were transferred to coding sequence amino acid mutule sequence alignment results. The TreeBeST software [
48] was used to construct phylogenetic tree based on the Muscle [
46,
47] mutule sequence alignment results. A gene family-based tree was constructed by the NJ method, and a Core-pan gene-based tree was constructed by the PHYML method.
ITS, tef-1α, and act sequences [
49] from 68 species (including four outgroups) were aligned separately using MUSCLE and manually adjusted for alignment results. The phylogenetic tree was construct using MEGA (V10.1.7) under the Maximum Likelihood method. The General Time Reversible (GTR) model was used in this tree (Bootstrap = 1000). The rates among sites were set to Gamma Distributed with Invariant Sites (G+I). Other parameters were kept as default values.
2.9. Synteny Analyses
The aimed genome sequence of the two fungi were ordered according to the reference fungal genome based on the alignment result of the MUMmer software [
50]. The sequences were scaled down according to the three genome sequences length and the X- and
Y-axis in the two dimensions of the synteny figure were constructed, and the up and down axis in the linear of the synteny figure were also constructed. The aimed fungal protein sets P1 were aligned with reference fungal protein sequence sets P2 by using BLASTp alignment, and the alternate alignment was also performed. The best alignment pair of each protein in the database was selected as the best hit. The protein pairs with consistent result in the two alignments were reserved. The consistent value of the protein pair was the average of the two alignments. The protein pairs were marked in the figure according to their location.
2.10. Secondary Metabolite Biosynthetic Gene Clusters Prediction
Secondary metabolite biosynthetic gene clusters in the
Cladosporium sp. (pathogen) and
Cladosporium sp. (endogensis) genomes were predicted using the antiSMASH fungal version [
51] online (
https://fungismash.secondarymetabolites.org/ accessed on 12 Agust 2021).
2.11. Cytotoxicity Assay of the Two Cladosporium Species
The fungal colony on the medium dishes was transferred into the liquid medium and shake cultured at 28 °C until the colony filled the liquid medium. Then, several fungal colonies were transferred to new liquid medium and cultured for 7 days. Fungus colonies were absorbed water with filter paper. The Cladosporium sp. (pathogen) and Cladosporium sp. (endogensis) colonies with the same weight were mashed by liquid nitrogen. After sonic disruption, shaking (1% dimethyl sulfoxide, DMSO), and centrifugation, the supernatant was used for cytotoxicity assay.
Aedes albopictus C6/36 cells at the logarithmic growth stage were inoculated in a 96-well plate. The cells were precultured in wells for 24 h, and then 10 µL of solution was added into each well. Each 10 µL of solution contained a gradient volume of dissolved materials of fungi or DMSO as follows: 0 µL, 0.1 µL, 1 µL, 2 µL, 4 µL, 6µL, 8 µL, and 10 µL. Five replicates were set for each solution. After incubation at 28 °C for 24 h, the liquid medium in each well was discarded and 100 µL of fresh medium and then 10 µL of CCK8 (Proteintech Group, Rosemont, IL, USA) were added to each well. After incubation for 1–2 h, the OD value was detected at 450 nm by the microplate reader (Thermo Fisher 3001, Waltham, MA, USA). The cell viability of Aedes albopictus C6/36 cells was calculated according to the formula of the manual (Proteintech Group, Rosemont, IL, USA). Three biological replications were performed. The cell viability results were analyzed using the DPS software. The least significant difference (LSD) test was used and analyses were carried out at the p = 0.01 level.
4. Discussion
In this study, we sequenced the genome of the two
Cladosporium. Genome annotation, pan-genome analyses, and secondary metabolite biosynthetic gene cluster prediction were performed. Specific gene analyses found that their species-specific genes were enriched in different pathways. Obviously,
Cladosporium sp. (endogensis)-specific genes were enriched in the pathways related to amino acid metabolism, carbohydrate metabolism, and energy metabolism. These pathways are important to provide nutrition for the host. These pathways were not found in the enrichment result of
Cladosporium sp. (pathogen) (
Figure S4, Tables S8 and S9). The function differences of
Cladosporium sp. (pathogen) and
Cladosporium sp. (endogensis) were thought to result from the differences of their genome characters.
It has been revealed that some bacteria symbionts, such as
Buchnera aphidicola,
Candidatus Vallotia, and
Candidatus Profftia, are able to provide vital vitamins and necessary amino acids for aphids [
28,
52]. It was shown that bacteria symbionts
Carsonella_DC and
Candidatus Profftella in psyllids also provide vitamins and amino acids for the host [
26]. The endosymbionts are indispensable for insect hosts, especially for plant-sucking insects, such as aphids, psyllids, whiteflies, and plant hoppers. In this study, we found
Cladosporium sp. (endogensis) and
Cladosporium sp. (pathogen) had pathways related to the biosynthesis of amino acids and vitamins (
Tables S5 and S6). These genes were important for
Cladosporium sp. (endogensis) to establish a mutual relationship with
E. pela.
CAZy and dbCAN annotation showed that
Cladosporium sp. (endogensis) had more carbohydrate esterases (CE) and glycoside hydrolases (GH) than
Cladosporium sp. (pathogen) (
Table S7). CE and GH were important to deacetylate plant polysaccharides. It is the character of phytopathogenic fungi that they need these enzymes to degenerate polysaccharides and break down plant cell walls to invade plants [
53,
54]. For sap-sucking insects, they need to pierce the plant phloem of the branch or leaf. In addition, phloem saps have high sugar contents. These food resources tend to result in nutrition deficiency and metabolism burden for insect feeders. The CEs and GHs of
Cladosporium sp. (endogensis) were thought to play an important role in the phloem sap-sucking lifestyle of
E. pela.
Compared to
Cladosporium sp. (pathogen),
Cladosporium sp. (endogensis) had more genes related to loss or reduction of pathogenicity or virulence (
Table S7). “Loss of pathogenicity” and “reduced virulence” represented reduced pathogenicity and virulence against the host [
55]. It was inferred that, during evolution, some microorganisms such as the ancestor of
Cladosporium sp. (endogensis) had the ability to degenerate plant polysaccharides to have endosymbiosis character and act as nutrition partners of
E. pela. The initial
Cladosporium in
E. pela could contaminate the plant phloem sap by the sucking the mouthpart and circulate horizontally in
E. pela, which could accelerate the spread and colony in the population.
Synteny analyses showed that
Cladosporium sp. (pathogen) and
Cladosporium sp. (endogensis) had a significant synteny with
C. phlei when compared with other
Cladosporium fungi.
C. phlei is a phytopathogenic fungus that causes purple eyespot disease, which is a common disease in the timothy plant. In addition,
C. phlei is intensively studied because it produces the fungal pigment, phleichrome, which is used in photodynamic therapy. This fungal perylenequinone is produced by polyketide synthases (PKS) [
56]. Phleichrome has antifungal activity against the fungus
Epichloe typhina [
57]. In this study, secondary metabolite biosynthetic gene cluster analyses showed that
Cladosporium sp. (pathogen) and
Cladosporium sp. (endogensis) had gene clusters responsible for the production of some polyketide. However, the cytotoxicity assay showed that
Cladosporium sp. (endogensis) had no cytotoxicity, while
Cladosporium sp. (pathogen) showed cytotoxicity even at a low volume. The results were consistent with the endosymbiosis character of
Cladosporium sp. (endogensis) and the pathogen character of
Cladosporium sp. (pathogen). In addition, according to the mutual gene phylogenetic analyses,
Cladosporium sp. (endogensis) is closely related to
C. cladosporioides.
C. cladosporioides was identified in the gut of adult beetle
Brachypeplus glaber [
58]. It was shown that
Cladosporium spp. accounted for 12.0% of the fungal species of the red flour beetle
Tribolium castaneum [
59]. The existence of
Cladosporium in the insect implied their symbiotic role in the host.
However, it is different for
Cladosporium sp. (pathogen). It is an entomopathogen of
E. pela [
7]. According to the specific gene analysis,
Cladosporium sp. (pathogen)-specific genes were enriched in GO terms related to the biosynthesis process of a variety of compounds, such as asperfuranone, emericellamide, fumagillin, o-orsellinic acid, and sterigmatocystin. On the other hand, these terms were absent in the
Cladosporium sp. (endogensis)-specific gene GO enrichments (
Figure S4, Table S9). The genes related to these terms were thought to play an important role in the infection process of
Cladosporium sp. (pathogen) as they act as a lethal pathogen.
Cladosporium fungi distribute in a variety of environments and infect a wide range of hosts [
13].
Cladosporium fungi are a pathogen for animals and plants. They cause human subcutaneous infection, brain abscess, pulmonary hemorrhage symptoms, and congestion, as well as phaeohyphomycosis. They also cause plant blossom blight and leaf spot [
13,
16,
18]. As an important resource insect, the distribution areas of
E. pela cover a wide range of climate types from hot and humid climates to cold and dry climates [
60,
61]. The honeydew produced by females is prone to be contaminated by a variety of pathogens including
Cladosporium sp. (pathogen) [
7].
E. pela has been cultured in China for more than 1000 years; however, this period is still very short compare with the evolution process of scale insects.
E. pela was subjected to the infection of
Cladosporium sp. (pathogen) during the cultivation and popularization of
E. pela in some regions.
Cladosporium sp. (pathogen) is a new pathogen of
E. pela, and it can be inferred from the reports that the honeydew of scale insects and aphids is a source of nourishment to some microorganisms, especially the Capnodiales order fungi [
62]. Thus,
Cladosporium sp. (pathogen) is a new invader of
E. pela in some regions during cultivation, which they did not encounter before.
Genome sequence analyses showed that
Cladosporium sp. (endogensis) fungi have a larger genome size than
Cladosporium sp. (pathogen), and the genome size is also larger than the other
Cladosporium species that have been genome sequenced. It was different with genomic analyses of pathogenic bacteria and bacterial symbiosis in aphids. In aphids, the three free-living
Serratia strains that are pathogenic to aphid hosts have a larger genome size than the facultative or obligate
Serratia strains. It was thought that the transition of
Serratia strains from a free-living stage to a host-restricted stage, comparing the reduction of genome size, mainly results from the losses of metabolic pathways or genes related to metabolism [
24]. However, in plants, some biotrophic pathogens have increased their genome size. Spanu (2012) holds that the increase or decrease in genome size was thought to be driven especially by the diversification and creation of effector genes [
63]. It was thought that
Cladosporium sp. (endogensis) fungi increased their genome size and formed a new genomic character to facilitate the mutual relationship with the
E. pela host. While
Cladosporium sp. (pathogen), acting as a pathogenic fugus, maintained the character of invader and lived free in nature.