Investigating Common Pathogenic Mechanisms between Homo sapiens and Different Strains of Candida albicans for Drug Design: Systems Biology Approach via Two-Sided NGS Data Identification

Candida albicans (C. albicans) is the most prevalent fungal species. Although it is a healthy microbiota, genetic and epigenetic alterations in host and pathogen, and microenvironment changes would lead to thrush, vaginal yeast infection, and even hematogenously disseminated infection. Despite the fact that cytotoxicity is well-characterized, few studies discuss the genome-wide genetic and epigenetic molecular mechanisms between host and C. albicans. The aim of this study is to identify drug targets and design a multiple-molecule drug to prevent the infection from C. albicans. To investigate the common and specific pathogenic mechanisms in human oral epithelial OKF6/TERT-2 cells during the C. albicans infection in different strains, systems modeling and big databases mining were used to construct candidate host–pathogen genetic and epigenetic interspecies network (GEIN). System identification and system order detection are applied on two-sided next generation sequencing (NGS) data to build real host–pathogen cross-talk GEINs. Core host–pathogen cross-talk networks (HPCNs) are extracted by principal network projection (PNP) method. By comparing with core HPCNs in different strains of C. albicans, common pathogenic mechanisms were investigated and several drug targets were suggested as follows: orf19.5034 (YBP1) with the ability of anti-ROS; orf19.939 (NAM7), orf19.2087 (SAS2), orf19.1093 (FLO8) and orf19.1854 (HHF22) with high correlation to the hyphae growth and pathogen protein interaction; orf19.5585 (SAP5), orf19.5542 (SAP6) and orf19.4519 (SUV3) with the cause of biofilm formation. Eventually, five corresponding compounds—Tunicamycin, Terbinafine, Cerulenin, Tetracycline and Tetrandrine—with three known drugs could be considered as a potential multiple-molecule drug for therapeutic treatment of C. albicans.


Introduction
Candida albicans (C. albicans) is considered as a prevalent fungus. Nonetheless, it will become an opportunistic pathogenic fungus based on the host's condition [1]. This pathogen is a Gram-positive fungus but adapts to anaerobic and aerobic condition. C. albicans exists in the oral, vaginal mucosa and gastrointestinal tract of many organisms [1], especially in oral epithelial cell. The relevant diseases include pseudomembranous candidiasis and denture-associated erythematous candidiasis. The former pseudomembranous candidiasis is also known as thrush mainly infecting infants and immunocompromised patients. The latter denture-associated erythematous candidiasis takes place among elderly due to poor cleaning of oral prosthetic devices in the hospital. Additionally, acceptance of cytotoxic chemotherapy will lead to severe immunosuppression and promote mucosal damage which makes C. albicans overgrow in the oral epithelial cell to cause oral candidiasis. Meanwhile, a delicate balance clearly exists between the potentially damaging effects of Candida virulence factors and the nature of the immune response elicited by the host [2]. Furthermore, under the limitation of resource in the infected host, there is a competition between C. albicans and hosts striving for resource to support their essential functions.
SC5314 and WO-1 are two common strains of C. albicans employed in laboratory for clinical research. Compared to SC5314, WO-1 in the white cell transforms to opaque cell with high frequency [3]. In addition, the sequence of C. albicans SC5314 is transcribed by previous studies so that C. albicans SC5314 is frequently used as a wild-type control derived from common laboratory [4]. Although previous studies did not indicate why C. albicans would be separated into different strains, strains SC5314 and WO1 are estimated to be separated from each other by a divergence time of one million years [5]. Both strains of C. albicans might exist in human body experiencing constant evolution to adapt for host microenvironment. The OKF6/TERT-2 cell line applied to be a model of the human oral epithelial cell is acquired from human oral keratinocytes. Previous studies usually used TR146 cell to do experiment and employed it for pathogen infection [6]. However, we could not regard TR146 as human oral keratinocytes or true model. Along with the advancement of biological technology, OKF6/TERT-2 cell line is a 3D system which resembles the commercially available system based on the cell line TR146 [7]. The cell line is made up of a multiple layer epithelial structure which is similar to the cells in native oral mucosa. Therefore, it is a better representative of the normal submucosa and true human mucosa. In the immune system, epithelial cells become the first defense line to antagonize bacterial infection. Nonetheless, under C. albicans infection condition, this monolayer of cell surface can be destructed by the pathogen's hyphae or cell surface proteins, allowing C. albicans to enter oral mucosa and motivate oral mucosal immune cells such as macrophages, neutrophils and dendritic cells. Moreover, cell surface proteins of C. albicans can degrade host cell surface protein to enter the cell so that the whole C. albicans will invade the host cell. C. albicans infection often arises after the disturbance of normal oral microbiome following immunocompromised patients including the HIV-infected patients or the broad-spectrum antibiotic treatment. After the decrease of immune system or the interference of the oral microbiota, C. albicans can form colonization on oral epithelial cells by hyphal growth, grow hyphae to penetrate cell and yield pathogenic factors to degrade the barrier.

The Identified GEINs under the Infection of C. albicans SC5314 and C. albicans WO-1
The real GEINs of C. albicans SC5314 infection in OKF6/TERT-2 cells of two replicates are shown in Figure S3A,B in Supplementary Materials, respectively, by applying network visualizing software Cytoscape [19]. Similarly, the real GEINs of C. albicans WO-1 infection in OKF6/TERT-2 cells of two replicates are shown in Figure S3C,D in Supplementary Materials, respectively. The number of identified nodes and edges are shown in Tables 1 and 2, respectively. For two replicates of C. albicans SC5314, the number of nodes of replicate 1 is higher than replicate 2; and the identified edges in Table 2 show significant differences between two replicates. By contrast, for two replicates of C. albicans WO-1, the number of nodes of replicate 1 is similar to replicate 2; and the identified edges in Table 2 show little differences between two replicates. Compared to C. albicans WO-1, C. albicans SC5314 easily produces the individual difference in infection progression. Overall, C. albicans WO-1 is more stable than C. albicans SC5314 in infection progression.
To further find gene functions in OKF6/TERT-2 cells during the infection of two strains of C. albicans according to their functional groups, we exhibited the specific host cellular functions and analyzed the functional abundance of related pathways of the conserved host target-genes among 2 replicates on the basis of gene ontology (GO) terms by applying the DAVID analysis [20]. In Table 3, the infection progression of C. albicans SC5314 was characterized by the redistribution of epithelial cell barrier, cell shape and cell adhesion so that they can activate oxidation response to inflammation response and metal-binding, which could act as an important character in the struggle for nutrients and metal material between host and pathogen. While considering the infection progression of C. albicans WO-1, it is similar to C. albicans SC5314. However, in addition to finding host gene functions, pathogen gene functions were also found. Based on Candida Genome Database, specific pathogen functions with a plenty of conserved pathogen target-genes among 2 replicates are found by applying Gene Ontology Term Finder. In Table 4, gene functions of C. albicans SC5314 were characterized by epithelial cell barrier so that C. albicans SC5314 can make morphological transition due to structural molecule activity and molecular function regulation. Besides, C. albicans SC5314 can invade oral epithelial cell via the induced endocytosis and active penetration which are applied by hydrolase activity, protein binding and structural molecule activity. While considering gene functions of C. albicans WO-1, it is the same as C. albicans SC5314. Table 1. Information about the numbers of nodes of candidate GEINs and real GEINs by the proposed system identification method in the infection of C. albicans SC5314 and C. albicans WO-1 of two replicates.  Since the real GEINs are very complicated, it is still hard to investigate common and specific infection pathogenic mechanism from these networks directly. We thereby used PNP approach (see Section 1.5 in Supplementary Materials) to find the corresponding core HPCNs in Figures S4 and S5 (see Supplementary Materials) from real GEINs of OKF6/TERT-2 cells during the infection of C. albicans WO-1 and SC5314, respectively. By this approach, we can further investigate common and specific pathogenic mechanisms of the C. albicans infection in respect of KEGG pathways. Table 3. The specific and common host cellular functions and functional abundance analysis of related pathways of the conserved host target-genes between 2 replicates in the infection of C. albicans SC5314 and WO-1 on the basis of GO terms by applying the DAVID analysis.

Category
Term p-Value The C. albicans SC5314 infection is characterized by the transformation of host cell shape, and this will result in the activities of GTPases because pathogen proteins adhere to host cell surface. However, while C. albicans SC5314 executes endocytosis function, cytoskeleton and cytoplasm change cell type in the host cells so that it will produce DNA replication and nucleic acid binding to activate gene expression. Additionally, the metal ion-binding ability plays a crucial role for human and pathogen because of its function in finding metallic nutrients. By this process, host cell also produces ROS-related molecule although the ion is toxic. Applying the ROS-related molecule also leads to eliminating C. albicans SC5314. Similarly, the C. albicans WO-1 infection is characterized by the transformation of host cell shape, and this will result in the activities of GTPases because pathogen proteins adhere to host cell surface.
However, while C. albicans WO-1 executes endocytosis function, cytoskeleton and cytoplasm change cell type in the host cells so that it will produce DNA replication and nucleic acid binding to activate gene expression. Additionally, the metal ion-binding ability plays a crucial role for human and pathogen because of its function in finding metallic nutrients. By this process, host cell also produces ROS-related molecule although the ion is toxic. Applying the ROS-related molecule also leads to eliminating C. albicans WO-1. The C. albicans SC5314 pathogenic function is considered by morphological transformation including structural molecule activity, molecular functional regulation, hydrolase activity and transferase activity. It is noted that C. albicans is influenced by transferase activity so that it will promote hyphal growth. Moreover, hydrolase activity is thought to facilitate penetration into host cells. Similar gene ontology terms can be observed in C. albicans WO-1. Employing PNP approach to real GEINs in Figure S3A,B in Supplementary Materials, we could evaluate the projection value of each node by the Equation (40) in Supplementary Materials for the building of core HPCNs. Host proteins with top 5000 projection values and pathogen protein with top 1500 projection values based on intra-species ranking in all two replicates and their connected genes/miRNAs/lncRNAs were selected as core HPCN of two replicates. Since the recognized real GEINs in Figure S3A,B in Supplementary Materials, respectively, are part of two replicates, which are biological replicates from the same cell line. The recognized differential regulations and interactions can be considered as the adaptability of cells while confronting stimulus and stress at different replicates. For more intact information, the combinations of these interactions and regulations in two replicates are viewed as real GEIN as shown in Figure S3E. Next, we extracted core nodes from the real GEIN in Figure S3E by PNP approach (see Section 1.5 in the Supplementary Materials) to construct core HPCN as shown in Figure S4 during the infection of C. albicans SC5314. Likewise, core HPCN of C. albicans WO-1 which is shown in Figure S5 was constructed by the same procedure.
Comparing Figures S4 and S5 in Supplementary Materials, the number of core proteins in the core HPCN during the infection of C. albicans WO-1 is higher than that in the core HPCN during the infection of C. albicans SC5314. The C. albicans SC5314 infection exists a number of individual differences in host cells. These differences lead to various projection values causing relatively diverse signaling pathways in two replicates. In contrast, the C. albicans WO-1 infection holds consistent high projection values causing similar signaling pathways in two replicates. Moreover, only the pathogen proteins with top 1500 projection values and host proteins with top 5000 projection values were selected as core network nodes of core HPCN in the infection of C. albicans SC5314. The same criteria apply to choose core network nodes of core HPCN in the infection of C. albicans WO-1.
To further investigate common and specific progression genetic and epigenetic mechanisms between C. albicans SC5314 and C. albicans WO-1 from Figures S4 and S5 in Supplementary Materials, based on the projection value of each element in the core HPCNs, we then construct core host-pathogen signaling pathways for two strains of C. albicans. For core host signaling pathways, we select core membrane proteins including core receptors, core TFs, core proteins, and their regulatory miRNAs and lncRNAs. On the other hand, we construct core pathogen signaling pathways by selecting cell wall core proteins, signal transmission core pathogen proteins and core pathogen TFs. In order to make signal transduction pathways complete, we not only have to consider elements mentioned above but also the role of epigenetic modifications such as acetylation, methylation, ubiquitination and phosphorylation. These epigenetic modifications decided by basal levels χ H i and χ P j in the host and pathogen PPIN in Equations (1) and (2) shown in Supplementary Materials, respectively. Here, basal levels denote unknown interactions which have not been mentioned in stochastic dynamic interactive equations. When a PPI basal level exceeds a threshold, the core proteins with an overtaking threshold of basal level in infection progression were speculated that these core proteins may be affected by epigenetic modification such as acetylation, methylation, ubiquitination and phosphorylation. Moreover, the genes with an overtaking threshold of basal level in infection progression were speculated that these core genes may be influenced by DNA methylation. The core cross-talk pathways of each strain in infection progression are described in the following and shown in Figures 1 and 2. The core cross-talk pathways extracted and rearranged based on KEGG pathways from the core HPCN in Figure S4 during the C. albicans SC5314 infection. The upper layer is the pathogen core pathways and the lower layer signifies the host core pathways during the C. albicans SC5314 infection. The grey lines represent the protein-protein interaction; the red arrow lines are transcriptional regulation; the green dot lines denote the protein translation; the black dash lines indicate the protein secretion; the blue lines with circle endpoint represent miRNA repression and the circles with purple frame and arrow lines represent the production activity and response of reactive oxygen species (ROS). The pathogenic factor orf19.1816 (ALS3) of C. albicans SC5314 triggers to induce endocytosis. The TFs orf19.1623 (CAP1) and orf19.5034 (YBP1) are pathogenic factor of C. albicans SC5314 to react via ROS of host production. Moreover, OKF6/TERT-2 cells apply autophagy and immune response to recruiting immune cells such as macrophages and neutrophils to eliminate C. albicans SC5314. Finally, the endoplasmic reticulum stress reflects on the accumulated cellular stress and host cell extrusion so that host cells will produce severe inflammatory response and cause apoptosis process. In addition, pathogenic factors orf19.5585 (SAP5) and orf19.5542 (SAP6) also generate inflammation response and apoptosis process of the host cell. The core cross-talk pathways extracted and rearranged based on KEGG pathways from the core HPCN in Figure S4 during the C. albicans SC5314 infection. The upper layer is the pathogen core pathways and the lower layer signifies the host core pathways during the C. albicans SC5314 infection. The grey lines represent the protein-protein interaction; the red arrow lines are transcriptional regulation; the green dot lines denote the protein translation; the black dash lines indicate the protein secretion; the blue lines with circle endpoint represent miRNA repression and the circles with purple frame and arrow lines represent the production activity and response of reactive oxygen species (ROS). The pathogenic factor orf19.1816 (ALS3) of C. albicans SC5314 triggers to induce endocytosis. The TFs orf19.1623 (CAP1) and orf19.5034 (YBP1) are pathogenic factor of C. albicans SC5314 to react via ROS of host production. Moreover, OKF6/TERT-2 cells apply autophagy and immune response to recruiting immune cells such as macrophages and neutrophils to eliminate C. albicans SC5314. Finally, the endoplasmic reticulum stress reflects on the accumulated cellular stress and host cell extrusion so that host cells will produce severe inflammatory response and cause apoptosis process. In addition, pathogenic factors orf19.5585 (SAP5) and orf19.5542 (SAP6) also generate inflammation response and apoptosis process of the host cell.  Figure 2. The core cross-talk pathways extracted and rearranged based on KEGG pathways from the core HPCN in Figure S5 during the C. albicans WO-1 infection. The upper layer is the pathogen core pathways and the lower layer signifies the host core pathways during the C. albicans WO-1 infection. The grey lines represent the protein-protein interaction; the red arrow lines are transcriptional regulation; the green dot lines denote the protein translation; the black dash lines indicate the protein secretion; the blue lines with circle endpoint represent miRNA repression and the circles with purple frame and arrow lines represent the production activity and response of ROS. The pathogenic factor CAWG_02005 (ALS3) of C. albicans WO-1 triggers to induce endocytosis. The TFs CAWG_02548 (CAP1) and CAWG_00057 (YBP1) pathogenic factor of C. albicans WO-1 will react via ROS of host production. Moreover, OKF6/TERT-2 cells apply autophagy and immune response to recruiting immune cells such as macrophages and neutrophils to eliminate C. albicans WO-1. Finally, the endoplasmic reticulum stress reflects on the accumulated cellular stress and host cell extrusion so that host cells will produce severe inflammatory response and cause apoptosis process. In addition, pathogenic factors CAWG_05098 (SAP6) and CAWG_05066 (SAP5) also lead to inflammation response and apoptosis process of the host cell. The core cross-talk pathways extracted and rearranged based on KEGG pathways from the core HPCN in Figure S5 during the C. albicans WO-1 infection. The upper layer is the pathogen core pathways and the lower layer signifies the host core pathways during the C. albicans WO-1 infection. The grey lines represent the protein-protein interaction; the red arrow lines are transcriptional regulation; the green dot lines denote the protein translation; the black dash lines indicate the protein secretion; the blue lines with circle endpoint represent miRNA repression and the circles with purple frame and arrow lines represent the production activity and response of ROS. The pathogenic factor CAWG_02005 (ALS3) of C. albicans WO-1 triggers to induce endocytosis. The TFs CAWG_02548 (CAP1) and CAWG_00057 (YBP1) pathogenic factor of C. albicans WO-1 will react via ROS of host production. Moreover, OKF6/TERT-2 cells apply autophagy and immune response to recruiting immune cells such as macrophages and neutrophils to eliminate C. albicans WO-1. Finally, the endoplasmic reticulum stress reflects on the accumulated cellular stress and host cell extrusion so that host cells will produce severe inflammatory response and cause apoptosis process. In addition, pathogenic factors CAWG_05098 (SAP6) and CAWG_05066 (SAP5) also lead to inflammation response and apoptosis process of the host cell.
In response to pathogen infection, the receptors such as EGFR, ERBB2, CDH1, HSP90B1, and TJAP1 at the host membrane interact with pathogen cell wall proteins to start inducing endocytosis.
Apart from the epigenetic regulations in core pathways mentioned above, we also discovered epigenetic regulations of miRNAs, including miR-30B, miR31, miR3941, miR143HG, miR1972-2 and miR548D2. MiR-30B inhibits AVEN during C. albicans SC5314 infection to reduce autophagy and apoptosis [39]. UCN2 and CCDC22 are silenced by miR31 and miR1972-2 to decrease inflammatory response, respectively. In addition to inflammatory response, UCN2 also decreases innate immune response by miR31. However, IL1B was stimulated to initiate apoptosis and inflammatory response, and miR3941 also inhibits IL1B to reduce tissue necrosis. Moreover, we found miR548D2 inhibits orf19.3292 to strength the effect of ROS which was first produced by host cell. Besides, orf19.7292 is silenced by miR143HG to reduce hyphae growth of pathogen. Finally, there is one gene UBC with an overtaking threshold basal level, indicating that this might have been resulting in DNA methylation via infection progression [40].

Analysis of Core Interspecies Pathways to Investigate Host/Pathogen Cross-talk Common and Specific Pathogenic Mechanisms during C. albicans WO-1 Infection
As shown in the core cross-talk pathways of Figure 2, our results indicate most interactions and regulations are the same as Figure 1. Accordingly, we only discuss different interactions and regulations in C. albicans WO-1 infection from C. albicans SC5314 infection. For pathogen interactions, the CAWG_01529-induced (orf19.2575) methylation of CAWG_00472 (orf19.581) simultaneously influences upstream proteins CAWG_3130 (PRA1) and CAWG_01080 (orf19.2995) and downstream protein CAWG_00418 (WOR1). Nonetheless, CAWG_00418 (WOR1) is subjected to not only protein CAWG_04472 (orf19.581) but also microenvironment filled with CO 2 so that CAWG_00418 (WOR1) can transform white cell to opaque cell. Furthermore, CAWG_02083 (EFG1) is also the main regulation of white-to-opaque switch of gene CAWG_00418 (WOR1) so that yeast cell of C. albicans WO-1 can transform white cell to opaque cell [41]. Other pathogen interactions signify that CAWG_04469 (orf19.578) can not trigger any TFs by interaction between CAWG_05375 (FAR1) and CAWG_00299 (orf19.666) but interact with CAWG_00581 (CDC42) directly. Moreover, CAWG_04844 (ASH2) does not bind to CAWG_01970 (GCN5) due to reducing epigenetic modification of CAWG_01970 (GCN5). Finally, because of the acetylation of CAWG_00969 (HHF1), CAWG_04836 (PSD2) could directly regulate CAWG_02542-induced methylation and be indirectly mediated by acetylation to increase interactions with CAWG_04444 (NAM7). On the other hand, in the pathogen regulations, it is noted that CAWG_00682 (CPH1) does not regulate DNA damage-related gene CAWG_00299 (orf19.666); TF CAWG_02766 (TEC1) does not regulate DNA damage-related gene CAWG_01979 (orf19.666). Instead, the TF CAWG_02766 (TEC1) modulates cellular functions such as hyphae growth, biofilm formation and white cell pheromone response. Due to the regulatory function of TF CAWG_02766 (TEC1), our result may imply that C. albicans WO-1 could transform opaque cell in the infection progression to reduce the regulation of CAWG_01979 (HHF22).
On the host side, similarly, we only discuss different interactions and regulations in C. albicans WO-1 infection from Figure 2. For host interactions, MRPL50 via C. albicans WO-1 infection directly triggers TF FOXA1 but not protein MYC (also knowns C-Myc) in C. albicans SC5314 infection. Moreover, EED does not simultaneously trigger two TFs GATA1 and YBX1 from our result. Instead, TF YBX1 is activated by protein AVEN. Due to different epigenetic modifications of MYC, MYC cannot trigger protein VCAM1 but further increase downstream protein interactions. For example, AVEN triggers MAPK14 (also known as p38, PRKM14) via VCAM1. Furthermore, receptor EGFR with NAALADL2-induced acetylation and OTUD3-induced ubiquitination activates less proteins such as only AVEN. Compared to C. albicans SC5314 infection, EGFR triggers two proteins BPHL and AVEN. Therefore, the activity of EGFR is limited to epigenetic modification in our result. Moreover, BPHL binds to GRB2 under C. albicans WO-1 infection via UBC. We can infer that UBC affected by NAALADL2-induced acetylation and PPP4R4-induced phosphorylation activates different interactions. By contrast, the receptor HSP90B1 cannot transmit stimulation signals to TF GATA1 due to GAPDHP24-induced phosphorylation. Therefore, we can infer that phosphorylation will produce inhibition. Finally, the HGSNAT-induced acetylation and USP41-induced ubiquitination of receptor ARRB2 activate downstream proteins such as LIPE, DHX9 and RAB12. Conversely, in C. albicans SC5314 infection, ARRB2 with the same epigenetic modification triggers different proteins such as SSR4 and C18orf8. It is noted that different epigenetic proteins could cause regulations of TF to strengthen the corresponding cellular response. Subsequently, when HSP90B1 with GAPDHP24-induced phosphorylation further triggers the regulation of TF ETS1, HSP90B1 with ubiquitination will not activate TF ETS1 in the infection of C. albicans SC5314. We may speculate that ubiquitination modification may degrade a receptor to reduce its regulation of TF. In addition, CCDC22 receives the upstream signal from receptor CDH1 influenced by MTAP-induced methylation and USP47-induced ubiquitination and results in repression of different miRNAs such as mir210.
In conclusion, distinct epigenetic modifications will lead to different host cellular responses from the exhibition of our results. The details will be discussed in the next section.

Discussion
From Figures 1 and 2, we identified the host-pathogen core cross-talk pathways during the infection of C. albicans SC5314 and C. albicans WO-1 by systems biology methods (refer to Supplementary Materials), respectively. Overall, there is a little difference between pathogenic mechanisms of two strains of C. albicans such as white-to-opaque switch. One study indicated C. albicans WO-1 could change white cell to opaque cell at high frequency compared to C. albicans SC5314 [42]. We found even exist this phenomenon, there are common pathogenic mechanisms between C. albicans SC5314 and WO-1. In addition, C. albicans usually maintain in white cell in normal microenvironment. Perhaps, it is the difficult environment forcing C. albicans to switch in opaque cell. Previous study suggests that opaque cells keep their own cell type in certain conditions including carbon dioxide, anaerobic growth and acidic (pH < 7) making them proceed sexual reproduction [43].
In other words, we also could imply that C. albicans WO-1 may transform opaque cell to white cell immediately due to external microenvironment. Moreover, the MTLa locus of C. albicans WO-1 is absent in all other strains of C. albicans. These would lead to mutation and genetic diversity occurring easily to C. albicans SC5314 [40].
According to Tables 1 and 2, C. albicans WO-1 is more stable than C. albicans SC5314. Hence, pathogen proteins within top 1500 projection values and host proteins with top 5000 projection values in PNP projection method (see Equation (40) in Supplementary Material) can be analyzed based on KEGG pathways to discover common pathogenic mechanism. In Figures 1 and 2, we extracted the specific core cross-talk pathways to investigate C. albicans infection mechanism within different strains, respectively. Discussing from outside to inside of the host cell, we infer that C. albicans becomes harmful from harmless on the oral epithelial cell. In the following subsections, we will extract three figures from Figures 1 and 2 to discuss the common and specific pathogenic mechanism of two strains of C. albicans. At last, based on their common pathogenic mechanism, we could propose common drugs to treat the infection of different strains of C. albicans.

Defensive Mechanism of OKF6/TERT-2 Cell and the Offensive Mechanism of Different Strains of C. albicans at Host Cell Surface
As shown in Figure 3A, C. albicans SC5314 is commensal on human oral epithelial cell. However, pathogen cell surface protein orf19.1816 (ALS3) binds to receptors CDH1 (also known as E-cadherin), ERBB2 (also known as HER2), EGFR and HSP90B1 so that these receptors will be degraded and induced in endocytosis. In the infection progresses, C. albicans SC5314 will invade a host cell through endocytosis and begin invasion. Receptor CDH1 with the NAT10-induced acetylation activates downstream TFs JUN (also known as c-Jun) and FOS (also known as c-Fos) through signaling protein CCDC22, which is involved in trafficking between the trans-Golgi network and vesicles in the cell periphery. Therefore, TF FOS affected by UBFD1-induced ubiquitination negatively regulates gene MMP12, which is positively regulated by TF JUN to result in the degradation of extracellular matrix (ECM). TF JUN affected by USP12PX-induced ubiquitination and PRMT1-induced methylation positively regulates gene PPARD so that host cells will produce ROSs to eliminate C. albicans SC5314. Moreover, HSP90B1 with the USP11-induced ubiquitination binds to proteins TMEM205, which is related to chemotherapeutic agent, to trigger TF JUN. Previous studies indicate that after accepting chemotherapeutic agent, C. albicans will invade host cell and recover itself in immune-compromised patients [44]. Next, receptor EGFR activates TF Jun via BPHL to regulate biological oxidations and hydrolase activity. Finally, the receptor ERBB2 triggers TFs YBX1 and GATA1 through a sequence of signaling protein RER1 to regulate ER protein, HIST1H4B, which is affected by the NEURL4-induced ubiquitination and PLA2G12B-induced phosphorylation-involved histone binding, SSR4, GRB2 influencing cell death, UCN2, SH2D1A related to immune regulatory interactions, RAB12 involved in autophagy modulation, and EED. TF YBX1 with the SHMT1-induced methylation and USP24-induced ubiquitination negatively regulates innate immune-related gene UCN2. TF GATA1 positively regulates gene AVEN which can eliminate C. albicans SC5314 by autophagy. However, because pathogen is not considered as a danger signal, miR-30B represses gene AVEN to reduce autophagy function. Then, because gene MMP12 regulation results in ECM degradation such as collagen, vitronectin, laminin and fibronectin, C. albicans SC5314 starves to nutrient or carbon sources.  albicans SC5314 infects OKF6/TERT-2 cells, host cells will produce immune response, autophagy and ROS production to defend against pathogen. Moreover, C. albicans SC5314 begins to grow hyphae and adhere to host cell surface proteins so that it increases invasion for starving nutrient source. The induction of endocytosis is also beneficial to C. albicans SC5314 invasion. Similarly, in (B), when C. albicans WO-1 infects OKF6/TERT-2 cells, host cells will produce immune response, autophagy and ROS production to defend against pathogen. Moreover, C. albicans WO-1 begins to grow hyphae and adhere to host cell surface proteins so that it increases invasion for starving nutrient source. The induction of endocytosis is also beneficial to C. albicans WO-1 invasion.
As shown in Figure 3B, C. albicans WO-1 is commensal on human oral epithelial cell. However, pathogen cell surface protein CAWG_2005 (ALS3) binds to receptors CDH1 (also known as E-cadherin), ERBB2 (also known as HER2), HSP90B1 and EGFR so that these receptors will be degraded and induced in endocytosis. As infection progresses, C. albicans WO-1 will invade a host cell through endocytosis and begin invasion. Compared to C. albicans SC5314, the receptor CDH1 with the MTAP-induced methylation and USP47-induced ubiquitination activates to downstream TFs JUN (also known as c-Jun) and FOS (also known as c-Fos) through CCDC22, which is involved in trafficking between the trans-Golgi network and vesicles in the cell periphery. Therefore, TF FOS with the NAA16-induced acetylation and USP31-induced ubiquitination positively regulates gene MMP12 to result in the degradation of extracellular matrix (ECM). TF JUN with the PPP3CA-induced phosphorylation and USP34-induced methylation positively regulates gene PPARD to produce ROSs to eliminate C. albicans WO-1. Moreover, receptor HSP90B1 affected by the GAPDHP24-induced phosphorylation binds to proteins TMEM205, which is related to chemotherapeutic agent, to trigger TF JUN. Previous studies indicate that after accepting chemotherapeutic, C. albicans WO-1 will invade host cell again and recover itself in immune-compromised patients [44]. In addition, the receptor ERBB2 triggers TF GATA1 through a sequence of signaling proteins RER1 to regulate ER protein, HIST1H4B which is influenced by the UBE2J2-induced ubiquitination and SSH2-induced phosphorylation-involved histone binding, SSR4, GRB2 which could influence cell death, UCN2, SH2D1A which is related to immune regulatory interactions, RAB12 involved in autophagy modulation, and EED. Finally, comparing to C. albicans SC5314, the receptor EGFR with the NAALADL2-induced acetylation and OTUD3-induced ubiquitination then activates downstream protein AVEN to modulate TF YBX1. The TF YBX1, which is influenced by the OTUD3-induced ubiquitination and TPMTP2-induced methylation, positively regulates innate immune-related gene UCN2. TF GATA1 negatively regulates gene AVEN to eliminate C. albicans WO-1 by autophagy. However, MiR-30B represses gene AVEN to reduce autophagy function. Then, due to gene MMP12 being regulated to result in ECM degradation such as collagen, vitronectin, laminin and fibronectin, C. albicans WO-1 starves to nutrient or carbon sources.
In conclusion, C. albicans SC5314 and WO-1 infect host cell at the beginning, then immune system of host cell can defense it. Although hyphae of C. albicans can destruct macrophage, host cells combine other cellular mechanisms to resist it such as autophagy, ROS production and immune response. At this moment, C. albicans SC5314 and WO-1 could be considered as a commensal pathogen of adhesion stage at the host cell. Furthermore, in different strains, gene expressions of MMP12 and PPARD under the infection of C. albicans SC5314 are higher than C. albicans WO-1. By contrast, gene expression of AVEN under the infection of C. albicans SC5314 is lower than C. albicans WO-1. From experiment data, gene PPARD has a significant change of expression in the infection progression of both strains. As a result, the degradation intensity of C. albicans SC5314 is stronger than that of C. albicans WO-1. Relatively, ROS production in the infection of C. albicans SC5314 is also more powerful than C. albicans WO-1. Hence, C. albicans WO-1 is easily eliminated by regulating the expressing of host gene AVEN. In addition, the expression levels of CDH1 and ERBB2 under the infection of C. albicans SC5314 are higher than C. albicans WO-1. In this situation, the infection of C. albicans SC5314 has induced a relatively strong endocytosis and epigenetic modification. For the infection of C. albicans WO-1, it will bring about more protein folding of host cell by higher expression of HSP90B1 with phosphorylation. Namely, compared to C. albicans SC5314, host cell with infection of C. albicans WO-1 leads to a more misfolded protein formation.

OKF6/TERT-2 Cell Confronts Different Strains of C. albicans by Strong ROS and Microenvironment Response
As shown in Figure 4A, due to the degradation of extracellular matrix, C. albicans proceeds to invasion stage, and host cell membrane begins to be destructed by the hyphal of C. albicans SC5314. More and more ROSs are generated by host cell. In Figure 4A, pathogen cell surface protein orf19.1816 (ALS3) binds to the host cell receptor EGFR so that the receptor EGFR will be degraded and induced in the ROS production of the host cell. After receiving the corresponding signal, the receptor EGFR triggers TFs NFKB1 (subunit of NF-κB) and GATA1 to regulate biological oxidations and hydrolase activity through a sequence of signaling proteins BPHL, UBC with the OTUB1-induced ubiquitination and INPP4B-induced phosphorylation, LIPE involved in lipid metabolism and bound to DHX9 and MAPK1 (also known as p38, ERK, PRKM1), DHX9 which could mediate TLR4 signal and NF-kB activation, MAPK1 which participates with EGFR-related signaling pathway to regulate cell survival and differentiation, UCN2 mentioned above, SH2D1A referred to previous section, RAB12 mentioned above, EED and C18orf18. The TF GATA1 positively regulates an immune-related gene IL20 to recruit more macrophage and pro-inflammatory cytokine. Another TF NFKB1 negatively regulates fungus infection-related gene DEFB4A. Since C. albicans is a Gram-positive fungus, gene DEFB4A negatively regulated by TF NFKB1 results in resisting pathogen invasion and inflammation. In addition, some of C. albicans SC5314 still stay at cell surface because they would like to form colony morphology by hyphal growth and yeast cell. However, pathogen membrane protein orf19.1059 (HHF1), which is related to colony morphology, interacts with the host cell receptor IL15RA. After the receptor receives the signaling, IL15RA binds to MYC. MYC with NAALADL1-induced acetylation and PDPR-induced phosphorylation activates downstream signaling protein AVEN for triggering the TF ETS1. The TF ETS1 positively regulates genes BPHL mentioned previously and CCDC22 involved in the activation of pro-inflammatory NF-κB signaling. Therefore, genes CCDC22 and DEFB4A will generate inflammatory in response to the regulation of TFs ETS1 and NFKB1, respectively. It is noted that inflammatory response in the invasion phase will not be over reaction due to the repression of miR1979-2 on gene CCDC22.
As shown in Figure 4B, due to the degradation of extracellular matrix, C. albicans proceeds to the invasion stage, and the host cell membrane begins to be destructed by the hyphal of C. albicans WO-1. More and more ROSs are generated by the host cell. In Figure 4B, pathogen cell surface protein CAWG_2005 (ALS3) binds no longer to EGFR but to host cell receptor ERBB2 making receptor ERBB2 degrade and induce in ROS production. After receiving the corresponding signal, the receptor ERBB2 triggers TFs NFKB1 (subunit of NF-κB) and GATA1 through a sequence of signaling proteins, RER1, UBC with the NAALADL2-induced acetylation and PPP4R4-induced phosphorylation, BPHL regulating biological oxidations and hydrolase activity and the binding to GRB2, LIPE involving in lipid metabolism and binding to DHX9 and MAPK1 (also known as p38, ERK, PRKM1), DHX9 mediating TLR4 signal and NF-kB activation, MAPK1 participating in the EGFR-related signaling pathway and regulation of cell survival and differentiation, UCN2 accepting signals from GRB2 and MAPK1, SH2D1A referring to the previous section, RAB12 having been mentioned above, EED and C18orf18. Compared to C. albicans SC5314, receptor ERBB2 instead of receptor EGFR activates BPHL-related pathway to strengthen ROS production via UBC adjusted by epigenetic modifications.
The TF GATA1 positively regulates the immune-related gene IL20 to recruit more macrophage and pro-inflammatory cytokine. Another TF NFKB1 negatively regulates fungus infections-related gene DEFB4A. Since C. albicans are Gram-positive fungus, gene DEFB4A negatively regulated by TF NFKB1 can result in resisting pathogen invasion and inflammation finally. In addition, some of C. albicans WO-1 still stay cell surface because they would like to form colony morphology by hyphal growth. However, pathogen membrane protein CAWG_00969 (HHF1), which is related to colony morphology, interacts with the host cell receptor IL15RA. After the receptor receives the signaling, IL15RA binds to signaling protein MYC with the ELP6-induced acetylation and MBD5-induced methylation to activate downstream protein AVEN and interact with RAB12 for triggering TFs ETS1 and GATA1. TF ETS1 positively regulates gene BPHL which has mentioned previously and negatively regulates CCDC22, involved in the activation of pro-inflammatory NF-κB signaling. Therefore, genes CCDC22 and DEFB4A will generate inflammatory response through the regulation of TFs ETS1 and NFKB1, respectively. Nonetheless, the inflammatory response in this phase will not be over reaction due to the repression of miR210 on gene CCDC22. Compared to C. albicans SC5314, MYC regulated by methylation instead of phosphorylation can activate RAB12 for triggering downstream TFs. The red arrow lines represent transcriptional regulation; the grey solid lines signify the protein-protein interaction; the green dot lines indicate the protein translation; the blue lines with circle endpoint represent miRNA repression. The circles with purple frame and arrow lines represent the production activity and response of ROS. In (A), because more C. albicans SC5314 invade gradually, host cell continues ROS production and increases stress on C. albicans SC5314. Therefore, C. albicans SC5314 needs to execute DNA damage response and resist ROS. Next, C. albicans SC5314 performs hyphae growth function to form biofilm continually. Due to hyphal growth, host cells are oppressed to cause inflammation response and cellular stress. At this time, OKF6/TERT-2 cells stay at an unbalance status. In (B), because more C. albicans WO-1 invades gradually, host cell continues ROS production and increases stress. Therefore, C. albicans WO-1 need to execute DNA damage response and resist ROS. Next, C. albicans WO-1 performs hyphae growth function to form biofilm continually. Due to hyphal growth, host cells are oppressed to cause inflammation response and cellular stress. At this time, OKF6/TERT-2 cells stay at an unbalance status. Eventually, C. albicans WO-1 senses carbon dioxide to transform white cell.
Afterwards, pathogen starts to antagonize the immune system of the host cell. After orf19.1059 (HHF1) interacting with the host cell receptor IL15RA, it will trigger TF orf19.1623 (CAP1) through a sequence of downstream proteins, orf19.3954 (PSD2) with the orf19.1631-induced methylation, orf19.1631 (ERG6) affected by orf19.3964-induced methylation and bound to orf19.3964 directly, orf19.6082 influenced by orf19.169-induced (CHO2) methylation, orf19.3964 (ASH2) with the orf19.1631-induced methylation and orf19.705-induced acetylation, orf19.705 (GCN5) and orf19.5034 The red arrow lines represent transcriptional regulation; the grey solid lines signify the protein-protein interaction; the green dot lines indicate the protein translation; the blue lines with circle endpoint represent miRNA repression. The circles with purple frame and arrow lines represent the production activity and response of ROS. In (A), because more C. albicans SC5314 invade gradually, host cell continues ROS production and increases stress on C. albicans SC5314. Therefore, C. albicans SC5314 needs to execute DNA damage response and resist ROS. Next, C. albicans SC5314 performs hyphae growth function to form biofilm continually. Due to hyphal growth, host cells are oppressed to cause inflammation response and cellular stress. At this time, OKF6/TERT-2 cells stay at an unbalance status. In (B), because more C. albicans WO-1 invades gradually, host cell continues ROS production and increases stress. Therefore, C. albicans WO-1 need to execute DNA damage response and resist ROS. Next, C. albicans WO-1 performs hyphae growth function to form biofilm continually. Due to hyphal growth, host cells are oppressed to cause inflammation response and cellular stress. At this time, OKF6/TERT-2 cells stay at an unbalance status. Eventually, C. albicans WO-1 senses carbon dioxide to transform white cell.
In conclusion, in the invasion phase, the common pathogenic mechanism of C. albicans SC5314 and C. albicans WO-1 is hyphae growth and elongation to further form biofilm. In addition, C. albican SC5314 and C. albicans WO-1 can resist ROS and immune cells such as macrophages and neutrophils. However, host cell stays at an unstable condition between acidic and alkaline pH. Because of lipid metabolism and ROS response, lipid depletion causes CO 2 production. Additionally, overreaction of ROS also produces CO 2 and hydrogen ion generating acidic substance. With these materials, host cell becomes acidic gradually. Under this microenvironment, C. albicans WO-1 exploits these materials to switch white cell. However, acidic substance production represses hyphal growth rather than to activate. Overall, host cell has a need to be balance between acidic and neutral pH. Along with more and more hyphae to oppress the host cell and penetrate cytoplasm, host cell generates inflammatory response gradually to recruit more cytokines to eliminate C. albican. At this moment, C. albicans still would be eliminated because of immature biofilm. Moreover, according to the experimental data, gene expressions of BPHL and DEFB4A of C. albicans SC5314 are higher than C. albicans WO-1. By contrast, gene expression of IL20 of C. albicans SC5314 is lower than C. albicans WO-1. In addition, gene IL20 has a significant change of expression in the initial infection. Therefore, the innate immune response also plays an important role in this stage so that C. albicans has to be perished. However, under C. albicans SC5314, host cell is easier to resist pathogen by the host gene expression. On the contrary, C. albicans WO-1 is difficult to be antagonized so that it will invade host cell rapidly. Additionally, the expression of IL15RA in the C. albicans WO-1 condition is higher than C. albicans SC5314 from the experimental data. It suggests that C. albicans WO-1 simply binds to IL15RA so that C. albicans WO-1 quickly forms cell colony. As a result, C. albicans WO-1 is not easier to be wiped out. Although host cell is anxious to rub out C. albicans, it will stay at an unbalance state such as pH changing and CO 2 production. Hence, pathogen protein orf19.3111 (PRA1) will induce hyphal growth even in different strains. From the experimented data, orf19.3111 (PRA1) has a significant change of expression in the infection.

Released Pathogenic Factor and Accumulated Cellular Response Result in Apoptosis and Inflammatory Response Further Leading to Necrosis
As shown in Figure 5A, due to numerous invasions of C. albicans SC5314 and over ROS production, cellular stress becomes larger gradually. Because of overfull ROS, host cell leads to producing a more inflammatory response. From pathogen cell surface protein orf19.1816 (ALS3) binding to these host receptors EGFR, TJAP1 and HSP90B1, these receptors will finally cause apoptosis and inflammatory response. Firstly, the receptor EGFR triggers TF ETS1 via signaling protein AVEN which is modulated by MYC with the NAALADL1-induced acetylation and PDPR-induced phosphorylation. TF ETS1 positively regulates gene CCDC22. Secondly, the receptor TJAP1 (also knowns as TJP4) activates TFs ETS1, JUN and GATA1 through a sequence of signaling proteins, MAPK6 (also known as ERK3), TNFAIP8L1, MRPL50, which is related to the maintenance of cell organelle, MYC, which is affected by the NAALADL1-induced acetylation and PDPR-induced phosphorylation and can activate downstream three proteins such as VCAM1, AVEN and GPR89A interacting with VCAM1 and reducing intracellular pH, HMGN1P4 with the KDM4A-induced methylation, MAPK14, CTSH, and EED. The TF JUN with the USP12PX-induced ubiquitination and PRMT1-induced methylation negatively regulates gene PPARD to influence ROS production and apoptosis and gene MMP12 to influence ECM degradation. The TF GATA1 positively regulates the inflammation-related gene TNFAIP8L1 and apoptosis-related gene AVEN, respectively. Eventually, the receptor HSP90B1 with USP11-induced ubiquitination interacts with TF FOXA1 directly. The TF FOXA1 negatively regulates apoptosis-related gene SERPINF1. In addition, another pathway also activates TF GATA1 because of ARRB2. When the pathogen membrane protein orf19.578 binds to the receptor ARRB2, which is affected by the HDAC1-induced acetylation and HERC5-induced ubiquitination, ARRB2 then activates TF GATA1 via the signaling pathway including SSR4, GRB2 which could induce cell death, UCN2, SH2D1A, RAB12 and VCAM1-activating EED. Not only these receptors but also complement system of receptor C3 (also known as C3a and C3b) will induce inflammatory response. Since C. albicans SC5314 releases proteases orf19.5542 (SAP6) and orf19.5585 (SAP5), C3 will be degraded by orf19.5542 (SAP6) and trigger IL1B (also known as IL-1β). IL1B is affected by the PPP2CA-induced phosphorylation and bound to both orf19.5542 (SAP6) and orf19.5585 (SAP5) directly resulting in inflammatory response and cell apoptosis.  Figure 5. C. albicans could release pathogenic factor and the accumulated cellular stress in host cell could result in apoptosis and inflammatory response leading to necrosis. The red arrow lines represent transcriptional regulation; the grey solid lines signify the protein-protein interaction; the green dot lines indicate the protein translation; the blue lines with circle endpoint represent miRNA repression. In (A), C. albicans SC5314 brings about numerous invasions, and host cells generate more and more inflammatory response via hyphal elongation. Following this, hyphae generate oppression on host cell so that host cell triggers apoptosis function of host cell. C. albicans SC5314 releases pathogenic factor to further involve in inflammatory response and apoptosis. Due to ER stress on the whole host cell, it influences the risk of the survival of host cell. Nonetheless, C. albicans SC5314 will form biofilm easily and enforce sexual reproduction so that it will output more yeast cells in the host cell. Next, yeast cells can colonize or invade other host cells. In (B), C. albicans WO-1 brings about numerous invasions, and host cells generate more and more inflammatory response via hyphal elongation. Following this, hyphae generate oppression on host cell so that host cell triggers apoptosis function of the host cell. C. albicans releases pathogenic factor to further involve in inflammatory Figure 5. C. albicans could release pathogenic factor and the accumulated cellular stress in host cell could result in apoptosis and inflammatory response leading to necrosis. The red arrow lines represent transcriptional regulation; the grey solid lines signify the protein-protein interaction; the green dot lines indicate the protein translation; the blue lines with circle endpoint represent miRNA repression. In (A), C. albicans SC5314 brings about numerous invasions, and host cells generate more and more inflammatory response via hyphal elongation. Following this, hyphae generate oppression on host cell so that host cell triggers apoptosis function of host cell. C. albicans SC5314 releases pathogenic factor to further involve in inflammatory response and apoptosis. Due to ER stress on the whole host cell, it influences the risk of the survival of host cell. Nonetheless, C. albicans SC5314 will form biofilm easily and enforce sexual reproduction so that it will output more yeast cells in the host cell. Next, yeast cells can colonize or invade other host cells. In (B), C. albicans WO-1 brings about numerous invasions, and host cells generate more and more inflammatory response via hyphal elongation. Following this, hyphae generate oppression on host cell so that host cell triggers apoptosis function of the host cell. C. albicans releases pathogenic factor to further involve in inflammatory response and apoptosis. Due to ER stress on the whole host cell, it influences the risk of the survival of host cell. Nonetheless, C. albicans WO-1 will form biofilm easily and enforce sexual reproduction so that it will output more yeast cells in the host cell. Next, yeast cells can colonize or invade other host cells. Finally, the whole microenvironment can produce more another cell type such as opaque cell because the whole host cell stays at anaerobic or acidic condition via the last response. Therefore, C. albicans WO-1 yields more opaque cells.
Through these cellular functions especially biofilm formation, C. albicans SC5314 finally causes apoptosis and cell necrosis. By the hyphal of C. albicans SC5314, it will actively penetrate and destroy mitochondrial or nuclear. Following the process, hyphae will cross each other and then form biofilm. No matter where biofilm forms outside or inside the host cell membrane, biofilm will lead to cell necrosis and stress response because host cell covered by C. albicans SC5314 stays at the hypoxia and acidic condition. Therefore, C. albicans SC5314 prefers sexual reproduction and not be subject to drug control. Previous study mentioned that biofilm of C. albicans could antagonize drug so that it cannot be eliminated and yields drug resistance [46]. Similarly, ROS production will not eliminate C. albicans because previous study indicated that biofilm formation could against it easily [47]. Therefore, oxidative stress via generating ROS will be harmful to the host cell. Not only producing anti-drug pathogen but also switching white cell to opaque cell generate more yeast cell by forming biofilm. Consequently, the host cell accumulates much cell stress causing cell apoptosis. Moreover, due to destruction of the host cell, inflammatory response will cause necrosis by hyphae.
As shown in Figure 5B, under numerous invasions of C. albicans WO-1 and over ROS production, cellular stress becomes larger gradually. Because of the overfull ROS, host cell leads to producing a more inflammatory response. As the pathogen cell surface protein CAWG_02005 (ALS3) binds to host cell membrane receptors EGFR, TJAP1, HSP90B1 and ARRB2, these host receptors will finally cause apoptosis and inflammatory response. Firstly, receptor EGFR with the NAALADL2-induced acetylation and OTUD3-induced ubiquitination triggers TF ETS1 via signaling protein AVEN that is also modulated by MYC, which is influenced by the ELP6-induced acetylation and MBD5-induced methylation. TF ETS1 negatively regulates genes CCDC22 and HSP90B1. By contrast, receptor HSP90B1 will bring about folding protein and immune response under the C. albicans WO-1 infection. Secondly, the receptor TJAP1 (also knowns as TJP4) activates TF FOXA1 through a sequence of signaling proteins, MAPK6 (also known as ERK3), TNFAIP8L1, and MRPL50 related to the maintenance of cell organelle. The TF FOXA1 positively regulates the autophagy-related gene VMP1. However, compared to C. albicans SC5314, autophagy function will increase cellular stress at this phage. Eventually, the receptor HSP90B1, which is affected by the GAPDHP24-induced phosphorylation, triggers TF JUN through the signaling protein TMEM205. The TF JUN with the PPP3CA-induced phosphorylation and USP34-induced methylation positively regulates the apoptosis-related gene PPARD. Finally, as CAWG_04469 (orf19.578) binds to the receptor ARRB2, ARRB2 with the HGSNAT-induced acetylation and USP41-induced ubiquitination can activate TF GATA1 through proteins RAB12 involved in autophagy modulation, and EED in the signal pathway. Compared to C. albicans SC5314, receptor ARRB2 triggers TF GATA1 with less signaling proteins since it will be influenced by stronger epigenetic modification. The TF GATA1 negatively regulates the apoptosis-related gene AVEN and inflammatory-related gene TNFAIP8L1, respectively. In addition, another pathway also activates TFs JUN, FOXA1 and GATA1. MYC with epigenetic modifications activates TFs JUN, FOXA1 and GATA1 through signaling proteins GPR89A activating TFs FOXA1 and connecting to VCAM1, which activates TF GATA1 via binding EED, HMGN1P4 with the USP25-induced ubiquitination, MAPK14 and CTSH. In Figure 5B, MYC with methylation activates to interact with RAB12 under the infection of C. albicans WO-1 which is different from C. albicans SC5314. Not only these receptors but also complement system of receptor C3 (also known as C3a and C3b) will induce an inflammatory response. Since C. albicans WO-1 releases proteases CAWG_05066 (SAP5) and CAWG_05098 (SAP6), C3 will be degraded by CAWG_05098 (SAP6) and triggers IL1B (also known as IL-1β). IL1B is influenced by the PPP1R15A-induced phosphorylation and bound to both CAWG_05066 (SAP5) and CAWG_05098 (SAP6) resulting in inflammatory response and cell apoptosis.
Afterwards, as a result of over inflammatory reaction and cellular stress, miRNAs have to repress inflammatory response. The only difference about miRNAs in the infection of C. albicans WO-1 is characterized by mir210 silencing CCDC22. At this moment, host cell decreases apoptosis but C. albicans WO-1 invades continuously. CAWG_02005 (ALS3) binds to TF CAWG_02083 (EFG1) directly so that C. albicans WO-1 carries out cellular functions quickly such as biofilm formation and endocytosis. The TF CAWG_02083 (EFG1) positively regulates the endocytosis-related gene CAWG_02005 (ALS3), biofilm-related gene CAWG_01979 (HHF22), and white-to-opaque switch main gene CAWG_00418 (WOR1). Additionally, receptor CAWG_03451 (HWP1) modulates TFs CAWG_00682 (CPH1), CAWG_02083 (EFG1) and CAWG_02766 (TEC1), which are mediated by proteins CAWG_04849 (SFL2) and CAWG_01914 (SFL1). Then, TF CAWG_00682 (CPH1) positively regulates the hyphal growth-related gene CAWG_01560 (RSR1), hydrolytic activity and biofilm-related genes CAWG_05098 (SAP6) and CAWG_05066 (SAP5) but negatively regulates the endocytosis-related gene CAWG_04469 (orf19.578). By contrast, TF CAWG_00682 (CPH1) does not regulate the DNA damage-related gene CAWG_00299 (orf19.666) under the infection of C. albicans WO-1. We infer that the reason why CAWG_00682 (CPH1) being inactivated is indirectly caused by cell cycle proteins such as CAWG_05375 (FAR1) and CAWG_03794 (NDT80). Since these proteins may execute functions in the white cell, C. albicans WO-1 transforms opaque cells to further reduce protein expression. The TF CAWG_02766 (TEC1) positively regulates the DNA damage-related gene CAWG_00299 (orf19.666) continuously. Compared to C. albicans SC5314, TF CAWG_02766 (TEC1) does not regulate gene CAWG_1979 (HHF12). Perhaps, white cells of C. albicans WO-1 mostly transform to opaque cells so that TF CAWG_02766 (TEC1) could reduce the modulation of CAWG_1979 (HHF12). Hence, C. albicans WO-1 can invade or damage host cell by different types of C. albicans largely. However, compared to C. albicans SC5314, CAWG_04469 (orf19.578) does not trigger any TF. It strengthens downstream proteins to execute hyphal growth function indirectly. By interacting with CAWG_04469 (orf19.578), CAWG_03378 (SEC15) enhances the expression of gene CAWG_01560 (RSR1). CAWG_00581 (CDC42) via CAWG_04469 (orf19.578) and CAWG_05375 (FAR1) also increases the gene expression of CAWG_01560 (RSR1) to elongate and grow hyphae. Based on these cellular functions especially biofilm formation, C. albicans WO-1 finally causes apoptosis and cell necrosis. By the hyphae form of C. albicans WO-1, it will actively penetrate and destroy mitochondria or nuclear of the host cell. Following infection, hyphae will cross-talk each other and then form biofilm. No matter whether biofilm forms outside or inside the host cell, biofilm will cause cell necrosis because host cell covered by C. albicans WO-1 stays at the hypoxia and acidic condition. According to the research in Reference [3], C. albicans WO-1 could transform white cell to opaque easily. Therefore, compared to C. albicans SC5314, C. albicans WO-1 prefers sexual reproduction and will not be an easy subject of drug control. A previous study mentioned that the biofilm of C. albicans can antagonize drug so that it does not be eliminated and yields drug resistance [46]. Likely, mature biofilm will against ROS so that C. albicans can survive [47]. Therefore, oxidative stress via generating ROS will be harmful to the host cell. Not only producing anti-drug pathogen but also switching white cell to opaque cell can generate different cell types simply. In addition, while host membrane is covered by the biofilm formation of C. albicans, more cell stress and inflammatory response emerges, caused by pathogenic factor and hyphae. Ultimately, the host cell accumulates much cell stress resulting in cell apoptosis. Moreover, due to damage to the host cell, inflammatory response will cause necrosis by hyphae.
To sum up, in the host cell damage phase, C. albicans SC5314 and C. albicans WO-1 all ultimately lead to cell apoptosis and necrosis. C. albicans SC5314 is still hard to switch from white cell to opaque cell. Nevertheless, C. albicans WO-1 switches white cell to opaque cell quickly. Promoted by the microenvironment continuously, C. albicans SC5314 still has opportunity to switch to opaque cell. Moreover, C. albicans WO-1 in the opaque cell can run sexual reproduction so that it will strengthen genetic diversity and own stability. The stability of C. albicans WO-1 in infection progression could be described in Tables 1 and 2. Moreover, according to the experimental data, gene expressions of AVEN and TNFAIP8L1 under the infection of C. albicans SC5314 are lower than C. albicans WO-1. By contrast, the receptor expression of EGFR under the infection of C. albicans SC5314 is higher than C. albicans WO-1. In addition, protein expressions of MAPK6 and MAPK14 in C. albicans SC5314 are higher than C. albicans WO-1. Therefore, the inflammatory response and apoptosis caused by C. albicans WO-1 are stronger than C. albicans SC5314. However, under the infection of C. albicans SC5314, host cell is easier to resist pathogen by combining with the expression of the above mentioned proteins. On the contrary, C. albicans WO-1 is hard to eliminate, such that host cell needs more immune or inflammatory reaction. Furthermore, different strains of C. albicans could induce inflammation by releasing the pathogenic factor. According to the experimental data, orf19.5542 (SAP6) has a significant change of expression in the infection progression at different strains. Pathogen protein orf19.5542 (SAP6) is along with the expression of receptor C3 to induce inflammatory response indirectly. It is noted that the expression level of receptor C3 affected by C. albicans WO-1 is higher than C. albicans SC5314. Finally, it can be considered that C. albicans WO-1 will generate a stronger inflammatory response and apoptosis.
Based on the discussion above, we summarize the differences in genetic and epigenetic pathogenic mechanisms during infection progression between two strains of C. albicans in Figure 6. Compared to C. albicans WO-1, C. albicans SC5314 holds the following differences for the infection mechanisms: (1) TF orf19.4433 (CPH1) regulates gene orf19.666 to cause DNA damage response; (2) TF Jun with methylation and ubiquitination regulates gene MMP12 to bring about ECM degradation; (3) TF FOS only with ubiquitination regulates gene MMP12 to have ECM degradation; (4) TF FOXA1 regulates gene SERPINF1 instead of VMP1 to result in apoptosis; (5) miR1979-2 rather than miR210 represses gene CCDC22 to do inflammatory response. Compared to C. albicans SC5314, C. albicans WO-1 takes the following differences for the infection mechanisms: (1) TF CAWG_02083 (EFG1) regulates gene CAWG_00418 (WOR1) to cause a white-opaque switch; (2) FOS with ubiquitination and methylation regulates gene MMP12 to contribute to ECM degradation.

Prediction of Drug Target Proteins and the Multiple-Molecule Drug Design for the Infection of Different Strains of C. albicans
Recently, the major drugs employed to treat C. albicans infection include Amphotericin B, Fluconazole and Caspofungin [48]. However, C. albicans will generate drug resistance by forming biofilm. In addition, C. albicans was kept balance between host defense and fungus. As long as the balance was destroyed, C. albicans will invade host cell and finally produce biofilm. Following this, C. albicans will cause diseases such as thrush and denture-associated erythematous. It can be seen that the treatment of C. albicans has to prevent hyphae growth and biofilm production. Nonetheless, C. albicans easily leads to re-infection after accepting treatment and staying immunocompromised on chemotherapy and HIV-infected patients. Furthermore, current treatments and drugs such as Amphotericin B, Fluconazole and Caspofungin have side effects. Therefore, there is a need to find other drugs to reduce re-infection and side effects. In the meantime, since C. albicans WO-1 has high frequency in white-opaque switching, we have to find potential drug targets for the treatment of C. albicans WO-1.  Figure 6. Summarizing the common and specific epigenetic and genetic pathogenic mechanisms in the infection of different strains of C. albicans. The figure summarizes the common and specific genetic and epigenetic pathogenic mechanisms in different strains of C. albicans. The green rectangular block denotes the differential regulations and functions between different strains of C. albicans.

Prediction of Drug Target Proteins and the Multiple-Molecule Drug Design for the Infection of Different Strains of C. albicans
Recently, the major drugs employed to treat C. albicans infection include Amphotericin B, Fluconazole and Caspofungin [48]. However, C. albicans will generate drug resistance by forming biofilm. In addition, C. albicans was kept balance between host defense and fungus. As long as the balance was destroyed, C. albicans will invade host cell and finally produce biofilm. Following this, C. albicans will cause diseases such as thrush and denture-associated erythematous. It can be seen that the treatment of C. albicans has to prevent hyphae growth and biofilm production. Nonetheless, C. albicans easily leads to re-infection after accepting treatment and staying immunocompromised on chemotherapy and HIV-infected patients. Furthermore, current treatments and drugs such as Amphotericin B, Fluconazole and Caspofungin have side effects. Therefore, there is a need to find other drugs to reduce re-infection and side effects. In the meantime, since C. albicans WO-1 has high According to our result, the infection of different strains of C. albicans with OKF6/TERT-2 cell could be used to investigate common pathogenic mechanism to predict drug targets for the design of multiple-molecule drug. We consider the important roles of orf19.1816 (CAWG_02005 in C. albicans WO-1 i.e., ALS3), orf19.610 (CAWG_02083 in C. albicans WO-1 i.e., EFG1), orf19.1321 (CAWG_03451 in C. albicans WO-1 i.e., HWP1), orf19.4433 (CAWG_00682 in C. albicans WO-1 i.e., CPH1), orf19.1623 (CAWG_02548 in C. albicans WO-1 i.e., CAP1) and orf19.723 (CAWG_01948 in C. albicans WO-1 i.e., BCR1). In these pathogen proteins, their functions include hyphae growth, endocytosis, and biofilm formation. Therefore, we can see that these pathogen TFs and pathogen cell surface proteins play a very important role in the pathogenic mechanism during the infection of C. albicans. Therefore, Amphotericin B, Fluconazole and Caspofungin mentioned above are still feasible drug treatments. Next, we will find other pathogen proteins as drug targets to discover new drugs according to their pathogen functions and roles in the pathogenic mechanism of C. albicans infection. Based on our results, the following proteins play important roles in the hyphal growth and biofilm formation: orf19.2614 (CAWG_01560 in C. albicans WO-1 i.e., RSR1), orf19.7292 (CAWG_02173 in C. albicans WO-1 i.e., ARP2), orf19.4519 (CAWG_04191 in C. albicans WO-1 i.e., SUV3), orf19.1854 (CAWG_01979 in C. albicans WO-1 i.e., HHF22), orf19.5542 (CAWG_05098 in C. albicans WO-1 i.e., SAP6) and orf19.5585 (CAWG_05066 in C. albicans WO-1 i.e., SAP5). Moreover, we also investigate other proteins involved in defense mechanism such as ROS response. Pathogen proteins about anti-ROS are orf19.1623 (CAP1), orf19.5034 (CAWG_00057 in C. albicans WO-1 i.e., YBP1) and orf19.3292 (CAWG_01270 in C. albicans WO-1). However, C. albicans in the infection progression exploits pathogen proteins such as orf19.7247 (CAWG_00020 in C. albicans WO-1 i.e., Rim101) which could exploit orf19.5585 (SAP5) for the degradation of host cell surface proteins. Eventually, the following pathogen proteins interact simultaneously with many proteins involving morphological transformation such as GTPase activity and the influence of microenvironment so that they can be considered as significant proteins to trigger TFs indirectly such as orf19.2087 (CAWG_03824 in C. albicans WO-1 i.e., SAS2), orf19.666 (CAWG_00299 in C. albicans WO-1), orf19.1093 (CAWG_04944 in C. albicans WO-1 i.e., FLO8) and orf19.939 (CAWG_04444 in C. albicans WO-1 i.e., NAM7). All the pathogen proteins mentioned above could be considered as potential common drug targets for therapeutic treatment of the infection of different strains of C. albicans.
After suggesting these common-molecule drug targets, we explored drug databases and did literature reviews to design a multiple-molecule drug targeting different strains of C. albicans simultaneously. One study has demonstrated that orf19.7247 (RIM101) could induce hyphae growth and degradation of host cell receptor [49]. The other study showed that drug targets of C. albicans were inferred by sequence homolog between C. albicans and S. cerevisiae. Here, we can find five drugs to eliminate C. albicans for new potential therapy as follows. First, Tunicamycin could repress a pathogen protein orf19.7247 functioning to reduce the hyphae growth induction and coordination of pathogen proteins for the degradation of host cell protein CDH1 [50]. Second, research shows that Terbinafine can inhibit the activity of orf19.5034 and its anti-ROS ability toward the stability of pathogen TF orf19.1623 (CAP1) to eliminate pathogen initially via the ROS production [25,50]. Moreover, Terbinafine also represses the activity of orf19.1854 (HHF22) which is related to hyphae growth and biofilm formation functions. However, Terbinafine directly or indirectly influences orf19.939 (NAM7) and orf19.2087 (SAS2) so that orf19.939 and orf19.2087 could reduce the chance of triggering TFs [50]. Third, Cerulenin affects the expression levels of pathogen proteins orf19.939 and orf19.4519 (SUV3) to decrease the biofilm formation [50]. Fourth, Tetracycline could inhibit orf19.5585 (SAP5) and orf19.5542 (SAP6) so that C. albicans would not form biofilm to release pathogenic factor [51]. Perhaps, aspartic proteinase inhibitors could be also employed for orf19.5585 and orf19.5542 [52]. Ciclopirox olamine is also broad-spectrum antibiotics to target orf19.939, orf19.1321 (HWP1), orf19.5585, and orf19.5542 [53]. Last, Tetrandrine can play an important roles in inhibiting orf19.1816 (ALS3), orf19.610 (EFG1) and orf19.5908 (TEC1) to reduce the regulatory ability of pathogen TFs [54]. Nevertheless, previous study showed a prolonged use of broad-spectrum antibiotics could lead to an impaired immune response [55]. Therefore, we do not consider the broad-spectrum antibiotics because of an immunocompromised response causing a re-infection. Other pathogen proteins are also applied to azole compounds drug, especially Fluconazole. Nonetheless, orf19.1623 (CAP1), orf19.390 (CDC42) and orf19.578 are important human-homologs of CAPZA1, CDC42 and GRTP1, respectively. The repression of these proteins may cause unpredictable dysfunction of host cell, especially the GTPase activity and actin growth. Therefore, orf19.5034 (YBP1) is considered as a better drug target instead of orf19.1623(CAP1). Eventually, Tunicamycin, Terbinafine, Cerulenin, Tetracycline and Tetrandrine combine with the three known drugs including Fluconazole, Amphotericin B, and Caspofungin as a potential multiple-molecule drug in Figure 7. This provides an alternative way for the therapeutic treatment of two strains of C. albicans based on the predicted drug targets.
because of an immunocompromised response causing a re-infection. Other pathogen proteins are also applied to azole compounds drug, especially Fluconazole. Nonetheless, orf19.1623 (CAP1), orf19.390 (CDC42) and orf19.578 are important human-homologs of CAPZA1, CDC42 and GRTP1, respectively. The repression of these proteins may cause unpredictable dysfunction of host cell, especially the GTPase activity and actin growth. Therefore, orf19.5034 (YBP1) is considered as a better drug target instead of orf19.1623(CAP1). Eventually, Tunicamycin, Terbinafine, Cerulenin, Tetracycline and Tetrandrine combine with the three known drugs including Fluconazole, Amphotericin B, and Caspofungin as a potential multiple-molecule drug in Figure 7. This provides an alternative way for the therapeutic treatment of two strains of C. albicans based on the predicted drug targets.  . Tetracycline can inhibit orf19.5585 (SAP5) and orf19.5542 (SAP6) of C. albicans from forming biofilm and releasing pathogenic factor. Cerulenin can affect the expression level of pathogen proteins orf19.939 (NAM7) and orf19.4519 (SUV3). Tunicamycin can repress pathogen orf19.7247 (RIM101) to reduce its ability of coordinating pathogen proteins for the degradation of host cell protein CDH1. Tetrandrine can inhibit orf19.1816 (ALS3), orf19.610 (EFG1) and orf19.5908 (TEC1) to reduce the ability of regulation functions of pathogen TFs. Moreover, other pathogen proteins are also applied to kinds of azole. So, these drugs are combined as multiple-molecule drugs to perish both strains of C. albicans simultaneously.
orf19.2614 (RSR1) and orf19.7292 (ARP2) are involved in hyphae growth and considered as important virulence factors. Moreover, orf19.666 participates in many pathogen protein interactions and DNA responses. Until now, we cannot find drugs for them. Additionally, no drug is explored for pathogen protein orf19.4884 (WOR1) regulating white-opaque switch at C. albicans WO-1. Therefore, we recommend that these pathogen proteins are potential drug targets for further drug design. In conclusion, our results show that orf19.2614 (RSR1), orf19.666, orf19.7292 (ARP2) and orf19.4884 (WOR1) will be significant drug targets for the design of a new common multiple-molecule drug to efficiently eliminate both C. albicans SC5314 and C. albicans WO-1.

Conclusions
The pathogenic mechanisms involved in C. albicans infection and resistance mechanism from host cells are complicated. For host cells, the hyphal growth and hydrolase-triggering virulence factors have been extensively investigated. However, few studies concentrate on the cross-talk mechanism between the human cell and C. albicans. In this study, based on big data mining, system identification, system order detection, and principal network projection methods, we investigated genetic and epigenetic interspecies networks between host OKF6/TERT-2 cells and C. albicans by two-sided NGS data during C. albicans infection. Moreover, our results distinguished the common and specific pathogenic mechanism caused by different signaling pathways to show pathogen offense and host defense in systematic viewpoint for C. albicans SC5314 and WO-1. The infection progression associated with essential epigenetic modifications in two strains of C. albicans such as ARRB2, CDH1, HSP90B1, EGFR and ERBB2, and host miRNA repressing on pathogens genes are mostly identified in core HPCNs by our systems biology approach. Furthermore, core HPCNs which obtain from applying PNP method to real GEINs are all in respect of KEGG pathways. These results indicate that epigenetic modification plays a significant role in the pathogenic mechanism for C. albicans SC5314 and WO-1. In the past, there are few studies discussing about the defensive mechanism in the human cell infected by C. albicans and the offensive mechanism of C. albicans in the viewpoint of pathogen epigenetic modification. A Previous study discovered epigenetic modification of C. albicans which did not infect the host cell [56]. Additionally, only few studies demonstrated how C. albicans interacts with host cell surface proteins without indicating downstream signaling proteins and target genes to generate the corresponding cellular responses. Our results show that epigenetic regulations play an important role in the common and specific pathogenic mechanism of C. albicans, especially C. albicans WO-1, transforming cell type rapidly during the infection process. This will offer a new direction for drug targets and systems drug designs. At the same time, with the recognition of core HPCNs, our understandings toward C. albicans infection will increase benefiting for drug discovery. In the future, the four pathogen proteins orf19.2614 (RSR1), orf19.666, orf19.7292 (ARP2), and orf19.4884 (WOR1) could be potential drug targets for the design of a new common multiple-molecule drug to eliminate both C. albicans SC5314 and C. albicans WO-1 efficiently.

5.1.
Overview of the Construction of GEINs and Core HPCNs in OKF6/TERT-2 Cells Line during the Infection of C. albicans SC5314 and C. albicans WO-1 To investigate the common and specific pathogenic mechanisms between C. albicans SC5314 and C. albicans WO-1 during the infection of human oral epithelial OKF6/TERT-2 cells, we identified the cross-talk GEINs and extracted core HPCNs between human and C. albicans SC5314 as well as C. albicans WO-1, respectively. In Figure 8, a flowchart is given to construct core HPCNs in the state of infection by C. albicans SC5314 and C. albicans WO-1 via big data mining, dynamic model construction and network identification for investigating pathogenic mechanisms and inferring potential drug targets.  Figure 8. The flow chart of the systems biology method applied to construct genetic and epigenetic interspecies networks (GEINs) for extracting core HPCNs to discover the common and specific pathogenic mechanisms during the infection of C. albicans SC5314 and C. albicans WO-1 for drug targets and potential common-molecule drugs. The grey blocks indicate the big data for constructing candidate GEIN and the blue blocks represent the input information including NGS data and the surveyed literature for drug design in this study; the blocks with grey frame denote systems biology approach utilized to construct real cross-talk GEINs in the infection of C. albicans SC5314 and C. albicans WO-1, and then extract the core HPCNs of two replicates via PNP method; the white rounded rectangular blocks are our corresponding results about a common multi-molecule drug.
In Figure 8, we construct GEINs and HPCNs under the following steps: (1) Big data mining and data preprocessing of host/pathogen gene/miRNA expression; (2) construction of candidate GEIN, which consists of candidate host/pathogen intra-species protein-protein interaction networks (PPINs), candidate inter-species PPIN between host and pathogen, candidate host/pathogen Figure 8. The flow chart of the systems biology method applied to construct genetic and epigenetic interspecies networks (GEINs) for extracting core HPCNs to discover the common and specific pathogenic mechanisms during the infection of C. albicans SC5314 and C. albicans WO-1 for drug targets and potential common-molecule drugs. The grey blocks indicate the big data for constructing candidate GEIN and the blue blocks represent the input information including NGS data and the surveyed literature for drug design in this study; the blocks with grey frame denote systems biology approach utilized to construct real cross-talk GEINs in the infection of C. albicans SC5314 and C. albicans WO-1, and then extract the core HPCNs of two replicates via PNP method; the white rounded rectangular blocks are our corresponding results about a common multi-molecule drug.
In Figure 8, we construct GEINs and HPCNs under the following steps: (1) Big data mining and data preprocessing of host/pathogen gene/miRNA expression; (2) construction of candidate GEIN, which consists of candidate host/pathogen intra-species protein-protein interaction networks (PPINs), candidate inter-species PPIN between host and pathogen, candidate host/pathogen gene/miRNA regulation networks (GRNs), candidate miRNA regulation networks of host-miRNAs on host/pathogen-genes and candidate lncRNAs regulation networks of host-lncRNAs on host-genes/pathogen-genes; (3) The network identifying process for detecting the real interspecies GEINs via system identification method and system order detection scheme, which is the Akaike's Information Criterion (AIC) to prune false positives in the candidate interspecies GEINs by using the two-sided genome-wide NGS data of OKF6/TERT-2 cells and C. albicans during infection (see Sections 1.2-1.4 in Supplementary Materials); (4) The extraction of the core HPCNs by applying the principal network projection (PNP) method on the real interspecies GEINs (see Section 1.5 in Supplementary Materials). Therefore, we can project core HPCNs to KEGG pathways to obtain core cross-talk pathways and compare them to investigate the crucial common and specific pathogenic mechanisms contributing to the infection progression of C. albicans SC5314 and C. albicans WO-1, respectively. Ultimately, we infer the common network biomarkers as the potential multiple drug targets for a common multiple-drug design.

NGS Data Preprocessing for Human and Pathogen
To identify the cross-talk activities between host and pathogen during the infection of C. albicans, two strains of C. albicans, SC5314 and WO-1, have to infect human cell, respectively. C. albicans WO-1 changes its own morphology easily and can transform white cell to opaque cell. Therefore, it is difficult to control and extremely sensitive to anaerobic environment. In the previous studies, the full catalog of transcriptional changes is not completely owing to the limitations of microarrays, which lead to a limited dynamic range and poor sensitivity to analyze low-abundance transcripts. Up to now, NGS data have been obtained from the research in Reference [57], which are used to investigate the transcription profiles of both immortalized oral epithelial cells (OKF6/TERT-2 cell line) and C. albicans. It is the only available dataset providing sufficient information for constructing the candidate host-pathogen GEIN.
The dual NGS data studied by Liu et al. has two parts which discuss about endothelial cells and epithelial cells, respectively [57]. Here, we focus on the epithelial cells (the OKF6/TERT-2 cell line) infected by yeast-phase organisms of two different clinical isolates of C. albicans, SC53144 and WO-1.
The first part of NGS data includes the mRNA/miRNA expression profiles of OKF6/TERT-2 cell line at 90, 300, 480 min post-infection with C. albicans SC5314 and WO-1. For OKF6/TERT-2 cells, the medium was cultured in Dulbecco's modified Eagle's medium (DMEM) without serum at 37 • C before the cell was infected. The second part includes the mRNA expression profiles of two biological replicates of C. albicans SC5314 and WO-1 in OKF6/TERT-2 cells at 90, 300, 480 min post-infection (GEO accession number GSE56093; https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi? acc=GSE56093). Moreover, the strain SC5314 including two replicates was originally recovered from a patient with generalized candidiasis. The other strain WO-1 including two replicates was isolated from the blood and lungs of a patient suffering from systemic candidiasis. RNA sequencing (RNA-Seq), next-generation sequencing (NGS) with llumina HiSeq 2000 platform, was used on host and two strains of C. albicans to provide valuable dataset for better understanding of common pathogenic mechanisms. Meanwhile, in order to avoid overfitting problem in constructing real host-pathogen GEINs which could be found in detail in Section 1.3 of the Supplementary Materials, we use cubic spline method for data interpolation within the experimental time intervals and data extrapolation after 8 h post-infection. Hence, we have two-sided NGS data from 0 min to 12 h.

Construction of Candidate GEINs by the Inference of Putative Interspecies and Intraspecies PPINs and
GRNs for C. albicans SC5314 and WO-1 The candidate GEINs are constructed as shown in Figure 8 through big data mining from numerous databases which contain many experimental data and bioinformatics predictions. The host candidate PPIN required of protein-protein interaction (PPI) information was obtained from MINT [58,59], DIP [60], BIND [61], IntAct [59] and BioGRID [62]. The host candidate GRN required of transcription factors (TFs)/lncRNAs and their downstream-regulated genes information was obtained Funding: