Next Article in Journal
A Comprehensive Review on Weight Loss Associated with Anti-Diabetic Medications
Previous Article in Journal
Androgen-Induced Cardiovascular Risk in Polycystic Ovary Syndrome: The Role of T Lymphocytes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells

1
Department of Mathematics, School of Sciences, Shanghai University, Shanghai 200444, China
2
School of Life Sciences, Shanghai University, Shanghai 200444, China
3
College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
4
Key Laboratory of Stem Cell Biology, Shanghai Jiao Tong University School of Medicine (SJTUSM) & Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS), Shanghai 200030, China
5
Department of Computer Science, Guangdong AIB Polytechnic College, Guangzhou 510507, China
6
Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
7
CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Life 2023, 13(4), 1011; https://doi.org/10.3390/life13041011
Submission received: 10 March 2023 / Revised: 2 April 2023 / Accepted: 8 April 2023 / Published: 14 April 2023
(This article belongs to the Section Epidemiology)

Abstract

:
Corona Virus Disease 2019 (COVID-19) not only causes respiratory system damage, but also imposes strain on the cardiovascular system. Vascular endothelial cells and cardiomyocytes play an important role in cardiac function. The aberrant expression of genes in vascular endothelial cells and cardiomyocytes can lead to cardiovascular diseases. In this study, we sought to explain the influence of respiratory syndrome coronavirus 2 (SARS-CoV-2) infection on the gene expression levels of vascular endothelial cells and cardiomyocytes. We designed an advanced machine learning-based workflow to analyze the gene expression profile data of vascular endothelial cells and cardiomyocytes from patients with COVID-19 and healthy controls. An incremental feature selection method with a decision tree was used in building efficient classifiers and summarizing quantitative classification genes and rules. Some key genes, such as MALAT1, MT-CO1, and CD36, were extracted, which exert important effects on cardiac function, from the gene expression matrix of 104,182 cardiomyocytes, including 12,007 cells from patients with COVID-19 and 92,175 cells from healthy controls, and 22,438 vascular endothelial cells, including 10,812 cells from patients with COVID-19 and 11,626 cells from healthy controls. The findings reported in this study may provide insights into the effect of COVID-19 on cardiac cells and further explain the pathogenesis of COVID-19, and they may facilitate the identification of potential therapeutic targets.

1. Introduction

Corona Virus Disease 2019 (COVID-19), caused by respiratory syndrome coronavirus 2 (SARS-CoV-2) [1,2,3], was first declared a pandemic on 11 March 2020 due to its quick global spread [4,5]. As of 23 July 2022, over 569 million cases have been reported worldwide, with over 6.3 million deaths [6]. Fever, sore throat, dry cough, and pneumonia symptoms are the common clinical manifestations of COVID-19 [7], and the majority of patients with COVID-19 initially exhibit upper respiratory symptoms resembling those of the common flu [8]. However, acute respiratory distress syndrome (ARDS), multiorgan failure, and possibly death have been reported in a small percentage of people with SARS-CoV-2 infection [9]. ARDS is the most common complication in patients with COVID-19, especially elderly patients with underlying diseases, such as diabetes [10].
However, COVID-19 not only harms the respiratory system, but also the cardiovascular system [11]. Approximately 20–30% of patients with COVID-19 have presented with cardiovascular problems linked to poor outcomes [12,13]. Specifically, COVID-19 may increase the risk of cardiovascular diseases, such as myocarditis, arrhythmia, atherosclerosis, and coronary artery disease [14,15,16,17]. For example, viral myocarditis is a type of heart disease possibly caused by COVID-19. Three potential pathogenic mechanisms are widely recognized: (a) SARS-CoV-2 directly damages cardiomyocytes through ACE2 receptors expressed in the heart [18], (b) COVID-19-induced cytokine storm leads to cardiomyocyte hypoxia and apoptosis [19,20], and (c) COVID-19 causes interferon-mediated systemic inflammation that damages cardiomyocytes [13,21]. Some drugs used to treat COVID-19 can cause heart damage, especially antivirals [15]. Therefore, experts in the field are particularly interested in cardiovascular damage caused by SARS-CoV-2 infection and the potential mechanisms associated with cardiac damage.
Vascular endothelial cells and cardiomyocytes are closely related to heart function and cardiovascular diseases. Vascular endothelial cells form the cell lining of arteries, veins, and capillaries [22]; actively control vasoconstriction; and are crucial for controlling immunological response, inflammation, and angiogenesis [23]. Cardiomyocytes are responsible for generating contractile force in the heart and can control the rhythmic beating of the heart [24]. Dysfunction of vascular endothelial cells can heighten the likelihood of developing cardiovascular diseases, such as atherosclerosis [25] and heart failure [26], and can also indirectly result in myocardial dysfunction due to impaired blood flow [27]. Cardiomyocyte dysfunction is associated with several heart diseases, such as cardiac hypertrophy [28], heart failure [29,30], and myocarditis [31].
In this study, we investigated the effect of SARS-CoV-2 infection on gene expression in vascular endothelial cells and cardiomyocytes. The gene expression profiles were retrieved from the GEM database for the cardiomyocytes and vascular endothelial cells of patients with COVID-19 and healthy controls. To build quantitative guidelines for precise prediction, many machine learning algorithms were employed to constitute a computational workflow. The purpose was to identify key genes differentially expressed in the heart after SARS-CoV-2 infection. The features and rules discovered can facilitate the prediction and diagnosis of COVID-19 cardiovascular-related sequelae and aid in understanding the potential pathogenic mechanisms of COVID-19-induced heart damage.

2. Materials and Methods

The flow of the machine learning-based workflow designed in this study is shown in Figure 1. Two types of heart cell samples were grouped according to COVID-19 status. The genes that were used to represent cells were ranked according to importance using four methods. Finally, by combining the incremental feature selection (IFS) [32] method and decision tree (DT) [33], some key genes and quantitative classification rules were obtained. This section describes the methods used in each segment.

2.1. Data

The current study integrated data from the GEM database [34], which contains the gene expression profiles of cardiac cells from patient with COVID-19 versus healthy controls. The database was downloaded from https://singlecell.broadinstitute.org/single_cell/study/SCP1216 (accessed on 23 May 2022). Cardiomyocytes and vascular endothelial cells in the heart were analyzed because they are closely related to heart function. Cells were grouped according to the cell type, and we obtained two sets of data. The first set contained 104,182 cardiomyocyte samples, including 12,007 COVID-19 cardiomyocytes and 92,175 healthy cardiomyocytes. The second set contained 22,438 vascular endothelial cells, of which 10,812 were COVID-19 vascular endothelial cells and 11,626 were healthy vascular endothelial cells. Each cell was represented by 29,071 gene expression features. In each cell type, COVID-19 and healthy control were deemed labels, which were combined with features to constitute a classification system. Essential features can be obtained by investigating such classification system.

2.2. Feature Selection Methods

Each sample contained a large number of gene expression features, but only a small fraction was associated with COVID-19. We analyzed these genes using the following four feature selection methods: least absolute shrinkage and selection operator (LASSO) [35], light gradient boosting machine (LightGBM) [36], Monte Carlo feature selection (MCFS) [37], and random forest (RF) [38], and we ranked them according to their association with COVID-19.

2.2.1. Least Absolute Shrinkage and Selection Operator

In the LASSO algorithm, a first-order penalty function was constructed using the L1 paradigm, where independent variables are the genetic features we input. By applying penalties to the variables with low correlation and small predictive contribution, the corresponding coefficients are reduced to zero, thereby eliminating these unimportant features. Such operation can reduce the data dimension and prevent overfitting. By observing the optimized function, the input features can be sorted according to the absolute values of the coefficients of the features.

2.2.2. Light Gradient Boosting Machine

LightGBM improves the gradient boosting DT algorithm. In a large dataset, it can merge some mutually exclusive features and eliminate those with small gradients, thus achieving data dimensionality reduction and improving efficiency. It uses a leaf-wise strategy when constructing a tree, so it extends branches with high efficiency each time during feature evaluation. The more a feature is involved in building the tree, the more it contributes to the prediction. Thus, features can be ranked according to their occurring times in the constructed DTs.

2.2.3. Monte Carlo Feature Selection

The MCFS algorithm constructs a number of independent DTs. The algorithm randomly selects some features used for constructing the nodes of trees many times. For each feature set, the algorithm randomly selects training data many times. Eventually, p feature subsets are obtained, and for each feature subset, training data are randomly constructed t times, and DTs are constructed. Thus, p × t trees are obtained. The relative importance score ( R I ) can symbolize the importance of a feature g.
R I g = τ = 1 p × t ω A C C u n g τ I G ( n g τ ) n o . i n   n g τ n o . i n   τ v ,
In the formula, ω A C C is the weighted accuracy of the tree τ under consideration, n g τ is a node of the DT whose information gain is denoted as I G ( n g τ ) , n o . i n   n g τ denotes the sample size of n g τ , and n o . i n   τ is the number of samples in the root of τ ; u and v are two positive reals weighting the ω A C C and the ratio n o . i n   n g τ / n o . i n   τ , respectively. In terms of decreasing order of RI scores, features can be ranked in a list.

2.2.4. Random Forest

Permutation feature importance was first introduced for RFs by Breiman [39] in 2001 and was later applied to other models by Fisher et al. [38]. The rationale is easy to explain. If a feature is more important, it leads to a greater increase in prediction error when it is randomized. The importance of features can be measured by the increase in prediction error after feature permutation. If permuting its value does not increase the prediction error, the feature is not important. Based on the increment in prediction error, features can be ranked in a list.

2.3. Incremental Feature Selection

Using the above methods, four ranked feature lists were obtained. However, it is still challenging to select which features to participate in classification. Given a classification algorithm, its performance under the selected features should provide good performance. This study used the IFS method [32] to determine such features. It constructs a series of feature subsets from a given feature list. The first subset contains the first 10 features, and then each new subset includes an increment of ten following features in the list. Each of these subsets is used to train the downstream classifier based on a given classification algorithm. The performance of all classifiers is subsequently evaluated using 10-fold cross-validation. Based on the performance of the classifiers, the classifier with best performance can be obtained and the best feature subset used in this classifier is picked up for further investigation.

2.4. Synthetic Minority Oversampling Technique

Looking at the data set, we found a disparity in the number of samples in each category. For example, the number of healthy cardiomyocytes was 7.7 times that of COVID-19 cardiomyocytes. The direct use of these data would result in preferences for majority classes. This study used the synthetic minority oversampling technique (SMOTE) [40] to address this issue. For a minority class, the method randomly selects a sample, say α, and looks for its k closest neighbors of the same class according to the Euclidean distance. Sample b is randomly selected among these k neighbors, and a point is randomly identified on the line segment connecting α and b. This point is taken as a new sample of the minority class. This process is repeated to generate enough new samples for the minority class so that the number of samples of each class in the dataset is balanced.

2.5. Decision Tree

The operation of the IFS method requires a classification algorithm. Here, we selected the DT algorithm [33], which constructs a tree structure with a trunk and leaves. The trunk represents the process of classifying the samples and has several nodes, which contain tests on features. Samples are assigned to different trunks according to the tests. Eventually, the samples reach the leaves, which represent the categories. The DT algorithm is a classical white-box algorithm; that is, its decision process is transparent and can suggest meaningful clues to facilitate the elucidation of the effect of COVID-19 on the expression levels of genes. These clues are included in a group of quantitative classification rules, which can be extracted from the constructed trees.

2.6. Performance Evaluation

In IFS method, many classifiers were constructed and evaluated using 10-fold cross-validation [41]. The F 1   m e a s u r e was taken as the key metric in measuring the performance of the classifiers [42,43,44,45,46,47,48]. The calculation procedure was as follows:
P r e c i s i o n = T P T P + F P ,
R e c a l l = T P T P + F N ,
F 1   m e a s u r e = 2 × ( R e c a l l × P r e c i s i o n ) R e c a l l + P r e c i s i o n ,
where T P is true positive, F P is false positive, and F N is false negative. Classifier performance increases with F 1   m e a s u r e .
We further used accuracy (ACC) and Matthew correlation coefficient (MCC) [49] for reference. MCC indicates the agreement between predicted and observed labels and is balanced when classes have different sample sizes. The performance of a classifier increases with MCC and ACC.

3. Results

The gene expression profiles of cardiomyocytes and cardiac vascular endothelial cells from patients with COVID-19 and healthy population were analyzed in this study, as shown in Figure 1. The results obtained in each step are presented in this section.

3.1. Results of Feature Ranking and Incremental Feature Selection

Each heart cell contained 29,071 gene expression signatures, which were analyzed using LASSO, LightGBM, MCFS, and RF methods. Four ranked feature (gene) lists were obtained, which are provided in Table S1. As only a small number of genes would be significantly associated with COVID-19, top 2000 genes in each list were picked up for subsequent analysis. Using the IFS method and the DT algorithm, 200 feature subsets were constructed from each list, thereby inducing 200 DT classifiers. All classifiers were assessed by 10-fold cross-validation. Table S2 provides the evaluation results of all classifiers. To clearly display the performance of classifiers under different feature subsets, IFS curves were plotted using F1 measure as the vertical coordinate and the number of genes (features) as the horizontal coordinate. The results are shown in Figure 2 and Figure 3.
On the cardiomyocyte dataset, four IFS curves on four feature lists are illustrated in Figure 2. It can be observed that the DT classifier using the first 20 genes in the list yielded by LightGBM showed the best performance, with an F1 measure of 0.983. In addition, it provided high ACC and MCC (0.996 and 0.980, respectively, see Table 1). The best DT classifiers on other three feature lists also showed excellent performance. The highest F1 measures were 0.975 (LASSO), 0.975 (MCFS), and 0.976 (RF) when first 20, 1500, and 340 genes were used in the corresponding lists, respectively. Their detailed performance is listed in Table 1.
On the dataset of vascular endothelial cells, four IFS curves are shown in Figure 3. The DT classifier using the first 120 genes in the list yielded by MCFS showed the highest performance (F1 measure of 0.949) and the best ACC and MCC values (0.951 and 0.902, respectively) were obtained on the same set of genes, which are listed in Table 1. Similarly, the best DT classifiers on the lists yielded by LASSO, LightGBM, and RF also provided high performance, which generated the F1 measure values of 0.923, 0.948, and 0.945, respectively, by using the first 60, 80, and 60 genes in the corresponding lists.
According to the above results, the best DT classifiers on each feature list all gave high performance, meaning that they can be useful tools to identify COVID-19 patients.
For each cell type, important genes were extracted from each feature list, with which DT classifier can yield the best performance. The number of genes is listed in Table 1. However, lots of such genes were extracted from some feature lists. For example, for cardiomyocytes, 1500 genes were accessed from the list yielded by MCFS. In this case, it was not easy to conduct the further investigation. In view of this, the most important genes should be extracted from these genes. After checking the IFS results on two cell types, we selected the top 60 genes from the list yielded by MCFS and top 60 genes from the list yielded by RF for cardiomyocytes, and we selected the top 20 genes from the list yielded by LightGBM and top 20 genes from the list yielded by MCFS for vascular endothelial cells. The F1-measure values of DT classifiers under above features are marked in Figure 2 and Figure 3. Compared with the F1-measure values of the best DT classifiers on the same feature list, they were a little lower. However, the numbers of used features were sharply reduced. Thus, these genes were relatively more important than rest genes. For the important genes selected from other feature lists, it was not necessary to make further selection as only a few genes were picked up. Accordingly, the top 20, 20, 60, and 60 genes in the lists yielded by LASSO, LightGBM, MCFS, and RF for cardiomyocytes, respectively, were selected to find the intersection and a Wayne diagram was plotted, as shown in Figure 4. Similarly, the most important genes (the first 60, 20, 20, and 60 genes of the lists) for vascular endothelial cells were analyzed. The results of the Wayne diagram are shown in Figure 5. As expected, we searched the key genes commonly indicated by these methods and shown to be closely associated with COVID-19. Detailed intersection results are shown in Table S3.

3.2. Classification Rules

The DT algorithm is a classical white-box algorithm. It can explicitly show the process of classification and provide interpretable classification clues. These rules may facilitate the analysis of the significance of genes, which may be differentially expressed in patients with COVID-19. By using the best DT classifiers on four feature lists, we summarized four groups of quantitative classification rules for cardiomyocytes and cardiovascular endothelial cells, respectively. The detailed classification rules are provided in Table S4. On one hand, these rules can be used to distinguish patients with COVID-19 from healthy controls. On the other hand, they can clearly display the different expression patterns in cardiomyocytes or cardiovascular endothelial cells between healthy controls and patients with COVID-19. Some rules will be discussed in detail below.

4. Discussion

We identified a set of potential signature genes that reveal differential expression associated with COVID-19 in cardiomyocytes and vascular endothelial cells. These genes can be useful in understanding how SARS-CoV-2 infection affects gene expression in cardiac cells. Confirming these genes can enhance the understanding of the pathogenesis of COVID-19-induced cardiovascular diseases, aiding in the clinical diagnosis and treatment of related clinical diseases. Several features and quantitative rules we identified are related to COVID-19-caused cardiovascular diseases according to newly published papers.

4.1. Analysis of Gene Features in Cardiac Cells for COVID-19

Based on the machine learning-based workflow, a set of significant genes were identified, which may be differentially expressed in vascular endothelial cells and cardiomyocytes. The genes facilitated the differentiation of patients with COVID-19 from healthy populations. Under pathological conditions, several top genes are involved in distinct biological activities in cardiomyocytes and vascular endothelial cells. Here, we analyzed the first five genes listed in Table 2 for cardiomyocytes and vascular endothelial cells.
The possible impact of the differential expression of identified top features in vascular endothelial cells and cardiomyocytes on the hearts of patients with COVID-19 is discussed below. This discussion may explain the vulnerability of patients with COVID-19 to cardiovascular diseases and may offer direction for clinical diagnosis and treatment.

4.1.1. Qualitative Features in Vascular Endothelial Cells

The first identified feature gene is MALAT1 (ENSG00000251562), which has a significant impact on the heart given that it positively regulates the proliferation of cardiomyocytes [50] and plays an important role in the regulation of cardiovascular inflammation [51]. Low levels of MALAT1 lncRNA were discovered in patients with severe COVID-19 [52,53]. Another study suggested that MALAT1 depletion is responsible for the sepsis inflammatory response by inhibiting the expression of IL-6 and TNF-α and the NF-κB signaling pathway by upregulating miR-150-5p [54]. COVID-19 is associated with proinflammatory cytokine release [55], suggesting that SARS-CoV-2 infection alters the expression of MALAT1. Therefore, MALAT1 can be considered a potential feature for identifying patients with COVID-19. In vascular endothelial cells, the differential expression of MALAT1 is associated with several cardiovascular diseases, and MALAT1 is associated with the inflammation and apoptosis of vascular endothelial cells [56], risk of coronary heart disease [57,58], and deep vein thrombosis [59].
The next identified features were ID1 (ENSG00000125968) and ID3 (ENSG00000117318). They belong to the inhibitor of DNA binding (ID) family. IDs are required for the formation of the heart [60,61] and skeletal myogenesis [62] and play an important role in angiogenesis [63]. Thus, a potential relationship between IDs and the heart was revealed. A recent study discovered ID1 differential expression in COVID-19 retest-positive patients [64], demonstrating the potential influence of COVID-19 on ID expression. The beneficial function of IDs in reducing viral replication has been shown in some articles [65,66]. The expression of IDs in vascular endothelial cells may have a potential role in increasing the risk of some cardiovascular diseases. ID3 expression is involved in the protective process of coronary artery disease [67] and atherosclerosis [60]. Moreover, ID1 plays an important role in repair after a vascular injury [68].
The next identified gene was MT-CO1 (ENSG00000198804), which is a cytochrome c oxidase gene and a mitochondrial marker that plays an important role in mitochondrial aerobic respiration [69]. Given that the heart’s driving function depends on ATP produced by aerobic respiration, MT-CO1 is essential for the heart. Several studies have demonstrated the effect of COVID-19 on MT-CO1 expression. In 2022, researchers found low mRNA levels of MT-CO1 in patients with COVID-19 [70], and another article noted that MT-CO1 was downregulated in patients with COVID-19 and recovered individuals [71]. The differential expression of MT-CO1 in vascular endothelial cells due to COVID-19 may contribute to a range of cardiovascular diseases. The inhibition of MT-CO1 expression may lead to inefficient electron transfer, resulting in the production of reactive oxygen species [72]. The resulting damage to the vascular endothelium has detrimental effects on cardiac function. In addition, MT-CO1 downregulation produces mitochondrial oxidative stress, which may increase the risk of atherosclerosis and coronary artery disease [73]. Thus, the higher risk of cardiovascular disease in patients with COVID-19 is partially explained by the differential expression of the MT-CO1 gene in vascular endothelial cells [74].
The last identified feature was EGFL7 (ENSG00000172889). EGFL7 gene is associated with angiogenesis and is highly expressed in the developing neonatal vasculature [75]. It is upregulated in adults with a vascular injury [76,77]. In 2020, Leng et al. [78] discovered that EGFL7 is downregulated in patients with COVID-19, suggesting that EGFL7 can be identified as a valid feature for distinguishing patients with COVID-19. The negative consequences that may be caused by EGFL7 gene differential expression in vascular endothelial cells in the heart have been explored. One study discovered that increasing EGFL7 expression enhances neoangiogenesis within plaques and promotes the development of atherosclerosis [79]. Moreover, EGFL7 knockdown induces cardiac dysfunction and fibrosis [80], and EGFL7 genetic deletion causes severe vascular defects [81].

4.1.2. Qualitative Features in Cardiomyocytes

The first identified feature gene was MALAT1 (ENSG00000251562). We have discussed the basic function of the MALAT1 gene and its differential expression in patients with COVID-19. In contrast to the previous discussion, the COVID-19-induced differential expression of MALAT1 in cardiomyocytes produced different negative effects on patients’ hearts. Increased MALAT1 expression enhances cardiomyocyte apoptosis [50,82,83], and MALAT1 is highly expressed in patients with acute myocardial infarction [84,85]. However, the low expression of MALAT1 may promote myocardial hypertrophy [86].
The next identified gene was CD36 (ENSG00000135218). It plays an important role in lipid metabolism, especially in the uptake of long-chain fatty acids, which are the primary source of myocardial energy supply, which explains why CD36 is so closely related to heart function [87]. CD36 is differentially expressed in patients with COVID-19 and recovered individuals [88]. Moreover, human primary monocytes infected with SARS-CoV-2 alter the expression of genes related to lipid uptake, such as CD36 [89]. All these results suggest that COVID-19 may alter CD36 gene expression. Alterations in CD36 gene expression may cause some adverse effects on the heart and leads to the disruption of fatty acids or lipid metabolism, resulting in a range of chronic diseases, such as cardiac hypertrophy, heart failure, and cardiac ischemia/reperfusion [90]. The myocardial uptake of long-chain fatty acids is significantly reduced, and glucose utilization is increased in CD36-deficient patients [91]. A shift in energy supply accelerates the progression of myocardial hypertrophy to heart failure [92].
The next identified feature was LARGE1 (ENSG00000133424), which is highly abundant in the heart [93] and involved in the regulation of cellular homeostasis in myocytes [94]. Although no direct evidence has shown that LARGE1 gene expression is altered in patients with COVID-19, two studies on the Lassa fever virus, which also contains spike proteins, found LARGE1 gene overexpression [95,96]. Thus, LARGE1 expression is likely to change as a result of SARS-CoV-2 infection. LARGE1 overexpression exacerbates muscular dystrophy [97,98]. Given that this disease affects cardiac function and even leads to heart failure [99], the altered expression of LARGE1 in cardiac myocytes caused by COVID-19 may have an adverse effect on the heart.
The next identified gene was RYR2 (ENSG00000198626), which is highly expressed in the heart, is involved in action potential regulation in atrial myocytes [100], and positively regulates atrial contractility [101]. These functions indicate the important role of RYR2 in the heart. Indirect evidence has demonstrated the association between COVID-19 and RYR2 expression. COVID-19 may cause hypoxia in patients [102], and RYR2 expression can be significantly reduced upon hypoxic exposure [103]. RYR2 channels exhibit increased activity in a patient’s brain [104], and alterations in RYR2 expression may cause a range of negative effects on the heart. Moreover, pathological Ca2+ channel remodeling and heart failure progression are linked to RYR2 [105,106]. Additionally, catecholamine-sensitive ventricular tachycardia is caused by RYR2 dysfunction [107], and low RYR2 expression likely causes a comparable range of negative effects.
The last identified feature was PLCG2(ENSG00000197943), which is a SARS-CoV-2 infection-related gene associated with immune response [108]. SARS-CoV-2 infection may lead to viral myocarditis, which may be associated with the expression of PLCG2. The expression of PLCG2 transcripts is upregulated in club cells after SARS-CoV-2 infection [109], and PLCG2 is highly expressed in the kidneys of patients with COVID-19 [108]. Cardiac injury due to COVID-19-induced differential expression of PLCG2 in cardiomyocytes may be associated with myocarditis. Given that PLCG2 is involved in multiple immune responses [110,111], its high expression may promote inflammation leading to myocarditis.

4.2. Analysis of Decision Rules in Cardiac Cells for COVID-19

As described above, we identified a validated set of genes that may help in qualitatively distinguishing cardiac gene expression samples from patients with COVID-19 from that in uninfected populations by using cardiac gene expression. The ability of some genes in a sample categorization at the transcriptome level can be confirmed by recent studies. In addition, quantitative rules were also established based on the computational workflow, and some representative rules for cardiomyocytes and vascular endothelial cells were selected for in-depth discussion, as listed in Table 3. The rules allowed us to accurately identify patients with COVID-19 according to changes in gene expression in cardiac cells.
We then predicted the potential impact of SARS-CoV-2 infection on patients’ hearts and the likelihood of developing cardiovascular diseases according to parameters differentially expressed in each rule. The pathogenicity analysis of the parameters in the rule is more specific than that in the previous section and is strictly based on the “upregulation” or “downregulation” indicated in the rule.

4.2.1. Quantitative Rules in Vascular Endothelial Cells

The first rule (Rule 0) involves four parameters that help in differentiating a population infected with SARS-CoV-2 and reflecting on the impact of COVID-19 on heart function. The first parameter MALAT1 (ENSG00000251562) was downregulated in this rule for the identification of patients with COVID-19. In 2020, a study found that MALAT1 was downregulated in the bronchoalveolar lavage fluids of patients with mild and severe SARS-CoV-2 infections [112]. Another study in 2021 found similar results [113]. MALAT1 can be considered a reliable parameter for distinguishing patients with COVID-19 despite the fact that MALAT1 expression levels varied with the severity of COVID-19 and the type of cells tested [52,114]. MALAT1 is associated with vascular endothelial cell homeostasis [115], and reduced MALAT1 expression may impede vascular repair after SARS-CoV-2 infection. MT-CO1 (ENSG00000198804), as a second parameter, was downregulated in vascular endothelial cells after SARS-CoV-2 infection in this rule. In 2022, researchers found reduced mRNA levels of MT-CO1 in patients with COVID-19 [70], validating this parameter. Another study from 2021 discovered that after SARS-CoV-2 infection, the expression of mitochondrial genes, including the MT-CO1 gene, was downregulated [116]. Reduced MT-CO1 expression in this rule may increase the incidence of coronary artery disease and contributes to poor outcomes in patients with coronary artery disease [117,118]. The next parameter in the vascular endothelium was HIF3A (ENSG00000124440), which was shown to be downregulated in this rule. In 2021, HIF3A was found to be downregulated within the frontal cortices of patients with COVID-19 [119]. HIF3A is induced by hypoxia [120] and downregulated in the inflammatory states [121], partially explaining the validity of the parameters. However, studies on the potential damage caused by low HIF3A expression to the heart are limited. A 2021 publication found that silencing HIF3A reduces apoptosis and has a cardioprotective effect [122]. The last parameter in Rule 0 was SNHG7(ENSG00000233016), which was predicted to be downregulated in the vascular endothelium of patients with COVID-19. In 2021, researchers found that SNHG7 lncRNA was upregulated in SARS-CoV-2-infected cells and tissues of patients with COVID-19 [123]. This result is contrary to the prediction of Rule 0. The testing of other tissues, such as the lung tissues, which were more exposed to different quantities of SARS-CoV-2 than the heart, may be the cause of the discrepancy between the results. A recent publication suggested that SNHG7 provides protection against atherosclerosis development [124], suggesting that patients with COVID-19 are more likely to develop atherosclerosis or more severe atherosclerosis. SNHG7 was downregulated in unstable plaques of patients with coronary artery disease [125], suggesting the potential role of its low expression in the regulation of coronary artery disease.
The second rule (Rule 1), which helped us identify healthy people without SARS-CoV-2 infection, contained four parameters. The first parameter was MALAT1 (ENSG00000251562), whose expression in vascular endothelial cells was consistently high in healthy populations. The downregulation of MALAT1 in patients with COVID-19 and the potential effects on cardiac health have been discussed. The second parameter, ID1 (ENSG00000125968), showed consistently high levels in vascular endothelial cells in this rule and was used in distinguishing individuals who were not infected by SARS-CoV-2. The knockdown of ID1 was found to be beneficial for the replication and transcription of multiple viruses [65,66], partially demonstrating that ID1 helps in distinguishing uninfected populations. Endocytic activation and angiogenesis are hampered by ID1 gene inhibition [126], and high levels of ID1 expression regulate the severity of inflamed tissue injury [127]. Therefore, lowering its expression following SARS-CoV-2 infection may reduce protection against heart damage. The next parameter was PDLIM5 (ENSG00000163110), which had low levels in the vascular endothelial cells of the SARS-CoV-2-uninfected population. Given the crucial function of PDLIM5 in repair when the heart is damaged [128,129], low levels of PDLIM5 in vascular endothelial cells in healthy populations seem understandable. Given that PDLIM5 is a pro-atherosclerotic gene [130], SARS-CoV-2 infection that results in increased PDLIM5 expression levels may accelerate atherosclerosis. The next parameter, PHACTR1 (ENSG00000112137), had low levels in vascular endothelial cells in this rule. It was used in distinguishing uninfected individuals from patients with COVID-19. According to a 2018 study, PHACTR1 interacts with MRTF-A to mediate inflammation, which may help to explain why uninfected people continue to have low levels of PHACTR1. The COVID-19-mediated overexpression of PHACTR1 may cause a range of cardiovascular diseases, including coronary artery disease [131,132,133] and acute myocardial infarction [133].

4.2.2. Quantitative Rules in Cardiomyocytes

The first rule (Rule 0), with two parameters, facilitated the identification of patients with COVID-19 according to the differential expression of these parameters in cardiomyocytes. The first parameter was MALAT1 (ENSG00000251562), which showed low expression levels in the cardiomyocytes of patients with COVID-19. Low MALAT1 expression in cardiomyocytes can cause damage to the hearts of COVID-19 patients. Chen et al. [86] found that MALAT1 knockout exacerbates angiotensin II-induced cardiac hypertrophy, suggesting that patients with COVID-19 are susceptible to myocardial hypertrophy. As for the second parameter, TTN (ENSG00000155657), which was shown downregulated in this rule, was used in distinguishing patients with COVID-19. Kanduc et al. [134] found that titin, a protein expressed by the TTN gene, shares 29 pentapeptides with the echinocandin of SARS-CoV-2. This similarity may cause immune cross-reactivity and thus affect TTN gene expression. Another publication stated that TTN is downregulated after IL-6 treatment [55], implying a corresponding downregulation in patients with COVID-19. TTN expression and myocardial function are tightly associated [55,135], and cardiomyocytes with TTN gene knockdown show weaker and disordered muscle nodes [136], which may decrease the heart’s ability to contract.
As for the second rule (Rule 1), it contained three parameters for the identification of populations without SARS-CoV-2 infection. The first parameter was EMC10 (ENSG00000161671), which had low levels in the cardiomyocytes of uninfected individuals. With regard to this gene’s potential impact on the heart, SARS-CoV-2 infection may result in the overexpression of EMC10, a gene primarily involved in cardiac repair [134]. As a result, we considered it a potential adaptive repair response. NEB (ENSG00000183091), which was demonstrated in this rule to retain low expression levels in the cardiomyocytes of uninfected people, was the parameter used in excluding SARS-CoV-2-infected people. SARS-CoV-2 infection affects NEB gene expression and induces an inflammatory response [137], contributing to the exclusion of healthy individuals. Patients with COVID-19 have higher levels of NEB expression in their cardiomyocytes. Actin filament length is regulated by NEB [138], suggesting that NEB overexpression may cause myonodal dysfunction, which in turn results in impaired heart contraction. The last parameter in this rule was PTGDS (ENSG00000107317), a gene whose specific expression level in cardiac myocytes helps in distinguishing healthy individuals without SARS-CoV-2 infection. In this rule, the expression of PTGDS should be maintained at a high level. In 2021, Haslbauer et al. [139] found that PTGDS gene expression was downregulated in patients with COVID-19, suggesting the dysregulation of arachidonic acid metabolism. The heart may be affected by COVID-19-induced decreased expression of PTGDS, which is involved in the production of prostaglandin D2 necessary for the survival of cardiomyocytes [140]. Therefore, increased cardiomyocyte apoptosis due to decreased PTGDS expression can harm the heart. Furthermore, Zhao et al. [141] revealed that reduced PTGDS expression may be a biomarker of myocardial infarction, indicating that COVID-19 patients may have a higher risk of experiencing this condition.
Overall, as we have already indicated, previous studies have validated the acquired rules and supported the top quantitative rules.

4.3. Limitations of This Study

In the current study, some essential information (essential genes and classification rules) was extracted from a large gene expression profile on two types of heart cells for COVID-19 patients and healthy controls. Some genes and rules can be confirmed to be related to COVID-19-caused cardiovascular diseases by looking up recent publications. However, solid evidence (through wet experiments) was not provided in this study. Furthermore, lots of genes and rules were output by the machine learning-based workflow. Only a few of them were discussed. Some essential findings may be hidden in the undiscussed part. It is hopeful that the following studies can identify more essential genes and rules.

5. Conclusions

To investigate the effect of COVID-19 on cardiac cells, a machine learning-based workflow was designed in this study, which analyzed the gene expression data of two heat cell types from patients with COVID-19 and healthy controls, including cardiomyocytes and cardiac vascular endothelial cells. Some essential genes related to COVID-19 in these cells were accessed, which can be latent biomarkers for identifying COVID-19 patients. On the other hand, effective classification rules were established, indicating the special expression patterns in these cells for COVID-19 patients. Furthermore, some efficient DT classifiers were built in this study, which can be useful tools to diagnose COVID-19 patients. These findings provided a reference for uncovering the mechanism by which COVID-19 damages the heart and suggests some possible therapeutic targets. Unlike traditional drug target studies, which investigate the association between a drug and genes, we believe the drug should target certain cells and the genes within these cell types. The single cell method will revolutionize the drug discovery.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life13041011/s1, Table S1: Feature lists obtained by using LASSO, LightGBM, MCFS, and RF; Table S2: IFS results of decision tree on different feature lists for two cell types; Table S3: Intersection of important genes identified by LASSO, LightGBM, MCFS, and RF for the two cell types. The features identified by 4, 3, 2, and 1 method(s) are shown; Table S4: Classification rules generated by the optimal DT classifiers on different feature lists for two cell types.

Author Contributions

Conceptualization, Z.Z., T.H. and Y.C.; methodology, Y.X., Q.M., L.C. and K.F.; validation, T.H.; formal analysis, J.R. and W.G.; data curation, T.H.; writing—original draft preparation, Y.X., Q.M. and J.R.; writing—review and editing, T.H. and Y.C.; supervision, Z.Z. and Y.C.; funding acquisition, T.H. and Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China [2022YFF1203202], Strategic Priority Research Program of Chinese Academy of Sciences [XDA26040304, XDB38050200], the Fund of the Key Laboratory of Tissue Microenvironment and Tumor of Chinese Academy of Sciences [202002], Shandong Provincial Natural Science Foundation [ZR2022MC072].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are openly available in GEM database at https://singlecell.broadinstitute.org/single_cell/study/SCP1216 (accessed on 23 May 2022), reference number [34].

Conflicts of Interest

The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Long, B.; Carius, B.M.; Chavez, S.; Liang, S.Y.; Brady, W.J.; Koyfman, A.; Gottlieb, M. Clinical update on COVID-19 for the emergency clinician: Presentation and evaluation. Am. J. Emerg. Med. 2022, 54, 46–57. [Google Scholar] [CrossRef] [PubMed]
  2. Coronavirus Disease (COVID-19)—World Health Organization. Available online: https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed on 20 September 2022).
  3. Adil, M.T.; Rahman, R.; Whitelaw, D.; Jain, V.; Al-Taan, O.; Rashid, F.; Munasinghe, A.; Jambulingam, P. SARS-CoV-2 and the pandemic of COVID-19. Postgrad. Med. J. 2021, 97, 110–116. [Google Scholar] [CrossRef] [PubMed]
  4. Chavez, S.; Long, B.; Koyfman, A.; Liang, S.Y. Coronavirus disease (COVID-19): A primer for emergency physicians. Am. J. Emerg. Med. 2021, 44, 220–229. [Google Scholar] [CrossRef] [PubMed]
  5. Raman, B.; Bluemke, D.A.; Luscher, T.F.; Neubauer, S. Long covid: Post-acute sequelae of COVID-19 with a cardiovascular focus. Eur. Heart J. 2022, 43, 1157–1172. [Google Scholar] [CrossRef]
  6. COVID-19 Map. Available online: https://coronavirus.jhu.edu/map.html (accessed on 20 September 2022).
  7. Parasher, A. COVID-19: Current understanding of its pathophysiology, clinical presentation and treatment. Postgrad. Med. J. 2021, 97, 312–320. [Google Scholar] [CrossRef]
  8. Thakur, V.; Ratho, R.K. Omicron (b.1.1.529): A new SARS-CoV-2 variant of concern mounting worldwide fear. J. Med. Virol. 2022, 94, 1821–1824. [Google Scholar] [CrossRef] [PubMed]
  9. Song, C.; Li, Z.; Li, C.; Huang, M.; Liu, J.; Fang, Q.; Cao, Z.; Zhang, L.; Gao, P.; Nie, W.; et al. SARS-CoV-2: The monster causes COVID-19. Front. Cell. Infect. Microbiol. 2022, 12, 835750. [Google Scholar] [CrossRef]
  10. Li, X.; Ma, X. Acute respiratory failure in COVID-19: Is it “typical” ards? Crit. Care 2020, 24, 198. [Google Scholar] [CrossRef]
  11. Wang, D.; Hu, B.; Hu, C.; Zhu, F.; Liu, X.; Zhang, J.; Wang, B.; Xiang, H.; Cheng, Z.; Xiong, Y.; et al. Clinical characteristics of 138 hospitalized patients with 2019 novel coronavirus-infected pneumonia in Wuhan, China. JAMA 2020, 323, 1061–1069. [Google Scholar] [CrossRef]
  12. Cizgici, A.Y.; Zencirkiran Agus, H.; Yildiz, M. COVID-19 myopericarditis: It should be kept in mind in today’s conditions. Am. J. Emerg. Med. 2020, 38, e1545–e1547. [Google Scholar] [CrossRef]
  13. Sawalha, K.; Abozenah, M.; Kadado, A.J.; Battisha, A.; Al-Akchar, M.; Salerno, C.; Hernandez-Montfort, J.; Islam, A.M. Systematic review of COVID-19 related myocarditis: Insights on management and outcome. Cardiovasc. Revasc. Med. 2021, 23, 107–113. [Google Scholar] [CrossRef] [PubMed]
  14. Liu, J.; Deswal, A.; Khalid, U. COVID-19 myocarditis and long-term heart failure sequelae. Curr. Opin. Cardiol. 2021, 36, 234–240. [Google Scholar] [CrossRef] [PubMed]
  15. Tajbakhsh, A.; Gheibi Hayat, S.M.; Taghizadeh, H.; Akbari, A.; Inabadi, M.; Savardashtaki, A.; Johnston, T.P.; Sahebkar, A. COVID-19 and cardiac injury: Clinical manifestations, biomarkers, mechanisms, diagnosis, treatment, and follow up. Expert Rev. Anti Infect. Ther. 2021, 19, 345–357. [Google Scholar] [CrossRef] [PubMed]
  16. Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in wuhan, china: A descriptive study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Siddiq, M.M.; Chan, A.T.; Miorin, L.; Yadaw, A.S.; Beaumont, K.G.; Kehrer, T.; Cupic, A.; White, K.M.; Tolentino, R.E.; Hu, B.; et al. Functional effects of cardiomyocyte injury in COVID-19. J. Virol. 2022, 96, e0106321. [Google Scholar] [CrossRef]
  18. Chen, L.; Li, X.; Chen, M.; Feng, Y.; Xiong, C. The ace2 expression in human heart indicates new potential mechanism of heart injury among patients infected with SARS-CoV-2. Cardiovasc. Res. 2020, 116, 1097–1100. [Google Scholar] [CrossRef] [Green Version]
  19. Zheng, Y.Y.; Ma, Y.T.; Zhang, J.Y.; Xie, X. COVID-19 and the cardiovascular system. Nat. Rev. Cardiol. 2020, 17, 259–260. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Wei, Z.Y.; Qian, H.Y. Myocardial injury in patients with COVID-19 pneumonia. Zhonghua Xin Xue Guan Bing Za Zhi 2020, 48, E006. [Google Scholar]
  21. Channappanavar, R.; Fehr, A.R.; Zheng, J.; Wohlford-Lenane, C.; Abrahante, J.E.; Mack, M.; Sompallae, R.; McCray, P.B., Jr.; Meyerholz, D.K.; Perlman, S. Ifn-i response timing relative to virus replication determines mers coronavirus infection outcomes. J. Clin. Investig. 2019, 129, 3625–3639. [Google Scholar] [CrossRef]
  22. Monahan-Earley, R.; Dvorak, A.M.; Aird, W.C. Evolutionary origins of the blood vascular system and endothelium. J. Thromb. Haemost. 2013, 11 (Suppl. S1), 46–66. [Google Scholar] [CrossRef] [Green Version]
  23. Kruger-Genge, A.; Blocki, A.; Franke, R.P.; Jung, F. Vascular endothelial cell biology: An update. Int. J. Mol. Sci. 2019, 20, 4411. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Sutanto, H.; Lyon, A.; Lumens, J.; Schotten, U.; Dobrev, D.; Heijman, J. Cardiomyocyte calcium handling in health and disease: Insights from in vitro and in silico studies. Prog. Biophys. Mol. Biol. 2020, 157, 54–75. [Google Scholar] [CrossRef] [PubMed]
  25. Xu, S.; Ilyas, I.; Little, P.J.; Li, H.; Kamato, D.; Zheng, X.; Luo, S.; Li, Z.; Liu, P.; Han, J. Endothelial dysfunction in atherosclerotic cardiovascular diseases and beyond: From mechanism to pharmacotherapies. Pharmacol. Rev. 2021, 73, 924–967. [Google Scholar] [CrossRef]
  26. Sun, B.; Meng, M.; Wei, J.; Wang, S. Long noncoding rna pvt1 contributes to vascular endothelial cell proliferation via inhibition of mir-190a-5p in diagnostic biomarker evaluation of chronic heart failure. Exp. Ther. Med. 2020, 19, 3348–3354. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Park, J.H.; Shin, H.H.; Rhyu, H.S.; Kim, S.H.; Jeon, E.S.; Lim, B.K. Vascular endothelial integrity affects the severity of enterovirus-mediated cardiomyopathy. Int. J. Mol. Sci. 2021, 22, 3053. [Google Scholar] [CrossRef]
  28. Nakamura, M.; Sadoshima, J. Mechanisms of physiological and pathological cardiac hypertrophy. Nat. Rev. Cardiol. 2018, 15, 387–407. [Google Scholar] [CrossRef] [PubMed]
  29. Gao, G.; Chen, W.; Yan, M.; Liu, J.; Luo, H.; Wang, C.; Yang, P. Rapamycin regulates the balance between cardiomyocyte apoptosis and autophagy in chronic heart failure by inhibiting mtor signaling. Int. J. Mol. Med. 2020, 45, 195–209. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Oeing, C.U.; Pepin, M.E.; Saul, K.B.; Agircan, A.S.; Assenov, Y.; Merkel, T.S.; Sedaghat-Hamedani, F.; Weis, T.; Meder, B.; Guan, K. Indirect epigenetic testing identifies a diagnostic signature of cardiomyocyte DNA methylation in heart failure. Basic Res. Cardiol. 2023, 118, 9. [Google Scholar] [CrossRef]
  31. Caforio, A.L.; Baritussio, A.; Basso, C.; Marcolongo, R. Clinically suspected and biopsy-proven myocarditis temporally associated with SARS-CoV-2 infection. Annu. Rev. Med. 2022, 73, 149–166. [Google Scholar] [CrossRef]
  32. Liu, H.; Setiono, R. Incremental feature selection. Appl. Intell. 1998, 9, 217–230. [Google Scholar] [CrossRef]
  33. Safavian, S.R.; Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man Cybern. 1991, 21, 660–674. [Google Scholar] [CrossRef] [Green Version]
  34. Delorey, T.M.; Ziegler, C.G.K.; Heimberg, G.; Normand, R.; Yang, Y.; Segerstolpe, Å.; Abbondanza, D.; Fleming, S.J.; Subramanian, A.; Montoro, D.T.; et al. COVID-19 tissue atlases reveal SARS-CoV-2 pathology and cellular targets. Nature 2021, 595, 107–113. [Google Scholar] [CrossRef] [PubMed]
  35. Ranstam, J.; Cook, J. Lasso regression. J. Br. Surg. 2018, 105, 1348. [Google Scholar] [CrossRef]
  36. Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 2017, 30, 3146–3154. [Google Scholar]
  37. Dramiński, M.; Koronacki, J. Rmcfs: An r package for monte carlo feature selection and interdependency discovery. J. Stat. Softw. 2018, 85, 1–28. [Google Scholar] [CrossRef] [Green Version]
  38. Fisher, A.; Rudin, C.; Dominici, F. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. J. Mach. Learn. Res. 2019, 20, 1–81. [Google Scholar]
  39. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  40. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. Smote: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  41. Kohavi, R. A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. Int. Jt. Conf. Artif. Intell. 1995, 14, 1137–1145. [Google Scholar]
  42. Powers, D. Evaluation: From precision, recall and f-measure to roc., informedness, markedness & correlation. J. Mach. Learn. Technol. 2011, 2, 37–63. [Google Scholar]
  43. Huang, F.; Fu, M.; Li, J.; Chen, L.; Feng, K.; Huang, T.; Cai, Y.-D. Analysis and prediction of protein stability based on interaction network, gene ontology, and kegg pathway enrichment scores. BBA-Proteins Proteom 2023, 1871, 140889. [Google Scholar] [CrossRef] [PubMed]
  44. Huang, F.; Ma, Q.; Ren, J.; Li, J.; Wang, F.; Huang, T.; Cai, Y.-D. Identification of smoking associated transcriptome aberration in blood with machine learning methods. BioMed Res. Int. 2023, 2023, 5333361. [Google Scholar] [CrossRef] [PubMed]
  45. Wu, C.; Chen, L. A model with deep analysis on a large drug network for drug classification. Math. Biosci. Eng. 2023, 20, 383–401. [Google Scholar] [CrossRef]
  46. Wang, H.; Chen, L. Pmptce-hnea: Predicting metabolic pathway types of chemicals and enzymes with a heterogeneous network embedding algorithm. Curr. Bioinform. 2023. [Google Scholar] [CrossRef]
  47. Tang, S.; Chen, L. Iatc-nfmlp: Identifying classes of anatomical therapeutic chemicals based on drug networks, fingerprints and multilayer perceptron. Curr. Bioinform. 2022, 17, 814–824. [Google Scholar]
  48. Ren, J.; Zhang, Y.; Guo, W.; Feng, K.; Yuan, Y.; Huang, T.; Cai, Y.-D. Identification of genes associated with the impairment of olfactory and gustatory functions in COVID-19 via machine-learning methods. Life 2023, 13, 798. [Google Scholar] [CrossRef] [PubMed]
  49. Matthews, B.W. Comparison of the predicted and observed secondary structure of t4 phage lysozyme. Biochim. Biophys. Acta 1975, 405, 442–451. [Google Scholar] [CrossRef]
  50. Li, L.; Wang, Q.; Yuan, Z.; Chen, A.; Liu, Z.; Wang, Z.; Li, H. Lncrna-malat1 promotes cpc proliferation and migration in hypoxia by up-regulation of jmjd6 via sponging mir-125. Biochem. Biophys. Res. Commun. 2018, 499, 711–718. [Google Scholar] [CrossRef]
  51. Martens, C.R.; Bansal, S.S.; Accornero, F. Cardiovascular inflammation: Rna takes the lead. J. Mol. Cell. Cardiol. 2019, 129, 247–256. [Google Scholar] [CrossRef]
  52. Abbasi-Kolli, M.; Sadri Nahand, J.; Kiani, S.J.; Khanaliha, K.; Khatami, A.; Taghizadieh, M.; Torkamani, A.R.; Babakhaniyan, K.; Bokharaei-Salim, F. The expression patterns of malat-1, neat-1, thril, and mir-155-5p in the acute to the post-acute phase of COVID-19 disease. Braz. J. Infect. Dis. 2022, 26, 102354. [Google Scholar] [CrossRef]
  53. Devadoss, D.; Acharya, A.; Manevski, M.; Pandey, K.; Thurman, M.; Nair, M.; Borchert, G.M.; Mirsaeidi, M.; Byrareddy, N.; Chand, H.S. Severe COVID-19 patients and a 3d airway tissue model of SARS-CoV-2 infection express high levels of airway mucins and associated immunomodulatory long noncoding rnas. In TP105 Basic Mechanisms of Lung Infections: From SARS-CoV-2 to Influenza; American Thoracic Society: New York, NY, USA, 2021; p. A4183. [Google Scholar]
  54. Wei, S.; Liu, Q. Long Noncoding rna Malat1 Modulates Sepsis-Induced Cardiac Inflammation through the mir-150-5p/nf-κb axis. Int. J. Clin. Exp. Pathol. 2019, 12, 3311. [Google Scholar]
  55. Cantu, N.; Vyavahare, S.; Kumar, S.; Chen, J.; Kolhe, R.; Isales, C.M.; Hamrick, M.; Fulzele, S. Synergistic effects of multiple factors involved in COVID-19-dependent muscle loss. Aging Dis. 2022, 13, 344–352. [Google Scholar]
  56. Huang, K.; Yu, X.; Yu, Y.; Zhang, L.; Cen, Y.; Chu, J. Long noncoding rna malat1 promotes high glucose-induced inflammation and apoptosis of vascular endothelial cells by regulating mir-361-3p/socs3 axis. Int. J. Clin. Exp. Pathol. 2020, 13, 1243–1252. [Google Scholar] [PubMed]
  57. Wang, G.; Li, Y.; Peng, Y.; Tang, J.; Li, H. Association of polymorphisms in malat1 with risk of coronary atherosclerotic heart disease in a chinese population. Lipids Health Dis. 2018, 17, 75. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Lv, F.; Liu, L.; Feng, Q.; Yang, X. Long non-coding rna malat1 and its target microrna-125b associate with disease risk, severity, and major adverse cardiovascular event of coronary heart disease. J. Clin. Lab. Anal. 2021, 35, e23593. [Google Scholar] [CrossRef] [PubMed]
  59. Wang, H.; Lin, S.; Yang, Y.; Zhao, M.; Li, X.; Zhang, L. Significant role of long non-coding rna malat1 in deep vein thrombosis via the regulation of vascular endothelial cell physiology through the microrna-383-5p/bcl2l11 axis. Bioengineered 2022, 13, 13728–13738. [Google Scholar] [CrossRef]
  60. Hu, W.; Xin, Y.; Hu, J.; Sun, Y.; Zhao, Y. Inhibitor of DNA binding in heart development and cardiovascular diseases. Cell. Commun. Signal. 2019, 17, 51. [Google Scholar] [CrossRef] [Green Version]
  61. Cunningham, T.J.; Yu, M.S.; McKeithan, W.L.; Spiering, S.; Carrette, F.; Huang, C.T.; Bushway, P.J.; Tierney, M.; Albini, S.; Giacca, M.; et al. Id genes are essential for early heart formation. Genes Dev. 2017, 31, 1325–1338. [Google Scholar] [CrossRef] [Green Version]
  62. Kong, D.; He, M.; Yang, L.; Zhou, R.; Yan, Y.-Q.; Liang, Y.; Teng, C.-B. Mir-17 and mir-19 cooperatively promote skeletal muscle cell differentiation. Cell. Mol. Life Sci. 2019, 76, 5041–5054. [Google Scholar] [CrossRef] [Green Version]
  63. Qiu, J.; Li, Y.; Wang, B.; Sun, X.; Qian, D.; Ying, Y.; Zhou, J. The role and research progress of inhibitor of differentiation 1 in atherosclerosis. DNA Cell Biol. 2022, 41, 71–79. [Google Scholar] [CrossRef]
  64. Wang, D.; Wang, D.; Huang, M.; Zheng, X.; Shen, Y.; Fu, B.; Zhao, H.; Chen, X.; Peng, P.; Zhu, Q.; et al. Transcriptomic characteristics and impaired immune function of patients who retest positive for SARS-CoV-2 rna. J. Mol. Cell Biol. 2021, 13, 748–759. [Google Scholar] [CrossRef] [PubMed]
  65. Wei, J.; Shi, Y.; Zou, C.; Zhang, H.; Peng, H.; Wang, S.; Xia, L.; Yang, Y.; Zhang, X.; Liu, J.; et al. Cellular id1 inhibits hepatitis b virus transcription by interacting with the novel covalently closed circular DNA-binding protein e2f4. Int. J. Biol. Sci. 2022, 18, 65–81. [Google Scholar] [CrossRef] [PubMed]
  66. Luo, Y.; Wang, G.; Ren, T.; Zhang, T.; Chen, H.; Li, Y.; Yin, X.; Zhang, Z.; Sun, Y. Screening of host genes regulated by id1 and id3 proteins during foot-and-mouth disease virus infection. Virus Res. 2021, 306, 198597. [Google Scholar] [CrossRef]
  67. Pattarabanjird, T.; Cress, C.; Nguyen, A.; Taylor, A.; Bekiranov, S.; McNamara, C. A machine learning model utilizing a novel snp shows enhanced prediction of coronary artery disease severity. Genes 2020, 11, 1446. [Google Scholar] [CrossRef]
  68. Valanti, E.-K.; Dalakoura-Karagkouni, K.; Fotakis, P.; Vafiadaki, E.; Mantzoros, C.S.; Chroni, A.; Zannis, V.; Kardassis, D.; Sanoudou, D. Reconstituted hdl-apoe3 promotes endothelial cell migration through id1 and its downstream kinases erk1/2, akt and p38 mapk. Metab. Clin. Exp. 2022, 127, 154954. [Google Scholar] [CrossRef] [PubMed]
  69. Baccarelli, A.A.; Byun, H.M. Platelet mitochondrial DNA methylation: A potential new marker of cardiovascular disease. Clin. Epigenet. 2015, 7, 44. [Google Scholar] [CrossRef] [Green Version]
  70. Guarnieri, J.W.; Dybas, J.M.; Fazelinia, H.; Kim, M.S.; Frere, J.; Zhang, Y.; Albrecht, Y.S.; Murdock, D.G.; Angelin, A.; Singh, L.N.; et al. Targeted down regulation of core mitochondrial genes during SARS-CoV-2 infection. bioRxiv 2022, 2022.2002.2019.481089. [Google Scholar]
  71. Hoque, M.N.; Khan, M.A.; Hossain, M.A.; Hasan, M.I.; Rahman, M.H.; Soliman, M.E.; Araf, Y.; Zheng, C.; Islam, T. Differential gene expression profiling reveals potential biomarkers and pharmacological compounds against SARS-CoV-2: Insights from machine learning and bioinformatics approaches. bioRxiv 2022, 2022, 3875. [Google Scholar] [CrossRef]
  72. Guo, C.; Wang, J.; Jing, L.; Ma, R.; Liu, X.; Gao, L.; Cao, L.; Duan, J.; Zhou, X.; Li, Y.; et al. Mitochondrial dysfunction, perturbations of mitochondrial dynamics and biogenesis involved in endothelial injury induced by silica nanoparticles. Environ. Pollut. 2018, 236, 926–936. [Google Scholar] [CrossRef]
  73. Vermorken, A.J.M.; Zhu, J.; Holvoet, P.; Cui, Y. The marker of cobalamin deficiency, plasma methylmalonic acid, may help identifying lysosomal iron trapping in patients. Its possible utility for heart failure. Redox Biol. 2021, 44, 102011. [Google Scholar] [CrossRef] [PubMed]
  74. Topol, E.J. COVID-19 can affect the heart. Science 2020, 370, 408–409. [Google Scholar] [CrossRef] [PubMed]
  75. Zhong, D.; Jia-wei, Z.; Yan-an, W. Current research progress of egfl7 in angiogenesis regulation. China J. Oral. Maxillofac. Surg. 2019, 17, 377. [Google Scholar]
  76. Heissig, B.; Salama, Y.; Takahashi, S.; Okumura, K.; Hattori, K. The multifaceted roles of egfl7 in cancer and drug resistance. Cancers 2021, 13, 1014. [Google Scholar] [CrossRef]
  77. Masoud, A.G.; Lin, J.; Azad, A.K.; Farhan, M.A.; Fischer, C.; Zhu, L.F.; Zhang, H.; Sis, B.; Kassiri, Z.; Moore, R.B. Apelin directs endothelial cell differentiation and vascular repair following immune-mediated injury. J. Clin. Investig. 2020, 130, 94–107. [Google Scholar] [CrossRef]
  78. Leng, L.; Cao, R.; Ma, J.; Mou, D.; Zhu, Y.; Li, W.; Lv, L.; Gao, D.; Zhang, S.; Gong, F.; et al. Pathological features of COVID-19-associated lung injury: A preliminary proteomics report based on clinical samples. Signal. Transduct. Target. Ther. 2020, 5, 240. [Google Scholar] [CrossRef]
  79. Sezer Zhmurov, C.; Timirci-Kahraman, O.; Amadou, F.Z.; Fazliogullari, O.; Basaran, C.; Catal, T.; Zeybek, U.; Bermek, H. Expression of egfl7 and mirna-126-5p in symptomatic carotid artery disease. Genet. Test. Mol. Biomark. 2016, 20, 125–129. [Google Scholar] [CrossRef]
  80. Li, L.; Zhao, Y.; Hu, Y.; Wang, X.; Jin, Q.; Zhao, Y. Recombinant egfl7 mitigated pressure overload-induced cardiac remodeling by blocking pi3k γ /akt/ nfκb signaling in macrophages. Front. Pharmacol. 2022, 13, 858118. [Google Scholar] [CrossRef] [PubMed]
  81. Small, E.M.; Frost, R.J.; Olson, E.N. Micrornas add a new dimension to cardiovascular disease. Circulation 2010, 121, 1022–1032. [Google Scholar] [CrossRef] [Green Version]
  82. Zhang, M.; Gu, H.; Xu, W.; Zhou, X. Down-regulation of lncrna malat1 reduces cardiomyocyte apoptosis and improves left ventricular function in diabetic rats. Int. J. Cardiol. 2016, 203, 214–216. [Google Scholar] [CrossRef] [PubMed]
  83. Hu, H.; Wu, J.; Yu, X.; Zhou, J.; Yu, H.; Ma, L. Long non-coding rna malat1 enhances the apoptosis of cardiomyocytes through autophagy inhibition by regulating tsc2-mtor signaling. Biol. Res. 2019, 52, 58. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  84. Vausort, M.; Wagner, D.R.; Devaux, Y. Long noncoding rnas in patients with acute myocardial infarction. Circ. Res. 2014, 115, 668–677. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  85. Puthanveetil, P.; Chen, S.; Feng, B.; Gautam, A.; Chakrabarti, S. Long non-coding rna malat1 regulates hyperglycaemia induced inflammatory process in the endothelial cells. J. Cell. Mol. Med. 2015, 19, 1418–1425. [Google Scholar] [CrossRef] [PubMed]
  86. Chen, F.; Li, W.; Zhang, D.; Fu, Y.; Yuan, W.; Luo, G.; Liu, F.; Luo, J. Malat1 regulates hypertrophy of cardiomyocytes by modulating the mir-181a/hmgb2 pathway. Eur. J. Histochem. 2022, 66, 3426. [Google Scholar] [CrossRef]
  87. Stanley, W.C.; Recchia, F.A.; Lopaschuk, G.D. Myocardial substrate metabolism in the normal and failing heart. Physiol. Rev. 2005, 85, 1093–1129. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  88. McCafferty, C.; Van Den Helm, S.; Letunica, N.; Attard, C.; Karlaftis, V.; Cai, T.; Praporski, S.; Swaney, E.; Burgner, D.; Neeland, M.; et al. Increased platelet activation in SARS-CoV-2 infected non-hospitalised children and adults, and their household contacts. Br. J. Haematol. 2021, 195, 90–94. [Google Scholar] [CrossRef] [PubMed]
  89. Dias, S.S.G.; Soares, V.C.; Ferreira, A.C.; Sacramento, C.Q.; Fintelman-Rodrigues, N.; Temerozo, J.R.; Teixeira, L.; Nunes da Silva, M.A.; Barreto, E.; Mattos, M.; et al. Lipid droplets fuel SARS-CoV-2 replication and production of inflammatory mediators. PLoS Pathog. 2020, 16, e1009127. [Google Scholar] [CrossRef]
  90. Glatz, J.F.C.; Wang, F.; Nabben, M.; Luiken, J.J.F.P. Cd36 as a target for metabolic modulation therapy in cardiac disease. Expert Opin. Ther. Targets 2021, 25, 393–400. [Google Scholar] [CrossRef]
  91. Zhang, X.; Fan, J.; Li, H.; Chen, C.; Wang, Y. Cd36 signaling in diabetic cardiomyopathy. Aging Dis. 2021, 12, 826–840. [Google Scholar] [CrossRef]
  92. Shu, H.; Peng, Y.; Hang, W.; Zhou, N.; Wang, D.W. Trimetazidine in heart failure. Front. Pharmacol. 2021, 11, 569132. [Google Scholar] [CrossRef]
  93. Bigotti, M.G.; Brancaccio, A. High degree of conservation of the enzymes synthesizing the laminin-binding glycoepitope of α-dystroglycan. Open Biol. 2021, 11, 210104. [Google Scholar] [CrossRef]
  94. Ribeiro, A.F.; Souza, L.S.; Almeida, C.F.; Ishiba, R.; Fernandes, S.A.; Guerrieri, D.A.; Santos, A.L.; Onofre-Oliveira, P.C.; Vainzof, M. Muscle satellite cells and impaired late stage regeneration in different murine models for muscular dystrophies. Sci. Rep. 2019, 9, 11842. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  95. Joseph, S.; Schnicker, N.J.; Xu, Z.; Yang, T.; Hopkins, J.; Watkins, M.; Chakravarthy, S.; Davulcu, O.; Anderson, M.E.; Venzke, D.; et al. Structure and mechanism of large1 matriglycan polymerase. bioRxiv 2022, 12, 491222. [Google Scholar]
  96. Katz, M.; Weinstein, J.; Eilon-Ashkenazy, M.; Gehring, K.; Cohen-Dvashi, H.; Elad, N.; Fleishman, S.J.; Diskin, R. Structure and receptor recognition by the lassa virus spike complex. Nature 2022, 603, 174–179. [Google Scholar] [CrossRef]
  97. Saito, F.; Kanagawa, M.; Ikeda, M.; Hagiwara, H.; Masaki, T.; Ohkuma, H.; Katanosaka, Y.; Shimizu, T.; Sonoo, M.; Toda, T.; et al. Overexpression of large suppresses muscle regeneration via down-regulation of insulin-like growth factor 1 and aggravates muscular dystrophy in mice. Hum. Mol. Genet. 2014, 23, 4543–4558. [Google Scholar] [CrossRef] [Green Version]
  98. Kanagawa, M. Dystroglycanopathy: From elucidation of molecular and pathological mechanisms to development of treatment methods. Int. J. Mol. Sci. 2021, 22, 13162. [Google Scholar] [CrossRef] [PubMed]
  99. D’Amario, D.; Amodeo, A.; Adorisio, R.; Tiziano, F.D.; Leone, A.M.; Perri, G.; Bruno, P.; Massetti, M.; Ferlini, A.; Pane, M.; et al. A current approach to heart failure in duchenne muscular dystrophy. Heart 2017, 103, 1770–1779. [Google Scholar] [CrossRef]
  100. Hiess, F.; Detampel, P.; Nolla-Colomer, C.; Vallmitjana, A.; Ganguly, A.; Amrein, M.; Ter Keurs, H.E.; Benítez, R.; Hove-Madsen, L.; Chen, S.W. Dynamic and irregular distribution of ryr2 clusters in the periphery of live ventricular myocytes. Biophys. J. 2018, 114, 343–354. [Google Scholar] [CrossRef] [Green Version]
  101. Ferrantini, C.; Coppini, R.; Scellini, B.; Ferrara, C.; Pioner, J.M.; Mazzoni, L.; Priori, S.; Cerbai, E.; Tesi, C.; Poggesi, C. R4496c ryr2 mutation impairs atrial and ventricular contractility. J. General. Physiol. 2016, 147, 39–52. [Google Scholar] [CrossRef] [Green Version]
  102. Gallo Marin, B.; Aghagoli, G.; Lavine, K.; Yang, L.; Siff, E.J.; Chiang, S.S.; Salazar-Mather, T.P.; Dumenco, L.; Savaria, M.C.; Aung, S.N.; et al. Predictors of COVID-19 severity: A literature review. Rev. Med. Virol. 2021, 31, 1–10. [Google Scholar] [CrossRef]
  103. Wei, J.; Xu, H.; Shi, L.; Tong, J.; Zhang, J. Trimetazidine protects cardiomyocytes against hypoxia-induced injury through ameliorates calcium homeostasis. Chem. -Biol. Interact. 2015, 236, 47–56. [Google Scholar] [CrossRef]
  104. Reiken, S.; Sittenfeld, L.; Dridi, H.; Liu, Y.; Liu, X.; Marks, A.R. Alzheimer’s-like signaling in brains of COVID-19 patients. Alzheimers Dement. 2022, 18, 955–965. [Google Scholar] [CrossRef]
  105. Zeng, Z.; Huang, N.; Zhang, Y.; Wang, Y.; Su, Y.; Zhang, H.; An, Y. Ctcf inhibits endoplasmic reticulum stress and apoptosis in cardiomyocytes by upregulating ryr2 via inhibiting s100a1. Life Sci. 2020, 242, 117158. [Google Scholar] [CrossRef] [PubMed]
  106. Kato, T.; Yamamoto, T.; Nakamura, Y.; Nanno, T.; Fukui, G.; Sufu, Y.; Hamada, Y.; Maeda, T.; Nishimura, S.; Ishiguchi, H.; et al. Correction of impaired calmodulin binding to ryr2 as a novel therapy for lethal arrhythmia in the pressure-overloaded heart failure. Heart Rhythm. 2017, 14, 120–127. [Google Scholar] [CrossRef] [PubMed]
  107. Acimovic, I.; Refaat, M.M.; Moreau, A.; Salykin, A.; Reiken, S.; Sleiman, Y.; Souidi, M.; Pribyl, J.; Kajava, A.V.; Richard, S.; et al. Post-translational modifications and diastolic calcium leak associated to the novel ryr2-d3638a mutation lead to cpvt in patient-specific hipsc-derived cardiomyocytes. J. Clin. Med. 2018, 7, 423. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  108. Jansen, J.; Reimer, K.C.; Nagai, J.S.; Varghese, F.S.; Overheul, G.J.; de Beer, M.; Roverts, R.; Daviran, D.; Fermin, L.A.S.; Willemsen, B.; et al. SARS-CoV-2 infects the human kidney and drives fibrosis in kidney organoids. Cell Stem Cell 2022, 29, 217–231.e8. [Google Scholar] [CrossRef] [PubMed]
  109. Li, S.; Zhao, F.; Ye, J.; Li, K.; Wang, Q.; Du, Z.; Yue, Q.; Wang, S.; Wu, Q.; Chen, H. Cellular metabolic basis of altered immunity in the lungs of patients with COVID-19. Med. Microbiol. Immunol. 2022, 211, 49–69. [Google Scholar] [CrossRef]
  110. Iliev, I.D.; Cadwell, K. Effects of intestinal fungi and viruses on immune responses and inflammatory bowel diseases. Gastroenterology 2021, 160, 1050–1066. [Google Scholar] [CrossRef]
  111. Liang, X. Investigating the protective contribution of plcg2 p522r variant in microglia-mediated immune pathways in alzheimer’s disease. Alzheimer’s Dement. 2022, 18, e062813. [Google Scholar] [CrossRef]
  112. Shaath, H.; Vishnubalaji, R.; Elkord, E.; Alajez, N.M. Single-cell transcriptome analysis highlights a role for neutrophils and inflammatory macrophages in the pathogenesis of severe COVID-19. Cells 2020, 9, 2374. [Google Scholar] [CrossRef]
  113. Yang, Q.; Lin, F.; Wang, Y.; Zeng, M.; Luo, M. Long noncoding rnas as emerging regulators of COVID-19. Front. Immunol. 2021, 12, 700184. [Google Scholar] [CrossRef]
  114. Huang, K.; Wang, C.; Vagts, C.; Raguveer, V.; Finn, P.W.; Perkins, D.L. Long non-coding rnas (lncrnas) neat1 and malat1 are differentially expressed in severe COVID-19 patients: An integrated single cell analysis. medRxiv 2021, 2021, 21254445. [Google Scholar] [CrossRef] [PubMed]
  115. Cremer, S.; Michalik, K.M.; Fischer, A.; Pfisterer, L.; Jae, N.; Winter, C.; Boon, R.A.; Muhly-Reinholz, M.; John, D.; Uchida, S.; et al. Hematopoietic deficiency of the long noncoding rna malat1 promotes atherosclerosis and plaque inflammation. Circulation 2019, 139, 1320–1334. [Google Scholar] [CrossRef] [PubMed]
  116. McDonald, J.T.; Enguita, F.J.; Taylor, D.; Griffin, R.J.; Priebe, W.; Emmett, M.R.; Sajadi, M.M.; Harris, A.D.; Clement, J.; Dybas, J.M.; et al. Role of mir-2392 in driving SARS-CoV-2 infection. Cell. Rep. 2021, 37, 109839. [Google Scholar] [CrossRef] [PubMed]
  117. Ait-Aissa, K.; Blaszak, S.C.; Beutner, G.; Tsaih, S.W.; Morgan, G.; Santos, J.H.; Flister, M.J.; Joyce, D.L.; Camara, A.K.S.; Gutterman, D.D.; et al. Mitochondrial oxidative phosphorylation defect in the heart of subjects with coronary artery disease. Sci. Rep. 2019, 9, 7623. [Google Scholar] [CrossRef] [Green Version]
  118. Holvoet, P.; Vanhaverbeke, M.; Bloch, K.; Baatsen, P.; Sinnaeve, P.; Janssens, S. Low mt-co1 in monocytes and microvesicles is associated with outcome in patients with coronary artery disease. J. Am. Heart Assoc. 2016, 5, e004207. [Google Scholar] [CrossRef] [PubMed]
  119. Gagliardi, S.; Poloni, E.T.; Pandini, C.; Garofalo, M.; Dragoni, F.; Medici, V.; Davin, A.; Visona, S.D.; Moretti, M.; Sproviero, D.; et al. Detection of SARS-CoV-2 genome and whole transcriptome sequencing in frontal cortex of COVID-19 patients. Brain Behav. Immun. 2021, 97, 13–21. [Google Scholar] [CrossRef]
  120. Wilkins, S.E.; Abboud, M.I.; Hancock, R.L.; Schofield, C.J. Targeting protein-protein interactions in the hif system. ChemMedChem 2016, 11, 773–786. [Google Scholar] [CrossRef] [Green Version]
  121. Ma, S.; Sun, S.; Li, J.; Fan, Y.; Qu, J.; Sun, L.; Wang, S.; Zhang, Y.; Yang, S.; Liu, Z.; et al. Single-cell transcriptomic atlas of primate cardiopulmonary aging. Cell Res. 2021, 31, 415–432. [Google Scholar] [CrossRef]
  122. Aryankalayil, M.J.; Martello, S.; Bylicky, M.A.; Chopra, S.; May, J.M.; Shankardass, A.; MacMillan, L.; Sun, L.; Sanjak, J.; Vanpouille-Box, C.; et al. Analysis of lncrna-mirna-mrna expression pattern in heart tissue after total body radiation in a mouse model. J. Transl. Med. 2021, 19, 336. [Google Scholar] [CrossRef]
  123. Saha, C.; Laha, S.; Chatterjee, R.; Bhattacharyya, N.P. Co-regulation of protein coding genes by transcription factor and long non-coding rna in SARS-CoV-2 infected cells: An in silico analysis. Non-Coding RNA 2021, 7, 74. [Google Scholar] [CrossRef]
  124. Zheng, J.; Tan, Q.; Chen, H.; Chen, K.; Wang, H.; Chen, Z.; Xi, Y.; Yin, H.; Lai, K.; Liu, Y. Lncrnasnhg7003 inhibits the proliferation, migration and invasion of vascular smooth muscle cells by targeting the mir13065p/sirt7 signaling pathway. Int. J. Mol. Med. 2021, 47, 741–750. [Google Scholar] [CrossRef] [PubMed]
  125. Pan, Z.; Fan, Z.; Ma, J.; Liu, H.; Shen, L.; He, B.; Zhang, M. Profiling and functional characterization of circulation lncrnas that are associated with coronary atherosclerotic plaque stability. Am. J. Transl. Res. 2019, 11, 3801–3815. [Google Scholar] [PubMed]
  126. Kyei, F.; Asante, D.-B.; Edekor, J.A.M.; Sarpong, E.; Gavor, E.; Konja, D. Down-regulation of id1 and id3 genes affects growth and survival of human umbilical vein endothelial cells (huvecs). J. Appl. Biol. Biotechnol. 2016, 4, 023–029. [Google Scholar]
  127. Zhang, N.; Subbaramaiah, K.; Yantiss, R.K.; Zhou, X.K.; Chin, Y.; Scherl, E.J.; Bosworth, B.P.; Benezra, R.; Dannenberg, A.J. Id1 expression in endothelial cells of the colon is required for normal response to injury. Am. J. Pathol. 2015, 185, 2983–2993. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  128. Nasser, M.I.; Masood, M.; Adlat, S.; Gang, D.; Zhu, S.; Li, G.; Li, N.; Chen, J.; Zhu, P. Mesenchymal stem cell-derived exosome microrna as therapy for cardiac ischemic injury. Biomed. Pharmacother. 2021, 143, 112118. [Google Scholar] [CrossRef]
  129. Huang, X.; Qu, R.; Ouyang, J.; Zhong, S.; Dai, J. An overview of the cytoskeleton-associated role of pdlim5. Front. Physiol. 2020, 11, 975. [Google Scholar] [CrossRef]
  130. Huang, J.; Cai, C.; Zheng, T.; Wu, X.; Wang, D.; Zhang, K.; Xu, B.; Yan, R.; Gong, H.; Zhang, J.; et al. Endothelial scaffolding protein enh (enigma homolog protein) promotes phlpp2 (pleckstrin homology domain and leucine-rich repeat protein phosphatase 2)-mediated dephosphorylation of akt1 and enos (endothelial no synthase) promoting vascular remodeling. Arterioscler. Thromb. Vasc. Biol. 2020, 40, 1705–1721. [Google Scholar] [CrossRef]
  131. Green, I.E.; Williams, S.R.; Sale, M.M.; Keene, K.L.; Worrall, B.B.; Southerland, A.M. Differential expression of phactr1 in atheromatous versus normal carotid artery tissue. J. Clin. Neurosci. 2020, 74, 265–267. [Google Scholar] [CrossRef]
  132. Zhang, Z.; Jiang, F.; Zeng, L.; Wang, X.; Tu, S. Phactr1 regulates oxidative stress and inflammation to coronary artery endothelial cells via interaction with nf-kappab/p65. Atherosclerosis 2018, 278, 180–189. [Google Scholar] [CrossRef]
  133. Adlam, D.; Olson, T.M.; Combaret, N.; Kovacic, J.C.; Iismaa, S.E.; Al-Hussaini, A.; O’Byrne, M.M.; Bouajila, S.; Georges, A.; Mishra, K.; et al. Association of the phactr1/edn1 genetic locus with spontaneous coronary artery dissection. J. Am. Coll. Cardiol. 2019, 73, 58–66. [Google Scholar] [CrossRef]
  134. Kanduc, D. Anti-SARS-CoV-2 immune response and sudden death: Titin as a link. Adv. Stud. Biol. 2021, 13, 37–44. [Google Scholar] [CrossRef]
  135. Itoh-Satoh, M.; Hayashi, T.; Nishi, H.; Koga, Y.; Arimura, T.; Koyanagi, T.; Takahashi, M.; Hohda, S.; Ueda, K.; Nouchi, T.; et al. Titin mutations as the molecular basis for dilated cardiomyopathy. Biochem. Biophys. Res. Commun. 2002, 291, 385–393. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  136. Zaunbrecher, R.J.; Abel, A.N.; Beussman, K.; Leonard, A.; von Frieling-Salewsky, M.; Fields, P.A.; Pabon, L.; Reinecke, H.; Yang, X.; Macadangdang, J.; et al. Cronos titin is expressed in human cardiomyocytes and necessary for normal sarcomere function. Circulation 2019, 140, 1647–1660. [Google Scholar] [CrossRef]
  137. Zolfaghari Emameh, R.; Nosrati, H.; Eftekhari, M.; Falak, R.; Khoshmirsafa, M. Expansion of single cell transcriptomics data of sars-cov infection in human bronchial epithelial cells to COVID-19. Biol. Proced. Online 2020, 22, 16. [Google Scholar] [CrossRef]
  138. Wang, Z.; Grange, M.; Pospich, S.; Wagner, T.; Kho, A.L.; Gautel, M.; Raunser, S. Structures from intact myofibrils reveal mechanism of thin filament regulation through nebulin. Science 2022, 375, eabn1934. [Google Scholar] [CrossRef] [PubMed]
  139. Haslbauer, J.D.; Zinner, C.; Stalder, A.K.; Schneeberger, J.; Menter, T.; Bassetti, S.; Mertz, K.D.; Went, P.; Matter, M.S.; Tzankov, A. Vascular damage, thromboinflammation, plasmablast activation, t-cell dysregulation and pathological histiocytic response in pulmonary draining lymph nodes of COVID-19. Front. Immunol. 2021, 12, 763098. [Google Scholar] [CrossRef]
  140. Oakley, R.H.; Cidlowski, J.A. Glucocorticoid signaling in the heart: A cardiomyocyte perspective. J. Steroid Biochem. Mol. Biol. 2015, 153, 27–34. [Google Scholar] [CrossRef] [Green Version]
  141. Zhao, Q.; Wu, K.; Li, N.; Li, Z.; Jin, F. Identification of potentially relevant genes for myocardial infarction using rna sequencing data analysis. Exp. Ther. Med. 2018, 15, 1456–1464. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Flow chart of the entire analysis process. Gene profile data for 104,182 cardiomyocytes and 22,438 vascular endothelial cells were analyzed. Each cell contained 29,071 gene expression levels. Genes were ordered according to the degree of correlation with COVID-19 using four feature selection methods. The obtained four ordered gene lists were fed into the incremental feature selection (IFS) method, where decision tree (DT) was employed as the classification algorithm. Finally, the best feature subsets and classification rules were extracted according to the IFS results using DT.
Figure 1. Flow chart of the entire analysis process. Gene profile data for 104,182 cardiomyocytes and 22,438 vascular endothelial cells were analyzed. Each cell contained 29,071 gene expression levels. Genes were ordered according to the degree of correlation with COVID-19 using four feature selection methods. The obtained four ordered gene lists were fed into the incremental feature selection (IFS) method, where decision tree (DT) was employed as the classification algorithm. Finally, the best feature subsets and classification rules were extracted according to the IFS results using DT.
Life 13 01011 g001
Figure 2. IFS curves for evaluating the performance of the decision tree under different numbers of top features in four lists. Four feature lists were generated by four feature selection methods, corresponding to four IFS curves (cardiomyocytes). (A) IFS curves based on the feature list yielded by LASSO. (B) IFS curves based on the feature list yielded by LightGBM. (C) IFS curves based on the feature list yielded by MCFS. (D) IFS curves based on the feature list yielded by RF.
Figure 2. IFS curves for evaluating the performance of the decision tree under different numbers of top features in four lists. Four feature lists were generated by four feature selection methods, corresponding to four IFS curves (cardiomyocytes). (A) IFS curves based on the feature list yielded by LASSO. (B) IFS curves based on the feature list yielded by LightGBM. (C) IFS curves based on the feature list yielded by MCFS. (D) IFS curves based on the feature list yielded by RF.
Life 13 01011 g002
Figure 3. IFS curves for evaluating the performance of the decision tree under different numbers of top features in four lists. Four feature lists were generated by four feature selection methods, corresponding to four IFS curves (vascular endothelial cells). (A) IFS curves based on the feature list yielded by LASSO. (B) IFS curves based on the feature list yielded by LightGBM. (C) IFS curves based on the feature list yielded by MCFS. (D) IFS curves based on the feature list yielded by RF.
Figure 3. IFS curves for evaluating the performance of the decision tree under different numbers of top features in four lists. Four feature lists were generated by four feature selection methods, corresponding to four IFS curves (vascular endothelial cells). (A) IFS curves based on the feature list yielded by LASSO. (B) IFS curves based on the feature list yielded by LightGBM. (C) IFS curves based on the feature list yielded by MCFS. (D) IFS curves based on the feature list yielded by RF.
Life 13 01011 g003
Figure 4. Venn diagram of four subsets of important genes identified by LASSO, LightGBM, MCFS, and RF (cardiomyocytes). Overlapping circles indicate the number of genes identified by multiple feature selection methods.
Figure 4. Venn diagram of four subsets of important genes identified by LASSO, LightGBM, MCFS, and RF (cardiomyocytes). Overlapping circles indicate the number of genes identified by multiple feature selection methods.
Life 13 01011 g004
Figure 5. Venn diagram of four subsets of important genes identified by LASSO, LightGBM, MCFS, and RF (vascular endothelial cells). Overlapping circles indicate the number of genes identified by multiple feature selection methods.
Figure 5. Venn diagram of four subsets of important genes identified by LASSO, LightGBM, MCFS, and RF (vascular endothelial cells). Overlapping circles indicate the number of genes identified by multiple feature selection methods.
Life 13 01011 g005
Table 1. Performance of the best classifiers constructed for cardiomyocytes and vascular endothelial cells using four lists.
Table 1. Performance of the best classifiers constructed for cardiomyocytes and vascular endothelial cells using four lists.
Cell TypeFeature Selection MethodNumber of FeaturesF1 MeasureMatthew Correlation CoefficientAccuracy
CardiomyocytesLeast absolute shrinkage and selection operator200.9750.9720.994
Light gradient boosting machine200.9830.9800.996
Monte Carlo feature selection15000.9750.9720.994
Random forest3400.9760.9730.994
Vascular endothelial cellsLeast absolute shrinkage and selection operator600.9230.8510.925
Light gradient boosting machine800.9480.9000.950
Monte Carlo feature selection1200.9490.9020.951
Random forest600.9450.8950.947
Table 2. Top five genes identified by the computational workflow in vascular endothelial cells and cardiomyocytes.
Table 2. Top five genes identified by the computational workflow in vascular endothelial cells and cardiomyocytes.
Cell TypeEnsembl IDGene SymbolDescription
Vascular endothelial cellsENSG00000251562MALAT1metastasis associated lung adenocarcinoma transcript 1
ENSG00000125968ID1inhibitor of DNA binding 1
ENSG00000117318ID3inhibitor of DNA binding 3
ENSG00000198804MT-CO1mitochondrially encoded cytochrome c oxidase I
ENSG00000172889EGFL7EGF like domain multiple 7
CardiomyocytesENSG00000251562MALAT1metastasis associated lung adenocarcinoma transcript 1
ENSG00000135218CD36CD36 molecule
ENSG00000133424LARGE1LARGE xylosyl- and glucuronyltransferase 1
ENSG00000198626RYR2ryanodine receptor 2
ENSG00000197943PLCG2phospholipase C gamma 2
Table 3. Representative rules generated in cardiac cells to identify patients with COVID-19.
Table 3. Representative rules generated in cardiac cells to identify patients with COVID-19.
Cell TypeRulesParametersPredicted Class
Vascular endothelial cellsRule 0ENSG00000251562 (MALAT1) ≤ 5.9107COVID-19 patients
ENSG00000198804 (MT-CO1) ≤ 1.6645
ENSG00000124440 (HIF3A) ≤ 2.2516
ENSG00000233016 (SNHG7) ≤ 1.9428
Rule 1ENSG00000251562 (MALAT1) > 6.0103People without SARS-CoV-2 infection
ENSG00000125968 (ID1) > 2.7774
ENSG00000163110 (PDLIM5) ≤ 0.4622
ENSG00000112137 (PHACTR1) ≤ 1.4398
CardiomyocytesRule 0ENSG00000251562 (MALAT1) ≤ 5.0148COVID-19 patients
ENSG00000155657 (TTN) ≤ 5.0027
Rule 1ENSG00000161671 (EMC10) ≤ 3.9595People without SARS-CoV-2 infection
ENSG00000183091 (NEB) ≤ 3.9691
ENSG00000107317 (PTGDS) > 0.6654
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xu, Y.; Ma, Q.; Ren, J.; Chen, L.; Guo, W.; Feng, K.; Zeng, Z.; Huang, T.; Cai, Y. Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells. Life 2023, 13, 1011. https://doi.org/10.3390/life13041011

AMA Style

Xu Y, Ma Q, Ren J, Chen L, Guo W, Feng K, Zeng Z, Huang T, Cai Y. Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells. Life. 2023; 13(4):1011. https://doi.org/10.3390/life13041011

Chicago/Turabian Style

Xu, Yaochen, Qinglan Ma, Jingxin Ren, Lei Chen, Wei Guo, Kaiyan Feng, Zhenbing Zeng, Tao Huang, and Yudong Cai. 2023. "Using Machine Learning Methods in Identifying Genes Associated with COVID-19 in Cardiomyocytes and Cardiac Vascular Endothelial Cells" Life 13, no. 4: 1011. https://doi.org/10.3390/life13041011

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop