SGAEMDA: Predicting miRNA-Disease Associations Based on Stacked Graph Autoencoder

MicroRNA (miRNA)-disease association (MDA) prediction is critical for disease prevention, diagnosis, and treatment. Traditional MDA wet experiments, on the other hand, are inefficient and costly.Therefore, we proposed a multi-layer collaborative unsupervised training base model called SGAEMDA (Stacked Graph Autoencoder-Based Prediction of Potential miRNA-Disease Associations). First, from the original miRNA and disease data, we defined two types of initial features: similarity features and association features. Second, stacked graph autoencoder is then used to learn unsupervised low-dimensional representations of meaningful higher-order similarity features, and we concatenate the association features with the learned low-dimensional representations to obtain the final miRNA-disease pair features. Finally, we used a multilayer perceptron (MLP) to predict scores for unknown miRNA-disease associations. SGAEMDA achieved a mean area under the ROC curve of 0.9585 and 0.9516 in 5-fold and 10-fold cross-validation, which is significantly higher than the other baseline methods. Furthermore, case studies have shown that SGAEMDA can accurately predict candidate miRNAs for brain, breast, colon, and kidney neoplasms.


Introduction
MicroRNA (miRNA) is a single-stranded small molecule RNA with a length of about 19-25 nucleotides that is encoded by endogenous genes [1,2]. MiRNAs are linked to and play a crucial part in many vital human body processes, such as cell proliferation, differentiation, immunity, and metabolism [3]. As a result, miRNAs have received increased attention, particularly in the field of associations between miRNAs and complex human diseases. Overexpression and downregulation of miRNA expression in humans have been linked to a variety of complex diseases, according to research [4,5]. Upregulation of miR-17-5p expression, for example, has a greater effect on pancreatic cancer cell proliferation and significantly increases the number of invading cells [6]. When compared to normal breast tissue, abnormal expression of miRNAs such as mir-125b, mir-145, mir-21, and mir-155 causes human breast cancer [7]. Cressatti et al. [8] discovered that miR-153 and miR-223 could be used as biomarkers for Parkinson's disease (PD) diagnosis through paired regulation of α-synuclein. MiR-34, miR-124a, -146, miR-187, miR-199a-5p, miR-203, miR-210, and miR-383 dysregulation all have a negative impact on pancreatic β-cell viability and function, which leads to uncontrolled proliferation of insulin-secreting cells and the development of diabetes [9,10]. In conclusion, miRNAs have been shown to be inextricably linked to the emergence of many human complex diseases, making the prediction of potential miRNA-disease association (MDA) a promising area of research. It can help researchers comprehend the pathological mechanisms of complex diseases, which can be beneficial in both the treatment and diagnosis of complex diseases.
Traditional biological wet experiments, such as anchored polymerase chain reaction and reverse transcription polymerase chain reaction, were used in the early years to identify the relationship between miRNAs and diseases, but they all have drawbacks such as complicated experiments, long time periods, and high costs [11][12][13]. Several studies in the field of bioinformatics have been developed in recent years, such as drug-drug interactions [14], drug-target interactions [15], lncRNA-disease association prediction [16], and lncRNA-miRNA interaction [17]. Each of these studies has added to our understanding of computational approaches for predicting miRNA-disease connections. Many superior computational methods for predicting potential miRNA-disease associations have been proposed as more biological data sets have been collected, which not only saves significant money and time but also provides researchers with a new perspective to further validate the predicted potential associations. These MDA prediction computational approaches can be roughly categorized into three categories [18]: machine learning-based prediction models, deep learning-based prediction models, and matrix transformation-based prediction models.
Machine learning has been widely applied in all areas, and numerous machine learning models for predicting MDA have produced positive results. As there are not enough known miRNA-disease connections, existing prediction models perform poorly, Zhou et al. [19] presented a new model combining gradient boosting decision tree and logistic regression (GBDT-LR) to rank miRNA candidates for diseases. The model can extract features and then score them using logistic regression. Peng et al. [20] proposed a new prediction model called Ensemble of Kernel Ridge Regression-based MiRNA-Disease Association prediction (EKRRMDA), which used KRR to build two classifiers in miRNA space and disease space, respectively, and combined them with ensemble learning to improve model prediction accuracy. Liu et al. [21] created a computational model for the SMALF by learning potential features from the original miRNA-disease association matrix and then predicting unknown miRNA-disease associations using XGBoost. Tang et al. [22] developed an ensemble learning method (PMDFI) based on higher-order feature interactions to predict potential miRNA-disease associations. It uses stacked autoencoders to learn higher-order features from the similarity matrix and then uses an integrated model combining multiple random forests with logistic regression to predict an association. Liu et al. [23] proposed an autoencoder-based deep forest ensemble learning model (DFELMDA), which was further validated through case studies of colon, breast, and lung tumors with varying disease types. Both PMDFI and DFELMDA use automatic encoders, but as they do not consider graph structure information, they cannot learn the miRNA and disease feature representation well. Although machine learning-based methods have demonstrated good performance, they typically require domain knowledge to build sample features.
With the advent of Deep Learning, many methods of end-to-end computing have been developed, and this novel prediction method predicts better than earlier traditional machine learning methods. Xuan et al. [24] developed CNNMDA, a deep learning method that uses two convolutional neural networks to efficiently learn the potential relationship between miRNAs and diseases (CNN). Li et al. [25] created a GAEMDA model that takes miRNA and disease similarity as feature information, aggregates it using a graph neural networkbased encoder to generate a low-dimensional representation of the nodes, and finally predicts it using a bilinear decoder. Zhou et al. [26] proposed a deep self-coding multicore learning approach (DAEMKL) the following year, which uses multicore learning to build miRNA-disease heterogeneous networks and then uses regression models to learn their feature representations. Li et al. [27] designed a computational framework based on graph attention network fusion of multi-source information (GATMDA). It utilized the graph attention network to aggregate information from neighbors with different weights to extract nonlinear features of diseases and miRNAs, and then predicted MDA by efficiently fusing linear and nonlinear features of diseases and miRNAs through a random forest algorithm. Han et al. [28] proposed that LAGCN build a heterogeneous network by integrating miRNA similarity, disease similarity, and miRNA-disease association information, and then use the attention mechanism to synthesize multiple CNNs to learn miRNA and disease embedding. Although deep learning-based methods can learn feature representations automatically and improve model prediction performance to some extent, they require a large number of training samples and do not incorporate graph structure information, making it difficult to capture neighborhood information in the network.
Furthermore, in recent years, several MDA prediction algorithms based on matrix transformation have appeared. Yu et al. [29] proposed a prediction model based on matrix completion and label propagation (MCLPMDA). It used matrix completion to reconstruct a new miRNA and disease similarity matrix based on the miRNA-disease association matrix, and then used the label propagation algorithm to predict MDA. Gao et al. [30] proposed the Nearest Profile-based Collaborative Matrix Factorization (NPCMF) algorithm, which uses L2,1-norm to complete the unknown association, using miRNA and disease nearest neighbor information to construct similarity functions and thus find new MDAs. Chen et al. [31] proposed the neighborhood constraint matrix completion algorithm (NCMCMDA), which combined neighborhood constraints with matrix completion for assisted prediction before transforming the prediction task into an optimization problem that could be solved by a rapid iterative algorithm. Yin et al. [32] created a new computational model called Logistic Weighted Profile-based Collaborative Matrix Factorization by combining two methods, weighted profile and collaborative matrix factorization (LWPCMF). The findings show that LWPCMF can accurately predict potential MDA. Although the matrix transformation-based method overcomes the problem of feature representation using vectors in high-dimensional space, its results are highly dependent on the initial solution selection, and it often fails to converge, which is time-consuming.
Although the models presented above predicted MDA well, they do have certain limitations. In recent years, autoencoders have been widely used in various fields [33,34] to efficiently learn the feature representation of miRNAs and diseases without losing the graph structure topology information, we propose a stacked graph autoencoder-based miRNAdisease association prediction algorithm (SGAEMDA), as shown in Figure 1. All miRNA features were then concatenated with disease features as miRNA-disease pair features. We employed 5-fold and 10-fold cross-validation to evaluate the prediction performance of our method. As a consequence, the AUCs of SGAEMDA in 5-fold and 10-fold cross-validation were 0.9585 and 0.9616, respectively, much higher than the other baseline methods. In addition, to demonstrate SGAEMDA's performance, we conducted case studies on brain neoplasms, breast neoplasms, colon neoplasms, and kidney neoplasms. According to the findings, the bulk of our predicted possible miRNA-disease associations were verified by the dbDEMC and miRCancer databases. This paper's significant contributions are summarized as follows.
(1) We integrated both association information and similarity information to construct the initial features and could better learn the potential information in miRNA-disease pairs. (2) We propose a stacked graph autoencoder prediction framework. Unlike previous stacked autoencoders, which used layer-by-layer training, the stacked graph autoencoder uses multi-layer collaborative unsupervised training. It is capable of effectively extracting potential, deep, and unknown feature information from the similarity network to compensate for the shortcomings of previous models' prediction results, which are biased toward miRNAs and diseases with known associations. (3) We use a multilayer perceptron (MLP) for prediction of the final results, which has high fault tolerance and can learn feature information from miRNA-disease pairs rapidly and efficiently to improve model prediction performance.

Datasets for MDA Prediction
The Human miRNA-disease association dataset we used was downloaded from the HMDDv2.0 database [35]. It contains 5430 known associations of 383 complex diseases and 495 miRNAs, and the rest are unknown associations. In the follow-up experiments, we used a binary adjacency matrix A with n m rows and n d columns to storage all known and unknown associations. Where n m and n d are the number of miRNAs and diseases in this dataset, respectively. Specifically, this binary association matrix A is defined as follows: Wang et al. [36] proposed a method to measure miRNA functional similarity and a method to construct miRNA functional similarity networks based on the hypothesis that functionally similar miRNAs are often associated with similar diseases. The functional similarity information of miRNAs can be obtained from http://www.cuilab.cn/files/images/ cuilab/misim.zip (accessed on 23 May 2022). Then, based on the obtained information, we built the miRNA functional similarity matrix MFS with n d rows and n d columns. Where MFS(m i , m j ) denotes the functional similarity score between miRNA m i and miRNA m j .

Disease Semantic Similarity
Based on a previous study [37], disease semantic similarity was obtained based on statistics disease ontology information. Specifically, all disease semantic similarities can be calculated using medical subject headings (MeSH), where each disease d i can be described by several directed acyclic graphs (DAGs). The directed acyclic graph can be defined as where d i denotes a specific disease, T(d i ) denotes the set containing the disease node d i and all its ancestor nodes, and E(d i ) denotes the set of corresponding edges. According to the constructed directed acyclic graph of disease d i , we can calculate the semantic contribution value of disease d k to disease d i as follows: where δ is the semantic contribution decay factor and based on a previous study, we set δ to 0.5. We can then calculate the semantic value of the disease d i .
Based on the assumption that the more the overlapping parts of the DAGs of two diseases are, the more similar they are. We can calculate the disease semantic similarity between diseases d i and d j , and define it as follows: where DS 1 is for storing the semantic similarity of the first kind of diseases. However, the above calculation method has a disadvantage in that it does not account for the different contributions of two diseases in the same layer of the DAG, and the disease with a low frequency should contribute more than the disease with a high frequency. As a result, we developed a second semantic similarity model. Specifically, we can calculate the semantic contribution value of disease d k to disease d i as follows: the number of DAGs including d k the number of diseases .
Likewise, we can obtain the semantic value of disease d i : Based on the previously mentioned assumptions, we can calculate the second kind of disease semantic similarity between diseases d i and d j , which is defined as follows: where DS 2 is for storing the second kind of disease semantic similarity. To obtain the sound disease semantic similarity, we combined the two types of disease semantic similarity to obtain the final disease semantic similarity, and the final disease semantic similarity between diseases d i and d j can be calculated according to the following equation:

Gaussian Interaction Profile Kernel Similarity of miRNAs and Diseases
Inspired by past studies [38], based on the hypothesis that functionally similar miRNAs may be associated with phenotypically similar diseases. We used Gaussian spectral kernel similarity to calculate the similarity between each pair of miRNAs and between each pair of diseases, which in turn complements the similarity information of miRNAs and diseases. Specifically, the Gaussian interaction profile kernel similarity between miRNAs m i and m j was calculated as follows: where the parameter γ m controls the kernel bandwidth, which can be obtained based on the hyperparameter γ m normalized by the average number of interactions for each miRNA. According to previous studies, γ m is set to 1. For diseases, similar to miRNAs, the Gaussian interaction profile kernel similarity between diseases d i and d j is calculated as follows: where,γ d is set to 1.

Integration of miRNAs and Diseases Similarity
Considering that some miRNAs have no function similarity to each other, and similarly, some diseases have no semantic similarity to each other, this can lead to a large number of sparse values in the miRNA function similarity matrix and disease semantic similarity matrix. To solve the above problem, we define the integrated similarity between miRNA m i and m j and the integrated similarity between diseases d i and d j by integrating the Gaussian interaction profile kernel similarity obtained from prior calculations as follows:

SGAEMDA
To predict the potential association of miRNAs with diseases, we propose the stacked graph autoencoder miRNA-disease association prediction model (SGAEMDA) in this study. To successfully extract potential information in the similarity network and forecast miRNA-disease associations, the model integrates a graph convolutional network-based autoencoder with a multilayer perceptron. SGAEMDA is typically comprised of the following steps: (1) Construct initial features. (2) Pre-train stacked graph autoencoder to extract miRNA and disease similarity potential features. (3) Concatenate potential features and association features. (4) Predict miRNA-disease.
(1) Construct initial features We construct the initial features of miRNAs and diseases from two different perspectives: association information and similarity information. First, for the miRNA-disease association matrix A, each row can be regarded as the association feature of miRNA and each column as the association feature of disease. For the miRNA integrative similarity matrix S m and the disease integrative similarity matrix S d , each row of S m can be regarded as the similarity feature of miRNA, and each row of S d can be regarded as the similarity feature of disease. Specifically, the two initial feature vectors of miRNAs and diseases are shown as follows: where ∈ {1, 2}, when = 1, F 1 φ denotes the association feature of miRNA or disease, and when = 2, F 2 φ denotes the functional similarity feature of miRNA or semantic similarity feature of disease. φ ∈ {m, d}, φ = m represents miRNA features and φ = d represents disease features, and n m1 , n d1 , n m2 , n d2 are the number of columns and rows of A, the number of columns of S m , and the number of columns of S d , i.e., 383, 495, 495, and 383, respectively.
(2) Pre-train stacked graph autoencoder Referring to a previous study [39], graph autoencoder can learn the low-dimensional feature representation of graph nodes to find the appropriate embedding. Since the information in the similarity features of miRNAs and diseases is high-dimensional, this could affect the prediction accuracy of the prediction model. We propose the stacked graph autoencoder to extract the low-dimensional similarity potential features from it, which has a stronger feature extraction ability than the traditional graph autoencoder. The graph autoencoder is particularly suitable for datasets with large numbers of unlabeled data and small numbers of labeled data due to its unsupervised training method. Specifically, the encoder and decoder for each layer of the autoencoder are defined as follows: and where A, Y, W denote the adjacency matrix, feature matrix of the node, and the learnable parameter matrix. Therefore, the feature representation of miRNA, Z l m can be learned by the above encoder-decoder structure as follows: and where l denotes the number of layers of the graph autoencoder, l = 1, 2 . . . L, Z l m denotes the low-dimensional feature representation learned by the lth layer of the graph autoencoder, when 1 = 1, Z 0 m , i.e., F 2 m , X l m denotes the miRNA feature representation reconstructed by the lth layer of the autoencoder, and A m denotes the Laplace-normalized miRNA adjacency matrix. The formula is as follows: where D m is the degree matrix of miRNA-integrated similarity matrix S m . Similarly, we learn the low-dimensional feature representation Z l d of the disease by the stacked graph autoencoder of the same architecture as follows: and where Z l d denotes the low-dimensional feature representation learned by the lth layer graph autoencoder, when l = 1, Z 0 d , i.e., F 2 d , X l d denotes the disease feature representation reconstructed by the lth autoencoder, and A d denotes the Laplace-normalized adjacency matrix of the disease. The formula is as follows: In this study, SGAE is constructed by stacking three graph autoencoders, i.e., L = 3. Specifically, the feature representation generated by the first graph autoencoder is taken as input to the second autoencoder, which generates another feature representation of lower dimensionality, and so on, until L graph autoencoders are constructed. Multiple graph autoencoders are trained collaboratively based on the reconstruction loss function to generate the final low-dimensional similarity feature representations of miRNA and disease, Z L m and Z L d , with the following equations: (

3) Concatenate potential features and association features
We set the final embedding dimension to 64 in pre-training, and the training obtained a low-dimensional similarity representation of all miRNAs and diseases, denoted as Z L m , Z L d , respectively. To include more potential information in the feature representations of miRNAs and diseases, we concatenated Z L m and Z L d with the association feature F 1 m of miRNAs and the association feature F 1 d of diseases, respectively, and finally obtained a 447-dimensional miRNA embedding and a 559-dimensional disease embedding, as follows: and where V m denotes the final embedding of miRNA and V d denotes the final embedding of disease.
(4) Predict miRNA-disease association by multilayer perceptron After obtaining the embedding of miRNAs and diseases, we concatenate the embedding V m i for each miRNA and V d j for each disease to form our complete dataset X, where X ∈ R (495 * 383)×(447+559) , as follows: where X ij denotes the characteristics of miRNA-disease pairs of miRNA m i and disease d j .
Then, we used a multilayer perceptron (MLP) to score the final miRNA-disease association for prediction, as follows: where l ∈ [1, 2] denotes the number of layers of the hidden layer, X l denotes the output of the lth hidden layer, and W l , b l are the learnable parameter matrix and bias of the lth hidden layer, respectively.ŷ ij is the prediction score of the final miRNA-disease pair. Finally, the model is trained by minimizing the error of the Binary Cross-Entropy Loss function: where (i, j) denotes the pair for miRNA m i and disease d j .y + and y − subtables denote the positive and negative sample sets. N denotes the number of all miRNA-disease pairs in the positive and negative sample sets.

Experiment Details
In our experiments, the SGAEMDA model is implemented based on the pytorch framework and the scikit-learn framework. The Adam optimizer is adopted to minimize the loss function both during the pre-training process and the MLP training process. Due to the significant imbalance of positive and negative samples in the database of HMDDv2.0, the number of known miRNA-disease associations is 5430 (positive samples), and the rest of the 184,155 pairs are unknown associations (negative samples), and the number of negative samples is about 34 times the positive samples. In order to have good robustness of our model, we randomly selected negative samples equal to the positive samples for MLP training, and randomly selected 10 times in the subsequent experiments to ensure the reliability of our experiments. Our source code of HSSG is available online: https://github.com/Lynn0424/SGAEMDA (accessed on 5 December 2022).

Evaluation Metrics
The area under the receiver operating characteristic curve (AUC) and area under precision-recall curve (AUPR) were our main metrics to evaluate the overall model performance. In classification problems, AUC is an essential method to evaluate the overall performance of a model, and for unbalanced data sets, AUPR can evaluate the model better than AUC. In order to be more comprehensive in evaluating the performance of the SGAEMDA model, we also used several common evaluation metrics such as accuracy (Acc), precision (Pre), recall (Rec), and F1-score. Several metrics are calculated as follows: where TP, TN, FP, FN denote true positive, false negative, false positive, and true negative, respectively.

Prediction of miRNA-Disease Association Based on SGAEMDA
To obtain reliable experimental results of the model, we performed 5-fold crossvalidation and 10-fold cross-validation to evaluate the model performance of SGAEMDA. In 5-fold CV (10-fold CV), all the training samples are randomly divided into 5 (10) subsets of approximately the same number, 4 (9) of them are chosen for training and the remaining 1 is chosen for testing, and the process is repeated until all the subsets have been used for the test set, and finally the obtained results are averaged as the final result. Figures 2 and 3 show the ROC curves and PR curves for the 5-fold CV and 10-fold CV and the area under their curves. It can be seen that our model has an AUC above 0.95 for both 5-fold CV and 10-fold CV, indicating the effectiveness of the model in predicting the potential miRNA-disease association and implying that the model performance is not affected by the amount of training data and test data in cross-validation. Table 1

Effect of Similarity Feature Dimensions
To further illustrate the effect of the final dimensionality of the similarity features on the model prediction performance, we set the dimensionality of the similarity features learned by the stacked graph autoencoder to 16,32,64,128,256 for comparison experiments, and calculate their AUC and AUPR, respectively. The experimental results are shown in Figure 4, and both AUC and AUPR reach the highest value when the dimension is 64. Therefore, we set the final learned similarity feature dimension to 64. In addition, we can infer that if the dimension is too small, it cannot fully learn the similarity information; while if the dimension is too large, there may be original redundant and noisy information, leading to lower model performance.

Effect of Stacked Graph Autoencoder Pre-Training
In SGAEMDA, to verify the validity of our proposed stacked graph autoencoder for miRNA-disease potential association prediction. We designed three groups of experiments. The first one uses only the potential similarity features Z L m and Z L d obtained by pre-training and uses them directly as the final embedding of miRNAs and diseases for prediction, denoted as only-pre-training. The second group is a direct concatenation of the original similarity features F 2 m and F 2 d and association features F 1 m and F 1 d for prediction without using stacked graph autoencoder, denoted as non-pre-training. The third group uses only the original association features F 1 m and F 1 d to predict the potential association, which is denoted as only-original feature. The fourth group of experiments uses pre-trained features Z L m and Z L d and association features F 1 m and F 1 d , i.e., the SGAEMDA model. Figure 5 and Table 2 show the prediction results of the four models. We can see that the SGAEMDA model is only slightly lower than the only-original feature model in Recall, but reaches the highest value in all the rest of the metrics. AUC and AUPR are more reflective of the overall performance of the model, so integrating the features learned by stacked autoencoder and association features can enable the model to achieve better performance.

Comparison of Different Classifier Models
In the SGAEMDA model, we used a multilayer perceptron (MLP) classifier to predict the potential miRNA-disease association. To confirm the reasonability of our adopted MLP, we used cross-validation with the same dataset for comparison with four common classifier models, which are random forest (RF), support vector machine (SVM), K-nearest neighbor (KNN), and XGBoost algorithm. We refer to the Liu et al. [21] proposed model to select the best parameters for different classifiers. In the RF algorithm, we set the maximum depth of the tree to 10, the maximum features to 100, and the rest of the parameters to default values. In the SVM algorithm, we use the RBF kernel and set C to 50. In the XGBoost algorithm, we set the number of trees to 1000, the learning rate to 0.1, and the rest of the parameters to their default values. For the KNN classifier, we performed a parameter sensitivity analysis and finally set the K value to 4, the p-value to 2, and the rest of the parameters to their default values. Table 3 shows the prediction performance of these classifiers. It can be seen that SGAEMDA achieves the highest results in four of the five evaluated metrics, and only in the accuracy rate it is 2.07% lower than the KNN classifier. However, in terms of potential association prediction, AUC and AUPR are more likely to show the overall model performance. Therefore, we selected MLP as our final classifier.

Comparisons with Existing SOTA Methods
To further prove the predictive performance of our proposed SGAEMDA model, we compare it with nine state-of-the-art existing computational models, namely LAGCN [28], GBDT-LR [19], EKRRMDA [20], MCLPMDA [29], GAEMDA [25], PMDFI [22], SMALF [21], DAEMKL [26], and DFELMDA [23]. Since the AUC values provide a comprehensive measure of the overall predictive performance of the models, we selected the AUC as a metric to evaluate the performance of these models (all AUC values were selected from their papers by taking their best values). In addition, the above models are all evaluated based on HMDDv2.0 on the five-fold cross-validation basis. Table 4 shows the comparative results of the models. From the table, we see that SGAEMDA achieved the highest AUC value among the 10 models, which is 3.3% higher than the second-best model (DFELMDA). In conclusion, SGAEMDA has very good results in predicting potential miRNA-disease associations.

Case Studies
We selected four neoplastic diseases as case studies: brain neoplasms (Table 5), breast neoplasms (Table 6), colon neoplasms (Table 7), and kidney neoplasms (Table 8). Specifically, there are 5430 known miRNA-disease associations in the HMDDv2.0 database, while the remaining 184,155 associations are unknown. The known associations obtained from the database were used as the training set for SGAEMDA, and then we prioritized the candidate miRNAs for several neoplasms based on the prediction scores and selected the top 20 candidate miRNAs. We verified the predicted experimental results one by one by using the dbDEMCv3.0 database [40] and the miRCancer database [41] as validation sets.
Brain neoplasms are defined as a neoplasm growing in the cranial cavity, also known as brain cancer and intracranial neoplasm. They are generally divided into two categories: primary and secondary [42]. Statistics show that the incidence of brain neoplasm has been increasing in recent years, and brain neoplasm accounts for about 5% of the total body neoplasms, other malignant neoplasms in the body have a 20-30% probability to metastasize into the skull, once the neoplasm occupies a certain space in the skull, regardless of benign or malignant neoplasm, it will endanger the life of patients. According to statistics, the incidence of brain neoplasms has been increasing in recent years. Brain neoplasms account for about 5% of the whole-body neoplasms, and all other malignant tumors in the body have a 20-30% chance of metastasizing to the skull. Therefore, a research priority was given to investigate miRNAs that may be associated with brain cancer. The results are shown in Table 5. Among the top 20 miRNAs associated with brain cancer, 19 of them are confirmed by dbDEMC or miRCancer. It is estimated that breast neoplasms account for 7-10% of all malignant tumors in the body. Its incidence is generally associated with genetics and is higher in women between 40-60 years of age [43,44]. Thus, the discovery of potential miRNAs associated with breast neoplasms provides direction for the treatment and diagnosis of breast neoplasms. The results are shown in Table 6 and all 20 of the predicted miRNAs associated with breast cancer are confirmed. Colon cancer, also known as colorectal cancer, is a malignant neoplasm of the gastrointestinal tract that occurs in the colon area. The incidence of colon neoplasms is statistically second only to gastric and esophageal cancers [45]. As shown in Table 7, it can be seen that 19 of the top 20 miRNAs predicted to be potentially associated with colon cancer are confirmed. Kidney neoplasms have a high incidence in western countries [46]. In addition, about 95% of renal neoplasms are malignant, the pathology of kidney tumors is more complex, and it is more challenging to treat kidney tumors. Table 8 shows that 19 of the top 20 miRNAs were validated by the database. In addition, to further validate the performance of our model, we downloaded miR-NAseq data for BRCA (breast invasive carcinoma) and COADREAD (colorectal cancer) from the TCGA database. Based on the downloaded data, we compared the differential expression between the top 10 miRNA paracancer sample groups that we predicted. The results of differential expression are shown in Figure 6.

Discussion
In the past, many studies have shown that aberrant miRNA expression is often associated with many biological processes as well as the occurrence of complex diseases in humans with considerable impact. Thus, predicting potential miRNA-disease associations can help medical professionals provide molecular insight into the pathogenesis of various complex diseases and thus develop relevant new drugs. In this paper, we propose the SGAEMDA model, a novel model based on a stacked graph autoencoder. Unlike previous stacked-autoencoders, SGAE is not trained layer-by-layer but in collaboration with each layer, which makes up for the drawback of weak coding ability due to greedy training of previous stacked-autoencoders. It can extract potential feature representations from miRNA similarity networks and disease similarity networks at a deeper level. The extracted features are concatenated with the corresponding association features and then MLP is used to predict the association between miRNA and diseases. After experiments, it is shown that the highest AUC value of SGAEMDA, which reached 0.9585 under the 5-fold and 10-fold cross-validation. is much higher than the other baseline methods. The case study analysis experimentally confirmed that our model can effectively predict the potential miRNA-disease association. However, our work still has some areas for improvement: (1) The model is not trained end-to-end, and our model may be lower in robustness. (2) The data used in the experiments are fewer and unable to extract more information about miRNAs and diseases from more perspectives.
In future studies, we will fuse more miRNA and disease similarity information to further improve the performance of our prediction models. Moreover, we will utilize a scheme similar to the EGES model [47] to allow embedding to cover more miRANs and diseases, thus addressing the cold-start problem in genetic disease association prediction.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript:

MDA
MiRNA-disease association MLP Multilayer perceptron CV Cross-validation DAG Directed acyclic graph GCN Graph convolutional networks AUC Area under curve AUPR Area under precision-recall