DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction

Lu, Daying; Zhang, Qi; Zheng, Chunhou; Li, Jian; Yin, Zhe

doi:10.3390/bioengineering11111132

Open AccessArticle

DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction

by

Daying Lu

^1,*,

Qi Zhang

¹,

Chunhou Zheng

^1,2,

Jian Li

¹ and

Zhe Yin

¹

School of Cyber Science and Engineering, Qufu Normal University, Qufu 273165, China

²

Artificial Intelligence Academy, Anhui University, Hefei 230039, China

^*

Author to whom correspondence should be addressed.

Bioengineering 2024, 11(11), 1132; https://doi.org/10.3390/bioengineering11111132

Submission received: 30 September 2024 / Revised: 3 November 2024 / Accepted: 9 November 2024 / Published: 10 November 2024

(This article belongs to the Special Issue Computational Genomics for Disease Prediction)

Download

Browse Figures

Versions Notes

Abstract

In recent years, numerous studies have highlighted the pivotal importance of miRNAs in personalized healthcare, showcasing broad application prospects. miRNAs hold significant potential in disease diagnosis, prognosis assessment, and therapeutic target discovery, making them an integral part of precision medicine. They are expected to enable precise disease subtyping and risk prediction, thereby advancing the development of precision medicine. GNNs, a class of deep learning architectures tailored for graph data analysis, have greatly facilitated the advancement of miRNA-disease association prediction algorithms. However, current methods often fall short in leveraging network node information, particularly in utilizing global information while neglecting the importance of local information. Effectively harnessing both local and global information remains a pressing challenge. To tackle this challenge, we propose an innovative model named DGNMDA. Initially, we constructed various miRNA and disease similarity networks based on authoritative databases. Subsequently, we creatively design a dual heterogeneous graph neural network encoder capable of efficiently learning feature information between adjacent nodes and similarity information across the entire graph. Additionally, we develop a specialized fine-grained multi-layer feature interaction gating mechanism to integrate outputs from the neural network encoders to identify novel associations connecting miRNAs with diseases. We evaluate our model using 5-fold cross-validation and real-world disease case studies, based on the HMDD V3.2 dataset. Our method demonstrates superior performance compared to existing approaches in various tasks, confirming the effectiveness and potential of DGNMDA as a robust method for predicting miRNA-disease associations.

Keywords:

miRNA-disease association prediction; graph convolutional transformer; feature interaction gating; gcan

1. Introduction

miRNAs, small non-coding RNAs, are key regulators of post-transcriptional gene expression. Despite their short length of 19 to 24 nucleotides, miRNAs can simultaneously regulate the expression of numerous genes through their broad targeting ability, modulating the expression of a vast array of mRNAs, especially those genes participating in interrelated biological pathways, allowing them to precisely adjust gene expression and orchestrate intricate cellular processes [1]. In 2002, Gregory J. Hannon and his research team [2] first reported the aberrant expression of two human miRNAs, let-7 [3] and lin-4 [4], in human lung cancer cells [5], laying the foundation for further exploration of the link between miRNAs and cancer [6], as well as other diseases, and subsequently, research on miRNAs as disease biomarkers and potential therapeutic targets gradually deepened. Traditional methods for predicting miRNA-disease associations relied on statistical analysis, the use of known biomarkers, and biological experimental validation. However, these methods often faced challenges such as difficulties in data acquisition, limited prediction accuracy, and applicability to only specific types of diseases, making them difficult to generalize. With advancements in technology and the continuous development of bioinformatics, researchers have developed more comprehensive modern methods aimed at achieving in-depth analysis of disease pathogenesis and accurate prediction of miRNA-disease associations. Current approaches mainly fall into two classes: those based on network analysis and those employing machine learning techniques.

Methods relying on network analysis typically build heterogeneous miRNA-disease networks by assessing the commonalities between miRNAs and diseases, along with their association matrices. They utilize the similarity information within the network to uncover and infer novel miRNA-disease associations [7]. Xie et al. [8] integrated data from various biological perspectives to develop a comprehensive miRNA-disease network, enhancing the accuracy and reliability of predictions. Yu et al. [9] employed a combination of tensor decomposition and label propagation techniques. They used tensor decomposition to extract crucial biological information, simplifying the analysis of intricate interactions between miRNAs and diseases. Wang et al. [10] introduced an attention-based graph convolution approach to identify disease-related miRNAs. The precision of network-based methods heavily depends on calculating the similarity between miRNAs and diseases, with the quality and completeness of prior knowledge directly impacting prediction performance. When prior knowledge is limited or unreliable, the accuracy of prediction results may be affected.

Machine learning-based methods involve using trained machine learning models to predict unknown data and identify potential miRNA-disease links. To demonstrate this, Chen et al. [11] utilized existing miRNA-disease links data and improved prediction accuracy by incorporating neighborhood constraints into the matrix completion process. Zhou et al. [12] employed gradient boosting decision trees to extract features and then used logistic regression for feature scoring and classification, this enables the prediction of potential links connecting miRNAs to various diseases. Xuan et al. [13] developed dual convolutional neural networks to analyze miRNA and disease data, adeptly identifying the intricate relationships and patterns inherent in these datasets. In summary, machine learning-based association prediction models have demonstrated remarkable efficiency. These models not only significantly reduce time and financial costs but also rely heavily on the quality of the extracted features for the prediction results. Therefore, extracting high-quality, information-rich features from raw data is crucial for improving prediction performance. Future research should concentrate on developing advanced feature engineering techniques to further enhance the performance of these models in predicting miRNA-disease associations.

The rapid development of graph neural networks (GNNs) [14] and transformers [15,16,17,18] has led to various GNN models, such as Graph Convolutional Networks (GCNs) [19], Graph Attention Networks (GATs) [20], and Graph Autoencoders (GAEs) [21]. Tang et al. [22] introduced MMGCN, a deep learning approach that employs GCN encoders to derive miRNA and disease representations from various similarity perspectives. It further refines these features using multi-channel attention to fill in the missing entries in the miRNA-disease association matrix. HGANMDA [23], a hierarchical graph attention network, predicts RNA-disease associations by leveraging attention mechanisms at both node and semantic levels to capture the significance of adjacent nodes and various meta-paths. Zhou et al. [24] introduced a method based on a multi-molecule heterogeneous graph transformer, integrating biological entity relationships of eight major biomolecules to construct a comprehensive heterogeneous biological entity graph and using a heterogeneous transformer for miRNA prediction. Despite the remarkable performance of these models in prediction tasks, they often have limitations in leveraging network node information, particularly in emphasizing global information while overlooking the importance of local information. Efficiently considering both local and global information remains a pressing challenge. Moreover, existing models solely consider the relationship attributes between node pairs when leveraging heterogeneous network data, posing an additional challenge to address. Inspired by L. H. Torres et al. [25], we propose DGNMDA, an innovative dual heterogeneous graph neural network encoder designed to predict miRNA-disease associations.

Initially, we build miRNA and disease homogeneous networks, as well as a miRNA-disease heterogeneous similarity network, leveraging various similarities among miRNAs and diseases. These networks comprehensively characterize the complex relationships and interactions between miRNAs and diseases. Then, we innovatively design a dual heterogeneous graph neural network encoder, consisting of a Graph Convolutional Transformer and a Graph Convolutional Attention Network (GCAN) encoder. The Graph Convolutional Transformer encoder captures both local structural information and global dependencies of nodes by introducing graph convolutional layers and self-attention mechanisms. The GCAN encoder adaptively aggregates information from neighboring nodes through an attention mechanism, effectively learning the local structural information of nodes. Through the collaborative work of these two encoders, our approach enables us to thoroughly capture local and global characteristics, obtaining high-quality node-embedding encodings. To further improve the quality of feature representations and prediction performance, we design a fine-grained multi-layer gating mechanism. This mechanism adaptively fuses and refines feature representations from the Graph Convolutional Transformer encoder and GCAN encoder at different granularity levels through gating units. This layer-wise progressive feature fusion approach effectively combines the strengths of both encoders, generating more refined and discriminative feature representations. Finally, we input the fused feature embeddings into a multi-layer perceptron (MLP) to predict the association scores between miRNAs and diseases. The overall workflow of our method is shown in Figure 1.

In this paper, we propose DGNMDA, a dual heterogeneous graph neural network encoder specifically designed for predicting miRNA-disease associations. Our method makes contributions in the following aspects:

Combining local structural information and global dependencies: We design a dual heterogeneous graph neural network encoder that integrates a Graph Convolutional Transformer and a Graph Convolutional Attention Network (GCAN). This architecture not only captures the global dependencies of nodes but also effectively learns local structural information, generating more comprehensive node embedding encodings.
Adaptive fusion of multi-level features: We introduce a fine-grained feature interaction gating mechanism to gradually fuse and refine feature representations from the two encoders at different levels. This adaptive fusion mechanism allows the model to dynamically adjust feature combinations based on task requirements, improving the flexibility and prediction performance of the model.
Improving prediction performance: Through experimental validation on the miRNA-disease association prediction task, our DGNMDA method demonstrates significant performance improvements on multiple benchmark datasets. Our results demonstrate the efficacy and advantages of our method, offering novel insights and resources for future studies in this domain.

2. Materials and Methods

2.1. Experimental Data

HMDD, a comprehensive database, offers a robust basis for investigating miRNA-human disease associations. This database integrates extensively validated miRNA-disease links, covering a wide range of laboratory research findings and clinical research discoveries. In this study, for an unbiased model comparison, we employ the HMDD v3.2 benchmark dataset, comprising 12,446 established links among 853 miRNAs and 591 diseases. We label these associations as positive instances. To tackle the uneven data distribution, we perform undersampling on the negative samples. Specifically, we equalize the dataset by randomly choosing an equivalent count of 0-labeled instances. This strategy ensures that the model is exposed to an equal proportion of positive and negative instances throughout training, consequently enhancing its resilience and generalizability. For a thorough assessment of our method’s efficacy, we carefully design data splitting and model evaluation strategies. First, we split the data into training and testing subsets in an 8:2 proportion to assess the model’s generalization ability. During the training phase, we utilize 5-fold CV to optimize the model’s hyperparameters and structural settings. By doing so, we can comprehensively evaluate the model’s performance on different data subsets and select the best-performing model configuration for the final independent testing, ensuring reliable prediction results in real-world applications.

2.2. Building miRNA-Disease Resemblance Graphs

To effectively represent miRNA and disease characteristics, we build separate homogeneous similarity networks and a combined heterogeneous network. When constructing the homogeneous similarity networks, we leverage the miRNA functional and GIP kernel similarity matrices to quantify miRNA similarities, while analogous techniques are used to build the disease homogeneous similarity network. For the heterogeneous affinity network, we incorporate validated miRNA-disease interaction data into a binary adjacency matrix of size M × N, with M and N denoting the miRNA and disease counts, respectively. This adjacency matrix enables us to obtain a miRNA-disease bipartite graph structure, in which miRNAs and diseases are represented as distinct node categories, and their known associations are represented as edges connecting the two node types. Finally, we derive the matrix form of the heterogeneous miRNA-Disease network, as shown below:

G = [\begin{matrix} 0 & G_{m d} \\ G_{d m}^{T} & 0 \end{matrix}] \in R^{(M + N) \times (M + N)}

(1)

where

G_{m d}

represents the miRNA-disease association sub-matrix, and

G_{d m}^{T}

represents its transpose.

2.3. Graph Convolutional Attention Network (GCAN) Encoder

Graph Convolutional Networks (GCNs), a groundbreaking neural network architecture, are tailored for analyzing data with graph structures. GCNs ingeniously incorporate nodes topologically and attribute information by performing local convolutional operations on the graph to extract latent node representations.The core mechanism of GCNs utilizes the graph’s adjacency matrix to guide the propagation of information between nodes, accurately modeling the local neighborhood structure. In each network layer, GCNs adaptively aggregate and transform the features of neighboring nodes, a process that can be viewed as spectral filtering of graph signals. As the network depth increases, the model gradually expands its receptive field, capturing broader structural information. This hierarchical feature extraction approach enables GCNs to simultaneously encode both microscopic local structures and macroscopic global patterns, ultimately generating node embeddings rich in graph topological semantics. Inspired by previous work, we employ a Graph Convolutional Attention Network (GCAN) for encoding. Our GCAN model first utilizes a traditional GCN to perform initial encoding on the input similarity network, providing rich local structural information and better feature initialization for the subsequent attention mechanism. Then, we input the encoded features into the attention-based layer. This design allows the model to adaptively assign importance weights to different neighboring nodes, capturing long-range dependencies between nodes while preserving the local topological structure. The detailed procedure is outlined below:

First, we employ GCN to encode the node features. For node

v_{i}

, the following equation describes the feature update process:

h_{v_{i}}^{(l)} = σ (\sum_{j \in N (i) \cup {i}} \frac{1}{\sqrt{| N (i) | | N (j) |}} W^{(l)} h_{ν_{j}}^{(l - 1)})

(2)

where

h {v_{i}}^{(l)}

denotes the feature representation of node

v_{i}

at the l-th layer, and

N (i)

represents its neighbor set,

W^{(l)}

is the learnable weight matrix of the l-th layer, and

σ

is the ReLU activation function.

To distinguish the importance of different neighboring nodes, we introduce an attention mechanism. For node pair

(v_{i}, v_{j})

, we calculate the attention coefficient

e_{i j}

, normalizing the attention coefficients yields the ultimate attention weights

α_{i j}

, and use the computed attention weights to perform weighted aggregation of the neighboring nodes’ features.

e_{i j} = L e a k y R e L U (a^{T} [W_{h} h_{v_{i}} ‖ W_{h} h_{v_{j}}])

(3)

α_{i j} = \frac{exp (e_{i j})}{\sum_{k \in N (i)} exp (e_{i k})}

(4)

z_{v_{i}} = σ (\sum_{j \in N (i)} α_{i j} W_{h} h_{v_{j}})

(5)

where a is the learnable attention weight vector,

W_{h}

is the shared feature transformation matrix, and

| |

denotes vector concatenation.

z_{v_{i}}

represents the new feature representation of node

v_{i}

after attention-weighted aggregation.

To bolster the model’s expressiveness and robustness, we employ multi-head attention, computing separate attention heads concurrently and concatenating their outputs. Meanwhile, to mitigate the issue of vanishing gradients in deep networks while retaining useful information from each layer, we introduce an improved skip connection method. This method preserves the advantages of traditional skip connections while considering the issues of noise accumulation and feature importance, and has relatively low computational complexity, which is particularly important for processing large-scale graph data. Specifically, we fuse the outputs of multiple GCAN layers through the improved skip connections.

h_{v_{i}}^{att} {= ‖}_{q = 1}^{Q} σ (\sum_{j \in N (i)} α_{i j}^{q} W_{h}^{q} h_{v_{j}})

(6)

H_{A G} = h_{v_{i}}^{(L)} + \sum_{l = 1}^{L - 1} α_{l} h_{v_{i}}^{(l)}

(7)

where

α_{i j}^{q}

and

W_{h}^{q}

denote the attention weights and feature transformation matrix of the q-th attention head, respectively, with L denoting the GCAN layer count

α_{l}

representing the l-th layer’s attention weight, which can be obtained through learning.

2.4. Graph Convolutional Transformer Encoder

Here, we adopt a Graph Convolutional Transformer for information extraction. Unlike traditional Transformers, the Graph Convolutional Transformer introduces graph convolutional layers to model the local structural information of nodes while leveraging the self-attention mechanism of Transformers to capture global relationships. The Graph Convolutional Transformer encoder consists of two main components: graph convolutional layers and self-attention layers. The graph convolutional layers are used to aggregate adjacent nodes’ local features, while the self-attention layers capture the global dependencies between nodes. Through multiple layers of graph convolution and self-attention operations, the encoder can generate high-quality node representations. This combination can better adapt to the characteristics of graph-structured data, improving the model’s performance on graph-related tasks, and enabling our model to effectively harness the graph’s local and global structure, improving its performance on pertinent tasks.

First, we obtain the input feature matrix

F_{s} \in R^{(M + N) \times k}

, with M and N denoting the miRNA and disease node counts, respectively, and k denoting the dimension of the input features.

F_{s} = [M_{s}; D_{s}]

(8)

In the equation,

M_{s}

and

D_{s}

denote the miRNA and disease nodes’ similarity matrices, respectively.

In the Graph Convolutional Transformer (GCT), we introduce graph convolutional layers to encode nodes’ local structural patterns by aggregating the neighborhood information, generating more expressive node representations.

X_{a}^{(0)} = F_{s}

(9)

X_{a}^{(e)} = σ (\hat{A} X_{a}^{(e - 1)} W^{(e)})

(10)

Here,

W^{(e)}

represents the trainable weights of the e-th graph convolutional layer, while

σ

denotes the ReLU function.

We introduce the multi-head self-attention layer to model global dependencies between nodes. Nevertheless, in graph convolutional transformers, node representations may converge with increasing encoder depth. The transformer’s inherent self-attention allows nodes to assimilate features from their counterparts, leading to similar feature representations at deep levels [26]. To address this challenge and improve the transformer’s capacity for modeling local relationships, we incorporate a Gaussian bias term into its self-attention layer.

The introduction of the Gaussian bias term is intended to strengthen the transformer’s ability to extract local structural information. By multiplying it with the attention matrix, we encourage nodes to allocate more attention to important nodes that are closer in distance [27,28]. This bias mechanism can help the transformer better capture local patterns and short-range dependencies in the miRNA-disease association graph.

Q^{(n)} = X^{(L)} W_{Q}^{(n)}, K^{(n)} = X^{(L)} W_{K}^{(n)}, V^{(n)} = X^{(L)} W_{V}^{(n)}

(11)

A_{r}^{(n)} = softmax (\frac{Q^{(n)} {(K^{(n)})}^{T}}{\sqrt{d}} + (- | ω k_{i, j}^{2} + b |)) V^{(n)}

(12)

A_{G T} = Concat (A_{r}^{(1)}, A_{r}^{(2)}, \dots, A_{r}^{(N)}) W_{O}

(13)

In the equations,

X^{(L)}

represents the final graph convolutional layer’s output, where

W_{Q}^{(n)}

,

W_{K}^{(n)}

, and

W_{V}^{(n)}

denote the learnable weight matrices for the n-th attention head. The variable d represents the dimension of the attention heads, with N denoting the attention headcount,

W_{O}

representing the output weight matrix,

k_{i, j}

indicating the inter-node distance for nodes i and j, and

ω

and b serving as learnable scalar parameters.

ω

controls the scale of the Gaussian bias term, helping the model adapt to different distance metrics, while b serves to penalize the weight of a node’s self-attention, preventing nodes from focusing excessively on themselves.

To enhance the model’s stability and generalizability, we introduce residual connections and layer regularization. Residual connections facilitate the smooth flow of gradients, while layer regularization accelerates convergence and mitigates the issue of vanishing gradients.

\tilde{A} = LayerNorm (A_{r} + X^{(L)})

(14)

H_{T R} = LayerNorm (FFN (A_{r}) + A_{r})

(15)

In the equation, FFN represents the feed-forward neural network layer, which consists of two linear transformations and a non-linear activation function.

2.5. Fine-Grained Multi-Layer Feature Interaction Gating

To fuse the embeddings generated by the Graph Convolutional Attention Network (GCAN) and Graph Convolutional Transformer encoders, we propose a fine-grained multi-layer feature interaction gating mechanism. This mechanism allows for the gradual integration and refinement of feature representations from the GCAN encoder and Transformer encoder at different levels. We introduce residual connections and feature interaction units tailored to our model. The residual connections add the mean of the input feature-embedding matrices to the fused feature-embedding matrix, promoting gradient flow and feature reuse. The feature interaction units capture the interaction information between different features through non-linear transformations, enhancing the feature representation capability. To mitigate the overfitting problem in deep networks, we apply dropout with a 0.5 probability to regularize the model. The specific representation is as follows:

Specifically, we first compute the gating weight matrix

G^{(1)} \in R^{N \times D}

for the first layer, which regulates the fusion ratio of each node across each feature dimension. Then, we use

G^{(1)}

to fuse the two feature-embedding matrices

H_{G}

and

H_{T}

, obtaining the fused feature-embedding matrix

H_{fused}^{(1)} \in R^{N \times D}

for the first layer:

G^{(1)} = σ (W_{g}^{(1)} [H_{A G}; H_{T R}] + b_{g}^{(1)})

(16)

H_{f u s e d}^{(1)} = G^{(1)} ⊙ H_{A G} + (1 - G^{(1)}) ⊙ H_{T R}

(17)

In the equation,

W_{g}^{(1)} \in R^{2 D \times D}

represents the learnable weight matrix,

b_{g}^{(1)} \in R^{D}

denotes the learnable bias vector, with

σ

denoting the Sigmoid function. The

[;]

symbol indicates the concatenation operation along the feature dimension, while ⊙ represents the element-wise multiplication operation.

To facilitate gradient flow and feature reuse, we introduce a residual connection that adds the mean of the input feature-embedding matrices to the fused feature-embedding matrix. We also incorporate a feature interaction unit

F^{(1)}

to capture the interaction information between different features through non-linear transformations. Furthermore, we apply dropout regularization to the output feature-embedding matrix

H_{o u t}^{(1)}

of the first layer, randomly setting a portion of the elements to zero to reduce overfitting.

H_{r e s}^{(1)} = H_{f u s e d}^{(1)} + (H_{A G} + H_{T R}) / 2

(18)

F^{(1)} = tanh (W_{f}^{(1)} [H_{A G}; H_{T R}] + b_{f}^{(1)})

(19)

H_{o u t}^{(1)} = H_{r e s}^{(1)} + F^{(1)}

(20)

H_{o u t}^{(1)} = Dropout (H_{o u t}^{(1)}, p)

(21)

In the equation,

W_{f}^{(1)}

represents the learnable weight matrix,

b_{f}^{(1)}

denotes the learnable bias vector, with

t a n h

being the hyperbolic tangent function.

Finally, we recursively apply the multi-layer gating mechanism, with each layer having its own gating weight matrix, residual connection, and feature interaction unit. This process yields the final fused feature-embedding matrix

H_{o u t}^{(d)}

, with d denoting the gating mechanism’s total layer count.

H_{f u s e d} = H_{o u t}^{(d)}

(22)

The model’s training is optimized using cross-entropy loss, which minimizes the difference between actual and predicted values, thereby enhancing prediction accuracy.

P_{f} = - \frac{1}{N} \sum_{v = 1}^{N} [y_{v} l o g (p_{v}) + (1 - y_{v}) l o g (1 - p_{v})]

(23)

In the equation, y represents the true labels and p denotes the model’s predicted labels, with N denoting the sample count.

3. Results and Discussion

3.1. Comparative Analysis with State-Of-The-Art Methods

To comprehensively evaluate the performance of our proposed prediction method, we chose five cutting-edge techniques as baselines: NIMGSA [29], AGAEMD [30], HGANMDA [23], MMGCN [22], and AMHMDA [31]. These cutting-edge approaches serve as benchmarks to assess the effectiveness and efficiency of our model in inferring miRNA-disease links.

NIMCGCN [29]: employs GCNs to derive features from similarity graphs and integrates a neural inductive matrix completion model to generate a complete miRNA-disease association matrix.
AGAEMD [30]: considers the attention distribution between nodes in the heterogeneous network and dynamically refines the miRNA functional resemblance profile.
HGANMDA [23]: leverages attention mechanisms at both node and semantic levels to capture the significance of adjacent nodes and meta-paths, reconstructing the associations between miRNAs and diseases.
MMGCN [22]: combines GCNs and multi-channel attention mechanisms to extract feature information, adaptively capturing the importance of different features.
AMHMDA [31]: harnesses GCNs to derive multi-faceted node features from various similarity networks, forming a hypergraph, which is then fused via attention to enable miRNA-disease association inference.

We employed 5-fold cross-validation on the HMDD v3.2 dataset to rigorously evaluate DGNMDA’s performance. Figure 2 shows that the AUC values for the five-fold cross-validation models are 0.9472, 0.9415, 0.9487, 0.9430, and 0.9473, while the AUPRC values are 0.9438, 0.9440, 0.9489, 0.9422, and 0.9469. These results, along with the data presented in Table 1, highlight the remarkable consistency in our model’s performance.

Through comparative analysis, we have identified two main aspects that limit the performance of existing prediction models. Firstly, current models overlook the higher-order connection patterns and global structural information within the miRNA-disease association network. Secondly, during the feature representation learning process, existing models fail to adequately consider the complex higher-order structure of heterogeneous biological networks. Methods such as NIMGSA, although capable of capturing local node features to a certain extent, are constrained by their neural network encoders and struggle to comprehensively characterize complex biological networks. In contrast, the DGNMDA model demonstrates significant advantages in capturing both global and local information. By employing a graph convolutional Transformer encoder, DGNMDA can simultaneously consider the local structure and global dependencies of nodes, enabling a more comprehensive understanding of the topological properties of miRNA-disease association networks and the discovery of key patterns in higher-order connections. Moreover, DGNMDA incorporates a graph convolutional attention encoder, which adaptively aggregates neighboring node information through an attention mechanism, further enhancing the expressive capability of local features. Furthermore, DGNMDA’s dual heterogeneous graph encoder skillfully captures higher-order structural data within heterogeneous biological networks and precisely localizes complex biological signals at multiple granularity levels, generating more accurate and biologically meaningful feature representations. As a result, the accuracy of miRNA-disease association prediction is significantly improved.

3.2. Ablation Study

To comprehensively assess the contributions and importance of various components in the DGNMDA model, we conducted a series of ablation experiments. By designing different model variants and comparing their performance on the HMDD v3.2 dataset, we gained a deep understanding of the roles and efficacy of key components, including the graph convolutional Transformer encoder, graph convolutional attention encoder, and feature interaction gating. We constructed four DGNMDA variants: DGN-A, which removed the self-attention mechanism in the graph convolutional Transformer encoder, retaining only the graph convolutional layers; DGN-B, which replaced the graph convolutional attention encoder with a traditional graph attention encoder; DGN-C, which eliminated the feature interaction gating and directly concatenated the encoder outputs; and DGN-D, which solely employed a conventional Transformer encoder. The experimental results in Table 2 demonstrate that the complete DGNMDA model achieves the best performance across all evaluation metrics. Compared to other variants, DGNMDA exhibits the highest AUC, AUPR, and F1 scores. The ablation experiments reveal the importance and complementarity of different components in the DGNMDA model. Our proposed DGNMDA model can more effectively capture local and global information, while the feature interaction gating adaptively fuses and refines the feature representations from the encoders. These experimental results strongly validate the effectiveness of the DGNMDA model.

3.3. Comparison of Single-Source and Multi-Source Features

Our model incorporates features from multiple sources as input to our model. Integrating multi-source feature information enhances the precision and stability of miRNA-disease link inference, benefiting from the complementary evidence provided by different data sources. By comprehensively considering miRNA functional similarity, disease semantic similarity, and miRNA-disease association networks, we can more comprehensively characterize the complex biological interactions among miRNAs and diseases and reveal potential regulatory mechanisms. Moreover, integrating multi-source data helps mitigate the sparsity issue encountered when using single data sources and enhances the generalizability of the prediction model. Furthermore, multi-source feature integration can facilitate the discovery of novel association patterns, expanding insights into miRNA functions in diseases and providing important clues for subsequent experimental validation and clinical applications. Therefore, we validated the importance of multi-source information. Table 3 and Figure 3 reveals that the model integrating multi-source information achieves the best performance.

3.4. Case Study

Cancer is a multifaceted disorder resulting from the interaction of genetic and environmental elements. miRNAs play crucial roles in cancer progression by regulating oncogene and tumor suppressor gene expression, cell proliferation and apoptosis, invasion, and metastasis [32,33,34,35,36]. Lymphoma is a serious malignant tumor that, if not promptly diagnosed and treated, can adversely affect patients’ health and quality of life in multiple aspects. Research reveals a strong correlation between specific miRNA expression and lymphoma patient outcomes [37,38,39,40]. For instance, elevated miR-21 and miR-155 levels frequently signify an unfavorable outcome, whereas increased miR-34a expression implies an improved prognosis [41]. These miRNAs have the potential to become prognostic biomarkers for guiding risk stratification and treatment decisions in lymphoma patients. Lung cancer, especially NSCLC, is a primary contributor to global cancer mortality [42]. Despite receiving potentially curative treatments, early-stage NSCLC patients still face a recurrence rate of up to 40% within 5 years post-treatment. miRNAs also play significant roles in lung cancer. For example, studies demonstrate that miR-155 and let-7 miRNAs can forecast lung adenocarcinoma outcomes [43,44]. Breast cancer, a prevalent malignancy among women, significantly contributes to female cancer mortality. Research indicates a significant decrease in the levels of specific miRNAs, such as miR-126 and miR-10b, in breast cancer patients [45]. To further validate the capability of the DGNMDA model in predicting miRNA-disease associations, we conducted case studies on three types of cancer: lymphoma, lung cancer, and breast cancer. In the first case study, we focused on lymphoma and lung cancer, using known miRNA-disease associations from HMDD v3.2 as the training set and the associations between these cancers and unknown miRNAs as the test set. We selected the top 20 candidate miRNAs for each cancer based on their prediction scores and verified them using the dbDEMC3.0 database [46]. The results showed that all top 20 predicted miRNAs were confirmed in the database (Table 4, Figure 4), demonstrating DGNMDA’s reliability and outstanding performance in predicting novel cancer-related miRNAs. The second case study aimed to assess DGNMDA’s predictive ability when known association information is lacking, treating breast cancer as a newly emerged disease. We intentionally ignored the known associations between breast cancer and miRNAs during model training, enabling an objective evaluation of DGNMDA’s effectiveness in discovering new miRNA-disease associations. Similarly, we verified the top 20 miRNAs with the highest prediction scores, and the results confirmed their presence in the database (Table 4, Figure 4), further highlighting DGNMDA’s exceptional performance in predicting novel disease-related miRNAs. In summary, through these two case studies, we comprehensively evaluated the performance of the DGNMDA model in predicting cancer-related miRNAs. The experimental results demonstrated that DGNMDA can effectively identify potential disease-related miRNAs for both known cancers and newly emerged diseases. These findings not only validate the reliability and practicality of our model but also provide new insights into understanding the role of miRNAs in cancer development.

3.5. Parameter Analysis

Proper hyperparameter tuning enables deep learning models to effectively learn complex miRNA-disease association patterns and excel at inferring novel associations. We performed multiple experiments to identify the ideal hyperparameter settings for maximizing model efficacy and generalizability. Based on the experimental results, the following hyperparameter values were selected: we set the feature-embedding dimension to 512, we set the multi-head attention mechanism to have 4 heads, the number of convolutional layers in the Graph Convolutional Transformer to 2, and the multi-layer gating mechanism set to 2 layers. We employed a dropout rate of 0.5 to mitigate overfitting and enhance DGNMDA’s ability to generalize to unseen data.

3.5.1. Impact of Feature-Embedding Dimension

In deep learning, the choice of feature-embedding dimension significantly influences model performance. The selection of embedding dimension requires a balance between expressive power and computational efficiency. Higher dimensions provide the model with stronger feature-capturing capabilities but also increase computational complexity and memory consumption. Excessively high dimensions may lead to an overly complex model that is difficult to train and generalize. Therefore, in practical applications, it is necessary to balance expressive power and computational efficiency by selecting an appropriate embedding dimension to achieve optimal performance and resource utilization. To investigate the optimal feature-embedding dimension, we designed a series of experiments to evaluate the model’s efficacy across various dimensions. The experimental results, shown in Figure 5, demonstrate that setting the feature-embedding dimension to 512 yields better performance compared to other dimensions.

3.5.2. Experiments on the Number of Multi-Layer Gating Layers

The number of gating layers is a key factor affecting model performance. An insufficient number of gating layers may limit the model’s expressive power, hindering a comprehensive understanding of miRNA-disease associations. On the other hand, too many gating layers, although enabling finer-grained feature fusion, may significantly increase computational complexity and introduce the risk of overfitting. Therefore, selecting the optimal number of gating layers requires balancing the model’s expressive power, computational efficiency, and generalization performance through experimental validation and domain knowledge guidance to find the most suitable configuration for the specific task and dataset, maximizing the potential of the multi-layer gating mechanism. Figure 6 illustrates the experimental findings, indicating that using two layers achieves the best model performance.

3.5.3. Impact of Graph Convolutional Layers and Attention Heads

Selecting the optimal number of graph convolutional layers and attention heads requires striking a balance between the model’s expressive power, computational efficiency, and generalization performance. Too few graph convolutional layers and attention heads may lead to underfitting and limited expressive power, making it difficult to fully capture the high-order dependencies and diverse interactions between nodes in the graph structure. On the other hand, too many layers and heads may introduce the risk of overfitting and significantly increase computational complexity, affecting the model’s generalization ability and training efficiency. Therefore, it is necessary to find the optimal configuration for the specific task and dataset through experimental validation and domain knowledge guidance. The results of the experiments, presented in Figure 7, demonstrate the impact of different configurations.

4. Conclusions

A growing body of experimental evidence reveals significant changes in miRNA expression under various disease conditions, suggesting their pivotal regulatory functions in disease onset and advancement. Motivated by this, our study aims to investigate the prediction of associations between miRNAs and diseases, which is crucial for understanding the molecular mechanisms of diseases and developing new therapeutic strategies. We propose an innovative DGNMDA model that employs a dual heterogeneous graph neural network encoder and incorporates a graph convolutional Transformer and a Graph Convolutional Attention Network (GCAN) encoder. This model can simultaneously capture local structural features and global dependencies, generating more comprehensive node embeddings. Moreover, we design a fine-grained feature interaction gating mechanism to adaptively integrate multi-level features, enhancing the model’s flexibility. To evaluate the model’s performance, we first determine the optimal hyperparameter combination through a series of experiments and then compare DGNMDA with five state-of-the-art models. Our model outperforms the alternatives in terms of AUC and AUPRC metrics. Furthermore, we validate the model’s practicality by assessing its effectiveness in predicting associations for three prevalent and lethal malignancies: lymphoma, lung cancer, and breast cancer. These findings suggest that DGNMDA has the potential to enhance miRNA-disease association prediction precision, deepening our understanding of disease pathogenesis and treatment strategies.

Despite the promising results achieved by DGNMDA, there are still some limitations and challenges. First, the model primarily relies on known association data and similarity measures when constructing similarity networks, and the quality and completeness of this information may affect model performance. In the future, we need to explore more data sources and similarity calculation methods to obtain more comprehensive and reliable similarity information. Second, balancing computational efficiency and prediction performance in real-world applications remains a noteworthy issue. Optimizing model architecture and training strategies to reduce computational overhead while maintaining performance will be a crucial aspect of our further explorations. Additionally, our model currently focuses on binary association predictions between miRNAs and diseases, while the complexity of biological systems extends far beyond this. Incorporating other types of biological entities and constructing more comprehensive multi-entity heterogeneous networks may reveal deeper biological mechanisms. Finally, although our model performs exceptionally well on benchmark datasets, more validation and refinement are needed for practical applications. Comparing model predictions with wet-lab experimental results and iteratively optimizing the model based on expert knowledge will help improve the model’s interpretability and credibility. Simultaneously, developing user-friendly visualization tools to facilitate the use and interpretation of model results by biologists and medical researchers is a crucial step in promoting the model’s application in real-world scenarios.

Author Contributions

D.L. and Q.Z.—conceptualization and drafting of the initial manuscript; C.Z.—data preparation and figure review; J.L. and Z.Y.—suggestions for the initial draft. All authors have read and agreed to the published version of the manuscript.

Funding

This investigation was supported by the National Natural Science Foundation of China (61532002, 61601261) and the Shandong Provincial Higher Educational Science and Technology Program (Grant No. J17KA062).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The miRNA-disease association data used in this study were obtained from the publicly available HMDD v3.2 database, accessed on 28 February 2024, through the website http://www.cuilab.cn/hmdd.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Lu, T.X.; Rothenberg, M.E. MicroRNA. J. Allergy Clin. Immunol. 2018, 141, 1202–1207. [Google Scholar] [CrossRef] [PubMed]
Lee, R.C.; Ambros, V. An extensive class of small RNAs in Caenorhabditis elegans. Science 2001, 294, 862–864. [Google Scholar] [CrossRef] [PubMed]
Roush, S.; Slack, F.J. The let-7 family of microRNAs. Trends Cell Biol. 2008, 18, 505–516. [Google Scholar] [CrossRef]
Wightman, B.; Ha, I.; Ruvkun, G. Posttranscriptional regulation of the heterochronic gene lin-14 by lin-4 mediates temporal pattern formation in C. elegans. Cell 1993, 75, 855–862. [Google Scholar] [CrossRef]
Mendell, J.T.; Olson, E.N. MicroRNAs in stress signaling and human disease. Cell 2012, 148, 1172–1187. [Google Scholar] [CrossRef]
Zhou, Q.; Cui, F.; Lei, C.; Ma, S.; Huang, J.; Wang, X.; Qian, H.; Zhang, D.; Yang, Y. ATG7-mediated autophagy involves in miR-138-5p regulated self-renewal and invasion of lung cancer stem-like cells derived from A549 cells. Anti-Cancer Drugs 2021, 32, 376–385. [Google Scholar] [CrossRef] [PubMed]
Chen, X.; Liu, M.X.; Yan, G.Y. RWRMDA: Predicting novel human microRNA–disease associations. Mol. BioSyst. 2012, 8, 2792–2798. [Google Scholar] [CrossRef]
Xie, X.; Wang, Y.; Sheng, N.; Zhang, S.; Cao, Y.; Fu, Y. Predicting miRNA-disease associations based on multi-view information fusion. Front. Genet. 2022, 13, 979815. [Google Scholar] [CrossRef] [PubMed]
Yu, N.; Liu, Z.P.; Gao, R. Predicting multiple types of MicroRNA-disease associations based on tensor factorization and label propagation. Comput. Biol. Med. 2022, 146, 105558. [Google Scholar] [CrossRef]
Wang, W.; Chen, H. Predicting miRNA-disease associations based on graph attention networks and dual Laplacian regularized least squares. Brief. Bioinform. 2022, 23, bbac292. [Google Scholar] [CrossRef]
Chen, X.; Sun, L.G.; Zhao, Y. NCMCMDA: miRNA–disease association prediction through neighborhood constraint matrix completion. Brief. Bioinform. 2021, 22, 485–496. [Google Scholar] [CrossRef] [PubMed]
Zhou, S.; Wang, S.; Wu, Q.; Azim, R.; Li, W. Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression. Comput. Biol. Chem. 2020, 85, 107200. [Google Scholar] [CrossRef] [PubMed]
Xuan, P.; Dong, Y.; Guo, Y.; Zhang, T.; Liu, Y. Dual convolutional neural network based method for predicting disease-related miRNAs. Int. J. Mol. Sci. 2018, 19, 3732. [Google Scholar] [CrossRef] [PubMed]
Scarselli, F.; Gori, M.; Tsoi, A.C.; Hagenbuchner, M.; Monfardini, G. The graph neural network model. IEEE Trans. Neural Netw. 2008, 20, 61–80. [Google Scholar] [CrossRef] [PubMed]
Lu, D.; Li, J.; Zheng, C.; Liu, J.; Zhang, Q. HGTMDA: A Hypergraph Learning Approach with Improved GCN-Transformer for miRNA–Disease Association Prediction. Bioengineering 2024, 11, 680. [Google Scholar] [CrossRef]
Zhang, R.; Wang, Z.; Wang, X.; Meng, Z.; Cui, W. Mhtan-dti: Metapath-based hierarchical transformer and attention network for drug–target interaction prediction. Brief. Bioinform. 2023, 24, bbad079. [Google Scholar] [CrossRef]
Li, Y.; Guo, Z.; Wang, K.; Gao, X.; Wang, G. End-to-end interpretable disease–gene association prediction. Brief. Bioinform. 2023, 24, bbad118. [Google Scholar] [CrossRef]
Gu, P.; Wu, T.; Zou, M.; Pan, Y.; Guo, J.; Xiahou, J.; Peng, X.; Li, H.; Ma, J.; Zhang, L. Multi-head self-attention model for classification of temporal lobe epilepsy subtypes. Front. Physiol. 2020, 11, 604764. [Google Scholar] [CrossRef]
Kipf, T.N.; Welling, M. Semi-supervised classification with graph convolutional networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
Casanova, P.; Lio, A.R.P.; Bengio, Y. Graph Attention networks. arXiv 2017, arXiv:1710.10903. [Google Scholar]
Wu, Z.; Pan, S.; Chen, F.; Long, G.; Zhang, C.; Philip, S.Y. A comprehensive survey on graph neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 4–24. [Google Scholar] [CrossRef]
Tang, X.; Luo, J.; Shen, C.; Lai, Z. Multi-view multichannel attention graph convolutional network for miRNA–disease association prediction. Brief. Bioinform. 2021, 22, bbab174. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Zhong, T.; Huang, D.; You, Z.H.; Nie, R. Hierarchical graph attention network for miRNA-disease association prediction. Mol. Ther. 2022, 30, 1775–1786. [Google Scholar] [CrossRef] [PubMed]
Zou, H.; Ji, B.; Zhang, M.; Liu, F.; Xie, X.; Peng, S. MHGTMDA: Molecular heterogeneous graph transformer based on biological entity graph for miRNA-disease associations prediction. Mol. Ther.-Nucleic Acids 2024, 23, 102139. [Google Scholar] [CrossRef]
Torres, L.H.; Ribeiro, B.; Arrais, J.P. Few-shot learning with transformers via graph embeddings for molecular property prediction. Expert Syst. Appl. 2023, 225, 120005. [Google Scholar] [CrossRef]
Zheng, M.; Gao, P.; Zhang, R.; Li, K.; Wang, X.; Li, H.; Dong, H. End-to-end object detection with adaptive clustering transformer. arXiv 2020, arXiv:2011.09315. [Google Scholar]
Zhang, C.; Zhao, Y.; Wang, J. Transformer-based dynamic fusion clustering network. Knowl.-Based Syst. 2022, 258, 109984. [Google Scholar] [CrossRef]
Guo, M.; Zhang, Y.; Liu, T. Gaussian transformer: A lightweight approach for natural language inference. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 6489–6496. [Google Scholar]
Li, J.; Zhang, S.; Liu, T.; Ning, C.; Zhang, Z.; Zhou, W. Neural inductive matrix completion with graph convolutional networks for miRNA-disease association prediction. Bioinformatics 2020, 36, 2538–2546. [Google Scholar] [CrossRef]
Zhang, H.; Fang, J.; Sun, Y.; Xie, G.; Lin, Z.; Gu, G. Predicting miRNA-disease associations via node-level attention graph auto-encoder. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022, 20, 1308–1318. [Google Scholar] [CrossRef]
Ning, Q.; Zhao, Y.; Gao, J.; Chen, C.; Li, X.; Li, T.; Yin, M. AMHMDA: Attention aware multi-view similarity networks and hypergraph learning for miRNA–disease associations identification. Brief. Bioinform. 2023, 24, bbad094. [Google Scholar] [CrossRef]
Peng, Y.; Croce, C.M. The role of MicroRNAs in human cancer. Signal Transduct. Target. Ther. 2016, 1, 1–9. [Google Scholar] [CrossRef] [PubMed]
Costinean, S.; Zanesi, N.; Pekarsky, Y.; Tili, E.; Volinia, S.; Heerema, N.; Croce, C.M. Pre-B cell proliferation and lymphoblastic leukemia/high-grade lymphoma in Eμ-miR155 transgenic mice. Proc. Natl. Acad. Sci. USA 2006, 103, 7024–7029. [Google Scholar] [CrossRef] [PubMed]
Croce, C.M.; Calin, G.A. miRNAs, cancer, and stem cell division. Cell 2005, 122, 6–7. [Google Scholar] [CrossRef]
Esquela-Kerscher, A.; Slack, F.J. Oncomirs—microRNAs with a role in cancer. Nat. Rev. Cancer 2006, 6, 259–269. [Google Scholar] [CrossRef] [PubMed]
Johnson, S.M.; Grosshans, H.; Shingara, J.; Byrom, M.; Jarvis, R.; Cheng, A.; Labourier, E.; Reinert, K.L.; Brown, D.; Slack, F.J. RAS is regulated by the let-7 microRNA family. Cell 2005, 120, 635–647. [Google Scholar] [CrossRef]
Ventura, A.; Young, A.G.; Winslow, M.M.; Lintault, L.; Meissner, A.; Erkeland, S.J.; Newman, J.; Bronson, R.T.; Crowley, D.; Stone, J.R.; et al. Targeted deletion reveals essential and overlapping functions of the miR-17 92 family of miRNA clusters. Cell 2008, 132, 875–886. [Google Scholar] [CrossRef]
Tilly, H.; Da Silva, M.G.; Vitolo, U.; Jack, A.; Meignan, M.; Lopez-Guillermo, A.; Walewski, J.; André, M.; Johnson, P.; Pfreundschuh, M.; et al. Diffuse large B-cell lymphoma (DLBCL): ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. Ann. Oncol. 2015, 26, v116–v125. [Google Scholar] [CrossRef]
Smith, A.; Crouch, S.; Lax, S.; Li, J.; Painter, D.; Howell, D.; Patmore, R.; Jack, A.; Roman, E. Lymphoma incidence, survival and prevalence 2004–2014: Sub-type analyses from the UK’s Haematological Malignancy Research Network. Br. J. Cancer 2015, 112, 1575–1584. [Google Scholar] [CrossRef]
Shankland, K.R.; Armitage, J.O.; Hancock, B.W. Non-hodgkin lymphoma. Lancet 2012, 380, 848–857. [Google Scholar] [CrossRef]
Lawrie, C.H.; Gal, S.; Dunlop, H.M.; Pushkaran, B.; Liggins, A.P.; Pulford, K.; Banham, A.H.; Pezzella, F.; Boultwood, J.; Wainscoat, J.S.; et al. Detection of elevated levels of tumour-associated microRNAs in serum of patients with diffuse large B-cell lymphoma. Br. J. Haematol. 2008, 141, 672–675. [Google Scholar] [CrossRef]
Jemal, A.; Siegel, R.; Ward, E.; Murray, T.; Xu, J.; Smigal, C.; Thun, M.J. Cancer statistics, 2006. CA-ATLANTA 2006, 56, 106. [Google Scholar] [CrossRef] [PubMed]
Yanaihara, N.; Caplen, N.; Bowman, E.; Seike, M.; Kumamoto, K.; Yi, M.; Stephens, R.M.; Okamoto, A.; Yokota, J.; Tanaka, T.; et al. Unique microRNA molecular profiles in lung cancer diagnosis and prognosis. Cancer Cell 2006, 9, 189–198. [Google Scholar] [CrossRef] [PubMed]
Takamizawa, J.; Konishi, H.; Yanagisawa, K.; Tomida, S.; Osada, H.; Endoh, H.; Harano, T.; Yatabe, Y.; Nagino, M.; Nimura, Y.; et al. Reduced expression of the let-7 microRNAs in human lung cancers in association with shortened postoperative survival. Cancer Res. 2004, 64, 3753–3756. [Google Scholar] [CrossRef] [PubMed]
Blenkiron, C.; Goldstein, L.D.; Thorne, N.P.; Spiteri, I.; Chin, S.F.; Dunning, M.J.; Barbosa-Morais, N.L.; Teschendorff, A.E.; Green, A.R.; Ellis, I.O.; et al. MicroRNA expression profiling of human breast cancer identifies new markers of tumor subtype. Genome Biol. 2007, 8, 1–16. [Google Scholar] [CrossRef]
Xu, F.; Wang, Y.; Ling, Y.; Zhou, C.; Wang, H.; Teschendorff, A.E.; Zhao, Y.; Zhao, H.; He, Y.; Zhang, G.; et al. dbDEMC 3.0: Functional exploration of differentially expressed miRNAs in cancers of human and model organisms. Genom. Proteom. Bioinform. 2022, 20, 446–454. [Google Scholar] [CrossRef]

Figure 1. Overall architecture of DGNMDA.

Figure 2. AUC and AUPR curves for five-fold cross-validation. (a) AUC pair ratio. (b) AUPRC pair ratio.

Figure 3. Comparative graph of multi-source and single-source information.

Figure 4. Top 20 miRNAs associated with the three diseases.

Figure 5. Influence of feature-embedding dimensions on model efficacy.

Figure 6. Impact of gating layer count on model performance.

Figure 7. Effect of GCN layer and attention head counts on performance. (a) AUC pair ratio. (b) AUPRC pair ratio.

Table 1. Comparative evaluation of alternative approaches.

Methods	ACC	F1-Score	Recall	Precision	AUC	AUPRC
NIMCGCN	0.8131	0.8148	0.8220	0.8076	0.8945	0.8926
AGAEMD	0.8502	0.8507	0.8544	0.8481	0.9270	0.9286
HGANMDA	0.8489	0.8481	0.8433	0.8529	0.9265	0.9253
MAGCN	0.8483	0.8473	0.8425	0.8533	0.9245	0.9268
AMHMDA	0.8648	0.8623	0.8539	0.8755	0.9411	0.9403
DGNMDA	0.8773	0.8800	0.8768	0.8896	0.9455	0.9451

Bold values indicate the best performance.

Table 2. Comparative evaluation of alternative approaches.

Methods	DGN-A	DGN-B	DGN-C	DGN-D	DGNMDA
AUC	0.9367	0.9398	0.9392	0.9382	0.9455
AUPR	0.9358	0.9374	0.9383	0.9367	0.9451

Bold values indicate the best performance.

Table 3. Multi-source feature experiment.

Metrics	MS+DS	MS+DG	MG+DS	MG+DG	ALL
AUC	0.9406	0.9411	0.9403	0.9421	0.9455
AUPR	0.9399	0.9408	0.9400	0.9418	0.9451

Bold values indicate the best performance.

Table 4. Predicted top 20 miRNAs: highest associations with lymphoma, lung cancer, and breast cancer.

Cancer: Lymphoma
Rank	miRNA	Evidence	Rank	miRNA	Evidence
1	hsa-mir-21	dbDEMC	11	hsa-mir-150	dbDEMC
2	hsa-mir-34a	dbDEMC	12	hsa-mir-29b	dbDEMC
3	hsa-mir-17	dbDEMC	13	hsa-mir-222	dbDEMC
4	hsa-mir-92a	dbDEMC	14	hsa-mir-181a	dbDEMC
5	hsa-mir-145	dbDEMC	15	hsa-mir-29c	dbDEMC
6	hsa-mir-19a	dbDEMC	16	hsa-mir-132	dbDEMC
7	hsa-mir-126	dbDEMC	17	hsa-let-7g	dbDEMC
8	hsa-mir-146a	dbDEMC	18	hsa-mir-200a	dbDEMC
9	hsa-let-7b	dbDEMC	19	hsa-mir-26a	dbDEMC
10	hsa-mir-221	dbDEMC	20	hsa-mir-181b	dbDEMC
Cancer: Lung cancer
Rank	miRNA	Evidence	Rank	miRNA	Evidence
1	hsa-mir-21	dbDEMC	11	hsa-mir-145	dbDEMC
2	hsa-mir-155	dbDEMC	12	hsa-mir-125b	dbDEMC
3	hsa-mir-17	dbDEMC	13	hsa-mir-16	dbDEMC
4	hsa-mir-34a	dbDEMC	14	hsa-mir-29a	dbDEMC
5	hsa-mir-146a	dbDEMC	15	hsa-mir-31	dbDEMC
6	hsa-mir-15a	dbDEMC	16	hsa-mir-122	dbDEMC
7	hsa-mir-223	dbDEMC	17	hsa-mir-150	dbDEMC
8	hsa-mir-200b	dbDEMC	18	hsa-mir-29c	dbDEMC
9	hsa-let-7d	dbDEMC	19	hsa-mir-92a	dbDEMC
10	hsa-mir-106a	dbDEMC	20	hsa-mir-124	dbDEMC
Cancer: Breast cancer
Rank	miRNA	Evidence	Rank	miRNA	Evidence
1	hsa-mir-21	dbDEMC	11	hsa-mir-126	dbDEMC
2	hsa-mir-155	dbDEMC	12	hsa-let-7e	dbDEMC
3	hsa-mir-17	dbDEMC	13	hsa-let-7f	dbDEMC
4	hsa-mir-29a	dbDEMC	14	hsa-mir-31	dbDEMC
5	hsa-mir-205	dbDEMC	15	hsa-mir-210	dbDEMC
6	hsa-mir-145	dbDEMC	16	hsa-mir-34c	dbDEMC
7	hsa-mir-200c	dbDEMC	17	hsa-mir-206	dbDEMC
8	hsa-mir-429	dbDEMC	18	hsa-mir-27a	dbDEMC
9	hsa-mir-18a	dbDEMC	19	hsa-mir-125b	dbDEMC
10	hsa-mir-19b	dbDEMC	20	hsa-mir-199a	dbDEMC

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, D.; Zhang, Q.; Zheng, C.; Li, J.; Yin, Z. DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction. Bioengineering 2024, 11, 1132. https://doi.org/10.3390/bioengineering11111132

AMA Style

Lu D, Zhang Q, Zheng C, Li J, Yin Z. DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction. Bioengineering. 2024; 11(11):1132. https://doi.org/10.3390/bioengineering11111132

Chicago/Turabian Style

Lu, Daying, Qi Zhang, Chunhou Zheng, Jian Li, and Zhe Yin. 2024. "DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction" Bioengineering 11, no. 11: 1132. https://doi.org/10.3390/bioengineering11111132

APA Style

Lu, D., Zhang, Q., Zheng, C., Li, J., & Yin, Z. (2024). DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction. Bioengineering, 11(11), 1132. https://doi.org/10.3390/bioengineering11111132

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

DGNMDA: Dual Heterogeneous Graph Neural Network Encoder for miRNA-Disease Association Prediction

Abstract

1. Introduction

2. Materials and Methods

2.1. Experimental Data

2.2. Building miRNA-Disease Resemblance Graphs

2.3. Graph Convolutional Attention Network (GCAN) Encoder

2.4. Graph Convolutional Transformer Encoder

2.5. Fine-Grained Multi-Layer Feature Interaction Gating

3. Results and Discussion

3.1. Comparative Analysis with State-Of-The-Art Methods

3.2. Ablation Study

3.3. Comparison of Single-Source and Multi-Source Features

3.4. Case Study

3.5. Parameter Analysis

3.5.1. Impact of Feature-Embedding Dimension

3.5.2. Experiments on the Number of Multi-Layer Gating Layers

3.5.3. Impact of Graph Convolutional Layers and Attention Heads

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI