A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation

Dong, Tim; Llewellyn, Rhys D.; Hezzell, Melanie; Angelini, Gianni D.

doi:10.3390/biomedicines13061323

Open AccessArticle

A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation

by

Tim Dong

^1,*

,

Rhys D. Llewellyn

²,

Melanie Hezzell

³

and

Gianni D. Angelini

¹

Bristol Heart Institute, Translational Health Sciences, University of Bristol, Bristol BS2 8HW, UK

²

Pharmacy Department, Liverpool Heart and Chest Hospital, Thomas Dr, Liverpool L14 3PE, UK

³

Bristol Veterinary School, University of Bristol, Langford House, Langford, Bristol BS40 5DU, UK

^*

Author to whom correspondence should be addressed.

Biomedicines 2025, 13(6), 1323; https://doi.org/10.3390/biomedicines13061323

Submission received: 17 March 2025 / Revised: 19 May 2025 / Accepted: 24 May 2025 / Published: 28 May 2025

(This article belongs to the Special Issue Role of Natural Product in Cardiovascular Disease—2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Background: The treatment and management of atrial fibrillation poses substantial complexity. A delicate balance in the trade-off between the minimising risk of stroke without increasing the risk of bleeding through anticoagulant optimisations. Natural compounds are often associated with low-toxicity effects, and their effects on atrial fibrillation have yet to be fully understood. Whilst deep learning (a subtype of machine learning that uses multiple layers of artificial neural networks) methods may be useful for drug compound interaction and discovery analysis, graphical processing units (GPUs) are expensive and often required for deep learning. Furthermore, in limited-resource settings, such as low- and middle-income countries, such technology may not be easily available. Objectives: This study aims to discover the presence of any new therapeutic candidates from a large set of natural compounds that may support the future treatment and management of atrial fibrillation anywhere using a low-cost technique. The objective is to develop a deep learning approach under a low-resource setting where suitable high-performance NVIDIA graphics processing units (GPUs) are not available and to apply to atrial fibrillation as a case study. Methods: The primary training dataset is the MINER-DTI dataset from the BIOSNAP collection. It includes 13,741 DTI pairs from DrugBank, 4510 drug compounds, and 2181 protein targets. Deep cross-modal attention modelling was developed and applied. The Database of Useful Decoys (DUD-E) was used to fine-tune the model using contrastive learning. This application and evaluation of the model were performed on the natural compound NPASS 2018 dataset as well as a dataset curated by a clinical pharmacist and a clinical scientist. Results: the new model showed good performance when compared to existing state-of-the-art approaches under low-resource settings in both the validation set (PR AUC: 0.8118 vs. 0.7154) and test set (PR AUC: 0.8134 vs. 0.7206). Tenascin-C (TNC; NPC306696) and deferoxamine (NPC262615) were identified as strong natural compound interactors of the arrhythmogenic targets ADRB1 and HCN1, respectively. A strong natural compound interactor of the bleeding-related target Factor X was also identified as sequoiaflavone (NPC194593). Conclusions: This study presented a new high-performing model under low-resource settings that identified new natural therapeutic candidates for pharmacological cardioversion and anticoagulation.

Keywords:

deep learning; bioinformatics; drug discovery; transcriptomics; proteomics; atrial fibrillation; machine learning

1. Introduction

Existing bleeding risk scores are notorious for poor prognostic capabilities. More recently, studies have examined the potential of using novel biomarkers for bleeding risk prognosis [1]. This is important not only for guiding oral anticoagulant administration in terms of dose for atrial fibrillation (AF) patients but also for preventing stroke since risk factors for bleeding and stroke often overlap. The patient subgroup of key importance is the low stroke risk group (CHA₂DS₂-VASc score 1), for which the appropriate anticoagulant dosage is most difficult to determine [1]. Overestimation of bleeding risk leads to anticoagulants not being sufficiently used, leading to a higher proportion of mortality and morbidity in AF patients, including stroke, which could have been avoided otherwise [2]. A study by Siegbahn et al. applied a Proximity Extension Assay to identify key biomarkers differentially regulated in bleeding: growth differentiation factor-15 (GDF-15), high-sensitivity cardiac troponin T (cTnT-hs), and seven novel biomarkers: osteopontin, ephrin type-B receptor 4, tumour necrosis factor (TNF) receptor 1, TNF receptor 2, soluble urokinase plasminogen activator receptor, TNF-related apoptosis-inducing ligand receptor 2, and osteoprotegerin from a large set of 268 unique protein biomarkers in plasma samples [3]. The increased levels of cytokine GDF-15 have been shown to be associated with cellular stress, tissue damage, and heightened risk of bleeding in AF patients. This was also shown in various other disease groups such as cerebral haemorrhage, acute coronary syndrome, pulmonary embolism, etc. [4]. However, the limitation of GDF-15 [5] and other biomarkers such as neutrophil-to-lymphocyte ratio (NLR) [6] is that it modifies the risk of not only bleeding but also a range of other cardiovascular and non-cardiovascular outcomes, including stroke, mortality, heart failure, and cancer [4]. Newer scores need to focus on biomarkers with simpler, more specific (direct) mechanisms to be incorporated into antithrombotic therapy guidelines.

The ABC bleeding risk score is an existing score of the risk of bleeding in AF patients and consists of two clinical risk factors (age and history of bleeding) and three biomarkers (GDF-15 (a marker of oxidative stress), cTnT-hs (marker of myocardial injury), and haemoglobin) [7]. This ABC score outperformed two other bleeding risk scores, HAS-BLED and ORBIT, achieving a c-index of 0.71 in the RE-LY external validated trial [7]. In terms of the use of artificial intelligence (AI) in the management of AF, studies have focused mainly on the use of AI for screening of AF rather than other aspects such as biomarker discovery [8,9].

1.1. Current Clinical Drug Practice for AF

NICE British National Formulary (BNF) recommends that cardioselective drugs (specifically affecting receptors in the heart) such as bisoprolol be used for AF treatment [10]. These drugs help reduce unwanted systemic effects by targeting cardiac cells. Alternative antiarrhythmic drugs include the potassium and β-receptor blocker sotalol, which does not interact with warfarin, although it is reported to have minor interactions with aspirin [11]. Flecainide, a sodium channel blocker is also used and does not appear to have any significant interactions with anticoagulant medications. Digoxin, is indicated for maintenance of atrial fibrillation or flutter and is used mostly to slow atrioventricular nodal conduction through parasympathomimetic effects, thereby decreasing the ventricular response rate and reducing the impact of the irregular rhythm. Digoxin may exacerbate the risk of bradycardia if co-administered with beta-blockers [11].

1.2. Anticoagulants for Managing Risk of Clotting in AF Patients

Whilst anticoagulants help to manage the risk of stroke and blood clot formation in patients with AF, these increase the risk of bleeding. Therefore, a fine balance needs to be achieved. Warfarin is a vitamin K antagonist anticoagulant and strongly affects anticoagulant interaction pathways [12]. One limitation of warfarin is that there needs to be continuous monitoring of dosage and safety due to its interaction with other drugs and food, as well as the variable natural response to warfarin treatment and impact on International Normalised Ratio (INR) [13]. Direct-acting oral anticoagulants (DOAC) such as rivaroxaban [14] and apixaban, both factor Xa inhibitors, are approved by NICE for the management of clotting risks in humans [15] and are increasingly preferred over warfarin, except for patients with recorded allergies or metallic heart valves. They avoid the need for routine laboratory monitoring and dosage adjustments. Whilst clopidogrel is used as the primary drug for preventing thrombotic events in veterinary cardiac settings (most commonly in cats), clopidogrel is considered a second-line treatment inpatients who cannot take warfarin [16].

1.3. Aim

This study aims to discover the presence of any new therapeutic candidates from a large set of natural compounds that may support the future treatment and management of atrial fibrillation using a low-cost technique. Graphics processing units (GPUs) are expensive and often required for deep learning. However, in low-resource settings, for example, such as low- and middle-income countries, such resources may not be a commodity [17]. The objectives were to develop a deep learning approach to apply to atrial fibrillation as a case study using a low-cost technique where suitable high-performance NVIDIA graphics processing units (GPUs) are not required and to identify candidate therapeutic compounds that could be used everywhere.

2. Methods

2.1. Dataset and Materials

The primary training dataset is the MINER drug–target interaction (DTI) dataset from the BIOSNAP collection (Table 1) [18]. It includes 13,741 DTI pairs from DrugBank, 4510 drug compounds, and 2181 protein targets. The Database of Useful Decoys (DUD-E) is used to fine-tune the model using contrastive learning. The DUD-E dataset analysed consisted of 57 protein targets and active compounds that are known to interact with these targets as well as decoys that are known not to bind the targets but have very similar chemical structures [19].

This evaluation of the model was performed on the Natural Product Activity & Species Source (NPASS) 2018 dataset as part of the case study (Table 2) as well as a dataset curated by a clinical pharmacist and a clinical scientist. The selection of clinical compounds and targets was based on the existing guidelines; the British National Formulary (BNF); commonly used compounds in the United Kingdom healthcare system; clinical expertise/prioritisation from the clinical pharmacist, cardiologist, and cardiac surgeon; as well as previous review work conducted, focusing on relevance for the treatment and management of AF [20]. The dataset curated by the clinical pharmacist consisted of (i) compound dataset: 12 antiarrhythmic drugs and 6 anticoagulant drugs commonly used in hospitals in the United Kingdom; (ii) target dataset: the target dataset consisted of representative targets whose genes are typically up-regulated in atrial fibrillation, consisting of beta-1 adrenergic receptor (ADRB1 gene) and potassium/sodium hyperpolarisation-activated cyclic nucleotide-gated channel 1 (HCN1 gene [20]) for antiarrhythmic drug-related targets and Factor Xa as the anticoagulation target. The antiarrhythmic drugs for AF were further divided into four main classes as well as other miscellaneous classes to visualise any relationships. Further subclassifications are beyond the scope of this work.

2.2. Target Featurisation

The chemical compounds were transformed into features using the Morgan fingerprint method. The protein sequences were transformed into features using the Skipgram Neural Network embedding approach as described by ProtVec [21]. The rationale for using this embedding approach over that of ProtBERT as in [22] is that the latter requires CUDA (NVIDIA GPU) to be available and configured to the appropriate version of the torch library on the device, which is not always available as is the case in this study. In addition, ProtBERT requires enormous computational power and requires continuous rapid connectivity to platforms such as Hugging Face, which may also pose challenges.

2.3. Modelling Approach

A Schematic overview of the approach is provided in Figure 1A below.

The embeddings

E_{i}

=

f_{i} (X_{i})

of the target (i = 1) and compound (i = 2) are entered separately into 3 layer feed-forward projection modules:

L_{1}^{i} = R e L U (W_{1}^{(i) T} E_{i} + b_{1}^{i})

L_{2}^{i} = R e L U (W_{2}^{(i) T} L_{1} + b_{2}^{i})

(1)

L_{3}^{i} = R e L U (W_{3}^{(i) T} L_{2} + b_{3}^{i})

where W represents the weights and b represents the bias in each layer. Subsequently, a dual-headed cross-modal self-attention component is added:

A = [σ (\frac{Q^{1} K^{2}}{\sqrt (p / 2)}) V^{2}, σ (\frac{Q^{2} K^{1}}{\sqrt (p / 2)}) V^{1}] W_{G}

(2)

where

p = 1024

,

W_{G}

is the weight capturing the global contextual information across attention heads,

Q^{1}

is the query matrix calculated using the target embedding, and

K^{2}

and

V^{2}

are calculated using the drug compound embeddings in the first attention head. In the second head, this is reversed, where

Q^{2}

is the query matrix calculated using the drug compound embedding,

K^{1}

and

V^{1}

are calculated using the target embeddings. This composition is used to maximise both the local and global information captured across the two modalities.

Subsequently, residuals were added and layer normalisation was performed to propagate gradient and normalise features. Finally, a single-layer feed-forward linear classifier is added to conduct the classification. Binary cross entropy (BCE) loss with the sigmoid function was used for the optimisation process rather than standard BCE loss to ensure values are normalised strictly between 0 and 1 during this process.

W_{4}^{T} (\frac{\hat{A} - μ_{\hat{A}}}{\sqrt{σ_{\hat{A}}^{2} + ε}} α + β) + b_{4}

(3)

where

\hat{A} = E_{1} + E_{2} + A, μ_{\hat{A}}

is the empirical average of

\hat{A}

,

σ_{\hat{A}}^{2}

is the variance of

\hat{A}

, and

α

and

W_{4}^{T}

are the weights of the normalisation and classification layers, respectively.

β

and

b_{4}

are the bias of the normalisation and classification layers, respectively.

ε

is a small constant in the denominator that reduces the likelihood of numerical instability.

The above modelling process was applied to the BIOSNAP as initial pre-training. This was then followed by contrastive learning as per [22] using the DUD-E training dataset. The Triplet (contrastive) loss function was used in order to perform the contrastive learning component of the training process. For each drug compound interaction pair, 50 non-interacting decoys were sampled randomly to produce the triplets, and the loss values were averaged across each triplet set. To explain in more detail, the DUD-E dataset of positive samples with random samples of negative sample combinations whilst used to facilitate the contrastive learning were only applied to the MINER-DTI training dataset such that it aims to enhance the contrast between the MINER-DTI training dataset’s positive and negative sample pairs for drug–target interactions, whilst maximising the likelihood of interaction between the active compound in the DUD-E dataset and the target in the MINER-DTI training dataset. In this sense, the DUD-E’s decoy non-active samples act like an augmentation to the negative samples in the MINER-DTI training dataset.

2.4. Representation Approach

Due to projection to the latent space resulting in negative embedding values, which can have arbitrary spatial meanings in terms of its negativity (i.e., the negative spatial orientation is not an indication of dissimilarity by itself), the cosine similarity was not used, but Euclidean distance was used instead, which cannot be negative. Due to the curse of dimensionality, Euclidean distances may not be robust in directly calculating the distance in non-linear distances across high dimensionality data functions; hence, the PaCMAP approach was applied to preserve global and local structure of data in lower dimensional space (i.e., two-dimensional projection space) before calculating the Euclidean distance [23]. This distance metric is then a measure of the predicted extent of interaction between targets and compounds with smaller distances representing higher interactions. The model embeddings of the target, active, and natural compounds were projected and distances calculated using this approach. Depending on the target, the active compounds were changed to either antiarrhythmics or anticoagulants.

2.5. Model Evaluation

The new model was benchmarked against the non-contrastive model in [22], whereby the only difference is that the ProtBERT featurisation of targets is replaced with using Skipgram Neural Network embedding instead. As the process is still very computationally costly and the dataset is large, training, validation, and test split evaluation methodology was used instead of cross-validation [24]. Precision–Recall Area Under the Curve (PR AUC) was used to evaluate the performance of models on the validation and test datasets. Both models with and without contrastive learning were evaluated. However, since only the overall precision–recall performance is of interest in this study, as it does not concern with diagnostics and risk prediction specifically, the precision curve plot is beyond the scope of the current study. Due to the computational cost required to run the contrastive learning model, consideration of analysis using any evaluation metrics (including uncertainty quantification) are beyond the scope of the current study.

Python version 3.9 and torch version 2.1.2 were used for the analysis.

3. Results

The ConPLex achieved a satisfactory performance of 0.7154 and 0.7206 on the validation and test dataset, respectively (Table 3). The New model demonstrated a higher magnitude of performance in both the validation (0.8140 vs. 0.7154) and test set (0.8369 vs. 0.7206).

Using the highest-performing model from the above analysis, both the ConPLex and New models were supplemented with contrastive learning. Whilst the regularisation effect decreased the performance of the model slightly compared to its non-contrastive counterpart, it still outperformed the ConPLex model without contrastive learning in both the validation set (0.8118 vs. 0.7154; Table 4) and test set (0.8134 vs. 0.7206). In addition, it also outperformed the ConPLex model with contrastive learning in both the validation set (0.8118 vs. 0.6999; Table 4) and test set (0.8134 vs. 0.6943).

Table 5 shows that as expected, ADRB1 interacts mostly strongly with bisoprolol. In the projection space (Figure 1B), it can be seen that Sotalol is also closely positioned to the target compared to many of the other antiarrhythmics. Lidocaine and flecainide were found to interact with ADRB1, although these are likely to be interactions through closely associated mechanisms rather than direct binding.

Interestingly, class I drugs can be seen to form two clusters, with one cluster being lidocaine and flecainide and the other cluster composed of quinidine, procainamide, and mexiletine, suggesting potential similarities in interaction mechanisms.

Although the top interactors for ADRB1 are not well known in the existing literature, tenascin-C (TNC; NPC306696) was identified as a strong natural compound interactor of ADRB1 (Figure 2B; Supplementary Materials Table S1). An ITScore-PP score of −280.142 using the MDockPP tool was obtained as a surrogate of a very high binding affinity for these compounds in this arrangement. The natural compounds did not interact with the ADRB1 target more strongly overall (p = 0.792). However, the tail of the grey distribution shows that some natural compounds demonstrated stronger binding scores than those of commonly used clinical antiarrhythmic drugs (Figure 2A).

As expected, lidocaine showed strong interaction with the HCN1 protein (Supplementary Materials Table S2). Interestingly, deferoxamine (NPC262615) was identified as the natural compound that interacts most strongly with the HCN1 protein (Supplementary Materials Table S3).

Factor Xa was found to bind most strongly with apixaban as expected (Table 6). Warfarin showed less intense binding in contrast (Figure 3).

The third-strongest natural compound interactor of Factor Xa was identified as sequoiaflavone (NPC194593; Table 7). Figure S1 shows that the attention matrix for each of the cross-modal attention heads captures different sparse sets of information and hence may complement each other in terms of the decision-making process. The natural compounds did not interact with the Factor Xa target more strongly overall (p = 0.658). However, the tail of the grey distribution shows that some natural compounds demonstrated stronger binding scores than those of commonly used clinical anticoagulants, though this difference is marginal (Figure S2).

Supplementary Materials Table S4 shows that both the new model and the ConPLex model using the approach described here outperformed the computational time and hardware cost of the approach described in Singh et al. (ConPLex with GPU with ProtBERT featurisation) [22].

Whilst the PR AUC metric used earlier is particularly suited in scenarios of class imbalance, focusing only on the sample pairs with positive class for interaction, the BIOSNAP validation and test dataset were relatively balanced in terms of interaction pairs/number of non-interaction pairs (1396/1352 and 2770/2727, respectively). Hence, the Area Under the Receiver Operating Characteristic Curve (ROC AUC) performance metric, which is suitable for use under a balanced class distribution, was also assessed as a sensitivity analysis to further validate the reliability of the approach. Using this metric, the performance obtained showed a similar relationship to that of the PR AUC, with the new model outperforming the ConPLex model under both scenarios with and without contrastive learning (Supplementary Materials Tables S5 and S6).

As a sensitivity analysis, docking was performed for sequoiaflavone and apixaban against Factor Xa using Autodock Vina. Sequoiaflavone demonstrated a slightly higher binding strength for Factor Xa compared to apixaban (ΔG −5.59 vs. −5.569 kcal/mol; Figure 4).

4. Discussion

Existing work in DTI prediction has focused on areas beyond atrial fibrillation and there has been limited consideration of the use of natural compounds and dataset curated by clinical pharmacist in the analytical process for this specific purpose. For example, Wong et al. have developed a link prediction (predicting interactions with a network) method using Gaussian kernel-based network similarity matrices of miRNA and lncRNA to feed into a linear optimisation algorithm for predicting their interactions [25]. In addition, Wang et al. developed an innovative approach by extracting Graph attention network attention weights for input into an optimised deep learning algorithm, called the ensemble deep RVFL network (edRVFL), for learning from intermediate feature forms rather than high-level features [26]. The approach was used to effectively fuse heterogeneous multi-disease phenotypes with circular RNA (cRNA) to predict their interactions. Other DTI prediction studies have focused on techniques requiring the use of CUDA (NVIDIA GPU) to be available and configured to the appropriate version of the torch library on the device, which is not always available, as is the case in this study, or may be limited in terms of model explainability in the latent space [27,28]. This study aims to bridge this gap, focusing not only on the development of new improved modelling approaches but also the application of such model for drug target interaction prediction and therapeutic compound discovery.

This study identified sequoiaflavone as a potential anticoagulant candidate for targeting Factor Xa. Sequoiaflavone is one of the five flavonoids present in the ginkgo biloba plant native to East Asia and may have anti-inflammatory [29] and cardioprotective effects through the inhibition of phosphodiesterases (PDEs) as well as anticancer effects through down-regulation of the PI3K/AKT signalling pathway [30]. The presence of phenolic hydrogens allows for these molecules to be donated to support the scavenging of reactive oxidative species (ROS) produced during inflammation (Figure 5A). The inhibition of PDE by sequoiaflavone could also decrease the hydrolysis of cyclic nucleotides [31] Cyclic adenosine 3′,5′-monophosphate (cAMP) and cyclic guanosine 3′,5′-monophosphate (cGMP), required for platelet aggregation [32]. It is interesting to note that in a previous study, sequoiaflavone was included as an ingredient in the ginkgo biloba extract (GBE50) that enhanced the antiplatelet effects of aspirin synergistically, reducing the effects of platelet aggregation [33]. Further clinical trials would be required to assess the independent effectiveness of sequoiaflavone in vivo for managing the risk of stroke and blood clot formation in animals and patients with AF, whilst minimizing the risk of bleeding. However, the parent drug Bio-Biloba—also containing the other flavonoids using ginkgo biloba, available in film-coated tablet form—is already approved by Medicines and Healthcare products Regulatory Agency (MHRA) and can readily be assessed further in clinical trials.

Deferoxamine is an iron chelator traditionally used to treat iron overload, transfusion-dependent anaemias, and chronic kidney disease (CKD)-related aluminium toxicity [34]. In parallel, it has been shown that increased accumulation of intra-cellular iron levels activates ferroptosis (produces ROS) and the Fenton reaction [34] (produces hydroxyl radicals) can lead to arrhythmia (including AF) development through ROS-induced ion channel remodelling, myocardial fibrosis, and mitochondrial dysfunction [35]. Previous studies have also shown that mycolactone, a toxin produced in bacteria, results in the hyperpolarisation of dorsal root ganglion (DRG) neurons [36]. In parallel, studies have shown that thoracic DRG depolarisation reduced ventricular arrhythmogenicity [37]. Supporting the results of this study, it was demonstrated previously that deferoxamine inhibited mycolactone-mediated cytotoxicity [38] and hence the associated understimulation through hyperpolarisation of DRG. Since deferoxamine is already approved for clinical use in the United States and other countries such as the United Kingdom (Figure 5B), it should be possible to conduct further clinical trials potentially repurposing these for in vivo testing of efficacy for the treatment of cardiac arrhythmia. Deferoxamine is typically administered through subcutaneous or intra-muscular injections. Hence, the drug is typically combined with the mesylate anion to increase solubility and hygroscopicity. A similar iron chelator that can be administered orally is also available, i.e., deferasirox, although such research has mainly focused on cancer treatment so far [39].

Interestingly, tenascin-C (TNC; NPC306696) was identified as a strong natural compound interactor of ADRB1. TNC is a large extracellular matrix glycoprotein characterised as a matricellular protein that is highly expressed during healthy and pathological tissue remodelling [40]. In particular, it can exert both harmful (proinflammatory and profibrotic effects) and beneficial effects in damaged hearts depending on its surrounding signalling factors. In particular, it is able to bind to more than 25 different proteins, including platelet-derived growth factor (PDGF), and hence have a wide range of functions, including oligomerisation, induction of mitogenic responses, cell migration, cell attachment, cell spreading, focal adhesion, cell survival, matrix assembly, and protease and proinflammatory cytokine synthesis [41]. TNC serum level has also been suggested as a prognostic marker of cardiac disease due to its inflammatory-related effects for a wide range of heart diseases including AF [42]. In terms of mechanism, it has been suggested that it works together with proinflammatory cytokines such as Interleukin-6 in a positive feed-back loop to enhance inflammation and promote myocardial fibrosis [42]. In addition, is has been found that in AF patients, the amount of TNC is correlated with the severity of atrial dilation [42]. While studies to date have mainly focused on targeting this TNC’s FNIII domain using antibodies and antagonists in cancer treatments [41], future therapeutic targeting to lower TNC levels could provide potential effective treatments for AF, but further evaluation would be required [43].

This study supported the results of previous studies that showed ADRB1 is inhibited by lidocaine with pH-associated effects on binding [44]. Flecainide was also identified previously as strongly interacting for ADRB1, with its potency effect for treating atrial fibrillation varying based on different genotypes [45]. However, the effect is indirect through the ADRB1 activation that facilitates the augmented inhibition of flecainide on sodium channels. NICE British National Formulary (BNF) recommends that cardioselective drugs (specifically affecting channels in the heart) such as bisoprolol (identified as the strongest-interacting drug for ADRB1) require less frequent dosing, since their duration of action is longer [10]. Sotalol, a non-cardioselective drug, was found to interact less strongly in this study, though different variants may interact differently [46]. In addition, sotalol is mostly used for its potassium channel blocking effects and hence may have comparatively lower effects on ADRB1. However, potassium channels were excluded from this study since potassium channels typically have decreased expression in atrial fibrillation patients [20]. Hence, further blockade of these channels may not always be beneficial. These drugs can be used in conjugation with digoxin (though used less nowadays) to control the ventricular response in atrial fibrillation [10].

Amiodarone, for example, is approved for the treatment of AF in both UK and European guidelines where other drugs are either not efficacious or are contra-indicated [47]. In this study, it was identified as a drug with low interaction effects to atrial fibrillation targets, potentially explaining why it might be helpful in scenarios, e.g., where drug interactions could lead to contra-indication. Amiodarone is an antiarrhythmic drug, classified as a potassium channel blocker, although it also has sodium-, calcium-, and β-receptor-blocking activity. Amiodarone is used to chemically cardiovert AF patients to a normal sinus rhythm. In terms of interactions with anticoagulant drugs, amiodarone increases the effect of anticoagulants such as warfarin through the inhibition of coumarin [12].

Surprisingly, despite rivaroxaban being approved by NICE, it was shown to demonstrate lower interaction with Factor X than warfarin, which is now decreasingly used. Nonetheless, clinical trials showed that rivaroxaban actually increased the risk of major bleeding compared to the use of blood-thinning aspirin only group in coronary artery and peripheral artery disease patients [48]. One additional limitation of these novel oral anticoagulants (NOACs) alone with dabigatran is that whilst they do not require dosage monitoring as with warfarin, they may have decreased persistence in usage by patients, leading to worse clinical outcomes.

Previous studies using the DUD-E dataset have been affected by limitations in the DUDE dataset in terms of limited chemical space, analogue bias, and decoy selection bias [49]. The decoys are small molecules that are known to not bind the target yet share physicochemical characteristics with the actual interacting compound for each target, and this could have a selective bias if manual selection of a subset of decoys is used for inclusion should that selection be carried out in a systematic manner. However, here, the random nature of the decoy sampling process mitigates such bias. Unlike other studies such as that in Chen et al. [49], which considers the DUD-E dataset as the sole dataset for training and test evaluation, the DUD-E dataset here is used to support the training process conducted on the MINER-DTI BIOSNAP dataset through contrastive learning, reducing any likelihood of restrictions to the chemical space exploration by the negative samples of non-interacting drug–target pairs.

In this specific study, there were no overestimations observed as a result of using the DUD-E dataset contrary to other studies [49], perhaps due to the difference in methodology of contrastive learning applied here. Instead, the use the of DUD-E dataset with contrastive learning showed a slight regularisation-like effect where the prediction performance decreased slightly in the trade-off to prevent overfitting. Future studies should also aim to further validate this approach on other datasets and problems to further explore the generalisability of this technique.

5. Future Work and Limitations

Although the new model holds potential for drug discovery in the scenarios considered, more research is needed to validate the finding in terms of experimentally using in vitro and eventually in vivo studies. For example, the SMILES structure for tenascin-C was not available and there is currently no easy way to conduct docking for protein-to-protein interactions. For example, the HADDOCK server, whilst potentially useful for protein–protein complex structure predictions, requires user input of known actively interacting residues [50], which is difficult to provide for novel compounds. In addition, these approaches, including, e.g., ClusPro, do not provide affinity estimates but rather less easy-to-interpret scoring methods such as cluster energy scores [51]. MDockPP can provide an intuitive ITScore-PP score that has a correlation of 0.71 in relation to binding affinity [52]. However, this approach is limited in that it takes about a day to generate the results, and further work is also required to improve the correlation to binding affinity. Nonetheless, future studies should aim to further research into approaches for docking across protein-to-protein interactions as well as conducting in vitro analyses to further validate the findings in this study. Future work may also aim to further improve upon existing available approaches such as Convolutional Neural Network based approaches, DeepDTA [27] and MolTrans [28], potentially incorporating knowledge of relationships in the substructural elements across compounds and target on top of the existing sample-level representation here, as part of a multi-scale approach. Toxicity studies have not been conducted and may be required to further understand the implications of the new compounds identified in terms of biological effects in animals and humans. Specifically, the inclusion of toxicity screening as well as absorption, distribution, metabolism, excretion, and toxicity (ADMET) of the newly identified compounds could further increase the clinical relevance of the current findings. Future work should also consider the application of the methodology developed herein to other domains and datasets. For example, it would be interesting to assess the approach here to predict miRNA–lncRNA interactions and disease–cRNA interactions [25,26]. Future work should also assess the interactions among the new antiarrhythmic drugs and the anticoagulant compounds in order to further ascertain how well the compounds fit into existing clinical workflows. In addition, future studies should consider Matthews Correlation Coefficient (MCC) and Balanced Accuracy (BA) metrics, which may also be useful for assessing imbalanced datasets. Future study should also further ascertain whether the regularisation-like effect of contrastive learning can help to mitigate against the correlation effects of similar binders in relation to analogue bias. While pre-trained large models such as ProtBERT are beyond the scope of the current study, future work should assess their potential effect on performance when combined with the methodologies in this study. In terms of thresholding activity (e.g., functional assay) datasets to consider active and inactive categories, this was considered where online functional assay datasets from ChEMBL were considered in relation to the drug compounds and targets. However, it was noted that there was a high level of missingness such that this was challenging to use without supplementation with experimentally assessed activity values, for which such resources were not available in this study. Although additional datasets such as the natural compound dataset have been considered here, future studies should aim to develop better linkage methods across experimental studies to aggregate activity-related datasets through wider collaborations, such as multi-institutional studies. In terms of future directions, it has also been suggested that improved assessment of scenarios for resuming anticoagulants, or alternative left atrial appendage (LAA) occlusion as well as new anticoagulants that inhibit Factor XI, would likely be beneficial [53]. The Factor XI-inhibiting drug abelacimab is currently being evaluated in clinical trials and it may therefore be worth evaluating novel candidate markers against this new drug in future studies of this kind [54].

6. Conclusions

The treatment and management of atrial fibrillation pose substantial complexity in terms of the delicate balance in the trade-off between the minimising risk of stroke without increasing the risk of bleeding through anticoagulant optimisations. This study presented a new high-performing model under low-resource settings that identified new natural therapeutic candidates for pharmacological cardioversion and anticoagulation as part of a case study.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biomedicines13061323/s1, Table S1: The top 10 natural products that interact with ADRB1; Table S2: The interaction of antiarrhythmic drugs with HCN1; Table S3: The top 10 natural products that interact with HCN1; Table S4: Evaluation of computational time and hardware cost performance of base models without contrastive learning (CL) in minutes against comparative ConPLex models here and that with GPU in original Singh et al. [22] paper; PR AUC: Precision-Recall Area Under the Curve; Table S5: Evaluation of performance of base models without contrastive learning; ROC AUC: Area Under the Receiver Operating Characteristic Curve; Table S6: Evaluation of performance of the best performing model with contrastive learning; ROC AUC: Area Under the Receiver Operating Characteristic Curve; Figure S1: Visualisation of the attention weights for the interaction between natural compounds and Factor X. The left and right sides show the attention matrices for each of the cross modal attention heads. The rows represent the samples in the natural compound dataset and columns represent the (

p / 2

= 512) dimension of the attention head parameter; Figure S2: Using a violin plot, the distribution of interaction scores for Factor X against the clinical anticoagulants (blue) and natural compounds (grey) are shown; p-value shows results from one-sided t test.

Author Contributions

Conceptualization, T.D. and M.H.; Methodology, T.D.; Software, T.D.; Validation, T.D.; Formal analysis, T.D.; Investigation, T.D., R.D.L., M.H. and G.D.A.; Resources, R.D.L., M.H. and G.D.A.; Data curation, T.D. and R.D.L.; Writing—original draft, T.D.; Writing—review & editing, T.D., R.D.L., M.H. and G.D.A.; Visualization, T.D.; Supervision, G.D.A.; Project administration, G.D.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The BIOSNAP dataset is available at: https://snap.stanford.edu/biodata/ (accessed on 27 December 2024) and the DUD-E dataset is available at https://dude.docking.org/targets (accessed on 27 December 2024). The natural compound dataset is available at: https://bidd.group/NPASS/ (accessed on 5 January 2025).

Conflicts of Interest

The authors declare no conflict of interest.

References

Berg, D.D.; Morrow, D.A. Improving Prediction of Anticoagulant-Related Major Bleeding in Atrial Fibrillation: The Search for New Biomarkers. J. Thromb. Haemost. 2021, 19, 2674–2676. [Google Scholar] [CrossRef] [PubMed]
Berg, D.D.; Ruff, C.T.; Morrow, D.A. Biomarkers for Risk Assessment in Atrial Fibrillation. Clin. Chem. 2021, 67, 87–95. [Google Scholar] [CrossRef] [PubMed]
Siegbahn, A.; Lindbäck, J.; Hijazi, Z.; Åberg, M.; Alexander, J.H.; Eikelboom, J.W.; Lopes, R.D.; Pol, T.; Oldgren, J.; Granger, C.B.; et al. Multiplex Protein Screening of Biomarkers Associated with Major Bleeding in Patients with Atrial Fibrillation Treated with Oral Anticoagulation. J. Thromb. Haemost. 2021, 19, 2726–2737. [Google Scholar] [CrossRef] [PubMed]
López-Gálvez, R.; Rivera-Caravaca, J.M. Growth Differentiation Factor 15 in Cardiovascular Diseases: Predicting Bleeding after Cardiac Surgery and Beyond That! Thromb. Haemost. 2022, 122, 657–660. [Google Scholar] [CrossRef]
Liu, W.; Bu, H. GDF-15: A Biomarker-Based Prediction for Bleeding-Cardiovascular Death. Eur. J. Intern. Med. 2022, 96, 125. [Google Scholar] [CrossRef]
Fagundes, A.; Ruff, C.T.; Morrow, D.A.; Murphy, S.A.; Palazzolo, M.G.; Chen, C.Z.; Jarolim, P.; Antman, E.M.; Braunwald, E.; Giugliano, R.P. Neutrophil-Lymphocyte Ratio and Clinical Outcomes in 19,697 Patients with Atrial Fibrillation: Analyses from ENGAGE AF- TIMI 48 Trial. Int. J. Cardiol. 2023, 386, 118–124. [Google Scholar] [CrossRef]
Hijazi, Z.; Oldgren, J.; Lindbäck, J.; Alexander, J.H.; Connolly, S.J.; Eikelboom, J.W.; Ezekowitz, M.D.; Held, C.; Hylek, E.M.; Lopes, R.D.; et al. The Novel Biomarker-Based ABC (Age, Biomarkers, Clinical History)-Bleeding Risk Score for Patients with Atrial Fibrillation: A Derivation and Validation Study. Lancet 2016, 387, 2302–2311. [Google Scholar] [CrossRef]
Harmon, D.M.; Sehrawat, O.; Maanja, M.; Wight, J.; Noseworthy, P.A. Artificial Intelligence for the Detection and Treatment of Atrial Fibrillation. Arrhythm. Electrophysiol. Rev. 2023, 12, e12. [Google Scholar] [CrossRef]
Sehrawat, O.; Kashou, A.H.; Noseworthy, P.A. Artificial Intelligence and Atrial Fibrillation. J. Cardiovasc. Electrophysiol. 2022, 33, 1932–1943. [Google Scholar] [CrossRef]
Reid, J.; Duker, G.; Almgren, O.; Nerme, V. (+)-Sotalol Causes Significant Occupation of Beta-Adrenoceptors at Concentrations That Prolong Cardiac Repolarization. Naunyn Schmiedebergs Arch. Pharmacol. 1990, 341, 215–220. [Google Scholar] [CrossRef]
Konieczny, K.M.; Dorian, P. Clinically Important Drug–Drug Interactions Between Antiarrhythmic Drugs and Anticoagulants. J. Innov. Card. Rhythm. Manag. 2019, 10, 3552–3559. [Google Scholar] [CrossRef] [PubMed]
BNF Content Published by NICE. Available online: https://bnf.nice.org.uk/ (accessed on 13 March 2025).
Shikdar, S.; Vashisht, R.; Zubair, M.; Bhattacharya, P.T. International Normalized Ratio: Assessment, Monitoring, and Clinical Implications. In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2025. [Google Scholar]
Berger, J.S.; Laliberté, F.; Kharat, A.; Lejeune, D.; Moore, K.T.; Jung, Y.; Lefebvre, P.; Ashton, V. Comparative Effectiveness and Safety of Rivaroxaban and Warfarin Among Nonvalvular Atrial Fibrillation (NVAF) Patients with Obesity and Polypharmacy in the United States (US). Adv. Ther. 2021, 38, 3771–3788. [Google Scholar] [CrossRef] [PubMed]
NHS England. Operational Note: Commissioning Recommendations for National Procurement for Direct-Acting Oral Anticoagulant(s) (DOACs). Available online: https://www.england.nhs.uk/long-read/commissioning-recommendations-for-national-procurement-for-doacs/ (accessed on 16 January 2025).
Clopidogrel | Drugs | BNF Content Published by NICE. Available online: https://bnf.nice.org.uk/drugs/clopidogrel/ (accessed on 16 January 2025).
Chib, A.; van Velthoven, M.H.; Car, J. mHealth Adoption in Low-Resource Environments: A Review of the Use of Mobile Healthcare in Developing Countries. J. Health Commun. 2015, 20, 4–34. [Google Scholar] [CrossRef]
Zitnik, M.; Sosič, R.; Maheshwari, J.L. Stanford Biomedical Network Dataset Collection. Available online: http://snap.stanford.edu/biodata/ (accessed on 20 February 2025).
Mysinger, M.M.; Carchia, M.; Irwin, J.J.; Shoichet, B.K. Directory of Useful Decoys, Enhanced (DUD-E): Better Ligands and Decoys for Better Benchmarking. J. Med. Chem. 2012, 55, 6582–6594. [Google Scholar] [CrossRef]
ESC Guidelines for the Management of Atrial Fibrillation. Available online: https://www.escardio.org/Guidelines/Clinical-Practice-Guidelines/Atrial-Fibrillation (accessed on 15 January 2025).
Asgari, E.; Mofrad, M.R.K. Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics. PLoS ONE 2015, 10, e0141287. [Google Scholar] [CrossRef]
Singh, R.; Sledzieski, S.; Bryson, B.; Cowen, L.; Berger, B. Contrastive Learning in Protein Language Space Predicts Interactions between Drugs and Protein Targets. Proc. Natl. Acad. Sci. USA 2023, 120, e2220778120. [Google Scholar] [CrossRef]
Huang, H.; Wang, Y.; Rudin, C.; Browne, E.P. Towards a Comprehensive Evaluation of Dimension Reduction Methods for Transcriptomic Data Visualization. Commun. Biol. 2022, 5, 719. [Google Scholar] [CrossRef]
Lee, S.B.; Gui, X.; Manquen, M.; Hamilton, E.R. Use of Training, Validation, and Test Sets for Developing Automated Classifiers in Quantitative Ethnography. In Proceedings of the International Conference on Quantitative Ethnography, Madison, WI, USA, 20–22 October 2019; Eagan, B., Misfeldt, M., Siebert-Evenstone, A., Eds.; Springer International Publishing: Cham, Switzerland, 2019; pp. 117–127. [Google Scholar]
Wong, L.; Wang, L.; You, Z.-H.; Yuan, C.-A.; Huang, Y.-A.; Cao, M.-Y. GKLOMLI: A Link Prediction Model for Inferring miRNA–lncRNA Interactions by Using Gaussian Kernel-Based Method on Network Profile and Linear Optimization Algorithm. BMC Bioinform. 2023, 24, 188. [Google Scholar] [CrossRef]
Wang, L.; Wong, L.; You, Z.-H.; Huang, D.-S. AMDECDA: Attention Mechanism Combined With Data Ensemble Strategy for Predicting CircRNA-Disease Association. IEEE Trans. Big Data 2024, 10, 320–329. [Google Scholar] [CrossRef]
Öztürk, H.; Özgür, A.; Ozkirimli, E. DeepDTA: Deep Drug–Target Binding Affinity Prediction. Bioinformatics 2018, 34, i821–i829. [Google Scholar] [CrossRef]
Huang, K.; Xiao, C.; Glass, L.M.; Sun, J. MolTrans: Molecular Interaction Transformer for Drug–Target Interaction Prediction. Bioinformatics 2021, 37, 830–836. [Google Scholar] [CrossRef] [PubMed]
Rakha, A.; Umar, N.; Rabail, R.; Butt, M.S.; Kieliszek, M.; Hassoun, A.; Aadil, R.M. Anti-Inflammatory and Anti-Allergic Potential of Dietary Flavonoids: A Review. Biomed. Pharmacother. 2022, 156, 113945. [Google Scholar] [CrossRef] [PubMed]
Yan, W.; Jinhui, Z.; Yongchen, Z.; Hongxiang, L.I.U.; Yawei, L.I.U.; Huanhuan, M.; Xincai, Y. Sequoiaflavone Inhibits Stem Cell Properties Such as Proliferation and Invasion of Gastric Cancer Cells by Down-Regulating PI3K/AKT Signaling Pathway. Chin. J. Clin. Pharmacol. Ther. 2023, 28, 508. [Google Scholar]
Quintal Martínez, J.P.; Segura Campos, M.R. Flavonoids as a Therapeutical Option for the Treatment of Thrombotic Complications Associated with COVID-19. Phytother. Res. 2022, 37, 1092–1114. [Google Scholar] [CrossRef]
Gresele, P.; Momi, S.; Falcinelli, E. Anti-Platelet Therapy: Phosphodiesterase Inhibitors. Br. J. Clin. Pharmacol. 2011, 72, 634–646. [Google Scholar] [CrossRef]
Ke, J.; Li, M.-T.; Huo, Y.-J.; Cheng, Y.-Q.; Guo, S.-F.; Wu, Y.; Zhang, L.; Ma, J.; Liu, A.-J.; Han, Y. The Synergistic Effect of Ginkgo Biloba Extract 50 and Aspirin Against Platelet Aggregation. Drug Des. Devel Ther. 2021, 15, 3543–3560. [Google Scholar] [CrossRef]
Deferoxamine Pharmacology, Monitoring, and Patient Outcomes. Available online: https://www.researchgate.net/publication/386250861_Deferoxamine_Pharmacology_Monitoring_and_Patient_Outcomes (accessed on 16 May 2025).
Shen, J.; Fu, H.; Ding, Y.; Yuan, Z.; Xiang, Z.; Ding, M.; Huang, M.; Peng, Y.; Li, T.; Zha, K.; et al. The Role of Iron Overload and Ferroptosis in Arrhythmia Pathogenesis. IJC Heart Vasc. 2024, 52, 101414. [Google Scholar] [CrossRef]
Song, O.-R.; Kim, H.-B.; Jouny, S.; Ricard, I.; Vandeputte, A.; Deboosere, N.; Marion, E.; Queval, C.J.; Lesport, P.; Bourinet, E.; et al. A Bacterial Toxin with Analgesic Properties: Hyperpolarization of DRG Neurons by Mycolactone. Toxins 2017, 9, 227. [Google Scholar] [CrossRef]
Kuwabara, Y.; Salavatian, S.; Howard-Quijano, K.; Yamaguchi, T.; Lundquist, E.; Mahajan, A. Neuromodulation With Thoracic Dorsal Root Ganglion Stimulation Reduces Ventricular Arrhythmogenicity. Front. Physiol. 2021, 12, 713717. [Google Scholar] [CrossRef]
Grönberg, A.; Zettergren, L.; Bergh, K.; Ståhle, M.; Heilborn, J.; Ängeby, K.; Small, P.L.; Akuffo, H.; Britton, S. Antioxidants Protect Keratinocytes against M. Ulcerans Mycolactone Cytotoxicity. PLoS ONE 2010, 5, e13839. [Google Scholar] [CrossRef]
Lui, G.Y.L.; Obeidy, P.; Ford, S.J.; Tselepis, C.; Sharp, D.M.; Jansson, P.J.; Kalinowski, D.S.; Kovacevic, Z.; Lovejoy, D.B.; Richardson, D.R. The Iron Chelator, Deferasirox, as a Novel Strategy for Cancer Treatment: Oral Activity Against Human Lung Tumor Xenografts and Molecular Mechanism of Action. Mol. Pharmacol. 2013, 83, 179–190. [Google Scholar] [CrossRef] [PubMed]
Imanaka-Yoshida, K.; Tawara, I.; Yoshida, T. Tenascin-C in Cardiac Disease: A Sophisticated Controller of Inflammation, Repair, and Fibrosis. Am. J. Physiol. Cell Physiol. 2020, 319, C781–C796. [Google Scholar] [CrossRef] [PubMed]
Giblin, S.P.; Midwood, K.S. Tenascin-C: Form versus Function. Cell Adh Migr. 2015, 9, 48–82. [Google Scholar] [CrossRef]
Imanaka-Yoshida, K. Tenascin-C in Heart Diseases—The Role of Inflammation. Int. J. Mol. Sci. 2021, 22, 5828. [Google Scholar] [CrossRef]
Grosse, A.; Mettke, F.; Kirsch, K.; Gruen, K.; Franz, M.; Schulze, P.C.; Surber, R. Regional Levels of Tenascin- C in Patients with Heart Failure and Atrial Fibrillation. Eur. Heart J. 2023, 44, ehad655.903. [Google Scholar] [CrossRef]
Modest, V.E.; Butterworth, J.F. Effect of pH and Lidocaine on Beta-Adrenergic Receptor Binding. Interaction during Resuscitation? Chest 1995, 108, 1373–1379. [Google Scholar] [CrossRef]
Nia, A.M.; Caglayan, E.; Gassanov, N.; Zimmermann, T.; Aslan, O.; Hellmich, M.; Duru, F.; Erdmann, E.; Rosenkranz, S.; Er, F. Beta1-Adrenoceptor Polymorphism Predicts Flecainide Action in Patients with Atrial Fibrillation. PLoS ONE 2010, 5, e11421. [Google Scholar] [CrossRef]
Belfiori, M.; Lazzari, L.; Hezzell, M.; Angelini, G.D.; Dong, T. Transcriptomics, Proteomics and Bioinformatics in Atrial Fibrillation: A Descriptive Review. Bioengineering 2025, 12, 149. [Google Scholar] [CrossRef]
Beta-Adrenoceptor Blocking Drugs | Treatment Summaries | BNF Content Published by NICE. Available online: https://bnf.nice.org.uk/treatment-summaries/beta-adrenoceptor-blocking-drugs/ (accessed on 11 March 2025).
Rivaroxaban (Xarelto) in Combination with Aspirin for Prevention of Major Cardiovascular Events in Coronary or Peripheral Artery Disease; National Institute for Health Research: London, UK, 2017.
Chen, L.; Cruz, A.; Ramsey, S.; Dickson, C.J.; Duca, J.S.; Hornak, V.; Koes, D.R.; Kurtzman, T. Hidden Bias in the DUD-E Dataset Leads to Misleading Performance of Deep Learning in Structure-Based Virtual Screening. PLoS ONE 2019, 14, e0220113. [Google Scholar] [CrossRef]
Honorato, R.V.; Trellet, M.E.; Jiménez-García, B.; Schaarschmidt, J.J.; Giulini, M.; Reys, V.; Koukos, P.I.; Rodrigues, J.P.G.L.M.; Karaca, E.; van Zundert, G.C.P.; et al. The HADDOCK2.4 Web Server for Integrative Modeling of Biomolecular Complexes. Nat. Protoc. 2024, 19, 3219–3241. [Google Scholar] [CrossRef]
Kozakov, D.; Hall, D.R.; Xia, B.; Porter, K.A.; Padhorny, D.; Yueh, C.; Beglov, D.; Vajda, S. The ClusPro Web Server for Protein-Protein Docking. Nat. Protoc. 2017, 12, 255–278. [Google Scholar] [CrossRef] [PubMed]
Huang, S.-Y.; Zou, X. An Iterative Knowledge-Based Scoring Function for Protein-Protein Recognition. Proteins 2008, 72, 557–579. [Google Scholar] [CrossRef] [PubMed]
Harrington, J.; Granger, C.B. Bleeding and Risk for Future Cardiovascular Events in Patients with Atrial Fibrillation on Oral Anticoagulation: Major Bleeding Is a Major Problem. Eur. Heart J. 2022, 43, 4909–4911. [Google Scholar] [CrossRef] [PubMed]
Reduced Bleeding with Abelacimab Could Transform Atrial Fibrillation Treatment. Available online: https://www.pharmacytimes.com/view/reduced-bleeding-with-abelacimab-could-transform-atrial-fibrillation-treatment (accessed on 15 January 2025).

Figure 1. (A) A schematic overview of the modelling approach. (B) Using PaCMAP, the learned latent space for ADRB1 (green), commonly used clinical antiarrhythmics (blue), and natural compounds (grey) are shown.

Figure 2. (A): Using a violin plot, the distribution of interaction scores for ADRB1 against the clinical antiarrhythmics (blue) and natural compounds (grey) are shown; p-value shows results from one-sided t test; (B): predicted 3D complex of ADRB1 (orange boxed region) bound to tenascin-C.

Figure 3. Using PaCMAP, the learned latent space for Factor Xa (green), commonly used clinical anticoagulants (blue), and natural compounds (grey) are shown.

Figure 4. Docking of sequoiaflavone within the archway between the intersection between the heavy and light chains of Factor X.

Figure 5. Chemical structure of (A): sequoiaflavone (PubChem) and (B): deferoxamine (DrugBank).

Table 1. Summary of the BIOSNAP and DUD-E dataset samples included. Training, validation, and test dataset values represent the total number of combinations (number of interaction pairs/number of non-interaction pairs); the DUD-E dataset is only used in the training process; hence, validation and test set boxes are left blank.

Dataset	Drugs	Targets	Training	Validation	Test
BIOSNAP	4510	2181	19,238 (9670/9568)	2748 (1396/1352)	5497 (2770/2727)
DUD-E	852,292	57	415,204 (8996/406,208)	—	—

Table 2. Summary of the NPASS 2018 dataset.

	N
Natural Products	30,926
Organisms	25,041

Table 3. Evaluation of the performance of base models without contrastive learning; PR AUC: Precision–Recall Area Under the Curve.

	PR AUC
	Validation Set	Test Set
ConPLex	0.7154	0.7206
New model	0.8140	0.8369

Table 4. Evaluation of performance of the best-performing model with contrastive learning; PR AUC: Precision–Recall Area Under the Curve.

	PR AUC
	Validation Set	Test Set
ConPLex	0.6999	0.6943
New model	0.8118	0.8134

Table 5. The interactions between antiarrhythmic drugs with that of the target protein ADRB1.

Euclidean Distance	Compound Name
0.00	ADRB1
5.92	Bisoprolol class 2
6.18	Lidocaine class 1
7.08	Flecainide class 1
8.86	Quinidine class 1
9.64	Disopyramide class 1
10.06	Diltiazem class 4
10.11	Sotalol class 3
10.48	Procainamide class 1
11.27	Mexiletine class 1
12.30	Digoxin-class cardiac glycoside
12.47	Amiodarone class 3
14.03	Verapamil class 4

Table 6. The interaction between anticoagulants and Factor X.

Euclidean_Distance	Compound_Name
0.00	factorX
5.98	Apixaban
7.82	Edoxaban
7.92	Dabigatran
9.80	Enoxaparin
14.55	Warfarin
15.01	Rivaroxaban

Table 7. The interaction between natural compounds and Factor Xa.

Euclidean Distance	Compound Name
0.00	factorX
0.08	NPC55443
0.10	NPC207866
0.15	NPC194593
0.18	NPC306696
0.18	NPC474341
0.20	NPC62927
0.21	NPC265856
0.25	NPC275027
0.26	NPC195466

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Dong, T.; Llewellyn, R.D.; Hezzell, M.; Angelini, G.D. A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation. Biomedicines 2025, 13, 1323. https://doi.org/10.3390/biomedicines13061323

AMA Style

Dong T, Llewellyn RD, Hezzell M, Angelini GD. A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation. Biomedicines. 2025; 13(6):1323. https://doi.org/10.3390/biomedicines13061323

Chicago/Turabian Style

Dong, Tim, Rhys D. Llewellyn, Melanie Hezzell, and Gianni D. Angelini. 2025. "A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation" Biomedicines 13, no. 6: 1323. https://doi.org/10.3390/biomedicines13061323

APA Style

Dong, T., Llewellyn, R. D., Hezzell, M., & Angelini, G. D. (2025). A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation. Biomedicines, 13(6), 1323. https://doi.org/10.3390/biomedicines13061323

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Deep Learning Methodology for Screening New Natural Therapeutic Candidates for Pharmacological Cardioversion and Anticoagulation in the Treatment and Management of Atrial Fibrillation

Abstract

1. Introduction

1.1. Current Clinical Drug Practice for AF

1.2. Anticoagulants for Managing Risk of Clotting in AF Patients

1.3. Aim

2. Methods

2.1. Dataset and Materials

2.2. Target Featurisation

2.3. Modelling Approach

2.4. Representation Approach

2.5. Model Evaluation

3. Results

4. Discussion

5. Future Work and Limitations

6. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI