Next Article in Journal
Insights on Antitumor Activity and Mechanism of Natural Benzophenanthridine Alkaloids
Previous Article in Journal
Design, Synthesis, and Biological Evaluation of Novel 3-Cyanopyridone/Pyrazoline Hybrids as Potential Apoptotic Antiproliferative Agents Targeting EGFR/BRAFV600E Inhibitory Pathways
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using the Correlation Intensity Index to Build a Model of Cardiotoxicity of Piperidine Derivatives

by
Alla P. Toropova
*,
Andrey A. Toropov
,
Alessandra Roncaglioni
and
Emilio Benfenati
Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Department of Environmental, Health Science, Via Mario Negri 2, 20156 Milano, Italy
*
Author to whom correspondence should be addressed.
Molecules 2023, 28(18), 6587; https://doi.org/10.3390/molecules28186587
Submission received: 1 August 2023 / Revised: 6 September 2023 / Accepted: 7 September 2023 / Published: 12 September 2023
(This article belongs to the Section Computational and Theoretical Chemistry)

Abstract

:
The assessment of cardiotoxicity is a persistent problem in medicinal chemistry. Quantitative structure–activity relationships (QSAR) are one possible way to build up models for cardiotoxicity. Here, we describe the results obtained with the Monte Carlo technique to develop hybrid optimal descriptors correlated with cardiotoxicity. The predictive potential of the cardiotoxicity models (pIC50, Ki in nM) of piperidine derivatives obtained using this approach provided quite good determination coefficients for the external validation set, in the range of 0.90–0.94. The results were best when applying the so-called correlation intensity index, which improves the predictive potential of a model.

1. Introduction

The risk of developing cardiotoxicity against the background of treating carcinogenic pathologies is one of the most urgent problems of modern oncology and cardiology. Piperidine derivatives are of exceptional interest due to their potential biological activity, such as antiviral, antibacterial, antitumor, and many others [1].
Current anticancer therapy includes many drugs with various mechanisms and a spectrum of actions. One of the most important groups in this number of drugs are antibiotics with antitumor activity, which play a massive role in the chemotherapy of various oncological diseases [2]. Their effectiveness has been clinically proven. However, despite the favorable course of the disease, when using this group of drugs, patients experience a number of undesirable side effects from various organs and systems that can develop not only during therapy but also after its completion. One of the main side effects is cardiotoxicity. This term includes various adverse events from the cardiovascular system against the background of drug therapy for oncological diseases. Such manifestations of cardiotoxicity as pain in the heart, blood pressure, heart rhythm disturbances, myocarditis, pericarditis, and heart attacks reduce a patient’s quality of life. Still, sometimes they become serious reasons for discontinuing or not prescribing the drug. For some medicines in this group, for example, alkylating agents, cardiotoxicity is a limiting factor.
Antibiotics with antitumor activity currently occupy a leading place in the treatment of oncological diseases [3]; as a result, the correction of their side effects, particularly cardiotoxicity, remains one of the most urgent problems for oncologists, cardiologists, and general practitioners. The frequency of development of various dysfunctions of the heart reaches greater values. At the same time, both reversible and irreversible consequences are quite dangerous. Prevention and treatment of cardiotoxicity remain mandatory, but complicated clinical tasks for a doctor due to the irreversibility and progressive nature of the disease changes most in the functioning of the cardiovascular system. Cardiotoxicity complications significantly impair patients’ quality of life and reduce the duration of life, and mortality from cardiovascular diseases still globally ranks first [4].
In addition, psychopharmacology and psychopharmacotherapy of depressive states are dynamically developing areas, and antidepressants are the second most prescribed drugs among all psychotropic drugs [5]. Such a high rating of these psychotropic drugs is because about 5% of the world’s population suffers from depression (according to WHO). However, high doses and long-term use of medications in this group lead to cardiotoxic effects. The cardiotoxicity of tricyclic antidepressants is manifested with conduction disturbances in the atrioventricular node and ventricles of the heart (quinine-like action), arrhythmias, and a decrease in myocardial contractility. Doxepin and amoxapine have the least cardiotoxicity. Treatment of patients with cardiovascular pathology with tricyclic antidepressants should be carefully monitored, and high doses should not be used.
Cardiovascular diseases such as coronary artery disease, valvular heart disease, arrhythmias, and heart failure are serious health risks and often require lifelong treatment. Much attention is paid to the psychological consequences of life for people with cardiac diseases [6]. Anxiety symptoms are common among patients with cardiovascular diseases and may worsen the prognosis for these patients. Symptoms of anxiety and depression can prevent lifestyle changes and adherence to therapy, as well as reduce the effectiveness of cardiac rehabilitation.
There is increasing evidence of the widespread use of psychotropic drugs in cardiac patients for comorbid psychiatric disorders [7]. The side effects of psychotropic medications on the part of the cardiovascular system include disturbances in the rhythm and conduction of the heart. For example, recent studies have shown that antidepressants were associated with increased mortality and an altered beta-blocker effect in patients with heart failure. In addition, the use of antipsychotic drugs in patients after acute myocardial infarction is necessary [6,7].
An increase in morbidity and mortality under the influence of depression in patients with cardiovascular diseases dictates the imperative need for preliminary analysis of both drugs’ therapeutic efficacy and cardiotoxic potential [8]. In other words, when developing a treatment plan for a depressed patient with heart disease, one should carefully weigh any intervention’s risk/benefit ratio. However, the choice of antidepressants is complicated because many can cause cardiovascular side effects, such as orthostatic hypotension, hypertension, and impaired cardiac conduction. In addition, clinically significant drug interactions should be considered when choosing treatment. Unfortunately, the number of clinical studies explicitly investigating the safety of antidepressants in patients with cardiovascular disease is limited, and the studies that are conducted have generally included a small number of patients.
The human ether-a-go-go-related gene (hERG) potassium channel plays a pivotal role in cardiac rhythm regulation, and the cardiotoxicity data associated with hERG inhibition using drugs and environmental chemicals provides important information for medicinal chemistry [9,10]. As stated above, cardiac problems are among the most complex in medicine [11], and there is a clear trend toward them becoming more important [12]. Identifying potential human ether-a-go-go-related gene (hERG) potassium channel blockers is an important part of drug discovery and checking up on drug safety processes in pharmaceutical industries and academic drug discovery centers [10]. The most popular idea at present is considered to begin the corresponding searches with the choice of idealization (a certain success), that is a molecule that absorbs the preferred qualities in the complete form. In order to prioritize molecules during the early drug discovery phase and to reduce the risk of the necessity of an additional preliminary checking-up of pharmaceutical agents, computational approaches have been developed to predict the potential of hERG blockage of new drug candidates. In other words, estimating the cardiac toxicity of organic hERG blockers is an important theoretical and practical task of medicinal chemistry. Potential hERG inhibitors must be identified for drug discovery and safety [13]; however, the experimental analysis of all potential hERG inhibitors is impossible because there are so many of them.
Computational models for the cardiac toxicity of organic hERG blockers are an attractive alternative to real experiments [14]. Quantitative structure–activity relationships (QSAR) are common computational approaches [15]. Such models can be obtained using machine learning based on graph theory, support vector machine, random forest, artificial neural networks, and other approaches [16,17].
CORAL software (http://www.insilico.eu/coral, accessed on 1 September 2023) is a tool for building up QSAR models for various endpoints with the Monte Carlo method [18,19,20,21,22,23,24,25,26,27]. The CORAL software was recently updated with what is called the index of ideality of correlation (IIC) [28] and the correlation intensity index (CII) [29]. IIC and CII are indicators of the predictive potential of QSAR.
IIC differs from other criteria of the statistical quality of linear regression models with a unique ability since it is a measure that is sensitive both to the value of the correlation coefficient and to the value of the mean absolute error (MAE).
In principle, CII has some analogy with the known cross-validation measures, but this analogy is partial. While the traditional cross-validated test is based on averaging the difference between the correlation coefficient before and after the “removing” of molecules from the set (training, calibration, or testing), the CII considers the average value of the difference observed removing only molecules which reduce the correlation coefficient in the set.
Here, the ability to improve the predictive potential of cardiotoxicity models using the IIC and CII is studied.

2. Results

2.1. QSAR Models Based on TF1

The Monte Carlo optimization with the target function TF1 for three random splits (#1, #2, and #3) provides the following models:
pIC50 = 3.456(±0.058) + 0.0632(±0.0019) × DCW(1, 15)
pIC50 = 4.082(±0.019) + 0.1498(±0.0025) × DCW(1, 15)
pIC50 = 2.618(±0.140) + 0.1654(±0.0090) × DCW(1, 35)
Table 1 provides the statistical quality of these models.

2.2. QSAR Models Based on TF2

The Monte Carlo optimization with the target function TF2 for three random splits (#1, #2, and #3) provides the following models:
pIC50 = 3.202(±0.068) + 0.0867 (±0.0027) × DCW(1, 15)
pIC50 = 3.036 (±0.084) + 0.0730 (±0.0028) × DCW(1, 15)
pIC50 = 3.490(±0.077) + 0.0803 (±0.0035) × DCW(1, 15)
Table 2 provides the statistical quality of these models. The average value of the coefficient of determination for these models is about 0.6 (for the set as a whole). However, there is a paradox described earlier [28]. The influence of the IICc leads to an improvement in the coefficient of determination for the calibration and the validation sets but not to the detriment of training sets, where the coefficient of determination is lower.
Figure 1 compares models calculated with the target functions TF1 and TF2. Models calculated using TF2 are preferred since Figure 2 confirms that the average determination coefficient values of TF2-models are larger than those of TF1-models for all three random splits.
Williams plots for all considered models indicated that there are practically no outliers for both TF1-models and TF2-models (Figure 3).

3. Discussion

The models’ advantage is their user-friendliness since their implementation requires only SMILES and numerical data for an endpoint without any other descriptors. There are special rules to define the mechanistic interpretation as well as the applicability domain. The described approach provides models following OECD principles [30,31]. The main essence of the above document concentrated on the well-known five OECD principles [26,30,31,32] is descripted below:
  • A defined endpoint;
  • An unambiguous algorithm;
  • A defined applicability domain;
  • Appropriate measures of goodness-of-fit, robustness, and predictivity;
  • A mechanistic interpretation, if possible.
Table 3 lists the molecular features of statistically stable promoters of an increase or decrease of the pIC50. These data are selected according to the following:
(i) Molecular features extracted from SMILES or HSG with significant prevalence in the training and calibration sets;
(ii) Molecular features which have positive correlation weights (CW) for all three runs of the Monte Carlo optimization;
(iii) Molecular features with negative CW for all three runs of the Monte Carlo optimization.
There are stable promoters of the pIC50 increase related to all distributions. For instance, the promoters of an increase of pIC50 are the presence of nitrogen connected with carbon when Morgan extended connectivity of carbon atoms is equal to 5, 6, and 7 or Morgan extended connectivity of nitrogen atoms equals 4. In contrast, promoters of decrease of pIC50 are vertex degrees of carbon atoms equal to 2 or 3 and degrees of nitrogen atoms equal to 2. Some other features become promoters of an increase or decrease of cardiotoxicity (Table 3).
Figure 4 contains examples of influence promoters of increase (C…=…….) and decrease (C………..) to the calculated cardiotoxicity values.
The comparison of the statistical quality of the models using the target functions TF1 and TF2 presented in Table 1 and Table 2 indicates that TF2 provides better results.
Table 4 contains the comparison of models for cardiotoxicity suggested in the literature.
The best model is observed for TF1 (split-1); however, the results for the other two splits in the case of the TF1-model are worse, and the variance of the coefficient of determination for the validation set is significant. In contrast, the average value of the coefficient of determination for the validation set in the case of the TF2-model is more significant, and the variance is less than those in the case of the TF1-model. Thus, despite the excellent result for split-1 with the TF1-model, on the whole, TF2-model is the preferable model.
The above-mentioned information allows us to state that the proposed models correspond to the five generally mentioned recognized principles of constructing a QSPR/QSAR model. However, it seems appropriate to dwell on a number of features of the considered method.
A very useful feature of the approach under consideration is its significant heuristic potential due to the possibility of approximately formulating statistical hypotheses as follows:
-
Whether (and if so, how much) the considered endpoint depends on the representation of molecules using SMILES;
-
Whether (and if so, to what extent) the considered endpoint depends on the representation of molecules using graphs;
-
Whether the representation of the molecular features extracted from SMILES and the graph provide a synergetic effect (i.e., improving the predictive potential of a model in the comparison of the separate cases considering the SMILES-based model and graph-based model);
-
Whether IIC improves the predictive potential of models based on SMILES-based representation of molecules;
-
Whether IIC improves the predictive potential of models based on a graph-based representation of molecules;
-
Whether CII improves the predictive potential of models based on SMILES-based representation of molecules;
-
Whether CII improves the predictive potential of models based on a graph-based representation of molecules;
-
Whether the combined use of IIC and CII has a synergistic effect, that is, whether observed improvement of the predictive potential of models occurs if applying IIC and CII together compared to the cases of using IIC and CII separately.
In principle, the list of similar hypotheses that can be formulated and, accordingly, tested within the framework of the approach under consideration, can also be expanded. However, it seems more appropriate to consider the mentioned possibilities, providing them with brief explanations.
In fact, only a part of the hypotheses listed above is considered here. The results can be formulated as follows:
1. The combined use of correlation weighting of SMILES attributes and graph invariants improves the predictive potential of the hERG inhibition model expressed as pIC50;
2. For the considered compounds, the use of CII provides a better predictive potential than that of models built using IIC;
3. The observed statistical results for the three random splits of the available connections in the training and control sets are in good agreement with each other.
Are there valuable models? If there are “valuable” models, then there must be models that are not “valuable”. How to distinguish valuable models from not very valuable ones? It has been stated that “All models are wrong, but some are useful” [35]. Thus, how to distinguish useful models from a set of wrong ones? The reproducibility of results and their clarity (graphical representation [36]) are most likely the main features of the utility model. In this paper, for this purpose, attempts were made to build several models using different splits. The development of criteria for the predictive potential of models is also part of the research designed to identify useful models. In this paper, for this purpose, attempts were made to compare two new criteria for the predictive potential of the model, the IIC (TF1) and the CII (TF2).
One can extract two basic components in the total large variety of QSAR studies: (i) “applicative” studies and (ii) “theoretical” studies. “Applicative” studies aim to integrate the results of applying current approaches to solve practical tasks. “Theoretical” studies aim to attempt to develop new conceptions of the QSPR/QSAR analysis. This study contains both applicative and theoreatical parts. On the one hand, here, the Monte Carlo optimization technique described in the literature is aimed to build up (almost) standard models (applicative part). On the other hand, new criteria of the predictive potential are studied (theoretical part).
Thus, the epistemological aspect of the provided QSAR research, here, is presented in the form of confirmation of two statements. First, all QSAR models are random events if they are built using random distributions in training and validation sets. Second, the usefulness of random QSAR models can be stated if the variance in the values of statistical characteristics is acceptably small.
The Supplementary Materials section contains the technical details related to the described approach.

4. Methods

4.1. Data

The numerical data on 113 piperidine derivatives (pyridine-substituted piperidines, tertiary alcohol-bearing piperidines, spirocyclic piperidines, and isoxazole-containing piperidines) were taken from the literature [9]. The activity is expressed as −logIC50 or pIC50 [9]. The set of compounds is split into (i) active training (≈25%), (ii) passive training (≈25%), (iii) calibration (≈25%), and (iv) validation sets (≈25%). Each set has a defined task. The active training set is used to build the model; molecular features extracted from the simplified molecular-input line-entry system (SMILES—which represents the structure) [28,29,37], of the active training set are involved in the Monte Carlo optimization to provide correlation weights for the above features, which provide the largest target function value on the active training set. The passive training checks whether the model for the active training set is satisfactory for SMILES that were not involved in the active training set. The calibration set should detect when overtraining (overfitting) starts. The validation set provides the possibility to assess the predictive potential of a model since the data from the validation set is unknown while building up a model. Our experience with CORAL shows that equal distribution over the four sets mentioned is likely the most rational strategy.
At the beginning of the optimization, the correlation coefficients between the experimental values of the endpoint and the descriptor simultaneously increase for all sets, but the correlation coefficient for the calibration set reaches a maximum; this is the start of overtraining, and further optimization leads to a decrease of the correlation coefficient for the calibration set. Optimization should be stopped when overtraining starts. After stopping the Monte Carlo optimization procedure, the validation set is needed to assess the model’s predictive potential.

4.2. Optimal Descriptor

The optimal descriptor, calculated with the representation of the molecular structure using the SMILES, 37serves as the basis of a model for cardiotoxicity. The optimal descriptor for the predictive model of the endpoint is calculated with Equation (7):
p I C 50 = C 0 + C 1 × D C W T , N
D C W T , N = k = 1 N A C W ( S k ) + k = 1 N A 1 C W ( S S k )  
where T is an integer that separates molecular features extracted from SMILES into rare and non-rare ones. The non-rare features serve to build up the model. The rare features are not used to build up the model. N is the number of epochs in the optimization of the correlation weights. Sk is a SMILES atom, i.e., one SMILES line symbol (e.g., ‘=’, ‘O’) or a group of symbols that cannot be examined separately (e.g., ‘Cu’, ‘%11’). SSk is a couple of SMILES atoms. CW(Sk) and CW(SSk) are the correlation weights of the SMILES attributes (SAk). NA is the number of non-rare SMILES attributes.
The Supplementary Materials (Table S1) contains an example of the DCW(1, 15) calculation.

4.3. Monte Carlo Optimization

Equation (2) needs the numerical data on the above correlation weights. Monte Carlo optimization is employed to calculate the correlation weights. Here, two target functions for the Monte Carlo optimization are examined:
T F 0 = r A T + r P T r A T r P T × 0.1
T F 1 = T F 0 + I I C C × 0.5
T F 2 = T F 1 + C I I C × 0.5
Equation (3) is defined empirically during the development of many different models. Variables r A T and r P T are correlation coefficients between the observed and predicted values of the endpoint for the active training set and passive training set, respectively. IICC is the index of ideality of correlation [28]. IICC is calculated with data on the calibration set as follows:
I I C C = r C m i n ( MAE C , MAE C + )   m a x ( MAE C , MAE C + )  
min x , y = x ,   i f   x < y y , o t h e r w i s e
max x , y = x ,   i f   x > y y , o t h e r w i s e
MAE C = 1 N Δ k ,   N   i s   t h e   n u m b e r   o f   Δ k < 0
MAE C + = 1 N + Δ k ,   N +   i s   t h e   n u m b e r   o f   Δ k 0
Δ k = o b s e r v e d k c a l c u l a t e d k
The corresponding values of the endpoint are observed and calculated.
The correlation intensity index (CII), similar to the IIC, was developed to improve the quality of the Monte Carlo optimization used to build up QSPR/QSAR models. CII is calculated as follows:
C I I C = 1 P r o t e s t k
P r o t e s t k = R k 2 R 2 , i f   R k 2 R 2 > 0 0 , o t h e r w i s e  
where R2 is the correlation coefficient for a set that contains n substances. R2k is the correlation coefficient for n − 1 substances of a set after removing the k-th substance. If (R2kR2) is larger than zero, the k-th substance is an “opponent” for the correlation between the experimental and predicted values of the set. A small sum of “protests” means a more “intense” correlation.

4.4. Applicability Domain

The described models’ applicability domain defines the “statistical defects” of molecular features extracted from SMILES or HSG. These are calculated as follows:
d k = P ( A k ) P ( A k ) N A k + N A k + P ( A k ) P ( A k ) N A k + N A k + P ( A k ) P ( A k ) N A k + N A k
where P(Ak), P′(Ak), and P″(Ak) are the probability of Ak in the active training, passive training, and calibration sets, respectively; N(Ak), N′(Ak), and N″(Ak) are frequencies of Ak in the active training, passive training, and calibration sets, respectively. The statistical SMILES defects (Dj) are calculated as follows:
D j = k = 1 N A d k
where NA is the number of non-blocked SMILES attributes in the SMILES.
A SMILES falls in the applicability domain if
D j < 2 D ¯
where D ¯ is the average statistical defect on all compounds.

5. Conclusions

The suggested approach provides reliable cardiotoxicity models since their predictive potential is confirmed for three random splits into training and validation sets. The Monte Carlo optimization with the target function TF2 calculated with the correlation intensity index (Equation (11)) is more accurate and more reliable than optimization with the index of ideality of correlation, i.e., the target function TF1 (Equation (10)). Figure 4 illustrates the simplicity of applying the model for comparison of the potential biological activity of different molecules. In fact, such analysis can be a tool for the preliminary assessment of biological activity only on the basis of a set of Monte Carlo computation experiments with different distributions of available data in the training and validation (test) sets.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules28186587/s1, Supplementary Materials section contains an example of the DCW(1, 15) calculation with Equation (1) for the TF1-model (split-1, Table S1) and details on the TF2-model calculated with Equation (4) (split-1, Table S2).

Author Contributions

Conceptualization, A.P.T., A.A.T., A.R. and E.B.; methodology, A.P.T. and A.A.T.; software, A.A.T.; validation, A.P.T. and A.A.T.; formal analysis, A.P.T.; data curation, A.P.T. and A.A.T.; writing—original draft preparation, A.P.T. and A.A.T.; writing—review and editing, A.P.T. and A.A.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the European Union’s Horizon 2020 research and innovation program (Project ALTERNATIVE, grant No. 101037090). The content of this manuscript reflects only the author’s view, and the Commission is not responsible for any use that may be made of the information it contains.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data available within the article or its Supplementary Materials.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Niedziałkowski, P.; Czaczyk, E.; Jarosz, J.; Wcisło, A.; Białobrzeska, W.; Wietrzyk, J.; Ossowski, T. Synthesis and electrochemical, spectral, and biological evaluation of novel 9,10-anthraquinone derivatives containing piperidine unit as potent antiproliferative agents. J. Mol. Struct. 2019, 1175, 488–495. [Google Scholar] [CrossRef]
  2. Godugu, K.; El-Far, A.H.; Al Jaouni, S.; Mousa, S.A. Nanoformulated ajwa (phoenix dactylifera) bioactive compounds improve the safety of Doxorubicin without compromising its anticancer efficacy in breast cancer. Molecules 2020, 25, 2597. [Google Scholar] [CrossRef]
  3. Dragojevic, S.; Ryu, J.S.; Hall, M.E.; Raucher, D. Targeted Drug Delivery Biopolymers Effectively Inhibit Breast Tumor Growth and Prevent Doxorubicin-Induced Cardiotoxicity. Molecules 2022, 27, 3371. [Google Scholar] [CrossRef] [PubMed]
  4. LeBaron, T.W.; Kura, B.; Kalocayova, B.; Tribulova, N.; Slezak, J. A new approach for the prevention and treatment of cardiovascular disorders. Molecular hydrogen significantly reduces the effects of oxidative stress. Molecules 2019, 24, 2076. [Google Scholar] [CrossRef] [PubMed]
  5. Berra, C.; Torta, R. Therapeutic rationale of antidepressant use in the elderly. Arch. Gerontol. Geriatr. 2007, 44, 83–90. [Google Scholar] [CrossRef]
  6. Peltzer, S.; Köstler, U.; Müller, H.; Müller, H.; Scholten, N.; Schulz-Nieswandt, F.; Jessen, F.; Albus, C. The CoRe-Net Study Group. The psychological consequences of living with coronary heart disease: Are patients’ psychological needs served? A mixed-method study in Germany. Health Expect. 2022, 25, 2746–2761. [Google Scholar] [CrossRef]
  7. Mackin, P. Cardiac side effects of psychiatric drugs. Hum. Psychopharmacol. 2008, 23 (Suppl. S1), 3–14. [Google Scholar] [CrossRef]
  8. Jordaan, P.; Dumotier, B.; Traebert, M.; Miller, P.E.; Ghetti, A.; Urban, L.; Abi-Gerges, N. Cardiotoxic potential of hydroxychloroquine, chloroquine and azithromycin in adult human primary cardiomyocytes. Toxicol. Sci. 2021, 180, 356–368. [Google Scholar] [CrossRef]
  9. Hari Narayana Moorthy, N.S.; Karthikeyan, C.; Manivannan, E. Multi-algorithm based machine learning and structural pattern studies for hERG ion channel blockers mediated cardiotoxicity prediction. Chemometr. Intell. Lab. Syst. 2021, 208, 104213. [Google Scholar] [CrossRef]
  10. Krishna, S.; Borrel, A.; Kleinstreuer, N.; Huang, R.; Zhao, J.; Xia, M. High-Throughput Chemical Screening and Structure-Based Models to Predict hERG Inhibition. Biology 2022, 11, 209. [Google Scholar] [CrossRef]
  11. Spadarella, G.; Perillo, T.; Ugga, L.; Cuocolo, R. Radiomics in Cardiovascular Disease Imaging: From Pixels to the Heart of the Problem. Curr. Cardiovasc. Imaging Rep. 2022, 15, 11–21. [Google Scholar] [CrossRef]
  12. Dearani, J.A.; Connolly, H.M.; Martinez, R.; Fontanet, H.; Webb, G.D. Caring for adults with congenital cardiac disease: Successes and challenges for 2007 and beyond. Cardiol. Young. 2007, 17 (Suppl. S2), 87–96. [Google Scholar] [CrossRef] [PubMed]
  13. Gobbi, M.; Beeg, M.; Toropova, M.A.; Toropov, A.A.; Salmona, M. Monte Carlo method for predicting of cardiac toxicity: hERG blocker compounds. Toxicol. Lett. 2016, 250–251, 42–46. [Google Scholar] [CrossRef] [PubMed]
  14. Obiol-Pardo, C.; Gomis-Tena, J.; Sanz, F.; Saiz, J.; Pastor, M. A multiscale simulation system for the prediction of drug-induced cardiotoxicity. J. Chem. Inf. Model. 2011, 51, 483–492. [Google Scholar] [CrossRef]
  15. Toropov, A.A.; Toropova, A.P. QSPR/QSAR: State-of-art, weirdness, the future. Molecules 2020, 25, 1292. [Google Scholar] [CrossRef]
  16. Balaban, A.T. Interdisciplinary aspects of organic chemistry-cum-graph theory. Curr. Org. Chem. 2015, 19(4), 387–395. [Google Scholar] [CrossRef]
  17. Wang, H.; Zhu, G.; Izu, L.T.; Chen-Izu, Y.; Ono, N.; Altaf-Ul-Amin, M.D.; Kanaya, S.; Huang, M. On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme. Front. Physiol. 2023, 14, 1156286. [Google Scholar] [CrossRef]
  18. Toropov, A.A.; Toropova, A.P.; Lombardo, A.; Roncaglioni, A.; Benfenati, E.; Gini, G. CORAL: Building up the model for bioconcentration factor and defining it’s applicability domain. Eur. J. Med. Chem. 2011, 46, 1400–1403. [Google Scholar] [CrossRef]
  19. Toropov, A.A.; Toropova, A.P.; Lombardo, A.; Roncaglioni, A.; De Brita, N.; Stella, G.; Benfenati, E. CORAL: The prediction of biodegradation of organic compounds with optimal SMILES-based descriptors. Cent. Eur. J. Chem. 2012, 10, 1042–1048. [Google Scholar] [CrossRef]
  20. Toropov, A.A.; Toropova, A.P.; Roncaglioni, A.; Benfenati, E. Prediction of biochemical endpoints by the coral software: Prejudices, paradoxes, and Results. Methods Mol. Biol. 2018, 1800, 573–583. [Google Scholar] [CrossRef]
  21. Toropov, A.A.; Toropova, A.P.; Roncaglioni, A.; Benfenati, E. The system of self-consistent semi-correlations as one of the tools of cheminformatics for designing antiviral drugs. New J. Chem. 2021, 45, 20713–20720. [Google Scholar] [CrossRef]
  22. Toropova, A.P.; Toropov, A.A.; Roncaglioni, A.; Benfenati, E. The index of ideality of correlation improves the predictive potential of models of the antioxidant activity of tripeptides from frog skin (Litoria rubella). Comput. Biol. Med. 2021, 133, 104370. [Google Scholar] [CrossRef] [PubMed]
  23. Li, Q.; Ding, X.; Si, H.; Gao, H. QSAR model based on SMILES of inhibitory rate of 2, 3-diarylpropenoic acids on AKR1C3. Chemometr. Intell. Lab. Syst. 2014, 139, 132–138. [Google Scholar] [CrossRef]
  24. Veselinović, J.B.; Nikolić, G.M.; Trutić, N.V.; Živković, J.V.; Veselinović, A.M. Monte Carlo QSAR models for predicting organophosphate inhibition of acetylcholinesterase. SAR QSAR Environ. Res. 2015, 26, 449–460. [Google Scholar] [CrossRef] [PubMed]
  25. Islam, M.A.; Pillay, T.S. Simplified molecular input line entry system-based descriptors in QSAR modeling for HIV-protease inhibitors. Chemometr. Intell. Lab. Syst. 2016, 153, 67–74. [Google Scholar] [CrossRef]
  26. Kumar, A.; Chauhan, S. Use of the Monte Carlo Method for OECD Principles-Guided QSAR Modeling of SIRT1 Inhibitors. Arch. Pharm. 2017, 350, e1600268. [Google Scholar] [CrossRef]
  27. Simon, L.; Imane, A.; Srinivasan, K.K.; Pathak, L.; Daoud, I. In Silico Drug-Designing Studies on Flavanoids as Anticolon Cancer Agents: Pharmacophore Mapping, Molecular Docking, and Monte Carlo Method-Based QSAR Modeling. Interdiscip. Sci. Comput. Life Sci. 2017, 9, 445–458. [Google Scholar] [CrossRef]
  28. Toropova, A.P.; Toropov, A.A. The index of ideality of correlation: A criterion of predictability of QSAR models for skin permeability? Sci. Total Environ. 2017, 586, 466–472. [Google Scholar] [CrossRef]
  29. Toropov, A.A.; Toropova, A.P. Correlation intensity index: Building up models for mutagenicity of silver nanoparticles. Sci. Total Environ. 2020, 737, 139720. [Google Scholar] [CrossRef]
  30. Lanevskij, K.; Didziapetris, R.; Sazonovas, A. Physicochemical QSAR analysis of hERG inhibition revisited: Towards a quantitative potency prediction. J. Comput. Aided Mol. Des. 2022, 36, 837–849. [Google Scholar] [CrossRef]
  31. OECD, (Organization for Economic Co-operation and Development). Guidance Document on The Validation of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models; OECD: Paris, France, 2007; No. 69. [Google Scholar]
  32. Toropova, A.P.; Toropov, A.A. Hybrid optimal descriptors as a tool to predict skin sensitization in accordance to OECD principles. Toxicol. Lett. 2017, 275, 57–66. [Google Scholar] [CrossRef]
  33. Chemi, G.; Gemma, S.; Campiani, G.; Brogi, S.; Butini, S.; Brindisi, M. Computational tool for fast in silico evaluation of hERG K+ channel affinity. Front. Chem. 2017, 5, 7. [Google Scholar] [CrossRef] [PubMed]
  34. Melnikov, F.; Anger, L.T.; Hasselgren, C. Toward Quantitative Models in Safety Assessment: A Case Study to Show Impact of Dose–Response Inference on hERG Inhibition Models. Int. J. Mol. Sci. 2023, 24, 635. [Google Scholar] [CrossRef] [PubMed]
  35. Box, G.E.P. Science and statistics. J. Am. Stat. Assoc. 1976, 71, 791–799. [Google Scholar] [CrossRef]
  36. Palacios-Can, F.J.; Silva-Sánchez, J.; León-Rivera, I.; Tlahuext, H.; Pastor, N.; Razo-Hernández, R.S. Identification of a Family of Glycoside Derivatives Biologically Active against Acinetobacter baumannii and Other MDR Bacteria Using a QSPR Model. Pharmaceuticals 2023, 16, 250. [Google Scholar] [CrossRef] [PubMed]
  37. Weininger, D. SMILES, a Chemical Language and Information System: 1: Introduction to Methodology and Encoding Rules. J. Chem. Inf. Comput. Sci. 1988, 28, 31–36. [Google Scholar] [CrossRef]
Figure 1. Examples of models calculated using target functions TF1 and TF2.
Figure 1. Examples of models calculated using target functions TF1 and TF2.
Molecules 28 06587 g001
Figure 2. The comparison of determination coefficients for the active training and the validation sets using TF1 and TF2 for splits 1–3.
Figure 2. The comparison of determination coefficients for the active training and the validation sets using TF1 and TF2 for splits 1–3.
Molecules 28 06587 g002
Figure 3. Williams plots of models calculated using target functions TF1 and TF2 for splits 1–3. The training set is the union of the active and passive training sets together with the calibration set; the compounds of the external test set are indicated in red.
Figure 3. Williams plots of models calculated using target functions TF1 and TF2 for splits 1–3. The training set is the union of the active and passive training sets together with the calibration set; the compounds of the external test set are indicated in red.
Molecules 28 06587 g003
Figure 4. The influence of presence/absence of promoters on the calculated cardiotoxicity.
Figure 4. The influence of presence/absence of promoters on the calculated cardiotoxicity.
Molecules 28 06587 g004
Table 1. The statistical quality of the QSAR model for cardiotoxicity was obtained with Monte Carlo optimization with target function TF1.
Table 1. The statistical quality of the QSAR model for cardiotoxicity was obtained with Monte Carlo optimization with target function TF1.
nR2CCCIICCIIQ2Q2F1Q2F2Q2F3RMSEMAEF
A *280.55520.71400.55880.78440.4940 0.4240.37132
P280.69110.74000.65250.83940.6454 0.4380.37358
C290.91010.95240.95360.94360.89550.90830.89980.94240.1600.129273
V280.9146--------0.200.15
A290.85480.92170.86290.93000.8306 0.2410.201159
P280.88930.89580.38770.93030.8740 0.2840.222209
C280.86800.92610.88360.92730.84510.83460.83400.86580.2420.194171
V280.8959--------0.270.24
A280.53470.69680.73120.78360.4239 0.4360.37930
P280.56070.70870.49300.77140.4976 0.4510.38733
C280.83740.91240.91500.89400.81730.82010.81920.88080.2280.179134
V290.9181--------0.210.16
* A = active training set; P = passive training set; C = calibration set; V = validation set.
Table 2. The statistical quality of the QSAR model for cardiotoxicity was obtained with Monte Carlo optimization with target function TF2.
Table 2. The statistical quality of the QSAR model for cardiotoxicity was obtained with Monte Carlo optimization with target function TF2.
nR2CCCIICCIIQ2Q2F1Q2F2Q2F3RMSEMAEF
A280.50670.67260.71180.77520.4439 0.4460.38627
P280.63410.66050.60430.84380.5801 0.4810.43645
C290.91430.95560.95620.94940.90110.92060.91320.95010.1490.123288
V280.9347--------0.190.15
A290.52530.68870.67640.79890.4496 0.4360.38730
P280.68060.75910.80090.83180.6362 0.3950.35455
C280.94940.97170.97410.97420.93960.94750.94730.95740.1360.105488
V280.9292--------0.190.16
A280.58700.73980.76620.78860.5004 0.4100.36137
P280.63470.77080.51160.79310.5878 0.4120.34145
C280.87730.93220.93660.92310.85870.85480.85400.90380.2050.174186
V290.9255--------0.220.18
Table 3. Promoters of an increase or decrease of cardiotoxicity (pIC50) that were observed for computational experiments with split-1.
Table 3. Promoters of an increase or decrease of cardiotoxicity (pIC50) that were observed for computational experiments with split-1.
IDSAkCWs Run1CWs Run2CWs Run3NA *NPNCdk
Promoters of increase
1(...........0.19470.74460.54402828290.0000
21...........0.72840.42110.35772828290.0000
3O...(.......0.37980.96500.57902828290.0000
4c...........0.13590.78770.30572828290.0000
5c...1.......0.60710.07980.17672827280.0009
6c...c.......0.68710.63690.62442828290.0000
7C...1.......0.26620.31890.43942523280.0038
81...(.......1.57160.94270.95252419260.0063
9N...(.......0.50490.65200.36102420230.0043
102...........0.37010.33640.37242020160.0058
11F...(.......0.40990.32200.15761724180.0085
12c...F.......0.23400.76880.0176131490.0105
13n...(.......0.42610.79840.62311210120.0042
14C...=.......1.26450.49200.864311560.0195
15F...1.......0.09910.35530.9094111190.0053
Promoters of decrease
1C...........−0.0788−0.0409−0.08202828290.0000
2=...(.......−0.6876−0.8715−0.28882626280.0009
3O...=.......−1.0322−0.3332−0.34442626280.0009
4O...C.......−0.4151−0.2547−0.09931712190.0094
5c...C.......−1.3073−0.8511−0.11591716150.0037
* NA, NP, and NC are the frequencies of a molecular feature in the active training, passive training, and calibration sets, respectively.
Table 4. The comparison of QSAR models for cardiotoxicity as suggested in the literature.
Table 4. The comparison of QSAR models for cardiotoxicity as suggested in the literature.
N TrainR2 TrainQ2 TrainRMSE TrainN ValidR2
Valid
RMSE ValidReference
1130.6576
0.6264
0.6872
0.6341
0.5801
0.6516
----[9] MOE-models
1130.6600
0.6896
0.7498
0.6272
0.6565
0.7118
----[9] MACCS-models
6230.29-0.6303450.410.550[30]
3090.911-0.2641120.8600.301[33]
40810.46-0.59---[34]
85
85
84
0.6774
0.8463
0.6169
0.6641
0.8379
0.5991
0.358
0.253
0.380
28
28
29
0.9146
0.8959
0.9181
0.203
0.275
0.210
Split-1
Split-2
Split-3
(In this work, TF1)
85
85
84
0.6348
0.7056
0.6746
0.6201
0.6931
0.6596
0.382
0.346
0.352
28
28
29
0.9347
0.9292
0.9255
0.189
0.186
0.221
Split-1
Split-2
Split-3
(In this work, TF2)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Toropova, A.P.; Toropov, A.A.; Roncaglioni, A.; Benfenati, E. Using the Correlation Intensity Index to Build a Model of Cardiotoxicity of Piperidine Derivatives. Molecules 2023, 28, 6587. https://doi.org/10.3390/molecules28186587

AMA Style

Toropova AP, Toropov AA, Roncaglioni A, Benfenati E. Using the Correlation Intensity Index to Build a Model of Cardiotoxicity of Piperidine Derivatives. Molecules. 2023; 28(18):6587. https://doi.org/10.3390/molecules28186587

Chicago/Turabian Style

Toropova, Alla P., Andrey A. Toropov, Alessandra Roncaglioni, and Emilio Benfenati. 2023. "Using the Correlation Intensity Index to Build a Model of Cardiotoxicity of Piperidine Derivatives" Molecules 28, no. 18: 6587. https://doi.org/10.3390/molecules28186587

APA Style

Toropova, A. P., Toropov, A. A., Roncaglioni, A., & Benfenati, E. (2023). Using the Correlation Intensity Index to Build a Model of Cardiotoxicity of Piperidine Derivatives. Molecules, 28(18), 6587. https://doi.org/10.3390/molecules28186587

Article Metrics

Back to TopTop