1. Introduction
Parkinson’s disease (PD) is the second most common neurodegenerative disorder after Alzheimer’s disease, affecting approximately 1% of the global population over the age of 60 [
1,
2,
3]. It is caused by the degeneration of dopaminergic neurons located in the
substantia nigra pars compacta [
4]. This neuronal loss leads to a decrease in dopamine release in the
striatum, which results in the hallmark symptoms of PD, which are bradykinesia, rigidity, postural instability, tremors, and a significant decline in patients’ quality of life [
5].
The diagnosis of PD still relies on clinical criteria and remains challenging [
6], particularly in its early stages when symptoms may resemble those of other disorders [
7] such as multisystem atrophy, progressive supranuclear palsy, essential tremor, and vascular or drug Parkinsonism [
5,
6,
8]. Therefore, the use of tools that allow for an accurate and early diagnosis of PD is crucial, given their confirmed impact on both prognosis and treatment outcomes [
9]. One such approach involves evaluating the integrity of the nigrostriatal dopaminergic pathway using dopamine active transporters (DATs) [
10]. The DATs are a group of transmembrane proteins associated with chloride channels, located at the presynaptic terminals of dopaminergic neurons [
11]. They are responsible for the reuptake of dopamine after its release into the synaptic cleft [
11,
12]. In patients with PD, because of the degeneration of these neurons, the density of DATs is significantly reduced [
13].
Non-invasive molecular neuroimaging techniques, such as SPECT (Single-Photon Emission Computed Tomography) and PET (Positron Emission Tomography), enable in vivo visualization of various tissues and organs using radiopharmaceuticals [
12,
14]. Several PET and SPECT radiotracers have been developed to target DATs [
15]. Among them are radio-iodinated cocaine analogues, such as
123I-Ioflupane (
123I-
N-ω-fluoropropyl-2β-carbomethoxy-3β-(4-iodophenyl)nortropane) and
123I-β-CIT (
123I-2β-carbomethoxy-3β-(4-iodophenyl)tropane) [
16]. These SPECT radiopharmaceuticals are widely used to assess the integrity of the nigrostriatal dopaminergic pathway and have demonstrated good diagnostic sensitivity and specificity for suspected PD cases [
15]. However, both tracers require pre-treatment with a thyroid-blocking agent to protect the thyroid gland from radioactive iodine uptake [
17]. Additionally, imaging with
123I-β-CIT must be conducted 24 h post-injection, which can be inconvenient for outpatient settings [
18]. There are also PET radiotracers with affinity for DAT, such as
11C-
N-(3-iodoprop-2(
E)-enyl)-2β-carbomethoxy-3β-(4′-methylphenyl)nortropane (
11C-PE2I) [
16]. In spite of its high specificity, the short half-life of
11C (approximately 20 min) limits its clinical utility, as it requires on-site cyclotron production [
19]. These limitations underscore the need for the development of new DAT-targeting radiopharmaceuticals that offer improved pharmacokinetic properties and logistical advantages, enabling better assessment of the nigrostriatal pathway in the early diagnosis of PD.
The estimated cost and time required to bring a new drug to market are approximately USD 2.8 billion and 7 to 12 years [
20]. Moreover, of the roughly 40,000 compounds tested in animals, only five typically proceed to human trials, and just one in five of those that reach clinical studies ultimately receives approval [
21]. This reflects a significant expenditure of financial, human, and time resources. In response, virtual (computational) screening has emerged as a promising alternative to the traditional “trial-and-error” approach in identifying new chemical entities with biological activity. As a result, the pharmaceutical industry has shown growing interest in computational methodologies [
22]. Among these approaches, QSAR (Quantitative Structure–Activity Relationship) modeling stands out as a valuable in silico technique connecting chemical structure with pharmacological activity. QSAR offers clear advantages in terms of cost- and time-efficiency, especially during the early stages of drug discovery, by enabling the prioritization of the most promising compounds for further evaluation [
23,
24,
25,
26].
In recent years, numerous computational studies have been undertaken to identify novel compounds with potential radiopharmaceutical applications (references). Notably, some of these investigations have targeted specific biological systems, such as the dopamine (D2) receptor and the adenosine A2A receptor, the latter with an emphasis on antagonist identification. A significant number of studies have also concentrated on modeling interactions with the dopamine transporter. However, many of these efforts have relied on relatively limited compound databases or congeneric series, which restricts the generalizability and predictive power of their findings.
In recent years, a growing number of computational studies have aimed to identify novel compounds with potential radiopharmaceutical applications [
27,
28,
29,
30]. These efforts have usually focused on specific molecular targets, including the dopamine (D
2) receptor [
27] and the adenosine A
2A receptor (particularly in the context of antagonist discovery) [
28]. In parallel, several studies have focused on modeling interactions with the dopamine transporter [
29,
30]. In spite of their contributions, many of these studies are based on relatively small datasets or congeneric compound series, which limits the extrapolation, robustness, and external validation of the resulting models. Given this context, the main aim of the present work is to propose a well-validated computational model based on a structurally diverse dataset, capable of identifying the key structural features required for DAT affinity, and to integrate it with molecular docking studies. This combined approach aims to facilitate the prediction of novel radiopharmaceutical candidates with potential utility in the diagnosis of Parkinson’s disease.
4. Discussion
The development of new computational models demands a suitable goodness-of-fit, robustness, applicability domain, and predictability. The model developed in this study showed an R
2 value of 0.755, indicating a suitable fit for modeling DAT binding affinity. This means that the model explains more than 75% of the experimental variance. There was no overfitting in the model, thus representing a good fit with a minimum number of descriptors, because of the low value of the LOF parameter (0.195) and an R
2adj = 0.729; the LOF is used to penalize the addition of descriptors in the model equation and should be as low as possible, while the R
2adj value should be as similar as possible to the R
2 value, which the developed model complied with. The correlation between the descriptors of the model was low since the K
XX value (0.236) was small, indicating that there was no redundant information in the selected descriptors. In addition, the correlation between the descriptors and the response variable was appropriate, according to the ΔK parameter (0.076), with a small error in the calculations of both training and prediction estimates (RMSE
tr = 0.358; MAE
tr = 0.276;
s = 0.381).
Figure 4 shows the values predicted by the model equation against the experimental LogK
d values for both training and prediction series. As can be seen, most of the points are close to the line, and three compounds with atypical behavior in comparison to the rest of the molecules in the database were detected as outliers and not used to develop the model.
Table 2 shows the predicted values for the compounds in the training set and the predictions in the LOO experiments as well as the residuals with respect to experimental values and leverage values (which were used to establish the applicability domain). The predictions are quite similar to the experimental values for most compounds. The molecular descriptors included in the model can be seen in
Table 6 with the general interpretation of the molecular feature that describes each descriptor and the descriptor block to which they belong in the software.
A critical aspect in QSAR studies is the definition of the AD of the models. Next, we used two approaches to establish the chemical space described by our model. First, we used a Williams graph (
Figure 5A): the AD is defined as the area at the left of a leverage threshold
h* = 0.3571 and within ±3 standard deviations. As shown in the figure, most compounds are within the applicability domain of the model. There is only one compound (49, belonging to the training set) with a leverage value (
h = 0.2327) greater than the critical leverage (
h*), although it shows the SD value within the limits; therefore, the prediction should be considered with caution. In addition, we used another approach available in QSARINS software, the Insubria graph (
Figure 5B), which plots the leverage values against the predicted responses for each compound. The results agree with the previous approach, identifying the same compound outside the AD with a leverage value greater than the cutoff value, but in this approach two compounds in the prediction set (36 and 56) present LogK
d values that were slightly overestimated; see
Table 3 for details.
To evaluate the robustness and stability of the model, several internal validation strategies were performed. First, an LOO cross-validation technique, which iteratively excludes a compound of the dataset: The variance explained in the prediction by LOO was Q
2LOO = 0.680, with a small error in the predictions (RMSE
cv = 0.409 and MAE
cv = 0.316); these results are shown in
Figure 6A. Second, an internal validation by LMO was performed, which was developed by leaving out 30% of the dataset to study the behavior of our model. The value obtained for this experiment (Q
2LMO = 0.648) was like the value previously achieved in LOO, although it is important to notice that this technique gives better results with larger databases. The Q
2LMO values (
red circles) were similar to each other and distributed around the Q
2 of the model (blue circle), corroborating its good fit and stability (see
Figure 6B). In the
Y-scrambling experiment, the absence of change in correlation for the model was checked. The expected behavior is that both values of R
2Y-scr and Q
2Y-scr (for each iteration) should differ appreciably from the R
2 and Q
2 values of the model. Both values of R
2 and Q
2 for every iteration are shown in
Figure 6 (
yellow and
red circles, respectively), and their averages were R
2Y-scr = 0.098 and Q
2Y-scr = −0.174, quite far from the respective values of these parameters for the model (
light and
dark blue for R
2 and Q
2, respectively), confirming its validity.
To evaluate the model’s ability to predict new compounds, an external validation was also performed. The procedure was carried out by applying the model equation developed with the training set to the set of external predictions. Based on the results with the external set, we can say that the model has a good predictive power (R
2ext = 0.709), with lower value of 0.339 for RMSE
ext. As can be seen in
Figure 5A, all compounds in the prediction set are within the applicability domain of the model.
Table 3 shows the predicted values for the compounds in the prediction set and the residuals with respect to experimental values and the leverage values. The predicted values are quite similar to the experimental values, demonstrating the predictive power of the developed model.
The validated model was then used to predict the affinity for DAT of new molecules of interest. For this, we selected a set of eleven compounds with different actions over the CNS; among the activities of these compounds, we can find psychostimulant, centrally acting sympathomimetic agent, anorectic psychostimulant, centrally acting anticholinergic, a radiopharmaceutical used to diagnose Parkinsonism, nasal decongestant for systemic use, and psychedelic. This feature ensures that they can reach the DAT and makes the evaluation of these compounds in virtual screening more interesting. This group of compounds was evaluated with the computational model and assessed by docking to estimate the binding affinities for the DAT.
Among the compounds selected for virtual screening, 123I-Ioflupane (one of the most widely used DAT radiotracers) was chosen, which achieved a good prediction for the LogKd value of 3.088; taking this as a point of reference, we highlighted the compounds with affinity for DAT potentially better than 123I-Ioflupane. We found that Modafinil and Armodafinil seem to be the better compounds showing the lowest LogKd value of 2.386, followed by Benztropine with 2.449 and Procaine with a value of 2.713, slightly lower than the 3.088 shown by 123I-Ioflupane. The results obtained with the QSAR model fully agree with those obtained in the docking experiments, which are described below.
The results of the docking experiments are shown in
Table 5. As can be seen, all the ligands present binding energies (ΔG
binding) between −5.278 and −7.303 kcal/mol, with the most negative energy corresponding to the complex formed between the ligand Ioflupane (009) and the dopamine transporter, with a binding free energy of −7.303 kcal/mol. The stability of this complex is based on the two H-bond interactions with Asp68 and Ser72, in addition to the π-π stacking interaction with Tyr355 (Sandwich Type) (
Figure 9C).
The second binding energy corresponded to the complex formed between Modafinil (001) and the dopamine receptor, with a binding free energy of −6.669 kcal/mol. The stability of this complex is based on two π-π stacking interactions shared with Phe332 (T-Shaped and Sandwich Type) (
Figure 9A). The third and fourth most negative binding free energies were found between the complexes formed between Procaine (003) (−6.452 kcal/mol) and Armodafinil (010) (−6.436 kcal/mol). The stability of the Procaine–dopamine transporter complex is based on one H-bond interaction with Asp79, and two π-π stacking interactions with Phe320 (Sandwich Type) and Phe326 (T-Shaped Type) (
Figure 9B). The stability of the Armodafinil–dopamine transporter complex is based on two hydrogen bond (H-Bond) interactions shared with Leu459 (
Figure 9D), in addition to the π-π stacking interaction with Phe462 (T-Shaped Type) (
Figure 9D). The least stable complex of all those subjected to docking experiments was the one formed between the ligand Phenmetrazine (004) and the dopamine transporter, as well as the least stable complex, corresponding to Phentermine (002) and the dopamine transporter. These two ligands were oriented away from the active center of the protein with loss of non-covalent interactions (
Figure 10).
Below, we present a detailed comparison of the results of our docking experiments (ΔGbinding and RMSD) with recently published data from computational and experimental studies. This comparison serves to validate the consistency and reliability of our findings. For example, a recent study developed robust QSAR models on approximately 61 DAT inhibitors, including drugs similar to ours, and obtained ΔGbinding values very similar to ours. Phenmetrazine (L-004) in this study behaved very similarly to ours, with ΔGbinding ≈ −6.8 kJ/mol [
78].
To support the validity of our data, we compared the docking results with recent experimental affinities in the scientific literature. Modafinil has been estimated to have a Ki ≈ 2600 nM (≈2.6 µM) per human DAT, which is in agreement with our finding of ΔGbinding ≈ −5.99 kJ/mol, reflecting a low affinity for the dopamine receptor, a result that is in agreement with our study. Furthermore, an analogous relationship between experimental affinity in this study and the binding energies observed in our set of molecules (with several ligands in the range −7.0 to −7.5 kJ/mol with RMSD < 2 Å) has been found, corroborating the predictive power of our docking experiments [
79].
Molecular dynamics simulations provide us with information about the behavior of ligand–protein complexes over time at the atomic level [
70,
80,
81,
82]. In our case, they also allow us to determine the stability of the systems and whether the interactions found in the docking experiments are maintained over time. To this end, we analyzed several parameters obtained from the trajectory, which will be discussed below. One of the important parameters when analyzing the stability of ligand–protein complexes over time is the RMSD [
47,
83,
84]. The behavior of this variable over time for all the complexes studied is shown in
Figure 11. As can be seen in
Figure 2, all complexes subjected to molecular dynamics simulations maintained low RMSD values (<2 Å), indicating that these complexes remained stable throughout the simulation period. This result is consistent with that obtained from the docking experiments. The lowest RMSD values were found in the complex formed by Benztropine and the dopamine receptor, meaning that it was the most stable complex over time. This complex had the most negative energy obtained in the docking experiments, coinciding with the results obtained from the docking experiments and the molecular dynamics experiments; therefore, we can expect that this molecule could be a good candidate as a radiotracer for Parkinson’s disease diagnosis.
One of the parameters that could explain the stability of compounds over time is hydrogen bond interactions (
Figure 12). As can be seen in
Figure 12, the number of hydrogen bond interactions during the trajectory was low. The highest number of hydrogen bond interactions was found in the complex formed by Procaine and the dopamine transporter. Given the structure of this molecule (
Figure 3), there is only one possibility of hydrogen bond interaction as the hydrogen acceptor: the carbonyl oxygen of this ligand, which interacts strongly and stably with Asp79 of the dopamine transporter with an occupancy of 88.81% (
Table 7). This interaction could give the complex strong stability during the simulation time.
The second complex with stable hydrogen bond interactions over time was found in the complex formed by diethylpropion with the dopamine transporter. This complex maintained strong interactions with Asp421 with an occupancy of 60.98% (
Table 7), which is considered a stable interaction over time. This behavior could explain the low RMSD values found during the 100 ns trajectory.
In the case of the complex formed by Benztropine and the dopamine transporter, no stable hydrogen bond interactions were found over time. Given the structure of this molecule (
Figure 11), it has little chance of interacting by hydrogen bond with the amino acids of the dopamine transporter, because it only has one hydrogen acceptor group and it is an oxygen of the ether group within Benztropine. The stability of this complex found in the RMSD parameter could be because of another type of interaction such as π-π staking, Van del Waals interactions, or solvation type.
Another parameter usually used to analyze ligand–protein complexes is the radius of gyration. This variable provides information about the degree of compaction of a complex during the simulation time. As can be seen in
Figure 12, the two most compact complexes were those formed by Procaine and Benztropine, with radii of gyration values below 4 Å. This result is consistent with the RMSD results and with the results obtained from the docking experiments, so we can conclude that these two compounds could be good markers for Parkinson’s disease.