Hologram QSAR Models of 4-[(Diethylamino)methyl]-phenol Inhibitors of Acetyl/Butyrylcholinesterase Enzymes as Potential Anti-Alzheimer Agents

Hologram QSAR models were developed for a series of 36 inhibitors (29 training set and seven test set compounds) of acetyl/butyrylcholinesterase (AChE/BChE) enzymes, an attractive molecular target for Alzheimer’s disease (AD) treatment. The HQSAR models (N = 29) exhibited significant cross-validated (AChE, q2 = 0.787; BChE, q2 = 0. 904) and non-cross-validated (AChE, r2 = 0.965; BChE, r2 = 0.952) correlation coefficients. The models were used to predict the inhibitory potencies of the test set compounds, and agreement between the experimental and predicted values was verified, exhibiting a powerful predictive capability. Contribution maps show that structural fragments containing aromatic moieties and long side chains increase potency. Both the HQSAR models and the contribution maps should be useful for the further design of novel, structurally related cholinesterase inhibitors.


Introduction
Alzheimer's disease (AD) is the most common cause of dementia in elderly people. As the worldwide population ages, AD is reaching epidemic levels, giving rise to a huge human, social, and economic burden [1,2]. AD is neuropathologically characterized by the extracellular deposition of amyloid plaques, as well as intracellular neurofibrillary tangles (NFTs) [3,4]. However, the cause of AD is still a subject of considerable discussion, and there are no available treatments that stop or reverse disease progression. Current treatments only help alleviate AD symptoms [5].
The main therapeutic approaches in AD treatment are based on the cholinergic hypothesis. According to this hypothesis, the neurodegenerative processes lead to a selective destruction of the cholinergic neurons, resulting in a lack of central cholinergic transmission. This deficiency causes cognitive declination, a loss of ability to perform daily tasks, reduced attention, and a loss of memory [6,7]. Thus, the symptomatic treatment of AD is based on acetylcholinesterase (AChE) inhibitors, which have positive effects on the cognitive, functional, and behavior symptoms of the disease and consequently increase acetylcholine levels and cholinergic neurotransmission in the brain [8]. The increase in cholinergic transmission by the use of drugs that enhance the central cholinergic function is currently thought to be the best approach for the treatment of AD [9].
Butyrylcholinesterase (BChE), another enzyme that hydrolyzes acetylcholine, is predominantly found in the glia neurons and exists in greater amounts than the progressively decreasing AChE in the brains of AD patients. Studies with BChE may hold promise in the search of new inhibitors for AD treatment [10].
Further studies using molecular modeling predictive techniques, such as hologram quantitative structure-activity relationship (HQSAR) [11], in conjunction with experimental techniques, such as X-ray 3D structure elucidation, may help uncover new human cholinesterase inhibitors as potential anti-Alzheimer agents. HQSAR, as with other fragment-based methods, uses fragment fingerprints [12] as a predictive variable of the biological activity variation or other related data. It is a method that develops statistically validated QSAR models [13] using topological (2D) parameters such as hologram length and fragment distinctions [14,15].
In an attempt to design new chemical entities with efficient anticholinesterase activities, we have explored the 2D molecular features of a series of 4-(diethylamino)methyl]-phenol AChE/BChE inhibitors [16] using the HQSAR method, a powerful ligand-based strategy in drug design [17].

HQSAR Analysis
A data set of 36 4-[(diethylamino)methyl]-phenol AChE/BChE inhibitors was compiled from the work of Yu et al. [16]. The chemical structures and biological data of these AChE/BChE inhibitors are listed in Table 1, as well as the distribution in the training set (29 compounds) and test set (seven compounds), an important step in the development of QSAR models aiming to maximize the test set diversity and to analyze the model prediction accuracy [18]. HQSAR investigations require selecting values for the parameters that specify the hologram length, as well as the size and type of fragments to be encoded. The generation of the molecular fragments was performed using the following fragment distinction parameters: atoms (A), bonds (B), connections (C), hydrogen (H) atoms, chirality (Ch), and donor/acceptor (DA) atoms. For both the AChE and BChE inhibitor training sets, a default fragment size (4-7) was employed to obtain relevant statistical indexes with different combinations of fragment distinction parameters (Table 2). To improve these statistical indexes (Table 2), we investigated the influence of fragment size (2-5, 3-6, 4-7, 5-8, 6-9, 7-10, 8-11, and 9-12) ( Table 3) on the fragment distinction parameters of those models having the highest statistical indexes.  Finally, the best HQSAR model for the AChE inhibitors was generated using bonds (B) and connections (C) as fragment distinction parameters and 9-12 as the fragment size, showing q 2 = 0.787 and r 2 = 0.965 (Table 3). For the BChE inhibitors, the best HQSAR model was developed using atoms (A) and bonds (B) as fragment distinction parameters and 9-12 as the fragment size, showing q 2 = 0.904 and r 2 = 0.952 (Table 3).
It is interesting to note that the best model of each data set (AChE and BChE inhibitors) was generated when long fragments (i.e., 9-12 for AChE and BChE) were considered. Notably, there is a direct relationship between the side chain length and the inhibitory potency where a length of eight methylene units seems to be optimal [1]. Therefore, the generation of models with a long fragment size was not unexpected.
As an additional validation test, we performed the randomization of the biological activity values (i.e., Y-randomization test) and re-run the HQSAR analysis for the AChE and BChE training sets, resulting in q 2 values ranging from 0.083 to 0.191 and −0.113 to −0.133, respectively. This result unsure the robustness of the original HQSAR regression models, since there are no chance correlations.
In HQSAR, the structure encoded within a 2D fingerprint can be directly related to biological activity variation, and the most important test of a QSAR model is its ability to predict the activity value for new compounds. The external validation of the HQSAR models was performed by predicting the biological potencies of the test set using the best models derived from the training set. The predictive ability of the HQSAR models is expressed with predictive r 2 values, which is similar to cross-validated r 2 (q 2 ), and is defined using the Equation (1): In Equation (1), SD is the sum of squared deviations between the biological activity of the test set and the mean activity of the training set molecules, and PRESS is the sum of squared deviations between the observed and the predicted activity values for every molecule in the test set [19].
The experimental pIC 50 , predicted pIC 50 and residual (pIC 50 Exp. − pIC 50 Pred.) values of the AChE and BChE inhibitors obtained by the best HQSAR model for each data set are reported in Table 4. The comparison plots between the experimental and predicted potencies of both the training and test sets for the AChE and BChE inhibitors, using the best HQSAR models, are shown in Figure 1.
In this work, we considered as outliers all compounds of the training and test sets where the residual value exceeded twice the standard error of estimate of the model [20] (AChE, 2*SD = 0.30; BChE, 2*SD = 0.52). According to this criterion, there are two outlier compounds for the BChE model: compound 33, which presented the highest residual values in BChE (residual = −0.66), and compound 22, BChE (residual = 0.66), but none of them presented residual values higher than one logarithmic unit. Compounds 33 and 22 belong to the test set and did not take part in the model building process. The outlier 33 is analogous to 26 (the most potent AChE inhibitor and the fourth most potent BChE inhibitor), and both have the same ethylpiperazine R-group moiety, differing only in their side chain lengths ( Table 1). The HQSAR models overestimated the biological potency of 33 because of the direct relationship between the number of methylene spacer units in the side chain and the inhibitory potency of this series, as discussed earlier [1]. However, for this particular series, there is an optimum side chain size of eight methylene units, instead of 10 units as in compound 33. The outlier 22 is analogous to 21, and both have the same chain length but differ in the number of methylene units in their R-group moieties. The HQSAR models underestimated the biological potency of 22 because this derivative has two more methylene units than compound 21, resulting in a potency estimate closer in value to 21.  A complete HQSAR analysis involves the investigation of important molecular fragments directly related to the biological activity variation so that one may propose structural modifications. Thus, the HQSAR models can be graphically displayed as color-coded structure diagrams in which the color of each atom reflects its contribution to the potency variation. The red and green ends of the spectrum reflect negative and positive contributions, respectively whereas atoms with intermediate contributions are colored white [15]. The individual atomic contributions of the most (compounds 26 and 24) and least (compounds 1 and 2) potent AChE and BChE inhibitors, according to the best HQSAR models, are displayed in Figure 2. The HQSAR contribution maps for AChE and BChE inhibitors show that the structural fragment containing aromatic moieties increases potency, reinforcing the importance of the aromatic system in establishing π-π stacking interactions with the aromatic residues present in both enzymes, as described elsewhere [16,21,22]. In addition, the HQSAR map obtained from the BChE inhibitors also revealed the importance of protonation of the amine nitrogen atom. This chemical group is important, as it is involved in electrostatic interactions, also in agreement with literature data [16].
The compounds of this series have been designed to behave as AChE/BChE dual site inhibitors, i.e., binding simultaneously to the catalytic anionic site (CAS) and the peripheral anionic site (PAS) [16]. For AChE inhibitors, the side chain length was another feature highlighted in the HQSAR map contribution, emphasizing the need for a long side chain fragment. In fact, the crystal structure of AChE from Torpedo california shows that the CAS is located at the bottom of a deep and narrow gorge composed of 14 aromatic residues, and the PAS is located at the entrance of this gorge at a distance of ~20 Å [16,23]. Moreover, the estimated CAS-PAS distance is ~14 Å [24].
Surprisingly, a comparison of the contribution maps of the most (26) and least (1) potent AChE inhibitors revealed that the positive and negative contributions to biological activity come from the fragment that is common to both molecules ( Figure 2). The same can be seen in the most (24) and least (2) potent BChE inhibitors. A possible explanation for this finding is that the compounds with a longer spacer group can reach both CAS and PAS simultaneously, enhancing binding. Conversely, compounds with shorter linkers cannot reach both binding sites and, thus, can bind only weakly, being readily displaced by water molecules.

Data Set and Molecular Modeling
The data set used for the HQSAR studies contains the 36 4-[(diethylamino)methyl]-phenol derivatives developed by Yu et al., showing cholinesterase inhibitory activity against both AChE and BChE enzymes [16]. The biological activity of all compounds, originally expressed as IC 50 (µM) values [16], were converted to pIC 50 (M) (−Log IC 50 , Table 4) values ( Table 1). The chemical structures of all compounds were constructed using the PC Spartan'10 program [25]. HQSAR HQSAR modeling was performed using the SYBYL 8.0 package [26]. The 36 compounds were divided in the same training (29 compounds) and test (seven compounds) sets for both the AChE and BChE studies, considering that test set molecules should represent high, middle and low potency compounds, also spanning structural diversity, to avoid potential problems during the HQSAR model external validation.
The HQSAR study has three key steps: the generation of fragments for each molecule in the training set, the encoding of these fragments in holograms, and the correlation with available biological data [27]. Parameters that are associated with the generation of holograms, such as hologram length (HL), fragment size, and fragment distinction, may affect the HQSAR model; thus, different combinations of these parameters were considered during the HQSAR runs [14,15].
The structures of the phenolic derivatives, which comprise the training set, were converted to structural fragments initially using a minimum and maximum number of four and seven atom fragments, respectively. All fragments produced were allocated to the defined size of the molecular hologram (53, 59, 61, 71, 83, 97, 151, 199, 257, 307, 353, 401 bins), and analysis was performed to distinguish the fragments based on a combination of two or more of the following parameters: atoms (A), bonds (B), connectivity(C), hydrogen (H) atoms, chirality (Ch), and donor/acceptor (DA) atoms. The influence of the fragment size on the statistical indexes was evaluated using the best combinations of distinguishing fragments.
All QSAR models were generated using partial least squares (PLS), and the internal validation was performed by leave-one-out (LOO) cross-validation (q 2 ). In order to access the risk of chance correlation, it has been carried out the y-randomization (y-scrambling or response randomization) test, an additional validation procedure, by randomization the biological activity values (i.e., Y-randomization or Y-scrambling test) and re-run the HQSAR analysis for the AChE and BChE training sets. An external validation was performed with the test set compounds, which was not considered for the HQSAR model development. The predictive ability of the models was investigated by predictive r 2 values (r 2 pred). The parameters used above were applied to both AChE and BChE studies.

Conclusions
HQSAR models were obtained using the same training (29 compounds) and test (seven compounds) sets for all analyses and demonstrated reliable predictive power for both AChE and BChE inhibitors. These models (N = 29) showed significant cross-validated (AChE, q 2 = 0.787; BChE, q 2 = 0.904) and non-cross-validated (AChE, r 2 = 0. 965; BChE, r 2 = 0.952) correlation coefficients. The strong correlation between the experimental and predicted potency values for the test set compounds (external validation) shows the reliability of the constructed HQSAR models for both studies. HQSAR contribution maps of these models were used to explain the importance of the structural fragment to the overall activity of this series; structural fragments containing aromatic moieties and long side chains increase potency, in agreement with previous works. Therefore, the HQSAR models and contribution maps should be useful for future efforts in the design of new cholinesterase inhibitors with improved biological activity.