Prediction of Multi-Target Networks of Neuroprotective Compounds with Entropy Indices and Synthesis, Assay, and Theoretical Study of New Asymmetric 1,2-Rasagiline Carbamates

In a multi-target complex network, the links (Lij) represent the interactions between the drug (di) and the target (tj), characterized by different experimental measures (Ki, Km, IC50, etc.) obtained in pharmacological assays under diverse boundary conditions (cj). In this work, we handle Shannon entropy measures for developing a model encompassing a multi-target network of neuroprotective/neurotoxic compounds reported in the CHEMBL database. The model predicts correctly >8300 experimental outcomes with Accuracy, Specificity, and Sensitivity above 80%–90% on training and external validation series. Indeed, the model can calculate different outcomes for >30 experimental measures in >400 different experimental protocolsin relation with >150 molecular and cellular targets on 11 different organisms (including human). Hereafter, we reported by the first time the synthesis, characterization, and experimental assays of a new series of chiral 1,2-rasagiline carbamate derivatives not reported in previous works. The experimental tests included: (1) assay in absence of neurotoxic agents; (2) in the presence of glutamate; and (3) in the presence of H2O2. Lastly, we used the new Assessing Links with Moving Averages (ALMA)-entropy model to predict possible outcomes for the new compounds in a high number of pharmacological tests not carried out experimentally.

targets on 11 different organisms (including human). Hereafter, we reported by the first time the synthesis, characterization, and experimental assays of a new series of chiral 1,2-rasagiline carbamate derivatives not reported in previous works. The experimental tests included: (1) assay in absence of neurotoxic agents; (2) in the presence of glutamate; and (3) in the presence of H2O2. Lastly, we used the new Assessing Links with Moving Averages (ALMA)-entropy model to predict possible outcomes for the new compounds in a high number of pharmacological tests not carried out experimentally.

Introduction
Entropy measures are universal parameters useful to codify biologically relevant information in many systems. In the 1970's Bonchev and Trinajstic et al. published works about the use of Shannon's entropy to calculate a structural information parameter [1][2][3][4]. Kier published another seminar works on the use of Shannon's entropy to encoding molecular structure in Cheminformatics studies in 1980 [4]. Many other authors used Shannon's entropy parameters for the same purpose on small molecule structure [5][6][7][8][9][10]. Graham et al. [11][12][13][14][15][16] used entropy measures to study the information properties of organic molecules. Entropy information measures were used to describe proteins [17,18], DNA sequences [19], protein networks [20], and magnetic resonance outcomes [21]. The software MARCH-INSIDE (MI) uses the theory of Markov chains to calculate the parameters θk(G). These values are the Shannon entropies of order k th of a graph G. The θk(G) values are useful quantify information about the structure of molecular systems [22]. The graph G represents a complex molecular system as a network of nodes interconnected by links (Lij = 1) or not connected (Lij = 0). MI algorithm associates a Markov matrix to the graph G in order to define the probabilities of interactions (ties or relationships) between nodes. These entropy parameters θk(G) can be calculated for many types of systems (molecular or otherwise). We have studied small molecules, RNA secondary structures, protein sequences, viral surfaces, cerebral cortex networks, metabolic networks, host-parasite networks, world trading networks, social networks, etc. In molecules, we know the information about links Lij (covalent bonds, hydrogen bonds, spatial contacts, etc.) beyond any reasonable doubt. However, we can use the information (θk(G) values)of the system to predict interactions with other systems in a network of a higher-structural level. For instance, we use the θk(G) values of drugs and targets structure to predict drug-target interactions (links) in drug-target network. In other cases, linking patterns change, are not known, or we find contradictory information. This is the case of the existence of different relationships between nodes in biological webs or social networks. In these cases, we can use the θk(G) values of known networks to find models useful to predict links in new networks [23][24][25].
On the other hand, the discovery of new drugs for the treatment of neurodegenerative diseases such as Alzheimer's, Parkison's, and Huntington's disease, Friedreich ataxia and others, is an important goal of medicinal chemistry [26][27][28][29]. The genes causing hereditary forms of some of these diseases have been identified but the molecular mechanisms of the neuronal degeneration have not been totally understood yet [30]. This picture, and some disappointing results in clinical trials, makes interesting the prediction of drug candidates with computational techniques [31,32]. In order to design these computational models we need to process chemical information from public databases. These databases have accumulated immense datasets of experimental results of pharmacological trials for many compounds. For instance, CHEMBL [33,34] is one of the biggest with more than 11,420,000 activity data for >1,295,500 compounds, and 9844 targets. This huge amount of information offers a fertile field for the application of computational techniques [34,35].
The analysis of all this data is very complex due to the presence of multi-target, multi-output, and multi-scale information. Multi-target complication emerges due to the existence of compounds with multiple targets [36][37][38]. This led to the formation of complex networks of drug-target interactions. We can represent drug-target networks as a graph with two types of nodes drugs (di) and targets (tj) interconnected by links (Lij). Barabasi et al. [39], constructed a drug-target network based on Food and Drug Administration (FDA) drugs and proteins linked by drug-target binary associations. Csermely et al. [40], reviewed the use of networks, including drug-target networks, for drug discovery.
Multi-output feature refers to the necessity of prediction of different experimental parameters (IC50, Ki, Km, etc.) to decided whether two nodes (drug and target) interact (Lij = 1) or not (Lij = 0). Multi-scaling refers to the different structural levels of the organization of matter. In this case, the input variables quantify molecular information (drugs structure) and macromolecular information (targets). They have to quantify also cellular (cellular targets) and organism information (specie that express the target). In these models we have a high number of assays carried out in very different conditions (cq) like time, concentrations, temperature, cellular targets, tissues, organisms, etc. In a recent work, we combined the θk(G) values calculated with MI and the idea of Moving Average (MA) operators with a similar purpose [41]. In time series analysis the MA operators are average values of characteristic of the system for different seasons. In fact, MA models became popular after the initial works of Box and Jenkins [42]. In time series analysis, MA models may combine other operators I = Integrated, AR = Autoregressive, N = Non-linear operators, or X = Exogenous effects. In this sense, others models have emerged combining different operators: ARMA, ARIMA, VARIMA, ARIMAX, NARMA, etc. In multi-output modeling, we calculate the MA operators as the average of the property of the system (molecular descriptors or others) for all drugs or targets with a specific response in one assay carry out under a sub-set of conditions (cj). Consequently, our MA operator is not acting over a time domain but over a sub-set of conditions of the pharmacological assays. The idea of application of MA operators to other domains different from time is gaining adepts due to its advantages. For instance, Botella-Rocamora et al. [43] developed a model for disease mapping using spatial Box-Jenkins operators with the form of MAs, to define dependence of the risk of a disease to occur. In our models, we use MA in relation with properties of nodes of networks (drugs, proteins, reactions, laws, neurons, etc.); which form links Lij(cq) in specific sub-set of conditions (cq). For this reason, we decided to call this strategy as ALMA (Assessing Links with Moving Averages) models. Speck-Planche and Cordeiro reported different multi-target or multi-output models using the same type of ALMA models [44][45][46].
In the specific area of neurodegenerative diseases, almost all these datasets includes also large sub-sets of assays involving potential neuroprotective drugs, targets, as well as drug-target and/or target-target interactions. The database NeuroDNet has interactive tools to create interaction networks for twelve neurodegenerative diseases. According to Vasaikar et al. [47], it is the first of its kind, which enables the construction and analysis of neurodegenerative diseases through protein interaction networks, regulatory networks and Boolean networks. In the case of neuroprotective compounds, some authors have reported multi-target ALMA models. García et al. used topological descriptors for a large series of 3370 active/non-active compounds to fit a classification function that can predict links Lij (interactions) of heterogeneous series of GSK inhibitors compounds with different neurological targets relevant to Alzheimer's disease and parasite species. Speck-Planche et al. [48], developed a multi-target model using a large and heterogeneous database of inhibitors against five proteins associated with Alzheimer's disease. The model correctly classified more than 90% of active and inactive compounds in the treatment of Alzheimer's disease on both, training and prediction series. Several guidelines are offered in other paper to show how the use of fragment-based descriptors can be determinant for the design of multi-target inhibitors of proteins associated with Alzheimer's disease [49].
In a recent work, we used the method TOPS-MODE (TM) [50] to calculate the structural parameters of drugs. The model correctly classified 4393 out of 4915 total cases with Specificity (Sp), Accuracy (Ac), and Sensitivity (Sn), of 80%-98%. We also used the method TM to develop one ALMA [51] model useful for the prediction of neuroprotective drugs. This dataset includes Multi-output assay endpoints of 2217 compounds for at least one out of 338 assays, with 148 molecular or cellular targets, and 35 types of activity measures in 11 model organisms (including human). In a third work [52], we introduced another ALMA model for neurotoxicity/neuroprotective effects of drugs based on the method MI. First, we used MI to calculate molecular descriptors of the type of stochastic spectral moments of all compounds. Next, we found a model that classified correctly 2955/3548 total cases on training and validation series with Ac, Sn, and Sp > 80%. Each data point (>8000) contains the values of 37 possible measures of activity, 493 assays, 169 molecular or cellular targets, and 11 different organisms (including human) for a given compound. The model has shown excellent results also in computational simulations of high-throughput screening experiments, with Ac = 90.6% for 4671 positive cases. Both models are able to predict the links Lij(cq) between i th drugs and j th targets according to the assay aq. However, we do not carried out a formal construction and a comparison of the drug-target networks for the CHEMBL data in previous papers. In any case, despite the high versatility of entropy measures to codify structural information, there is no report of a multi-target model for drug-target interactions for compounds with neuroprotective/neurotoxic effect. In this work, we report the first multi-target, multi-output, and multi-scale ALMA model for CHEMBL data of neuroprotective/neurotoxic effect of drugs. Then, we construct and compare for the first time three Multi-output assay complex networks for these CHEMBL dataset using the two previous models and the model reported in this work. From there, we reported by the first time the synthesis, characterization, and experimental assays of a new series of rasagiline carbamate derivatives not reported in previous works. We carried out three different experimental tests: assay (1) in absence of neurotoxic agents; (2) in the presence of glutamate; and (3) in the presence of H2O2. Finally, we used the new entropy model to predict possible outcomes for these compounds in a high number of pharmacological tests not carried out experimentally. The results presented here show the high potential of entropy parameters of chemical information for the design of neuroprotective drugs, the construction of complex bio-molecular networks, and the potential of ALMA models for multi-target, multi-output, and multi-scale modeling.

Model Training and Validation
We report a model to predicting when the i th compound may present a high (Lij(cq) = 1) or not (Lij(cq) = 0) value of the experimental parameter used to characterize interaction with a molecular or cellular target involved in a neuroprotective/neurodegenerative process. The output Sij(cq) of our multi-output model depend on both chemical structure of the i th drug di and the set of conditions selected to perform the biological assay (cq) including the j th target, of course. In consonance, the ALMA model should predict different probabilities if we change the organisms (c1), the biological assays (c2), the molecular/cellular target (c3), or the standard experimental parameter measured (c4), for the same compound [53].The best ALMA-entropy model found in this work was: The statistical parameters for the above equation in training are: Number of cases used to train the model (N), Canonical Regression Coefficient (Rc), Chi-square (χ 2 ), and p-level [54]. The probability cut-off for this Linear Discriminant Analysis (LDA) model is i p1(cq) > 0.5 ≥ Lij(cq) = 1. It means that the drug di predicted by the model, with probability p > 0.5, is expected to give a positive outcome in the q th assays carry out under the given set of conditions cq. This ALMA-entropy model presents excellent performance in both training and external validation series with Sn, Sp, and Ac > 80% (see Table 1). Values higher than 75% are acceptable for LDA-QSAR models, according to previous reports [55][56][57][58][59].
The first term in the equation, quantify both the quality of the input data p(cl) and the information θ i 5 about the structure of the drug (see material and methods and previous works [51]). We can expand the Box-Jenkins MA terms in the ALMA equation in order to clearly depict all the parameters involved: After inspection of this equation, we can see that the ALMA model can predict for the same compound different scores for different experimental parameters, targets, assays, or even different organisms. In Table 2 we illustrate the values of probability of drug-target interaction pij(cq) predicted with the previous model, for several examples of known drugs or new promising compounds. These are the probabilities with which the i th compound interact with the j th drug under the assay conditions cq. This is equivalent to pij(cq) > 0.5 ≥ Lij(cq)pred = 1. However, online supplementary material files contain a complete list with many examples of positive and control cases.    The Table 2 shows predictions of the same drug in different sets of conditions of assay cq, including different targets, organisms, or assays. Therefore, we only have to substitute in the equation the value of θ i 5 of the compound and the respective values p1(cq)·<θ i 5(cq)> for the MA operators of each condition. In the Table 3 we depict many examples of values of MA operators p1(cq)·<θ i 5(cq)> for different conditions.

Comparison with Other ALMA Models
An interesting exercise is the comparison of the present model and the network predicted with outcomes obtained with other methods. Until the best of our knowledge, there are only two similar models. Both models make use of the spectral moments of a molecular matrix as input variables (Di) to quantify the molecular structure of drugs. The first model [51] applies spectral moments µk of order k th of the bond adjacency matrix ( 1 B) calculated with the TM approach. The equation of this model is the following: The second model [52] employs as input the π i k values of the Markov matrix ( 1 Π) of atom-atom electron delocalization calculated with the software MI. In the TM method, we weighted the edges of the molecular graph with standard distances of chemical bonds whereas the MI algorithm employs atom standard electronegativities to weighting the nodes of molecular graph. The equation of the second model is: In both cases, as well as in the present ALMA-entropy model, we used MA terms to quantify the deviations of the structure of one compound from sub-sets of compounds with a positive outcome in different conditions cq. The three methods showed excellent values of Ac, Sp, and Sn on both training and validation series (see Table 1). Apparently, the TM model shows better values of these parameters but we have to take into consideration the differences in the complexity of the data sets used to train and validate these models. The TM-spectral moment model is able to classify correctly 83%-82% of 4915 cases in total (on training and validation series respectively). The MI-spectral moment model is able to classify correctly 89%-92% of 3598 cases. Notably, the MI-entropy model is able to classify correctly 89%-92% of 8309 cases. Consequently, the statistics for the present model refer to a dataset with more than twice the number of data points present in previous models.  ALMA models may be useful both (1) for computational or virtual High-Throughput Screening (HTS) screening of large databases like CHEMBL and/or (2) for construction of drug-target networks. All the results, discussed in previous section, indicate that many compounds may act as multi-target drugs with non-linear or indirect effect (orthosteric and/or allosteric) over different targets in different pathways. In a recent special issue edited by Csermely, Nussinov, and Szilágyi [60], different research groups discussed about this topic and related concepts such as allo-networks. In one of these papers, Mueller et al. [61] have developed a computational model for the HTS of drugs with action over mGluR5; which represent a promising strategy for the treatment of schizophrenia. Considering the relevance of allotropy for these and other receptors for our study, and all previous comments about allo-network drugs, we decided to use our model to construct a drug-target network. The interest in doing so is that this type of network-based tools may be applied for the discovery of new drugs, including perhaps allo-network drugs [40,60,62].
Considering these points, we constructed here by the first time a drug-target network with CHEMBL experimental outcomes of multiplex assays of neuroprotective effects of drugs with the same dataset used in the previous section. This is probably the first drug-target network representation of the interaction of neuroprotective compounds with cellular or protein targets; many of them susceptible to allosteric modulators. In this directed network, we used three classes of nodes, drugs (di), targets (tj), and pharmacological assays (aq). They are connected by only three classes of arcs (directed links) drug ≥ (di ≥ tj), drug ≥ assay (di ≥ aq), and target-assay (tj ≥ aq). Other types of relationships were not considered. The observed drug-target network was constructed with the input dependent variable Lij(cq). In consequence, if CHEMBL reports the case of drug di that causes an strong biological response (Lij(cq) = 1) in one biological experiment carry out under the conditions cq = (tj, aq), we have to draw in the network the path di ≥ tj ≥ aq. We omitted here the representation of nodes for the type of experimental measure and the organism that express the target. This avoids very highly connected nodes that may cause a strong distortion in network topology and mask or hidden the relevance of important drugs or targets.
The observed network constructed with the dataset published in the previous work has 968 nodes = 721 drugs + 72 targets + 175 pharmacological assays for neuroprotective effects. We apply, also, the software MI to quantify the structural information of the drug-target networks. In so doing, we calculated the Shannon entropy (Sh), as well as δ = node degrees for the nodes (drugs, targets, and assays) in the network, see Table 4. Please note that the Sh entropy values for the nodes in the drug-target network (supra-molecular structural level) are different from the θk entropy values use to quantify the information about the structure of the drug (molecular structural level). Actually, we do not use a classic Shannon entropy (H) but a first-order Markov-Shannon entropy [25].
After a first inspection, we can observe that the degree of a node (δ) in the network has average values of δ = 4.8 ≈ 5 for all nodes, δ = 4.8 ≈ 5 for drugs, and δ = 4.3 ≈ 4 for assays. It means that, on average, each drug interacts with five targets and we can measure this interaction with approximately four assays. It is easy to realize that the higher δ for targets may be determined in part by their position in the network. For each link of drug or assay node, we have two interactions for the target di ≥ tj and tj ≥ aq). As a result, we can decompose the δ into δ = δin + δout = node degree = in-degree + out-degree [63].
For this reason, we carried out all calculations eliminating the direction of arcs. In so doing, we considered them as symmetric links to avoid this "over-booking" of target nodes. Consequently, the average is δ = 6.1 ≈ 6 for targets, a value still higher, but closer to 5 than to 8-10, the double is expected. In a second stage, we use our model to reconstructing/predicting the same network, based on the probability p(mj) outputs of the model. Two nodes are connected when the probability predicted by the model is p(cq) > 0.5, it means that p(di, tj), or p(di, aq), or p(tj, aq) are >0.5, for different pairs of links. We can perceive that the values of the drug-target network predicted by the model are very similar to those of the observed network. Consequently, we can conclude that the model is efficient not only in the overall prediction of links in the network (high Ac, Sp, and Sn, see Table 1) but in the reconstruction of topological patterns. For instance, from information theory we can deduce that the uncertainty of links is similar in both networks because Shannon entropy calculated for all links is Shobs = 0.005 − 0.007 ≈ Shpred = 0.004 − 0.006. In Figure 1, we represented the Observed (A) vs. Predicted (B) complex networks.

Synthesis and Experimental Assay of New 1,2-Rasagiline Derivatives
The compounds 2, 3, 4, 5, 6, 7, 8, and 9 were synthesized according to the strategy given in Figure 2. As shown in this scheme, they were synthesized from the aminoalcohol 1 [(1R,2S)-(+)-1-amino-2-indanol], a commercial product. The alkylation of 1 with propargyl bromide and potassium carbonate in hot acetonitrile provided, in a global yield of 92%, a mixture of the corresponding mono-and dipropargylated derivatives (2 and 3), which were separated by flash column chromatography using hexane/EtOAc (3:1) as eluent. Compound 3 was converted to the corresponding acetate (4) and benzoate (5) by treatment with acetic anhydride or benzoyl chloride, Et3N and catalytic amounts of 4-dimethylaminopyridine (DMAP) in MeCN. The carbamate derivatives (6, 7, 8, and 9) were synthesized, from the hidroxy mono-or dipropargylaminoindans (2 and 3), by reaction with the corresponding dialkylcarbamyl chloride in NaH and acetonitrile following the procedure described in the literature [64]. The new compounds synthesized in this work (2, 3, 4, 5, 6, 7, 8, and 9) were subjected to an initial study to determinate its neuroprotective ability in both the presence and the absence of neurotoxic agents (ANA). The method of reduction of the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) was used to ascertain the cell viability, given by the number of cells present in the culture. The ability of cells to reduce MTT is an indicator of the integrity of mitochondria, and its functional activity is interpreted as a measure of cell viability [65]. Three assays were conducted in a culture of motor cortex neurons of 19-day-old Sprague-Dawley rat embryos. All results are expressed as the mean ± S.E.M. [51,52] of at least three independent experiments (Table 5).   Firstly, we studied the ability to induce a neuroprotective effect in the absence of any neurotoxic stimulation. Secondly, we studied the neuroprotective effect in the presence of glutamate, a compound that causes a pathological process, in which neurons are damaged leading to apoptosis when its receptors, such as the NMDA and AMPA, are over-activated. Lastly, the ability of the compounds synthesized to protect from damage by H2O2, that causes neuronal death by oxidative stress, was analyzed. The results obtained allow to deduce the existence of a moderate neuroprotective effect in the absence of any toxic stimulus, presenting the best results type 6 and 9 carbamate derivatives, with values of 11.5% and 8.4%, respectively, followed by the compound 3, 4, and 7 with values slightly above 4% (see Figure 3).

Using ALMA-Entropy Model to Predicting New Drugs in Other Assays
We used the ALMA-entropy model to predicting the more probable results for all the new rasagiline derivatives synthesized in this work, in >500 assays not carried out experimentally. When the molecular descriptors (entropy indices) of the new rasagiline derivatives were introduced in our model, we obtained the probable interaction with different targets. The model predicts that most of them could interact with the subunits A and B of the 5-hidroxy-tryptamine type 3 receptors (5-HT3Rs), see Table 6. These results seem to be consistent with the literature, since the antagonists of 5-HT3Rs have been related to neuroprotective properties in vitro and in vivo [66]. In fact, this could be a potential mechanism of neuroprotection added to several described mechanisms for rasagiline derivatives [67,68]. Rasagiline is also known for promoting serotoninergic activity by other ways, which is a clinically relevant fact in certain circumstances [69]. All in one highlights the intricate relationships of these drugs with the 5-hidroxy-tryptamine (serotonine) system. In any case, we need to analyze these results with caution. In our previous works [51,52], we predicted with new models and confirmed experimentally that some rasagiline derivatives (similar to the derivatives studied in this work) presented activity over glutamate receptors (GluRs) pathway. In the first of these works [51], we study experimental measures of neuroprotective capacity of new 1,3-rasagiline derivatives. All the compounds, except one of them, had a high protective activity against damage mediated by H2O2. The best one of all, a monopropargyl trans derivative, showed also a high neuroprotective action in all three type of assays. Our first model predicted for this compound high probability of activity in relationship with acetylcholine and GABA, in addition to GluRs. In coincidence, acetylcholine receptors (AChRs) have been associated with neuroprotective proprieties in several recent experimental works, and there are also reports of association of GABA and GluRs with neuroprotective ability [70,71]. Nuritova et al. [72], discussed a neuroprotective strategy involving retrograde release of glutamate.
In our second work [52], we studied two types of substituent groups (propargyl groups attached to the nitrogen and a carbamate or esther group instead of hydroxyl). The compounds also presented two different chirality patterns but with 1,3 substitutions pattern. The compounds of this second series were active experimentally in the absence and presence of neurotoxic agents. The best compound of this second series, a dipropargyl derivative, was predicted to have brain nitric oxide synthase (bNOS) as the most probable target and certain probability of multi-target ligand. Again, bNOS was associated experimentally with neuroprotective action in several works [73,74].
The compounds studied this third work present similar substituent groups and stereochemistry but one 1,2 substitution pattern. Based on the previous results, we should expect a similar experimental activity and predictions. However, in the previous section we shown experimentally that the present set of compounds seems not to be very active over GluRs and the model predicts the higher scores of activity over 5-HT3Rs instead of the expected receptors. As we stated in the previous paragraph, 5-HT3Rs have been related to neuroprotective properties in vitro and in vivo [66]. A plausible hypothesis (pendent of further experimental confirmation) is the variation in receptor affinity (from GluRs to 5-HT3Rs pathway) due to the change from 1,3 to 1,2 substitution pattern. From our point of view, these correspondences between targets that our equations predict, and the references cited from the literature could indicate biological plausibility of our models.

ALMA-Entropy Models
ALMA models may be classified as a general type of model to assessing the links in different systems. They are adaptable to all molecular descriptors and/or graphs invariants or descriptors for complex networks. In general, we refer to a descriptor D i k of type k th of the i th system (compound or drug di in this case) represented by a matrix M. In fact, in this work we are going to compare the model based on entropy values θ i k of a Markov matrix 1 Π with other ALMA models based on other invariants of the same matrix 1 Π, or invariants of the bond adjacency matrix 1 B. Consequently, we describe first the general equations of the model using a generic descriptor, or graph theoretical invariant D i k, and later we give the specific equation for the entropy model based on θ i k values. The aim of this model is to link the scores Sij(cq) with the molecular descriptors D i k of a given compound di and the Box-Jenkins MA operators written in the form of deviation terms ΔD i k(cq) = D i k -<D i k(cq)>. The model has the following general form: The output dependent variable is Sij(cq) = Sij(cl, c2, c3, c4, c5) = Sij(cl, aq, ot, tj, sx). The variable Sij(cq) is a numerical score of the biological activity of the i th drug (di) vs. the j th target measured in one assay carried out under the set of q th conditions cq. Our hypothesis is H0: we can calculate the output Si(cq) as a linear combination of scores. We have two types of scores. The first type are the scores 'Si k = 'ak·p(cl)· i Dk that account for the quality of data p(cl) and for contributions of the k th molecular descriptors to the final activity score Sij(cq). In fact, we used the probability p(c1) = 1.0; 0.75; or 0.5 for data curated in CHEMBL database at levels of expert, intermediate, or auto-curation level, respectively. The second type are scores ''Sij k (cq>1) = ''ak·ΔD i k(cj) for the contributions of deviations ΔD i k(cq) = (D i k − <D i k(cq)>) of the descriptors of di from the average of those of active molecules Lij(cq) = 1 for different cq. In general, cj refers to different Multi-output assay conditions, e.g., targets, assays, cellular lines, organisms, organs, etc. In this sense, c0 = is the accuracy of the data for this assay, c1 = au is the assay per se, c2 = ot is the organism that express the target, c3 = tj is the j th cellular or molecular target, and c5 = sx is standard experimental measure of activity. Then, the parameter D i k and ΔD i k(cq) are the input independent variables and Lij(cq) = 1 is the input dependent variable. Here, <D i k(cq)> is the average of the k th descriptors D i k of all i th compounds considered as active (Lij(cq) = 1) in an assay carry out under the set of conditions cq. The parameters ΔD i k(cq) are similar to the MA used in time series analysis for Bob-Jenkins ARIMA models and others [42]. This type of MA model has been used before to solve different problems in Cheminformatics before. It means that, firstly, we sum the values of D i k for all the nj drugs with Lij(cq) = 1 in the assay carry out in the conditions cj. Next, we divide this sum by the number of compounds nj with this condition.
In this model, we used only one molecular descriptor θ i 5. This is the Shannon entropy of order k = 5 calculated with MI. We do not use low-order entropies k = 0, 1, 2, 3, and 4. Accordingly, the general equation is: This type of moving average or deviation-like models was coined by us as the ALMA models, and has been used before to solve different problems [54,[75][76][77]. In order to seek the model we used the technique Linear Discriminant Analysis (LDA) implemented in the software package STASTICA 6.0 [78]. The statistical parameters used to corroborate the model were: Number of cases in training (N), and overall values of Sp, Sn, and Ac [54].

CHEMBL Dataset
We downloaded from the public database CHEMBL a general data set composed of >8000 Multi-output assay endpoints (results of multiple assays) [33,34]. We assigned a value of the observed (obs) class variable Lij(cq)obs = 1 (active compound) or Lij(cq)obs = 0 (non-active compounds) to every i th drug biologically assayed in different conditions cj. The dataset used to train and validate the model includes N = 3548 statistical cases, formed by Nd = 3091 unique drugs which have been assayed each one in at least one out of 37 possible standard type measures determined in, at least, one out of 493 assays. Each assay involves, in turn, at least one out of 169 molecular or cellular targets expressed in the tissues of at least one out of 11 different organisms (including human).

Synthesis of 1,2-Rasagiline Derivatives
Melting points are uncorrected and were determined in Reichert Kofler Thermopan (Reichert, Vienna, Austria) or in capillary tubes on a Büchi 510 apparatus (BÜCHI Labortechnik AG, Flawil, Switzerland). Infrared spectra were recorded on a JASCO FT/IR-4100 spectrophotometer (JASCO Analytical Instruments, Easton, PA, USA). The 1 H-NMR spectra (300 MHz) and 13 C-NMR spectra (75 MHz) were recorded in a Bruker AMX spectrometer (Bruker BioSpin Corporation, Fremont, CA, USA), using TMS as internal reference (chemical shifts in δ values, J. in Hz). EI Mass spectra were recorded on a HEWLETT-PACKARD 5988A spectrometer (Hewlett-Packard Company, Palo Alto, CA, USA). FABMS were obtained using MICROMASS AUTOSPEC mass spectrometer (WATERS, Milford, MA, USA) and ESIMS were determined on a BRUKER AMAZON ETD spectrometer (Bruker BioSpin Corporation). We performed microanalyses in a Perkin-Elmer 240B elemental analyzer (PerkinElmer, Waltham, MA, USA) by the Microanalysis Service of the University of Santiago de Compostela. The specific rotation was measured with a PERKIN-ELMER 241 polarimeter

Culture of Rat Cortical Neurons
Embryos were selected from 19 to 20 days pregnant rats by caesarean section. Meninges were removed and cortex was isolated after the dissection of the brain. The fragments obtained from several embryos were subjected to mechanic digestion. We re-suspended the cells in a Neurobasal medium with 2% B-27. We seeded in 48-well plates at a density of 100,000 cells/mL. Neuronal cultures were allowed to grow for 8-10 days. Incubations with different CSF were done when the microscope showed the existence of a dense neuronal network. Embryos were selected from 19 to 20 days pregnant rats, which were decapitated and embryos were extracted from the womb by caesarean section. Meninges were removed and a portion of motor cortex was isolated after the dissection of the brain. Fragments obtained from several embryos were subjected to mechanic digestion and cells were re-suspended in Neurobasal medium with 2% B-27 and seeded in 48-well plates at a density of 100,000 cells/mL. Neuronal cultures were allowed to grow for 8-10 days and when the microscope showed the existence of a dense neuronal network, incubations with different CSF were done [79].

Measurement of Neuronal Viability
We used the MTT reduction assay following the procedure previously described [65]. After the appropriate incubations with the compounds alone, or co-incubated with 100 µM H2O2 or glutamate, 0.5 mg/mL MTT were added to each well and incubation was performed at 37 °C for 2 h. Formazan salt formed was dissolved in DMSO, and colorimetric determination were performed at 540 nm. Control cells without compounds or toxic stimulus were considered 100% viability. Neuronal viability after exposure to compounds or different treatments was expressed as% of control within each individual experiment. Graph Pad Prism Software (GraphPad Software, San Diego, CA, USA) was used to perform statistical analyses and graphical presentation. Experiments were reproduced at least three times. Data were expressed as mean ± S.E.M. values. Groups were compared by ANOVA/Dunnett's test. A p-value ≤0.05 was accepted as the limit of statistical significance.

Conclusions
We can use Shannon entropy measures to developing predictive models for multi-target networks of neuroprotective/neurotoxic compounds. In doing so, we can use Box-Jenkins operators of molecular descriptors to obtain multi-target, multi-scale, and multi-output models able to predict different outcomes for multiple combinations of output experimental measures, experimental protocols, organisms, and molecular and cellular targets. One of these models has been demonstrated here to be useful as a complementary tool in the organic synthesis and evaluation of the multi-target biological activity of new compounds with potential neuroprotective activity. The model is also a very useful tool to predict complex networks of drug-target interactions with possible applications to the study of non-linear effects in the biological activity of neuroprotective drugs.