Structure-Activity Relationships Based on 3D-QSAR CoMFA/CoMSIA and Design of Aryloxypropanol-Amine Agonists with Selectivity for the Human β3-Adrenergic Receptor and Anti-Obesity and Anti-Diabetic Profiles

The wide tissue distribution of the adrenergic β3 receptor makes it a potential target for the treatment of multiple pathologies such as diabetes, obesity, depression, overactive bladder (OAB), and cancer. Currently, there is only one drug on the market, mirabegron, approved for the treatment of OAB. In the present study, we have carried out an extensive structure-activity relationship analysis of a series of 41 aryloxypropanolamine compounds based on three-dimensional quantitative structure-activity relationship (3D-QSAR) techniques. This is the first combined comparative molecular field analysis (CoMFA) and comparative molecular similarity index analysis (CoMSIA) study in a series of selective aryloxypropanolamines displaying anti-diabetes and anti-obesity pharmacological profiles. The best CoMFA and CoMSIA models presented values of r2ncv = 0.993 and 0.984 and values of r2test = 0.865 and 0.918, respectively. The results obtained were subjected to extensive external validation (q2, r2, r2m, etc.) and a final series of compounds was designed and their biological activity was predicted (best pEC50 = 8.561).


Introduction
The β3 adrenergic receptor (β3-AR) is a transmembrane protein that belongs to the superfamily of G protein-coupled receptors [1,2].There are three subtypes of β adrenergic receptors.The β1 adrenergic receptor is mainly located in the cardiovascular system, where it is the target of selective blockers such as atenolol or bisoprolol, which are used for the treatment of hypertension [3].The β2 adrenergic receptor is mainly located in smooth muscles, where their activation by agonists such as salbutamol or salmeterol enables asthma treatment [4].On the other hand, the β3-AR is widely distributed in the human body.It is present in the brain [5], the cardiovascular system [6], colon, bladder, and adipose tissue [7].Therefore, it could be a therapeutic target for the treatment of diseases such as depression [8], hypertension, heart failure [9], overactive bladder (OAB) syndrome [10], colon cancer [11], metabolic syndrome, and obesity [12].
Until now, the pharmacophore for the design and synthesis of new β3-AR ligands has been the ethanolamine chain.Most of the compounds are of the phenylethanolamine or aryloxypropanol-amine type.To achieve β3 adrenergic selectivity, the insertion of bulky groups on the right-hand side (RHS) of the molecule is favorable (Figure 1).However, since the approval of mirabegron in 2012, few selective agonist compounds for the β3-AR receptor have been reported [13][14][15].Selective β3 adrenergic agonists include CL 316,243 [16], amibegron (SR58611A) [17,18], mirabegron (YM-178) [10], and vibegron (RVT-901) [19] (Figure 1).CL 316,243 has anti-obesity and anti-diabetic profiles [20].Amibegron presents antidepressant effects in animal models [21].Mirabegron is the only selective β3 drug currently approved by the U.S. Food and Drug Administration for the treatment of OAB syndrome [22], however, there have been reports of upper airway angioedema following the administration of mirabegron [23].On the other hand, recent studies have shown that mirabegron raises blood pressure and prolongs the QTc interval in electrocardiograms [24].This information calls into question the continuity of mirabegron in the market.In this scenario, Merck Laboratories reported in 2016 the discovery of vibegron (Figure 1), a new potent and selective β3-AR agonist, which is currently under development in human clinical trials for the treatment of OAB [19].

Introduction
The β3 adrenergic receptor (β3-AR) is a transmembrane protein that belongs to the superfamily of G protein-coupled receptors [1,2].There are three subtypes of β adrenergic receptors.The β1 adrenergic receptor is mainly located in the cardiovascular system, where it is the target of selective blockers such as atenolol or bisoprolol, which are used for the treatment of hypertension [3].The β2 adrenergic receptor is mainly located in smooth muscles, where their activation by agonists such as salbutamol or salmeterol enables asthma treatment [4].On the other hand, the β3-AR is widely distributed in the human body.It is present in the brain [5], the cardiovascular system [6], colon, bladder, and adipose tissue [7].Therefore, it could be a therapeutic target for the treatment of diseases such as depression [8], hypertension, heart failure [9], overactive bladder (OAB) syndrome [10], colon cancer [11], metabolic syndrome, and obesity [12].
Until now, the pharmacophore for the design and synthesis of new β3-AR ligands has been the ethanolamine chain.Most of the compounds are of the phenylethanolamine or aryloxypropanolamine type.To achieve β3 adrenergic selectivity, the insertion of bulky groups on the right-hand side (RHS) of the molecule is favorable (Figure 1).However, since the approval of mirabegron in 2012, few selective agonist compounds for the β3-AR receptor have been reported [13][14][15].Selective β3 adrenergic agonists include CL 316,243 [16], amibegron (SR58611A) [17,18], mirabegron (YM-178) [10], and vibegron (RVT-901) [19] (Figure 1).CL 316,243 has anti-obesity and anti-diabetic profiles [20].Amibegron presents antidepressant effects in animal models [21].Mirabegron is the only selective β3 drug currently approved by the U.S. Food and Drug Administration for the treatment of OAB syndrome [22], however, there have been reports of upper airway angioedema following the administration of mirabegron [23].On the other hand, recent studies have shown that mirabegron raises blood pressure and prolongs the QTc interval in electrocardiograms [24].This information calls into question the continuity of mirabegron in the market.In this scenario, Merck Laboratories reported in 2016 the discovery of vibegron (Figure 1), a new potent and selective β3-AR agonist, which is currently under development in human clinical trials for the treatment of OAB [19].From the works of Cramer and Klebe [25,26], comparative molecular field analysis (CoMFA) and comparative molecular similarity index analysis (CoMSIA) are considered useful methodologies to understand the pharmacological properties of a series of compounds.Contour maps generated from CoMFA and CoMSIA show regions of the molecular structure where modifications in the steric, electrostatic, hydrophobic, and H-bond properties generate a favorable or unfavorable change in biological activity.Therefore, the contour maps obtained help to: (a) understand the nature of ligand-receptor interactions; (b) predict biological activity; and (c) aid in the rational design of new compounds.
In the last 10 years, there have been only two reports of quantitative structure-activity relationship (QSAR) studies on selective compounds for the β3-AR [27,28], one of which was conducted by our research group [28].In both cases, the studies were carried out on phenylethanolamine-type compounds.Since then, there have been no reports of QSAR studies on aryloxypropanolamines.In the present work, we present a three-dimensional (3D)-QSAR study of a series of potent and selective human β3-AR agonists [29,30].The reported compounds showed an interesting profile as potential drugs for the treatment of obesity and noninsulin-dependent (type II) diabetes.The compounds have a wide structural variability on both the RHS and left-hand side (LHS) of the general aryloxypropanolamine structure (Figure 1).The information obtained from the CoMFA and CoMSIA contour maps was systematized in a useful structural-activity relationship diagram.With this information, we finally reported the design of new compounds with promising β3 adrenergic activity.

Statistical Results
A summary of the statistical results for CoMFA and CoMSIA are presented in Table 1.Details of all possible combinations are given in Table S1 (Supplementary Material).The best models were searched through successive field combinations.The first parameter to evaluate the statistical robustness of a QSAR model is the value of q 2 , which must be greater than 0.5.q 2 is an indicator of the internal predictive capacity of a QSAR model.For CoMFA, the model that considered both field contributions (CoMFA-SE) presented the highest value (0.537), while with CoMSIA, several models presented a significant q 2 value.The CoMSIA steric + electrostatic + hydrophobic + acceptor (CoMSIA-SEHA) and CoMSIA steric + electrostatic + acceptor (CoMSIA-SEA) models generated similar values for this coefficient (0.674 and 0.651, respectively).However, the value of r 2 , which evaluates the external predictive capacity of the model, allowed for discrimination between the models.In this case, the CoMSIA model that considered all the field contributions (CoMSIA-All) presented the highest value of r 2 (0.918).The best models also had a low SEE and a high r 2 .The optimal number of components (N) is also low in all models presented (N = 6 for the best CoMFA and CoMSIA model).Ideally, a good model should have as few components as possible (N should be less than one-third of the total number of compounds studied), which ensures that the predictions will be based on meaningful information from field contributions, rather than on overtraining of the model.There is also a balance in the percentages of field contribution (approximately 50% for each field in CoMFA-SE and approximately 20% for each field in CoMSIA-All), which supports the reliability of the conclusions obtained from each contour map.
Table 2 presents a summary of the external validation of the CoMFA-SE and CoMSIA-All models (hereafter they are referred to as "CoMFA" and "CoMSIA" models).Both models have a high value for r 2 (0.865 and 0.918, respectively), which is an indication of an adequate external predictive capacity.However, according to Golbraikh and Tropsha [31,32], high values of q 2 and r 2 (conditions 1 and 2) are necessary but not sufficient conditions for the validation of a model.For a QSAR model to have a reliable predictive capability, the line for experimental versus predicted activity should be as close as possible to the line y = x.This is observed in the fulfillment of conditions 3a or 3b, 4a or 4b, 5a or 5b, and 6 listed in Table 2. Finally, condition 7, known as r 2 m metrics, is a quantitative measure to determine the proximity between the observed and the predicted activity for the test set.The CoMFA and CoMSIA models reported here fulfilled all the conditions for internal and external validation and, in general, the CoMSIA model displays better statistical parameters than CoMFA.1; r 2 0 and k are the correlation coefficient between the experimental (x) versus predicted activities (y) for test set through the origin and the respective slope of regression; and r 2 0 and k are the correlation coefficient between the predicted (y) versus experimental activities (x) for test set through the origin and the respective slope of regression.r 2 m was defined in Equation (6).
The values of experimental activity, predicted activity, and residual values for the best CoMFA and CoMSIA models are shown in Table 3.All the compounds showed low residual values and deviations in the predicted activity over a logarithmic unit were not observed.Figure 2A,B show the graphs of experimental versus predicted activity, from which it can be seen that the data distribution is close to the y = x line.Both models show a good balance in terms of predictive capacity.The CoMFA model presents 21 compounds with negative residual values and 20 with positive deviations (Figure 2C), while the CoMSIA model presents 19 compounds with negative residual values and 22 with positive deviations (Figure 2D).The residual ranges were −0.44 to 0.45 for CoMFA and −0.53 to 0.37 for CoMSIA.As shown in Figure 2E,F, the CoMFA and CoMSIA models show a satisfactory predictive capability throughout the whole set of data (training and test set) as well as a good predictive power for both, less active (8, 20, and 21) and most active compounds (16, 32, and 33).Furthermore, to assess the robustness of the model, the Y-randomization test [33] was applied (see Table S2 of the Supplementary Material for randomizations).The dependent variable (biological activity) was randomly shuffled and a new QSAR model was developed using the original independent variable matrix.If after multiple randomizations the new values of q 2 and r 2 ncv are negative or below the limit of acceptability (q 2 < 0.5, r 2 ncv < 0.6), then it is corroborated that the results obtained in the formulation of the final models are not by chance.In our case, the new QSAR models (after several repetitions) have low q 2 and r 2 ncv values (Table 4).

CoMFA
In summary, the best CoMFA and CoMSIA models were selected based on their statistical robustness and good validated external predictability.In the case of CoMFA, both potentials contribute equally to biological activity (41.2% for the steric field and 58.8% for the electrostatic field).In the case of CoMSIA there is a homogeneous contribution of each field to the activity, however, the electrostatic field presents a slightly higher contribution (27.9%), so its contribution to biological activity is more important.

CoMFA
In the electrostatic contour map (Figure 3A), we can see a blue polyhedron around the methylene connector that connects the amide with imidazole ring.This suggests that the replacement of this connector by electropositive groups would be favorable for activity.For example, groups such as CONH, CO(NH) 2 , or a protonated amino group (projecting the proton towards the blue polyhedron) could be evaluated.This could explain why compounds 32 and 33 are among the most active since they project the polar proton of the urea function towards the blue zone, while compounds 8 and 20 present low activities due to the absence of said function.On the other hand, as seen in Figure 3B, the most inactive compound of the series (compound 21) intersects the blue polyhedron.Therefore, one way to improve the activity of this type of derivative would be to increase the electronic deficiency of the benzene ring.This could be achieved by inserting electronic attractor groups into the ring or by inserting groups such as OH or NH that project the hydrogen atom to the blue region.
In the steric contour map around the most active compound 16 (Figure 3C), a green polyhedron is seen around position 5 of the imidazole ring.Therefore, the insertion of bulky substituents in that position would be favorable.This could explain the high activities reported for compounds 12 and 32, which project a benzyl and nitro group, respectively, to the green region.In Table 5, the proposals 1x to 5x were based on this observation.These compounds contain OH, NH, F, and acetyl groups in that position.The use of more voluminous halogens did not generate better predictions (e.g., Cl, Br, or I).On the other hand, in the less active compounds 7, 20, and 21 (Figure 3D), the benzene ring intersects the yellow region, which could explain the low activity of these compounds.
Molecules 2018, 23, x FOR PEER REVIEW 7 of 20 In summary, the best CoMFA and CoMSIA models were selected based on their statistical robustness and good validated external predictability.In the case of CoMFA, both potentials contribute equally to biological activity (41.2% for the steric field and 58.8% for the electrostatic field).In the case of CoMSIA there is a homogeneous contribution of each field to the activity, however, the electrostatic field presents a slightly higher contribution (27.9%), so its contribution to biological activity is more important.

CoMFA
In the electrostatic contour map (Figure 3A), we can see a blue polyhedron around the methylene connector that connects the amide with imidazole ring.This suggests that the replacement of this connector by electropositive groups would be favorable for activity.For example, groups such as CONH, CO(NH)2, or a protonated amino group (projecting the proton towards the blue polyhedron) could be evaluated.This could explain why compounds 32 and 33 are among the most active since they project the polar proton of the urea function towards the blue zone, while compounds 8 and 20 present low activities due to the absence of said function.On the other hand, as seen in Figure 3B, the most inactive compound of the series (compound 21) intersects the blue polyhedron.Therefore, one way to improve the activity of this type of derivative would be to increase the electronic deficiency of the benzene ring.This could be achieved by inserting electronic attractor groups into the ring or by inserting groups such as OH or NH that project the hydrogen atom to the blue region.
In the steric contour map around the most active compound 16 (Figure 3C), a green polyhedron is seen around position 5 of the imidazole ring.Therefore, the insertion of bulky substituents in that position would be favorable.This could explain the high activities reported for compounds 12 and 32, which project a benzyl and nitro group, respectively, to the green region.In Table 5, the proposals 1x to 5x were based on this observation.These compounds contain OH, NH, F, and acetyl groups in that position.The use of more voluminous halogens did not generate better predictions (e.g., Cl, Br, or I).On the other hand, in the less active compounds 7, 20, and 21 (Figure 3D), the benzene ring intersects the yellow region, which could explain the low activity of these compounds.

CoMSIA
The CoMSIA electrostatic contour map (Figure 4A,B) is similar to that obtained for CoMFA (Figure 3A,B).A blue polyhedron is visible near the methylene linker (Figure 4A).The concordance of this information in both models led us to propose the insertion of a urea group, after which we

CoMSIA
The CoMSIA electrostatic contour map (Figure 4A,B) is similar to that obtained for CoMFA (Figure 3A,B).A blue polyhedron is visible near the methylene linker (Figure 4A).The concordance of this information in both models led us to propose the insertion of a urea group, after which we obtained derivatives with high predicted activity (Table 5).As in CoMFA, the less active compound 21 intersects the blue polyhedron in the benzene region, therefore the insertion of electropositive functions before the ring would be most favorable for activity.obtained derivatives with high predicted activity (Table 5).As in CoMFA, the less active compound 21 intersects the blue polyhedron in the benzene region, therefore the insertion of electropositive functions before the ring would be most favorable for activity.Like the CoMFA map (Figure 3C), the steric contour map shows a green region intersecting position 5 of the imidazole ring of compound 16 (Figure 4C).However, on the CoMSIA map, a yellow region surrounds the green region, therefore the increase in volume should be explored with caution.In fact, in the proposal for new structures, it was found that the insertion of large groups in position 5, such as Br and Cl, generated a reduction in biological activity, however, the insertion of mediumvolume groups such as OH, NH2, and F improved activity considerably.This also suggests that the increase in molar refractivity is not favorable for activity.
The hydrophobic contour map (Figure 4E,F) shows two yellow polyhedrons, one near the carbonyl oxygen and the other near the benzene ring.This indicates that the presence of lipophilic groups in these regions would be favorable for activity.The high activity reported for compounds 32, 33, 35, and 39 could be explained by this fact since they position the sulfur atom of the sulfonylurea linker towards the smaller yellow polyhedron.In addition, in those same derivatives, the proton of the NH group at the right of the connector is directly positioned towards a grey polyhedron, which suggests that the presence of hydrophilic groups in that area is favorable.On the other hand, around the most active compound 16, there is a second grey polyhedron intersecting the imidazole ring (Figure 4E), which implies that this ring could be replaced by other hydrophilic isostere rings, but not by systems such as benzene or thiophene.Finally, the second yellow polyhedron intersects the benzene ring of the most active compound 16, but not the benzene ring of the least active compounds 19-23 (Figure 4F), which may in part explain the lower activity observed for these derivatives.
A large purple polyhedron surrounding the imidazole ring and the ortho position of benzene around the most active compound 16 (Figure 4G) is shown on the H-bond donor map (Figure 4G,H), suggesting that the presence of H-bond donor groups in these positions is not favorable for activity.This may explain the low activity of compound 8 (which is among the series of compounds 5-18) because it directly positions the NH group of the imidazole ring to the upper purple region.On the other hand, a smaller cyan polyhedron in the lower area suggests that the selective insertion of Hbond donor groups in the methylene connector area of imidazole would be beneficial.This is corroborated by the fact that the most active compounds 32, 33, and 35 position the NH group of the sulfonylurea towards this polyhedron.Finally, a small purple polyhedron on the LHS of the molecule suggests that the presence of NH groups of the dihydrobenzimidazolone and indole ring systems would not be beneficial for activity.
The H-bond acceptor contour map (Figure 4I,J) shows two magenta polyhedra close to position 3 and 5 of the imidazole ring, which means that incorporation of H-bond acceptor atoms in these positions is favorable.In fact, compounds 32 and 33 position the oxygen atoms from nitro groups to the magenta region.Other groups that could be inserted are F, OH, and pyridine rings.Like the CoMFA map (Figure 3C), the steric contour map shows a green region intersecting position 5 of the imidazole ring of compound 16 (Figure 4C).However, on the CoMSIA map, a yellow region surrounds the green region, therefore the increase in volume should be explored with caution.In fact, in the proposal for new structures, it was found that the insertion of large groups in position 5, such as Br and Cl, generated a reduction in biological activity, however, the insertion of mediumvolume groups such as OH, NH2, and F improved activity considerably.This also suggests that the increase in molar refractivity is not favorable for activity.
The hydrophobic contour map (Figure 4E,F) shows two yellow polyhedrons, one near the carbonyl oxygen and the other near the benzene ring.This indicates that the presence of lipophilic groups in these regions would be favorable for activity.The high activity reported for compounds 32, 33, 35, and 39 could be explained by this fact since they position the sulfur atom of the sulfonylurea linker towards the smaller yellow polyhedron.In addition, in those same derivatives, the proton of the NH group at the right of the connector is directly positioned towards a grey polyhedron, which suggests that the presence of hydrophilic groups in that area is favorable.On the other hand, around the most active compound 16, there is a second grey polyhedron intersecting the imidazole ring (Figure 4E), which implies that this ring could be replaced by other hydrophilic isostere rings, but not by systems such as benzene or thiophene.Finally, the second yellow polyhedron intersects the benzene ring of the most active compound 16, but not the benzene ring of the least active compounds 19-23 (Figure 4F), which may in part explain the lower activity observed for these derivatives.
A large purple polyhedron surrounding the imidazole ring and the ortho position of benzene around the most active compound 16 (Figure 4G) is shown on the H-bond donor map (Figure 4G,H), suggesting that the presence of H-bond donor groups in these positions is not favorable for activity.This may explain the low activity of compound 8 (which is among the series of compounds 5-18) because it directly positions the NH group of the imidazole ring to the upper purple region.On the other hand, a smaller cyan polyhedron in the lower area suggests that the selective insertion of Hbond donor groups in the methylene connector area of imidazole would be beneficial.This is corroborated by the fact that the most active compounds 32, 33, and 35 position the NH group of the sulfonylurea towards this polyhedron.Finally, a small purple polyhedron on the LHS of the molecule suggests that the presence of NH groups of the dihydrobenzimidazolone and indole ring systems would not be beneficial for activity.
The H-bond acceptor contour map (Figure 4I,J) shows two magenta polyhedra close to position 3 and 5 of the imidazole ring, which means that incorporation of H-bond acceptor atoms in these positions is favorable.In fact, compounds 32 and 33 position the oxygen atoms from nitro groups to the magenta region.Other groups that could be inserted are F, OH, and pyridine rings.

8.000
Like the CoMFA map (Figure 3C), the steric contour map shows a green region intersecting position 5 of the imidazole ring of compound 16 (Figure 4C).However, on the CoMSIA map, a yellow region surrounds the green region, therefore the increase in volume should be explored with caution.In fact, in the proposal for new structures, it was found that the insertion of large groups in position 5, such as Br and Cl, generated a reduction in biological activity, however, the insertion of medium-volume groups such as OH, NH 2 , and F improved activity considerably.This also suggests that the increase in molar refractivity is not favorable for activity.
The hydrophobic contour map (Figure 4E,F) shows two yellow polyhedrons, one near the carbonyl oxygen and the other near the benzene ring.This indicates that the presence of lipophilic groups in these regions would be favorable for activity.The high activity reported for compounds 32, 33, 35, and 39 could be explained by this fact since they position the sulfur atom of the sulfonylurea linker towards the smaller yellow polyhedron.In addition, in those same derivatives, the proton of the NH group at the right of the connector is directly positioned towards a grey polyhedron, which suggests that the presence of hydrophilic groups in that area is favorable.On the other hand, around the most active compound 16, there is a second grey polyhedron intersecting the imidazole ring (Figure 4E), which implies that this ring could be replaced by other hydrophilic isostere rings, but not by systems such as benzene or thiophene.Finally, the second yellow polyhedron intersects the benzene ring of the most active compound 16, but not the benzene ring of the least active compounds 19-23 (Figure 4F), which may in part explain the lower activity observed for these derivatives.
A large purple polyhedron surrounding the imidazole ring and the ortho position of benzene around the most active compound 16 (Figure 4G) is shown on the H-bond donor map (Figure 4G,H), suggesting that the presence of H-bond donor groups in these positions is not favorable for activity.This may explain the low activity of compound 8 (which is among the series of compounds 5-18) because it directly positions the NH group of the imidazole ring to the upper purple region.On the other hand, a smaller cyan polyhedron in the lower area suggests that the selective insertion of H-bond donor groups in the methylene connector area of imidazole would be beneficial.This is corroborated by the fact that the most active compounds 32, 33, and 35 position the NH group of the sulfonylurea towards this polyhedron.Finally, a small purple polyhedron on the LHS of the molecule suggests that the presence of NH groups of the dihydrobenzimidazolone and indole ring systems would not be beneficial for activity.
The H-bond acceptor contour map (Figure 4I,J) shows two magenta polyhedra close to position 3 and 5 of the imidazole ring, which means that incorporation of H-bond acceptor atoms in these positions is favorable.In fact, compounds 32 and 33 position the oxygen atoms from nitro groups to the magenta region.Other groups that could be inserted are F, OH, and pyridine rings.

Outliers
In the CoMFA model, compounds 18 and 23 were outliers.Compound 18 has a pEC 50 = 6.3010 and unlike its analog 16 (the most active compound in the series), it has an alkylation in the N of the amide.Therefore, the spatial conformation of the imidazole ring may be altered.On the other hand, alkylation of the ethyl chain in compound 18 restricts rotation and could fix a different conformation within the target.With respect to the underestimation of activity for compound 23, this may be because CoMFA does not consider the favorable effects of the presence of the urea group.Effects that are considered by the hydrophobic and H-bond acceptor maps of CoMSIA, where compounds 18 and 23 were not outliers.
The outlier compounds in CoMSIA were 40 and 41, for which the model predicts greater activity than the real one.This imprecision may be because, in the case of compound 40, it positions a sulfonamide group towards the magenta polyhedral of the H-bond acceptor map, which is favorable for activity.However, this group falls into the yellow region of the steric map, but the greater contribution of the H-bond acceptor potential to the activity overestimates the predicted activity.In the case of compound 41, the overestimation of the biological activity value may be due to the reduction in the electronic density of the benzene ring, given by the thiourea group, which has the highest percentage of contribution to biological activity.

Applicability Domain
The applicability domain (AD) is a theoretical region in chemical space encompassing both the model descriptors and modeled response, which allows one to estimate the uncertainty in the prediction of a compound based on how similar it is to the training compounds employed in the model development.In this work, we used the method developed by Roy et al. [34] for the determination of AD.This method is based on the basic theory of the standardization approach.
The calculation was carried out using the free application available on the author's page, after which it was obtained that all compounds were within the domain of applicability, except compound 18.This reinforces what was described in the previous section, that alkylation of the urea connector could result in significant changes in receptor binding.For this reason, none of the designed compounds (see next section) included alkyl groups in the urea connector.

Design of Novel Derivatives
Based on the information provided by CoMFA and CoMSIA, we have designed a series of structures of the aryloxypropanolamine type.In Table 5, we present the best derivatives with their predicted pEC 50 value by the best model (CoMSIA, r 2 = 0.918).The first proposed molecule 1x was as active as compound 16 (pEC 50 = 7.208).The other structures (2x-12x) are more active than compound 16.The best candidates are compounds 3x (pEC 50 = 8.561) and 7x (pEC 50 = 8.520).The presence of polar functions like OH, NH 2, and F at position 5 of the imidazole ring yielded good candidates.Other interesting proposals are the replacement of imidazole with a benzimidazole ring (comp.9x-12x), in which the presence of polar functions improves the activity.

Selection of Conformers and Molecular Alignment
CoMFA and CoMSIA studies were performed with Sybyl X-1.2 software (1.2, Tripos International, St. Louis, MS, USA) installed in a Windows 10 environment on a PC with an Intel Core i7 CPU.To acquire the best conformers for each molecule, every compound was drawn in ChemDraw and subjected to a preliminary geometry optimization using MM2 molecular mechanics as is implemented in ChemBio3D software (15.1.0,PerkinElmer, Waltham, MA, USA).Following this, the structure of compound 16 (the most active of the series) was further minimized by quantum mechanics using the DFT B3LYP/6.311+g**method in Gaussian software (09, Gaussian Inc., Wallingford, CT, USA).This structure was used as a template for the alignment.The mol2 structures were imported to Sybyl and MMFF94 charges were assigned to each atom.The minimized structures were superimposed by the atom-by-atom fit method choosing the aryloxypropanolamine nucleus as the common scaffold for alignment (Figure 5).In addition, a minimization was carried out based on the Powell method [35] (as implemented in Sybyl, Figure S1 in the Supplementary Material).However, the statistical results were much lower than those reported by the method used in this manuscript (Table S3 in the Supplementary Material).

CoMFA and CoMSIA Field Calculation
To derive the CoMFA and CoMSIA descriptor fields, the aligned training set molecules were placed in a three-dimensional cubic lattice with a grid spacing of 2 Å in the x, y, and z directions such that the entire set was included in it.The CoMFA steric and electrostatic field energies were calculated using an sp 3 carbon probe atom with a van der Waals radius of 1.52 Å and a charge of +1.0.Cut-off values for both steric and electrostatic fields were set to 30.0 kcal/mol.For CoMSIA analysis, the standard settings (probe with charge +1.0, radius 1 Å, hydrophobicity +1.0, H-bond donating +1.0, and H-bond accepting +1.0 [26]) were used to calculate five different fields: steric, electrostatic, hydrophobic, donor, and acceptor.Gaussian-type distance dependence was used to measure the relative attenuation of the field position of each atom in the lattice and led to a much smoother sampling of the fields around the molecules when compared to CoMFA.The default value of 0.3 was set for attenuation factor α.

Data Set Selection and β3-Adrenergic Activity
CoMFA and CoMSIA studies were performed on a set of 41 phenoxypropanolamine derivatives reported by Astellas Pharma [29,30] (Table 6).The derivatives displayed potent agonistic activity at the β3-AR.Agonistic activity (EC50) was assessed by measuring cAMP accumulation in CHO cells expressing β3-ARs.The EC50 values were converted to pEC50 (−logEC50).Several combinations of training and test sets were evaluated.The compounds were manually and randomly divided into training (29 compounds, 70%) and test sets (12 compounds, 30%), ensuring that both sets contained structurally diverse compounds with high, medium, and low activity, and a uniform distribution to avoid possible problems during external validation.For this purpose, most of the test set compounds were randomly extracted from the range of 6-8 logarithmic units of pEC50, while a smaller number were randomly selected from the range of 4-6 logarithmic units.Several test set groups were evaluated.For the construction of the final models, the test set that generated the highest r 2 value in each case (CoMFA and CoMSIA) was finally selected.The distribution of pEC50 values for the whole set, the training set, and the test set is shown in Figure 6.In all three cases, the biological activity

CoMFA and CoMSIA Field Calculation
To derive the CoMFA and CoMSIA descriptor fields, the aligned training set molecules were placed in a three-dimensional cubic lattice with a grid spacing of 2 Å in the x, y, and z directions such that the entire set was included in it.The CoMFA steric and electrostatic field energies were calculated using an sp 3 carbon probe atom with a van der Waals radius of 1.52 Å and a charge of +1.0.Cut-off values for both steric and electrostatic fields were set to 30.0 kcal/mol.For CoMSIA analysis, the standard settings (probe with charge +1.0, radius 1 Å, hydrophobicity +1.0, H-bond donating +1.0, and H-bond accepting +1.0 [26]) were used to calculate five different fields: steric, electrostatic, hydrophobic, donor, and acceptor.Gaussian-type distance dependence was used to measure the relative attenuation of the field position of each atom in the lattice and led to a much smoother sampling of the fields around the molecules when compared to CoMFA.The default value of 0.3 was set for attenuation factor α.

Data Set Selection and β3-Adrenergic Activity
CoMFA and CoMSIA studies were performed on a set of 41 phenoxypropanolamine derivatives reported by Astellas Pharma [29,30] (Table 6).The derivatives displayed potent agonistic activity at the β3-AR.Agonistic activity (EC 50 ) was assessed by measuring cAMP accumulation in CHO cells expressing β3-ARs.The EC 50 values were converted to pEC 50 (−logEC 50 ).Several combinations of training and test sets were evaluated.The compounds were manually and randomly divided into training (29 compounds, 70%) and test sets (12 compounds, 30%), ensuring that both sets contained structurally diverse compounds with high, medium, and low activity, and a uniform distribution to avoid possible problems during external validation.For this purpose, most of the test set compounds were randomly extracted from the range of 6-8 logarithmic units of pEC 50 , while a smaller number were randomly selected from the range of 4-6 logarithmic units.Several test set groups were evaluated.For the construction of the final models, the test set that generated the highest r 2 value in each case (CoMFA and CoMSIA) was finally selected.The distribution of pEC 50 values for the whole set, the training set, and the test set is shown in Figure 6.In all three cases, the biological activity followed a Gaussian distribution.The range of the biological activities spans 3.5 log units, from 3.89 to 7.37.

Internal Validation and Partial Least Squares (PLS) Analysis
PLS analysis was used to construct a linear correlation between the CoMFA and CoMSIA descriptors (independent variables) and the activity values (dependent variables) [36].To select the best model, the cross-validation analysis was performed using the leave-one-out (LOO) method (and sample distance PLS [SAMPLS]), which generates the square of the cross-validation coefficient (q 2 ) and the optimum number of components (N).The non-cross validation was performed with a column filter value of 2.0 to speed up the analysis and reduce the noise.The q 2 , which is a measure of the internal quality of the models, was obtained according to the following Equation (1): where , , and are observed, mean, and predicted activity in the training set, respectively.

External Validation
The models were subjected to external validation criteria according to the proposed test by

Internal Validation and Partial Least Squares (PLS) Analysis
PLS analysis was used to construct a linear correlation between the CoMFA and CoMSIA descriptors (independent variables) and the activity values (dependent variables) [36].To select the best model, the cross-validation analysis was performed using the leave-one-out (LOO) method (and sample distance PLS [SAMPLS]), which generates the square of the cross-validation coefficient (q 2 ) and the optimum number of components (N).The non-cross validation was performed with a designed.The predicted biological activity for the new derivatives is high and, in general, the presence of polar groups and cycles like the benzimidazole ring on the RHS are predicted to improve activity.This could be due the presence of polar functions may be important for interaction with the Arg315 residue, and aromatic rings may establish pi-stacking interactions with a Phe198 residue as it is reported in literature [38].
Taking into account the information derived from the CoMFA and CoMSIA studies, we have summarized the principal structure-activity relationships for the studies series of compounds in Figure 7.This information will be useful for the design of new compounds with promising therapeutic applications in several pathological disorders such as obesity, diabetes, OAB, depression, and cancer.
aryloxypropanolamines with selective potency for the β3-AR.The CoMFA and CoMSIA models presented good internal (q 2 = 0.537 and 0.669 for CoMFA and CoMSIA, respectively) and external (r 2 = 0.865 and 0.918 for CoMFA and CoMSIA, respectively) validation.The models were further validated following the criteria given by Tropsha and Roy [31,32,37], and were determined to be statistically reliable and robust.In both models, there was an equilibrium among the steric, electrostatic, hydrophobic, H-bond acceptor, and H-bond donor contribution to the activity.With this information, a new series of compounds was designed.The predicted biological activity for the new derivatives is high and, in general, the presence of polar groups and cycles like the benzimidazole ring on the RHS are predicted to improve activity.This could be due the presence of polar functions may be important for interaction with the Arg315 residue, and aromatic rings may establish pistacking interactions with a Phe198 residue as it is reported in literature [38].
Taking into account the information derived from the CoMFA and CoMSIA studies, we have summarized the principal structure-activity relationships for the studies series of compounds in Figure 7.This information will be useful for the design of new compounds with promising therapeutic applications in several pathological disorders such as obesity, diabetes, OAB, depression, and cancer.

Supplementary Materials:
The following are available online, Figure S1: Distill-based alignment of optimized molecules by Powell method, Table S1: q 2 and N values for all field combinations of CoMFA and CoMSIA, Table S2: Randomizations of biological activity for the execution of the Y-random test, Table S3

Figure 1 .
Figure 1.Structures of CL 316,243, amibegron, mirabegron, vibegron and the general structure of the aryloxypropanolamines studied here.

Figure 1 .
Figure 1.Structures of CL 316,243, amibegron, mirabegron, vibegron and the general structure of the aryloxypropanolamines studied here.

Figure 2 .
Figure 2. Plots of experimental versus predicted pEC50 values for the training and test set molecules for CoMFA (A) and CoMSIA (B) models.Residual plots between predicted and experimental values for CoMFA (C) and CoMSIA (D); CoMFA (E) and CoMSIA (F) predictions for every molecule in the complete set.

Figure 2 .
Figure 2. Plots of experimental versus predicted pEC 50 values for the training and test set molecules for CoMFA (A) and CoMSIA (B) models.Residual plots between predicted and experimental values for CoMFA (C) and CoMSIA (D); CoMFA (E) and CoMSIA (F) predictions for every molecule in the complete set.

Table 4 .
q 2 and r 2 ncv values after several Y-randomization tests.

Figure 4 .
Figure 4. CoMSIA electrostatic (A,B); steric (C,D); hydrophobic (E,F); donor (G,H) and acceptor (I,J) contour maps around compounds 16 (left) and 21 (right), the most active and less active of the series respectively.The colors in A-D have the same meaning as in the CoMFA contour maps.Hydrophobic favored areas are in yellow and unfavorable areas in grey (E,F).Donor and acceptor favored areas are in cyan and magenta respectively, and donor and acceptor unfavorable areas are in purple and red, respectively (G-J).

Figure 4 .
Figure 4. CoMSIA electrostatic (A,B); steric (C,D); hydrophobic (E,F); donor (G,H) and acceptor (I,J) contour maps around compounds 16 (left) and 21 (right), the most active and less active of the series respectively.The colors in A-D have the same meaning as in the CoMFA contour maps.Hydrophobic favored areas are in yellow and unfavorable areas in grey (E,F).Donor and acceptor favored areas are in cyan and magenta respectively, and donor and acceptor unfavorable areas are in purple and red, respectively (G-J).

Figure 4 .
Figure 4. CoMSIA electrostatic (A,B); steric (C,D); hydrophobic (E,F); donor (G,H) and acceptor (I,J) contour maps around compounds 16 (left) and 21 (right), the most active and less active of the series respectively.The colors in A-D have the same meaning as in the CoMFA contour maps.Hydrophobic favored areas are in yellow and unfavorable areas in grey (E,F).Donor and acceptor favored areas are in cyan and magenta respectively, and donor and acceptor unfavorable areas are in purple and red, respectively (G-J).

Molecules 2018 , 20 Figure 5 .
Figure 5.The superimposed structures of all compounds used in the CoMFA/CoMSIA models.

Figure 5 .
Figure 5.The superimposed structures of all compounds used in the CoMFA/CoMSIA models.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 6 .
Figure 6.Histogram of frequency distribution data.

Figure 7 .
Figure 7. Main structure-activity relationships found in this study.
: Statistical parameters for CoMFA and CoMSIA based in the Powell minimization method Author Contributions: C.M.-V., D.V.-V., J.A.-L., and J.C.-S.Collected and processed data.J.S.-D.and G.R.-G.performed the DFT calculations.M.L. performed the statistical analysis and constructed the CoMFA and CoMSIA models.J.M. conducted the study and wrote the manuscript.

Table 1 .
Statistical parameters and Field combinations for CoMFA and CoMSIA a .
a q 2 = the square of the leave-one-out (LOO) cross-validation (CV) coefficient; N = the optimum number of components; SEP = standard error of prediction; SEE is the standard error of estimation of non-CV analysis; r 2 ncv is the square of the non CV coefficient; F is the F-test value; r 2 is the predictive r 2 for test set compounds; S, E, H, D and A are the steric, electrostatic, hydrophobic, hydrogen-bond donor, and hydrogen-bond acceptor contributions respectively.

Table 2 .
Summary of external validation parameters for CoMFA and CoMSIA.

Table 3 .
Experimental and predicted pEC 50 and residual values for analyzed compounds according to CoMFA and CoMSIA.

Table 5 .
The proposed structures of new molecules and their predicted pEC50 values using the best model.

Table 5 .
The proposed structures of new molecules and their predicted pEC 50 values using the best model.

Table 5 .
The proposed structures of new molecules and their predicted pEC50 values using the best model.

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC 50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .

Table 6 .
Chemical structures and pEC50 values of the studied β3-adrenergic ligands a .