2.2. In Silico Study Results
The 3D structure of each molecule of the C. edulis stem compound was obtained from the ChemBL database. All of the information on all of the structures was stored in a phase called the C. edulis phase of the Maestro11.2 software (Schrodinger). This information includes compound properties such as AlogP, HBA, HBD, and molecular weight.
To treat vitiligo, the JAK 1/JAK 3 and JAK 1/JAK 2 molecular targets were, respectively, inhibited using JAK chemical inhibitors, tofacitinib and ruxolitinib. In our work, these two complexes were the targets of our review and study.
The inhibitors of the JAK 1/JAK 3 and JAK 1/JAK 2 complexes used in the current research were collected in July 2022. Their molecular information was extracted from the BindingDB database and supplemented to Maestro11.2 using the standard molecular property method. The information from the BindingDB database included 53 inhibitor ligands for the JAK 1/JAK 2 complex and 136 for the JAK 1/JAK 3 complex. In the BindingDB database, the ligand bioactivity of the molecular targets can vary significantly. For example, for the JAK 1/JAK 3 complex, the IC50 value of the molecule with the accession number BDBM50527406 was 2.6 nM. However, the bioactivity of the molecule with the accession number BDBM50530785 was 39,000 nM.
To find out the bioactive molecule class, based on the lower IC50, a classification using the k-means method was implemented in Python on the Anaconda platform. The results were analyzed based on the parameters SSE (sum of squared errors) and silhouette (distance between the class centers). The obtained SSE value shown in
Figure 3 indicates that the overall molecules had to be arranged into three different classes according to the elbow at class level three.
Figure 4 represents the distance between the class centers, indicating that about 95% of all the data were already classified into three classes.
An analysis of the properties of each class was performed by Python.
Figure 5 represents the boxplot of each property of the active class (AlogP, the number of hydrogen-bond donors and acceptors HBD and, HBA, respectively). Based on the boundaries of the boxes in
Figure 6, the results suggested that the most desirable regions for bioactivity were MW > 400, AlogP < 3, and HBA > 6. More details for all the other parameters, which are molecular weight, targets bioactivities, RO5 violations, rotatable bonds, QED weighted, CX BpKa, CX LogP, CX LogD, aromatic rings, inorganic flag, heavy atoms, HBA Lipinski, HBD Lipinski, RO5 violations (Lipinski), and molecular weight (monoisotopic), are presented in
Table 3. The resulting quartiles of all the molecules of active classes were computed and used as a filter for all the molecules in the
C. edulis database by the ligand preparation method (LigPrep).
Pharmacophores were created by Maestro11.2 based on the filter molecules of the active class. The results of pharmacophore creation are shown in
Table 4. Twenty of the best hypotheses were selected, and the performance of each hypothesis is presented in
Table 4. These hypotheses contain four and five features. The hypothesis with the highest AUC was chosen for the next step which is the screening of the
C. edulis phase molecules.
Hypothesis 7 presents the best performance of all the hypotheses with an area under the curve (AUC) value of 0.98.
Figure 6 presents this hypothesis, which was created by Maestro11.2 and formed by a hydrophobic-hydrophobic-ring-ring functional group (HHRR_7) (
Figure 6a). There are 84 active molecules and 48 inactive molecules. All the active molecules have the same features and the same positions of hypothesis HHRR_7 (
Figure 6b).
The receiver operating characteristic (ROC) curve of hypothesis 7 is presented in
Figure 7.
Table 5 represents the results of the screening of the pharmacophoric molecules of the stems and flowers compared to hypothesis 7. At the level of the flower, a single molecule (C
12H
26O
2) with only one conformation was obtained which respects the pharmacophore; however, four molecules (C
16H
17N
3O
2, C
19H
26O
2, C
20H
30O
2, and C
19H
23NO
4) respect hypothesis 7 in the stem. They have 23, 5, 1, and 1 conformations, respectively. These five molecules that were found were subjected to the next step of the pharmacokinetic analysis using the Swiss ADME program.
The pharmacokinetic properties of the five molecules were determined by the Swiss ADME method, which represents the properties in a six-axis landmark: LIPO (lipophilicity), FLEX (flexibility), SIZE, POLAR (polarity), INSATU (saturation) and INSOLU (solubility). From the bioavailability radar plot of the molecule C
16H
17N
3O
2 (
Figure 8), which was extracted from the
C. edulis stem, it can be observed that this molecule does not exceed the limits (pink area). Therefore, this molecule has acceptable properties of adsorption, distribution, metabolism, and excretion. This molecule was selected for the next molecular docking process.
Figure 9 shows a comparison between the molecular binding of natural and chemical ligands. The crystal structure of the JAK 1 and JAK 2 complexes with their inhibitors was extracted by the RCSB PDB database (
Figure 9a), which shows the position of the chemical ligand (in red) in the JAK 1 and JAK 2 complexes. However,
Figure 9b shows the position of selected natural ligands (in purple) in the JAK 1 and JAK 2 complexes. The obtained results of molecular docking show that the two molecules have the same attachment site in the JAK 1/JAK 2 inhibitor complex.
Figure 10 represents the amino acids attached to the inhibitory chemical molecule of the JAK 1/JAK 2 complexes (
Figure 10a) versus the amino acids attached to the natural molecule extracted by the plant
C. edulis (
Figure 10b). This figure clearly shows that the same amino acids are attached to the chemical and natural molecules (Leu1010, ASN1008, Ser963, Gly962, Leu881, Gly882, Glu883, and Val884,); therefore, they potentially have similar activity. These results confirmed that the chemical molecule can be replaced with the natural molecule extracted from the plant
C. edulis which has anti-vitiligo activity.
A three-dimensional quantitative structure activity relationship (3D-QSAR) model was created using the fingerprint parameters as output variables and all the other molecular properties as inputs for the model. Four machine learning models (LR, SVM, RF, and NN) and a bagging/stacking process were used. The results of the learning tests are given by the R-squared values presented in
Table 6. The best model was created using the bagging and stacking techniques. The R-squared coefficients were computed for each model. The best model was the one that was obtained by the bagging/stacking process with an R-squared value 0.999. This model will be validated using experimental data for IC50 and will be the subject of another study in the future. The stability of this natural potential molecule C
16H
17N
3O
2 will be studied for dynamic molecular analysis.