Next Article in Journal
Biosynthesis, Regulation, and Biotechnological Production Strategies of Riboflavin (Vitamin B2) and Its Derivatives: A Review
Next Article in Special Issue
Biological and Teratogenic Evaluations of Nitrogen Heterocycles for Anticancer Therapy
Previous Article in Journal
The Sigma-1 Receptor Agonist Fluvoxamine Is Protective in Hyperglycaemia-Induced Dysfunction of Trabecular Meshwork Cells
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

In Silico Discovery of ABZI Nitrogen Heterocycle STING Agonists via 3D-QSAR, Molecular Dynamics, and AI-Based Synthesis Prediction

1
The First Clinical School, Zhengzhou University, Zhengzhou 450001, China
2
The First Affiliated Hospital, Xinxiang Medical University, Xinxiang 453003, China
3
Henan International Joint Laboratory of Neural Information Analysis and Drug Intelligent Design, School of Medical Engineering, Xinxiang Medical University, Xinxiang 453003, China
4
School of Basic Medical Sciences, Xinxiang Medical University, Xinxiang 453003, China
5
Department of Theoretical Chemistry, Chemical Centre, Lund University, SE-221 00 Lund, Sweden
*
Author to whom correspondence should be addressed.
Pharmaceuticals 2026, 19(3), 387; https://doi.org/10.3390/ph19030387
Submission received: 23 January 2026 / Revised: 10 February 2026 / Accepted: 21 February 2026 / Published: 28 February 2026

Abstract

Background/Objectives: The stimulator of interferon genes (STING) pathway plays a central role in innate immune signaling and represents an attractive therapeutic target for cancer immunotherapy. Amidobenzimidazole (ABZI) derivatives have emerged as promising non-nucleotide STING agonists with improved drug-like properties compared to cyclic dinucleotides. However, current ABZI compounds still exhibit limited oral bioavailability and cross-species potency discrepancies. In addition, potential systemic toxicity remains a concern, indicating the need for further structural optimization. Methods: In this study, a comprehensive computer-aided drug design strategy was employed to systematically investigate ABZI derivatives and identify novel STING agonists with enhanced activity and favorable pharmacokinetic profiles. A 3D quantitative structure–activity relationship (3D-QSAR) model was constructed using the Topomer CoMFA approach based on a dataset of 109 reported ABZI compounds. Guided by the contour map analysis, new chemical groups were introduced through a fragment growth method, generating a large virtual library. The library was subsequently filtered via molecular docking, molecular dynamics simulations, and MM-PBSA binding free energy calculations. Results: Among the newly designed ABZI compounds, five compounds displayed lower binding free energies than D59, with M13 and M44 showing reductions exceeding 6.7 kcal/mol. This work demonstrates the effectiveness of an integrated in silico design strategy for the discovery of novel STING agonists. Conclusions: The identified compounds represent promising candidates for subsequent experimental validation and may support the development of nitrogen heterocycle-based STING agonists for antitumor applications.

1. Introduction

STING is a pivotal adaptor protein in innate immunity, primarily localized on the endoplasmic reticulum (ER). Upon sensing cytosolic DNA, STING undergoes conformational changes and oligomerization, thereby activating downstream signaling pathways. The cyclic GMP–AMP synthase (cGAS)–STING axis constitutes the core mechanism for cytosolic DNA recognition in mammalian cells. Specifically, cGAS senses cytoplasmic DNA and catalyzes the synthesis of 2′3′-cyclic GMP–AMP (2′3′-cGAMP), which binds to and activates STING, ultimately inducing the production of type I interferons (IFNs) and multiple pro-inflammatory cytokines [1,2,3,4,5]. This pathway plays a critical role in antiviral defense, antitumor immunity, and the initiation of adaptive immune responses [6,7,8,9,10]. Consequently, dysregulation of STING signaling is closely associated with infectious diseases, malignancies, and autoimmune disorders, highlighting its potential as a therapeutic target.
Early efforts to develop STING agonists primarily focused on cyclic dinucleotides (CDNs). Among these, the endogenous ligand 2′3′-cGAMP was the first identified natural agonist [11,12], followed by the development of analogs such as ADU-S100 [13,14], MK-1454 [15], and SB11285 [16]. Although CDNs are highly active in vitro and bind strongly, their large size, poor membrane permeability, susceptibility to enzymatic degradation, and short in vivo half-life limit their clinical use [17,18]. Moreover, several CDN candidates have demonstrated unsatisfactory efficacy in clinical trials, whether administered as monotherapy or in combination with immune checkpoint inhibitors [14,19,20,21]. These limitations have prompted the exploration of non-nucleotide agonists (non-CDNs) with improved drug-like properties.
Among non-CDN STING agonists, ABZI derivatives have emerged as particularly promising. Ramanjulu et al. first reported the dimeric ABZI (diABZI) in 2018, which achieved high-affinity binding by spanning the STING dimer pocket and induced complete and durable tumor regression in syngeneic mouse models via intravenous administration, overcoming the intratumoral delivery limitation of CDNs [22]. Nevertheless, diABZI compounds generally require systemic administration and exhibit limited oral bioavailability and potential systemic inflammatory risk [23]. In 2020, Xi et al. reported a series of ABZI derivatives that robustly induced IFN-β, CXCL10, and IL-6 production in vitro. These compounds also demonstrated favorable antitumor activity in mouse models and were evaluated comprehensively for their pharmacokinetic properties [24]. Despite these advances, systemic administration remained necessary, and inter-allelic differences among human STING variants pose translational challenges [25]. Further optimization by Song et al. led to the synthesis of Triazole-40, which maintained high activity, exhibited over 20-fold improved aqueous solubility compared with earlier analogs, and stably bound multiple human STING alleles, highlighting enhanced drug development potential [26]. Nonetheless, significant potency discrepancies between human and murine STING underscore the need for careful cross-species model selection [26]. In addition, ABZI compounds have demonstrated utility beyond oncology; for example, diABZI-4 administered intranasally effectively inhibited SARS-CoV-2 replication and conferred protection in K18-ACE2 transgenic mice [26]. Collectively, ABZI derivatives combine potent STING agonistic activity with favorable drug-like characteristics, yet challenges including oral bioavailability, cross-species potency differences, and systemic inflammatory risk remain, offering opportunities for further structural optimization and delivery strategy development [23,26,27].
In this work, a 3D quantitative structure–activity relationship (3D-QSAR) model was developed to investigate the structure–activity relationships of ABZI derivatives targeting STING. A dataset of 109 ABZI analogs with reported in vitro agonistic activities was collected from the literature [24,28,29,30,31] and used to construct a Topomer CoMFA model, enabling quantitative evaluation of key steric and electrostatic features governing biological activity [32]. Key structural features were extracted from the 3D-QSAR contour maps and used to guide subsequent molecular design. Subsequently, a fragment-based molecular growth strategy was employed to design new candidate compounds, followed by molecular docking to predict potential binding modes, molecular dynamics simulations to assess complex stability, and MM-PBSA calculations to estimate binding free energies. Five prioritized compounds were identified, among which compounds 1–3 exhibited favorable predicted affinity, Caco-2 permeability, microsomal metabolic stability, and toxicity profiles. Additionally, computer-aided synthetic planning (CASP) using AI tools provided feasible synthetic routes for the selected compounds, supporting subsequent experimental synthesis and activity validation. This study provides a framework to support the rational design of ABZI-derived non-CDNs STING agonists with potential antitumor applications.

2. Result and Discussion

2.1. 3D-QSAR Model Construction and Validation

Based on 89 training compounds, a 3D-QSAR model for ABZI derivatives was constructed using the Topomer CoMFA approach. The fragmentation scheme applied to the first compound in the training set is illustrated in Figure 1, which ensures that the designed agonists retain the core scaffold while capturing the influence of various R-group substitutions on biological activity. To quantitatively evaluate model performance, statistical parameters were calculated using both leave-one-out cross-validation and non-cross-validation methods. The optimal number of components (ONC) was identified as 3, resulting in a cross-validated correlation coefficient of 0.55, a conventional correlation coefficient of 0.72. These metrics satisfy the commonly accepted standards for 3D-QSAR modeling, indicating that the model captures meaningful structure–activity relationships within the training set.
To further evaluate predictive reliability the model, an external test set comprising 20 compounds (compound 90–109, Table S1) was employed. The external predictive correlation coefficient ( R e x t e r n a l 2 ) was 0.69, suggesting reasonable generalization. The correlation between experimental and predicted pEC50 values for both the training and test sets is depicted in Figure 2, providing a clear visualization of the model’s ability to capture the activity trends of ABZI derivatives. These results demonstrate that the Topomer CoMFA model’s ability to reproduce overall activity trends of ABZI derivatives.
Finally, Y-randomization tests were further performed to validate the robustness of the model. In this procedure, the independent variables were kept unchanged, while the dependent variable (pEC50) was shuffled 10 times. New models were then reconstructed. Compared to the original model, all the randomized models yielded markedly lower q2 and r2 values (Table 1), which indicates that the constructed Topomer CoMFA model is robust and free from spurious correlations.

2.2. COMFA-Based Structure–Activity Relationship Analysis

The quantitative structure–activity relationship of ABZIs derivatives was analyzed using the Topomer CoMFA model. Colored contour maps generated from the StDev × Coeff coefficients were employed to visualize the relationship between molecular features and biological activity. The contour maps correlate variations in compound activity with steric and electrostatic fields. Steric contours are represented in green and yellow, while electrostatic contours are shown in blue and red (Figure 3). Green steric regions indicate that increasing bulk at these positions is favorable for activity, whereas yellow regions suggest that steric expansion should be avoided. In electrostatic maps, blue regions are favorable for electronegative substituents, while red regions are favorable for electropositive groups to enhance activity. Here, it was also compound D59 used as a reference, the analysis of substituents at each position is as follows:
R1 substituent: The steric map shows limited green regions, indicating that increasing bulk at this position is unfavorable. This observation is supported by the comparison of compounds 3 and D5, and compounds 4 and 14b with their respective pIC50 values.
R2 substituent: A small green area is present near the ethyl group on the pyrazole ring, suggesting that introducing small functional groups or substituting the ethyl group may enhance activity.
R3 substituent: Large green polyhedral regions are observed below the fragment, indicating that introducing bulk at this position is beneficial. Experimental data of compounds 24b vs. 16c and 14f vs. 24c confirms this conclusion.
R4 substituent: The terminal methyl group on the phenyl ring is located in a large green area, and electrostatic analysis indicates that incorporating electronegative groups at this position could further enhance activity.
The steric and electrostatic field analysis suggests that R3 is suitable for bulk optimization, while the terminal phenyl ring of R4 is appropriate for the introduction of electronegative functional groups. Based on these insights, newly designed ABZIs derivatives will focus on fragment growth at the R3 position and the introduction of electronegative groups at the R4 terminal phenyl ring to improve biological activity.
In this work, the R3 site was diversified using the fragment-growing function in MOE, while the R4 site was substituted with groups such as amino, nitro, and halogens. The built-in Linker fragment library in MOE covers a wide chemical space, providing structural diversity for fragment growth. The newly generated compounds, together with seven template molecules, gave a total of 94,850 candidates. Their binding affinity to the STING protein was first evaluated using the London dG scoring function, which was applied during the fragment growth process. Finally, a total of 1030 compounds with binding energies below −15.00 kcal/mol were selected for further analysis [33], and their London dG scoring and molecular docking are shown in Table S2.

2.3. Molecular Docking

Molecular docking was performed on the 1030 newly designed compounds obtained from Section 3.2. Among them, 773 showed better binding energies than the reference compound D59 (−8.47 kcal/mol), suggesting stronger affinity for STING. This outcome also reinforces the reliability of the CoMFA model established in Section 3.1 and Section 3.2.
To identify the most promising candidates, the top 50 compounds based on docking scores were further analyzed, and 10 representative molecules were selected for detailed evaluation (Table 2). These compounds exhibited notably stronger binding to STING than D59, with CoMFA-predicted pEC50 values ranging from 5.00 to 6.00.
Binding mode analysis showed that these 10 compounds consistently occupied the STING active pocket and formed extensive non-covalent interactions with surrounding residues (Figure 4). Notably, nine of them introduced negatively charged substituents (–NH2, –NO2, or halogens) at the R4 position (Table 3). These groups established stable hydrogen bonds with key residues and enhanced hydrophobic contacts, leading to improved binding stability between the ligands and STING.

2.4. Multi-Step Binding Free Energy Calculation

In this study, the 10 ABZI derivatives selected in Section 3.3, together with the reference compound D59, were subjected to 100 ns molecular dynamics (MD) simulations in complex with the STING protein, and the binding free energies were also calculated from the trajectories. As shown in Table 2, four compounds (M1, M11, M13, and M44) exhibited binding free energies lower than that of D59 (−38.88 kcal/mol), with M44 showing a reduction of 4.59 kcal/mol relative to D59, indicating stronger binding affinity to STING.
To address the potential low accuracy associated with using a single short MD simulation for MM-PBSA calculations, we extended the analysis for the four selected complexes (STING-M1, STING-M11, STING-M13, and STING-M44) along with the reference STING-D59 and STING-M7. Each system underwent four independent 300 ns MD simulations in parallel, with the final binding free energies reported as the average across these replicates. This multi-replicate approach was specifically implemented to ensure statistical reliability and to effectively mitigate sampling variability inherent in shorter or single-trajectory simulations. Although M7 exhibited a slightly higher binding free energy than D59 (0.11 kcal/mol), it demonstrated a superior docking score and a more favorable binding mode. Therefore, STING-M7 was also included (the structures of these six compounds were presented in Figure 5). The calculated root-mean-square deviation (RMSD), listed in Figure 6, showed that all systems converged and stabilized around 3 Å, indicating that the complexes maintained stable conformations throughout the 300 ns simulations.
Finally, for each simulation, one frame was saved every 20 ps during the last 20 ns of the equilibrated trajectory, giving 1000 frames per system for subsequent binding free energy calculations. As summarized in Table 4, the binding free energies calculated from four independent 300 ns MD simulations are highly reproducible, with standard errors of 0.86–1.88 kcal/mol across all compounds. In particular, M13 (−43.30 ± 0.98 kcal/mol), M11 (−34.66 ± 0.95 kcal/mol), and M1 (−37.62 ± 0.86 kcal/mol) display SE < 1 kcal/mol, demonstrating robust convergence and reliable statistical meaning of the results. Compared to the reference compound D59 (−35.78 kcal/mol), all derivatives except M11 display more favorable binding free energies. Notably, M13 and M44, exhibited reductions of 7.5 kcal/mol and 6.7 kcal/mol respectively, demonstrating exceptionally high binding affinity to the STING protein.

2.5. Binding Mode

To further elucidate the binding mechanisms of the candidate ABZI derivatives with STING, representative conformations from the equilibrated MD trajectories were analyzed in combination with hydrogen bond occupancy and per-residue decomposition of the MM-PBSA energies. As shown in Figure 7, the reference compound D59 as well as the derivatives M13 and M44 were stably embedded in the STING binding pocket while retaining the conserved ABZI scaffold. The central amide group, serving as a key pharmacophore, formed conserved hydrogen bonds with loop-region residues in the binding site (Figure 8). However, compared with D59, which formed only two primary hydrogen bonds with SER162(A) and SER162(B) (with the highest occupancy of merely 59.62%), M13 and M44 adopted more extended conformations due to the substitutions introduced at the R3 and R4 positions (Table 5). This spatial rearrangement facilitated the formation of a more extensive and stable noncovalent interaction network with surrounding residues.
Specifically, M13 not only maintained the core amide–SER interaction but also established a strong hydrogen bond between the N5 atom of its 1H-pyrazole group and SER162(A)-OG, with an occupancy of 79.71%. In addition, the amide carbonyl formed a hydrogen bond with SER241(A)-N (64.52% occupancy), together with a hydrogen bond to THR263(A) (50.67% occupancy), constituting a stable “Ser162–Ser241–Thr263” three-point hydrogen-bonding network. Moreover, the electron-rich –NH2 group introduced at the R4 position of M13 formed a stable salt bridge with Asp237(A), further extending the ligand conformation and enhancing binding specificity toward the pocket residues.
M44 also preserved multiple hydrogen bonds with SER162(A), SER241(A), and THR263(A), each with an occupancy greater than 60%. In addition, the piperazine substituent introduced a new hydrogen bond donor, forming a unique cross-chain hydrogen bond with VAL239(B)-O, albeit with a lower occupancy of 21.04%. Furthermore, the bulky R3 substituent of M44 enlarged the hydrophobic contact surface, strengthening van der Waals and π–π stacking interactions with Tyr167(A) and surrounding residues, while the amino group at the R4 position engaged in a weaker hydrogen bond with Asp237(A).
Therefore, the amide groups of both M13 and M44 formed stable hydrogen bonds with the Ser241 residue located in the loop region of chain A at the upper part of the STING active pocket, while their 1H-pyrazole moieties established hydrogen bond interactions with Ser162 and other residues at the lower part of the pocket. In addition, the benzimidazole rings of both compounds engaged in hydrophobic interactions and π–π stacking with Tyr167(A), thereby firmly anchoring the ligands within the active pocket. The residue-based free energy decomposition analysis (Figure 9) further revealed that these key residues, including Ser162, Tyr167, and Ser241, made substantial contributions to the binding free energy and played central roles in maintaining the stability of ABZIs–STING interactions.

2.6. ADMET Evaluation

To evaluate the drug-likeness of the designed compounds, ADMET predictions were performed for the candidate compounds M13 and M44, as well as the reference compound D59 [34]. In terms of physicochemical properties, parameters such as aqueous solubility (Log S) and the octanol/water partition coefficient (Log P) not only determine solubility, membrane permeability, and potency, but also exert a crucial impact on pharmacokinetic (PK) and pharmacodynamic (PD) characteristics. Since physicochemical features are largely determined by chemical structure, the introduction of carbon chains or cyclic moieties at the R3 position enhanced hydrophobic and van der Waals interactions of M13 and M44 within the binding pocket, thereby strengthening their binding affinity and specificity toward STING. However, the increase in molecular weight generally leads to decreased solubility. This drawback was partially compensated in M13 and M44 by introducing hydrophilic groups such as ether, hydroxyl, or quaternary ammonium substituents at the R3 position, which improved their Log S and Log P values compared with the reference compound D59.
Regarding drug distribution (Table 6), the predicted blood–brain barrier (BBB) permeability of the reference compound D59 was poor, suggesting its suitability for peripheral targeting. By contrast, certain designed molecules (e.g., M13) showed favorable BBB penetration, implying potential as central nervous system (CNS)-targeted agents. Plasma protein binding (PPB) analysis indicated that D59 exhibits strong binding to plasma proteins, which may lower the free drug concentration in vivo and reduce oral bioavailability. In comparison, M13 and M44 displayed more favorable PPB profiles.
Toxicity assessment revealed that cardiotoxicity remains a potential concern. The human ether-a-go-go related gene (hERG) channel blockade, a key predictor of drug-induced cardiotoxicity, indicated that both D59 and M13 present relatively high risks, which may pose challenges for their further developability and therefore warrant careful consideration in lead optimization, as well as subsequent in vitro validation of cardiac safety. For skin sensitization, all tested molecules contained alerting structures such as 1H-benzo[d]imidazol-2-amine, which may trigger hypersensitivity reactions; moreover, the amino substituents introduced in M13 and M44 could further increase sensitization risk. These structural alerts suggest that future optimization should focus on scaffold modification or substituent tuning to mitigate sensitization liabilities while preserving STING activity. In cytotoxicity prediction assays against A549 (lung adenocarcinoma) and normal cells, the overall toxicity levels of the candidate compounds were lower than that of D59, indicating a comparatively favorable cellular safety profile and supporting their prioritization as in silico starting points rather than fully validated leads.

2.7. Retrosynthetic Analysis

Structurally, the compounds consist of 4-, 5-, or 6-ring systems with moderate complexity, lacking excessive fused or bridged rings that are known to elevate synthetic challenges due to increased stereochemical and regioselective demands. Additionally, all four compounds exhibited zero violations of Lipinski’s Rule of Five, confirming their drug-like properties with balanced physicochemical profiles and no evident structural liabilities such as atomic diversity extremes or atypical functional groups. This enhances their post-synthesis application potential in drug development.
To evaluate the synthetic accessibility of the identified compounds, we employed a combination of established metrics, starting with the widely used Synthetic Accessibility Score (SAscore), a molecular structure-based heuristic [34]. As shown in Table 6, the SAscore values for these four compounds are all <6, indicating they are all relatively easy to synthesize. Complementing this, we utilized DeepSA, a deep learning-driven chemical language model trained on over 3.5 million molecules using natural language processing techniques to predict synthesis accessibility [35]. Table S3 presents the DeepSA predictions: ES probabilities spanned 0.77 to 0.997, while HS probabilities were consistently low, spanning 0.0029 to 0.222. All HS values were well below the model’s thresholds of 0.47 for ES/HS balance and 0.5 for definitive HS classification. These results collectively indicate low synthetic difficulty, aligning with DeepSA’s high discriminative performance (AUROC of 89.6% in validation studies).
To assess overall synthetic accessibility based on molecular structure, we applied the Retrosynthetic Accessibility Score (RAscore), a machine learning model trained on AI-driven retrosynthesis data to predict synthesizability by estimating whether a viable synthetic pathway can be identified via computer-aided synthesis planning [36]. As shown in Table S4, most compounds exhibited high RAscores (0.59–0.87), indicating favorable synthetic feasibility, except for M11 (0.33), which suggests higher potential synthesis challenges.
Building on these predictions, we further validated feasibility by generating retrosynthetic routes using ChemAIRS (https://www.chemiscal.com/), an AI-powered retrosynthesis platform developed by Chemical. AI that leverages machine learning and expert chemical rules to generate diverse, efficient pathways by analyzing thousands of potential routes. The highest-ranked route for each compound was evaluated, with that of M13 serving as a representative example and selected as the primary synthesis scheme for this study (Figure S1). These assessments collectively support the compounds’ practical viability for experimental synthesis in drug development.

3. Materials and Method

3.1. Data Sets and Structure Alignment

In this work, a total of reported 109 ABZI derivatives reported were collected as the dataset for 3D-QSAR modeling, and their chemical structures and biological activities are summarized in Table S1. The EC50 values of the collected compounds were within the range of 0.065 to 51.07 μM. For compatibility with the 3D-QSAR model, EC50 values were converted to pEC50 (−logEC50), with values ranging from 3.991 to 7.187.
All compounds were preprocessed using SYBYL-X v2.0. Molecular 3D conformations were optimized with the Tripos force field, and Gasteiger–Hückel charges were assigned. Energy minimization was performed using the Powell gradient algorithm, with a maximum of 1000 iterations and an energy convergence criterion of 0.001 kcal/mol/Å2. To ensure conformational reliability, the lowest-energy conformation of each molecule was selected as the representative structure for subsequent modeling.
The selection of a template compound is critical for the accuracy of 3D-QSAR alignment. In this study, the most potent compound, D59, was chosen as the template, and all other molecules were aligned to its core scaffold. The dataset was randomly divided into a training set (89) for model construction and a test set (20) for external validation.

3.2. Topomer CoMFA Model Construction

The 3D-QSAR model was constructed using the Topomer CoMFA approach implemented in SYBYL-X v2.0 [37]. Topomer CoMFA is a fragment-based method in which molecules are divided into a core scaffold and R-group substituents. Conformations of the fragments are automatically generated and aligned according to empirical rules, thereby creating a three-dimensional lattice encompassing all compounds in the training set. The lattice spacing was set to 2.0 Å, and an sp3-hybridized carbon atom probe was applied to calculate steric and electrostatic field energies around the molecules. The resulting descriptors were used as independent variables, while the pEC50 values served as dependent variables for regression analysis using the partial least squares (PLS) method [38].
The construction of the Topomer CoMFA model was based on the optimal number of principal components, which was determined through cross-validation. The predictive quality of the established model was assessed using both cross-validated and non-cross-validated approaches, yielding the cross-validation coefficient (q2) and the correlation coefficient (r2), respectively. To further evaluate the robustness of the model, Y-randomization testing was performed: while keeping the independent variables (X) unchanged, the dependent variable (pEC50) was randomly permuted 10 times and the models were reconstructed. In all randomized cases, the resulting q2 and r2 values were markedly lower than those of the original model, indicating that the Topomer CoMFA model possessed reliable predictive capability. The validated 3D-QSAR model further enabled the interpretation of structure–activity relationships of ABZI derivatives through contour map analysis and facilitated the prediction of their bioactivity values.

3.3. Fragment Growing

Based on the structure–activity relationships revealed by the Topomer CoMFA model constructed in Section 3.2, a fragment-based growth strategy was employed to optimize the ABZI scaffold and design potential derivatives with enhanced bioactivity. The most active compound, D59, was selected as the template, with the R3 and R4 positions designated as primary modification sites. Fragment growth was performed using the Linker fragment library integrated in MOE v2019 (Chemical Computing Group, Montreal, QC, Canada) [39], which encompasses a variety of common functional groups and structural motifs, thereby enabling coverage of a broad chemical space.
For the newly generated molecules, preliminary evaluation of their binding affinity toward the STING protein was conducted using the London dG [40] scoring function. London dG is an empirical scoring function that estimates the relative binding free energy between small molecules and their receptor, allowing rapid assessment of potential interactions. In the present study, the obtained scores were used to prioritize candidate compounds and guide subsequent molecular docking and molecular dynamics simulations.

3.4. Molecular Docking

To investigate the binding affinity and interaction patterns between the newly designed ABZI derivatives and STING, molecular docking simulations were performed using MOE v2019 [39]. Crystal structures of STING complexed with ABZI ligands (PDB IDs: 6DXL) [22] were obtained from the RCSB Protein Data Bank [41]. All protein and ligand structures were prepared and minimized using the Amber10:EHT force field in MOE prior to docking. Missing amino acid residues in the protein were completed using QuickPrep (MOE), followed by protonation via Protonate3D, and energy minimization with an RMS gradient of 0.1 kcal/mol/Å2. Ligands were prepared by adding hydrogen atoms, computing partial charges, and performing energy minimization under the same parameters.
The prepared protein structures were designated as receptors and the ligands as docking partners, with the binding site defined by the location of the co-crystallized ligand. Docking was conducted in two stages: placement and refinement. During the placement stage, the Triangle Matcher method was used to generate initial poses, which were scored by the London dG function, and the top 50 poses were selected for further refinement. In the refinement stage, a semi-flexible docking protocol was employed, keeping the protein rigid while allowing ligand flexibility, and the GBVI/WSA dG scoring function was used to evaluate the binding affinity of each pose. The final docking results were reported as the best-scoring conformations.

3.5. Molecular Dynamics Simulation and Binding Free Energy Calculation

Molecular dynamics (MD) simulations were performed using Amber18. Topology and initial coordinates of all the system were prepared with the tleap module of AmberTools18. The ff14SB force filed [42] was applied to the protein, while the ligand was described using GAFF [43], with missing parameters generated via parmchk2. The complex was solvated in an octahedral TIP3P water box, ensuring a minimum distance of 12 Å between any solute atom and the box edge, and counterions (Na+/Cl) were added to neutralize the system.
Energy minimization was conducted in two stages. In the first stage, a Cartesian restraint of 500 kcal/mol/Å2 was applied to the protein and ligand atoms, and the solvent molecules were minimized for 10,000 steps (first 5000 steps by steepest descent, followed by 5000 steps of conjugate gradient). In the second stage, all restraints were removed, and the entire system underwent an 10,000-step unrestrained minimization.
The minimized system was gradually heated from 0 K to 300 K under the NVT ensemble, with restraints on the protein backbone atoms to prevent unrealistic conformational changes, using the Langevin thermostat. Pre-equilibration was then performed under NPT conditions (1 atm, 300 K) for 2 ns. The production phase consisted of three independent 300 ns MD simulations under the same NPT conditions, each initiated with a different random seed to ensure independent trajectories. A time step of 2 fs was used, and all bonds involving hydrogen atoms were constrained using the SHAKE algorithm [44]. Non-bonded interactions were truncated at 10 Å, and long-range electrostatics were treated using the Particle Mesh Ewald method [45]. Atomic coordinates were saved every 50 ps for subsequent analysis.
Trajectory analysis was performed using Cpptraj, and the RMSD of the system was calculated to assess equilibration. To evaluate the binding affinity between receptor and ligand, Molecular Mechanics/Poisson–Boltzmann Surface Area (MM-PBSA) calculations were carried out using MMPBSA.py [46,47]. The average binding free energy was computed over 2000 frames extracted from the last 20 ns of stable trajectory. The binding free energy Δ G b i n d could be calculated from:
Δ G b i n d = G c o m p l e x ( G r e c e p t o r + G l i g a n d )
Subsequently, per-residue decomposition analysis was performed for the last 20 ns of the trajectory to identify key amino acid residues contributing significantly to the binding free energy and to elucidate their roles in ligand–receptor interactions.

3.6. Retrosynthetic Analysis

To reduce trial costs and shorten the synthesis cycle for target compounds, this work used the online server ChemAirs (pro.chemiscal.com/home) to assess the synthetic difficulty of candidate compounds and predict synthetic routes. The server uses the SA score (1–10, with higher scores indicating greater synthetic difficulty) to quantify synthetic accessibility. The retrosynthetic analysis based on chemical reaction patterns, uses chain scission to gradually decompose the target molecule into intermediates and commercially available precursors. The server’s default parameters were used throughout the analysis. Retrosynthetic recommendations were screened based on synthetic difficulty and raw material costs to determine the optimal synthetic strategy.

4. Conclusions

In this work, an integrated computational framework combining 3D-QSAR modeling, fragment-based molecular design, molecular docking, molecular dynamics simulations, binding free energy calculations, and ADMET prediction was applied to the rational optimization of ABZI-derived STING agonists, identifying steric expansion at the R3 position and the introduction of electronegative substituents at the R4 terminal phenyl ring as favorable strategies for enhancing STING agonistic activity. Guided by these findings, a large virtual compound library was generated and systematically screened, leading to the identification of several high-affinity candidates.
M13 and M44 exhibited markedly enhanced binding affinity compared with the reference compound D59, as confirmed by extensive molecular dynamics simulations and MM-PBSA calculations. These compounds formed stable and extensive noncovalent interaction networks with key residues in the STING binding pocket, including conserved hydrogen bonds with Ser162 and Ser241, as well as strengthened hydrophobic and π–π interactions with Tyr167. ADMET analyses further indicated that M13 and M44 possess improved physicochemical and pharmacokinetic properties, while maintaining acceptable safety profiles, highlighting their potential as lead compounds for further development. In addition, AI-assisted synthetic route prediction provided practical guidance for their experimental synthesis and biological evaluation.
Although this is an in silico discovery study, the identified computational leads may offer valuable structural insights and inspiration for non-CDN STING agonist research. However, synthesis of these novel ABZI derivatives is currently challenging due to limited access to key intermediates; we are actively trying to obtain these intermediates, exploring viable synthetic routes, and planning to experimentally validate the top candidates in subsequent work.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ph19030387/s1, Table S1: Activity data (pEC50) and predicted pEC50 values (Pred) of the training set; Table S2: Compound molecular docking score top50; Table S3: The prediction results of DeepSA. HA_num rule_of_five represent the number of heavy atoms, Lipinski’s Rule of Five respectively. ES and HS represent probability values, a molecule requires less than or equal to 10 synthetic steps was labeled as ES, otherwise, if the required step is larger than 10 or can’t be successfully predicted by Retro* was labeled as HS; Table S4: The prediction results of RAscore using XGB model. The values represent synthetic accessibility probability; Figure S1: Predicted synthetic route of compound M13; Figure S2: Details of thepredicted synthetic route for compound M13. References in Supplementary Materials can be found [28,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75].

Author Contributions

Conceptualization, B.Z.; Methodology, M.W.; Software, X.P.; Validation, H.R., B.Z., Y.J. and S.Z.; Formal analysis, B.Z., Y.J. and S.Z.; Data curation, H.R.; Writing—original draft, H.R. and X.P.; Writing—review & editing, M.W.; Visualization, X.P.; Supervision, M.W.; Project administration, M.W.; Funding acquisition, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (No. 22303076).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article and Supplementary Materials. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Schoggins, J.W. Interferon-stimulated genes: What do they all do? Annu. Rev. Virol. 2019, 6, 567–584. [Google Scholar] [CrossRef]
  2. Ishikawa, H.; Barber, G.N. STING is an endoplasmic reticulum adaptor that facilitates innate immune signalling. Nature 2008, 455, 674–678. [Google Scholar] [CrossRef]
  3. Barber, G.N. STING: Infection, inflammation and cancer. Nat. Rev. Immunol. 2015, 15, 760–770. [Google Scholar] [CrossRef] [PubMed]
  4. Zhang, H.; You, Q.-D.; Xu, X.-L. Targeting stimulator of interferon genes (STING): A medicinal chemistry perspective. J. Med. Chem. 2019, 63, 3785–3816. [Google Scholar] [CrossRef] [PubMed]
  5. Wu, J.; Sun, L.; Chen, X.; Du, F.; Shi, H.; Chen, C.; Chen, Z.J. Cyclic GMP-AMP is an endogenous second messenger in innate immune signaling by cytosolic DNA. Science 2013, 339, 826–830. [Google Scholar] [CrossRef]
  6. Ergun, S.L.; Fernandez, D.; Weiss, T.M.; Li, L. STING polymer structure reveals mechanisms for activation, hyperactivation, and inhibition. Cell 2019, 178, 290–301. e10. [Google Scholar] [CrossRef]
  7. Li, X.-D.; Wu, J.; Gao, D.; Wang, H.; Sun, L.; Chen, Z.J. Pivotal roles of cGAS-cGAMP signaling in antiviral defense and immune adjuvant effects. Science 2013, 341, 1390–1394. [Google Scholar] [CrossRef]
  8. Shang, G.; Zhang, C.; Chen, Z.J.; Bai, X.-c.; Zhang, X. Cryo-EM structures of STING reveal its mechanism of activation by cyclic GMP–AMP. Nature 2019, 567, 389–393. [Google Scholar] [CrossRef] [PubMed]
  9. Ablasser, A.; Goldeck, M.; Cavlar, T.; Deimling, T.; Witte, G.; Röhl, I.; Hopfner, K.-P.; Ludwig, J.; Hornung, V. cGAS produces a 2′-5′-linked cyclic dinucleotide second messenger that activates STING. Nature 2013, 498, 380–384. [Google Scholar] [CrossRef]
  10. Cai, X.; Chiu, Y.-H.; Chen, Z.J. The cGAS-cGAMP-STING pathway of cytosolic DNA sensing and signaling. Mol. Cell 2014, 54, 289–296. [Google Scholar] [CrossRef]
  11. Zhang, X.; Shi, H.; Wu, J.; Zhang, X.; Sun, L.; Chen, C.; Chen, Z.J. Cyclic GMP-AMP containing mixed phosphodiester linkages is an endogenous high-affinity ligand for STING. Mol. Cell 2013, 51, 226–235. [Google Scholar] [CrossRef]
  12. Gao, P.; Ascano, M.; Zillinger, T.; Wang, W.; Dai, P.; Serganov, A.A.; Gaffney, B.L.; Shuman, S.; Jones, R.A.; Deng, L. Structure-function analysis of STING activation by c [G (2′, 5′) pA (3′, 5′) p] and targeting by antiviral DMXAA. Cell 2013, 154, 748–762. [Google Scholar] [CrossRef]
  13. Corrales, L.; Glickman, L.H.; McWhirter, S.M.; Kanne, D.B.; Sivick, K.E.; Katibah, G.E.; Woo, S.-R.; Lemmens, E.; Banda, T.; Leong, J.J. Direct activation of STING in the tumor microenvironment leads to potent and systemic tumor regression and immunity. Cell Rep. 2015, 11, 1018–1030. [Google Scholar] [CrossRef]
  14. Zandberg, D.; Ferris, R.; Laux, D.; Mehra, R.; Nabell, L.; Kaczmar, J.; Gibson, M.; Kim, Y.; Neupane, P.; Bauman, J. 71P A phase II study of ADU-S100 in combination with pembrolizumab in adult patients with PD-L1+ recurrent or metastatic HNSCC: Preliminary safety, efficacy and PK/PD results. Ann. Oncol. 2020, 31, S1446–S1447. [Google Scholar] [CrossRef]
  15. Chang, W.; Altman, M.D.; Lesburg, C.A.; Perera, S.A.; Piesvaux, J.A.; Schroeder, G.K.; Wyss, D.F.; Cemerski, S.; Chen, Y.; DiNunzio, E. Discovery of MK-1454: A potent cyclic dinucleotide stimulator of interferon genes agonist for the treatment of cancer. J. Med. Chem. 2022, 65, 5675–5689. [Google Scholar] [CrossRef]
  16. Janku, F.; Strauss, J.; Karim, R.; Olszanski, A.J.; Luke, J.J.; Leach, K.; Iyer, R.; Abbas, A. A phase Ia/Ib dose-escalation study of intravenously administered SB 11285 alone and in combination with nivolumab in patients with advanced solid tumors. J. Clin. Oncol. 2020, 38, TPS3162. [Google Scholar] [CrossRef]
  17. Gao, J.; Tao, J.; Liang, W.; Zhao, M.; Du, X.; Cui, S.; Duan, H.; Kan, B.; Su, X.; Jiang, Z. Identification and characterization of phosphodiesterases that specifically degrade 3′ 3′-cyclic GMP-AMP. Cell Res. 2015, 25, 539–550. [Google Scholar] [CrossRef] [PubMed]
  18. Corrales, L.; McWhirter, S.M.; Dubensky, T.W.; Gajewski, T.F. The host STING pathway at the interface of cancer and immunity. J. Clin. Investig. 2016, 126, 2404–2411. [Google Scholar] [CrossRef]
  19. Garon, E.B.; Rizvi, N.A.; Hui, R.; Leighl, N.; Balmanoukian, A.S.; Eder, J.P.; Patnaik, A.; Aggarwal, C.; Gubens, M.; Horn, L. Pembrolizumab for the treatment of non–small-cell lung cancer. N. Engl. J. Med. 2015, 372, 2018–2028. [Google Scholar] [CrossRef]
  20. Endo, A.; Kim, D.-S.; Huang, K.-C.; Hao, M.-H.; Mathieu, S.; Choi, H.-w.; Majumder, U.; Zhu, X.; Shen, Y.; Sanders, K. Discovery of E7766: A representative of a novel class of macrocycle-bridged STING agonists (MBSAs) with superior potency and pan-genotypic activity. Cancer Res. 2019, 79, 4456. [Google Scholar] [CrossRef]
  21. Dumbrava, E.E.; Friedman, C.F.; Hamilton, E.P.; Chen, C.T.; Sumrow, B.J.; JeBailey, L.; Klempner, S.J. 654 XMT-2056: A phase 1 study of a HER2-directed STING agonist antibody drug conjugate (ADC) in HER2+ solid tumors. J. Immunother. Cancer 2024, 12, A751. [Google Scholar] [CrossRef]
  22. Ramanjulu, J.M.; Pesiridis, G.S.; Yang, J.; Concha, N.; Singhaus, R.; Zhang, S.-Y.; Tran, J.-L.; Moore, P.; Lehmann, S.; Eberl, H.C. Design of amidobenzimidazole STING receptor agonists with systemic activity. Nature 2018, 564, 439–443. [Google Scholar] [CrossRef]
  23. Villanueva, M.T. STINGing systemically. Nat. Rev. Drug Discov. 2019, 18, 15–16. [Google Scholar] [CrossRef]
  24. Xi, Q.; Wang, M.; Jia, W.; Yang, M.; Hu, J.; Jin, J.; Chen, X.; Yin, D.; Wang, X. Design, synthesis, and biological evaluation of amidobenzimidazole derivatives as stimulator of interferon genes (STING) receptor agonists. J. Med. Chem. 2019, 63, 260–282. [Google Scholar] [CrossRef] [PubMed]
  25. Kennedy, R.B.; Haralambieva, I.H.; Ovsyannikova, I.G.; Voigt, E.A.; Larrabee, B.R.; Schaid, D.J.; Zimmermann, M.T.; Oberg, A.L.; Poland, G.A. Polymorphisms in STING affect human innate immune responses to poxviruses. Front. Immunol. 2020, 11, 567348. [Google Scholar] [CrossRef]
  26. Song, Z.; Wang, X.; Zhang, Y.; Gu, W.; Shen, A.; Ding, C.; Li, H.; Xiao, R.; Geng, M.; Xie, Z. Structure–activity relationship study of amidobenzimidazole analogues leading to potent and systemically administrable stimulator of interferon gene (STING) agonists. J. Med. Chem. 2021, 64, 1649–1669. [Google Scholar] [CrossRef]
  27. Humphries, F.; Shmuel-Galia, L.; Jiang, Z.; Wilson, R.; Landis, P.; Ng, S.-L.; Parsi, K.M.; Maehr, R.; Cruz, J.; Morales, A. A diamidobenzimidazole STING agonist protects against SARS-CoV-2 infection. Sci. Immunol. 2021, 6, eabi9002. [Google Scholar] [CrossRef]
  28. Chen, N.-N.; Zhang, H.; Zhu, Q.-S.; Zeng, T.; Dai, W.; Zhou, Y.-L.; Xin, G.-F.; Wu, B.-D.; Gong, S.-J.; Jiang, Z.-Y. Development of orally bioavailable amidobenzimidazole analogues targeting stimulator of interferon gene (STING) receptor. J. Med. Chem. 2023, 66, 5584–5610. [Google Scholar] [CrossRef] [PubMed]
  29. Jeon, M.J.; Lee, H.; Lee, J.; Baek, S.Y.; Lee, D.; Jo, S.; Lee, J.-Y.; Kang, M.; Jung, H.R.; Han, S.B. Development of potent immune modulators targeting stimulator of interferon genes receptor. J. Med. Chem. 2022, 65, 5407–5432. [Google Scholar] [CrossRef]
  30. Jeon, M.J.; Lee, H.; Jo, S.; Kang, M.; Jeong, J.H.; Jeong, S.H.; Lee, J.-Y.; Song, G.Y.; Choo, H.; Lee, S. Discovery of novel amidobenzimidazole derivatives as orally available small molecule modulators of stimulator of interferon genes for cancer immunotherapy. Eur. J. Med. Chem. 2023, 261, 115834. [Google Scholar] [CrossRef] [PubMed]
  31. Liu, X.; Wang, M.; Yang, M.; Sun, H.; Wang, B.; Pan, X.; Chen, X.; Jin, J.; Wang, X. Structure-activity relationship study of amidobenzimidazole derivatives as stimulator of interferon genes (STING) agonists. Eur. J. Med. Chem. 2023, 246, 114943. [Google Scholar] [CrossRef]
  32. Cramer, R.D. Topomer CoMFA: A design methodology for rapid lead optimization. J. Med. Chem. 2003, 46, 374–388. [Google Scholar] [CrossRef] [PubMed]
  33. Muteeb, G.; Alshoaibi, A.; Aatif, M.; Rehman, M.T.; Qayyum, M.Z. Screening marine algae metabolites as high-affinity inhibitors of SARS-CoV-2 main protease (3CLpro): An in silico analysis to identify novel drug candidates to combat COVID-19 pandemic. Appl. Biol. Chem. 2020, 63, 79. [Google Scholar] [CrossRef]
  34. Fu, L.; Shi, S.; Yi, J.; Wang, N.; He, Y.; Wu, Z.; Peng, J.; Deng, Y.; Wang, W.; Wu, C. ADMETlab 3.0: An updated comprehensive online ADMET prediction platform enhanced with broader coverage, improved performance, API functionality and decision support. Nucleic Acids Res. 2024, 52, W422–W431. [Google Scholar] [CrossRef]
  35. Wang, S.; Wang, L.; Li, F.; Bai, F. DeepSA: A deep-learning driven predictor of compound synthesis accessibility. J. Cheminform. 2023, 15, 103. [Google Scholar] [CrossRef]
  36. Thakkar, A.; Chadimová, V.; Bjerrum, E.J.; Engkvist, O.; Reymond, J.-L. Retrosynthetic Accessibility Score (RAscore)—Rapid Machine Learned Synthesizability Classification from AI Driven Retrosynthetic Planning. Chem. Sci. 2021, 12, 3339–3349. [Google Scholar] [CrossRef]
  37. Cramer, R.D.; Patterson, D.E.; Bunce, J.D. Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. J. Am. Chem. Soc. 1988, 110, 5959–5967. [Google Scholar] [CrossRef] [PubMed]
  38. Wold, S.; Sjöström, M.; Eriksson, L. PLS-regression: A basic tool of chemometrics. Chemom. Intell. Lab. Syst. 2001, 58, 109–130. [Google Scholar] [CrossRef]
  39. Ulc, C. Molecular Operating Environment (MOE); Chemical Computing Group: Montreal, QC, Canada, 2019. [Google Scholar]
  40. Li, Y.; Han, L.; Liu, Z.; Wang, R. Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results. J. Chem. Inf. Model. 2014, 54, 1717–1736. [Google Scholar] [CrossRef]
  41. Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The protein data bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef] [PubMed]
  42. Maier, J.A.; Martinez, C.; Kasavajhala, K.; Wickstrom, L.; Hauser, K.E.; Simmerling, C. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 2015, 11, 3696–3713. [Google Scholar] [CrossRef]
  43. Wang, J.; Wolf, R.M.; Caldwell, J.W.; Kollman, P.A.; Case, D.A. Development and testing of a general amber force field. J. Comput. Chem. 2004, 25, 1157–1174. [Google Scholar] [CrossRef] [PubMed]
  44. van Gunsteren, W.F.; Berendsen, H.J. Algorithms for macromolecular dynamics and constraint dynamics. Mol. Phys. 1977, 34, 1311–1327. [Google Scholar] [CrossRef]
  45. Darden, T.; York, D.; Pedersen, L. Particle mesh Ewald: An N log (N) method for Ewald sums in large systems. J. Chem. Phys. 1993, 98, 10089. [Google Scholar] [CrossRef]
  46. Miller, B.R., III; McGee, T.D., Jr.; Swails, J.M.; Homeyer, N.; Gohlke, H.; Roitberg, A.E. MMPBSA.py: An efficient program for end-state free energy calculations. J. Chem. Theory Comput. 2012, 8, 3314–3321. [Google Scholar] [CrossRef] [PubMed]
  47. Genheden, S.; Ryde, U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert. Opin. Drug Discov. 2015, 10, 449–461. [Google Scholar] [CrossRef]
  48. Kim, H.; Lee, K.; Kim, C.; Lim, J.; Kim, W.Y. DFRscore: Deep Learning-Based Scoring of Synthetic Complexity with Drug-Focused Retrosynthetic Analysis for High-Throughput Virtual Screening. J. Chem. Inf. Model. 2024, 64, 2432–2444. [Google Scholar] [CrossRef]
  49. Nimbus Titan, Inc. STING Pyrazole Agonists and Uses Thereof. WO 2020132566 A1, 25 June 2020. [Google Scholar]
  50. Bradley, P.A.; Carroll, R.J.; Lecouturier, Y.C.; Moore, R.; Noeureuil, P.; Patel, B.; Wheeler, S. The synthesis of two potent β-3 adrenergic receptor agonists. Org. Process Res. Dev. 2010, 14, 1326–1336. [Google Scholar] [CrossRef]
  51. Medical Research Council. Cyclopropene Amino Acids and Methods. WO 2015136265 A1, 17 September 2015. [Google Scholar]
  52. Parker, J.S.; McCormick, M.; Anderson, D.W.; Maltman, B.A.; Gingipalli, L.; Toader, D. The development and scale-up of an antibody drug conjugate tubulysin payload. Org. Process Res. Dev. 2017, 21, 1602–1609. [Google Scholar] [CrossRef]
  53. Li, J.; Zhang, D.Y.; Pan, F.; Ma, R.; Zhu, W.J.; Lü, K.Z.; Chen, X.; Zhang, Y. A Macrocyclic Immunomodulator. CN 111471056 A, 2 July 2021. [Google Scholar]
  54. Moseley, J.D.; Moss, W.O.; Welham, M.J. Investigation of an alternative route to ZD3638 and cost-benefit analysis comparison of raw materials with the previous route. Org. Process Res. Dev. 2001, 5, 491–497. [Google Scholar] [CrossRef]
  55. Bonrath, W.; Bourgeois, F.; Medlock, J.A.; Sparr, C. Stereoselective Synthesis of Enantiomerically-Enriched Pantolactone. WO 2019228874 A1, 5 December 2019. [Google Scholar]
  56. Rowbottom, M.W.; Hutchinson, J.H. Lysyl Oxidase-Like 2 Inhibitors and Uses Thereof. WO 2017003862 A1, 5 January 2017. [Google Scholar]
  57. Kim, D.S.; Yoshizawa, K.; Mitasev, B.; Schnaderbeck, M.; Zhang, H.; Omori, M.; Kayano, A.; Nagai, M.; Wakasugi, K.; Watanabe, Y.; et al. Synthesis of BACE1 Inhibitors E2609/E2071 via Oxime–Olefin Cycloaddition Following a Process Risk Mitigation Strategy. Org. Process Res. Dev. 2021, 26, 804–816. [Google Scholar] [CrossRef]
  58. Yasuma, T.; Sasaki, S.; Sakai, N. Phenylpropanoic Acid Derivatives. EP 1698624 A1, 6 September 2006. [Google Scholar]
  59. Meyers, M.J.; Singh, M.; Stallings, C.L.; Weiss, L.A.; Wildman, S.; Arnett, S.D. Thieno[2,3-d]pyrimidines and Benzofuro[3,2-d]pyrimidines as Antimicrobial Agents. WO 2019018359 A1, 24 January 2019. [Google Scholar]
  60. Yokoyama, K.; Kato, K.; Kitahara, T.; Ohno, H.; Nishina, T.; Kumakura, M.; Awaya, A.; Nakano, T.; Watanabe, K.; Saruta, S. Quinazoline Compounds and Antihypertensives. US 4734418 A, 29 March 1988. [Google Scholar]
  61. Ji, N.; Sintchak, M.D.; Zhang, Y.; Zheng, X. Fused-Glutarimide CRBN Ligands and Uses Thereof. WO 2021011631 A1, 21 January 2021. [Google Scholar]
  62. You, Q.D.; Xu, X.L.; Chen, N.N.; Zhang, H.; Zhu, Q.S.; Dai, W.; Zhou, Y.L.; Jiang, Z.Y.; Guo, X.K.; Wang, L.; et al. A Benzimidazole Compound, a Pharmaceutical Composition Comprising the Benzimidazole Compound, and The Use of Both. CN 115141182 A, 4 October 2022. [Google Scholar]
  63. Bode, C.M.; Cheng, A.C.; Choquette, D.; Lewis, R.T.; Potashman, M.H.; Romero, K.; Stellwagen, J.C.; Whittington, D.A. Benzimidazole and Azabenzimidazole Compounds That Inhibit Anaplastic Lymphoma Kinase. WO 2012018668 A1, 9 February 2012. [Google Scholar]
  64. Thottumkara, K.V.; Thottumkara, A.P. User- and Eco-Friendly Hypervalent Iodine Reagent and Method of Synthesis. US 20040030187 A1, 12 February 2004. [Google Scholar]
  65. Barda, D.A.; Mader, M.M. PI3 Kinase/mTOR Dual Inhibitor. US 20120184577 A1, 19 July 2012. [Google Scholar]
  66. Ashwell, S.; Gero, T.; Ioannidis, S.; Janetka, J.; Lyne, P.; Oza, V.; Springer, S.; Su, M.; Yu, D. Substituted Thiophenes and Uses Thereof. WO 2005016909 A1, 24 February 2005. [Google Scholar]
  67. Lu, Z.; Yoon, T.P. Visible Light Photocatalysis of [2+2] Styrene Cycloadditions via Energy Transfer. Angew. Chem. Int. Ed. 2012, 51, 10329. [Google Scholar] [CrossRef]
  68. Collier, P.N.; Campbell, A.D.; Patel, I.; Raynham, T.M.; Taylor, R.J. Enantiomerically pure alpha-amino acid synthesis via hydroboration-suzuki cross-coupling. J. Org. Chem. 2002, 67, 1802–1815. [Google Scholar] [CrossRef]
  69. Amin, R.; Chen, J.X.; Cotterill, I.C.; Emrich, D.; Ganley, D.; Khmelnitsky, Y.L.; Yang, Q. Improved Synthesis of the C16–C20 Segment of Resolvin E1 Using Enantioselective Ketone Reduction and Lipase-Catalyzed Resolution. Org. Process Res. Dev. 2013, 17, 915–920. [Google Scholar] [CrossRef]
  70. Goto, T.; Urabe, D.; Isobe, Y.; Arita, M.; Inoue, M. Total synthesis of four stereoisomers of (5Z, 8Z, 10E, 14Z)-12-hydroxy-17,18-epoxy-5,8,10,14-eicosatetraenoic acid and their anti-inflammatory activities. Tetrahedron 2015, 71, 8320–8332. [Google Scholar] [CrossRef]
  71. Pflästerer, D.; Rettenmeier, E.; Schneider, S.; de Las Heras Ruiz, E.; Rudolph, M.; Hashmi, A.S.K. Highly Efficient Gold-Catalyzed Synthesis of Dibenzocycloheptatrienes. Chem. Eur. J. 2014, 20, 6752–6755. [Google Scholar] [CrossRef] [PubMed]
  72. Austad, B.C.; Calkins, T.L.; Chase, C.E.; Fang, F.G.; Horstmann, T.E.; Hu, Y.; Watanabe, T. Commercial manufacture of Halaven®: Chemoselective transformations en route to structurally complex macrocyclic ketones. Synlett 2013, 24, 333–337. [Google Scholar] [PubMed]
  73. Bravo, P.; Fustero, S.; Guidetti, M.; Volonterio, A.; Zanda, M. Stereoselective Mannich-type reaction of an acyclic ketimine with a substituted chlorotitanium enolate: Efficient approach to d-erythro-α-trifluoromethyl-β-hydroxyaspartic units. J. Org. Chem. 1999, 64, 8731–8735. [Google Scholar] [CrossRef]
  74. Tsuritani, T.; Mizuno, H.; Nonoyama, N.; Kii, S.; Akao, A.; Sato, K.; Mase, T. Efficient synthesis of 1,4-diaryl-5-methyl-1,2,3-triazole, a potential mGluR1 antagonist, and the risk assessment study of arylazides. Org. Process Res. Dev. 2009, 13, 1407–1412. [Google Scholar] [CrossRef]
  75. Harter, M.; Kosemund, D.; Delbeck, M.; Kalthof, B.; Wasnaire, P.; Submeier, F.; Lustig, K. Heterocyclylmethyl-Thienouracile as Antagonists of the Adenosine-A2B-Receptor. US 20180065981 A1, 8 March 2018. [Google Scholar]
Figure 1. The common structure of ABZI agonists (the part inside the dotted line box is the core group, and R1–R4 represent substituents).
Figure 1. The common structure of ABZI agonists (the part inside the dotted line box is the core group, and R1–R4 represent substituents).
Pharmaceuticals 19 00387 g001
Figure 2. The linear regression relationship between experimental and predicted activity for training set (A) and test set (B). (The correlation coefficients of are 0.7170 and 0.6873 respectively.)
Figure 2. The linear regression relationship between experimental and predicted activity for training set (A) and test set (B). (The correlation coefficients of are 0.7170 and 0.6873 respectively.)
Pharmaceuticals 19 00387 g002
Figure 3. Contour Maps of Steric (A) and Electrostatic Fields (B) for R1-R4 Substituents of Reference Compound D59. For Contour Maps of Steric: green regions indicate where bulky substituents increase biological activity, while yellow regions indicate where less bulky substituents favor higher activity; For Electrostatic Fields: blue regions favor positively charged groups, and red regions favor electronegative groups for improved agonist activity.
Figure 3. Contour Maps of Steric (A) and Electrostatic Fields (B) for R1-R4 Substituents of Reference Compound D59. For Contour Maps of Steric: green regions indicate where bulky substituents increase biological activity, while yellow regions indicate where less bulky substituents favor higher activity; For Electrostatic Fields: blue regions favor positively charged groups, and red regions favor electronegative groups for improved agonist activity.
Pharmaceuticals 19 00387 g003
Figure 4. Surface representation of STING with ten compounds bound in the ligand-binding pocket. Each color represents a different compound.
Figure 4. Surface representation of STING with ten compounds bound in the ligand-binding pocket. Each color represents a different compound.
Pharmaceuticals 19 00387 g004
Figure 5. Structures of six STING agonists.
Figure 5. Structures of six STING agonists.
Pharmaceuticals 19 00387 g005
Figure 6. RMSD values of (A) the complexes and (B) ligands from three parallel 300 ns molecular dynamics simulations (black, red, and blue).
Figure 6. RMSD values of (A) the complexes and (B) ligands from three parallel 300 ns molecular dynamics simulations (black, red, and blue).
Pharmaceuticals 19 00387 g006
Figure 7. Surface representation of STING and structural superposition of four ligands after MD simulations.
Figure 7. Surface representation of STING and structural superposition of four ligands after MD simulations.
Pharmaceuticals 19 00387 g007
Figure 8. Binding modes of the STING protein complex systems. (A) STING-D59, (B) STING-M13 complex and (C) STING-M44. Dashed lines of different colors indicate specific intermolecular interactions: yellow for hydrogen bonds, cyan for salt bridges, magenta for π-π stacking interactions, and salmon for hydrophobic interactions.
Figure 8. Binding modes of the STING protein complex systems. (A) STING-D59, (B) STING-M13 complex and (C) STING-M44. Dashed lines of different colors indicate specific intermolecular interactions: yellow for hydrogen bonds, cyan for salt bridges, magenta for π-π stacking interactions, and salmon for hydrophobic interactions.
Pharmaceuticals 19 00387 g008
Figure 9. Residue energy contributions to the binding free energy (only residues with an energy contribution magnitude greater than 0.8 kcal/mol are shown).
Figure 9. Residue energy contributions to the binding free energy (only residues with an energy contribution magnitude greater than 0.8 kcal/mol are shown).
Pharmaceuticals 19 00387 g009
Table 1. Y-randomization validation results of the Topomer CoMFA model.
Table 1. Y-randomization validation results of the Topomer CoMFA model.
Iterationq2r2
Random 1−0.00260.2172
Random 2−0.0455−0.0622
Random 3−0.03870.0885
Random 4−0.0235−0.1351
Random 5−0.0445−0.0147
Random 60.0100−0.2098
Random 7−0.03860.0363
Random 8−0.04950.0368
Random 9−0.04540.0463
Random 10−0.04050.0827
Table 2. Docking scores of the top 50 compounds based on the London dG scoring function, fragment growth scores, and protein–ligand binding free energies obtained from 100 ns molecular dynamics simulations. Na denotes unavailable values.
Table 2. Docking scores of the top 50 compounds based on the London dG scoring function, fragment growth scores, and protein–ligand binding free energies obtained from 100 ns molecular dynamics simulations. Na denotes unavailable values.
IdCompound NamePred (pEC50)London dG (kcal/mol)Docking Scores (kcal/mol)Binding Free Energy PB (kcal/mol)
1D596.34Na−8.47−38.88 ± 0.13
2O4H26.41Na−8.24Na
3O4O26.24Na−8.75Na
4O4B6.43Na−7.99Na
5O4C6.42Na−8.82Na
6O4F6.40Na−8.29Na
7O4CH6.49Na−8.34Na
8M16.33−15.10−9.83−40.57 ± 0.15
9M36.71−15.18−9.68−34.14 ± 0.20
10M66.50−15.35−9.65−30.60 ± 0.34
11M76.09−16.36−9.63−38.77 ± 0.18
12M116.31−16.41−9.58−40.31 ± 0.26
13M136.57−15.15−9.57−41.15 ± 0.18
20M445.98−15.16−9.36−43.47 ± 0.18
Table 3. Structures of the design compound of the present study.
Table 3. Structures of the design compound of the present study.
IdCompound NameCompound StructureR3R4
1D59Pharmaceuticals 19 00387 i001Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i003
2O4H2 Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i004
3O4O2 Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i005
4O4B Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i006
5O4C Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i007
6O4F Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i008
7O4CH Pharmaceuticals 19 00387 i002Pharmaceuticals 19 00387 i009
8M1 Pharmaceuticals 19 00387 i010Pharmaceuticals 19 00387 i003
10M3 Pharmaceuticals 19 00387 i011Pharmaceuticals 19 00387 i009
11M6 Pharmaceuticals 19 00387 i012Pharmaceuticals 19 00387 i007
12M7 Pharmaceuticals 19 00387 i013Pharmaceuticals 19 00387 i005
13M11 Pharmaceuticals 19 00387 i014Pharmaceuticals 19 00387 i005
14M13 Pharmaceuticals 19 00387 i015Pharmaceuticals 19 00387 i004
15M16 Pharmaceuticals 19 00387 i016Pharmaceuticals 19 00387 i004
16M20 Pharmaceuticals 19 00387 i017Pharmaceuticals 19 00387 i005
17M23 Pharmaceuticals 19 00387 i018Pharmaceuticals 19 00387 i005
18M25 Pharmaceuticals 19 00387 i019Pharmaceuticals 19 00387 i004
19M38 Pharmaceuticals 19 00387 i020Pharmaceuticals 19 00387 i004
20M44 Pharmaceuticals 19 00387 i021Pharmaceuticals 19 00387 i004
Table 4. Binding free energies obtained from three parallel 300 ns MD simulations (kcal/mol).
Table 4. Binding free energies obtained from three parallel 300 ns MD simulations (kcal/mol).
Parallel 1Parallel 2Parallel 3Parallel 4Total Gbind
D59−32.02−40.87−34.30−35.91−35.78 ± 1.88
M1−39.18−35.42−37.10−38.78−37.62 ± 0.86
M7−34.95−41.16−39.92−40.66−39.17 ± 1.43
M11−34.64−35.83−31.98−36.20−34.66 ± 0.95
M13−41.01−45.78−43.27−43.14−43.30 ± 0.98
M44−42.89−39.60−45.77−41.77−42.51 ± 1.28
Table 5. Hydrogen bonds formed in these systems.
Table 5. Hydrogen bonds formed in these systems.
SystemAcceptorDonorHydrogen OccupancyAvgDist
(Å)
AvgAng
(°)
STING-M13Lig-N5SER162(A)-OG79.71%2.83163.57
Lig-O3SER241(A)-N64.52%2.87160.81
Lig-O1THR263(A)-OG150.67%2.73156.40
SER241(A)-OLig-N641.43%2.88162.17
STING-M44Lig-N5SER162(A)-OG69.52%2.84163.13
Lig-O1THR263(A)-OG167.27%2.70158.59
Lig-O3SER241(A)-N63.32%2.85157.79
SER241(A)-OLig-N673.66%2.85162.27
VAL_239(B)-OLig-N1121.04%2.88148.53
STING-D59Lig-O3SER162(A)-OG59.62%2.76164.39
SER162(B)-OLig-N649.43%2.86152.50
Table 6. The bioavailability and pharmacokinetics prediction.
Table 6. The bioavailability and pharmacokinetics prediction.
PropertiesD59M11M13M44
MW (g/mol)556.25687.28670.33667.33
LogS (logmol/L)−4.519−3.831−2.964−2.945
LogP (logmol/L)3.3432.2951.8591.769
SAscoreeasyeasyeasyeasy
Lipinskiexcellentexcellentexcellentexcellent
Pfizerexcellentexcellentexcellentexcellent
Absorption
HIAexcellentexcellentexcellentexcellent
PAMPA Permeabilityexcellentexcellentexcellentexcellent
Distribution
PPBpoorpoorexcellentexcellent
Volume Distributionexcellentexcellentexcellentexcellent
BBB Penetrationpoorpoorexcellentmedium
Metabolism
CYP1A2 inhibitor/substrateNo/YesNoNo/YesNo/Yes
CYP2C19 inhibitor/substrateNoNoNoNo
CYP2C9 inhibitor/substrateNoYES/NoNoNo
CYP2D6 inhibitor/substrateNo/YesNoNoNo
CYP3A4 inhibitor/substrateNoYES/NoNoNo
CYP2B6 inhibitor/substrateNoNoNoNo
Excretion
CLplasmaexcellentexcellentexcellentexcellent
T1/2poormediummediumpoor
Toxicity
hERG Blockers (10 uM)poorpoorpoormedium
Skin Sensitizationexcellentpoormediummedium
AMES Mutagenicitypoorpoormediummedium
Carcinogencitypoorpoorexcellentmedium
Hematotoxicityexcellentmediumexcellentexcellent
A549 Cytotoxicitymediumexcellentexcellentexcellent
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ren, H.; Jin, Y.; Zhao, B.; Peng, X.; Zhao, S.; Wang, M. In Silico Discovery of ABZI Nitrogen Heterocycle STING Agonists via 3D-QSAR, Molecular Dynamics, and AI-Based Synthesis Prediction. Pharmaceuticals 2026, 19, 387. https://doi.org/10.3390/ph19030387

AMA Style

Ren H, Jin Y, Zhao B, Peng X, Zhao S, Wang M. In Silico Discovery of ABZI Nitrogen Heterocycle STING Agonists via 3D-QSAR, Molecular Dynamics, and AI-Based Synthesis Prediction. Pharmaceuticals. 2026; 19(3):387. https://doi.org/10.3390/ph19030387

Chicago/Turabian Style

Ren, Houcheng, Yuhong Jin, Baipu Zhao, Xiangbing Peng, Shan Zhao, and Meiting Wang. 2026. "In Silico Discovery of ABZI Nitrogen Heterocycle STING Agonists via 3D-QSAR, Molecular Dynamics, and AI-Based Synthesis Prediction" Pharmaceuticals 19, no. 3: 387. https://doi.org/10.3390/ph19030387

APA Style

Ren, H., Jin, Y., Zhao, B., Peng, X., Zhao, S., & Wang, M. (2026). In Silico Discovery of ABZI Nitrogen Heterocycle STING Agonists via 3D-QSAR, Molecular Dynamics, and AI-Based Synthesis Prediction. Pharmaceuticals, 19(3), 387. https://doi.org/10.3390/ph19030387

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop