Next Article in Journal
Invariant Natural Killer T Cells in Cancer Immunotherapy: Lipid-Based Modulation, Nanotechnology, and Translational Advances
Next Article in Special Issue
From Local Tissue Repair to Fibrosis: Deciphering Gene Co-Expression Networks in Benign Pulmonary Nodules and Idiopathic Pulmonary Fibrosis Comorbidity via Bioinformatics and Machine Learning
Previous Article in Journal
Metallomic Analysis of Vitreous Humor of the Human Eye—A Post-Mortem Multielemental Study
Previous Article in Special Issue
Benchmarking edgeR and methylKit for the Detection of Differential DNA Methylation: A Methodological Evaluation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Computational Identification of Potential Novel Allosteric IHF Inhibitors Using QSAR Modeling to Inhibit Plasmid-Mediated Antibiotic Resistance

by
Oscar Saurith-Coronell
1,2,
Olimpo Sierra-Hernandez
1,2,
Juan David Rodríguez-Macías
3,*,
José R. Mora
4,
Noel Perez-Perez
5,
Jackson J. Alcázar
6,
Ricardo Olimpio de Moura
7,
Igor José dos Santos Nascimento
7,
Edgar A. Márquez Brazón
2,* and
Yovani Marrero-Ponce
8,9
1
Departamento de Medicina, División Ciencias de la Salud, Universidad del Norte, Km 5, Vía Puerto Colombia, Puerto Colombia 081007, Colombia
2
Grupo de Investigaciones en Química y Biología, Departamento de Química y Biología, Facultad de Ciencias Básicas, Universidad del Norte, Carrera 51B, Km 5, Vía Puerto Colombia, Barranquilla 081007, Colombia
3
Facultad de Ciencias de la Salud, Exactas y Naturales, Universidad Libre, Barranquilla 080001, Colombia
4
Grupo de Química Computacional y Teórica (QCT-USFQ), Departamento de Ingeniería Química, Universidad San Francisco de Quito, Diego de Robles y Vía Interoceánica, Quito 170901, Ecuador
5
Colegio de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito USFQ, Quito 170157, Ecuador
6
Centro de Química Médica, Facultad de Medicina Clínica Alemana, Universidad del Desarrollo, Santiago 7780272, Chile
7
Programa de Pós-Graduação em Ciências Farmacêuticas (PPGCF), Universidade Estadual da Paraíba (UEPB), Campina Grande 58429-500, Brazil
8
Facultad de Ingeniería, Universidad Panamericana, Ciudad de México 03920, Mexico
9
Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Universidad San Francisco de Quito (USFQ), Quito 170157, Ecuador
*
Authors to whom correspondence should be addressed.
Int. J. Mol. Sci. 2026, 27(6), 2526; https://doi.org/10.3390/ijms27062526
Submission received: 25 January 2026 / Revised: 23 February 2026 / Accepted: 5 March 2026 / Published: 10 March 2026
(This article belongs to the Special Issue Benchmarking of Modeling and Informatic Methods in Molecular Sciences)

Abstract

The rapid spread of antibiotic resistance through plasmid-mediated conjugation remains a primary global health concern. Despite its critical role in horizontal gene transfer, no approved drugs currently target this process, leaving a critical therapeutic gap. Integration Host Factor (IHF), a DNA-binding protein essential for plasmid replication and mobilization, emerges as a promising yet underexplored target for anti-conjugation strategies. This work aimed to develop a predictive computational model and identify small molecules that disrupt IHF function, thereby reducing plasmid transfer and limiting resistance gene dissemination. A curated dataset of 65 compounds with reported anti-plasmid activity was analyzed using a 3D-QSAR model based on algebraic descriptors computed with QuBiLS-MIDAS. The model was validated through leave-one-out cross-validation (Q2 = 0.82), Tropsha’s criteria, and Y-scrambling. Representative compounds were selected via pharmacophore clustering and evaluated through molecular docking at both the DNA-binding site and a predicted allosteric pocket of IHF. The most promising complexes underwent 200 ns molecular dynamics simulations to assess stability and interaction patterns. The QSAR model demonstrated strong predictive performance (R2 = 0.90). Docking simulations revealed more favorable binding energies at the allosteric site (up to −12.15 kcal/mol) compared to the DNA-binding site. Molecular dynamics confirmed the stability of these interactions, with allosteric complexes showing lower RMSD fluctuations and consistent binding energy profiles. Dynamic cross-correlation analysis revealed that allosteric ligand binding induces conformational changes in key catalytic residues, including Pro65, Pro61, and Leu66. These alterations may compromise DNA recognition and disrupt the initiation of replication. To our knowledge, this is the first computational study proposing allosteric inhibition of IHF as an anti-conjugation strategy. These findings provide a foundation for experimental validation and the development of novel agents to prevent horizontal gene transfer, offering a promising approach to restoring antibiotic efficacy against multidrug-resistant pathogens.

1. Introduction

Antibiotic resistance has emerged as a critical global public health threat, undermining the effectiveness of antibiotics in treating bacterial infections [1]. Current estimates suggest that bacterial infections are responsible for approximately 7.7 million deaths annually, with nearly 5 million attributed to multidrug-resistant (MDR) pathogens. Infections caused by resistant organisms are not only more challenging to treat but are also associated with markedly prolonged hospital stays. For instance, Gram-negative bacterial infections extend hospitalization by an average of 32.6 days, while severe bloodstream infections can increase hospital stays by up to two months compared with non-infected patients [2]. Beyond the clinical burden, antibiotic resistance imposes substantial economic strain on healthcare worldwide by increasing the consumption of medical resources. Consequently, the World Health Organization has prioritized the development of new therapeutic strategies targeting MDR bacteria [3].
Antimicrobial resistance arises from complex interactions between microorganisms and their environment. In clinical settings, acquired resistance in previously susceptible bacterial populations typically arise from prolonged and uncontrolled antibiotic exposure. Under selective pressure, bacteria develop two principal genetic mechanisms for survival: gene mutation and horizontal gene transfer (HGT) [4,5,6]. Mutations can modify antimicrobial targets, activate enzymatic pathways that inactivate drugs, decrease antibiotic permeability, or induce metabolic changes that bypass drug effects. HGT, recognized as a central driver of antimicrobial resistance emergence and spread, enables bacteria to incorporate foreign genetic material through three distinct pathways: transformation, transduction mediated by bacteriophages, and conjugation, the most common mechanism in clinical settings involving plasmid transfer between cells via direct contact [7,8].
Conjugation is the most important mechanism for horizontal dissemination of antibiotic resistance genes among bacterial populations. This multi-step process begins with cellular contact between donor and recipient cells, mediated by a pilus structure facilitating DNA transfer [9]. The Integration Host Factor (IHF) is a heterodimeric protein (~20 kDa) that binds specifically to DNA and functions as an accessory factor in various cellular processes in prokaryotes, primarily by inducing DNA bending, inducing a U-turn in the DNA structure that is necessary for the assembly of the relaxosome a protein complex. These essential functions position IHF as a promising molecular target for inhibiting plasmid maintenance and horizontal gene transfer, potentially through disruption of its DNA-binding and structural modulation activities [10]. Once cell-to-cell contact is established and plasmid replication is completed, conjugative machinery composed of a type IV coupling protein (T4CP) and type IV secretion system (T4SS) is activated. These genes, typically located on autonomously replicating plasmids, are transcribed upon recognition of the origin of transfer (oriT) by the relaxase protein, which triggers precise cleavage and the mobilization of plasmid DNA [11,12]. Conjugative plasmids are transmitted between bacteria with remarkable efficiency and are classified based on compatibility groups, relaxase families, or the resistance genes they carry. For example, the incompatibility plasmid linage Fertility plasmid group (IncF) belonging to the mobility plasmidum F-type (MOBF) family comprises low-copy number conjugative plasmids (45–200 Kb) commonly found in Gram-negative bacteria, including Escherichia coli, Klebsiella pneumoniae, and Enterobacteriaceae members. Their clinical significance stems from the frequent presence of extended-spectrum β-lactamase genes (ESBLs), such as bla-CTX-M, and carbapenemase genes conferring carbapenem resistance in Enterobacteriaceae (CRE), alongside resistance genes to aminoglycosides, quinolones, and tetracyclines [13,14,15,16,17,18].
While recent antibiotics such as cefiderocol target multidrug-resistant Gram-negative bacteria, this approach offers only short-term solutions. Alternative strategies are being explored, including adjuvants like SPR741, a polymyxin-B–derived cationic peptide that acts as an antibiotic potentiator by disrupting the outer membrane of Gram-negative bacteria, thereby enhancing the efficacy of co-administered antibiotics reducing nephrotoxicity and enabling higher antibiotic dosing [19,20]. However, neither strategy directly addresses the prevention of resistance gene spread or the elimination of the genetic machinery. Currently, no approved clinical drugs directly inhibit horizontal gene transfer in bacteria. Promising compounds under investigation include abacavir and azidothymidine (AZT), antiretroviral analogs reducing pCT plasmid conjugation in E. coli by 83% and pKpQIL plasmid conjugation in K. pneumoniae by 80%. Linoleic acid, which inhibits TrwD ATPase activity within the T4SS, also demonstrates conjugation-inhibiting properties. These compounds promote plasmid-borne genetic material loss, reducing acquired resistance mechanisms in bacterial populations [21,22].
In silico strategies have emerged as efficient, cost-effective tools for drug development, enabling rapid screening of thousands of potential molecules using computational methods that are significantly faster than traditional approaches. These strategies contribute to a deeper molecular understanding of drug behavior and mechanisms of action, which is crucial for early modifications that optimize biological activity and ADME-Tox properties (absorption, distribution, metabolism, excretion, and toxicity). Molecular docking and molecular dynamics simulations represent the most widely used computational methods, exploring ligand-protein complex conformations and stability while estimating interaction energies. MM-PBSA and MM-GBSA approaches calculate these energies by accounting for bond lengths, angles, torsions, van der Waals interactions, electrostatic forces, and solvent-accessible surface area, enabling optimization of candidate molecules’ pharmacological and toxicological profiles. Quantitative Structure Activity Relationship (QSAR) analysis, a predictive modeling technique increasingly incorporated into drug development, establishes quantitative relationships between compound structural features and biological activity. QSAR models identify molecular characteristics associated with specific pharmacological properties, including mechanism of action, toxicity, and selectivity, enabling informed modifications to optimize performance and safety profiles [23,24,25]. QSAR strategies have been successfully applied to HIV-1 protease inhibitor development, leading to the identification of novel potential inhibitors and the optimization of properties such as hydrophobicity. Clinically available drugs such as Indinavir and Darunavir served as reference compounds guiding the design and refinement of new drugs [26,27,28].
Given the high prevalence of multidrug-resistant organisms that utilize conjugation to transfer virulence and resistance factors, we propose a comprehensive computational approach integrating QSAR modeling, target identification, molecular docking, and molecular dynamics simulations [29,30]. This strategy investigates the mechanisms of action of compounds that demonstrate plasmid conjugation-inhibition activity. Our primary objective is to understand how specific molecules interfere with conjugation, elucidate structure-activity relationships, predict binding affinity and complex stability, and establish a predictive model to guide the development of new potential plasmid conjugation inhibitors. Such inhibitors could meaningfully reduce the spread of antibiotic resistance among clinically relevant bacterial strains.

2. Results

To develop a reliable predictive model, we first established a structured methodological sequence. This process began with data collection, followed by the construction of a QSAR model to identify molecular properties that contribute to plasmid conjugation inhibition. Next, a pharmacophore search was conducted to predict potential molecular targets affected by the compounds. Finally, molecular docking and molecular dynamics simulations were performed to analyze interactions between the compounds and their targets, assess the stability of the complexes, and propose plausible mechanisms of action for inhibiting plasmid transfer.

2.1. Dataset Preparation

To construct the training dataset for our predictive model, we selected 65 compounds with reported anti-plasmid activity against the pCRE BIDMC20a plasmid. This selection was guided by defined inclusion criteria applied to the ICCB-Longwood Screening Facility database at Harvard Medical School, which houses a chemically diverse array of bioactive molecules, including FDA-approved drugs, kinase inhibitors, ion channel modulators, and other pharmacologically relevant agents [31]. These compounds were chosen to ensure both clinical relevance and mechanistic diversity, providing a solid foundation for subsequent modeling and analysis. All selected compounds were retrieved from the PubChem database https://pubchem.ncbi.nlm.nih.gov (accessed on 22 February 2025), and their corresponding Compound ID (CID) codes were extracted for standardized reference and integration. To facilitate consistent referencing, each compound was assigned a unique identifier in the format ICCB-n. Biological activity was expressed as the Anti-Plasmid Inhibition Index (APII), a metric designed to capture plasmid stability relative to bacterial growth. To calculate APII, we used the ratio of relative fluorescence units (RFU), which reflect plasmid-associated fluorescence, to optical density at 600 nm (OD600), an indicator of cell density. This normalization ensures that fluorescence signals are not biased by variations in bacterial growth, providing a more accurate measure of plasmid maintenance under different conditions.
To improve data consistency and reduce skewness, APII values were log-transformed before statistical modeling. Log transformation is a standard practice in high-throughput screening and QSAR studies because it stabilizes variance and helps approximate normality in biological datasets. Similar approaches have been reported in fluorescence-based plasmid transfer assays and conjugation frequency studies [32,33,34,35,36,37]. Therefore, higher log-transformed values correspond to greater inhibition of plasmid maintenance [38]. The complete dataset, including these normalized and transformed values, is presented in Table 1.
Following the initial compound selection, structural refinement was performed using Density Functional Theory (DFT) calculations with the B3LYP functional. To ensure that each optimized geometry represented a true energy minimum, vibrational frequency analyses were performed; only structures exhibiting exclusively positive frequencies were retained [39]. This optimization step enabled the determination of an extensive set of molecular descriptors, encompassing thermodynamic, electronic, and topological properties. In total, approximately 16,000 descriptors were computed for each compound. Subsequent correlation analyses facilitated the development of a predictive mathematical model based on a subset of seven descriptors [40,41]. These selected parameters, primarily algebraic in nature, are presented in Table 2.
In the final model, descriptors X1–X7 correspond to 3D indices generated using the QuBiLS-MIDAS (MID) methodology at the atom-based (AB) level. These descriptors encode geometric patterns between atoms through duplex and ternary relationships, employing different distance metrics and various matrix types, including global indices (T) and, in some cases, full matrix preservation (KA). Regarding their chemical content, X1 is a non-chiral bilinear index associated with a hardness (h) softness (s) term; X2 is a linear, chiral index with kurtosis, using a topological cutoff LGP[2] and weighted by polar surface area (PSA); X3 is a non-chiral bilinear index with skewness that incorporates a terminal-methyl (M) structural term, applies a geometric cutoff LGL[2–3], and is weighted by AlogP (a) and polarizability (p); X4 is a non-chiral bilinear index with skewness, constructed with an LGP[2] cutoff and weighted by electronegativity (e) and polarizability (p); X5 is a linear, chiral index with kurtosis, centered on heteroatoms (X) and weighted by charge (c); X6 is a non-chiral trilinear-cubic index (triplets) including SS2 transformations, cutoffs LG3P[1]/LGP[1], and hardness (h) weighting; and X7 is a linear, chiral index with skewness focused on heteroatoms (X) and weighted by charge (c). Together, these descriptors capture a broad combination of geometric, electronic, and physicochemical information.
The QSAR model points to two complementary factors driving the APII. First, the feature importance analysis based on standardized coefficients and the relative R2 contributions shows that descriptors X3, X1, and X2 provide the largest share of explanatory power. All three have negative coefficients, indicating that higher values of the structural and physicochemical patterns captured by X1–X4, particularly polar-surface and electronic-gradient terms, PSA-related information in X2, and the LogP/polarizability/electronic components represented in X3 and X4, are associated with reduced predicted activity. This outcome is consistent with the observation that lower polar surface area, more favorable lipophilicity profiles, and specific electronic-property balances correlate with increased APII within the dataset.
Second, descriptors reflecting charge distribution and electronic stability (X5–X7) exhibit positive coefficients. This suggests a modest beneficial contribution of charge-related patterns (X5 and X7) and global hardness (X6) to activity. However, the importance rankings indicate that these contributions are secondary in magnitude, and only X5 shows strong statistical support in the current model; X6 and X7 contribute positively but more weakly. Taken together, these results indicate that the model preferentially favors ligands with reduced polarity (lower PSA-related signatures) and physicochemical features compatible with membrane penetration and hydrophobic interactions, while charge-distribution patterns and hardness-related stability offer an additional, though comparatively smaller, positive contribution. Overall, this integrated interpretation provides a coherent framework to guide the design of compounds with an improved APII.

2.2. QSAR Modeling and Validation

For modeling purposes, the Anti-Plasmid Inhibitor Index (APII) was designated as the dependent variable, and the computed descriptors were used as independent variables. To explore the predictive relationship between molecular structure and biological activity, a range of statistical modeling techniques was employed. Among these, regression-based approaches were particularly instrumental in constructing robust mathematical models capable of correlating the calculated molecular descriptors with the APII [42,43]. Several models were generated during this process; however, one demonstrating the highest statistical significance and predictive performance was selected for further analysis. This final model incorporates the previously described descriptors and is presented in Equation (1).
A P I I   =   0.6445     1.5200   X 1     0.0038 X 2     0.0203 X 3     0.0363 X 4   +   0.0038 X 5   +   6.0267 X 6     +   0.0046 X 7
N = 65, R2 = 0.90, Q2 = 0.82
Analysis of the descriptors included in the model suggests that compounds exhibiting higher biological activity tend to possess greater electronic hardness and branched or cyclic structures. Molecules with a balanced global electronic hardness and softness, combined with intermediate physicochemical properties such as polarizability, charge distribution, logP, and polar surface area (PSA) were favored. The corresponding descriptor values are presented in detail within Supplementary Table S1. Conversely, structures lacking these characteristics were penalized during model optimization. So far, the model indicates a preference for rigid molecular frameworks with regions of high electronic density and a well-defined three-dimensional architecture, as these features appear to enhance the likelihood of effective interaction with the target protein.
To validate the predictive performance of the model, leave-one-out cross-validation (LOOCV) was employed [44]. This technique involves iteratively excluding one compound from the dataset, training the model on the remaining n − 1 compounds, and then predicting the excluded compound’s activity. The process is repeated for each compound, and the average prediction error across all iterations is calculated to assess model reliability [45]. The resulting model demonstrated strong predictive power, with a cross-validated coefficient of determination (Q2) of 0.8232, alongside low error metrics (RMSEP = 0.0413; MAEext = 0.0337). Figure 1 compares predictions obtained using the OLS-Hold-Out method and LOOCV.
To ensure that all training molecules fell within the applicability domain of the QSAR model, a Williams plot was generated (Supplementary Figure S1). This plot shows standardized residuals reflecting prediction accuracy against leverage values, which indicate each compound’s influence on the regression model. Higher leverage values correspond to a stronger impact on the equation. The horizontal boundaries at ±3σ (three standard deviation units) define the acceptable residual range, while the vertical threshold at h = 0.369 represents the calculated leverage limit. All training compounds were located within these boundaries, confirming the absence of structural or response outliers. This outcome supports the reliability of the predictions and reinforces the robustness and interpretability of the QSAR model [46,47,48].
Additional validation of the model was carried out using Tropsha’s criteria for predictive QSAR models. In Table 3, K and K′ represent the slopes of the regression lines obtained from the plot of predicted versus observed activities for the external and internal validation sets, respectively; values close to 1 indicate that the QSAR model has appropriate scaling and is neither over- nor under-estimating the response. Moreover, the value of ∣R2 − R02∣/R2 was 0.0354, reflecting strong model robustness. Similarly, the difference ∣R02 − R02∣ was below 0.1, further supporting the reliability of the model [49,50]. A comprehensive summary of the statistical analysis and metric validation is presented in Table 3.
To further assess the model’s robustness, a Y-scrambling analysis was performed. This validation technique yielded negative R2 values (mean = −0.19; standard deviation = 0.10), effectively ruling out spurious correlations and confirming the statistical validity of the model. Additionally, the model’s performance was compared with alternative regression algorithms. Notably, the Random Forest approach produced a lower predictive accuracy (R2 = 0.55), underscoring the superior performance of the linear model. These findings support the appropriateness of the linear structure in capturing. All those validation schemes support the predictive reliability of the model to predict the APII of different molecules [51]. Taken together, these validation approaches support the model’s reliability in predicting APII values across structurally diverse compounds. However, it is important to recognize that the relatively small number of compounds used during the development of the QSAR model may limit its performance when applied to external datasets. This reduced dataset size could increase the risk of overfitting and, consequently, hinder the model’s ability to generalize to new chemical structures [52].

2.3. Cluster Analysis and Selection of Representative Compound

Following the development of the predictive model, a cluster analysis was conducted to identify structurally related groups among the 65 compounds used in model training. The three-dimensional structures of these molecules were utilized to generate pharmacophore maps using the pmapper Python 3.12.3 library, which integrates the feature-detection capabilities of RDKit https://www.rdkit.org (accessed on 1 April 2025). Key pharmacophoric features such as hydrogen bond donors and acceptors, hydrophobic regions, aromatic rings, and ionizable groups were included.
Each feature was encoded based on its 3D coordinates, establishing a geometric center that reflects the principal structural characteristics of each molecule. These pharmacophores were then represented using canonical signatures and 2048-bit binary fingerprints. Importantly, the 3D signatures allowed for the identification of pharmacophoric similarities across compounds without requiring molecular alignment, thereby avoiding biases introduced by differences in molecular size. To ensure consistency, spatial parameters such as distances and planarity were discretized using tolerance settings defined by pmapper. The resulting fingerprints were used to compute Tanimoto similarity scores, which served as the basis for clustering [53]. This process yielded a major cluster comprising 31 compounds, from which a single representative molecule was selected. The remaining compounds were distributed across smaller clusters. The representative compound of the largest cluster is ICCB-3 (8-Hydroxydaidzein), illustrated in Figure 2.
ICCB-3 has a molecular weight of 270.24 g/mol and belongs to the isoflavone, a class of natural compounds within the flavonoid family, a group of polyphenols predominantly found in legumes. Isoflavones have garnered significant interest in pharmaceutical research due to their antioxidant and cardiovascular properties, as well as their potential role in hormone-dependent cancers [54,55].

2.4. Target Prediction

To explore the potential mechanism of action underlying the observed inhibitory activity, a target prediction analysis was conducted using PharmMapper https://www.lilab-ecust.cn/pharmmapper (accessed on 5 April 2025), to identify plausible biological targets for the representative compound ICCB-3. Its 3D structure was first used to generate pharmacophoric features [56,57]. These features were then aligned and analyzed using PharmMapper. To enhance the accuracy and flexibility of the mapping process, a conformer search was performed, generating approximately 500 conformations using the MMFF94 force field. The pharmacophore mapping was conducted against a curated database of 16,159 druggable pharmacophore models. This extensive dataset includes experimentally validated protein-ligand interactions derived from natural ligands, known drug targets, and repurposed drugs. Based on pharmacophoric similarity, PharmMapper identified approximately 1000 potential targets. From this pool, a rigorous selection process was undertaken to prioritize targets involved in key biological processes such as plasmid replication, regulation, and transfer. Among the top candidates, two targets were identified, with particular emphasis on the Integration Host Factor (IHF).

2.5. Docking Protocol

Following the QSAR analysis and subsequent target identification, we proceeded with molecular docking simulations to deepen our understanding of the potential mechanisms of action of the selected compounds. This approach enabled us to explore potential interactions between the ligands and IHF, providing valuable insights into their binding behavior. By integrating QSAR modeling with molecular docking techniques, we aimed to establish a qualitative and, potentially, quantitative relationship between molecular structure and biological activity [58,59]. The docking simulations were conducted using IHF as the target protein, focusing on a set of 31 compounds that belong to a representative cluster. These molecules were selected based on their apparent shared mechanism of action, which involves plasmid inhibition. The combined computational strategy not only supports the hypothesis of a typical interaction pathway but also enhances the predictive power of our analysis regarding ligand efficacy and specificity [60,61].
The initial hypothesis proposes that the selected compounds interfere with the interaction between IHF and plasmid DNA. This inhibition could occur through two primary mechanisms: (i) direct binding to amino acid residues involved in DNA recognition, or (ii) induction of conformational changes in the protein that alter its interaction with DNA, potentially through an allosteric mechanism [62]. To investigate these possibilities, two distinct approaches were used to define the docking grid parameters. The first approach relied on a comprehensive literature review to identify key residues responsible for DNA binding. Among these, Pro65 was highlighted for its role in binding to the minor groove and inducing DNA bending. Additional residues, such as Arg63 and Thr60, were also considered for their functional importance in IHF-DNA interactions [63]. Based on this information, the docking grid was centered at X: 2, Y: 14, Z: 4, with dimensions of 50 Å × 50 Å × 50 Å. The second approach involved a cavity search using the web-based platform CBDock2 and the open-source tool fpocket. CBDock2 integrates three methodologies: solvent-accessible surface area (SASA) analysis, structure-based docking with AutoDock Vina 1.1.2, and template-based docking, which compares the target protein with structurally similar proteins and their known ligands [64]. In parallel, fpocket employs a Voronoi tessellation algorithm to partition space into regions of atomic influence, identifying empty pockets on the protein surface and interior that may serve as druggable sites. Both methods converged on the same potential allosteric pocket, which was selected as the second docking grid, centered at X: −2, Y: −9, Z: 34, with the same dimensions of 50 Å × 50 Å × 50 Å [65].
Molecular docking simulations were performed using AutoDockGPU, a high-performance tool that leverages GPU resources to efficiently generate a large number of potential binding poses between the receptor and each ligand [66]. For each compound, 1000 conformations were generated, using both docking grids previously described: one targeting the DNA-binding site and the other a potential allosteric site. The resulting docking data were analyzed based on several parameters: the best binding energy (lowest conformational energy), the mean binding energy across all poses, the standard deviation of the energy values, and the root mean square deviation (RMSD) of the conformational clusters formed between the ligand and the active site. These metrics provided a comprehensive view of the binding stability and variability for each compound. The results of the docking simulations are summarized in Table 4, which presents data for both proposed mechanisms of action: direct interaction with DNA-binding residues and allosteric modulation of the IHF structure.
The results of the molecular docking simulations provide valuable insights into potential interactions between the selected compounds and the IHF protein. Moreover, correlating these findings with the QSAR model outcomes contributes to validating the predictive power of the computational approach. In general, the binding energies associated with the direct inhibition mechanism ranged between −6 and −9 kcal/mol. In contrast, the docking results targeting the proposed allosteric site showed slightly more favorable binding energies, ranging from −7 to −11 kcal/mol. This suggests that the compounds may interact more effectively with the allosteric region of the protein. Among the tested molecules, ICCB-26 stood out for the most favorable binding energy at the allosteric site and for strong interaction in the direct inhibition model. Notably, this compound also demonstrated the highest biological activity in experimental assays. ICCB-24 showed high binding affinity and significant biological activity, whereas ICCB-53, despite its strong binding energy, displayed lower biological activity. This apparent discrepancy between docking scores and biological activity suggests that additional factors may influence the observed bioactivity. These could include differences in membrane permeability, compound stability, or the possibility of a multitarget mechanism of action. To further elucidate these findings, a detailed analysis of the molecular interactions between the compounds and IHF was conducted, focusing on both the DNA-binding site and the proposed allosteric site. To further explore the molecular basis underlying the docking and biological activity results, a detailed interaction analysis was performed for ICCB-26, ICCB-24, and ICCB-53, as well as ICCB-3 the representative compound of the cluster and ICCB-43, which exhibited the weakest performance in both docking simulations and biological assays. The chemical structures of these compounds are provided in Supplementary Figure S2. The interactions of the docking results for the analysis of direct inhibition are shown in Figure 3, and those for allosteric inhibition are shown in Figure 4.
The molecular interaction analysis derived from the docking simulations revealed a diverse range of non-covalent interactions between the ligands and the IHF protein; These included Van der Waals forces, conventional hydrogen bonds, carbon-hydrogen bonds, π-cation, π-sigma, alkyl, and π-alkyl interactions. Notably, within the allosteric pocket, several complexes display π-sulfur (π-S) contacts, wherein the ligand’s aromatic π-system interacts favourably with the polarizable sulfur of methionine/cysteine, contributing to pose stabilization and orientational bias. This interaction is well documented in proteins: methionine-aromatic motifs are common and provide ~1–1.5 kcal·mol−1 of additional stabilization, and sulfur-involving noncovalent contacts are increasingly recognized as determinants of protein structure and function; in line with this, proximity to aromatic residues has been shown to reduce methionine’s susceptibility to oxidation [67,68].
As illustrated in Figure 3, ICCB-26 exhibited a high number of π-sigma, π-alkyl, and hydrogen bond interactions, along with van der Waals contacts. These interaction types were also present in ICCB-53, supporting the hypothesis that such interactions may contribute to the favorable docking scores observed for these compounds. In contrast, ICCB-24 and ICCB-43 displayed fewer of these key interactions, which may explain their comparatively lower binding affinities. ICCB-3, used as the reference molecule, helped identify the most relevant residues involved in ligand binding. Among these, Pro 65 stood out for its critical role in DNA binding and in inducing the characteristic U-turn in the DNA structure. Other vital residues included Arginine 76, Arginine 63, and Isoleucine 73. The compounds interacted with these residues through different mechanisms, including hydrogen bonding, π-sigma, and alkyl interactions, highlighting the structural diversity in ligand-receptor engagement.
In contrast to the results from direct inhibition docking simulations, Figure 4 shows that compounds ICCB-26 and ICCB-24 exhibit distinct interaction profiles. These molecules interact by Pi-Sulfur and Pi-Cation interactions, alongside conventional hydrogen bonds and Van der Waals forces. Such interactions are likely to contribute significantly to the enhanced binding scores observed during docking, suggesting a potential role in stabilizing the ligand-protein complex. Conversely, ICCB-53, despite demonstrating a favorable binding score, showed poor biological activity. Notably, this compound lacks Pi-Sulfur and Pi-Cation interactions, which may account for its reduced efficacy. Although ICCB-53 forms Pi-Sigma, Alkyl, van der Waals, and hydrogen bond interactions, the absence of Pi-Sulfur contacts appears to be a critical factor influencing its functional performance. Similarly, ICCB-43 lacks these key interactions and exhibits fewer overall molecular contacts, further supporting this hypothesis. Among the compounds analyzed, ICCB-3 serves as a representative example, exhibiting consistent and meaningful interactions with key amino acid residues, including Leu 12, Leu 26, and Phe 30. These residues participate in hydrogen bonding and Pi-Sigma interactions, which may play a pivotal role in the compound’s mechanism of action. If the inhibitory effect of these molecules is mediated through this interaction pathway, their interaction with these residues could induce conformational changes that disrupt DNA binding, alter protein folding, or interfere with dimerization processes essential for the protein’s active structure. These possibilities warrant further investigation through molecular dynamics simulations, which could provide deeper insights into the conformational behavior of the protein-ligand complexes. Figure 5 illustrates the results of the redocking validation procedure performed with AutoDock Vina (version 1.2.7), showing the superposition of all compounds within both the allosteric and direct inhibition pockets of the IHF protein [69,70]. The visualization highlights their spatial arrangement and key interactions with relevant residues, including Pro65.
This comparative analysis was designed to elucidate specific interaction patterns and binding behaviors at both the DNA-binding site and the proposed allosteric site of the IHF protein. By examining the nature and strength of interactions such as hydrogen bonds, hydrophobic contacts, and electrostatic forces across these regions, we aimed to identify structural determinants that influence the inhibitory potential of the compounds. The inclusion of ICCB-3 as a reference compound provided a consistent framework for evaluating interaction profiles within the cluster, while ICCB-43 served as a contrasting example, highlighting structural and chemical features associated with reduced activity. All ligands analyzed share an extended polyaromatic scaffold with peripheral carbonyl and/or hydroxyl substituents, forming a rigid π-conjugated core that enables π–π and π–alkyl interactions within the hydrophobic pocket, complemented by polar groups capable of establishing directional hydrogen bonds with key residues. ICCB-3, in particular, represents the most compact and symmetric analogue, resembling a substituted anthraquinone in which oxygenated groups are directly attached to the tricyclic aromatic core without bulky sulfonyl or amide side chains. This architecture minimizes rotatable bonds and heteroatom diversity, yielding a highly planar, conformationally restricted scaffold whose interactions are dominated by centrally located hydrogen bonds and extensive aromatic and hydrophobic contacts. These observations support the hypothesis that biological efficacy depends not only on binding energy but also on the quality and nature of molecular interactions within the target site [71].

2.6. Molecular Dynamics Simulations

To gain deeper insight into the potential mechanisms of action of the plasmid inhibitors and into how these compounds form stable complexes with their targets, molecular dynamics (MD) simulations were conducted to assess the conformational stability and binding energy of the ligand-protein complexes over time. The compounds ICCB-26 and ICCB-53 were selected for this analysis due to their strong performance in docking simulations and their contrasting biological activity profiles, which represent two extremes in the experimental results. This contrast is particularly valuable for exploring possible correlations between in silico predictions and in vitro efficacy. Additionally, ICCB-3 was included as a reference compound due to its representative interaction profile. Its inclusion may help define the minimum interaction requirements necessary to elicit a biological response. Following the docking protocol, two MD simulations were performed for each compound, corresponding to the two proposed mechanisms of action. The initial conformations used for the simulations were derived from the lowest energy docking poses. Simulations were carried out using YASARA Structure, employing the AMBER14 force field, which has been previously validated for modeling interactions between proteins and small molecules [72,73,74]. Each simulation was run for 200 nanoseconds, allowing sufficient time to observe dynamic behavior and interaction stability. For the initial analysis, two key parameters were evaluated: the binding energy (Figure 5) and Root Mean Square Deviation (RMSD) (Figure 6). These metrics provide insight into the binding affinity and structural stability of the complexes throughout the simulation period, offering valuable information on the potential effectiveness and reliability of the compounds as inhibitors [75].
Throughout the molecular dynamics simulations, distinct interaction behaviors were observed between the compounds and the target protein, depending on the inhibitory mechanism being evaluated. Binding energy was calculated using the YASARA macro md_Analyzebindenergy.mcr, which estimates this metric as the difference between the solvation and potential energies of the ligand-protein complex and those of the individual reactants. This analysis employs the MM-PBSA (Molecular Mechanics Poisson–Boltzmann Surface Area) methodology, integrating an entropy approximation to yield a more comprehensive binding energy profile. Within this framework, the calculated binding energy values are interpreted such that more positive values indicate stronger and more favorable interactions, whereas more negative values correspond to weaker or unfavorable interactions [76]. The results revealed that allosteric inhibition generally produced more favorable and stable interaction energies. For instance, the reference compound ICCB-3 exhibited a mean interaction energy of −15.10 kJ/mol, while ICCB-26 and ICCB-53 showed values of −6.53 kJ/mol and 84.39 kJ/mol, respectively. On the other hand, the direct inhibition mechanism yielded less favorable and more variable energy profiles. ICCB-3 recorded a mean interaction energy of −12.27 kJ/mol, whereas ICCB-26 and ICCB-53 showed values of −64.22 kJ/mol and 5.11 kJ/mol, respectively. A clear trend emerged from these results: ICCB-53 consistently showed the most favorable energy interactions across both inhibition mechanisms, suggesting a strong and stable binding affinity. On the other hand, ICCB-26 displayed highly variable interaction energies, suggesting not only a weaker binding affinity but also instability in its interaction with the target over time.
To complement and deepen the interpretation of the binding energy analysis, it is essential to consider the RMSD of the entire protein-ligand complex. The combined evaluation of these two metrics provides a more comprehensive understanding of the interaction dynamics and stability of the system throughout the simulation. In the case of allosteric inhibition, the RMSD values indicate a relatively stable interaction across all compounds, with fluctuations averaging around 3 Å. Notably, a stabilization period was observed during the first 20 nanoseconds, likely reflecting initial conformational adjustments as the system reached an energetically favorable state following structural optimization. Conversely, the direct inhibition mechanism exhibited greater conformational variability, with RMSD values reaching approximately 4 Å. The compound ICCB-53 showed the highest instability, undergoing fluctuations of up to 6 Å during the first 75 nanoseconds before settling into a more stable conformation. This behavior suggests that the system required a longer period to reach equilibrium, possibly due to initial dynamic instability [77].
Taken together, the binding energy and RMSD analyses suggest that the allosteric inhibition mechanism is associated with stronger and more stable interactions, supporting the hypothesis that it may represent the primary mode of action by which these compounds inhibit plasmid transfer. However, to fully understand how this mechanism operates, it is crucial to examine the specific interactions between the ligands and the protein, and how these interactions influence the protein’s structural behavior. These molecular interactions between the compounds and the IHF protein are illustrated in Figure 7, providing visual insight into the nature and location of the binding contacts.
As shown in Figure 8, in the direct inhibition scenario, a high number of classical hydrogen bonds were observed, particularly involving residues such as Arg63, Thr74, and Pro72. Although these interactions occur within the catalytic site and are generally strong, they appear to be highly sensitive to minor conformational changes in either the protein or the ligand. This sensitivity is reflected in the binding energy and RMSD analyses, which suggest a less stable interaction over time. In contrast, the allosteric site is characterized by a hydrophobic and aromatic pocket, with frequent non-polar contacts between the ligands and residues such as Phe30, Phe31, Tyr11, and Met8. The most prominent interactions in this region are of the types of Pi–Pi stacking and Pi–Alkyl, which likely play a crucial role in maintaining the stability of the complex throughout the simulation. Moreover, these interactions may contribute to conformational changes that indirectly affect the active site, particularly residues like Arg63 and Pro65.
To further investigate these structural modifications, we analyzed the radius of gyration (Rg) and the solvent-accessible surface area (SASA), as presented in Figure 9. These metrics provide additional insight into the compactness and exposure of the protein structure, helping to clarify the impact of ligand binding on overall protein dynamics [78].
The analysis of the Rg and SASA provides valuable analysis into the overall conformational changes in the protein during molecular dynamics simulations. These metrics are particularly useful for assessing both the degree of structural compaction and the distribution of solvent-exposed regions, which may reflect alterations in the protein’s tertiary structure [79]. Notable differences were observed between the two inhibitory mechanisms. In the case of direct inhibition, the protein exhibited a greater degree of compaction, as indicated by a lower average radius of gyration. This observation was further supported by reduced SASA, suggesting that the protein adopts a more compact conformation upon ligand binding at the catalytic site. In contrast, the allosteric inhibition mechanism was associated with a larger solvent-accessible surface area, implying a more relaxed or expanded conformation. This may result from conformational changes in distal secondary structure elements, potentially induced by ligand binding at the allosteric site.
To further investigate which residues undergo the most significant structural fluctuations, and to understand how these changes might influence the protein’s functional activity, we analyzed the Root Mean Square Fluctuation (RMSF) of each residue. These results are presented in Figure 10, providing a residue-level view of the protein’s dynamic behavior under different inhibitory conditions.
The RMSF analysis revealed a consistent pattern of conformational changes across both proposed mechanisms of inhibition. In both simulation sets, the most significant fluctuations were observed in the region encompassing residues 60 to 70, notably affecting Pro65, a residue previously identified as functionally relevant. These findings suggest that this region may play a central role in the structural response of the protein to ligand binding. In the case of allosteric inhibition, the observed fluctuations were more consistent across the different compounds, supporting the hypothesis of a shared mechanism of action among molecules belonging to the same conformational cluster identified within the applicability domain of the QSAR model [80]. This consistency may indicate a common structural pathway through which these compounds exert their inhibitory effects. On the other hand, the direct inhibition mechanism also showed fluctuations in the same residue region; however, the pattern was less uniform across the compounds. For instance, ICCB-26 did not exhibit the same degree of structural variation observed in other compounds, suggesting a less stable or less consistent interaction with the protein. Additional fluctuations in other regions of the protein further support the notion of a less stable conformational state under direct inhibition. Of particular interest is the observation that, despite binding at a site distant from the active center, compounds acting through allosteric inhibition were still able to induce structural changes in key catalytic residues such as Arg63 and Pro65. To explore how these distal interactions might influence the active site, a Dynamic Cross-Correlation Matrix (DCCM) analysis was performed (Figure 11). This approach allows for the identification of correlated motions between residues interacting with the ligands and those located in the active site, offering further insight into the allosteric communication pathways within the protein.
The DCCM analysis between the residues involved in ligand binding and those located in the active site of the protein provides valuable insight into the mechanistic relationship between ligand interaction and functional modulation. This analysis helps to explain the observed correlation between biological activity and the results of the in silico simulations. In the DCCM plots, blue regions represent anti-correlated motions, where residues move in opposite directions, while red regions indicate positively correlated motions, meaning the residues move in the same direction [81,82]. White areas reflect a lack of correlation between residue movements. The compound ICCB-26, which exhibited the highest biological activity, showed strong anti-correlated motions between residues in the active site, including Pro65 and those interacting directly with the ligand. This suggests a possible mechanical decoupling between the binding site and the catalytic core, potentially mediated by conformational changes in the β-sheets and loops that form the active site. Such decoupling could hinder the protein’s ability to interact effectively with DNA, thereby inhibiting its function. This behaviour has been reported in several works in which an allosteric mechanism is suggested [83,84].
In contrast, ICCB-53, which demonstrated the lowest biological activity, displayed positively correlated motions between the ligand-binding residues and the active site. This may indicate a lack of structural disruption, allowing the protein to maintain its functional conformation despite ligand binding. Notably, the most significant anti-correlated interactions in the ICCB-26 complex were observed between Phe52–Pro65, Leu26–Pro61, and Leu26–Lys66. These residues may represent key nodes within the allosteric pocket, whose dynamic behavior has a pronounced impact on the structural integrity of the active site. For ICCB-3, which exhibited intermediate biological activity, a mixed pattern of correlated and anti-correlated motions was observed.
This observations aligns with the moderate efficacy observed and suggests that partial allosteric modulation may be occurring. All compounds interact with a common set of hydrophobic hotspots LeuA3/A26, ValA27, PheA30/31, and TyrA11 but differ significantly in how these contacts are distributed along the binding cleft. ICCB-26 and ICCB-53 establish extended, multi-point interaction networks: distal aromatic rings engage peripheral Leu and Phe residues, while the central sulfonyl group forms persistent polar contacts with residues such as GluA7 and IleA34, effectively bridging adjacent subsites within the pocket. In contrast, ICCB-3 concentrates its interactions around PheA30, TyrA11, and LeuA26, with a shorter interaction radius and fewer peripheral van der Waals contacts. This pattern reflects a more localized anchoring within the central cavity and limited engagement of lateral subpockets. Such an interaction topology supports ICCB-3 as a minimal binder that retains essential pharmacophoric contacts but lacks the extended subpocket coverage observed for ICCB-26 and ICCB-53. These findings underscore the importance of long-range dynamic interactions in modulating protein function and reinforce the hypothesis that allosteric inhibition may represent a more effective strategy for disrupting IHF activity [85].

3. Discussion

Antibiotic resistance is a major global health threat that compromises the effectiveness of antimicrobial therapies. It occurs when bacteria develop mechanisms to evade antibiotics, often through complex interactions with their environment. Resistance can arise through horizontal gene transfer, in which bacteria acquire resistance genes from other microorganisms, or through selective pressure from prolonged antibiotic exposure, especially in aquatic environments. Additionally, genetic mutations in susceptible bacteria can alter drug targets or enzymatic pathways, reducing antibiotic binding and leading to treatment failure [86,87]. Bacterial conjugation is a key mechanism for the spread of antibiotic resistance. This process uses a sex pilus to transfer plasmids between cells. Among these, IncF plasmids are particularly important because they carry genes for resistance to drugs such as β-lactams, carbapenems, and tetracyclines.
This study implemented an integrated computational approach combining 3D-QSAR predictive modeling, pharmacophore-based screening, molecular docking, and molecular dynamics simulations to predict and propose the potential biological effects of compounds with demonstrated plasmid inhibition activity. The first step involved developing a predictive QSAR model using a dataset of 65 compounds and seven molecular algebraic descriptors that capture electronic, thermodynamic, and topological properties, calculated with QuBiLS-MIDAS. The resulting model exhibited high predictive performance (R2 = 0.90, Q2 = 0.82) and was validated through multiple strategies, including Leave-One-Out Cross-Validation (LOOCV), Tropsha’s criteria, and Y-scrambling, confirming its robustness and reliability. Analysis of the selected descriptors revealed a preference for rigid molecular frameworks with regions of high electronic density and a well-defined three-dimensional architecture. Furthermore, the presence of a balanced distribution of electronic hardness and softness, along with intermediate physicochemical properties such as polarizability, charge, logP, and polar surface area (PSA), was associated with enhanced interaction with the target protein and improved biological activity.
Following the QSAR analysis, a pharmacophore-based search was conducted to identify representative compounds from the original set of 65 molecules used to build the predictive model [88]. This process involved generating pharmacophoric clusters based on molecular fingerprints calculated using pmapper and analyzed with RDKit. Key features such as hydrogen bond donors and acceptors, hydrophobic regions, aromatic rings, and ionizable groups were considered to create a 3D pharmacophore representation of all compounds in the dataset. From these clusters, a central compound was selected to represent the largest group based on structural similarity and biological activity. As a result, ICCB-3 was chosen as the representative compound. This molecule was subsequently used for target identification through the PharmMapper web platform, which performs extensive searches for potential druggable pockets using a large database of receptor-ligand complexes. The analysis identified the Integration Host Factor (IHF) as the most promising target among more than 1000 candidates, a conclusion supported by bibliographic evidence. IHF plays a critical role in plasmid replication, which is essential for bacterial conjugation. Multiple studies have shown that structural alterations in this protein can significantly impair its normal function, leading to a reduction in plasmid transfer efficiency. These structural changes disrupt the interactions between the monomers that form the functional dimer, resulting in improper dimerization of the complex. In addition, modifications in key residues involved in DNA recognition interfere with the initiation of the plasmid-transfer process. Together, these findings highlight the importance of the protein’s structural integrity and demonstrate that even subtle modifications can directly compromise plasmid replication [89,90]. Structurally, it is a heterodimer with a central α-helical core flanked by β-sheets that mediate DNA binding. Proline residues, such as Pro65, interact with DNA bases, inducing bends in the double helix that facilitate origin activation, DNA unwinding, and assembly of replication proteins, including RepE and DnaA. Additionally, IHF interacts with components of the type IV secretion system (T4SS), such as TraY and Tram, promoting DNA curvature required for plasmid mobilization. This coordinated activity accelerates the dissemination of antibiotic resistance genes. Therefore, inhibiting IHF function represents a promising strategy to block plasmid replication and limit the spread of antimicrobial resistance [91,92].
The Integration Host Factor (IHF) was selected as the molecular target for docking simulations, evaluating two potential inhibition mechanisms: direct inhibition of the active site where the protein interacts with DNA to induce structural modifications and allosteric inhibition, which triggers conformational changes that alter protein functionality. Overall, the allosteric mechanism exhibited more favorable binding energies. Notably, ICCB-26 achieved strong docking scores in both mechanisms (>8.6 kcal/mol) and displayed the highest Anti-Plasmid Inhibition Index (APII). In contrast, ICCB-53 also showed good binding energies (>8.4 kcal/mol) but poor biological activity, suggesting that APII is influenced not only by ligand-protein affinity but also by additional factors such as compound permeability and complex stability over time.
The molecular dynamics (MD) simulations confirmed both the affinity and stability of the ligand-protein complexes. Simulations were performed over 200 nanoseconds, providing strong evidence in favor of an allosteric inhibition mechanism, as compounds bound to this site exhibited greater stability than those that interacted directly. The mean RMSD for these complexes was approximately 3 Å, and binding energy values reached up to 84.4 kJ/mol, indicating a highly stable interaction. Additional analyses, including radius of gyration, solvent-accessible surface area (SASA), and root mean square fluctuation (RMSF), offered further insights into the structural dynamics of the protein-ligand complexes. These metrics revealed conformational changes and stability patterns that reinforce the hypothesis of a potential allosteric mechanism of inhibition [93].
Despite these favorable results, a direct correlation with biological activity has not yet been fully established. To address this, and considering the findings from the RMSF analysis, which revealed positional changes in active-site residues, particularly Pro65 and Arg63, under the allosteric mechanism, we examined the interactions of the compounds at their minimum-energy conformations during the simulation. This analysis identified key residues involved in complex formation at the allosteric site, including Phe30, Phe31, Tyr11, and Met8, which primarily engaged in Pi–Pi, Pi–Alkyl, and van der Waals interactions. To further explore the functional impact of these interactions, a Dynamic Cross-Correlation Matrix (DCCM) was generated to assess the relationship between residues in the allosteric pocket and those in the active site. The results revealed a clear correlation with biological activity: compounds with higher activity tended to induce anti-correlated motions between interacting and functional residues, suggesting that this dynamic decoupling may enhance inhibitory effects.
These anti-correlated motions may be associated with a mechanical decoupling between the allosteric binding site and the catalytic core, likely mediated by conformational changes in the β-sheets and loops that form the active site. Such structural rearrangements could impair the protein’s ability to interact effectively with DNA, thereby inhibiting its biological function [94]. Although this mechanism may provide a plausible explanation for the observed changes in biological activity, it is also important to address the apparent decoupling between binding stability and the experimental activity reported. This discrepancy suggests that the mechanism of action proposed could play a decisive role. Typically, molecules that inhibit a protein by binding directly to the active site show a relationship between their binding stability and the corresponding biological activity. However, in the present case, an allosteric mechanism has been proposed. Under such circumstances, the primary determinant of biological activity may not be the binding stability itself, but rather other contributing factors. These may include specific interactions with key residues, the three-dimensional shape of the ligand, or particular electronic and thermodynamic features that influence downstream conformational or functional changes in the protein [95,96], although further experimental validation will be necessary to confirm this proposed mechanism, it is important to acknowledge the inherent limitations of in silico simulations when attempting to describe or verify functional biological processes of this complexity.
This potential allosteric mechanism of functional inhibition represents a promising novel strategy for anti-plasmid activity, offering an alternative approach to combat antimicrobial resistance. By preventing the horizontal transfer of resistance genes, this strategy could significantly reduce the rate of dissemination of multidrug-resistant microorganisms. Furthermore, inhibiting plasmid replication may lead to the eventual loss of extrachromosomal genetic material, which is not integrated into the bacterial chromosome. Over time, this could result in a progressive re-sensitization of bacterial populations to antibiotics that had previously lost efficacy, restoring the therapeutic potential of existing antimicrobial agents.
Despite the advantages of in silico methodologies, these approaches cannot fully replicate the complexity of the biological environment during an infection. Computational simulations do not account for interactions between the drug and various substances produced by the host immune system or by the bacteria themselves. Furthermore, the activity of these compounds may vary with structural changes caused by mutations, potentially altering binding mechanisms and complex formation. For these reasons, it is essential to validate the findings presented here through in vitro and in vivo experiments to confirm the efficacy of the proposed compounds. It is also important to consider the potential collateral effects that may arise when structural alterations disrupt the various functions of this protein. Several cellular processes rely on proper DNA architecture and the coordinated assembly of multiprotein complexes. For example, site-specific recombination, the initiation and synchronization of chromosomal replication at oriC, and global transcriptional regulation all depend on the structural integrity and dynamic behavior of this protein. In addition, changes that affect nucleoid organization or DNA topology could further influence essential cellular processes. Such collateral alterations may ultimately lead to modifications in bacterial physiology, potentially altering their typical interactions and behavior within the environment [97,98]. Nevertheless, the results obtained from these simulations provide a valuable foundation for the IHF development of future drugs with potential clinical applications [99,100].

4. Materials and Methods

4.1. Database and Calculation of Descriptors

To construct the dataset for training the QSAR machine learning model, an initial set of 366 compounds was selected from a previously published study that investigated antimicrobial and anti-plasmid activity through high-throughput screening of over 12,000 known bioactive molecules. Following a detailed curation process, 65 compounds were retained. This selection excluded duplicates, compounds not indexed in major chemical databases, and those lacking retrievable structural information. The 2D chemical structures of the selected compounds were retrieved from the PubChem database. To ensure accurate geometries for descriptor calculation, energy minimization was performed using GaussView version 6 [101], applying Density Functional Theory (DFT) with the B3LYP functional and the 6–31G(d) basis set. Subsequently, molecular descriptors including both topological and electronic properties were calculated using the QuBiLs-MIDAS software https://tomocomd.com/software (accessed on 10 June 2025). These descriptors served as input features for the QSAR model. To prepare the biological activity data, the experimental values expressed as the RFU/OD600 ratio were log-transformed. This normalization ensured that higher values reflected greater biological activity, thereby enhancing data interpretability and improving the robustness of statistical analyses during model development.

4.2. Building and Analysis of the QSAR

Once the dataset was compiled into a .csv file containing the 65 selected compounds, their corresponding biological activity values, and the full set of calculated molecular descriptors, it was subjected to a preprocessing pipeline using a custom script. The first step involved eliminating autocorrelated variables, which helped reduce multicollinearity, minimize redundancy and noise, and improve the model’s interpretability. Subsequently, columns containing missing or null values were removed. After this cleaning process, approximately 16,000 descriptors remained for further analysis. From this refined set, only the descriptors showing the strongest correlation with the dependent variable (biological activity) were selected for model training.
A linear regression model was developed, yielding a predictive equation that estimates plasmid inhibition activity from molecular features. To ensure the robustness and reliability of the model, several validation techniques were employed, including Hold-Out, 10-Fold Cross Validation, Leave-One-Out Cross Validation, and Repeated K-Fold. For each validation method, key performance metrics were calculated: R2, Q2, Root Mean Square Error of Prediction (RMSEP), and Mean Absolute Error (MAE). The validity of the model was further confirmed using Tropsha’s criteria and Y-scrambling tests, which assess the risk of chance correlations. Finally, the applicability domain of the model was defined using Interquartile Range (IQR) and Cook’s Distance, ensuring that predictions are made only within the chemical space where the model is considered reliable [102].

4.3. Selection of Representative Compound and Target Identification

The 3D structures of the molecules used to train the QSAR model were subjected to an automated pharmacophoric clustering process, using structures stored in .sdf format. Through an iterative procedure, all molecules were loaded, and pharmacophores were generated using the pmapper Python library. Subsequently, various fingerprints representing key structural features of the molecules were computed using RDKit. To improve the accuracy of structural comparisons, molecular planarity alignment was performed. After this step, Tanimoto similarity scores [103,104] were applied within each cluster to identify the compound most representative of its group. The largest cluster contained 31 compounds, and its representative molecule was ICCB-3.
This compound was used to identify potential molecular targets through PharmMapper, a well-established web-based platform for reverse pharmacophore mapping. This tool identifies likely biological targets for small molecules by comparing their pharmacophoric features against a comprehensive database of experimentally validated protein-ligand interactions. PharmMapper utilizes a diverse collection of pharmacophore models derived from natural ligands, known drug targets, and repurposed therapeutic agents. By aligning the pharmacophoric features of the query compound with those in its database, the platform predicts potential protein targets with high confidence. This approach allows for the identification of plausible molecular interactions without requiring prior knowledge of the compound’s biological activity, highlighting the predictive strength and versatility of the software.

4.4. Docking and Molecular Dynamics Protocol

Molecular docking was performed using the E. coli Integration Host Factor (IHF) structure crystallized in complex with DNA (PDB 1IHF, 2.5 Å). This structure captures the characteristic DNA U-turn generated by IHF, in which Pro65 and neighboring residues intercalate into the minor groove and initiate sharp bending of the DNA duplex, thereby defining the native DNA-binding interface of the protein. The DNA-binding site used in this study was defined directly according to the residue–DNA contacts observed in the 1IHF crystal structure, following the binding architecture described by Rice et al. [105]. For docking, the DNA strands were removed while preserving the protein conformation adopted in the DNA-bound state. All crystallographic water molecules and non-protein or non-DNA heteroentities were removed prior to receptor preparation. Hydrogen atoms were added, and protonation states were assigned at pH 7.4. Two docking grids were created: (i) one centered on the experimentally defined DNA-binding site from 1IHF, at X: 2, Y: 14, Z: 4 with dimensions of 50 Å × 50 Å × 50 Å and (ii) a second grid positioned at a putative allosteric pocket identified using fpocket and CB-Dock2, with centered at X: −2, Y: −9, Z: 34 with the same dimensions for the cubid grid (50 Å × 50 Å × 50 Å). Both grids were used for exhaustive sampling of ligand conformations.
All docking calculations were carried out in flexible-ligand mode, i.e., with full torsional freedom for each compound throughout the search. This choice is particularly relevant for allosteric evaluation, because ligand adaptability often underpins binding at cryptic or partially formed pockets. All docking simulations were performed using AutoDockGPU, which employs the ADADELTA optimization algorithm and an enhanced local search method, enabling extensive conformational sampling through GPU acceleration. For each compound, 1000 conformations were generated, with scoring parameters set to 2,500,000 evaluations and a maximum of 42,000 generations. Data analysis was conducted using a custom script that extracted the best binding scores, calculated the mean score across all conformations, and determined the standard deviation and RMSD for each cluster of generated complexes (Supplementary Material S1).
Molecular dynamics (MD) simulations were performed using YASARA Structure over a period of 200 nanoseconds, employing the AMBER14 force field. A cubic simulation box was defined with a 5 Å margin around all atoms of the complex, ensuring complete coverage of the system, and periodic boundary conditions were applied to maintain continuity throughout the simulation. Long-range electrostatic interactions were calculated using the Particle-Mesh Ewald method. Physiological conditions were simulated by setting pH to 7.4, ionic strength to 0.9% NaCl, temperature at 298 K, and adjusting pressure and water density dynamically to maintain standard values. The water density was kept at 0.997 g/mL using the TIP3P solvation model. Post-simulation analysis was conducted using two YASARA macros; the first calculated structural and dynamic parameters, including RMSD, RMSF, radius of gyration, solvent-accessible surface area (SASA), and dynamic cross-correlation matrices (DCCM). The second estimated binding energy between receptor and ligand, using an MM/PBSA-like approach, incorporates an approximation of the entropic cost associated with solvent exposure. This calculation considers that water molecules in contact with the protein surface reorganize themselves and lose conformational freedom, reducing entropy and increasing energy cost. A coefficient of 0.65 was applied to account for Van der Waals and long-range interactions between the protein surface and water molecules. All results were compiled and tabulated for subsequent statistical analysis. The 2D interaction diagrams from docking and molecular dynamics simulations were generated using Discovery Studio Visualizer for subsequent analysis https://discover.3ds.com (accessed on 20 September 2025).

5. Conclusions

The computational workflow developed in this study, integrating QSAR-based machine learning (R2 = 0.90), molecular docking, and 200 ns molecular dynamics simulations, identified a promising allosteric mechanism for inhibiting plasmid transfer in bacteria. This mechanism targets the Integration Host Factor (IHF), a protein essential for initiating plasmid replication and mobilization. Pharmacophore-based screening selected ICCB-3 as a representative compound, and reverse pharmacophore mapping using PharmMapper supported IHF as its potential molecular target. Docking analysis showed that compounds binding to the allosteric site generally exhibited more favorable binding energies than those acting through direct inhibition. For example, ICCB-26 reached −12.15 kcal/mol, while ICCB-53 scored −11.62 kcal/mol; these values correspond specifically to interactions within the proposed allosteric pocket. However, the variation in APII values among compounds with similar binding energies suggests that biological activity is not determined solely by affinity. This indicates that additional factors-such as structural rearrangements induced upon ligand binding-likely contribute to the mechanism of action. Molecular dynamics simulations confirmed the stability and favorable energetics of these complexes over 200 ns, supported by RMSD, RMSF, radius of gyration, SASA, and binding energy analyses. Dynamic cross-correlation analysis revealed that ligand binding at the allosteric site induces conformational changes that weaken IHF–DNA interactions, particularly involving residues Phe52 and Leu26, which in turn affect the positioning of Pro65 and other residues critical for DNA binding. These structural alterations may hinder the assembly required for plasmid replication, reducing plasmid copy numbers, and limiting both horizontal and vertical gene transfer. Such inhibition represents a promising strategy to curb the dissemination of antibiotic resistance and restore bacterial susceptibility to previously ineffective antibiotics. While these computational findings are encouraging, experimental validation through in vitro and in vivo studies is essential to confirm the proposed mechanism and advance these compounds toward preclinical and clinical evaluation.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms27062526/s1.

Author Contributions

Conceptualization, O.S.-C., O.S.-H., and E.A.M.B.; methodology, O.S.-C.; software, J.D.R.-M.; validation, O.S.-C., O.S.-H. and E.A.M.B.; formal analysis, J.D.R.-M.; investigation, J.R.M., N.P.-P., J.J.A., R.O.d.M. and I.J.d.S.N.; resources, Y.M.-P.; data curation, J.D.R.-M.; writing—original draft preparation, O.S.-C.; writing review and editing, O.S.-C., O.S.-H. and E.A.M.B.; visualization, J.D.R.-M.; supervision, O.S.-H. and E.A.M.B.; project administration, O.S.-C.; funding acquisition, Y.M.-P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors would like to express their sincere gratitude to Universidad del Norte, Universidad San Francisco de Quito, and Universidad Libre for providing all the computational resources necessary to carry out this project. Additionally, they would like to thank Minciencias for providing grant number RC-746-2017 and for their invaluable support in facilitating the J.D.M.R. scholarship through the 909 call.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Abbas, A.; Barkhouse, A.; Hackenberger, D.; Wright, G.D. Antibiotic Resistance: A Key Microbial Survival Mechanism That Threatens Public Health. Cell Host Microbe 2024, 32, 837–851. [Google Scholar] [CrossRef]
  2. Giraldi, G.; Montesano, M.; Napoli, C.; Frati, P.; La Russa, R.; Santurro, A.; Scopetti, M.; Orsi, G.B. Healthcare-Associated Infections Due to Multidrug-Resistant Organisms: A Surveillance Study on Extra Hospital Stay and Direct Costs. Curr. Pharm. Biotechnol. 2019, 20, 643–652. [Google Scholar] [CrossRef]
  3. Mancuso, G.; Midiri, A.; Gerace, E.; Biondo, C. Bacterial Antibiotic Resistance: The Most Critical Pathogens. Pathogens 2021, 10, 1310. [Google Scholar] [CrossRef]
  4. Davidovich, N.V.; Kukalevskaya, N.N.; Bashilova, E.N.; Bazhukova, T.A. General Principles of Antibiotic Resistance Evolution in Bacteria (Review of Literature). Russ. Clin. Lab. Diagn. 2020, 65, 387–393. [Google Scholar] [CrossRef]
  5. Baquero, F.; Alvarez-Ortega, C.; Martinez, J.L. Ecology and Evolution of Antibiotic Resistance. Environ. Microbiol. Rep. 2009, 1, 469–476. [Google Scholar] [CrossRef]
  6. Munita, J.M.; Arias, C.A. Mechanisms of Antibiotic Resistance. Microbiol. Spectr. 2016, 4, 464–473. [Google Scholar] [CrossRef] [PubMed]
  7. Lerminiaux, N.A.; Cameron, A.D.S. Horizontal Transfer of Antibiotic Resistance Genes in Clinical Environments. Can. J. Microbiol. 2019, 65, 34–44. [Google Scholar] [CrossRef] [PubMed]
  8. Liu, G.; Thomsen, L.E.; Olsen, J.E. Antimicrobial-Induced Horizontal Transfer of Antimicrobial Resistance Genes in Bacteria: A Mini-Review. J. Antimicrob. Chemother. 2022, 77, 556–567. [Google Scholar] [CrossRef] [PubMed]
  9. Gago-Córdoba, C.; Val-Calvo, J.; Abia, D.; Díaz-Talavera, A.; Miguel-Arribas, A.; Aguilar Suárez, R.; van Dijl, J.M.; Wu, L.J.; Meijer, W.J.J. A Conserved Class II Type Thioester Domain-Containing Adhesin Is Required for Efficient Conjugation in Bacillus Subtilis. mBio 2021, 12, e00104-21. [Google Scholar] [CrossRef]
  10. Lin, S.H.; Zhao, D.; Deng, V.; Birdsall, V.K.; Ho, S.; Buzovetsky, O.; Etson, C.M.; Mukerji, I. Integration Host Factor Binds DNA Holliday Junctions. Int. J. Mol. Sci. 2022, 24, 580. [Google Scholar] [CrossRef]
  11. von Wintersdorff, C.J.H.; Penders, J.; van Niekerk, J.M.; Mills, N.D.; Majumder, S.; van Alphen, L.B.; Savelkoul, P.H.M.; Wolffs, P.F.G. Dissemination of Antimicrobial Resistance in Microbial Ecosystems through Horizontal Gene Transfer. Front. Microbiol. 2016, 7, 173. [Google Scholar] [CrossRef]
  12. Smillie, C.; Garcillán-Barcia, M.P.; Francia, M.V.; Rocha, E.P.C.; de la Cruz, F. Mobility of Plasmids. Microbiol. Mol. Biol. Rev. 2010, 74, 434–452. [Google Scholar] [CrossRef]
  13. Rozwandowicz, M.; Brouwer, M.S.M.; Fischer, J.; Wagenaar, J.A.; Gonzalez-Zorn, B.; Guerra, B.; Mevius, D.J.; Hordijk, J. Plasmids Carrying Antimicrobial Resistance Genes in Enterobacteriaceae. J. Antimicrob. Chemother. 2018, 73, 1121–1137. [Google Scholar] [CrossRef] [PubMed]
  14. Partridge, S.R.; Kwong, S.M.; Firth, N.; Jensen, S.O. Mobile Genetic Elements Associated with Antimicrobial Resistance. Clin. Microbiol. Rev. 2018, 31, e00088-17. [Google Scholar] [CrossRef]
  15. Garcillán-Barcia, M.P.; Alvarado, A.; de la Cruz, F. Identification of Bacterial Plasmids Based on Mobility and Plasmid Population Biology. FEMS Microbiol. Rev. 2011, 35, 936–956. [Google Scholar] [CrossRef]
  16. Pearson, J.C.; Gillett, E.; Gadri, N.D.; Dionne, B. Tetracyclines, the Old and the New: A Narrative Review. CMI Commun. 2025, 2, 105059. [Google Scholar] [CrossRef]
  17. Buglak, A.A.; Chebotaev, P.P.; Zherdev, A.V.; Hendrickson, O.D. QSAR and Machine Learning Applied for the Analysis of (Fluoro)Quinolone Activity. Expert Opin. Drug Discov. 2025, 20, 1525–1547. [Google Scholar] [CrossRef] [PubMed]
  18. Thy, M.; Timsit, J.-F.; de Montmollin, E. Aminoglycosides for the Treatment of Severe Infection Due to Resistant Gram-Negative Pathogens. Antibiotics 2023, 12, 860. [Google Scholar] [CrossRef]
  19. Alamneh, Y.A.; Antonic, V.; Garry, B.; Pucci, M.J.; Abu-Taleb, R.; Shearer, J.P.; Demons, S.T.; Getnet, D.; Swierczewski, B.E.; Lister, T.; et al. Minocycline and the SPR741 Adjuvant Are an Efficacious Antibacterial Combination for Acinetobacter Baumannii Infections. Antibiotics 2022, 11, 1251. [Google Scholar] [CrossRef]
  20. Naseer, S.; Weinstein, E.A.; Rubin, D.B.; Suvarna, K.; Wei, X.; Higgins, K.; Goodwin, A.; Jang, S.H.; Iarikov, D.; Farley, J.; et al. US Food and Drug Administration (FDA): Benefit-Risk Considerations for Cefiderocol (Fetroja®). Clin. Infect. Dis. 2021, 72, e1103–e1111. [Google Scholar] [CrossRef]
  21. Buckner, M.M.C.; Ciusa, M.L.; Meek, R.W.; Moorey, A.R.; McCallum, G.E.; Prentice, E.L.; Reid, J.P.; Alderwick, L.J.; Di Maio, A.; Piddock, L.J.V. HIV Drugs Inhibit Transfer of Plasmids Carrying Extended-Spectrum β-Lactamase and Carbapenemase Genes. mBio 2020, 11, e03355-19. [Google Scholar] [CrossRef]
  22. Li, G.; Xia, L.-J.; Zhou, S.-Y.; Wang, X.-R.; Cui, C.-Y.; He, Y.-Z.; Diao, X.-Y.; Liu, M.; Lian, X.-L.; Kreiswirth, B.N.; et al. Linoleic Acid and A-linolenic Acid Inhibit Conjugative Transfer of an IncX4 Plasmid Carrying Mcr-1. J. Appl. Microbiol. 2021, 130, 1893–1901. [Google Scholar] [CrossRef]
  23. Son, A.; Park, J.; Kim, W.; Yoon, Y.; Lee, S.; Ji, J.; Kim, H. Recent Advances in Omics, Computational Models, and Advanced Screening Methods for Drug Safety and Efficacy. Toxics 2024, 12, 822. [Google Scholar] [CrossRef] [PubMed]
  24. Vidal-Limon, A.; Aguilar-Toalá, J.E.; Liceaga, A.M. Integration of Molecular Docking Analysis and Molecular Dynamics Simulations for Studying Food Proteins and Bioactive Peptides. J. Agric. Food Chem. 2022, 70, 934–943. [Google Scholar] [CrossRef] [PubMed]
  25. Acharya, A.; Nagpure, M.; Roy, N.; Gupta, V.; Patranabis, S.; Guchhait, S.K. How to Nurture Natural Products to Create New Therapeutics: Strategic Innovations and Molecule-to-Medicinal Insights into Therapeutic Advancements. Drug Discov. Today 2024, 29, 104221. [Google Scholar] [CrossRef]
  26. Gayathri, P.; Pande, V.; Sivakumar, R.; Gupta, S.P. A Quantitative Structure–Activity Relationship Study on Some HIV-1 Protease Inhibitors Using Molecular Connectivity Index. Bioorg. Med. Chem. 2001, 9, 3059–3063. [Google Scholar] [CrossRef]
  27. Ul-Haq, Z.; Usmani, S.; Shamshad, H.; Mahmood, U.; Halim, S.A. A Combined 3D-QSAR and Docking Studies for the In-Silicoprediction of HIV-Protease Inhibitors. Chem. Cent. J. 2013, 7, 88. [Google Scholar] [CrossRef] [PubMed]
  28. Jayatilleke, P.R.N.; Nair, A.C.; Zauhar, R.; Welsh, W.J. Computational Studies on HIV-1 Protease Inhibitors: Influence of Calculated Inhibitor−Enzyme Binding Affinities on the Statistical Quality of 3D-QSAR CoMFA Models. J. Med. Chem. 2000, 43, 4446–4451. [Google Scholar] [CrossRef]
  29. Vemula, D.; Bhandari, V. Proteome-Wide Identification of Druggable Targets and Inhibitors for Multidrug-Resistant Pseudomonas Aeruginosa Using an Integrative Subtractive Proteomics and Virtual Screening Approach. Heliyon 2025, 11, e42584. [Google Scholar] [CrossRef]
  30. Matamoros-Recio, A.; Franco-Gonzalez, J.F.; Forgione, R.E.; Torres-Mozas, A.; Silipo, A.; Martín-Santamaría, S. Understanding the Antibacterial Resistance: Computational Explorations in Bacterial Membranes. ACS Omega 2021, 6, 6041–6054. [Google Scholar] [CrossRef]
  31. Zulauf, K.E.; Kirby, J.E. Discovery of Small-Molecule Inhibitors of Multidrug-Resistance Plasmid Maintenance Using a High-Throughput Screening Approach. Proc. Natl. Acad. Sci. USA 2020, 117, 29839–29850. [Google Scholar] [CrossRef] [PubMed]
  32. Mira, P.; Yeh, P.; Hall, B.G. Estimating Microbial Population Data from Optical Density. PLoS ONE 2022, 17, e0276040. [Google Scholar] [CrossRef]
  33. Arbé-Carton, K.; Rey-Sogo, A.; Santos-Fernández, N.; Altube, O.; Garbisu, C.; Arana, L.; Alkorta, I. Development of a High-Throughput Platform to Measure Plasmid Transfer Frequency. Front. Cell. Infect. Microbiol. 2023, 13, 1269732. [Google Scholar] [CrossRef]
  34. Noonan, A.J.C.; Qiu, Y.; Ho, J.C.H.; Ocampo, J.; Vreugdenhil, K.A.; Marr, R.A.; Zhao, Z.; Yoshikuni, Y.; Hallam, S.J. CRAGE-Mediated Insertion of Fluorescent Chromosomal Markers for Accurate and Scalable Measurement of Co-Culture Dynamics in Escherichia Coli. Synth. Biol. 2020, 5, ysaa015. [Google Scholar] [CrossRef] [PubMed]
  35. Couturier, A.; Virolle, C.; Goldlust, K.; Berne-Dedieu, A.; Reuter, A.; Nolivos, S.; Yamaichi, Y.; Bigot, S.; Lesterlin, C. Real-Time Visualisation of the Intracellular Dynamics of Conjugative Plasmid Transfer. Nat. Commun. 2023, 14, 294. [Google Scholar] [CrossRef] [PubMed]
  36. Djermoun, S.; Rode, D.K.H.; Jiménez-Siebert, E.; Netter, N.; Lesterlin, C.; Drescher, K.; Bigot, S. Biofilm Architecture Determines the Dissemination of Conjugative Plasmids. Proc. Natl. Acad. Sci. USA 2025, 122, e2417452122. [Google Scholar] [CrossRef]
  37. Qi, Q.; Kamruzzaman, M.; Iredell, J.R. A Streamlined Approach for Fluorescence Labelling of Low-Copy-Number Plasmids for Determination of Conjugation Frequency by Flow Cytometry. Microorganisms 2023, 11, 878. [Google Scholar] [CrossRef]
  38. Aadland, E.; Andersen, L.B.; Migueles, J.H.; Ortega, F.B.; Kvalheim, O.M. Interpretation of Associations between the Accelerometry Physical Activity Spectrum and Cardiometabolic Health and Locomotor Skills in Two Cohorts of Children Using Raw, Normalized, Log-Transformed, or Compositional Data. J. Sports Sci. 2020, 38, 2708–2719. [Google Scholar] [CrossRef]
  39. Arumugam, T.; Ramalingam, A.; Guerroudj, A.R.; Sambandam, S.; Boukabcha, N.; Chouaih, A. Conformation and Vibrational Spectroscopic Analysis of 2,6-Bis(4-Fluorophenyl)-3,3-Dimethylpiperidin-4-One (BFDP) by DFT Method: A Potent Anti-Parkinson’s, Anti-Lung Cancer, and Anti-Human Infectious Agent. Heliyon 2023, 9, e21315. [Google Scholar] [CrossRef]
  40. Chen, D.; Huang, X.; Fan, Y. Thermodynamics-Based Model Construction for the Accurate Prediction of Molecular Properties From Partition Coefficients. Front. Chem. 2021, 9, 737579. [Google Scholar] [CrossRef]
  41. Izquierdo, R.; Zadorosny, R.; Rosales, M.; Marrero-Ponce, Y.; Cubillan, N. Molecular and Descriptor Spaces for Predicting Initial Rate of Catalytic Homogeneous Quinoline Hydrogenation with Ru, Rh, Os, and Ir Catalysts. ACS Omega 2025, 10, 18312–18331. [Google Scholar] [CrossRef] [PubMed]
  42. Staszak, M.; Staszak, K.; Wieszczycka, K.; Bajek, A.; Roszkowski, K.; Tylkowski, B. Machine Learning in Drug Design: Use of Artificial Intelligence to Explore the Chemical Structure–Biological Activity Relationship. WIREs Comput. Mol. Sci. 2022, 12, e1568. [Google Scholar] [CrossRef]
  43. Niazi, S.K.; Mariam, Z. Recent Advances in Machine-Learning-Based Chemoinformatics: A Comprehensive Review. Int. J. Mol. Sci. 2023, 24, 11488. [Google Scholar] [CrossRef]
  44. Geroldinger, A.; Lusa, L.; Nold, M.; Heinze, G. Leave-One-out Cross-Validation, Penalization, and Differential Bias of Some Prediction Model Performance Measures—A Simulation Study. Diagn. Progn. Res. 2023, 7, 9. [Google Scholar] [CrossRef]
  45. Rodríguez-Macías, J.; Saurith-Coronell, O.; Vargas-Echeverria, C.; Delgado, D.I.; Márquez Brazón, E.A.; De Aguas, R.G.; Mora, J.R.; Paz, J.L.; Marrero-Ponce, Y. Multitarget Design of Steroidal Inhibitors Against Hormone-Dependent Breast Cancer: An Integrated In Silico Approach. Int. J. Mol. Sci. 2025, 26, 7477. [Google Scholar] [CrossRef] [PubMed]
  46. Abdullahi, M.; Adeniji, S.E.; Arthur, D.E.; Musa, S. Quantitative Structure-Activity Relationship (QSAR) Modelling Study of Some Novel Carboxamide Series as New Anti-Tubercular Agents. Bull. Natl. Res. Cent. 2020, 44, 136. [Google Scholar] [CrossRef]
  47. Furusjö, E.; Svenson, A.; Rahmberg, M.; Andersson, M. The Importance of Outlier Detection and Training Set Selection for Reliable Environmental QSAR Predictions. Chemosphere 2006, 63, 99–108. [Google Scholar] [CrossRef]
  48. Schultz, T.W.; Hewitt, M.; Netzeva, T.I.; Cronin, M.T.D. Assessing Applicability Domains of Toxicological QSARs: Definition, Confidence in Predicted Values, and the Role of Mechanisms of Action. QSAR Comb. Sci. 2007, 26, 238–254. [Google Scholar] [CrossRef]
  49. De, P.; Kar, S.; Ambure, P.; Roy, K. Prediction Reliability of QSAR Models: An Overview of Various Validation Tools. Arch. Toxicol. 2022, 96, 1279–1295. [Google Scholar] [CrossRef]
  50. Lowe, C.N.; Charest, N.; Ramsland, C.; Chang, D.T.; Martin, T.M.; Williams, A.J. Transparency in Modeling through Careful Application of OECD’s QSAR/QSPR Principles via a Curated Water Solubility Data Set. Chem. Res. Toxicol. 2023, 36, 465–478. [Google Scholar] [CrossRef] [PubMed]
  51. Kigo, S.N.; Omondi, E.O.; Omolo, B.O. Assessing Predictive Performance of Supervised Machine Learning Algorithms for a Diamond Pricing Model. Sci. Rep. 2023, 13, 17315. [Google Scholar] [CrossRef] [PubMed]
  52. Rácz, A.; Bajusz, D.; Héberger, K. Effect of Dataset Size and Train/Test Split Ratios in QSAR/QSPR Multiclass Classification. Molecules 2021, 26, 1111. [Google Scholar] [CrossRef] [PubMed]
  53. Vogt, M.; Bajorath, J. Ccbmlib—A Python Package for Modeling Tanimoto Similarity Value Distributions. F1000 Res. 2020, 9, 100. [Google Scholar] [CrossRef]
  54. Paździora, W.; Paśko, P.; Grabowska, K.; Galanty, A. Can Isoflavone-Rich Legume Plants Be Useful in the Chemoprevention of Hormone-Dependent Cancers?—A Systematic Review. Int. J. Mol. Sci. 2024, 25, 7389. [Google Scholar] [CrossRef]
  55. Ferriere, F.; Aasi, N.; Flouriot, G.; Pakdel, F. Exploring the Complex Mechanisms of Isoflavones: From Cell Bioavailability, to Cell Dynamics and Breast Cancer. Phytother. Res. 2025, 39, 957–979. [Google Scholar] [CrossRef]
  56. Liu, X.; Ouyang, S.; Yu, B.; Liu, Y.; Huang, K.; Gong, J.; Zheng, S.; Li, Z.; Li, H.; Jiang, H. PharmMapper Server: A Web Server for Potential Drug Target Identification Using Pharmacophore Mapping Approach. Nucleic Acids Res. 2010, 38, W609–W614. [Google Scholar] [CrossRef]
  57. Wang, X.; Shen, Y.; Wang, S.; Li, S.; Zhang, W.; Liu, X.; Lai, L.; Pei, J.; Li, H. PharmMapper 2017 Update: A Web Server for Potential Drug Target Identification with a Comprehensive Target Pharmacophore Database. Nucleic Acids Res. 2017, 45, W356–W360. [Google Scholar] [CrossRef]
  58. Pantsar, T.; Poso, A. Binding Affinity via Docking: Fact and Fiction. Molecules 2018, 23, 1899. [Google Scholar] [CrossRef]
  59. Megantara, S.; Rusdin, A.; Budiman, A.; Puluhulawa, L.; Ikram, N.; Muchtaridi, M. Demonstrating the Absence of Correlation Between Molecular Docking and in Vitro Cytotoxicity in Anti-Breast Cancer Research: Root Causes and Practical Resolutions. Breast Cancer Targets Ther. 2025, 17, 1005–1023. [Google Scholar] [CrossRef]
  60. Mahmud, A.W.; Shallangwa, G.A.; Uzairu, A. QSAR and Molecular Docking Studies of 1,3-Dioxoisoindoline-4-Aminoquinolines as Potent Antiplasmodium Hybrid Compounds. Heliyon 2020, 6, e03449. [Google Scholar] [CrossRef]
  61. Ricci-Lopez, J.; Aguila, S.A.; Gilson, M.K.; Brizuela, C.A. Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning. J. Chem. Inf. Model. 2021, 61, 5362–5376. [Google Scholar] [CrossRef]
  62. Nikolova, E.N.; Stanfield, R.L.; Dyson, H.J.; Wright, P.E. A Conformational Switch in the Zinc Finger Protein Kaiso Mediates Differential Readout of Specific and Methylated DNA Sequences. Biochemistry 2020, 59, 1909–1926. [Google Scholar] [CrossRef]
  63. Yoshua, S.B.; Watson, G.D.; Howard, J.A.L.; Velasco-Berrelleza, V.; Leake, M.C.; Noy, A. Integration Host Factor Bends and Bridges DNA in a Multiplicity of Binding Modes with Varying Specificity. Nucleic Acids Res. 2021, 49, 8684–8698. [Google Scholar] [CrossRef]
  64. Liu, Y.; Yang, X.; Gan, J.; Chen, S.; Xiao, Z.-X.; Cao, Y. CB-Dock2: Improved Protein–Ligand Blind Docking by Integrating Cavity Detection, Docking and Homologous Template Fitting. Nucleic Acids Res. 2022, 50, W159–W164. [Google Scholar] [CrossRef] [PubMed]
  65. Le Guilloux, V.; Schmidtke, P.; Tuffery, P. Fpocket: An Open Source Platform for Ligand Pocket Detection. BMC Bioinform. 2009, 10, 168. [Google Scholar] [CrossRef] [PubMed]
  66. Santos-Martins, D.; Solis-Vasquez, L.; Tillack, A.F.; Sanner, M.F.; Koch, A.; Forli, S. Accelerating AutoDock4 with GPUs and Gradient-Based Local Search. J. Chem. Theory Comput. 2021, 17, 1060–1073. [Google Scholar] [CrossRef]
  67. Kojasoy, V.; Tantillo, D.J. Impacts of Noncovalent Interactions Involving Sulfur Atoms on Protein Stability, Structure, Folding, and Bioactivity. Org. Biomol. Chem. 2023, 21, 11–23. [Google Scholar] [CrossRef]
  68. Aledo, J.C.; Cantón, F.R.; Veredas, F.J. Sulphur Atoms from Methionines Interacting with Aromatic Residues Are Less Prone to Oxidation. Sci. Rep. 2015, 5, 16955. [Google Scholar] [CrossRef] [PubMed]
  69. Trott, O.; Olson, A.J. AutoDock Vina: Improving the Speed and Accuracy of Docking with a New Scoring Function, Efficient Optimization, and Multithreading. J. Comput. Chem. 2010, 31, 455–461. [Google Scholar] [CrossRef]
  70. Eberhardt, J.; Santos-Martins, D.; Tillack, A.F.; Forli, S. AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. J. Chem. Inf. Model. 2021, 61, 3891–3898. [Google Scholar] [CrossRef]
  71. Srivastava, M.; Mittal, L.; Kumari, A.; Agrahari, A.K.; Singh, M.; Mathur, R.; Asthana, S. Characterizing (Un)Binding Mechanism of USP7 Inhibitors to Unravel the Cause of Enhanced Binding Potencies at Allosteric Checkpoint. Protein Sci. 2022, 31, e4398. [Google Scholar] [CrossRef] [PubMed]
  72. Xue, B.; Yang, Q.; Zhang, Q.; Wan, X.; Fang, D.; Lin, X.; Sun, G.; Gobbo, G.; Cao, F.; Mathiowetz, A.M.; et al. Development and Comprehensive Benchmark of a High-Quality AMBER-Consistent Small Molecule Force Field with Broad Chemical Space Coverage for Molecular Modeling and Free Energy Calculation. J. Chem. Theory Comput. 2024, 20, 799–818. [Google Scholar] [CrossRef]
  73. Huai, Z.; Shen, Z.; Sun, Z. Binding Thermodynamics and Interaction Patterns of Inhibitor-Major Urinary Protein-I Binding from Extensive Free-Energy Calculations: Benchmarking AMBER Force Fields. J. Chem. Inf. Model. 2021, 61, 284–297. [Google Scholar] [CrossRef] [PubMed]
  74. Land, H.; Humble, M.S. YASARA: A Tool to Obtain Structural Guidance in Biocatalytic Investigations. In Protein Engineering; Springer: Berlin, Germany, 2018; pp. 43–67. [Google Scholar]
  75. Rampogu, S.; Shaik, B.; Kim, J.H.; Jung, T.S.; Ha, M.W.; Lee, K.W. Explicit Molecular Dynamics Simulation Studies to Discover Novel Natural Compound Analogues as Mycobacterium Tuberculosis Inhibitors. Heliyon 2023, 9, e13324. [Google Scholar] [CrossRef]
  76. Dong, L.; Li, P.; Wang, B. Enhancing MM /P(G) BSA Methods: Integration of Formulaic Entropy for Improved Binding Free Energy Calculations. J. Comput. Chem. 2025, 46, e70093. [Google Scholar] [CrossRef]
  77. Dessertaine, T.; Moran, J.; Benzaquen, M.; Bouchaud, J.-P. Out-of-Equilibrium Dynamics and Excess Volatility in Firm Networks. J. Econ. Dyn. Control 2022, 138, 104362. [Google Scholar] [CrossRef]
  78. Sraphet, S.; Javadi, B. Application of Hierarchical Clustering to Analyze Solvent-Accessible Surface Area Patterns in Amycolatopsis Lipases. Biology 2022, 11, 652. [Google Scholar] [CrossRef]
  79. Ahmad, B.; Saeed, A.; Al-Amery, A.; Celik, I.; Ahmed, I.; Yaseen, M.; Khan, I.A.; Al-Fahad, D.; Bhat, M.A. Investigating Potential Cancer Therapeutics: Insight into Histone Deacetylases (HDACs) Inhibitions. Pharmaceuticals 2024, 17, 444. [Google Scholar] [CrossRef]
  80. Benabderrahmane, M.; Bureau, R.; Voisin-Chiret, A.S.; Sopkova-de Oliveira Santos, J. Insights into Mcl-1 Conformational States and Allosteric Inhibition Mechanism from Molecular Dynamics Simulations, Enhanced Sampling, and Pocket Crosstalk Analysis. J. Chem. Inf. Model. 2020, 60, 3172–3187. [Google Scholar] [CrossRef]
  81. Ose, N.J.; Butler, B.M.; Kumar, A.; Kazan, I.C.; Sanderford, M.; Kumar, S.; Ozkan, S.B. Dynamic Coupling of Residues within Proteins as a Mechanistic Foundation of Many Enigmatic Pathogenic Missense Variants. PLoS Comput. Biol. 2022, 18, e1010006. [Google Scholar] [CrossRef] [PubMed]
  82. Zhu, S.; Sridhar, A.; Teng, J.; Howard, R.J.; Lindahl, E.; Hibbs, R.E. Structural and Dynamic Mechanisms of GABAA Receptor Modulators with Opposing Activities. Nat. Commun. 2022, 13, 4582. [Google Scholar] [CrossRef]
  83. Zhu, R.; Wu, C.; Zha, J.; Lu, S.; Zhang, J. Decoding Allosteric Landscapes: Computational Methodologies for Enzyme Modulation and Drug Discovery. RSC Chem. Biol. 2025, 6, 539–554. [Google Scholar] [CrossRef]
  84. Hertig, S.; Latorraca, N.R.; Dror, R.O. Revealing Atomic-Level Mechanisms of Protein Allostery with Molecular Dynamics Simulations. PLoS Comput. Biol. 2016, 12, e1004746. [Google Scholar] [CrossRef] [PubMed]
  85. Zhu, J.; Wang, J.; Han, W.; Xu, D. Neural Relational Inference to Learn Long-Range Allosteric Interactions in Proteins from Molecular Dynamics Simulations. Nat. Commun. 2022, 13, 1661. [Google Scholar] [CrossRef]
  86. Bobate, S.; Mahalle, S.; Dafale, N.A.; Bajaj, A. Emergence of Environmental Antibiotic Resistance: Mechanism, Monitoring and Management. Environ. Adv. 2023, 13, 100409. [Google Scholar] [CrossRef]
  87. Nadeem, S.F.; Gohar, U.F.; Tahir, S.F.; Mukhtar, H.; Pornpukdeewattana, S.; Nukthamna, P.; Moula Ali, A.M.; Bavisetty, S.C.B.; Massa, S. Antimicrobial Resistance: More than 70 Years of War between Humans and Bacteria. Crit. Rev. Microbiol. 2020, 46, 578–599. [Google Scholar] [CrossRef]
  88. dos Santos, K.L.B.; Cruz, J.N.; Silva, L.B.; Ramos, R.S.; Neto, M.F.A.; Lobato, C.C.; Ota, S.S.B.; Leite, F.H.A.; Borges, R.S.; Silva, C.H.T.P.d.; et al. Identification of Novel Chemical Entities for Adenosine Receptor Type 2A Using Molecular Modeling Approaches. Molecules 2020, 25, 1245. [Google Scholar] [CrossRef]
  89. Dempsey, W.B.; Fee, B.E. Integration Host Factor Affects Expression of Two Genes at the Conjugal Transfer Origin of Plasmid R100. Mol. Microbiol. 1990, 4, 1019–1028. [Google Scholar] [CrossRef] [PubMed]
  90. Gamas, P.; Caro, L.; Galas, D.; Chandler, M. Expression of F Transfer Functions Depends on the Escherichia Coli Integration Host Factor. Mol. Gen. Genet. 1987, 207, 302–305. [Google Scholar] [CrossRef] [PubMed]
  91. Bragagnolo, N.; Rodriguez, C.; Samari-Kermani, N.; Fours, A.; Korouzhdehi, M.; Lysenko, R.; Audette, G.F. Protein Dynamics in F-like Bacterial Conjugation. Biomedicines 2020, 8, 362. [Google Scholar] [CrossRef]
  92. Kasho, K.; Oshima, T.; Chumsakul, O.; Nakamura, K.; Fukamachi, K.; Katayama, T. Whole-Genome Analysis Reveals That the Nucleoid Protein IHF Predominantly Binds to the Replication Origin OriC Specifically at the Time of Initiation. Front. Microbiol. 2021, 12, 697712. [Google Scholar] [CrossRef]
  93. Ghahremanian, S.; Rashidi, M.M.; Raeisi, K.; Toghraie, D. Molecular Dynamics Simulation Approach for Discovering Potential Inhibitors against SARS-CoV-2: A Structural Review. J. Mol. Liq. 2022, 354, 118901. [Google Scholar] [CrossRef]
  94. Misra, R.; Maity, A.; Kundu, S.; Bhunia, M.; Nanda, B.; Maiti, N.C.; Pal, U. Loop Dynamics and Conformational Flexibility in Dengue Serine Protease Activity: Noninvasive Perturbation by Solvent Exchange. J. Chem. Inf. Model. 2023, 63, 2122–2132. [Google Scholar] [CrossRef]
  95. Astore, M.A.; Pradhan, A.S.; Thiede, E.H.; Hanson, S.M. Protein Dynamics Underlying Allosteric Regulation. Curr. Opin. Struct. Biol. 2024, 84, 102768. [Google Scholar] [CrossRef] [PubMed]
  96. Hernández González, J.E.; Alberca, L.N.; Masforrol González, Y.; Reyes Acosta, O.; Talevi, A.; Salas-Sarduy, E. Tetracycline Derivatives Inhibit Plasmodial Cysteine Protease Falcipain-2 through Binding to a Distal Allosteric Site. J. Chem. Inf. Model. 2022, 62, 159–175. [Google Scholar] [CrossRef] [PubMed]
  97. Kasho, K.; Ozaki, S.; Katayama, T. IHF and Fis as Escherichia Coli Cell Cycle Regulators: Activation of the Replication Origin OriC and the Regulatory Cycle of the DnaA Initiator. Int. J. Mol. Sci. 2023, 24, 11572. [Google Scholar] [CrossRef]
  98. Pozdeev, G.; Beckett, M.C.; Mogre, A.; Thomson, N.R.; Dorman, C.J. Reciprocally Rewiring and Repositioning the Integration Host Factor (IHF) Subunit Genes in Salmonella enterica Serovar Typhimurium: Impacts on Physiology and Virulence. Microb. Genom. 2022, 8, 000768. [Google Scholar] [CrossRef] [PubMed]
  99. Shaker, B.; Ahmad, S.; Lee, J.; Jung, C.; Na, D. In Silico Methods and Tools for Drug Discovery. Comput. Biol. Med. 2021, 137, 104851. [Google Scholar] [CrossRef]
  100. Verma, A.; Kumar, V.; Naik, B.; Masood Khan, J.; Singh, P.; Erik Joakim Saris, P.; Gupta, S. Screening and Molecular Dynamics Simulation of Compounds Inhibiting MurB Enzyme of Drug-Resistant Mycobacterium Tuberculosis: An in-Silico Approach. Saudi J. Biol. Sci. 2023, 30, 103730. [Google Scholar] [CrossRef]
  101. Dennington, R.; Keith, T.A.; Millam, J.M. GaussView, Version 6.1; Semichem Inc.: Shawnee, KS, USA, 2016.
  102. Schultz, L.E.; Wang, Y.; Jacobs, R.; Morgan, D. A General Approach for Determining Applicability Domain of Machine Learning Models. NPJ Comput. Mater. 2025, 11, 95. [Google Scholar] [CrossRef]
  103. Chung, N.C.; Miasojedow, B.; Startek, M.; Gambin, A. Jaccard/Tanimoto Similarity Test and Estimation Methods for Biological Presence-Absence Data. BMC Bioinform. 2019, 20, 644. [Google Scholar] [CrossRef] [PubMed]
  104. Bajusz, D.; Rácz, A.; Héberger, K. Why Is Tanimoto Index an Appropriate Choice for Fingerprint-Based Similarity Calculations? J. Cheminform. 2015, 7, 20. [Google Scholar] [CrossRef] [PubMed]
  105. Rice, P.A.; Yang, S.; Mizuuchi, K.; Nash, H.A. Crystal Structure of an IHF-DNA Complex: A Protein-Induced DNA U-Turn. Cell 1996, 87, 1295–1306. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Correlation between the observed APII values and those predicted by the 3D-QSAR model, using both OLS regression and LOOCV approaches.
Figure 1. Correlation between the observed APII values and those predicted by the 3D-QSAR model, using both OLS regression and LOOCV approaches.
Ijms 27 02526 g001
Figure 2. 3D structure of the representative compound, ICCB-3.
Figure 2. 3D structure of the representative compound, ICCB-3.
Ijms 27 02526 g002
Figure 3. 2D interaction map illustrating the key molecular contacts observed in the docking results for the most representative compounds bound to the IHF protein via the direct inhibition mechanism.
Figure 3. 2D interaction map illustrating the key molecular contacts observed in the docking results for the most representative compounds bound to the IHF protein via the direct inhibition mechanism.
Ijms 27 02526 g003
Figure 4. 2D interaction map illustrating the key molecular contacts observed in the docking results for the most representative compounds bound to the IHF protein via the allosteric inhibition mechanism.
Figure 4. 2D interaction map illustrating the key molecular contacts observed in the docking results for the most representative compounds bound to the IHF protein via the allosteric inhibition mechanism.
Ijms 27 02526 g004
Figure 5. Validation of the redocking protocol for assessing both direct and allosteric inhibition mechanisms targeting the IHF protein.
Figure 5. Validation of the redocking protocol for assessing both direct and allosteric inhibition mechanisms targeting the IHF protein.
Ijms 27 02526 g005
Figure 6. Variation in binding energy (expressed in kJ/mol) throughout a 200-nanosecond molecular dynamics simulation for both direct and allosteric inhibition mechanisms of the IHF protein.
Figure 6. Variation in binding energy (expressed in kJ/mol) throughout a 200-nanosecond molecular dynamics simulation for both direct and allosteric inhibition mechanisms of the IHF protein.
Ijms 27 02526 g006
Figure 7. RMSD profiles over the simulation time for both direct and allosteric inhibition mechanisms of the IHF protein.
Figure 7. RMSD profiles over the simulation time for both direct and allosteric inhibition mechanisms of the IHF protein.
Ijms 27 02526 g007
Figure 8. 2D interaction map illustrating the key molecular contacts observed in the MD results for the most representative compounds bound to the IHF protein via the direct (A) and allosteric (B) inhibition mechanisms.
Figure 8. 2D interaction map illustrating the key molecular contacts observed in the MD results for the most representative compounds bound to the IHF protein via the direct (A) and allosteric (B) inhibition mechanisms.
Ijms 27 02526 g008
Figure 9. Temporal evolution of the Rg and SASA throughout the molecular dynamics simulations for both direct and allosteric inhibition mechanisms of the IHF protein.
Figure 9. Temporal evolution of the Rg and SASA throughout the molecular dynamics simulations for both direct and allosteric inhibition mechanisms of the IHF protein.
Ijms 27 02526 g009
Figure 10. RMSF profiles obtained from molecular dynamics simulations for both direct and allosteric inhibition mechanisms of the IHF protein.
Figure 10. RMSF profiles obtained from molecular dynamics simulations for both direct and allosteric inhibition mechanisms of the IHF protein.
Ijms 27 02526 g010
Figure 11. DCCM analysis, highlighting the correlated motions between the ligand and the contact residues within the IHF protein, particularly those associated with the DNA-binding site.
Figure 11. DCCM analysis, highlighting the correlated motions between the ligand and the contact residues within the IHF protein, particularly those associated with the DNA-binding site.
Ijms 27 02526 g011
Table 1. Anti-Plasmid Inhibition Index (APII) values and corresponding Compound Identifiers (CIDs) for the molecules in the QSAR training dataset.
Table 1. Anti-Plasmid Inhibition Index (APII) values and corresponding Compound Identifiers (CIDs) for the molecules in the QSAR training dataset.
ICCB-IDTrivial-NAMECIDAPIIICCB-IDTrivial-NAMECIDAPII
ICCB-1Bromfenac607260.347ICCB-34Lonidamine395620.569
ICCB-2NPPB45490.553ICCB-35MG-1324623820.409
ICCB-38-Hydroxydaidzein54661390.367ICCB-36MK-2461441379460.733
ICCB-4AZD8055252629650.347ICCB-37Alisertib247718670.382
ICCB-5Amorolfine542600.648ICCB-38Laselipag99318910.328
ICCB-6Aripiprazole607950.403ICCB-39FG 714243750.561
ICCB-7Azacitidine94440.530ICCB-40BRN 21434512019860.530
ICCB-8MS-2733069145730.523ICCB-41SCHEMBL991061161783050.545
ICCB-9Baicalein52816050.469ICCB-42Oxaprozin46140.367
ICCB-10CFTRinh 17215542100.362ICCB-43Proflavine70990.444
ICCB-11Cinchophen85930.352ICCB-44Pentamidine47350.658
ICCB-12Carprofen25810.553ICCB-45Hezamidine651300.642
ICCB-13CH5561846670.530ICCB-46Piceatannol6676390.638
ICCB-14Chrysin52816070.450ICCB-47Pipemidic acid48310.611
ICCB-15DCPIB100711660.415ICCB-48Pro-Banthine92790.432
ICCB-16CHEMBL161871853788250.415ICCB-49Quinalizarin50040.319
ICCB-17Dequalinium29930.409ICCB-50AC1LCVGT6567170.367
ICCB-18Diflunisal30590.495ICCB-51Rifabutina1354155640.420
ICCB-19Dobutamine368110.561ICCB-52SC581251152390.415
ICCB-20Efloxate83950.382ICCB-53SH-4-54721886430.310
ICCB-21FG 714243750.495ICCB-54Lintitript1220770.393
ICCB-22Febuxostat1340180.481ICCB-55STX-011942532360.502
ICCB-23Genistein52809610.398ICCB-56Sertaconazole2001030.456
ICCB-24Glyburide34880.577ICCB-57Shikonin4795030.409
ICCB-25Glimepiride34760.530ICCB-58DTXSID60587903167605130.438
ICCB-26Gliquidone916100.668ICCB-59TPCA-199037860.469
ICCB-27Icatibant69181730.382ICCB-60Terbinafine15490080.387
ICCB-28Idoxuridine59050.481ICCB-61Tipifarnib1593240.495
ICCB-29Imperatorin102120.377ICCB-62Toceranib53291060.456
ICCB-30Isoliquiritigenin6382780.577ICCB-63402-71-14396470.409
ICCB-31Kasugamycin651740.620ICCB-64UK-38336798186820.403
ICCB-32LY-2552831220230.561ICCB-65Vidofludimus98200080.509
ICCB-33Lincomycin647100.415
Table 2. Molecular descriptors and their corresponding regression coefficients contribute to the construction of the 3D-QSAR model.
Table 2. Molecular descriptors and their corresponding regression coefficients contribute to the construction of the 3D-QSAR model.
CodeDescriptorCoefficient (β)R2 Contributions
Intercept0.6445
X1I50_B_AB_nCi_2_M3_SS1_T_KA_h−s_MID−1.520022.02%
X2AC[4]_K_F_AB_Ci(−1.25;1.25)_2_M10_MP0_T_LGP[2]_psa_MID−0.003819.35%
X3IB_S_B_AB_nCi_2_M3_NS0_M_LGL[2–3]_a_p_MID−0.020326.10%
X4AC[1]_S_B_AB_nCi_2_M3_NS0_T_LGP[2]_e_p_MID−0.03639.32%
X5AC[3]_K_F_AB_Ci(0.25;−0.25)_2_M15_MP1_X_KA_c_MID0.00389.60%
X6MN_TrC_AB_nCi_3_M20(M16)_SS2_T_LG3P[1]_LGP[1]_h_MID6.02676.77%
X7IB_S_F_AB_Ci(0.25;−0.25)_2_M10_SS0_X_KA_c_MID0.00466.84%
Table 3. Statistical parameters of the QSAR model developed for steroidal compounds exhibiting inhibitory activity against plasmid transfer.
Table 3. Statistical parameters of the QSAR model developed for steroidal compounds exhibiting inhibitory activity against plasmid transfer.
MetricOLS-Hold-OutLOOCV
R (Correlation Coefficient)0.96730.9076
R2 (Determination Coefficient)0.90340.8232
R2ext0.9034NaN
Q2 (Predictive Squared Correlation)NaN0.8232
QMRF (Mean Squared Prediction Error)0.00140.0017
RMSEP (Square Root of MSE)0.03780.0413
MAEext0.02830.0337
(R2 − R02)/R2 < 0.10.03540.0000
(R2 − R02)/R2 < 0.10.02770.0313
|R02 − R02| < 0.10.00700.0258
0.85 < K < 1.151.04660.9992
0.85 < k’ < 1.150.95170.9935
Table 4. Docking binding energies (ΔG, in kcal/mol) calculated for both direct and allosteric inhibition modes of the QSAR training set compounds when docked against the Integration Host Factor (IHF) protein (PDB ID: 1IHF).
Table 4. Docking binding energies (ΔG, in kcal/mol) calculated for both direct and allosteric inhibition modes of the QSAR training set compounds when docked against the Integration Host Factor (IHF) protein (PDB ID: 1IHF).
Direct Inhibition
ICCB_IDMeanBestEnergyDesvStRMSDICCB_IDMeanBestEnergyDesvStRMSD
ICCB-3−6.06−6.220.061.96ICCB-38−6.42−7.10.271.81
ICCB-4−7.83−8.010.081.33ICCB-43−5.27−5.280.000.05
ICCB-7−6.28−6.640.170.43ICCB-44−6.38−6.920.270
ICCB-8−7.36−7.890.221.76ICCB-45−7.80−8.230.20.46
ICCB-9−6.38−6.470.071.76ICCB-47−7.46−7.810.181.85
ICCB-15−6.89−7.290.270.88ICCB-49−7.01−7.07−6.970.45
ICCB-23−6.47−6.690.150.3ICCB-51−9.31−9.570.140.46
ICCB-24−6.47−6.690.151.63ICCB-53−8.48−9.490.321
ICCB-25−8.07−8.690.21.75ICCB-54−7.34−7.880.261.18
ICCB-26−8.68−9.410.310ICCB-55−8.26−8.420.070.28
ICCB-27−7.89−8.470.260.98ICCB-56−6.81−7.960.321.37
ICCB-31−7.63−7.960.171.21ICCB-58−7.63−7.980.161.64
ICCB-32−6.35−7.070.340.83ICCB-61−7.02−7.410.161.48
ICCB-33−7.67−7.940.160.87ICCB-62−7.24−7.540.121.71
ICCB-36−8.55−9.460.341.93ICCB-65−7.26−7.790.181.15
ICCB-37−8.06−8.410.16−7.67
Allosteric Inhibition
ICCB_IDMeanBestEnergyDesvStRMSDICCB_IDMeanBestEnergyDesvStRMSD
ICCB-3−7.47−7.490.010.13ICCB-38−7.98−8.230.140.88
ICCB-4−10.2−10.260.040.4ICCB-43−7.06−7.120.040.23
ICCB-7−7.3−7.390.020.84ICCB-44−8.17−8.480.141.01
ICCB-8−8.95−9.250.141.8ICCB-45−10.39−10.740.171.26
ICCB-9−7.35−7.410.060.15ICCB-47−8.15−8.280.080.31
ICCB-15−7.74−8.210.160.86ICCB-49−7.26−7.360.050.49
ICCB-23−7.1−7.340.060.9ICCB-51−10.26−10.620.160.66
ICCB-24−10.59−11.270.231.04ICCB-53−11.34−11.620.181.08
ICCB-25−11−11.360.140.98ICCB-54−8.83−9.120.120.63
ICCB−26−11.58−12.150.221.12ICCB-55−10.25−10.290.020.48
ICCB-27−7.88−8.960.71.14ICCB-56−8.86−8.730.131.38
ICCB-31−8.2−8.910.260.76ICCB-58−9.85−10.070.110.60
ICCB-32−7.68−8.840.481.58ICCB-61−9.22−9.640.061.70
ICCB-33−8.59−8.360.161.16ICCB-62−9.64−10.040.151.60
ICCB-36−10.38−10.610.131.64ICCB-65−8.86−9.180.161.75
ICCB-37−9.73−10.240.251.8
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Saurith-Coronell, O.; Sierra-Hernandez, O.; Rodríguez-Macías, J.D.; Mora, J.R.; Perez-Perez, N.; Alcázar, J.J.; Moura, R.O.d.; Nascimento, I.J.d.S.; Márquez Brazón, E.A.; Marrero-Ponce, Y. Computational Identification of Potential Novel Allosteric IHF Inhibitors Using QSAR Modeling to Inhibit Plasmid-Mediated Antibiotic Resistance. Int. J. Mol. Sci. 2026, 27, 2526. https://doi.org/10.3390/ijms27062526

AMA Style

Saurith-Coronell O, Sierra-Hernandez O, Rodríguez-Macías JD, Mora JR, Perez-Perez N, Alcázar JJ, Moura ROd, Nascimento IJdS, Márquez Brazón EA, Marrero-Ponce Y. Computational Identification of Potential Novel Allosteric IHF Inhibitors Using QSAR Modeling to Inhibit Plasmid-Mediated Antibiotic Resistance. International Journal of Molecular Sciences. 2026; 27(6):2526. https://doi.org/10.3390/ijms27062526

Chicago/Turabian Style

Saurith-Coronell, Oscar, Olimpo Sierra-Hernandez, Juan David Rodríguez-Macías, José R. Mora, Noel Perez-Perez, Jackson J. Alcázar, Ricardo Olimpio de Moura, Igor José dos Santos Nascimento, Edgar A. Márquez Brazón, and Yovani Marrero-Ponce. 2026. "Computational Identification of Potential Novel Allosteric IHF Inhibitors Using QSAR Modeling to Inhibit Plasmid-Mediated Antibiotic Resistance" International Journal of Molecular Sciences 27, no. 6: 2526. https://doi.org/10.3390/ijms27062526

APA Style

Saurith-Coronell, O., Sierra-Hernandez, O., Rodríguez-Macías, J. D., Mora, J. R., Perez-Perez, N., Alcázar, J. J., Moura, R. O. d., Nascimento, I. J. d. S., Márquez Brazón, E. A., & Marrero-Ponce, Y. (2026). Computational Identification of Potential Novel Allosteric IHF Inhibitors Using QSAR Modeling to Inhibit Plasmid-Mediated Antibiotic Resistance. International Journal of Molecular Sciences, 27(6), 2526. https://doi.org/10.3390/ijms27062526

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop