Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking

García-Díaz, Jesus Magdiel; Garibaldi-Ríos, Asbiel Felipe; Gallegos-Arreola, Martha Patricia; Gutiérrez-Gutiérrez, Filiberto; Delgado-Saucedo, Jorge Iván; Martínez-Velázquez, Moisés; Puebla-Pérez, Ana María

doi:10.3390/scipharm94010009

Open AccessReview

Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking

by

Jesus Magdiel García-Díaz

¹

,

Asbiel Felipe Garibaldi-Ríos

^2,3

,

Martha Patricia Gallegos-Arreola

²

,

Filiberto Gutiérrez-Gutiérrez

⁴

,

Jorge Iván Delgado-Saucedo

⁴

,

Moisés Martínez-Velázquez

^1,*

and

Ana María Puebla-Pérez

^4,*

¹

Unidad de Biotecnología Médica y Farmacéutica, Centro de Investigación y Asistencia en Tecnología y Diseño del Estado de Jalisco A.C., Guadalajara 44270, Jalisco, Mexico

²

División de Genética, Centro de Investigación Biomédica de Occidente (CIBO), Centro Médico Nacional de Occidente (CMNO), Instituto Mexicano del Seguro Social (IMSS), Guadalajara 44340, Jalisco, Mexico

³

Doctorado en Genetica Humana, Centro Universitario de Ciencias de la Salud (CUCS), Universidad de Guadalajara (UdeG), Guadalajara 44340, Jalisco, Mexico

⁴

Departamento de Farmacobiología, Centro Universitario de Ciencias Exactas e Ingenierías (CUCEI), Universidad de Guadalajara (UdeG), Guadalajara 4430, Jalisco, Mexico

^*

Authors to whom correspondence should be addressed.

Sci. Pharm. 2026, 94(1), 9; https://doi.org/10.3390/scipharm94010009

Submission received: 29 November 2025 / Revised: 27 December 2025 / Accepted: 8 January 2026 / Published: 13 January 2026

(This article belongs to the Topic Bioinformatics in Drug Design and Discovery—2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Drug discovery is a complex and expensive process in which only a small proportion of candidate molecules reach clinical approval. Computational methods, particularly computer-aided drug design (CADD), have become fundamental to accelerate and optimize early stages of discovery by integrating chemical, biological, and pharmacokinetic information into predictive models. This review outlines a complete computational workflow for chemical compound analysis, covering molecular structure generation, database selection, evaluation of absorption, distribution, metabolism, excretion and toxicity (ADMET), target prediction, and molecular docking. It focuses on freely accessible and web-based tools that enable reproducible, cost-effective, and scalable in silico studies. Key platforms such as PubChem, ChEMBL, RDKit, SwissADME, TargetNet, and SwissDock are highlighted as examples of how different resources can be integrated to support rational compound design and prioritization. The article also discusses essential methodological principles, data curation strategies, and common limitations in virtual screening and docking analyses. Finally, it explores future directions in computational drug discovery, including the incorporation of artificial intelligence, multi-omics integration, and quantum simulations, to enhance predictive accuracy and translational relevance.

Keywords:

computer-aided drug design; cheminformatics; molecular docking; virtual screening; ADMET prediction; target prediction; QSAR modeling; computational drug discovery

1. Introduction

Drug discovery and development are highly complex, costly, and time-consuming, with only a small fraction of candidates reaching the market [1]. Between 2009 and 2018, the U.S. Food and Drug Administration (FDA) authorized 355 new drugs; after accounting for failed trials, the median capitalized investment per drug was estimated at USD 985.3 million, with a mean of USD 1335.9 million [2,3]. To mitigate these challenges, computer-aided drug design (CADD) and artificial intelligence (AI) have become central to modern discovery, complementing structure-based and ligand-based approaches [4].

CADD provides a framework for storing, managing, and modeling chemical entities, supporting hit identification, lead optimization, ADMET profiling, and early safety evaluation [4,5]. A key advantage of computer-aided drug design (CADD) resides in its capacity to efficiently screen large chemical libraries, thereby reducing experimental burden as well as overall time and cost. Its predictive capabilities facilitate the early prioritization of promising lead candidates, minimizing resources allocated to compounds with limited therapeutic potential. Beyond these operational advantages, CADD affords valuable mechanistic insight into drug–receptor interactions, enabling a rational interpretation of binding modes and affinity determinants and underscoring its central role in contemporary drug discovery workflows [6,7].

Between 2021 and 2023, numerous CADD-driven studies reported the successful identification of bioactive compounds across a broad range of therapeutic targets, including inhibitors of promiscuous ABC transporters [8]. In prostate cancer research, VPC-17821 and VPC-17160 were discovered as potential treatments through targeting of the androgen receptor and DNA binding domain dimerization [9], while VPC-70619 was reported to target N-Myc–Max signaling [10]. Additional studies described the discovery of an inhibitor of MyD88–Toll/interleukin-1 receptor domain [11] and compound 64, an inhibitor of Bruton’s tyrosine kinase [12]. Further examples include XST-119, a novel inhibitor of FOXM1 [13], and Zinc-09 targeting serum and glucocorticoid regulated kinase 3 (SGK3) [14], together with the identification of a novel class of D-amino acid oxidase inhibitors for schizophrenia [15]. CAAD-based investigations also enabled the discovery of a cannabinoid antagonist [16], 5-HT2A receptor agonist [17], and ROCK1 kinase inhibitors [18]. Beyond human targets, computational approaches supported the identification of 4,5-pin2bpy (L4) and inhibitor of the Staphylococcus aureus MurG enzyme [19], as well as CDMS-01 targeting Trypanosoma cruzi sirtuin 2 enzyme for Chagas disease [20]. Finally, drug repositioning efforts led to the identification of nifuroxazide (NFZ) as a potent inhibitor of E-26 transformation-specific (ETS)-related gene (ERG) in prostate cancer [21]. These examples highlight the versatility and translational relevance of CADD-based pipelines. Importantly, CADD has also enabled the identification and repositioning of FDA-approved drugs, with deep learning-based approaches predicting antiviral activity against SARS-CoV-2 for agents such as atazanavir, remdesivir, efavirenz, ritonavir, and dolutegravir, underscoring the clinical impact of CADD-assisted drug discovery [22].

Key methodologies of CADD include virtual screening, molecular docking, quantitative structure–activity relationship (QSAR) modeling, and de novo design, which together enable prediction of drug–target interactions and optimization of bioactive compounds (4–6). These strategies are generally divided into structure-based drug design (SBDD), which relies on data from X-ray crystallography, nuclear magnetic resonance, or homology modeling [23].

The exponential growth of genomic data, high-throughput screening results, and curated databases such as ChemBank, DrugBank, and ChemSpider [24] has further strengthened the role of cheminformatics and bioinformatics. This vast data growth facilitates both SBDD (by providing more target protein structures) and ligand-based design (by offering more bioactivity data for machine learning and QSAR models). These fields support property analysis, pharmacokinetic and toxicity prediction, drug–target interaction modeling, and the exploration of biological pathways and genetic variants [25,26]. Together, they provide the foundation for integrated workflows that link chemical structure generation, predictive modeling, and molecular docking.

This review proposes a computational workflow for drug discovery, spanning from linear molecular representations and three-dimensional structure generation to molecular docking. The manuscript is organized to first discuss chemical databases and virtual libraries for compound selection, followed by molecular structure representation and input formats, ADMET property evaluation, target prediction strategies, and molecular docking methodologies. Emphasis is placed on open-access and web-based platforms, with the aim of illustrating how representative tools can be integrated into practical pipelines that support rational compound design and assist researchers during early-stage drug discovery.

2. Chemical Databases and Libraries in Virtual Screening

Molecules can be regarded as the fundamental units of a chemical language that encodes atomic composition, bonding patterns, charges, physicochemical attributes, and biological activities. Mastery of this language is essential for de novo drug design, where the objective is to generate chemically valid molecules with favorable pharmacological properties [27].

Historically, lead identification relied on high-throughput screening of large chemical libraries against biological targets [28]. Although this strategy proved effective, its high cost and limited scalability have prompted the adoption of computational alternatives. Virtual screening has since become a cornerstone of computer-aided drug discovery, broadly categorized into structure-based and ligand-based approaches. Both strategies rely fundamentally on chemical databases and virtual libraries, which provide the structural and physicochemical data required to identify and prioritize bioactive compounds [29].

Initial applications of virtual screening demonstrated its capacity to identify ligands and predict receptor-bound conformations with notable accuracy [28]. Since then, the rapid expansion of compound collections has transformed the field: commercial vendors now provide “make-on-demand” services and virtual catalogs of up to 20 million readily synthesizable compounds [30,31,32]. To support these strategies, chemical libraries are generally classified into three groups: general, natural product-based, and specialized resources (Figure 1). This classification provides a conceptual framework for understanding how different resources contribute to complementary aspects of drug discovery pipelines.

A wide array of open-access repositories is available, covering general, natural, and specialized compound collections. Here we highlight just a few representative examples. Among general databases, PubChem https://pubchem.ncbi.nlm.nih.gov (accessed on 17 September 2025) provides access to over 200 million compounds with detailed structural and physicochemical information [33], while ChEMBL https://www.ebi.ac.uk/chembl/ (accessed on 19 November 2025) adds pharmacological depth with around 2.5 million annotated bioactive molecules [34,35]. The ZINC20/22 platforms https://cartblanche.docking.org (accessed on 19 September 2025), widely used in virtual screening, host billions of purchasable compounds [36,37]. DrugBank, although smaller (~500,000 compounds), provides curated drug–target information of high value for pharmaceutical research [38]. Complementing these, natural product libraries such as COCONUT 2.0 https://coconut.naturalproducts.net (accessed on 21 September 2025) integrate structural and provenance metadata [39,40].

Taken together, chemical databases and virtual libraries provide the essential foundation for computational drug discovery. However, their utility depends on data quality and the need for rigorous curation (e.g., standardization, deduplication, and bioactivity cleaning) prior to processing. The next step in the workflow, therefore, focuses on chemical structure generation and the use of linear input formats, which transform molecular representations into standardized, machine-readable data suitable for computational analysis.

3. Chemical Structure Generation and Linear Input Formats

Before virtual screening can be performed, chemical structures must be expressed in machine-readable formats. As shown in Figure 2, these include two-dimensional depictions, three-dimensional encodings, linear notations, and connection table-based files. Two-dimensional and three-dimensional formats capture connectivity and spatial coordinates, providing the basis for computational analyses.

Linear notations, among which SMILES is the most widely adopted, encode molecular graphs as alphanumeric strings that are compact, efficient, and suitable for cheminformatics, ADMET prediction, and machine learning [41,42,43]. SMILES provides a concise representation of molecular structures using ASCII characters [44,45,46,47]. Because atom ordering during graph traversal may vary, a single compound can be represented by multiple valid strings. These variants, known as enumerated or randomized SMILES, are generated by selecting different starting atoms, expanding the representation space available for predictive modeling and machine learning [41,48,49,50].

SMILES encodes chemical structures according to fundamental rules that ensure accuracy and prevent misinterpretation [51,52]. To support their use, several platforms provide tools for generating SMILES strings from molecular design. Marvin JS Editor 17.21.0 by ChemAxon (Budapest, Hungary) offers comprehensive capabilities under a commercial license, while free web-based alternatives like the Chemical Identifier Resolver by NCI/CADD Group [53] provide practical options. Furthermore, the open-source toolkit RDKit is foundational, enabling high-throughput programmatic processing of structures (e.g., cleaning, canonicalization, and representation generation) essential in machine learning pipelines. Collectively, these tools integrate structure representation into computational workflows, thereby bridging chemical data into in silico analyses.

4. ADMET Property Evaluation

Many drugs development failures stem from poor pharmacokinetics or toxicity, often detected only in costly late-stage trials. To reduce these setbacks, evaluation of absorption, distribution, metabolism, excretion and toxicity (ADMET) has therefore become crucial to rational drug design [54]. Computational methods provide a practical alternative for integrating ADMET evaluation into early discovery workflows, enabling rapid prediction of pharmacokinetic and toxicity profiles, complementing docking and QSAR analyses [55,56].

Beyond estimating ADMET profiling, models also provide insights into bioavailability and potential safety liabilities [57]. Importantly, protein-based simulations allow the study of compound interactions with critical ADMET-related proteins such as cytochrome P450 isoenzymes, the hERG potassium channel, and P glycoprotein [58,59,60]. By combining predictive and structural data, these methods enhance rational compound selection and optimization.

Current ADMET prediction strategies can be broadly divided into two complementary approaches [54]: structure-based methods using docking/dynamics [61,62], and ligand-based methods using QSAR models derived from chemical and biological datasets [63,64].

Building on these approaches, numerous open-access platforms have been developed to predict ADMET properties and assist in compound prioritization. Among the most frequently used are SwissADME https://www.swissadme.ch (accessed on 25 September 2025), which provides predictions of physicochemical parameters, pharmacokinetics, and drug-likeness, making it a valuable first step in compound prioritization [65]; pkCSM https://biosig.lab.uq.edu.au/pkcsm/ (accessed on 25 September 2025), which uses graph-based signatures to encode molecular structure to train predictive models [66]; and admetSAR https://lmmd.ecust.edu.cn/admetsar2 (accessed on 27 September 2025), one of the largest curated resources for large-scale virtual screening [67,68]. Additional resources include ADMETlab (versions 2.0 and 3.0) https://admet-mesh.scbdd.com (accessed on 27 September 2025) [69,70] and the machine learning-based platform ADMET-AI https://admet.ai.greenstonebio.com (accessed on 28 September 2025) [71]. Furthermore, specialized platforms have been designed for specific ADMET parameters, such as MetaPred for cytochrome p450 isoform prediction (https://webs.iiitd.edu.in/oscadd/metapred/submit.php, accessed on 28 September 2025) [72] and PASS for biological activity prediction https://way2drug.com/dr/ (accessed on 28 September 2025) [73].

In addition to pharmacokinetic and toxicity prediction, many of the platforms discussed also incorporate drug likeness evaluation, providing an early filter for prioritizing viable candidates [74]. Among the most widely applied filters are Lipinski’s Rule of Five [75], Ghose [76], Veber [77], Muegge [78,79], and Egan [80,81].

However, it is crucial to recognize that these rules are primarily count-based heuristic filters that fail to capture molecular topology or specific target interactivity. Their use in early filtering must be complemented with QSAR analysis and structural evaluations, particularly when screening unconventional candidates like natural products or molecules designed for specialized barriers (e.g., the blood–brain barrier).

Complementing ADMET evaluations, specialized web platforms focus on toxicity prediction. Among freely accessible tools, ProTox 3.0 https://tox.charite.de/protox3/index.php (accessed on 29 September 2025) integrates molecular similarity, fragment analysis, and machine learning [82], while Toxtree applies decision tree algorithms https://apps.ideaconsult.net/data/ui/toxtree (accessed on 1 October 2025) [83]. Beyond open-access solutions, commercial platforms such as eTox-Drug Toxicity Prediction on through the Neurosnap platform (Neurosnap Inc. Computational Biology Platform for Research. Wilmington, DE, USA, 2022) https://neurosnap.ai/ (accessed on 1 October 2025) and MultiCASE https://multicase.com (accessed on 1 October 2025) [84] offer in silico tools to support comprehensive evaluation of compound toxicity and facilitate risk assessment.

Taken together, these platforms, both free and commercial, provide complementary resources that substantially enhance the predictive assessment of compound toxicity.

5. Target Prediction and Receptor Selection

An essential step in compound development is the identification of molecular targets with which a compound may interact [85]. In silico strategies addressing this challenge include comparative genomics [86,87], network-based approaches [87,88,89], and target fishing [90]. Target fishing can be executed through target-centric methods, such as machine learning and QSAR modeling [90,91], or ligand-centric methods, based on molecular similarity assessment [90,92].

Building on these strategies, comparative genomics integrates large-scale genomic data with computational tools to identify essential genes whose inhibition may compromise cellular viability, providing insights for both infectious diseases and complex conditions like cancer [86,87]. Meanwhile, network-based approaches enable the systematic exploration of molecular interactions within biological systems, supporting biomarker discovery, disease diagnosis, and target identification [87,88,89]. Complementarily, target fishing frameworks combine both paradigms—target- and ligand-centric—often enhanced by machine learning algorithms such as Random Forest or Naive Bayes, trained on extensive bioactivity datasets to estimate compound–target interaction probabilities [90,91].

Target prediction is made broadly accessible through web platforms that typically process SMILES input structures, which are converted into molecular fingerprints for prediction [93]. Fingerprints are central to cheminformatics, enabling virtual screening and exploration of chemical space, with substructure-based descriptors often providing the most accurate results. A widely used example is the MACCS fingerprint, where each bit denotes the presence or absence of predefined substructural features [94].

These types of fingerprints are widely employed across web-based prediction platforms. For instance, TargetNet http://targetnet.scbdd.com (accessed on 2 October 2025) integrates seven fingerprinting schemes to evaluate binding potential by generating high-quality QSAR models for each target [95]. SuperPred https://prediction.charite.de/subpages/target_prediction.php (accessed on 2 October 2025) combines target prediction with the ATC classification system [96]. Ligand-based strategies also underpin widely used tools such as SwissTargetPredictions http://www.swisstargetprediction.ch (accessed on 2 October 2025), which predicts the most probable protein targets of bioactive molecules in humans [97], and the Similarity Ensemble Approach (SEA) https://sea.bkslab.org (accessed on 2 October 2025), which infers protein function from chemical similarity ligands [98].

6. Molecular Docking: Evaluating Ligand–Protein Binding Affinity

Molecular docking has emerged as a central tool in structure-guided drug design [99]. Molecular docking is applied to predict ligand conformation and compatibility within active sites. The accuracy of these predictions depends on the docking algorithm, which incorporates spatial orientation, ligand flexibility, and scoring functions [100]. According to Priya et al. (2014), molecular docking can be grouped into two main categories: protein–protein, which predicts the interface between interacting macromolecules, often without requiring prior experimental information, and protein–ligand, the most widely used approach that models how a small molecule fits into an active site of a protein, where it may function as either activator or inhibitor for receptor activity [101].

Conceptually, docking involves two steps: first, the algorithm generates alternative ligand poses; second, a scoring function evaluates and ranks these poses according to noncovalent interactions, providing an estimate of binding affinity [102,103]. Once these principles are established, the general workflow of molecular docking becomes clearer (Figure 3). It involves (i) ligand and receptor preparation, (ii) active site identification, (iii) docking simulation to generate multiple poses, and (iv) ranking and selection of the most favorable binding conformation [104,105].

The reliability of docking results is intrinsically limited by the scoring functions used. These functions are typically empirical and simplify key thermodynamic terms, making them generally more effective at predicting the correct pose (orientation) than at predicting the absolute affinity (Ki or ΔG values).

Major limitations include their poor handling of solvation effects (the energetic cost of displacing water molecules) and the conformational entropy of the system. Therefore, docking results must be complemented by more rigorous computational methods, such as molecular dynamics (MD) and MM/GBSA or MM/PBSA calculations, for a more accurate estimation of binding affinity and complex stability.

Today, researchers can choose from a wide array of docking tools, each tailored to specific applications and presenting distinct strengths and limitations. Although many powerful options are available for structure-based approaches, no single program is universally optimal for all molecular systems. For this reason, numerous molecular docking programs have been developed, ranging from commercial packages to freely accessible tools; the most representative docking software that exemplifies methodological diversity and relevant for different computational applications are shown in Table 1 and Table 2. Despite methodological variations, the overall goal remains the same: to generate plausible ligand conformations and rank them according to predicted affinity.

To support the appropriate selection of docking software, performance should be evaluated using clear and reproducible criteria, particularly docking accuracy, computational efficiency, and screening enrichment. Docking accuracy is commonly assessed based on the rank-one solution, which reflects whether the top-ranked pose accurately reproduces the experimental binding mode. Using this criterion, comparative benchmarks have reported acceptable docking accuracies of approximately 0.47 for AutoDock, 0.31 for DOCK, 0.35 for FlexX, and 0.52 for GOLD, indicating that widely used tools can differ substantially in pose prediction performance. In terms of computational efficiency, FlexX has been consistently reported as one of the fastest approaches, whereas AutoDock typically exhibits longer execution times, a factor that becomes particularly relevant in large-scale virtual screening campaigns. Moreover, screening studies using DOCK and FlexX have demonstrated notable differences in ligand enrichment, emphasizing that the recovery of potentially active compounds within the top-ranked fraction of a library depends strongly on the selected platform [127].

Additional comparative studies using diverse protein–ligand datasets further highlight variability in robustness and pose prediction accuracy across docking platforms. A comparative evaluation of docking programs highlights substantial differences in robustness and pose prediction accuracy across platforms. In a study involving 195 diverse protein–ligand complexes, Glide (v4.5), GOLD (v3.2), and LigandFit (v2.3) were assessed for their ability to reproduce crystallographic binding orientations. GOLD successfully processed the complete dataset, whereas Glide and LigandFit failed to process 25 and 8 complexes, respectively, indicating differences in robustness when handling structurally diverse systems. Regarding docking accuracy, approximately 40% of the docking solutions generated by these programs achieved an RMSD below 1.0 Å, while Glide and GOLD exhibited higher success rates of approximately 60% on this highly diverse test set [128].

Recent comparative evaluations of multiple docking programs reported success rates of approximately 40–60% for top-scored poses and 60–80% for best poses, with RMSD values generally below 2 Å relative to native conformations. Among academic tools, AutoDock Vina (49.0%), AutoDock (PSO) (47.3%), UCSF DOCK (44.0%), and AutoDock (LGA) (37.4%) showed variable performance in predicting top-ranked poses. In parallel, commercial platforms such as GOLD (59.8%) and Glide, operating in Extra Precision (57.8%) and Standard Precision (53.8%) modes, demonstrated comparable or slightly higher success rates, while LigandFit (46.1%) showed moderate performance. Overall, the averaged success rates of commercial and academic docking programs were similar (54.0% vs. 47.4% for top-scored poses and 67.8% vs. 68.4% for best poses), indicating that both categories of algorithms are capable of adequately sampling conformational space and generating reliable docking solutions across diverse protein–ligand complexes [129].

In addition to traditional docking programs, a growing number of free web-based servers now provide accessible platforms for molecular docking (Table 2). These resources integrate streamlined workflows with user-friendly interfaces, enabling efficient docking simulations without the need for local installations or advanced computational infrastructure. By lowering technical barriers, web servers have become valuable tools for both specialized research and educational purposes, facilitating rapid prototyping, virtual screening, and exploratory studies in structure-based drug discovery.

Table 2. Online examples of web servers for molecular docking tools.

Web Server	Tools/Features	Suggested Use Cases	URL	Cite
PatchDock	Geometry-based docking; detects shape complementarity with minimal clashes and large interface areas	Protein–protein, protein–ligand, and protein–DNA docking	https://bioinfo3d.cs.tau.ac.il/PatchDock/ (accessed on 5 October 2025)	[130]
ZDOCK	Protein–protein docking using 3D FFT; statistical potentials; improved speed (>8×) and reduced memory	Large scale protein–protein docking, flexible molecule docking	https://zdock.wenglab.org (accessed on 5 October 2025)	[131]
CB-DOCK2	Blind protein–ligand docking; cavity detection + AutoDock Vina + template guidance (FitDock)	Binding site prediction and docking for homologous proteins	https://cadd.labshare.cn/cb-dock2/index.php (accessed on 5 October 2025)	[111,132]
SwissDock	Docking with AutoDock Vina (fast) and Attracting Cavities (accurate); flexible input formats; web access	Small-molecule docking, virtual screening, quick tests, covalent docking	https://www.swissdock.ch (accessed on 7 October 2025)	[133]
HADDOCK	Data-driven docking using experimental or biophysical restraints (AIRs)	Protein–protein docking guided by NMR or mutagenesis data	https://rascar.science.uu.nl/haddock2.4/ (accessed on 7 October 2025)	[134]
Webina 1	In-browser AutoDock Vina via WebAssembly; includes PDBQT Convert	Quick ligand–receptor docking; teaching and rapid tests	https://durrantlab.pitt.edu/webina/ (accessed on 7 October 2025)	[135]
ProteinsPlus	Tools for structure check (EDIA), hydrogen placement (Protoss), conformations (SIENA), interaction diagrams (PoseView), interface classification (HyPPI), pocket detection and druggability (DoGSiteScorer)	Preprocessing, binding site analysis, early-stage modeling	https://proteins.plus (accessed on 7 October 2025)	[136]
HPEPDOCK 2.0	Blind protein–peptide docking; hierarchical algorithm with MODPEP ensembles; global and local docking	Protein–peptide interaction modeling; global and local docking	http://huanglab.phys.hust.edu.cn/hpepdock/ (accessed on 11 October 2025)	[137]
HawkDock	Deep learning flexible docking (GeoDock); binding affinity (VD-MM/GBSA); mutation analysis	Protein–protein docking, affinity prediction, mutation impact studies	https://cadd.zju.edu.cn/hawkdock/ (accessed on 11 October 2025)	[138,139]
EDock	Blind docking with replica exchange Monte Carlo; integrates I-TASSER and COACH	Docking on low-resolution protein models; binding site prediction	https://zhanggroup.org/EDock/ (accessed on 11 October 2025)	[140]

In summary, molecular docking has consolidated its role as a central component of modern drug discovery workflows. Its successful application in both urgent contexts, such as COVID-19 [141,142], and long-term research fields like oncology highlights its versatility [143,144]. Nevertheless, docking outcomes are inherently dependent on scoring functions and structural quality, and must be complemented with experimental validation to ensure biological accuracy.

Despite the central role of molecular docking and other in silico approaches in modern drug discovery, multiple factors continue to limit their translational success. In addition to methodological constraints inherent to docking, such as simplified scoring functions and sensitivity to structural quality, broader challenges related to data sourcing, integration, and representation substantially contribute to the high attrition of CADD-derived candidates. Redundant and inconsistent datasets, data sparsity, class imbalance, and the limited availability of well-annotated negative samples undermine predictive reliability, while feature representation strategies often struggle to capture the biological complexity of drug–target interactions [145]. Furthermore, the limited interpretability of advanced machine learning models and the incomplete integration of pharmacokinetic, toxicological, and clinical data exacerbate the gap between in silico predictions and experimental or clinical outcomes. Collectively, these limitations underscore the need for integrative, high-quality datasets, improved predictive models, and complementary experimental validation to enhance the success of CADD-driven drug discovery pipelines.

7. Integrative Workflow Summary

In this review we suggest a computational drug discovery workflow, based on the search for compound properties and target interactions. The proposed computational workflow for chemical compound analysis integrates the selection of candidate molecules, conversion of structures into linear and three-dimensional formats, ADMET evaluation, and chemical database mining, through to virtual library generation and molecular docking studies (Figure 4). This scheme synthesizes the key methodological steps discussed throughout the review, providing an integrative overview of the in silico pipeline for bioactive compound discovery and evaluation.

8. Conclusions and Future Perspectives

Computational methods have reshaped drug discovery by enabling coherent workflows that connect molecular representation, chemical databases, ADMET evaluation, target prediction, and molecular docking. These approaches streamline early-stage research, reduce costs and timelines, and minimize reliance on animal testing by filtering unsuitable candidates before experimental validation. The combination of open-access platforms, curated libraries, and predictive algorithms has made these strategies broadly accessible, offering reproducible pipelines that enhance efficiency and selectivity in drug design. Looking forward, the continued integration of emerging technologies will expand these capabilities. Generative artificial intelligence holds promise for the de novo design of novel compounds; multi-omics data can strengthen the biological context of predictions; and advances in quantum simulations may enable unprecedented accuracy in modeling molecular interactions. Together, these developments point to a future where computational pipelines not only complement but also guide experimental research, accelerating the translation of in silico predictions into clinically relevant therapies.

Author Contributions

Conceptualization, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V. and A.M.P.-P.; Methodology, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and J.I.D.-S.; Software, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V. and A.M.P.-P.; Validation, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and F.G.-G.; Formal Analysis, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V. and A.M.P.-P.; Investigation, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and J.I.D.-S.; Resources, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and F.G.-G.; Data Curation, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and J.I.D.-S.; Writing—Original Draft Preparation, J.M.G.-D., A.F.G.-R. and M.P.G.-A.; Writing—Review and Editing, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and F.G.-G.; Visualization, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V., A.M.P.-P. and J.I.D.-S.; Supervision, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V. and A.M.P.-P.; Project Administration, J.M.G.-D., A.F.G.-R., M.P.G.-A., M.M.-V. and A.M.P.-P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Majumder, S.; Panigrahi, G.K. Advancements in Contemporary Pharmacological Innovation: Mechanistic Insights and Emerging Trends in Drug Discovery and Development. Intell. Pharm. 2025, 3, 118–126. [Google Scholar] [CrossRef]
Kapetanovic, I.M. Computer-aided drug discovery and development (CADDD): In Silico-Chemico-Biological Approach. Chem. Biol. Interact. 2008, 171, 165–176. [Google Scholar] [CrossRef]
Ou-Yang, S.; Lu, J.; Kong, X.; Liang, Z.; Luo, C.; Jiang, H. Computational Drug Discovery. Acta Pharmacol. Sin. 2012, 33, 1131–1140. [Google Scholar] [CrossRef]
Shah, M.; Patel, M.; Shah, M.; Patel, M.; Prajapati, M. Computational Transformation in Drug Discovery: A Comprehensive Study on Molecular Docking and Quantitative Structure Activity Relationship (QSAR). Intell. Pharm. 2024, 2, 589–595. [Google Scholar] [CrossRef]
Brogi, S. Computational Approaches for Drug Discovery. Molecules 2019, 24, 3061. [Google Scholar] [CrossRef] [PubMed]
Baig, H.M.; Ahmad, K.; Roy, S.; Ashraf, J.M.; Adil, M.; Siddiqui, M.H.; Khan, S.; Kamal, M.A.; Provaznik, I.; Choi, I. Computer Aided Drug Design: Success and Limitations. Curr. Pharm. Des. 2016, 22, 572–581. [Google Scholar] [CrossRef] [PubMed]
Pârvu, L. QSAR—A piece of drug design. J. Cell. Mol. Med. 2003, 7, 333–335. [Google Scholar] [CrossRef]
Namasivayam, V.; Silbermann, K.; Wiese, M.; Pahnke, J.; Stefan, S.M. C@PA: Computer-Aided Pattern Analysis to Predict Multarget ABC Transporter Inhibitors. J. Med. Chem. 2021, 64, 3350–3366. [Google Scholar] [CrossRef]
Radaeva, M.; Ban, F.; Zhang, F.; LeBlanc, N.; Lallous, N.; Rennie, P.S.; Gleave, M.E.; Cherkasov, A. Devepent of Novel Inhibitors Targeting the D-Box of the DNA Binding Domain of Androgen Receptor. Int. J. Mol. Sci. 2021, 22, 2493. [Google Scholar] [CrossRef]
Ton, A.T.; Foo, J.; Singh, K.; Lee, J.; Kalyta, A.; Morin, H.; Perez, C.; Ban, F.; Leblanc, E.; Lallous, N.; et al. Development of VPC-70619, a Small-Molecule N-Myc Inhibitors as a Potential Therapy for Neuroendocrine Prostate Cancer. Int. J. Mol. Sci. 2022, 23, 2588. [Google Scholar] [CrossRef]
Song, J.; Chen, D.; Pan, Y.; Shi, X.; Liu, Q.; Lu, X.; Xu, X.; Chen, G.; Cai, Y. Discover of a Novel MyD88 Inhibitor M20 and its Protection Against Sepsis-Mediated Acute Lung Ingury. Front. Pharmacol. 2021, 12, 775117. [Google Scholar] [CrossRef]
Hopkins, B.T.; Bame, E.; Bell, N.; Bohnert, T.; Bowden-Verhoek, J.K.; Bui, M.; Cancilla, M.T.; Conlon, P.; Cullen, P.; Erlanson, D.A.; et al. Utilizing structure based drug design and metabolic soft spot identification to optimize the in vitro potency and in vivo pharmacokinetic properties leading to the discovery of novel reversible Bruto´ns tyrosine inhibitors. Bioorg. Med. Chem. 2021, 44, 116275. [Google Scholar] [CrossRef]
Xie, Z.S.; Zhou, Z.Y.; Sun, L.Q.; Yi, H.; Xue, S.T.; Li, Z.R. Structured-based virtual screening towards the discovery of novel FOXM1 inhibitors. Future Med. Chem. 2022, 14, 207–219. [Google Scholar] [CrossRef]
Zhou, D.; Yu, X.; Song, Y.; Zeng, H.; Zhang, H.; Chen, B.; Wang, Y.; Li, X.; He, Q.; Zhou, W. Screening of and mechanism underlying the action of serum and glucorticoid-regulated kinase 3-targeted drugs againts estrogen receptor-positive breast cancer. Eur. J. Pharmacol. 2022, 927, 174982. [Google Scholar] [CrossRef]
Tan, H.; Jensen, K.; Houang, E.; McRobb, F.M.; Bhat, S.; Svensson, M.; Bochevarov, A.; Day, T.; Dahlgren, M.K.; Bell, J.A.; et al. Discovery of a Novel Class of d-Amino Acid Oxidase Inhibitors Using the Schrödinger Computational Platform. J. Med. Chemistry. 2022, 65, 6775–6802. [Google Scholar] [CrossRef]
Sadybekov, A.A.; Sadybekov, A.V.; Liu, Y.; Illiopoulos-Tsoutsouvas, C.; Huang, X.P.; Pickett, J.; Houser, B.; Patel, N.; Tran, N.K.; Tong, F.; et al. Synthon-based ligand discovery in virtual libraries of over 11 billion compoundss. Nature 2022, 601, 452–459. [Google Scholar] [CrossRef]
Kaplan, A.L.; Confair, D.N.; Kim, K.; Barros-Álvarez, X.; Rodriguez, R.M.; Yang, Y.; Kweon, O.S.; Che, T.; McCorvy, J.; Kamber, D.N.; et al. Bespoke library docking for 5HT2A receptor agonists with anti-depressant activity. Nature 2023, 610, 582–591. [Google Scholar] [CrossRef] [PubMed]
Beroza, P.; Crawford, J.J.; Ganichkin, O.; Gendelev, L.; Harris, S.F.; Klein, R.; Miu, A.; Steinbacher, S.; Kligler, F.M.; Lemmen, C. Chemical space docking enables large-scale structured-based virtual screening to discover ROCK1 kinase inhibitors. Nat. Commun. 2022, 13, 6447. [Google Scholar] [CrossRef] [PubMed]
Cortat, Y.; Nedyalkova, M.; Schindler, K.; Kadakia, P.; Demirci, G.; Sovari, S.N.; Crochet, A.; Salentinig, S.; Lattuada, M.; Steiner, O.M.; et al. Computer-Aided Drug Design and Synthesis of Rhenium Clotrimazole Antimicrobial Agents. Antibiotics 2023, 12, 619. [Google Scholar] [CrossRef]
Ferreira, G.M.; Kronenberger, T.; Maltarollo, V.G.; Poso, A.; de Moura Gatti, F.; Almeida, V.M.; Marana, S.R.; Lopes, C.D.; Tezuka, D.Y.; de Albuquerque, S.; et al. Trypanosoma cruzi Sirtuin 2 as Relevant Druggable Target: New Inhibitors Developed by Computer-Aided Drug Design. Pharmaceuticals 2023, 16, 428. [Google Scholar] [CrossRef] [PubMed]
Li, C.; Zhang, J.; Wu, Q.; Kumar, A.; Pan, G.; Kelvin, D.J. Nifuroxazide Activates the Parthanatos to Overcome TMPRSS2: ERG Fusion-Positive Prostate Cancer. Mol. Cancer Ther. 2023, 22, 306–316. [Google Scholar] [CrossRef]
Gurung, A.B.; Ali, M.A.; Lee, J.; Farah, M.A.; Al-Anazi, K.M. An Updated Review of Computer-Aided Drug Design and Its Application to COVID-19. BioMed Res. Int. 2021, 2021, 8853056. [Google Scholar] [CrossRef]
Batool, M.; Ahmad, B.; Choi, S. A Structure-Based Drug Discovery Paradigm. Int. J. Mol. Sci. 2011, 20, 2783. [Google Scholar] [CrossRef]
Ivanenkov, Y.A.; Savchuk, N.P.; Ekins, S.; Balakin, K.V. Computational Mapping Tools for Drug Discovery. Drug Discov. Today 2009, 14, 767–775. [Google Scholar] [CrossRef]
Prada-Gracia, D.; Huerta-Yépez, S.; Moreno-Vargas, L.M. Application of Computational Methods for Anticancer Drug Discovery, Design, and Optimization. Bol. Méd. Hosp. Infant. Méx. Engl. Ed. 2016, 73, 411–423. [Google Scholar] [CrossRef]
Romano, J.D.; Tatonetti, N.P. Informatics and Computational Methods in Natural Product Drug Discovery: A Review and Perspectives. Front. Genet. 2019, 10, 368. [Google Scholar] [CrossRef] [PubMed]
Grisoni, F. Chemical Language Models for de Novo Drug Design: Challenges and Opportunities. Curr. Opin. Struct. Biol. 2023, 79, 102527. [Google Scholar] [CrossRef]
Shoichet, B.K. Virtual Screening of Chemical Libraries. Nature 2004, 432, 862–865. [Google Scholar] [CrossRef] [PubMed]
Sabe, V.T.; Jhamba, L.A.; Maguire, G.E.M.; Govender, T.; Naicker, T.; Kruger, H.G. Current Trends in Computer Aided Drug Design and a Highlight of Drugs Discovered via Computational Techniques: A Review. Eur. J. Med. Chem. 2021, 224, 113705. [Google Scholar] [CrossRef] [PubMed]
Carlsson, J.; Luttens, A. Structure-Based Virtual Screening of Vast Chemical Space as a Starting Point for Drug Discovery. Curr. Opin. Struct. Biol. 2024, 87, 102829. [Google Scholar] [CrossRef]
Grygorenko, O.O.; Radchenko, D.S.; Dziuba, I.; Chuprina, A.; Gubina, K.E.; Moroz, Y.S. Generating Multibillion Chemical Space of Readily Accessible Screening Compounds. iScience 2020, 23, 101681. [Google Scholar] [CrossRef]
Lyu, J.; Wang, S.; Balius, T.E.; Singh, I.; Levit, A.; Moroz, Y.S.; O’Meara, M.J.; Che, T.; Algaa, E.; Tolmachova, K.; et al. Ultra-Large Library Docking for Discovering New Chemotypes. Nature 2019, 566, 224–229. [Google Scholar] [CrossRef] [PubMed]
Kim, S.; Chen, J.; Cheng, T.; Gindulyte, A.; He, J.; He, S.; Li, Q.; Shoemaker, B.A.; Thiessen, P.A.; Yu, B.; et al. PubChem 2025 Update. Nucleic Acids Res. 2025, 53, D1516–D1525. [Google Scholar] [CrossRef]
Davies, M.; Nowotka, M.; Papadatos, G.; Dedman, N.; Gaulton, A.; Atkinson, F.; Bellis, L.; Overington, J. ChEMBL Web Services: Streamlining Access to Drug Discovery Data and Utilities. Nucleic Acids Res. 2015, 43, 612–620. [Google Scholar] [CrossRef] [PubMed]
Zdrazil, B.; Felix, E.; Hunter, F.; Manners, E.J.; Blackshaw, J.; Corbett, S.; de Veij, M.; Ioannidis, H.; Lopez, D.M.; Mosquera, J.F.; et al. The ChEMBL Database in 2023: A Drug Discovery Platform Spanning Multiple Bioactivity Data Types and Time Periods. Nucleic Acids Res. 2024, 52, D1180–D1192. [Google Scholar] [CrossRef]
Irwin, J.J.; Tang, K.G.; Young, J.; Dandarchuluun, C.; Wong, B.R.; Khurelbaatar, M.; Moroz, Y.S.; Mayfield, J.; Sayle, R.A. ZINC20—A Free Ultralarge-Scale Chemical Database for Ligand Discovery. J. Chem. Inf. Model. 2020, 60, 6065–6073. [Google Scholar] [CrossRef]
Tingle, B.I.; Tang, K.G.; Castanon, M.; Gutierrez, J.J.; Khurelbaatar, M.; Dandarchuluun, C.; Moroz, Y.S.; Irwin, J.J. ZINC-22─A Free Multi-Billion-Scale Database of Tangible Compounds for Ligand Discovery. J. Chem. Inf. Model. 2023, 63, 1166–1176. [Google Scholar] [CrossRef]
Knox, C.; Wilson, M.; Klinger, C.M.; Franklin, M.; Oler, E.; Wilson, A.; Pon, A.; Cox, J.; Chin, N.E.L.; Strawbridge, S.A.; et al. DrugBank 6.0: The DrugBank Knowledgebase for 2024. Nucleic Acids Res. 2024, 52, D1265–D1275. [Google Scholar] [CrossRef]
Chandrasekhar, V.; Rajan, K.; Kanakam, S.R.S.; Sharma, N.; Weißenborn, V.; Schaub, J.; Steinbeck, C. COCONUT 2.0: A Comprehensive Overhaul and Curation of the Collection of Open Natural Products Database. Nucleic Acids Res. 2025, 53, D634–D643. [Google Scholar] [CrossRef] [PubMed]
Sorokina, M.; Merseburger, P.; Rajan, K.; Yirik, M.A.; Steinbeck, C. COCONUT Online: Collection of Open Natural Products Database. J. Cheminform. 2021, 13, 2. [Google Scholar] [CrossRef]
David, L.; Thakkar, A.; Mercado, R.; Engkvist, O. Molecular Representations in AI-Driven Drug Discovery: A Review and Practical Guide. J. Cheminform. 2020, 12, 56. [Google Scholar] [CrossRef]
Polanski, J.; Gasteiger, J. Computer Representation of Chemical Compounds. In Handbook of Computational Chemistry; Leszczynski, J., Kaczmarek-Kedziera, A., Puzyn, T.G., Papadopoulos, M., Reis, H.K., Shukla, M., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 1997–2039. ISBN 978-3-319-27282-5. [Google Scholar]
Walters, W.P. Virtual Chemical Libraries. J. Med. Chem. 2019, 62, 1116–1124. [Google Scholar] [CrossRef]
Han, H.; Shaker, B.; Lee, J.; Choi, S.; Yoon, S.; Singh, M.; Basith, S.; Cui, M.; An, J.; Kang, S.; et al. Employing Automated Machine Learning (AutoML) Methods to Facilitate the In Silico ADMET Properties Prediction. J. Chem. Inf. Model. 2025, 65, 3215–3225. [Google Scholar] [CrossRef] [PubMed]
Paul, A. Dextrosinistral Reading of SMILES Notation: Investigation into Origin of Non-Sense Code from String Manipulations. Digit. Chem. Eng. 2025, 15, 100222. [Google Scholar] [CrossRef]
Pinheiro, G.; Mucelini, J.; Soares, M.; Prati, R.; Da Silva, J.; Quiles, M. Machine Learning Prediction of Nine Molecular Properties Based on the SMILES Representation of the QM9 Quantum-Chemistry Dataset. J. Phys. Chem. A 2020, 124, 9854–9866. [Google Scholar] [CrossRef]
Saini, V. Machine Learning Prediction of Empirical Polarity Using SMILES Encoding of Organic Solvents. Mol. Divers. 2023, 27, 2331–2343. [Google Scholar] [CrossRef] [PubMed]
Bjerrum, E.J. SMILES Enumeration as Data Augmentation for Neural Network Modeling of Molecules. arXiv 2017. [Google Scholar] [CrossRef]
Bjerrum, E.J.; Sattarov, B. Improving Chemical Autoencoder Latent Space and Molecular De Novo Generation Diversity with Heteroencoders. Biomolecules 2018, 8, 131. [Google Scholar] [CrossRef]
Mswahili, M.E.; Jeong, Y.-S. Transformer-Based Models for Chemical SMILES Representation: A Comprehensive Literature Review. Heliyon 2024, 10, e39038. [Google Scholar] [CrossRef]
EPA Appendix F SMILES Notation Tutorial. Sustainable Futures P2 Framework Manual 2012 EPA-748-B12-001. 2012. Available online: https://www.epa.gov/sites/default/files/2015-05/documents/appendf.pdf (accessed on 18 September 2025).
O’Boyle, N.M. Towards a Universal SMILES Representation—A Standard Method to Generate Canonical SMILES Based on the InChI. J. Cheminform. 2012, 4, 22. [Google Scholar] [CrossRef]
Muresan, S.; Sitzmann, M.; Southan, C. Mapping Between Databases of Compounds and Protein Targets. In Bioinformatics and Drug Discovery, 2nd ed.; Methods in Molecular Biology; Humana Press: Totowa, NJ, USA, 2012; Volume 910, pp. 145–164. [Google Scholar] [CrossRef]
Wu, F.; Zhou, Y.; Li, L.; Shen, X.; Chen, G.; Wang, X.; Liang, X.; Tan, M.; Huang, Z. Computational Approaches in Preclinical Studies on Drug Discovery and Development. Front. Chem. 2020, 8, 726. [Google Scholar] [CrossRef]
Roney, M.; Mohd Aluwi, M.F.F. The Importance of In-Silico Studies in Drug Discovery. Intell. Pharm. 2024, 2, 578–579. [Google Scholar] [CrossRef]
Valerio, L.G. In Silico Methods. In Encyclopedia of Toxicology, 3rd ed.; Wexler, P., Ed.; Academic Press: Oxford, UK, 2014; pp. 1026–1029. ISBN 978-0-12-386455-0. [Google Scholar]
Jamrozik, E.; Śmieja, M.; Podlewska, S. ADMET-PrInt: Evaluation of ADMET Properties: Prediction and Interpretation. J. Chem. Inf. Model. 2024, 64, 1425–1432. [Google Scholar] [CrossRef]
Creanza, T.M.; Delre, P.; Ancona, N.; Lentini, G.; Saviano, M.; Mangiatordi, G.F. Structure-Based Prediction of hERG-Related Cardiotoxicity: A Benchmark Study. J. Chem. Inf. Model. 2021, 61, 4758–4770. [Google Scholar] [CrossRef]
Kato, H. Computational Prediction of Cytochrome P450 Inhibition and Induction. Drug Metab. Pharmacokinet. 2020, 35, 30–44. [Google Scholar] [CrossRef]
Vilar, S.; Sobarzo-Sánchez, E.; Uriarte, E. In Silico Prediction of P-Glycoprotein Binding: Insights from Molecular Docking Studies. Curr. Med. Chem. 2019, 26, 1746–1760. [Google Scholar] [CrossRef]
Miller, E.B.; Hwang, H.; Shelley, M.; Placzek, A.; Rodrigues, J.P.G.L.M.; Suto, R.K.; Wang, L.; Akinsanya, K.; Abel, R. Enabling Structure-Based Drug Discovery Utilizing Predicted Models. Cell 2024, 187, 521–525. [Google Scholar] [CrossRef] [PubMed]
Moroy, G.; Martiny, V.Y.; Vayer, P.; Villoutreix, B.O. Toward in Silico Structure-Based ADMET Prediction in Drug Discovery. Drug Discov. Today 2011, 17, 44–55. [Google Scholar] [CrossRef] [PubMed]
Peter, S.C.; Dhanjal, J.K.; Malik, V.; Radhakrishnan, N.; Jayakanthan, M.; Sundar, D. Quantitative Structure-Activity Relationship (QSAR): Modeling Approaches to Biological Applications. Encycl. Bioinform. Comput. Biol. 2019, 2, 661–676. [Google Scholar] [CrossRef]
Reddy, M.B.; Clewell, H.J.; Lave, T.; Andersen, M.E. Physiologically Based Pharmacokinetic Modeling: A Tool for Understanding ADMET Properties and Extrapolating to Human. In New Insights into Toxicity and Drug Testing; IntechOpen: London, UK, 2013. [Google Scholar]
Daina, A.; Michielin, O.; Zoete, V. SwissADME: A Free Web Tool to Evaluate Pharmacokinetics, Drug-Likeness and Medicinal Chemistry Friendliness of Small Molecules. Sci. Rep. 2017, 7, 42717. [Google Scholar] [CrossRef]
Pires, D.E.V.; Blundell, T.L.; Ascher, D.B. pkCSM: Predicting Small-Molecule Pharmacokinetic and Toxicity Properties Using Graph-Based Signatures. J. Med. Chem. 2015, 58, 4066–4072. [Google Scholar] [CrossRef]
Cheng, F.; Li, W.; Zhou, Y.; Shen, J.; Wu, Z.; Liu, G.; Lee, P.W.; Tang, Y. admetSAR: A Comprehensive Source and Free Tool for Assessment of Chemical ADMET Properties. J. Chem. Inf. Model. 2012, 52, 3099–3105. [Google Scholar] [CrossRef]
Yang, H.; Lou, C.; Sun, L.; Li, J.; Cai, Y.; Wang, Z.; Li, W.; Liu, G.; Tang, Y. admetSAR 2.0: Web-Service for Prediction and Optimization of Chemical ADMET Properties. Bioinformatics 2019, 35, 1067–1069. [Google Scholar] [CrossRef]
Fu, L.; Shi, S.; Yi, J.; Wang, N.; He, Y.; Wu, Z.; Peng, J.; Deng, Y.; Wang, W.; Wu, C.; et al. ADMETlab 3.0: An Updated Comprehensive Online ADMET Prediction Platform Enhanced with Broader Coverage, Improved Performance, API Functionality and Decision Support. Nucleic Acids Res. 2024, 52, W422–W431. [Google Scholar] [CrossRef]
Xiong, G.; Wu, Z.; Yi, J.; Fu, L.; Yang, Z.; Hsieh, C.; Yin, M.; Zeng, X.; Wu, C.; Lu, A.; et al. ADMETlab 2.0: An Integrated Online Platform for Accurate and Comprehensive Predictions of ADMET Properties. Nucleic Acids Res. 2021, 49, W5–W14. [Google Scholar] [CrossRef] [PubMed]
Swanson, K.; Walther, P.; Leitz, J.; Mukherjee, S.; Wu, J.C.; Shivnaraine, R.V.; Zou, J. ADMET-AI: A Machine Learning ADMET Platform for Evaluation of Large-Scale Chemical Libraries. Bioinformatics 2024, 40, btae416. [Google Scholar] [CrossRef] [PubMed]
Mishra, N.K.; Agarwal, S.; Raghava, G.P. Prediction of Cytochrome P450 Isoform Responsible for Metabolizing a Drug Molecule. BMC Pharmacol. 2010, 10, 8. [Google Scholar] [CrossRef]
Filimonov, D.A.; Lagunin, A.A.; Gloriozova, T.A.; Rudik, A.V.; Druzhilovskii, D.S.; Pogodin, P.V.; Poroikov, V.V. Prediction of the Biological Activity Spectra of Organic Compounds Using the Pass Online Web Resource. Chem. Heterocycl. Compd. 2014, 50, 444–457. [Google Scholar] [CrossRef]
Li, B.; Wang, Z.; Liu, Z.; Tao, Y.; Sha, C.; He, M.; Li, X. DrugMetric: Quantitative Drug-Likeness Scoring Based on Chemical Space Distance. Brief. Bioinform. 2024, 25, bbae321. [Google Scholar] [CrossRef]
Roskoski, R., Jr. Properties of FDA-approved small molecule protein kinase inhibitors. Pharmacol. Res. 2019, 144, 19–50. [Google Scholar] [CrossRef]
Ghose, A.K.; Viswanadhan, V.N.; Wendoloski, J.J. A Knowledge-Based Approach in Designing Combinatorial or Medicinal Chemistry Libraries for Drug Discovery. 1. A Qualitative and Quantitative Characterization of Known Drug Databases. J. Comb. Chem. 1999, 1, 55–68. [Google Scholar] [CrossRef] [PubMed]
Veber, D.F.; Johnson, S.R.; Cheng, H.-Y.; Smith, B.R.; Ward, K.W.; Kopple, K.D. Molecular Properties That Influence the Oral Bioavailability of Drug Candidates. J. Med. Chem. 2002, 45, 2615–2623. [Google Scholar] [CrossRef]
Muegge, I.; Heald, S.L.; Brittelli, D. Simple Selection Criteria for Drug-like Chemical Matter. J. Med. Chem. 2001, 44, 1841–1846. [Google Scholar] [CrossRef]
Sicak, Y. Synthesis, Predictions of Drug-Likeness, and Pharmacokinetic Properties of Some Chiral Thioureas as Potent Enzyme Inhibition Agents. Turk. J. Chem. 2021, 46, 665–676. [Google Scholar] [CrossRef]
Egan, W.J.; Kenneth, M.M.; Baldwin, J.J. Prediction of Drug Absorption Using Multivariate Statistics. J. Med. Chem. 2000, 43, 3867–3877. [Google Scholar] [CrossRef]
Kralj, S.; Jukič, M.; Bren, U. Molecular Filters in Medicinal Chemistry. Encyclopedia 2023, 3, 501–511. [Google Scholar] [CrossRef]
Banerjee, P.; Kemmler, E.; Dunkel, M.; Preissner, R. ProTox 3.0: A Webserver for the Prediction of Toxicity of Chemicals. Nucleic Acids Res. 2024, 52, W513–W520. [Google Scholar] [CrossRef]
Patlewicz, G.; Jeliazkova, N.; Safford, R.J.; Worth, A.P.; Aleksiev, B. An Evaluation of the Implementation of the Cramer Classification Scheme in the Toxtree Software. SAR QSAR Environ. Res. 2008, 19, 495–524. [Google Scholar] [CrossRef]
Chakravarti, S.; Saiakhov, R.D. MultiCASE Platform for In Silico Toxicology. In In Silico Methods for Predicting Drug Toxicity; Methods in Molecular Biology; Humana: New York, NY, USA, 2022; Volume 2425, pp. 497–518. ISBN 978-1-0716-1960-5. [Google Scholar]
Raheem, A.K.A.; Dhannoon, B.N. Comprehensive Review on Drug-Target Interaction Prediction—Latest Developments and Overview. Curr. Drug Discov. Technol. 2024, 21, 56–57. [Google Scholar] [CrossRef]
Rasul, H.O.; Ghafour, D.D.; Aziz, B.K.; Hassan, B.A.; Rashid, T.A.; Kivrak, A. Decoding Drug Discovery: Exploring A-to-Z In Silico Methods for Beginners. Appl. Biochem. Biotechnol. 2025, 197, 1453–1503. [Google Scholar] [CrossRef] [PubMed]
Zhang, X.; Wu, F.; Yang, N.; Zhan, X.; Liao, J.; Mai, S.; Huang, Z. In Silico Methods for Identification of Potential Therapeutic Targets. Interdiscip. Sci. Comput. Life Sci. 2022, 14, 285–310. [Google Scholar] [CrossRef]
Wu, Z.; Li, W.; Liu, G.; Tang, Y. Network-Based Methods for Prediction of Drug-Target Interactions. Front. Pharmacol. 2018, 9, 1134. [Google Scholar] [CrossRef]
Yu, Z.; Wu, Z.; Wang, Z.; Wang, Y.; Zhou, M.; Li, W.; Liu, G.; Tang, Y. Network-Based Methods and Their Applications in Drug Discovery. J. Chem. Inf. Model. 2024, 64, 57–75. [Google Scholar] [CrossRef]
He, T.; Caba, K.; Ballester, P. A Precise Comparison of Molecular Target Prediction Methods. Digit. Discov. 2025, 4, 2548–2558. [Google Scholar] [CrossRef]
Jimenes-Vargas, K.; Pazos, A.; Munteanu, C.R.; Perez-Castillo, Y.; Tejera, E. Prediction of Compound-Target Interaction Using Several Artificial Intelligence Algorithms and Comparison with a Consensus-Based Strategy. J. Cheminform. 2024, 16, 27. [Google Scholar] [CrossRef] [PubMed]
Yang, S.-Q.; Ye, Q.; Ding, J.; Yin, M.-Z.; Lu, A.-P.; Chen, X.; Hou, T.; Cao, D.-S. Current Advances in Ligand-Based Target Prediction. WIREs Comput. Mol. Sci. 2020, 11, e1504. [Google Scholar] [CrossRef]
Durant, J.L.; Leland, B.A.; Henry, D.R.; Nourse, J.G. Reoptimization of MDL Keys for Use in Drug Discovery. J. Chem. Inf. Comput. Sci. 2002, 42, 1273–1280. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Ren, P.; Yang, H.; Zheng, J.; Bai, F. TEFDTA: A Transformer Encoder and Fingerprint Representation Combined Prediction Method for Bonded and Non-Bonded Drug–Target Affinities. Bioinformatics 2024, 40, btad778. [Google Scholar] [CrossRef] [PubMed]
Yao, Z.-J.; Dong, J.; Che, Y.-J.; Zhu, M.-F.; Wen, M.; Wang, N.-N.; Wang, S.; Lu, A.-P.; Cao, D.-S. TargetNet: A Web Service for Predicting Potential Drug–Target Interaction Profiling via Multi-Target SAR Models. J. Comput. Aided Mol. Des. 2016, 30, 413–424. [Google Scholar] [CrossRef]
Gallo, K.; Goede, A.; Preissner, R.; Gohlke, B.-O. SuperPred 3.0: Drug Classification and Target Prediction—A Machine Learning Approach. Nucleic Acids Res. 2022, 50, W726–W731. [Google Scholar] [CrossRef]
Daina, A.; Michielin, O.; Zoete, V. SwissTargetPrediction: Updated Data and New Features for Efficient Prediction of Protein Targets of Small Molecules. Nucleic Acids Res. 2019, 47, W357–W364. [Google Scholar] [CrossRef]
Keiser, M.J.; Roth, B.L.; Armbruster, B.N.; Ernsberger, P.; Irwin, J.J.; Shoichet, B.K. Relating Protein Pharmacology by Ligand Chemistry. Nat. Biotechnol. 2007, 25, 197–206. [Google Scholar] [CrossRef]
Magalhães, R.P.; Vieira, T.F.; Melo, A.; Suosa, S.F. Chapter 15—In Silico Development of Quorum Sensing Inhibitors. In Recent Trends in Biofilm Science and Technology; Academic Press: Cambridge, MA, USA, 2020; pp. 329–357. ISBN 978-0-12-819497-3. [Google Scholar]
Cheung, L.K.Y.; Yada, R.Y. Predicting Global Diet-Disease Relationships at the Atomic Level: A COVID-19 Case Study. Curr. Opin. Food Sci. 2022, 44, 100804. [Google Scholar] [CrossRef]
Priya, C.G.; Chakraborty, C.; Narayan, V.; Kumar, T. Chapter Ten—Computational Approaches and Resources in Single Amino Acid Substitutions Analysis Toward Clinical Research. In Advances in Protein Chemistry and Structural Biology; Academic Press: Cambridge, MA, USA, 2014; Volume 94, pp. 365–423. ISBN 978-0-12-800168-4. [Google Scholar]
Gianti, E.; Carnevale, V. Chapter Two—Computational Approaches to Studying Voltage-Gated Ion Channel Modulation by General Anesthetics. In Methods in Enzymology; Academic Press: Cambridge, MA, USA, 2018; Volume 602, pp. 25–59. [Google Scholar]
Morgnanesi, D.; Heinrichs, E.J.; Mele, A.R.; Wilkinson, S.; Zhou, S.; Kulp, J.L., III. A Computational Chemistry Perspective on the Current Status and Future Direction of Hepatitis B Antiviral Drug Discovery. Antivir. Res. 2015, 123, 204–215. [Google Scholar] [CrossRef] [PubMed]
Khanna, V.; Ranganathan, S.; Petrovsky, N. Rational Strucute-Based Drug Design. In Encyclopedia of Bioinformatics and Computational Biology; Academic Press: Cambridge, MA, USA, 2019; Volume 2, pp. 585–600. ISBN 978-0-12-811432-2. [Google Scholar]
Roy, K.; Kar, S.; Das, R.N. Other Related Techniques; Academic Press: Cambridge, MA, USA, 2015; pp. 357–425. ISBN 978-0-12-801505-6. [Google Scholar]
Morris, G.; Huey, R.; Lindstrom, W.; Sanner, M.F.; Belew, R.; Goodsell, D.S.; Olson, A.J. AutoDock4 and AutoDockTools4: Automated Docking with Selective Receptor Flexibility. J. Comput. Chem. 2009, 30, 2785–2791. [Google Scholar] [CrossRef]
BIOVIA, Dassault Systèmes. Discovery Studio, Web-Based Software; Dassault Systèmes: San Diego, CA, USA, 2023. [Google Scholar]
Allen, W.J.; Balius, T.E.; Mulkherjee, S.; Brozell, S.R.; Moustakas, D.; Lang, T.; Case, D.A.; Kuntz, I.D.; Rizzo, R.C. DOCK 6: Impact of New Features and Current Docking Performance. J. Comput. Chem. 2015, 5, 1132–1156. [Google Scholar] [CrossRef]
Du, L.; Geng, C.; Zeng, Q.; Huang, T.; Tang, J.; Chu, Y.; Zhao, K. Dockey: A Modern Integrated Tool for Large-Scale Molecular Docking and Virtual Screening. Brief. Bioinform. 2023, 24, bbad047. [Google Scholar] [CrossRef]
Roberts, V.A.; Thompson, E.E.; Pique, M.E.; Perez, M.S.; Ten, L.F. DOT2: Macromolecular Docking with Improved Biophysical Models. J. Comput. Chem. 2013, 34, 1743–1758. [Google Scholar] [CrossRef] [PubMed]
Yang, X.; Liu, Y.; Gan, J.; Xiao, Z.-X.; Cao, Y. FitDock: Protein–Ligand Docking by Template Fitting. Brief. Bioinform. 2022, 23, bbac087. [Google Scholar] [CrossRef] [PubMed]
Rarey, M.; Kramer, B.; Lengauer, T.; Klebe, G. A Fast Flexible Docking Method Using an Incremental Construction Algorithm. J. Mol. Biol. 1996, 261, 470–489. [Google Scholar] [CrossRef]
Friesner, R.A.; Banks, J.L.; Murphy, R.B.; Halgren, T.A.; Klicic, J.; Mainz, D.T.; Repasky, M.; Knoll, E.H.; Shelley, M.; Perry, J.K.; et al. Glide: A New Approach for Rapid, Accurate Docking and Scoring. 1. Method and Assessment of Docking Accuracy. J. Med. Chem. 2004, 47, 1739–1749. [Google Scholar] [CrossRef]
Jones, G.; Willet, P.; Glen, R.C.; Leach, A.R.; Taylor, R. Development and Validation of a Genetic Algorithm for Flexible Docking. J. Mol. Biol. 1997, 267, 727–748. [Google Scholar] [CrossRef]
Singh, A.; Copeland, M.M.; Kundrotas, P.J.; Vakser, I.A. GRAMM Web Server for Protein Docking. In Computational Drug Discovery and Design; Methods in Molecular Biology; Humana: New York, NY, USA, 2024; Volume 2714, pp. 101–112. ISBN 978-1-0716-3441-7. [Google Scholar]
Hsu, K.-C.; Chen, Y.-F.; Lin, S.-R.; Yang, J.-M. iGEMDOCK: A Graphical Environment of Enhancing GEMDOCK Using Pharmacological Interactions and Post-Screening Analysis. BMC Bioinform. 2011, 12, S33. [Google Scholar] [CrossRef]
Liu, N.; Xu, Z. Using LeDock as a Docking Tool for Computational Drug Design. IOP Conf. Ser. Earth Environ. Sci. 2019, 218, 012143. [Google Scholar] [CrossRef]
Terwilliger, T.C.; Klei, H.; Adams, P.D.; Moriarty, N.W.; Cohn, J. Automated Ligand Fitting by Core-Fragment Fitting and Extension into Density. Biol. Crystallogr. 2006, 62, 915–922. [Google Scholar] [CrossRef]
Hakkennes, M.L.A.; Buda, F.; Bonnet, S. MetalDock: An Open Access Docking Tool for Easy and Reproducible Docking of Metal Complexes. J. Chem. Inf. Model. 2023, 63, 7816–7825. [Google Scholar] [CrossRef] [PubMed]
Chemical Computing Group ULC. Molecular Operating Environment (MOE); Chemical Computing Group: Montreal, QC, Canada, 2025. [Google Scholar]
Thomsen, R.; Christensen, M.H. MolDock: A New Technique for High-Accuracy Molecular Docking. J. Med. Chem. 2006, 49, 3315–3321. [Google Scholar] [CrossRef] [PubMed]
Shnecke, V.; Swanson, C.A.; Getzoff, E.D.; Tainer, J.A.; Kuhn, L. Screening a Peptidyl Databse for Potential Ligands to Proteins with Side-Chain Flexibility. Proteins Struct. Funct. Genet. 1998, 33, 74–87. [Google Scholar] [CrossRef]
Kabier, M.; Gambacorta, N.; Trisciuzzi, D.; Kumar, S.; Nicolotti, O.; Matthew, B. MzDOCK: A Free Ready-to-Use GUI-Based Pipeline for Molecular Docking Simulations. J. Comput. Chem. 2025, 45, 1980–1986. [Google Scholar] [CrossRef] [PubMed]
Murphy, R.B.; Philipp, D.M.; Friesner, R.A. A Mixed Quantum Mechanics/Molecular Mechanics (QM/MM) Method for Large-Scale Modeling of Chemistry in Protein Environments. J. Comput. Chem. 2000, 21, 1442–1457. [Google Scholar] [CrossRef]
Morley, S.D.; Afshar, M. Validation of an Empirical RNA-Ligand Scoring Function for Fast Flexible Docking Using RiboDock. J. Comput.-Aided Mol. Des. 2004, 18, 189–208. [Google Scholar] [CrossRef]
Verdonk, M.L.; Cole, J.C.; Taylor, R. SuperStar: A Knowledge-Based Approach for Identifying Interaction Sites in Proteins. J. Mol. Biol. 1999, 289, 1093–1108. [Google Scholar] [CrossRef]
Bursulaya, B.; Totrov, M.; Abagyan, R.; Brooks, C.L., 3rd. Comparative study of several algorithms for flexible ligand docking. J. Comput.-Aided Mol. Des. 2003, 17, 755–763. [Google Scholar] [CrossRef]
Li, X.; Li, Y.; Cheng, T.; Liu, Z.; Wang, R. Evaluation of the performance of four molecular docking programs on a diverse set of protein complexes. J. Comput. Chem. 2010, 31, 2109–2125. [Google Scholar] [CrossRef]
Wang, Z.; Sun, H.; Yao, X.; Li, D.; Xu, L.; Li, Y.; Tian, S.; Hou, T. Comprehensive evaluation of ten docking programs on a diverse set of protein-ligand complexes: The prediction accuracy of sampling power and scoring power. Phys. Chem. Chem. Phys. 2016, 18, 12964–12975. [Google Scholar] [CrossRef]
Duhovny, D.; Wolfson, H.J. Efficient Unbound Docking of Rigid Molecules. Lect. Notes Comput. Sci. 2002, 2452, 185–200. [Google Scholar] [CrossRef]
Pierce, B.G.; Wiehe, K.; Hwang, H.; Kim, B.-H.; Vreven, T.; Weng, Z. ZDOCK Server: Interactive Docking Prediction of Protein-Protein Complexes and Symmetric Multimers. Bioinform. Oxf. Engl. 2014, 30, 1771–1773. [Google Scholar] [CrossRef] [PubMed]
Liu, Y.; Yang, X.; Gan, J.; Chen, S.; Xiao, Z.-X.; Cao, Y. CB-Dock2: Improved Protein–Ligand Blind Docking by Integrating Cavity Detection, Docking and Homologous Template Fitting. Nucleic Acids Res. 2022, 50, W159–W164. [Google Scholar] [CrossRef]
Grosdidier, A.; Zoete, V.; Michielin, O. SwissDock, a Protein-Small Molecule Docking Web Service Based on EADock DSS. Nucleic Acids Res. 2011, 39, 270–277. [Google Scholar] [CrossRef] [PubMed]
Honorato, R.V.; Trellet, M.E.; Jiménez-García, B.; Schaarschmidt, J.J.; Giulini, M.; Reyes, V.; Koukos, P.I.; Rodrigues, J.P.; Karaca, E.; van Zundert, G.C.; et al. The HADDOCK2.4 Web Server for Integrative Modeling of Biomolecular Complexes. Nat. Protoc. 2024, 19, 3219–3241. [Google Scholar] [CrossRef]
Kochnev, Y.; Hellemann, E.; Cassidy, K.C.; Durrant, J.D. Webina: An Open-Source Library and Web App That Runs AutoDock Vina Entirely in the Web Browser. Bioinformatics 2020, 36, 4513–4515. [Google Scholar] [CrossRef]
Schöning-Stierand, K.; Diedrich, K.; Ehrt, C.; Flachsenberg, F.; Graef, J.; Sieg, J.; Penner, P.; Poppinga, M.; Ungethüm, A.; Rarey, M. ProteinsPlus: A Comprehensive Collection of Web-Based Molecular Modeling Tools. Nucleic Acids Res. 2022, 50, 611–615. [Google Scholar] [CrossRef]
Zhou, P.; Jin, B.; Li, H.; Huang, S.-Y. HPEPDOCK: A Web Server for Blind Peptide-Protein Docking Based on a Hierarchical Algorithm. Nucleic Acids Res. 2018, 46, 443–450. [Google Scholar] [CrossRef]
Weng, G.; Wang, E.; Wang, Z.; Liu, H.; Zhu, F.; Li, D.; Hou, T. HawkDock: A Web Server to Predict and Analyze the Protein–Protein Complex Based on Computational Docking and MM/GBSA. Nucleic Acids Res. 2019, 47, 322–330. [Google Scholar] [CrossRef]
Zhang, X.; Jiang, L.; Weng, G.; Shen, C.; Zhang, O.; Liu, M.; Zhang, C.; Gu, S.; Wang, J.; Wang, X.; et al. HawkDock Version 2: An Updated Web Server to Predict and Analyze the Structures of Protein–Protein Complexes. Nucleic Acids Res. 2025, 53, 306–315. [Google Scholar] [CrossRef] [PubMed]
Zhang, W.; Bell, E.W.; Yin, M.; Zhang, Y. EDock: Blind Protein–Ligand Docking by Replica-exchange Monte Carlo Simulation. J. Cheminform. 2020, 12, 37. [Google Scholar] [CrossRef]
GERÇEK, Z.; CEYHAN, D.; ERÇAĞ, E. Synthesis and Molecular Docking Study of Novel COVID-19 Inhibitors. Turk. J. Chem. 2021, 45, 704–718. [Google Scholar] [CrossRef]
Haque, S.K.E.; Bhadra, S.; Kumar, N. Exploring Potential Therapeutic Candidates against COVID-19: A Molecular Docking Study. Discov. Mol. 2024, 1, 5. [Google Scholar] [CrossRef]
Phosrithong, N.; Ungwitayatorn, J. Molecular Docking Study on Anticancer Activity of Plant-Derived Natural Products. Med. Chem. Res. 2010, 19, 817–835. [Google Scholar] [CrossRef]
Sharma, V.; Chander, P.C.; Kumar, V. In Silico Molecular Docking Analysis of Natural Pyridoacridines as Anticancer Agents. Adv. Chem. 2016, 2016, 5409387. [Google Scholar] [CrossRef]
Ru, X.; Xu, L.; Han, W.; Zou, Q. In silico methods for drug-target interaction prediction. Cell Rep. Methods 2025, 5, 101184. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Classification of major chemical compound databases according to their scope and level of information detail. Created in BioRender. Garibaldi, A. (2025) https://BioRender.com/8r6ykd9 (accessed on 23 September 2025).

Figure 2. Representative chemical structure formats used in computational workflows: 2D, 3D, SMILES, and MOL/SDF. Created in BioRender. Garibaldi, A. (2025) https://BioRender.com/oltwvn1 (accessed on 23 September 2025).

Figure 3. General workflow for ligand–receptor docking simulations. Created in BioRender. Garibaldi, A. (2025) https://BioRender.com/uqrmiyc (accessed on 6 October 2025).

Figure 4. Integrative workflow for in silico chemical compound analysis. Created in BioRender. Garibaldi, A. (2025) https://BioRender.com/v6sfioa (accessed on 7 October 2025).

Table 1. Examples of molecular docking tools mostly used in drug design studies.

Software	License Type	Main Features	Recommended Use	Cite
Autodock	Open-source	Binding orientation and affinity of small molecules to 3D receptors	Virtual screening, structure-based design, recommended for academic projects	[106]
Discovery Studio	Commercial and Academic	Docking with ligand conformational search (Monte Carlo) and LigandFit preparation	Comprehensive modeling, docking workflows, industry-level projects	[107]
DOCK	Commercial and Academic	Geometric matching to place ligands/fragments in binding sites; includes solvent effects	Academic research, fragment docking, solvent-inclusive studies	[108]
Dockey	Open-source	Graphical interface integrating preparation, parallel docking, interaction detection, and visualization	Comprehensive and user-friendly docking workflows	[109]
DOT	Open-source	Docking of macromolecule interactions; predicts binding via electrostatic and van der Waals energies	Protein–protein and large complex docking; biologically relevant models	[110]
FitDock	Academic	Improves protein–ligand docking by using similar co-crystal structures; enhances sampling and scoring	Structure-based drug design with accuracy improvement	[111]
FlexX	Commercial and Academic	Uses incremental construction: docks ligand fragments	Fast docking of fragment-based ligands in diverse binding pockets	[112]
Glide	Commercial	Ligand–receptor docking, supports virtual screening and binding mode prediction	High-precision docking, virtual screening in pharma research	[113]
GOLD	Commercial and Academic	Genetic algorithm for ligand binding predictions; flexible across diverse protein targets	Reliable docking in drug discovery, protein–ligand interaction studies	[114]
GRAMM	Commercial	Explores intermolecular energy landscape; predicts stable and transient protein–protein docking poses	Protein–protein interaction modeling and complex prediction	[115]
iGEMDOCK	Open-source	Identifies pharmacological interactions by virtual screening	Ligand screening and pharmacological interaction prediction	[116]
LeDock	Academic	Fast and accurate flexible docking of small molecules;	High-throughput virtual screening and pose prediction	[117]
LigandFit	Commercial	Shape-based docking using cavity detection, Monte Carlo conformational search, and grid-based scoring	Protein–ligand docking, pose prediction, and high-throughput virtual screening	[118]
MetalDock	Open-source	Specialized in metal–organic docking; supports multiple metal types and automates workflow	Protein, DNA, and biomolecule docking with metal complexes	[119]
MOE	Commercial	Integrated modeling platform: docking, QSAR, pharmacophore design, homology modeling	Comprehensive drug discovery workflows, method development, academic evaluation	[120]
Molegro Virtual Docker	Commercial	Docking platform with novel optimization algorithm and user-friendly interface	Protein–ligand docking, virtual screening with high usability	[121]
MSU SLIDE	Commercial and Academic	Manages large binding-site templates with multi-stage indexing; ranks ligands by steric complementarity	Efficient virtual screening of large libraries with binding-site template matching	[122]
MzDOCK	Open-source	GUI-based docking tool; simplifies workflows and improves reproducibility	User-friendly option for beginners and teaching	[123]
Qsite	Commercial	QM/MM multi-scale tool combining quantum and molecular mechanics to predict configurations, energetics, and electronic structures	Accurate modeling of reactive systems, catalytic sites, and mechanistic studies	[124]
rDOCK	Open-source	Docking of small molecules to proteins and nucleic acids	Virtual screening, binding mode prediction, protein and nucleic acid targets	[125]
SuperStar	Commercial	Generates protein interaction maps from crystallographic data; predicts “hot-spots” for favorable interactions	Binding site analysis, hot-spot prediction, and molecular design support	[126]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Published by MDPI on behalf of the Österreichische Pharmazeutische Gesellschaft. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

Share and Cite

MDPI and ACS Style

García-Díaz, J.M.; Garibaldi-Ríos, A.F.; Gallegos-Arreola, M.P.; Gutiérrez-Gutiérrez, F.; Delgado-Saucedo, J.I.; Martínez-Velázquez, M.; Puebla-Pérez, A.M. Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking. Sci. Pharm. 2026, 94, 9. https://doi.org/10.3390/scipharm94010009

AMA Style

García-Díaz JM, Garibaldi-Ríos AF, Gallegos-Arreola MP, Gutiérrez-Gutiérrez F, Delgado-Saucedo JI, Martínez-Velázquez M, Puebla-Pérez AM. Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking. Scientia Pharmaceutica. 2026; 94(1):9. https://doi.org/10.3390/scipharm94010009

Chicago/Turabian Style

García-Díaz, Jesus Magdiel, Asbiel Felipe Garibaldi-Ríos, Martha Patricia Gallegos-Arreola, Filiberto Gutiérrez-Gutiérrez, Jorge Iván Delgado-Saucedo, Moisés Martínez-Velázquez, and Ana María Puebla-Pérez. 2026. "Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking" Scientia Pharmaceutica 94, no. 1: 9. https://doi.org/10.3390/scipharm94010009

APA Style

García-Díaz, J. M., Garibaldi-Ríos, A. F., Gallegos-Arreola, M. P., Gutiérrez-Gutiérrez, F., Delgado-Saucedo, J. I., Martínez-Velázquez, M., & Puebla-Pérez, A. M. (2026). Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking. Scientia Pharmaceutica, 94(1), 9. https://doi.org/10.3390/scipharm94010009

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Computational Workflow for Chemical Compound Analysis: From Structure Generation to Molecular Docking

Abstract

1. Introduction

2. Chemical Databases and Libraries in Virtual Screening

3. Chemical Structure Generation and Linear Input Formats

4. ADMET Property Evaluation

5. Target Prediction and Receptor Selection

6. Molecular Docking: Evaluating Ligand–Protein Binding Affinity

7. Integrative Workflow Summary

8. Conclusions and Future Perspectives

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI