Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design

Suay-García, Beatriz; Bueso-Bordils, Jose I.; Falcó, Antonio; Antón-Fos, Gerardo M.; Alemán-López, Pedro A.

doi:10.3390/ijms23031620

Open AccessReview

Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design

by

Beatriz Suay-García

^1,*

,

Jose I. Bueso-Bordils

²

,

Antonio Falcó

¹

,

Gerardo M. Antón-Fos

²

and

Pedro A. Alemán-López

²

¹

ESI International @ UCHCEU, Departamento de Matemáticas, Física y Ciencias Tecnológicas, Universidad Cardenal Herrera—CEU, CEU Universities San Bartolomé 55, Alfara del Patriarca, 46115 Valencia, Spain

²

Departamento de Farmacia, Universidad Cardenal Herrera—CEU, CEU Universities, C/Ramón y Cajal s/n, Alfara del Patriarca, 46115 Valencia, Spain

^*

Author to whom correspondence should be addressed.

Int. J. Mol. Sci. 2022, 23(3), 1620; https://doi.org/10.3390/ijms23031620

Submission received: 23 December 2021 / Revised: 24 January 2022 / Accepted: 28 January 2022 / Published: 30 January 2022

(This article belongs to the Special Issue Drug Design and Virtual Screening)

Download

Browse Figure

Versions Notes

Abstract

:

Traditionally, drug development involved the individual synthesis and biological evaluation of hundreds to thousands of compounds with the intention of highlighting their biological activity, selectivity, and bioavailability, as well as their low toxicity. On average, this process of new drug development involved, in addition to high economic costs, a period of several years before hopefully finding a drug with suitable characteristics to drive its commercialization. Therefore, the chemical synthesis of new compounds became the limiting step in the process of searching for or optimizing leads for new drug development. This need for large chemical libraries led to the birth of high-throughput synthesis methods and combinatorial chemistry. Virtual combinatorial chemistry is based on the same principle as real chemistry—many different compounds can be generated from a few building blocks at once. The difference lies in its speed, as millions of compounds can be produced in a few seconds. On the other hand, many virtual screening methods, such as QSAR (Quantitative Sturcture-Activity Relationship), pharmacophore models, and molecular docking, have been developed to study these libraries. These models allow for the selection of molecules to be synthesized and tested with a high probability of success. The virtual combinatorial chemistry–virtual screening tandem has become a fundamental tool in the process of searching for and developing a drug, as it allows the process to be accelerated with extraordinary economic savings.

Keywords:

virtual combinatorial chemistry; virtual screening; QSAR; drug development

1. Introduction

Traditionally, drug development included the individual synthesis and biological evaluation of hundreds of organic compounds with the intention of characterizing their biological activity, selectivity, bioavailability, and toxicity. On average, this process involved high economic costs and several years of research before identifying a drug with suitable characteristics to be commercialized [1]. Thus, the identification and synthesis of new compounds rapidly became the limiting step in the discovery and optimization of lead compounds for the development of new drugs [2]. In the past, chemical libraries used in biological assays were obtained by gathering compounds via purification and identification of biologically active ingredients from natural, marine, or fermentative products among other sources [3]. This was a time-consuming process that led to the appearance of combinatorial chemistry as a method to obtain large chemical libraries in a time-effective manner [2].

De Julian-Ortiz defined Virtual Combinatory Chemistry (VCC) as the computational simulation of the generation of new chemical structures by using a combinatorial strategy to generate a virtual library [4]. Since the generated compounds do not necessarily have to be new, VCC could be defined more precisely as computational simulation to generate structurally related compounds. Moreover, the concept of virtual combinatorial library should be clearly separated from databases in which compounds are not structurally related. In other words, a virtual combinatorial library can be generated by combining a limited number of chemical building blocks. The emergence of VCC, along with the publication of many databases with hundreds or thousands of compounds, has propelled the development of computational methods designed to analyze the rapidly increasing amounts of chemical information that is being generated [5]. Initially, these libraries or databases were analyzed using High-Throughput Screening (HTS), which involved the experimental screening of entire compound collections. However, the growing number of compounds available for screening promoted the development of computational approaches to complement HTS, such as Virtual Screening (VS) [6]. The main advantage of VS is that, while HTS requires experimentation to obtain results, VS consists in the computational evaluation of databases aiming to select a small number of reliable and experimentally testable candidate compounds that have a high probability of being active [5].

Different methodologies have been developed to carry out VS and they can be divided into two main categories: ligand-based VS (LBVS) and structure-based VS (SBVS) [7]. LBVS methods use the structural and biological data from a set of known active compounds to identify promising candidates for experimental screening [8]. These chemical data can be based on either 2D or 3D representations of the molecules. On the other hand, SBVS requires the 3D representation of the target, as this approach aims to find molecules that fit within a binding site in the best position and orientation possible [9].

Furthermore, besides identifying the appropriate chemical structure, other factors must be considered during the drug design process. For example, variations in crystal structure can lead to different polymorphs of a solid compound with different physicochemical characteristics that can translate to pharmacokinetic differences that, in turn, may affect their activity [10,11,12]. For this reason, understanding crystallization has become increasingly important to have a reproducible drug production process. In fact, Density Functional Theory (DFT) has become increasingly popular in drug design because it can predict this behavior in active pharmaceutical ingredients, among many other things [13].

This review discusses chemical combinatorial libraries as well as other existing databases available for VS and the different methodologies used for VS. This review is divided into three main parts. In the first part, we analyze the different strategies used to generate virtual combinatorial libraries as well as the methods that can be used to do so. In the second part, we review the methodologies used to carry out the virtual screening of combinatorial libraries and non-combinatorial databases. Lastly, the third part includes examples and applications of the aforementioned methodologies in the discovery and development of new drugs.

2. Virtual Combinatorial Library Creation

The design of virtual combinatorial libraries (VCLs) is a critical part in the early phases of the drug discovery process as these libraries are used in lead generation projects to identify series of analogues around hit and lead compounds to explore structure–activity relationships (SARs) [14]. Starting from a single known bioactive molecule acting as a template, a set of theoretically isofunctional molecules can be virtually assembled mimicking the pharmacophore pattern [15]. In the following, we discuss the different approaches that can be followed to create a VCL as well as the different software platforms available to do so.

2.1. Types of Combinatorial Libraries

There are two main classifications of VCLs regarding their generation process: based on a synthetic route or based on a scaffold structure.

The synthetic route approach starts with the identification of the chemical reactions intended to be followed to obtain the designed compounds. This includes the reaction rules, the reaction strategy, allowed products, forbidden products, parameter values that define the logical conditions for reaction application, and the sites where reactions occur [16]. Basically, the library is made up of the products of carrying out a certain reaction with n reactants of type A and n reactants of type B. This approach imitates quite accurately the steps followed in real chemical synthesis. In fact, the similarity it has with in situ chemical synthesis is the reason why this is the approach generally followed by the pharmaceutical industry. Examples of the application of reaction-based VCLs in the pharmaceutical industry include BI-Claim developed by Boehringer Ingelheim, Eli Lilly’s Proximal Collection, and Pfizer global virtual library (PGVL) [17,18,19]. All these VCLs were built using prevalidated or reported reactions as well as accessible chemical reagents. Similarly, Humbeck et al. developed CHIPMUNCK, a VCL that covers over 95 million compounds [20]. This combinatorial database is composed of three sub-libraries, each being the product of a special set of in-silico-performed reactions: heterocycle forming reactions, medicinal chemistry reactions, and multicomponent reactions. Another example of a VCL based on a synthetic route is ZINClick [21]. This combinatorial library contains over 16 million 1,4-disubstituted-1,2,3-triazoles that can by synthesized via a “click” 1,3-dipolar cycloaddition reaction between azides and alkynes catalyzed by copper salts. Similarly, Saldívar-González et al. applied a Diversity-Oriented Synthesis strategy to design a library of lactams that could be easily synthesized by performing a series of intramolecular paring reactions to form an amide bond between carboxylic acids and primary or secondary amines [22].

The other main approach to VCL design is that based on a scaffold structure. This method consists in the determination of a common skeleton with variable sites tagged as R₁, R₂, R₃… R_n, where each one is associated with a list of possible substituents [23]. This approach is ideal in those cases where there are different synthetic routes described to obtain a common scaffold [24]. This type of VCL is focused on a specific target, structural class, or pharmacophore as it stresses the exploration of a specific area of the chemical space, resulting in a small number of structurally related compounds based on a known target or family [24]. Examples of this type of VCL include the combinatorial library of 1001 6-fluoroquinolones developed by Bueso-Bordils et al. [25] to identify new compounds with antibacterial activity against methicillin-resistant Staphylococcus aureus (MRSA). The library was built using a 6-fluoroquinolone skeleton with structural variations in positions 1, 7, and 8. Similarly, Kouman et al. designed a VCL based on a benzamide scaffold to identify new Mycobacterium tuberculosis 2-trans enoyl-acyl carrier protein reductase inhibitors with favorable pharmacokinetic profiles [26]. Lauro et al. have also built a library containing approximately 2.0 × 10⁴ virtual compounds by following a multicomponent-based chemical route for the decoration of the 2,4-thiazolidinedione core [27].

2.2. Generation of Combinatorial Libraries

Virtual combinatorial libraries can be generated using different computational tools and software [28]. Table 1 summarizes different tools that can be used to build VCLs of small molecules. Some of these tools, such as KNIME, RDKit, DataWarrior, and Reactor, allow for the creation of a VCL based on a list of prevalidated reactions [29,30,31,32,33,34]. Others, such as Library Synthesizer, SimLib, MOE, Schrödinger, and Nova, use the scaffold-based approach to create the combinatorial library by allowing the user to select a common scaffold or molecular skeleton with tagged substitution points to which different R groups will be attached [35,36,37,38,39,40]. Finally, a third type of model includes those using multi-objective algorithms such as CCLab and MoSELECT [29,30]. In this case, the tool does not only provide a set of combinatorial compounds, but also provides filtering options regarding aspects such as synthesis cost, drug-likeness, physicochemical properties, and structural diversity. These tools allow the relationship between different objectives to be explored with competing objectives easily identified. Thus, the library designer can make an informed choice on which solution to explore.

3. Virtual Screening

Virtual screening can be defined as a computational technique that is generally used in the early stages of the drug discovery process to search libraries of small molecules to identify chemical compounds that are likely to bind to one or several drug targets [42]. In other words, VS is a step-by-step method with a series of filters able to narrow down and choose a set of lead-like hits with potential biological activity against intended drug targets [43]. Essentially, VS could be considered as an experimental high-throughput screening (HTS) performed in silico [44]. VS presents two main advantages when compared to the traditional experimental HTS. Firstly, it acts as a filter, selecting only those candidates with the most favorable characteristics to be active, which can then be tested in vitro. This leads to the second main advantage, which is the fact that, since the compounds studied do not necessarily exist, their “testing” does not consume valuable substance material, which, in turn, improves the time- and cost-effectiveness of the drug development process. Therefore, any molecule can, in theory, be evaluated using VS.

3.1. Methods Used in Virtual Screening

Virtual screening techniques can be grouped into two major categories, depending on the type of information used to develop the screening models. Ligand-based virtual screening relies on structural and physicochemical properties of the chemical scaffold of known active and inactive molecules and is based on the molecular similarity principle [7]. On the other hand, SBVS exploits the three-dimensional structure of the target protein [9]. In the following, we will describe different methodologies used in LBVS and SBVS.

3.1.1. Ligand-Based Virtual Screening (LBVS)

As was mentioned above, LBVS is based on molecular similarity through the comparison of different structural and physicochemical properties [7]. The main hypothesis behind LBVS is that similar compounds will cause similar biological effects. Essentially, large ligand libraries are searched to identify compounds with similar chemical properties or shapes to molecules with known pharmacological activity, which can in turn result in the identification of new active compounds [45]. The search can be performed using several screening methods that differ on the measure of similarity, ranging from two-dimensional descriptors to shape comparisons and three-dimensional descriptors.

Quantitative Structure–Activity Relationship (QSAR) models are one of the main methods used in LBVS. These models can identify the correlation between structure-based molecular descriptors and biological activity [46]. Traditionally, these models were used retrospectively, with scientists focused on developing explanatory models of existing data [47]. However, the substantial increase in the size of experimental datasets available has led to an increase in the use of QSAR models as a virtual screening tool to discover active compounds in chemical databases and VCLs [48]. There are many QSAR approaches that differ on the structural parameters, also known as descriptors, used to characterize molecules as well as on the mathematical approaches used to establish the correlation between descriptor values and pharmacological activity [49].

The molecular descriptors used in QSAR models can be divided into five groups: topological, geometrical, thermodynamic, electronic, and constitutional [50,51,52]. Topological and geometrical descriptors represent the connectivity of atoms in a molecule as well as its shape but, while topological descriptors are based on 2D molecular graphs, geometrical descriptors are calculated from the 3D coordinates of the atoms. Thermodynamic descriptors relate the chemical structure to an observed chemical behavior. Examples of these include molar refractivity as a combined measure of molecular size and polarizability, log P to characterize the hydrophobicity of the molecule, and solvation free energies [53]. Electronic descriptors describe electronic aspects of the molecule or atom bonds such as the charge distribution in a molecule. Lastly, constitutional descriptors reflect simple chemical information about a molecule, such as the molecular weight or the number of bonds in the molecule.

There are many mathematical methods used to build the QSAR predictive models. These could be grouped into linear and machine learning approaches [54]. Linear methods, which include linear discriminant analysis, multiple linear regression, and partial least squares, among others, fit data to an equation and report the coefficients derived from it. On the other hand, machine learning methods, among which one can find neural networks and support vector machines, process input information and recognize patterns.

Another widely used LBVS approach is pharmacophore-based modeling. In this case, different algorithms are applied to identify configurations or spatial arrangements of chemical features that are common to molecules with a known activity [55]. These chemical features include, but are not limited to, hydrogen bonds, charges, and hydrophobic areas [56]. The analysis can be carried out in either a 2D or 3D space [57]. Pharmacophore models are based on the principle that novel compounds able to fulfill a certain interaction pattern regarding the aforementioned chemical features should bind and show comparable biological activity to that of the known active molecule. Pharmacophore modeling starts with the identification of the pharmacophore of a molecule with a desired activity. Subsequently, a conformational analysis is carried out where the flexibility of small molecules is handled by enumerating multiple conformations for each molecule in the database. Pharmacophore-based LBVS can sometimes be confused with molecular docking, an SBVS method. The main differences between them will be discussed after molecular docking is explained.

3.1.2. Structure-Based Virtual Screening (SBVS)

SBVS, also known as target-based virtual screening (TBVS), aims to predict the best interaction between ligands and a molecular target to form a complex [9]. In other words, the affinity of different ligands to the target is assessed and ranked. Thus, to perform SBVS, the 3D structure of the target protein must be known to be able to predict the interactions between the target and each chemical compound in silico [58]. This technique is based on a series of algorithms that explore the geometrically feasible alignments of different ligands with a specific drug target [59]. As a result, the ligands are ranked according to their affinity with the receptor site, allowing for the identification of molecules that are more likely to present pharmacological activity. In order to carry out this ranking, scoring functions are used to approximate the binding free energy between the protein and the ligand in each docking pose [60]. Lastly, the results are processed to examine the validity of the generated pose, undesirable chemical moieties, metabolic liabilities, desired physicochemical properties, lead-likeness, and chemical diversity [61].

Scoring functions play a key role in molecular docking. These functions can be divided into three categories: empirical, knowledge-based, and physics-based [62]. Empirical functions are some of the most widely used as they are easy to compute. These functions try to capture relevant elements of binding free energy, such as solvent accessible surface, entropy, and hydrogen bonds, to then fit them in experimental data [63]. In fact, because of their simple energy terms, these scoring functions are able to predict binding affinity, ligand pose, and virtual screening with low computing costs; however, their accuracy is lower compared with the other two types of functions [64]. On the other hand, knowledge-based scoring functions calculate the desired pairwise potentials from three-dimensional structures of a large set of protein–ligand complexes based on the inverse Boltzmann statistic principle [65]. In this case, the size and quality of the databases used to derive the statistical potentials have a great impact on the accuracy of knowledge-based scoring functions. Lastly, physics-based scoring functions include scoring functions based on force field, solvation model, and quantum mechanics methods [66,67,68]. These scoring functions can directly compute the interactions between the atoms of a protein and ligand, having a greater predictive accuracy than other types of scoring functions due to consideration of the enthalpy, solvation, and entropy.

Having seen molecular docking and pharmacophore-based VS, it is easy to confuse one with the other as both aim to identify molecules capable of binding to a certain drug target. However, their difference relies, essentially, on the methodology. While pharmacophore-based VS uses the structures of ligands with known pharmacological activity to predict chemical structures that should bind to proteins in the same way, molecular docking requires the defined 3D structure of the target protein to study which compounds will bind more effectively to it and, thus, have the higher probability of being pharmacologically active [64,69].

4. Applications and Current Trends

The different methodologies of VS have been widely used for the discovery and development of new drugs. This VS can be either performed on virtual combinatorial libraries or on large databases of chemical compounds available online (Figure 1). The number of chemical databases available for VS has increased exponentially in the last few years as the advances in computational methods have vastly increased the information output [70]. These databases include chemical, biomolecular, drug–target interaction, and/or disease information and can be used for drug discovery and drug repurposing. Some of the most widely used databases in medicinal chemistry include PubChem, ZINC, ChemSpider, and DrugBank [71,72,73,74]. In the following, we will present successful examples of the different VS techniques applied in both VCLs and chemical databases for the discovery of new drugs in the early stages of the development process.

As was mentioned earlier, QSAR models were initially used to interpret the structure–activity relationship of lead compounds. However, this technique evolved and QSAR models began to be applied in the prediction of pharmacological activity. For example, Bueso-Bordils et al. built a QSAR model based on linear discriminant analysis to predict antibacterial activity against MRSA [25]. They used this model to virtually screen a fluoroquinolone VCL, identifying 117 theoretically active molecules of which five were synthesized and three showed anti-MRSA activity comparable to that of ciprofloxacin. Similarly, Suay-Garcia et al. developed a tree-based QSAR model based on quinolones that was applied to the DrugBank database to screen for active compounds against Escherichia coli [75]. The model identified 134 drugs with theoretical activity against E. coli of which eight were already commercialized as antibacterial drugs, 67 were approved for different pathologies, and 55 were drugs in experimental stages. The same methodology was used by Luo et al. to develop a binary classification QSAR prediction model that was used to mine drug-like, diversity, and GPCR-targeted libraries to identify novel anxiolytics and potential antischizophrenic drugs [76]. Another QSAR model was developed using GUSAR software to identify novel HIV-1 integrase inhibitors [77]. This model was used to virtually screen a subset of 308 structurally distinct compounds from the BindingDB database. Of these, 236 compounds were selected as potential candidates for synthesis due to their good druglikeness. Finally, six compounds were chosen to be synthesized and one of them was experimentally confirmed to inhibit the strand transfer reaction in HIV. More recently, Zaki et al. developed a balanced QSAR model based on the genetic similarity between SARS-CoV-2 and SARS-CoV to identify novel molecules with inhibitory potential against the main protease of SARS-CoV-2 [78]. The study combines a prediction QSAR model along with molecular docking and molecular dynamics to screen 26,467 food compounds and 360 heterocyclic variants of a benzotriazole–indole hybrid molecule to identify promising hits to treat COVID-19.

Pharmcophore-based models are the other most common LBVS approach in virtual screening. For instance, a pharmacophore-based model was developed to identify potential σ1 receptor ligands to treat Alzheimer’s Disease [79]. This model was applied to screen 8543 compounds from the Life Chemicals database, of which five candidates presented excellent druglikeness and ADMET properties. Along these lines, Liu et al. generated a pharmacophore model from the structures of active amino alcohols to perform a virtual screening to discover novel compounds with anti-echinococcal activity [80]. The screening was performed on the ZINC15 database and, out of the 62 compounds selected by the model, 10 were found to be experimentally active against Echinococcus multilocularis. Kouman et al. followed a similar procedure to identify benzamides capable of inhibiting 2-trans enoyl-acyl carrier protein reductases in Mycobacterium tuberculosis [26]. In this case, a pharmacophore model generated from the active conformations of N-benzyl-4-((heteroaryl)methyl) benzamides (BHMBs) was used as a virtual screening tool of novel analogs included in a VCL of compounds containing benzamide scaffolds. The model identified 90 new and potent BHMBs with enhanced cell membrane permeability and high human oral absorption compared with current treatments for tuberculosis. Screening of a virtual combinatorial library with a pharmacophore model was also used to identify novel µ-opioid receptor inverse agonists to treat narcotic overdose or drug addiction [81]. More specifically, a library including 19,800 tetrapeptides was created to perform the virtual screening and three candidates were selected for binding assays.

Regarding SBVS, molecular docking is the most widely used technique. However, the latest VS trends aim for a consensus approach in which different VS techniques are used in combination to optimize results. Thus, molecular docking is generally found to be used along with LBVS models. For example, a combination of a pharmacophore-based model with 3D-QSAR and molecular docking was used to virtually screen the ZINC and ASINEX databases to identify potential dipeptidyl peptidase IV inhibitors to be used as oral antidiabetics [82]. More specifically, the pharmacophore and 3D QSAR model was used to virtually screen the aforementioned databases and the hit molecules were used to design a VCL that was evaluated using molecular docking. A similar procedure was followed by Bommu et al. to predict potential epigallocatechin gallate (EGCG) analogs against epidermal growth factor receptors [83]. In this case, log P and log S predictions along with the toxicity endpoint were modeled using QSAR, which was combined with a pharmacophore model and molecular docking to identify seven high-potential EGCG analogs as promising pharmacological, anticancer, and drug-like templates that could be used towards moderating lung cancer progression. This consensus approach was also used to identify natural compounds against mosquito-borne Chikungunya virus targets [84]. To do so, a subset of compounds from natural sources found on PubChem was studied using molecular docking and the selected potential ligands were subjected to 3D-QSAR studies to predict biological activity. Finally, Lipinski’s rule and ADMET studies were also performed, leading to the identification of the four best-fit compounds of natural origin against targets of the Chikungunya virus.

5. Conclusions

Virtual Combinatorial Chemistry and the different Virtual Screening tools are presented as a key tool in the development of new drugs in a time- and cost-effective manner. These in silico methods, whether combined or on their own, accelerate the drug discovery process by acting as filters and allowing experimental evaluation to be focused only on compounds with the most drug-likeness.

Author Contributions

Conceptualization, P.A.A.-L., G.M.A.-F. and A.F.; resources, A.F.; data curation, B.S.-G.; writing—original draft preparation, B.S.-G. and J.I.B.-B.; writing—review and editing, P.A.A.-L. and B.S.-G.; visualization, G.M.A.-F.; supervision, P.A.A.-L.; project administration, A.F.; funding acquisition, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the grant number INDI21/15 from Universidad CEU Cardenal Herrera.

Conflicts of Interest

The authors declare no conflict of interest.

References

Garrett, M.D.; Workman, P. Discovering novel chemotherapeutic drugs for the third millennium. Eur. J. Cancer 1999, 35, 2010–2030. [Google Scholar] [CrossRef]
Guido, R.V.C.; Oliva, G.; Andricopulo, A.D. Modern drug discovery technologies: Opportunities and challenges in lead discovery. Comb. Chem. High Throughput Screen. 2011, 14, 830–839. [Google Scholar] [CrossRef] [PubMed]
Cho, S.J.; Zheng, W.; Tropsha, A. Rational Combinatorial Library Design. 2. Rational Design of Targeted Combinatorial Peptide Libraries Using Chemical Similarity Probe and the Inverse QSAR Approaches. J. Chem. Inf. Comput. Sci. 1998, 38, 259–268. [Google Scholar] [CrossRef] [PubMed]
De Julian-Ortiz, J.V. Virtual darwinian drug design: QSAR inverse problem, virtual combinatorial chemistry, and computational screening. Comb. Chem. High Throughput Screen. 2001, 4, 295–310. [Google Scholar] [CrossRef]
López-Vallejo, F.; Caulfield, T.; Martinez-Mayorga, K.; Giulianotti, M.A.; Nefzi, A.; Houghten, R.A.; Medina-Franco, J.L. Integrating Virtual Screening and Combinatorial Chemistry for Accelerated Drug Discovery. Comb. Chem. High Throughput Screen. 2011, 14, 475–487. [Google Scholar] [CrossRef]
Bajorath, J. Integration of virtual and high-throughput screening. Nat. Rev. Drug Discov. 2002, 1, 882–894. [Google Scholar] [CrossRef]
Lill, M. Virtual Screening in Drug Design. Methods Mol. Biol. 2013, 993, 1–12. [Google Scholar] [CrossRef]
Jahn, A.; Hinselmann, G.; Fechner, N.; Zell, A. Optimal assignment methods for ligand-based virtual screening. J. Cheminformatics 2009, 1, 14–23. [Google Scholar] [CrossRef] [Green Version]
Maia, E.H.B.; Assis, L.C.; De Oliveira, T.A.; Da Silva, A.M.; Taranto, A.G. Structure-Based Virtual Screening: From Classical to Artificial Intelligence. Front. Chem. 2020, 8, 343. [Google Scholar] [CrossRef]
Bauer, J.; Spanton, S.; Henry, R.; Quick, J.; Dziki, W.; Porter, W.; Morris, J. Ritonavir: An Extraordinary Example of Conformational Polymorphism. Pharm. Res. 2001, 18, 859–866. [Google Scholar] [CrossRef]
Zhou, Y.; Wang, J.; Xiao, Y.; Wang, T.; Huang, X. The Effects of Polymorphism on Physicochemical Properties and Pharmacodynamics of Solid Drugs. Curr. Pharm. Des. 2018, 24, 2375–2382. [Google Scholar] [CrossRef] [PubMed]
Drebushchak, V.A.; McGregor, L.; Rychkov, D.A. Cooling rate “window” in the crystallization of metacetamol form II. J. Therm. Anal. Calorim. 2017, 127, 1807–1814. [Google Scholar] [CrossRef]
Mazurek, A.H.; Szeleszczuk, Ł.; Pisklak, D.M. Periodic DFT Calculations—Review of Applications in the Pharmaceutical Sciences. Pharmaceutics 2020, 12, 415. [Google Scholar] [CrossRef] [PubMed]
Vainio, M.J.; Kogej, T.; Raubacher, F. Automated Recycling of Chemistry for Virtual Screening and Library Design. J. Chem. Inf. Model. 2012, 52, 1777–1786. [Google Scholar] [CrossRef] [PubMed]
Schneider, G. Trends in virtual combinatorial library design. Curr. Med. Chem. 2002, 9, 2095–2101. [Google Scholar] [CrossRef]
Nikolay, K.; Svetlana, A.; Nina, J. Combinatorial generation of molecules by virtual software reactor. Sci. Work Union Sci. Bulg. Plovdiv 2017, 11, 214–219. [Google Scholar]
Lessel, U.; Wellenzohn, B.; Lilienthal, M.; Claussen, H. Searching Fragment Spaces with Feature Trees. J. Chem. Inf. Model. 2009, 49, 270–279. [Google Scholar] [CrossRef]
Nicolaou, C.A.; Watson, I.A.; Hu, H.; Wang, J.-B. The Proximal Lilly Collection: Mapping, Exploring and Exploiting Feasible Chemical Space. J. Chem. Inf. Model. 2016, 56, 1253–1266. [Google Scholar] [CrossRef]
Hu, Q.; Peng, Z.; Sutton, S.C.; Na, J.; Kostrowicki, J.; Yang, B.; Thacher, T.; Kong, X.; Mattaparti, S.; Zhou, J.Z.; et al. Pfizer Global Virtual Library (PGVL): A Chemistry Design Tool Powered by Experimentally Validated Parallel Synthesis Information. ACS Comb. Sci. 2012, 14, 579–589. [Google Scholar] [CrossRef]
Humbeck, L.; Weigang, S.; Schäfer, T.; Mutzel, P.; Koch, O. CHI PMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein-Protein Interaction Modulators. ChemMedChem 2018, 13, 532–539. [Google Scholar] [CrossRef] [Green Version]
Massarotti, A. Investigation of the Click-Chemical Space for Drug Design Using ZINClick. Methods Mol. Biol. 2021, 2266, 3–10. [Google Scholar] [CrossRef] [PubMed]
Saldívar-González, F.I.; Lenci, E.; Calugi, L.; Medina-Franco, J.L.; Trabocchi, A. Computational-aided design of a library of lactams through a diversity-oriented synthesis strategy. Bioorganic Med. Chem. 2020, 28, 115539. [Google Scholar] [CrossRef] [PubMed]
Karthikeyan, M.; Pandit, D.; Vyas, R. ChemScreener: A Distributed Computing Tool for Scaffold based Virtual Screening. Comb. Chem. High Throughput Screen. 2015, 18, 544–561. [Google Scholar] [CrossRef] [PubMed]
Krier, M.; de Araújo-Júnior, J.X.; Schmitt, M.; Duranton, J.; Justiano-Basaran, H.; Lugnier, C.; Bourguignon, J.-J.; Rognan, D. Design of Small-Sized Libraries by Combinatorial Assembly of Linkers and Functional Groups to a Given Scaffold: Application to the Structure-Based Optimization of a Phosphodiesterase 4 Inhibitor. J. Med. Chem. 2005, 48, 3816–3822. [Google Scholar] [CrossRef]
Bueso-Bordils, J.I.; Perez-Gracia, M.T.; Suay-Garcia, B.; Duart, M.J.; Algarra, R.V.M.; Zamora, L.L.; Anton-Fos, G.M.; Lopez, P.A.A. Topological pattern for the search of new active drugs against methicillin resistant Staphylococcus aureus. Eur. J. Med. Chem. 2017, 138, 807–815. [Google Scholar] [CrossRef]
Kouman, K.C.; Keita, M.; N’Guessan, R.K.; Owono, L.C.O.; Megnassan, E.; Frecer, V.; Miertus, S. Structure-Based Design and in Silico Screening of Virtual Combinatorial Library of Benzamides Inhibiting 2-trans Enoyl-Acyl Carrier Protein Reductase of Mycobacterium tuberculosis with Favorable Predicted Pharmacokinetic Profiles. Int. J. Mol. Sci. 2019, 20, 4730. [Google Scholar] [CrossRef] [Green Version]
Lauro, G.; Terracciano, S.; Cantone, V.; Ruggiero, D.; Fischer, K.; Pace, S.; Werz, O.; Bruno, I.; Bifulco, G. A Combinatorial Virtual Screening Approach Driving the Synthesis of 2,4-Thiazolidinedione-Based Molecules as New Dual mPGES-1/5-LO Inhibitors. ChemMedChem 2020, 15, 481–489. [Google Scholar] [CrossRef]
Saldívar-González, F.I.; Huerta-García, C.S.; Medina-Franco, J.L. Chemoinformatics-based enumeration of chemical libraries: A tutorial. J. Cheminformatics 2020, 12, 1–25. [Google Scholar] [CrossRef]
Fang, G.; Xue, M.; Su, M.; Hu, D.; Li, Y.; Xiong, B.; Ma, L.; Meng, T.; Chen, Y.; Li, J.; et al. CCLab—a multi-objective genetic algorithm based combinatorial library design software and an application for histone deacetylase inhibitor design. Bioorganic Med. Chem. Lett. 2012, 22, 4540–4545. [Google Scholar] [CrossRef]
Gillet, V.J.; Khatib, W.; Willett, P.; Fleming, P.J.; Green, D.V.S. Combinatorial Library Design Using a Multiobjective Genetic Algorithm. J. Chem. Inf. Comput. Sci. 2002, 42, 375–385. [Google Scholar] [CrossRef]
Berthold, M.R.; Cebron, N.; Dill, F.; Gabriel, T.R.; Kötter, T.; Meinl, T.; Ohl, P.; Thiel, K.; Wiswedel, B. KNIME—The Konstanz information miner: Version 2.0 and beyond. ACM SIGKDD Explor. Newsl. 2009, 11, 26–31. [Google Scholar] [CrossRef] [Green Version]
Landrum, G. RDKit. Available online: https://www.rdkit.org/ (accessed on 28 October 2021).
Sander, T.; Freyss, J.; Von Korff, M.; Rufener, C. DataWarrior: An Open-Source Program For Chemistry Aware Data Visualization and Analysis. J. Chem. Inf. Model. 2015, 55, 460–473. [Google Scholar] [CrossRef] [PubMed]
Reactor|ChemAxon. Available online: https://chemaxon.com/products/reactor (accessed on 28 October 2021).
Library synthesizer—Tripod Development. Available online: https://tripod.nih.gov/?p=370 (accessed on 28 October 2021).
Schüller, A.; Hähnke, V.; Schneider, G. SmiLib v2.0: A Java-Based Tool for Rapid Combinatorial Library Enumeration. QSAR Comb. Sci. 2007, 26, 407–410. [Google Scholar] [CrossRef]
Chemical Computing Group (CCG)|Computer-Aided Molecular Design. Available online: https://www.chemcomp.com/ (accessed on 28 October 2021).
Schrödinger. Available online: https://www.schrodinger.com/ (accessed on 28 October 2021).
Optibrium. Available online: https://www.optibtium.com/startdrop/startdrop-nova.php (accessed on 28 October 2021).
ChemDraw. Available online: https://perkinelmerinformatics.com/products/research/chemdraw/ (accessed on 4 November 2021).
GLARE. Available online: https://glare.sourcefoge.net/ (accessed on 28 October 2021).
Shoichet, B.K. Virtual screening of chemical libraries. Nature 2004, 432, 862–865. [Google Scholar] [CrossRef] [PubMed]
Lavecchia, A.; Di Giovanni, C. Virtual Screening Strategies in Drug Discovery: A Critical Review. Curr. Med. Chem. 2013, 20, 2839–2860. [Google Scholar] [CrossRef] [PubMed]
Tanrikulu, Y.; Krüger, B.; Proschak, E. The holistic integration of virtual screening in drug discovery. Drug Discov. Today 2013, 18, 358–364. [Google Scholar] [CrossRef] [PubMed]
Ripphausen, P.; Nisius, B.; Bajorath, J. State-of-the-art in ligand-based virtual screening. Drug Discov. Today 2011, 16, 372–376. [Google Scholar] [CrossRef]
Spiegel, J.; Senderowitz, H. Evaluation of QSAR Equations for Virtual Screening. Int. J. Mol. Sci. 2020, 21, 7828. [Google Scholar] [CrossRef]
Tropsha, A.; Golbraikh, A. Predictive QSAR Modeling Workflow, Model Applicability Domains, and Virtual Screening. Curr. Pharm. Des. 2007, 13, 3494–3504. [Google Scholar] [CrossRef]
Suay-Garcia, B.; Bueso-Bordils, J.I.; Falcó, A.; Pérez-Gracia, M.T.; Antón-Fos, G.; Alemán-López, P. Quantitative structure–activity relationship methods in the discovery and development of antibacterials. WIREs Comput. Mol. Sci. 2020, 10, e1472. [Google Scholar] [CrossRef]
Gini, G. QSAR: What Else? Methods Mol. Biol. 2018, 1800, 79–105. [Google Scholar] [CrossRef] [PubMed]
Khan, A.U. Descriptors and their selection methods in QSAR analysis: Paradigm for drug design. Drug Discov. Today 2016, 21, 1291–1302. [Google Scholar] [CrossRef] [PubMed]
Todeschini, R.; Consonni, V. Handbook of Molecular Descriptors; Wiley-VCH: Weinheim, Germany, 2000. [Google Scholar] [CrossRef]
LaPointe, S.M.; Weaver, D.F. A Review of Density Functional Theory Quantum Mechanics as Applied to Pharmaceutically Relevant Systems. Curr. Comput. Aided-Drug Des. 2007, 3, 290–296. [Google Scholar] [CrossRef]
Perkins, R.; Fang, H.; Tong, W.; Welsh, W.J. Quantitative structure-activity relationship methods: Perspectives on drug discovery and toxicology. Environ. Toxicol. Chem. 2003, 22, 1666–1679. [Google Scholar] [CrossRef]
Liu, P.; Long, W. Current Mathematical Methods Used in QSAR/QSPR Studies. Int. J. Mol. Sci. 2009, 10, 1978–1998. [Google Scholar] [CrossRef]
Li, S.; Zhang, S.; Chen, D.; Jiang, X.; Liu, B.; Zhang, H.; Rachakunta, M.; Zuo, Z. Identification of Novel TRPC5 Inhibitors by Pharmacophore-Based and Structure-Based Approaches. Comput. Biol. Chem. 2020, 87, 107302. [Google Scholar] [CrossRef]
Wolber, G. 3D pharmacophore elucidation and virtual screening. Drug Discov. Today Technol. 2011, 7, e203–e204. [Google Scholar] [CrossRef]
Hessler, G.; Baringhaus, K.-H. The scaffold hopping potential of pharmacophores. Drug Discov. Today Technol. 2011, 7, e263–e269. [Google Scholar] [CrossRef]
Liu, S.; Alnammi, M.; Ericksen, S.S.; Voter, A.F.; Ananiev, G.E.; Keck, J.L.; Hoffmann, F.M.; Wildman, S.A.; Gitter, A. Practical Model Selection for Prospective Virtual Screening. J. Chem. Inf. Model. 2018, 59, 282–293. [Google Scholar] [CrossRef] [Green Version]
Kuntz, I.D.; Blaney, J.M.; Oatley, S.J.; Langridge, R.; Ferrin, T.E. A geometric approach to macromolecule-ligand interactions. J. Mol. Biol. 1982, 161, 269–288. [Google Scholar] [CrossRef]
Lionta, E.; Spyrou, G.; Vassilatis, D.K.; Cournia, Z. Structure-Based Virtual Screening for Drug Discovery: Principles, Applications and Recent Advances. Curr. Top. Med. Chem. 2014, 14, 1923–1938. [Google Scholar] [CrossRef] [PubMed]
Reddy, A.S.; Pati, S.P.; Kumar, P.P.; Pradeep, H.N.; Sastry, G.N. Virtual Screening in Drug Discovery—A Computational Perspective. Curr. Protein Pept. Sci. 2007, 8, 329–351. [Google Scholar] [CrossRef]
Sun, H. Pharmacophore-Based Virtual Screening. Curr. Med. Chem. 2008, 15, 1018–1024. [Google Scholar] [CrossRef]
Eldridge, M.D.; Murray, C.W.; Auton, T.R.; Paolini, G.V.; Mee, R.P. Empirical scoring functions: I. The development of a fast empirical scoring function to estimate the binding affinity of ligands in receptor complexes. J. Comput. Aided Mol. Des. 1997, 11, 425–445. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Liu, Z.; Li, J.; Han, L.; Zhao, Z.; Wang, R. Comparative Assessment of Scoring Functions on an Updated Benchmark: 1. Compilation of the Test Set. J. Chem. Inf. Model. 2014, 54, 1700–1716. [Google Scholar] [CrossRef] [PubMed]
Gohlkea, H.; Hendlicha, M.; Klebea, G. Knowledge-based scoring function to predict protein-ligand interactions. J. Mol. Biol. 2000, 295, 337–356. [Google Scholar] [CrossRef] [PubMed]
Jorgensen, W.L.; Chandrasekhar, J.; Madura, J.D.; Impey, R.W.; Klein, M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983, 79, 926–935. [Google Scholar] [CrossRef]
Zheng, Z.; Wang, T.; Li, P.; Merz, K.M., Jr. KECSA-Movable Type Implicit Solvation Model (KMTISM). J. Chem. Theory Comput. 2014, 11, 667–682. [Google Scholar] [CrossRef] [Green Version]
Raha, K.; Peters, M.B.; Wang, B.; Yu, N.; Wollacott, A.M.; Westerhoff, L.; Merz, K.M. The role of quantum mechanics in structure-based drug design. Drug Discov. Today 2007, 12, 725–731. [Google Scholar] [CrossRef]
Chen, Z.; Li, H.-L.; Zhang, Q.-J.; Bao, X.-G.; Yu, K.-Q.; Luo, X.-M.; Zhu, W.-L.; Jiang, H.-L. Pharmacophore-based virtual screening versus docking-based virtual screening: A benchmark comparison against eight targets. Acta Pharmacol. Sin. 2009, 30, 1694–1708. [Google Scholar] [CrossRef] [Green Version]
Tanoli, Z.; Seemab, U.; Scherer, A.; Wennerberg, K.; Tang, J.; Vähä-Koskela, M. Exploration of databases and methods supporting drug repurposing: A comprehensive survey. Briefings Bioinform. 2021, 22, 1656–1678. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kim, S. Getting the most out of PubChem for virtual screening. Expert Opin. Drug Discov. 2016, 11, 843–855. [Google Scholar] [CrossRef] [PubMed] [Green Version]
ZINC15. Available online: http://zinc15.docking.org/ (accessed on 17 November 2021).
ChemSpider–Chemical Database. Royal Society of Chemistry, Cambridge, UK. Available online: http://www.chemspider.com/ (accessed on 15 December 2021).
DrugBank. Available online: https://go.drugbank.com/ (accessed on 8 December 2021).
Suay-Garcia, B.; Falcó, A.; Bueso-Bordils, J.I.; Anton-Fos, G.M.; Pérez-Gracia, M.T.; Alemán-López, P.A. Tree-Based QSAR Model for Drug Repurposing in the Discovery of New Antibacterial Compounds Against Escherichia coli. Pharmaceuticals 2020, 13, 431. [Google Scholar] [CrossRef] [PubMed]
Luo, M.; Wang, X.S.; Roth, B.L.; Golbraikh, A.; Tropsha, A. Application of Quantitative Structure–Activity Relationship Models of 5-HT1A Receptor Binding to Virtual Screening Identifies Novel and Potent 5-HT1A Ligands. J. Chem. Inf. Model. 2014, 54, 634–647. [Google Scholar] [CrossRef] [PubMed]
Guasch, L.; Zakharov, A.V.; Tarasova, O.A.; Poroikov, V.V.; Liao, C.; Nicklaus, M.C. Novel HIV-1 Integrase Inhibitor Development by Virtual Screening Based on QSAR Models. Curr. Top. Med. Chem. 2016, 16, 441–448. [Google Scholar] [CrossRef] [PubMed]
Zaki, M.E.A.; Al-Hussain, S.A.; Masand, V.H.; Akasapu, S.; Bajaj, S.O.; El-Sayed, N.N.E.; Ghosh, A.; Lewaa, I. Identification of Anti-SARS-CoV-2 Compounds from Food Using QSAR-Based Virtual Screening, Molecular Docking, and Molecular Dynamics Simulation Analysis. Pharmaceuticals 2021, 14, 357. [Google Scholar] [CrossRef]
Alamri, M.A.; Alamri, M.A. Pharmacophore and docking-based sequential virtual screening for the identification of novel Sigma 1 receptor ligands. Bioinformation 2019, 15, 586–595. [Google Scholar] [CrossRef] [Green Version]
Liu, C.; Yin, J.; Yao, J.; Xu, Z.; Tao, Y.; Zhang, H. Pharmacophore-Based Virtual Screening Toward the Discovery of Novel Anti-echinococcal Compounds. Front. Cell. Infect. Microbiol. 2020, 10, 118. [Google Scholar] [CrossRef]
Poli, G.; Dimmito, M.P.; Mollica, A.; Zengin, G.; Benyhe, S.; Zador, F.; Stefanucci, A. Discovery of Novel µ-Opioid Receptor Inverse Agonist from a Combinatorial Library of Tetrapeptides through Structure-Based Virtual Screening. Molecules 2019, 24, 3872. [Google Scholar] [CrossRef] [Green Version]
Shah, B.M.; Modi, P.; Trivedi, P. Pharmacophore- based virtual screening, 3D- QSAR, molecular docking approach for identification of potential dipeptidyl peptidase IV inhibitors. J. Biomol. Struct. Dyn. 2021, 39, 2021–2043. [Google Scholar] [CrossRef]
Bommu, U.D.; Konidala, K.K.; Pabbaraju, N.; Yeguvapalli, S. QSAR modeling, pharmacophore-based virtual screening, and ensemble docking insights into predicting potential epigallocatechin gallate (EGCG) analogs against epidermal growth factor receptor. J. Recept. Signal Transduct. 2019, 39, 18–27. [Google Scholar] [CrossRef] [PubMed]
Vora, J.; Patel, S.; Sinha, S.; Sharma, S.; Srivastava, A.; Chhabria, M.; Shrivastava, N. Structure based virtual screening, 3D-QSAR, molecular dynamics and ADMET studies for selection of natural inhibitors against structural and non-structural targets of Chikungunya. J. Biomol. Struct. Dyn. 2018, 37, 3150–3161. [Google Scholar] [CrossRef] [PubMed]

Figure 1. General flowchart used in virtual screening.

Table 1. Examples of chemoinformatic tools available to create chemical libraries of small molecules. (Adapted from Saldívar-González et al. [28]).

Tool/Software	Main Features	Ref.
CCLab	Based on a multi-objective genetic algorithm, including synthesis cost and drug-likeness.	[29]
MoSELECT	Based on a multi-objective genetic algorithm, including diversity and “drug-like” physicochemical properties, and a fitness function.	[30]
KNIME	Based on generic reactions.	[31]
RDKit	Based on generic reactions.	[32]
DataWarrior	Molecules are designed following a given generic reaction and a list of real reactant structures.	[33]
Library synthesizer	Creates libraries through specification of a central scaffold with connection points and a list of R groups.	[35]
SimLib v2.0	Libraries are built using SMILES and a scaffold-based approach.	[36]
GLARE	Allows one to optimize reagent lists for the design of combinatorial libraries.	[41]
Reactor (ChemAxon)	Library generated using generic reactions and considering reaction rules that yield chemically feasible products.	[34]
Molecular Operating Environment (MOE)	Scaffold-based. New chemical compounds are generated by attaching R groups to a common skeleton with marked points.	[37]
Schrödinger	Creates library by substituting attachments on a core structure with fragments from reagent compounds.	[38]
Nova	Uses central scaffolds and a list of R groups.	[39]
ChemDraw	Uses central scaffolds and a list of R groups.	[40]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Suay-García, B.; Bueso-Bordils, J.I.; Falcó, A.; Antón-Fos, G.M.; Alemán-López, P.A. Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design. Int. J. Mol. Sci. 2022, 23, 1620. https://doi.org/10.3390/ijms23031620

AMA Style

Suay-García B, Bueso-Bordils JI, Falcó A, Antón-Fos GM, Alemán-López PA. Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design. International Journal of Molecular Sciences. 2022; 23(3):1620. https://doi.org/10.3390/ijms23031620

Chicago/Turabian Style

Suay-García, Beatriz, Jose I. Bueso-Bordils, Antonio Falcó, Gerardo M. Antón-Fos, and Pedro A. Alemán-López. 2022. "Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design" International Journal of Molecular Sciences 23, no. 3: 1620. https://doi.org/10.3390/ijms23031620

APA Style

Suay-García, B., Bueso-Bordils, J. I., Falcó, A., Antón-Fos, G. M., & Alemán-López, P. A. (2022). Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design. International Journal of Molecular Sciences, 23(3), 1620. https://doi.org/10.3390/ijms23031620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Virtual Combinatorial Chemistry and Pharmacological Screening: A Short Guide to Drug Design

Abstract

1. Introduction

2. Virtual Combinatorial Library Creation

2.1. Types of Combinatorial Libraries

2.2. Generation of Combinatorial Libraries

3. Virtual Screening

3.1. Methods Used in Virtual Screening

3.1.1. Ligand-Based Virtual Screening (LBVS)

3.1.2. Structure-Based Virtual Screening (SBVS)

4. Applications and Current Trends

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI