Introduction
Small molecule biological probes that influence protein function are useful tools for studying the cell and can also serve as lead structures in the development of new therapeutic agents [
1,
2,
3,
4,
5]. The NIH Roadmap [
6] project has created opportunities for the development of new chemical libraries for subsequent screening by the Molecular Libraries Screening Center Network (MLSCN) in a combined effort to identify unique and useful biological probes.
Selecting compounds for screening is an important, albeit challenging, task [
7,
8]. Ideally, the compounds are pure and of known structure, contain only non-reactive functional groups, have a possibility of some level of selectivity towards a given protein target or at least toward a protein class, and have the physical properties suitable for use in biological assays, including aqueous solubility and cellular permeability. Compounds that meet part or all of these parameters are generally described as “drug-like,” even if their chemical structures can be quite divergent—from marine natural products with multiple stereocenters and many polar functional groups to small molecule achiral synthetic compounds that are quite hydrophobic [
9,
10].
This paper describes the synthesis and characterization of an imidazole-4,5-dicarboxamide (I45DC) library substituted with two α-amino acid esters. We have been using the imidazole-4,5-dicarboxylic acid (I45DA) scaffold to design compounds that inhibit specific biological targets [
11,
12,
13], as well as for the synthesis of chemical libraries to be used in the MLSCN screening effort. Nevertheless, no dissymmerically-disubstituted I45DCs bearing only α-amino acid esters and only two symmetrically-disubstituted I45DCs bearing α-amino acids esters have been previously reported in the literature. There are related bis-I45DCs bearing one α-amino acid ester per I45DC that we used to target a protein-protein interaction [
12], as well as dissymmetrically-disubstituted I45DCs bearing one α-amino acid ester along with either a primary or secondary alkanamine that we are concurrently reporting [
14].
Amino acid side chains are natural recognition elements in substrate-enzyme, ligand-receptor, and protein-protein interactions, and we have shown that I45DCs form a strong intramolecular hydrogen bond that remains stable even in water at pH 7 [
15]. Moreover, the strong intramolecular hydrogen bond anticipated in each of these I45DCs accomplishes three important tasks: it predisposes the amino acid ester substituents to be separated by a distance comparable with side chain separations found adjacent on one side in an α-helix or on one side in a β-strand secondary structure [
12,
16], yields a quasi ring that combines with the imidazole to yield mimics of substituted purines [
13,
16], and offers added low-energy conformational flexibility for the compound to adapt to the binding site, which is an advantage over an all covalently bonded scaffold.
Our design of this library incorporated two amino acid ester pharmacophores in each I45DC, reasoning that α-amino acids are natural recognition elements in substrate-enzyme, ligand-receptor, and protein-protein interactions. All of the final compounds are of analytical purity and have been submitted to the Molecular Library Small Molecule Repository (MLSMR) for use by the Molecular Library Screening Center Network (MLSCN).
Results and Discussion
A library for high-throughput screening and hit identification in the discovery phase of identifying small molecule probes and drugs must strike a balance between sufficient numbers of compounds in class to sample pharmacophoric space without a significant redundancy. This balance saves time and energy in both delivering the compounds and performing the bioassay, as well as in the overall costs of the effort.
The α-amino acid esters used in the synthesis of this library were of the
S-configuration and are given in
Table 1. We opted for a set of mostly hydrophobic side chains in this work, since a less-than-perfect burial of a hydrophobic side chain in a binding cleft would still be expected to show some bioactivity in a screening effort, in contrast with a polar α-amino acid where electrostatic interactions, such as hydrogen bonding, form quite specific interactions geometrically and may not bind at all if those conditions are not met. This stated, the amino acids in this library include Ala, Leu, and Phe, along with Gly. Lysine was also included, but the side chain left protected by a Boc group in this initial library. The carboxylic acid was protected with either a
tert-butyl or benzyl ester to provide a hydrophobic group for binding interactions, while also providing a convenient deprotection and modification strategy for derivitizing bioactive compounds.
Table 1.
Amino acid ester salts, 2{1-9}, used in library synthesis.
Table 1.
Amino acid ester salts, 2{1-9}, used in library synthesis.
![Molecules 14 00352 i001]() |
---|
Member | R1 | R2 | X ![Molecules 14 00352 i002]() |
---|
2{1} | H | C(CH3)3 | Cl |
2{2} | H | CH2Ph | Cl |
2{3} | CH3 | C(CH3)3 | Cl |
2{4} | CH3 | CH2Ph | Cl |
2{5} | CH2CH(CH3)2 | C(CH3)3 | Cl |
2{6} | CH2CH(CH3)2 | CH2Ph | OSO2C6H4CH3 |
2{7} | CH2Ph | C(CH3)3 | Cl |
2{8} | CH2Ph | CH2Ph | Cl |
2{9} | (CH2)4NH2 | C(CH3)3 | Cl |
The resulting I45DC library members have reasonable drug-like properties, as illustrated by their molecular weights that range from 382−724 g/mol with an average value of 522 g/mol, cLogP values that range from 0.35−4.83 with an average value of 2.95, and relatively few rotatable bonds [
17,
18,
19].
The synthetic strategy to the pyrazine intermediates and final dI45DCs is shown in
Scheme 1. The starting pyrazine diacid chloride,
1, was prepared as previously described [
8], as were the amino acid ester substituted pyrazines,
3 [
14]. Reaction of the appropriate pyrazine,
3, with two equivalents of a second amino acid ester hydrochloride or tosylate salt,
2{1-9}, and two equivalents of
N,
N-diisopropyl-ethylamine as a scavenger for the acid produced a good average yield (73%) of the final I45DC products,
4, following purification by column chromatography. All of the compounds were analyzed by using LC-MS with LC detection at 214 nm, while selected compounds were analyzed by
1H-NMR spectroscopy. The yields to the 9 sI45DCs and 36 dI45DCs are given in
Table 2 and
Table 3, respectively. A comparative analysis of the low, high, average, and median yields as a function of an α-amino acid ester is given in
Table 4 and indicates there is no consequence on reaction yield based on structure. We have included tables of data (formula, molecular weight, cLogP values, physical form of the compound,
Rf values, and retention times in the LC-MS) for library members
5{1-45}, LC-MS spectra for all library members, LC-MS data for 10 reactions to crude library members, as well as
1H-NMR spectra for the crude reactions and purified library members for 15 representative compounds in a
supporting file. The same information is also available for each compound at the project website [
20].
Scheme 1.
Synthesis of library members.
Scheme 1.
Synthesis of library members.
Table 2.
Symmetrically-disubstituted Amino Acid Ester I45DCs, 4{1-9}.
Table 2.
Symmetrically-disubstituted Amino Acid Ester I45DCs, 4{1-9}.
Cmpd. | Reactants | Yield (%) |
---|
4{1} | 2{1} | 2{1} | 79 |
4{2} | 2{2} | 2{2} | 22 |
4{3} | 2{3} | 2{3} | 69 |
4{4} | 2{4} | 2{4} | 12 |
4{5} | 2{5} | 2{5} | 83 |
4{6} | 2{6} | 2{6} | 97 |
4{7} | 2{7} | 2{7} | 62 |
4{8} | 2{8} | 2{8} | 73 |
4{9} | 2{9} | 2{9} | 86 |
Table 3.
Dissymmetrically-disubstituted Amino Acid Ester I45DCs, 4{10-45}.
Table 3.
Dissymmetrically-disubstituted Amino Acid Ester I45DCs, 4{10-45}.
Cmpd. | Reactants | Yield (%) | Cmpd. | Reactants | Yield (%) |
---|
4{10} | 2{1} | 2{2} | 83 | 4{28} | 2{3} | 2{7} | 54 |
4{11} | 2{1} | 2{3} | 86 | 4{29} | 2{3} | 2{8} | 56 |
4{12} | 2{1} | 2{4} | 85 | 4{30} | 2{3} | 2{9} | 77 |
4{13} | 2{1} | 2{5} | 91 | 4{31} | 2{4} | 2{5} | 86 |
4{14} | 2{1} | 2{6} | 83 | 4{32} | 2{4} | 2{6} | 97 |
4{15} | 2{1} | 2{7} | 78 | 4{33} | 2{4} | 2{7} | 59 |
4{16} | 2{1} | 2{8} | 84 | 4{34} | 2{4} | 2{8} | 90 |
4{17} | 2{1} | 2{9} | 61 | 4{35} | 2{4} | 2{9} | 63 |
4{18} | 2{2} | 2{3} | 56 | 4{36} | 2{5} | 2{6} | 94 |
4{19} | 2{2} | 2{4} | 70 | 4{37} | 2{5} | 2{7} | 87 |
4{20} | 2{2} | 2{5} | 69 | 4{38} | 2{5} | 2{8} | 85 |
4{21} | 2{2} | 2{6} | 55 | 4{39} | 2{5} | 2{9} | 75 |
4{22} | 2{2} | 2{7} | 67 | 4{40} | 2{6} | 2{7} | 88 |
4{23} | 2{2} | 2{8} | 64 | 4{41} | 2{6} | 2{8} | 96 |
4{24} | 2{2} | 2{9} | 62 | 4{42} | 2{6} | 2{9} | 95 |
4{25} | 2{3} | 2{4} | 57 | 4{43} | 2{7} | 2{8} | 48 |
4{26} | 2{3} | 2{5} | 76 | 4{44} | 2{7} | 2{9} | 59 |
4{27} | 2{3} | 2{6} | 94 | 4{45} | 2{8} | 2{9} | 56 |
Table 4.
Percent yields of purified library members based on amino acid esters, 2{1-9}.
Table 4.
Percent yields of purified library members based on amino acid esters, 2{1-9}.
Member | Low | High | Average | Median |
---|
2{1} | 61 | 91 | 81 | 83 |
2{2} | 22 | 83 | 61 | 64 |
2{3} | 54 | 94 | 69 | 69 |
2{4} | 12 | 97 | 69 | 70 |
2{5} | 69 | 94 | 83 | 85 |
2{6} | 55 | 97 | 89 | 94 |
2{7} | 48 | 88 | 67 | 62 |
2{8} | 48 | 96 | 72 | 73 |
2{9} | 56 | 95 | 70 | 63 |
A total of 10 reactions were run and the individual crude reactions compared to the corresponding pure compounds in order to estimate the level of purity in the crude library and to provide some indication of the major impurities present. As with the dI45DC libraries bearing amino acid esters and alkanamines [
14], the major impurity in this library results from hydrolysis of the pyrazine intermediate,
3, to give an imidazole-4-substituted carboxamide-5-carboxylic acid. We observed evidence of this impurity by LC-MS in 8 out of 10 of the crude reactions, although the level of its presence appears low compared to the desired product based on the UV trace at 214 nm. There was evidence by LC-MS for this impurity in the other 2 crude reactions, but this ion gave a relatively low number of counts in the MS spectrum and, more significantly, the signal was also under the product peak, so we think the presence of this ion is likely an ionization pathway of the product in these two instances and does not mean there was significant hydrolysis in the reaction. Indeed, one of these reactions provided a 97% purified yield and the other a 79% purified yield, supporting this hypothesis. There is no evidence in the LC-MS traces of any other substantial impurities in the crude reactions, although we know that the diisopropylethylamine hydrochloride (DIEA
.HCl) is still present.
The
1H-NMR spectra of the crude reactions are similar to those of their purified products, aside from the extra DIEA
.HCl signals and variations in the amide and imidazole NH signals in the crude reactions as compared with their purified products (see
supporting information). The amide and imidazole NH chemical shifts are known to be sensitive to both acidity and solvent [
15], and it is hypothesized that the presence of the DIEA
.HCl is the cause for the difference in the chemical shifts between the crude and purified compounds.
These compounds have two intramolecularly hydrogen bonded conformations as illustrated in
Figure 1. The percentage of each conformation for these compounds has not been determined, although we have noted in the past that the structure can bias one conformation over another by as much as 80% to 20% [
13,
21]. One advantage of this intramolecular hydrogen bond is that it yields two conformations of every compound in this library and that either conformation or both may be bioactive. We reason that a compound with significant bioactivity would be discovered in preliminary biological screens even when tested at half of the initial screening concentration, as would be in the case when there are two equivalent hydrogen bonded conformations. Thus, this library provides two possible bioactive structures for each compound. Subsequent structure-activity relationship development can include the synthesis of imidazole-4-ester-5-amide derivatives with chiral α-hydroxyacid esters, incorporating one
N-methylamino acid ester in the dI45DC, or
N-alkylating the imidazole ring in order to control the hydrogen bond donor and acceptor and thereby gain insight on the bioactive conformation [
16].
Figure 1.
Two intramolecularly hydrogen bonded conformations of the dI45DCs.
Figure 1.
Two intramolecularly hydrogen bonded conformations of the dI45DCs.
The library members have all been submitted to and accepted into the MLSMR for screening by the MLSCN. At the time of this writing, two of the compounds in this library,
4{9} and
4{39}, showed bioactivity in initial screens towards calpain II, but each just missed the activity cutoff in the follow up investigation which was greater than 50% inhibition when tested at 0.225 mM [
22]. The current biological data for all of these compounds is best accessed through the PubChem database through a structure search [
23].
Table 5 provides the compound number in this report with the corresponding PubChem SID numbers that can also be used to find information in PubChem. There is also recently reported software, PubChemSR, that operates in a Windows environment and may be useful in helping researchers mine the data found in PubChem [
24].
Table 5.
Cross reference guide for library members with the PubChem database (SID identification number).
Table 5.
Cross reference guide for library members with the PubChem database (SID identification number).
Cmpd. | PubChem SID‡ | Cmpd. | PubChem SID‡ |
---|
4{1} | 26732547 | 4{24} | 49733438 |
4{2} | 49713852 | 4{25} | 49713850 |
4{3} | 26732537 | 4{26} | 26732546 |
4{4} | 49733436 | 4{27} | 26732531 |
4{5} | 26732515 | 4{28} | 26732516 |
4{6} | 49734139 | 4{29} | 49713848 |
4{7} | 26732535 | 4{30} | 26732538 |
4{8} | 49733439 | 4{31} | 49713847 |
4{9} | 26732539 | 4{32} | 50096426 |
4{10} | 49713854 | 4{33} | 49734319 |
4{11} | 26732541 | 4{34} | 49714451 |
4{12} | 49713857 | 4{35} | 49713851 |
4{13} | 26732520 | 4{36} | 49713845 |
4{14} | 26732540 | 4{37} | 26732533 |
4{15} | 26732542 | 4{38} | 49731974 |
4{16} | 49713858 | 4{39} | 26732534 |
4{17} | 26732514 | 4{40} | 49713844 |
4{18} | 49713855 | 4{41} | 49713843 |
4{19} | 49731975 | 4{42} | 49713846 |
4{20} | 49713856 | 4{43} | 49713849 |
4{21} | 49713853 | 4{44} | 26732536 |
4{22} | 49731976 | 4{45} | 50096427 |
4{23} | 49733857 | | |