A Structural-Reporter Group to Determine the Core Conformation of Sialyl Lewisx Mimetics

The d-GlcNAc moiety in sialyl Lewisx (sLex, 1) acts predominantly as a linker to position the d-Gal and the l-Fuc moieties in the bioactive spatial orientation. The hypothesis has been made that the NHAc group of GlcNAc pushes the fucose underneath the galactose and, thus, contributes to the stabilization of the bioactive conformation of the core of sLex (1). To test this hypothesis, GlcNAc mimetics consisting of (R,R)-1,2-cyclohexanediols substituted with alkyl and aryl substituents adjacent to the linking position of the fucose moiety were synthesized. To explore a broad range of extended and spatially demanding R-groups, an enzymatic approach for the synthesis of 3-alkyl/aryl-1,2-cyclohexanediols (3b-n) was applied. These cyclohexanediol derivatives were incorporated into the sLex mimetics 2b-n. For analyzing the relationship of affinity and core conformation, a 1H NMR structural-reporter-group concept was applied. Thus, the chemical shift of H-C5Fuc proved to be a sensitive indicator for the degree of pre-organization of the core of this class of sLex mimetics and therefore could be used to quantify the contribution of the R-groups.


Introduction
Lectins, such as selectins [1], galectins [2,3], or siglecs [4,5] have gained increasing attention as drug targets. However, although being valuable leads to the development of new drugs, carbohydrates rarely find therapeutic application, as they typically suffer from complex synthesis and poor pharmacokinetic and pharmacodynamic properties. As a consequence, small molecules mimicking the carbohydrate epitope, e.g., the sialidase inhibitor oseltamivir [6], have been developed to overcome these unfavorable pharmacokinetic and pharmacodynamic properties (PK/PD).
One important reason for the low affinity of carbohydrate/lectin interactions are the high polarity of carbohydrate ligands and, therefore, substantial enthalpic desolvation costs. This becomes obvious from the thermodynamic profile of the sialyl Lewis x (1, sLe x )/Eselectin interaction (Figure 1) [7]. The positive enthalpy term ∆H • is a result of the high desolvation costs (∆H • desolv ) originating from the numerous hydroxy groups and the carboxylate, which are not compensated by the interaction enthalpy ∆H • int . As described by Cabani [8], the desolvation of a single hydroxy group costs approx. 26 kJ mol −1 . This is an amazingly high penalty, considering that it cannot be compensated by a single H-bond which yields a maximum of 20.7 kJ mol −1 [9]. However, when hydroxy groups are not involved in binding or are not part of extended H-bond networks, the desolvation enthalpy can be substantially reduced by tdehydroxylation.
Molecules 2023, 28, 2595 2 of 10 to benefit from a smaller entropic penalty during complexation. Thus, for example, due to the loss of conformational entropy, one rotatable bond that becomes immobilized upon binding carries a Gibbs energy penalty close to 2 kJ mol −1 [10]. The highly beneficial entropy term (−TΔS° = 22.6 kJ mol −1 ) for the interaction of sLe x (1)/E-selectin originates from a large desolvation entropy −TΔS°desolv (numerous water molecules are released to bulk), only slightly reduced by a small conformational entropy penalty −TΔS°conf, suggesting that tetrasaccharide 1 is pre-shaped in the binding conformation. According to the pharmacophore of sLe x (1) (Figure 1), Neu5Ac contributes only with its carboxylate to binding. However, when Neu5Ac was replaced by glycolic acid, the induced conformational flexibility led to a substantial loss of affinity (IC50 4.5 mM) [11]. To reduce flexibility, a large number of non-carbohydrate acids with substituents in the αposition were evaluated, leading to the discovery of (S)-cyclohexyl lactic acid as suitable substitute of Neu5Ac [11][12][13].
The D-GlcNAc moiety acts predominantly as a linker to position the D-Gal and L-Fuc moieties in the bioactive spatial orientation. With the (R,R)-1,2-cyclohexanediol derivative 2a (R = H, KD 60.7 μM), an expedient replacement of the GlcNAc moiety was identified [10]. When mimic 2b (R = CH3, KD 14 μM) showed a four-fold improved affinity compared to 2a, the hypothesis was raised that the methyl group of the GlcNAc mimic pushes the fucose underneath the galactose moiety in the same way as the NHAc group in sLe x (1) does. Because of the synthetic access via asymmetric CBS (Corey-Bakshi-Shibata) reduction [14,15] and epoxide opening with higher order cyanocuprates [16] was limited to Me, nBu, cPr and CH=CH2 [17], we explored an enzymatic approach to explore a broader range of R-groups. For analyzing the relationship of affinity and core conformation, a 1 H NMR structural-reporter-group concept was applied [18,19]. It was shown on the trisaccharide Lewis x , that a non-conventional hydrogen bond between H5 of the fucose moiety (H-C5 Fuc ) and O5 of the galactose moiety (O5 Gal ) results in a low field shift for H-C5 Fuc [20][21][22]. This chemical shift proved to be a sensitive indicator for the degree of pre-organization of the core of this new class of sLe x mimetics and, therefore, could be used as structuralreporter group to quantify the contribution of the R-groups.

Results
Originally, 1,2-cyclohexanediol (3a, R = H) was used as GlcNAc mimetic [11][12][13]. To counteract the pushing effect exerted by the NHAc group in sLe x , additional alkyl or aryl It is generally assumed that pre-shaping a ligand into its bioactive conformation will provide increased binding affinity, primarily because the rigidified molecule is expected to benefit from a smaller entropic penalty during complexation. Thus, for example, due to the loss of conformational entropy, one rotatable bond that becomes immobilized upon binding carries a Gibbs energy penalty close to 2 kJ mol −1 [10]. The highly beneficial entropy term (−T∆S • = 22.6 kJ mol −1 ) for the interaction of sLe x (1)/E-selectin originates from a large desolvation entropy −T∆S • desolv (numerous water molecules are released to bulk), only slightly reduced by a small conformational entropy penalty −T∆S • conf , suggesting that tetrasaccharide 1 is pre-shaped in the binding conformation.
According to the pharmacophore of sLe x (1) (Figure 1), Neu5Ac contributes only with its carboxylate to binding. However, when Neu5Ac was replaced by glycolic acid, the induced conformational flexibility led to a substantial loss of affinity (IC 50 4.5 mM) [11]. To reduce flexibility, a large number of non-carbohydrate acids with substituents in the α-position were evaluated, leading to the discovery of (S)-cyclohexyl lactic acid as suitable substitute of Neu5Ac [11][12][13].
The D-GlcNAc moiety acts predominantly as a linker to position the D-Gal and L-Fuc moieties in the bioactive spatial orientation. With the (R,R)-1,2-cyclohexanediol derivative 2a (R = H, K D 60.7 µM), an expedient replacement of the GlcNAc moiety was identified [10]. When mimic 2b (R = CH 3, K D 14 µM) showed a four-fold improved affinity compared to 2a, the hypothesis was raised that the methyl group of the GlcNAc mimic pushes the fucose underneath the galactose moiety in the same way as the NHAc group in sLe x (1) does. Because of the synthetic access via asymmetric CBS (Corey-Bakshi-Shibata) reduction [14,15] and epoxide opening with higher order cyanocuprates [16] was limited to Me, nBu, cPr and CH=CH 2 [17], we explored an enzymatic approach to explore a broader range of R-groups. For analyzing the relationship of affinity and core conformation, a 1 H NMR structural-reporter-group concept was applied [18,19]. It was shown on the trisaccharide Lewis x , that a non-conventional hydrogen bond between H5 of the fucose moiety (H-C5 Fuc ) and O5 of the galactose moiety (O5 Gal ) results in a low field shift for H-C5 Fuc [20][21][22]. This chemical shift proved to be a sensitive indicator for the degree of pre-organization of the core of this new class of sLe x mimetics and, therefore, could be used as structural-reporter group to quantify the contribution of the R-groups.
Since Rotticci et al. [23] successfully applied the stereo-selective enzymatic acylation to 3-methyl-2-cyclohexenol (3b), we were confident that the enzymatic approach can be successfully transferred to a wide range of cyclohexenol derivatives.

Synthesis of the Mimetics 2b-n of Sialyl Lewis x (1)
Fucosylation of all-trans-14b under in situ anomerization conditions [30] gave 16b (Scheme 4), which was smoothly deprotected with tetrabutylammonium fluoride (TBAF), affording pseudodisaccharide 17b [17] in excellent yield over two steps. Galactosylation with donor 18 [31] promoted by dimethyl(methylthio)sulfonium triflate (DMTST) afforded 19b β-selectively. Debenzylation by hydrogenolysis followed by saponification with lithium hydroxide and ion exchange chromatography finally gave 2b. In a similar approach, the test compounds 2c-n were obtained.  19 F is an important nucleus in NMR spectroscopy because of its receptivity and large chemical shift dispersion. 19 F NMR spectroscopy is therefore perfectly suited to determine the enantiomeric excess (ee) based on a comparison of the Mosher esters of rac-3b and (R)-3b. The diastereomeric excess (de) for the Mosher ester (R,R)-12b of R-3b amounted to 97%. For the Mosher esters 12c-n, excellent de's resulting in ee's for 3c-n in the range of 85 to 98% were obtained as well.
Since the protecting group of the hydroxy group in (R)-3b-n has to be stable under strongly basic and acidic conditions, should not hamper fucosylation by steric bulk, and finally, should allow cleavage under mild conditions orthogonal to benzyl protecting groups, a tert-butyldimethylsilyl (TBS) ether (→(R)-13b-n) was chosen. Hydroboration followed by oxidation yielded all-trans-14b-n in 50-92% over two steps. By this short sequence all-trans-14b-n were obtained in acceptable to excellent yields, requiring only two chromatographic purifications.

Structural-Reporter Group for the Core Conformation
The 1 H chemical shift of H-C5 Fuc was monitored as a structural-reporter group for the strength of the non-conventional H-bond between H-C5 Fuc and O5 Gal . This chemical shift is highly sensitive to the frequency and distance of the non-conventional H-bond

Structural-Reporter Group for the Core Conformation
The 1 H chemical shift of H-C5 Fuc was monitored as a structural-reporter group for the strength of the non-conventional H-bond between H-C5 Fuc and O5 Gal . This chemical shift is highly sensitive to the frequency and distance of the non-conventional H-bond interaction and thus represents a population-weighted average of all conformations present in the solution (Table 2).

Discussion
The chemical shift of H-C5 Fuc of sLe x (1) is 4.83 ppm [32] and thus 0.9 ppm higher than that of the fucose monomer 20 [33] (Table 2). When Neu5Ac in sLe x (1) was replaced by (S)-cyclohexyl lactic acid (→22) [11][12][13]17], the chemical shift of H-C5 Fuc remains unchanged, i.e., the replacement has no influence on the core conformation. However, when the GlcNAc moiety was replaced by (R,R)-cyclohexane-1,2-diol (→2a) [17], a substantially high field shift (Δδ = 0.33) occurred, indicating that compared to sLe x (1) the pre-organization of the core conformation is disturbed. Nevertheless, the affinity was improved fivefold, although the chemical shift of H-C5 Fuc of 4.50 ppm in 2a is a clear indication for raised conformational entropy costs. This penalty, however, is obviously overcompensated by lower enthalpic desolvation costs originating from a reduction of the polar surface area (PSA) [34] due to the replacement of GlcNAc by (R,R)-cyclohexane-1,2-diol (PSA22 = 324 Å 2 vs. PSA2a = 205 Å 2 ). The transition from (R,R)-cyclohexane-1,2-diol in 2a to (R,R,S)-3-methylcyclohexane-1,2-diol in 2b caused a 3.5-fold improvement of KD. Whereas binding and desolvation enthalpy should remain unchanged (almost identical water shell, same PSA and same pharmacophore), the improved pre-organization of the core in the bioactive conformation (H-C5 Fuc increases from 4.50 ppm for 2a to 4.84 ppm for 2b) is obviously responsible for the improved KD of 2b.
As reported earlier [17], the additional methyl group in 2b pushes the Fuc moietyas the NHAc group in sLe x (1)-underneath the Gal moiety, leading to comparable preorganization of the core as indicated by the same chemical shift for H-C5 Fuc . Linear R- The transition from (R,R)-cyclohexane-1,2-diol in 2a to (R,R,S)-3-methylcyclohexane-1,2-diol in 2b caused a 3.5-fold improvement of K D . Whereas binding and desolvation enthalpy should remain unchanged (almost identical water shell, same PSA and same pharmacophore), the improved pre-organization of the core in the bioactive conformation (H-C5 Fuc increases from 4.50 ppm for 2a to 4.84 ppm for 2b) is obviously responsible for the improved K D of 2b.
As reported earlier [17], the additional methyl group in 2b pushes the Fuc moiety-as the NHAc group in sLe x (1)-underneath the Gal moiety, leading to comparable preorganization of the core as indicated by the same chemical shift for H-C5 Fuc . Linear R-groups as in 2b-d,i,j,m lead to chemical shifts of H-C5 Fuc similar to sLe x (1). They obviously support the bioactive core conformation, resulting in single-digit micromolar binding affinities. However, with a bulkier R-group as in 2h (R = t-Bu) a high-field chemical shift of H-C5 Fuc accompanied by a loss in affinity was observed. Finally, although 2e (R = benzyl), 2f (R = CH 2 C 6 H 11 ), and 2l (R = CH 2 O(CH 2 ) 2 OMe) exhibit chemical shifts for H-C5 Fuc characteristic for an optimal core conformation, they show slightly lower affinities. According to saturation transfer difference (STD)-NMR, hydrophobic contacts of the methyl group in 2b can be excluded [17]. This was also confirmed by the X-ray structure of 2b (R = Me) co-crystallized with E-selectin (PDB 4C16), where both Arg84 and Gln85 as parts of the E-selectin binding site have thermal B-factors of~74 (at carbon atom CZ) and~78 (at carbon atom CD), respectively, which is a clear sign for a high flexibility of their amino acid side chains compared to the backbone (B-factors of~45 at CA). However, bulky R-groups can establish van der Waals contacts with Arg84 and Gln85. This limits the flexibility of the amino acid side chains causing an entropy penalty and reduced affinities. In most pronounced cases a steric clash may also disrupt the optimal core conformation or prevent from proper access to the binding site as observed for 2g (R = phenyl) and 2k (R = (CH 2 ) 3 Ph). In the case of 2l (R = CH 2 O(CH 2 ) 2 OMe), an additional desolvation penalty for the rather polar R-group may be involved as well. Surprisingly, although virtually the same K D 's were obtained for 2c (R = CH 2 CH 3 ) and 2n (R = CH 2 CF 3 ), there is a large difference between the chemical shifts of H-C5 Fuc (∆δ = 0.23) for the two compounds.
Based on the binding affinities K D , the Gibbs free energies ∆G were calculated (Table 3). Because in 21, which contains a flexible ethanediol as GlcNAc replacement, the chemical shift for H-C5 Fuc is close to that of the Fuc monosaccharide 20 (4.12 and 3.93 ppm, respectively, Table 2), a contribution related to a direct neighborhood of H-C5 Fuc and O5 Gal can therefore be excluded. The 80-fold improvement of affinity for 2a compared to 21 is related to reduced entropy costs, predominantly as a consequence of the reduced flexibility of the linker. All other thermodynamic parameters defining affinity (similar pharmacophore and desolvation costs) remain approximately the same size. Thus, the ∆∆G 21→2a of 10.89 kJ mol −1 is, if at all, related to the formation of the non-conventional H-bond but mainly to the rigidified linker. Finally, the introduction of an R group further limits the flexibility of the core by pushing the Fuc moiety underneath the Gal unit enabling the formation of a non-conventional H-bond between H-C5 Fuc and O5 Gal . Thus, ∆∆G 2a→2b of 3.06 kJ mol −1 and ∆∆G 2a→2j of 6.57 kJ mol −1 result from the steric effect of the R-group as well as from the formation of a non-conventional H-bond.

Conclusions
A common strategy for the design of glycomimetics is the substitution of carbohydrate moieties by carbocyclic scaffolds. It results in lower polarity, i.e., reduced desolvation costs but also increased hydrolytic and metabolic stability. In addition, the facile synthetic accessibility of the mimetic structures, which, as a consequence of the chirality of the parent carbohydrate moiety includes stereochemical challenges, is of cardinal importance.
For evaluating the core conformation of the sLe x mimetics (R)-2a-n, the chemical shift of H-C5 Fuc proved to be a valuable structural-reporter group [18,19]. When mimetics exhibit a chemical shift for H-C5 Fuc between 4.83 and 4.86 ppm, their core conformation is comparable to the natural counterpart sLe x (1) [21,32]. In contrast, high-field shifts indicate a disturbed pre-organization of the core, leading to an increase in conformational entropy costs. Examples with pre-shaped cores are 2c,d,i,j,m exhibiting affinities in the range of 4.3-9.5 µM. On the other hand, when the R-group is increasingly space-demanding as in 2h and 2k, the optimal core conformation is disrupted (H-C5 Fuc 4.03, 4.75 ppm) or optimal access of the ligand to the binding site is hindered, consequently leading to a substantial loss of affinity.
In summary, H-C5 Fuc proved to be a reliable reporter group for evaluating to what extent the core of the sLe x mimetics 2b-2n is pre-organized in the bioactive conformation. The correlation is, therefore, a qualitative tool to quickly predict the efficacy of a derivative as an E-selectin binder. Furthermore, the improvement of the Gibbs free energy ∆G • by >6.5 kJ mol −1 , i.e., a 10-fold improvement of affinity, impressively shows the relevance of the contribution by the pre-shaped core and the non-conventional H-bond to binding.

Materials and Methods
E-selectin production (cloning, transfection, expression, and purification) and labeling were previously described for the E-selectinSCR6-IgGFc construct [12] and for the E-selectinSCR2 construct [36].
E-selectinLEC2 was labeled using the amine reactive protein labeling kit BLUE-NHS as described in the Supporting Information. Buffer exchange and labeling were performed according to the manufacturer's protocol. To protect the lysines in the binding site from being labeled, the protein was saturated with 600 µM of compound 2b. The labeled protein was dialyzed over night against assay buffer (10 mM HEPES pH 7.4, 150 mM NaCl, 1 mM CaCl 2 ) using Slide-A-Lyzer dialysis cassettes (10 kDa MWCO). Protein concentration was determined by HPLC-UV against a BSA standard.
Microscale thermophoresis (MST) experiments were carried out at 25 • C with 100% LED power, 50% laser power, a laser on time of 30 s, and a laser off time of 5 sec using standard treated capillaries. Ligands were dissolved in assay buffer supplemented with 0.05% v/v Tween20 and titrated 1:1 for a total of 16 dilution steps. The dilution series of ligand was mixed 1:1 with a solution of 0.2 µM labeled E-SelectinSCR2 and incubated for 10 min at room temperature before measurement. Datapoints were normalized using the bound and unbound borders achieved by NanoTemper Analysis 1.2.205 software (NanoTemper Technologies GmbH, Munich, Germany) and analyzed/illustrated with GraphPad Prism 5.0 (GraphPad Software, La Jolla, CA, USA). The measurements were globally fitted using Equation (1) for single site binding [37].
[PL] = (C P +C L +K D ) − (C P +C L +K D ) − 4C P C L 2C P (1) where [PL] is the protein-ligand complex concentration and K D is the dissociation constant. C P represents the total concentration of protein and C L the total concentration of ligand.

Funding:
We gratefully acknowledge the financial support by Glycomimetics Inc., Gaithersburg, Maryland 20878, USA.