3.1. Binding Scores and RMSD Values
The linker variants from
Figure 1, with lengths ranging from 0 to 6 carbons, were docked into Ag85B using AutoDock Vina, generating 10 binding poses per docking run (with 10 independent random-seed runs per ligand; 100 poses total per linker variant).
Figure 2 presents the distribution of Vina binding scores across the 100 poses for each linker variant (10 poses per run × 10 runs), along with the binding scores of each individual pose for each ligand. Binding scores are reported in the native Vina convention (kcal/mol), where more negative values indicate more favorable predicted binding. It should be noted that AutoDock Vina scores are estimates of binding free energy and are most appropriate for relative comparisons among closely related ligands under the same docking protocol, rather than as exact measurements of binding strength. In particular, for large and flexible ligands, Vina does not explicitly capture important entropic and solvent-related contributions. Therefore, differences on the order of ~1–2 kcal/mol should be interpreted as suggestive trends rather than conclusive quantitative differences.
For all linker variants docked, shorter linkers were consistently associated with lower binding scores, indicating greater relative binding affinity for Ag85B. The zero-carbon linker exhibited one of the strongest predicted binding affinities, with a mean binding score of −10.5 ± 0.7 kcal/mol, while the six-carbon linker demonstrated the weakest predicted affinity, with a mean binding score of −9.4 ± 0.7 kcal/mol. This represents an approximately 1 kcal/mol difference in estimated binding affinity between the shortest and longest linkers. Given the approximate nature of Vina scoring—especially for large, flexible ligands—this magnitude should be interpreted as a qualitative ranking trend rather than as a precise quantitative free-energy difference. The trend is consistent with the hypothesis that shorter linkers may more readily adopt pocket-compatible conformations in these docking models, potentially enabling more recurrent packing and contact patterns with pocket residues.
While binding affinity scores are an important factor for evaluating linker selectivity, it is equally critical to consider the consistency with which different linker variants bind to the same region. In the docking simulations, blind docking was performed with a search space spanning the entire enzyme to avoid biasing the results toward the canonical trehalose site. This provides a more stringent test for nonspecific binding because ligands are permitted to sample alternative surface sites, revealing whether binding is dispersed across multiple regions. In addition to the binding scores presented in
Figure 2, Root Mean Square Deviation (RMSD) values were calculated for each pose of each ligand relative to the best-scoring (most favorable) pose of the same ligand. These RMSD values measure how consistently the ligand poses align with the most favorable binding conformation, which serves as the reference pose. Linker variants that bind opportunistically at multiple regions can produce a more heterogeneous set of poses (and therefore higher RMSD relative to the reference pose), whereas linkers that repeatedly return to the same region can yield tightly clustered poses (lower RMSD) even when alternative sites are available. The results, summarized in
Figure 3, show the distribution of RMSD values for 99 poses relative to the best-scoring reference pose for each linker variant.
Notably, increasing linker length was associated with a modest increase in mean RMSD, suggesting greater variability in predicted ligand binding poses. The zero-carbon linker showed a mean RMSD of 3.0 ± 0.1 Å (SEM), consistent with relatively clustered poses. In comparison, the six-carbon linker showed an RMSD of 5.2 ± 0.2 Å (SEM), indicating a broader spread of predicted poses. Although these differences are modest, the overall pattern is consistent with greater pose heterogeneity for longer linker variants. This may reflect reduced geometric constraint and/or increased conformational flexibility, which could contribute to the less favorable binding affinity scores observed for longer linkers. This trend is illustrated in
Figure 4, which compares all binding poses of the zero-carbon linker (
Figure 4a) and six-carbon linker (
Figure 4b), highlighting tighter clustering in the former and greater dispersion in the latter.
To compare how linker length affects trehalose-core anchoring, RMSD was computed using the conserved trehalose-ring heavy atoms from the top-ranked pose of each independent docking run (10 runs per linker variant), while preserving the atoms in their intact docked poses. Specifically, RMSD comparisons were performed using only the heavy atoms shared between trehalose and the porphyrin analogs (i.e., atoms present in both structures), while ignoring all non-shared heavy atoms. The C6 oxygen was excluded from the shared atom set. This procedure was applied identically across all variants to enable a consistent, internal comparison. The shared-heavy-atom subset of each top-ranked docked pose was then compared to the experimentally observed trehalose conformation from the PDB structure used in this study (PDB ID: 1F0P). To quantify the structural alignment, the RMSD between the top-ranked docked pose (most negative Vina score within a run) and the crystallographic trehalose pose was calculated. The RMSD comparison results are summarized in
Figure 5.
Figure S1 (Supplemental Materials) displays the highest-affinity binding pose for each linker variant, while
Figure S2 (Supplemental Materials) shows these poses relative to the trehalose cocrystal structure. Trehalose was also re-docked using the same receptor preparation and global docking parameters employed for the conjugates. Under these conditions, top-ranked trehalose poses from 10 independent docking runs aligned, on average, to the crystallographic trehalose with an RMSD of 5.4 ± 0.2 Å (SEM) (
Figure 5). Accordingly, RMSD values for the conjugates are interpreted primarily as a relative measure of similarity to the crystallographic reference within a single, consistent protocol.
The shared-atom subset of the top-ranked pose for the zero-carbon linker exhibits an RMSD of 6.0 ± 0.2 Å (SEM) relative to the experimentally determined cocrystal pose, whereas the six-carbon linker shows a higher RMSD of 7.2 ± 0.4 Å (SEM). A clear trend emerges—as linker length increases, so does the RMSD between the shared structures and the known trehalose pose. Importantly, RMSD is a geometric measure of pose similarity and does not itself establish biological truth, since multiple conformations can be compatible with productive binding. The crystallographic trehalose pose provides an experimentally observed reference conformation under crystallographic conditions. Within this consistent workflow, the lower RMSD for the 0C variant indicates that shorter linkers constrain the shared-atom alignment into orientations closer to the crystallographic reference than those sampled by longer linkers. This is consistent with the hypothesis that shorter linkers restrict the conjugates to more recurrent trehalose-core placements relative to the crystallographic reference within these docking models.
Taken together, shorter linkers (0–2 carbons) showed more favorable docking scores and lower RMSD values (both pose-clustering RMSD and trehalose-core RMSD relative to the crystallographic reference) than longer linkers (3–6 carbons) within this protocol, consistent with more recurrent placement in the same pocket region in these docking models. Conversely, longer linkers showed broader RMSD distributions and less favorable docking scores; in representative poses, portions of the linker/porphyrin extend farther from the pocket region, consistent with increased conformational freedom.
The stronger docking behavior of shorter linkers may reflect their ability to maximize favorable non-covalent interactions while improving packing. For instance, the zero-carbon linker, in the top-ranked pose, formed hydrogen bonds and aromatic interactions (π–π with HIS 262 and π–cation with ARG 43). In contrast, the six-carbon linker, despite showing reduced tight packing in the clash/contact analysis, exhibited poor binding scores and high RMSD values, suggesting that its excessive flexibility limits its ability to adopt a recurrent trehalose-core placement relative to the crystallographic reference under this docking protocol. This suggests that an optimal linker length must balance flexibility with spatial fit.
3.2. Clashes and Contacts
Clashes and contacts of the linker variants within the binding pocket were evaluated by analyzing Van der Waals (VDW) overlaps between atoms. Contacts were defined as interactions in which the VDW surfaces of two atoms were within 0.4 Å of each other (overlap ≥ −0.4 Å), indicating potential non-covalent interactions. Clashes, on the other hand, were identified as instances where VDW overlaps exceeded 0.6 Å, representing tight packing due to close atomic proximity. Importantly, in the present docking analysis, these overlaps are not necessarily severe, physically impossible collisions; rather, they represent tight packing events that are allowed by the docking program and can occur in otherwise well-scoring poses. Thus, a residue can contribute many contacts without tight overlaps, and conversely a small number of tight overlaps can reflect a snug fit rather than failure to bind. To ensure that only poses within the binding pocket were analyzed, clashes and contacts were evaluated for the top-ranked pose from each independent docking run for each linker variant (10 poses total per linker variant).
Figure 6 summarizes the average number of contacts and clashes for each linker variant.
As linker length increases, the number of clashes varies modestly, peaking for the zero-carbon linker and then decreasing for longer linkers. The zero-carbon linker exhibits an average of 1.9 ± 0.2 (SEM) clashes, while the six-carbon linker shows the least clashes (0.7 ± 0.3), indicating reduced packing with longer linkers.
The number of contacts increases with linker length up to the four-carbon linker, after which it begins to decline. Notably, the six-carbon linker forms nearly the same number of contacts as the zero-carbon linker, suggesting reduced packing efficiency (despite comparable contact counts) for longer linkers. The zero-carbon linker has an average of 143 ± 3 (SEM) contacts, the four-carbon linker achieves the highest contact count with 158 ± 4 (SEM) contacts, and the six-carbon linker decreases to an average of 148 ± 6 (SEM) contacts. These trends highlight the balance between steric clashes and non-covalent interactions, with shorter linkers forming tighter but potentially strained binding, while longer linkers reduce strain but sacrifice packing efficiency.
To identify which specific residues are responsible for the clashes summarized in
Figure 6, we analyzed all clashing residues for the top-ranked pose from each independent docking run for each linker variant (10 poses total per linker variant). The results, shown in
Figure 7a, summarize the frequency of clashes per residue for each linker variant. To complement the clash analysis and capture close packing that does not rise to steric interference,
Figure 7b summarizes the frequency of contacts per residue for each linker variant. To provide spatial context for these interactions,
Figure 8 presents structural visualizations highlighting the locations of these clashes. In this figure, magenta regions highlight areas of steric clashes between ligand poses within the binding pocket and surrounding residues, illustrating how linker length and ligand conformation may influence interactions and contribute to clashes with specific residues throughout the pocket.
Notably, the zero-carbon linker shows substantial clashes with ARG 43 in
Figure 7a (10 clash occurrences), indicating that ARG 43 contributes through both tight packing and favorable contacts. By comparison, HIS 262 shows no clashes and relatively few contact occurrences for the zero-carbon linker, indicating limited interaction for this variant. Consistent with this,
Figure 7b shows that ARG 43 is among the most frequently contacted residues for the shortest linker (449 total atom-pair contact occurrences across the 10 best poses for the zero-carbon variant), supporting strong engagement that includes steric overlap. In contrast, HIS 262 shows relatively few contact occurrences for the zero and two-carbon variants (9 and 24 respectively), but its contact frequency increases for intermediate/longer linkers (65 for the three-carbon and 61 for the five-carbon variant), suggesting that linker length modulates how often the ligand samples conformations that approach HIS 262 closely. Concurrently, several hydrophobic pocket residues show broadly similar contact frequencies across linker lengths—for example, LEU 229 (176–201 across variants), TRP 264 (121–224), LEU 42 (92–116), and LEU 163 (71–91)—which is consistent with a conserved hydrophobic anchoring of the porphyrin scaffold, while linker length primarily tunes polar/charged contacts and overall pose compactness.
Shorter linkers, such as the zero-carbon and two-carbon variants, exhibited localized clashes that coincided with compact binding and favorable binding scores (
Figure 8). The zero-carbon linker clashed exclusively with ARG 43, LEU 163, and TRP 264, residues positioned in/near the trehalose-binding pocket region, suggesting that deep-pocket insertion can produce close packing without necessarily introducing steric interference at HIS 262 (
Figure 8a). The two-carbon linker, while clashing with LEU 163, also clashed with LEU 152 in the deep pocket, indicating its ability to effectively engage with multiple regions of the pocket (
Figure 8b). This balance between surface strain and deep-pocket interaction may have allowed shorter linkers to achieve superior binding scores.
As linker length increased, additional residues became involved in clashes, potentially disrupting the ligand’s ability to adopt a stable pose. The three-carbon and four-carbon linkers (
Figure 8c and
Figure 8d, respectively) frequently clashed with residues such as LEU 42, LEU 163, and HIS 262 in the deeper pocket. For example, the four-carbon linker showed significant clashes in regions where the linker bridges the porphyrin and trehalose moieties, as visualized in
Figure 8d, highlighting the steric strain introduced by the increased length. Despite avoiding most clashes, the six-carbon linker showed the least favorable docking performance, as its flexibility may have prevented recurrent engagement with residues like ARG 43 and LEU 163, resulting in reduced packing specificity and less recurrent pocket placement in
Figure 8f.
In summary, clashes, particularly for shorter linkers, played a dual role by enabling tight packing within the binding pocket without necessarily causing prohibitive steric interference. Residues such as TRP 264 and LEU 152 interact closely with the shorter linkers, consistent with deep-pocket engagement that helps stabilize compact poses. For key residues such as ARG 43 (and, depending on linker length, HIS 262), the dominant contribution is often through frequent contacts rather than clashes, although ARG 43 also shows recurring clashes (notably for the zero-carbon linker), whereas HIS 262 shows limited interaction for the zero-carbon linker and more frequent contacts for intermediate/longer linkers. Conversely, longer linkers often alter the balance of clashes and contacts, with increased conformational flexibility and/or redistributed steric interactions that can reduce packing efficiency and binding specificity. These results suggest that controlling steric strain can enhance predicted binding specificity and pose stability, provided that it does not disrupt stronger interactions or prevent productive contacts within the binding pocket.
3.3. Protein–Ligand Interactions
While the previous analysis of steric clashes and contacts provided insights into spatial fit and packing efficiency across all poses, hydrogen bonding and other interactions offer a complementary perspective by highlighting specific stabilizing forces within the binding pocket. To focus on the interactions observed in representative low-energy poses, this section examines all interactions in the best-scoring pose for each linker variant, as determined by the lowest (most favorable) Vina score. Hydrogen bonds within 2.8 Å, π–π stacking within 4.4 Å, π–cation interactions within 6.6 Å, and sites of solvent exposure are illustrated in
Figure 9.
Grey regions represent areas of the ligand exposed to the solvent environment, indicating regions less involved in direct interactions with the binding pocket. Boundary regions are color-coded: green boundaries indicate hydrophobic residues, light blue boundaries represent polar residues, and dark blue boundaries denote positively charged residues.
The zero-carbon linker (
Figure 9a) demonstrates multiple interactions within the binding pocket, forming hydrogen bonds with HIS 262 and ARG 43, as well as engaging in π–π stacking interactions with HIS 262 and π–cation interactions with ARG 43. The two-carbon (
Figure 9b) linker exhibits a broader range of interactions, forming hydrogen bonds at ARG 43 (twice), TRP 264, and ASP 40, alongside π–π stacking interactions at HIS 262. The three-carbon linker (
Figure 9c) maintains robust binding through hydrogen bonds at TRP 264, ARG 43, GLN 45, and ASN 54, and establishes π–π stacking interactions at TRP 264 and HIS 262. The four-carbon (
Figure 9d) linker forms hydrogen bonds with TRP 264 and ARG 43, while also displaying π–π stacking interactions at HIS 262. The five-carbon linker (
Figure 9e) interacts through hydrogen bonds at TRP 264, HIS 262 (twice), and GLN 45. Finally, the six-carbon linker (
Figure 9f) forms a single hydrogen bond with ARG 43 and engages in π–π stacking interactions at HIS 262.
Shorter linkers, such as the zero- and two-carbon variants, exhibited minimal solvent exposure, allowing their trehalose and porphyrin moieties to deeply embed within the binding pocket and form strong interactions. In contrast, longer linkers, like the five- and six-carbon variants, showed greater solvent exposure, particularly around the trehalose moiety, suggesting suboptimal alignment and reduced engagement with deep pocket residues. This increased exposure likely contributes to the diminished predicted docking scores (i.e., less favorable Vina scores) observed for longer linkers.
In summary, hydrogen bonding and π interactions in the best-scoring poses provide a complementary view of linker-dependent binding geometry. Short linkers, such as the zero- and two-carbon variants, showed multiple interactions with residues such as ARG 43, HIS 262, and TRP 264 in their best-scoring poses. These residues are strategically located to stabilize ligands within the binding pocket, and their engagement by shorter linkers underscores the importance of deep-pocket interactions. Longer linkers, in contrast, generally showed greater solvent exposure around the trehalose moiety and less compact pocket engagement in the best-scoring poses, despite still forming hydrogen-bonding and/or π interactions. This diminished engagement is consistent with the less favorable docking scores and broader pose distributions.
The zero-carbon linker uniquely forms a π–cation interaction between its triazole ring and ARG 43, a key residue also involved in hydrogen bonding. This dual interaction likely anchors the ligand within the pocket, enhancing stability and contributing to its more favorable predicted docking score. π–Cation interactions, driven by electrostatic attraction between an aromatic ring (here, the triazole ring) and ARG 43’s positively charged guanidinium group, provide a strong stabilizing force. Longer linkers did not show this interaction in the best-scoring poses analyzed here, likely due to increased flexibility and solvent exposure, which prevent optimal positioning of the triazole/linked aromatic moiety.
The solvent-exposure patterns reinforce the conclusion that shorter linkers embed more effectively within the binding pocket, allowing both the porphyrin and trehalose moieties to engage in meaningful interactions. The increased solvent exposure observed for longer linkers, particularly around the trehalose moiety, suggests that these ligands often fail to achieve comparably deep and compact binding poses, leading to reduced engagement with critical residues and weaker binding scores.