3.1. The Difference in Chemical Shifts between Mono- and Disaccharides
As each disaccharide molecule contains two identical or different monosaccharide units, we investigated how the chemical shifts change when transitioning from the free monosaccharides to the monosaccharide unit in a disaccharide. In
Table 1,
Table 2 and
Table 3 the changes in the chemical shifts of an
-glucose,
-glucose and
-galactose unit are shown when they are incorporated in a disaccharide. In each case we take the chemical shifts of the monosaccharide (
-glucose,
-glucose or
-galactose) as reference and compare with the corresponding monosaccharide unit in the disaccharide by computing the RMSD between the two. The RMSD is calculated as
where
i is the index of the atom,
N is the number of atoms in the monosaccharide and
and
are the chemical shifts for the
i-th monosaccharide unit and the reference (isolated) monosaccharide atoms, respectively. These are shown in the last column in the tables. As there are two monosaccharide units in the disaccharides they are distinguished by unprimed (Unit 1) and primed (Unit 2) indices on
Figure 2. In addition, we considered two polymorph structures of both sucrose and lactose. These are referred to as Structure 1 and Structure 2.
In
Table 1 we present our analysis of the
13C chemical shifts of the
-glucose unit in the disaccharides presented in
Figure 2. For the
-glucose units in both sucrose polymorph structures, the RMSD between the chemical shifts in sucrose and in the free
-glucose is 3.3 ppm. The deviations between the chemical shifts of the individual atoms C
, C
and C
in free
-glucose and
-glucose in sucrose are in the range 3.96–4.57 ppm which are significant and indicate that the
-glucose in sucrose is affected by the presence of the second unit.
The second disaccharide, trehalose consists of two units of -glucose, each attached via C. The RMSD of the changes in the glucose chemical shifts is similar for both rings in trehalose. For instance, the deviation of an individual carbon atom is large for C, C and C for the first unit. These are actually the same atoms for which the largest changes were observed in both sucrose structures. However, the changes are smaller in trehalose than in sucrose both for these atoms as well as the other ones, thus also resulting in a smaller RMSD than for sucrose. Interestingly, for both disaccharides, it is not the atom that makes the glycosidic linkage, i.e., C, for which the largest change in chemical shift is observed. For the second -glucose unit in trehalose, the changes are large for C, C and C and thus also for the atom involved in the glycosidic linkage.
For the -glucose in the first -maltose unit, the RMSD is larger than for glucose in both sucrose and trehalose. This is mostly due to the significantly larger change in the chemical shift, 8.68 ppm, of the C atom which forms the glycosidic linkage. In contrast to the smaller changes in sucrose and trehalose unit 2, in -maltose the C atom is deshielded in comparison to -glucose. In the second -glucose unit in -maltose the largest change is observed for C, i.e., again for the carbon atom involved in the glycosidic linkage.
Finally, we will look at
-lactose. The unit cell of
-lactose contains two molecules forming a triclinic crystal system. This means that these two
-lactose in the unit cell have different chemical shift values for the same atom. For that we will look at RMSD between the free monosaccharide and the two different
-lactose, which are called structure 1 and structure 2 in the table. As the two
-glucose units in
-lactose do not have the same chemical shift for the same atoms, the RMSD for these two rings compared to the free
-glucose will not be the same, as seen in
Table 1. The RMSD is very high for one of them and is smaller for the other, which means it is difficult to compare triclinic crystals with other crystals. The two
-glucose units in
-lactose have in common that the largest deviation is for the C
glycosidic linkage atom, which is 11.10 and 14.69 ppm, respectively.
We can deduce so far from the comparison of these calculations that the second unit has an impact on glucose in various disaccharides. When we have a disaccharide with two identical monomers linked via the same atom (trehalose), the effect is the smallest, while the effect increases for sucrose molecules. For compounds with 1–4 glycoside linkages, such as maltose and lactose, the disparity is the greatest. Furthermore, the most significant deviation does not always originate from the atom that forms the glycosidic bond, as seen for sucrose and the two trehalose units. For both maltose units and -lactose, however, the most significant divergence occurs from the glycosidic linkage atom.
Table 2 presents the calculated chemical shifts of
-glucose monosaccharide and
-glucose in the two disaccharides: cellobiose and
-lactose. The RMSD for the first unit in cellobiose is lower than the RMSD for the second unit, which are 3.54 ppm and 6.03 ppm, respectively. Looking at the individual atoms, we see that the largest changes in the chemical shifts of individual carbon atoms in the first unit of cellobiose are 7.38 and 3.68 ppm for C
and C
, i.e., for the glycosidic linkage atom, C
, and its neighbour. For the second
-glucose in cellobiose unit the largest deviation of individual carbon atoms is 14.17 and 3.62 ppm (C
and C
), which are again the glycosidic linkage atom, C
, and its neighbour atom C
. This demonstrates that the RMSD is smaller when a monosaccharide is connected to another ring via the initial atom (C
) than when it is bonded by another atomic number, i.e., C
, as we have seen already in
Table 1 for
-glucose. The RMSD for
-glucose in
-lactose is 4.77 ppm, with the highest deviation being 9.64 ppm for C
. We can deduce from these calculations that glycosidic linkage atoms have the biggest variations in all of these units.
In
Table 3, we compared
-galactose monosaccharide with
-galactose units in the two lactose polymorphs. For both
-lactose the RMSD is 1.76 and for the two polymorphs of
-lactose the RMSD are 3.47 and 3.39 ppm. The largest deviation between
-galactose monosaccharide and
-galactose in both
-lactose and
-lactose are for C
, being the glycosidic linkage atom, which are 2.67, 7.33 and 5.92 ppm, respectively.
Based on all the calculations so far, we can conclude that moving from monosaccharides to disaccharides affects the chemical shift in a specific way for all studied disaccharides. While for most disaccharides the chemical shifts of the two carbon atoms involved in the glycosidic linkage were mostly changed, in the case of sucrose and trehalose other atoms were influenced more than glycosidic linkage atoms. Trehalose differs from the other disaccharides because it has two identical monomers with the same glycosidic linkage atom in both. When transitioning from mono- to di-saccharides, the difference in chemical shift can reach up to 15 ppm.
One might ask whether the changes in the chemical shifts of the glycosidic linkage carbon atoms are correlated with the bond angles at the glycosidic linkage and in particular the angles
and
, which are illustrated in
Figure 7 for maltose. The values for these angles, along with the changes in the chemical shift of the corresponding linkage atoms, are presented in
Table 4. Our analysis reveals a remarkable diversity in the observed angles across different disaccharides. Furthermore, upon comparing these angles with the changes in the chemical shift for the linkage atoms, we found no apparent overall correlation between the variations in chemical shift and the aforementioned angles. While for trehalose, with both angles between 60 and 61 degress the changes in chemical shift are quite small, 0.23 ppm and −3.17 ppm, for sucrose the change in chemical shift is also small with −1.12 ppm but the angles are vastly different, i.e.,
and
. On the other hand, for
-cellobiose and
-lactose,
is roughly around −70
and
above 105
leading to a negative change in the chemical shift of unit 1 roughly around −5 ppm and around −12 ppm for unit 2, i.e., meaning similar changes for similar angles. In the same way, the similar angles in the two
-lactose structures lead to similar changes in the chemical shifts. Therefore, it is evident that the observed variation in the chemical shifts cannot be attributed solely to these angles.
3.2. The Difference in Chemical Shifts between Di- and Tri-Saccharides
In this section we will examine the extent to which the chemical shift is affected on transitioning from di- to tri-saccharides, as seen in
Table 5. Both melezitose and planteose molecules,
Figure 3, contain a sucrose unit, where the additional glycosidic linkage is from the
-D-fructofuranosyl part of sucrose. Melezitose has a smaller RMSD than planteose when compared to
-glucose in sucrose; the same is true for fructose in melezitose and planteose. For sucrose in melezitose and planteose the deviation is greatest for C
and C
, respectively. When we compare free
-glucose with
-glucose in sucrose, melezitose, and planteose, these two atoms are among the most affected. The changes in chemical shift for fructose in melezitose are largest for C
followed by C
, where C
is the new glycosidic linkage atom. Similarly, for planteose, the largest deviation is for C
and C
, even though C
is the new linkage atom. This demonstrates again that the glycosidic linkage atoms do not have to be the most significant difference.
Water molecules are present in the crystals of both melezitose (monohydrate) and planteose (dihydrate), which affects the nuclear shifts of the atoms. We may, therefore, have obtained a good understanding of what happens with the chemical shift on transitioning from the free monosaccharide to disaccharides as no water molecules would interfere with the chemical shifts. However, because both trisaccharides contain water molecules, we actually examine not only the effect of size but also the effect of water. To see how much the chemical shift will truly alter due to the additional unit, one should find some trisaccharides that do not contain water molecules. In the following section we will therefore also study the effect of crystal water on the chemical shifts.
3.3. The Impact of Water Molecules on Chemical Shifts in Mono- and Di-Saccharides
Some of the crystals, for which the crystal structures have been determined, contain water in the unit cell. These crystals are
-glucose,
-maltose [
17], trehalose [
19] and
-lactose [
23]. All of these crystals also have an anhydrate crystal structure. We will therefore compare the monohydrate crystals with the anhydrate to see the impact of water molecules on the chemical shifts in mono- and disaccharides. The anhydrate is always the reference component in our comparisons. We start by looking at
-glucose in
Table 6. The RMSD of the chemical shifts between
-glucose with and without water is 1.80 ppm. The largest change is 3.24 ppm for C
followed by C
, since water molecules are close to these atoms as can be seen in
Figure 8.
According to these calculations, the chemical shift is influenced by the water molecule. The atoms nearest to the water molecules are the ones who suffer the most. The influence of the water molecule ranges between 2 and 5 ppm, which is smaller than the effect of the difference between mono-, di-, and trisaccharides, as well as the effect of various disaccharides with each other.
In
Table 7 we present the results for maltose anhydrate and monohydrate. One should note that the maltose monohydrate is a
-maltose anhydrate, meaning that the second unit is a
-D-glucopyranoside and not an
-D-glucopyranoside as in
-maltose. The differences in the chemical shifts of the second unit between the anhydrate and monohydrate are thus not solely due to the water molecules. However, we chose to include these numbers anyway. The RMSDs for the two units of maltose with and without water are 2.22 and 3.61 ppm, respectively. The largest deviation is 3.18 ppm (C
) in the first ring, since the water molecule is closer to that atom. For the second ring one observes also a change in the chemical shift of C
due to a neighbouring water molecule (4.81 ppm). Furthermore, there is also a significant change of 5.70 ppm for C
, which has nothing to do with water molecules and must therefore arise from the difference in crystal structure and the atoms around it, as seen in
Figure 8.
For the trehalose, the largest deviation between the anhydrate and the monohydrate trehalose is 1.96 ppm (C
) for the first ring ase well as 2.19 and 2.42 ppm (C
and C
) for the second ring, as seen in
Table 8. These changes are due to the influence of water molecules, which are between C
and C
in the first ring and C
in the second ring which then affect the C
and C
, as seen in
Figure 8.
Table 9 presents the results for different
-lactose crystals. The lactose anhydrate is triclinic, which means there are two distinct
-lactose anhydrate molecules involved. Starting with the galactose ring in lactose, the water molecules are close to C
and C
when the structure is treated as a crystal rather than simply a unit cell. The difference in chemical shift between the galactose in
-lactose anhydrate and monohydrate is considerable for C
in both molecules and C
in the second molecule. On the other hand, the highest difference between the
-glucose in
-lactose anhydrate and monohydrate is for C
in the first molecule and C
in the second. This deviation occurs because a water molecule is close to the oxygen atom between C
and C
, and another water molecule is close to C
, which impacts C
in the second, as seen in
Figure 8.
To investigate the potential correlation between the distance of the carbon atoms to the oxygen atom in the closest water molecule and the corresponding change in chemical shift, we have collected these date in
Table 10. We observe three ranges of distances. The shortest are roughly around 3.5 Å, the intermediate around 4 Å and the largest distances are around roughly around 4.5 Å. For the shortest distances the changes in chemical shift range from −0.5 ppm all to 3.6 ppm, while for the distances around 4 Å the changes in chemical shift are between −4.8 ppm and 2.6 ppm. Finally for the largest distances the changes are in the range from −2.4 ppm to −1.6 ppm, which implies that there is no simple direct correlation between the change in chemical shift and the distance to the nearest water molecule.