Sulfur Analogs of the Core Formose Cycle: A Free Energy Map

Kua, Jeremy; Peña, Maria T.; Cotter, Samantha N.; Leca, John

doi:10.3390/life15010001

Open AccessArticle

Sulfur Analogs of the Core Formose Cycle: A Free Energy Map

by

Jeremy Kua

^*

,

Maria T. Peña

,

Samantha N. Cotter

and

John Leca

Department of Chemistry & Biochemistry, University of San Diego, San Diego, CA 92110, USA

^*

Author to whom correspondence should be addressed.

Life 2025, 15(1), 1; https://doi.org/10.3390/life15010001

Submission received: 26 November 2024 / Revised: 16 December 2024 / Accepted: 19 December 2024 / Published: 24 December 2024

(This article belongs to the Special Issue Feature Papers in Origins of Life 2024)

Download

Browse Figures

Versions Notes

Abstract

Using computational methods, we examine if the presence of H₂S can tame the unruly formose reaction by generating a free energy map of the reaction thermodynamics and kinetics of sulfur analogs within the core cycle. With mercaptoaldehyde as the linchpin C₂ species, and feeding the cycle with CH₂O, selected aldol additions and enolizations are kinetically more favorable. Thione formation is thermodynamically less favored compared to aldehydes and ketones, but all these species can be connected by enolization reactions. In some sulfur analogs, the retroaldol transformation of a C4 species back into linchpin species is thermodynamically favorable, and we have found one route incorporating where incorporating sulfur selects for a specific pathway over others. However, as CH₂O diminishes, the aldol addition of larger species is less favorable for the sulfur analogs. Our results also suggest that competing Cannizzaro side reactions are kinetically less favored and thermodynamically disfavored when H₂S is abundant.

Keywords:

origins of life; thermodynamics; kinetics; prebiotic chemistry; formose reaction; sulfur

1. Introduction

In extant biochemistry, autocatalytic cycles are a key feature of the metabolism [1]. However, outside of living systems, there are few instances of such reaction networks that utilize simple substances not artificially designed. One exception, of interest to the origins-of-life research community, is the formose reaction whereby the C₁ “food” molecule (CH₂O) is converted to (CH₂O)_n sugars of increasing size and diversity. In the 1970s, the formose reaction was extensively investigated as a way to boost carbohydrate and food production [2], but ultimately proved unfeasible. The core sugar-forming reaction mechanism utilizing aldol and retro-aldol reactions is now well known [3,4,5]. However, because aldehydes are present, the alkaline conditions used in the formose reaction lead to a complex mixture due to Cannizzaro disproportionation reactions [6], and thus, includes a plethora of acids and polyols.

There has been a recent resurgence in the investigation of the glorious mess that is the formose reaction. In a series of systematic studies, the Huck group examined how the observed product distribution was affected by changing the environmental variables [7,8,9]. Paschek and co-workers examined how catalysts, plausibly present in carbon-containing meterorites, influenced sugar synthesis [10], while Vinogradoff et al. used olivine silicate catalysts [11]. Haas et al. investigated the effects of mechanochemistry [12]. Omran, who highlighted the messiness of the formose reaction [6], more recently looked at the self-construction of chemical gardens under conditions resembling hydrothermal vents [13]. Large-scale computational methods have also been applied to the formose reaction to test various chemical network models [14,15,16].

Our focus is on the smallest autocatalytic core of the formose reaction, as shown in Figure 1. The C₁ “food” species is CH₂O, and in the absence of any other compounds in the autocatalytic cycle, the reaction has a slow induction phase. This is because the direct dimerization of CH₂O into the C₂ linchpin molecule glycolaldehyde has a high activation barrier due to the absence of an umpolung species to create new C–C bonds. However, once a small amount of the linchpin is present, the reaction accelerates rapidly. The direct dimerization can now be bypassed because the aldol addition reactions, C₂ + C₁ → C₃ and C₃ + C₁ → C₄, proceed with much lower barriers. Autocatalysis is triggered when a retro-aldol reaction regenerates more of the linchpin species (C₄ → C₂ + C₂) which leads to increasing rates of CH₂O consumption. While lower concentrations of larger sugars (C₅ to C₈) may be observed relatively early in the reaction, most of these are produced in later stages after CH₂O has been consumed. Note that the Cannizzaro disproportionation reaction is always present, and not just for the C₁ molecules (as shown in Figure 1); any sugar molecule in the cycle can disproportionate.

In our early work [17] on formaldehyde oligomerization, we found that polyol formation may compete with the aldol addition, but although C–O bond formation is kinetically favored over C–C bond formation, the former is thermodynamically reversible and hydrolyzes in aqueous solution. In contrast, the aldol addition of CH₂O with C–C bond formation is thermodynamically quite favorable and has significantly higher barriers for the reverse reaction. Also, isomerization reactions (e.g., glyceraldehyde to dihydroxyacetone) or ring closures (e.g., erythrose or threose for the C₄ species) may lead to an equilibrating pool of off-cycle species that reduce the reactivity of the in-cycle and more reactive aldehyde compounds.

Our group is interested in proto-metabolism. After completing the initial (albeit) small thermodynamic maps of CHO and CHOS compounds [18,19], we wondered if the presence of sulfur could play a role in taming the complex product distribution of the formose reaction. Sulfur has a long, storied history in prebiotic chemistry. While the autocatalytic metabolic core [20] in present life (exemplified by the tricarboxylic acid cycle and its analogs) mainly consists of CHO molecules, the importance of coenzyme A may be a vestige of sulfur’s broader involvement in proto-metabolism as proposed by De Duve [21]. The Sutherland group’s prebiotic map proposes a “cyanosulfidic” world [22], and sulfur’s significance at the origin of life has been highlighted in a recent review [23]. Sulfur was also prominently featured in Wachterhauser’s pyrite world [24], which helped usher proto-metabolism to the forefront of recent origin-of-life research.

As previously reported [19], we identified mercaptoaldehyde as the C₂ linchpin species analogous to glycolaldehyde, and showed that its formation from glycolaldehyde and H₂S was thermodynamically favorable and kinetically feasible. We outlined the potential favorable thermodynamics of a subset of the C₃ and C₄ sulfur analogs, but we did not examine detailed pathways (the purpose of that paper was to look at broad thermodynamic trends). We speculated that depending on where the thiol groups were positioned, they may provide some degree of selectivity but we only provided broad trends: thiol groups on terminal carbons were thermodynamically favored, and sulfur in the aldose rings could shift the equilibrium from exclusively favoring ketoses over aldoses. We did not look at sulfur’s influence on the retroaldol reaction, nor calculate most of the reaction barriers that would influence the kinetics of the aldol additions, isomerizations, or Cannizzaro reactions.

The present work takes a detailed look at the core autocatalytic cycle and examines how sulfur analogs influence the thermodynamics and kinetics of the many interconnected reaction steps. Thus, we provide an analogous (albeit messier) map to our recent study of the smallest core [25]. While H₂S was mentioned in the previous work, it was only in the context of sequestering formaldehyde or as an external catalyst rather than being incorporated as thiols throughout the cycle.

This article is organized as follows: After describing our computational protocol and its limitations, the combined results and discussion addresses (1) how the presence of H₂S impacts the C₁ food and C₂ linchpin compounds; (2) the competing Cannizzaro reactions drive the reduction of food species to methanol; (3) the impact of thiols in the two aldol reactions that add CH₂O to form C₃ and C₄ species, respectively; (4) how thiols may shift the favorability of the C₄ → C₂ + C₂ retroaldol; and (5) we will examine the potential of C₅ and C₆ forming reactions and their limitations.

2. Computational Methods

We use the same computational protocol as our recent work on the thermodynamic map of small CHOS molecules and the exploration of the core formose cycle [19,25] so we can make direct comparisons and extend our free energy map. Here, we provide a brief description of that protocol for convenience. Much of the text in this section is reproduced from those two articles (published in this journal) [19,25] since we think the description is both clear and succinct. Essentially, we calculate the free energies using quantum chemical methods, and our protocol showed good agreement with the available experimental results for CHO systems [17,18,26,27] (there are no experimental data for sulfur-containing compounds involved in the formose reaction).

The computational details are as follows: The geometry of each molecule is optimized and its electronic energy is calculated at the B3LYP [28,29,30,31] flavor of density functional theory with the 6-311G** basis set. To maximize the probability of finding the global minima, multiple conformers are generated using molecular mechanics (MMFFs force field [32]). The optimized structures are embedded in a Poisson–Boltzmann continuum to calculate the aqueous solvation contribution to the free energy. While this does not provide a specific concentration, it assumes a dilute solution such that the electrostatic field generated by a neighboring solute molecule is effectively screened by the water solvent. One can consider all solutes to have the same relative concentrations in our calculations. In a handful of cases, when the solvation calculation gave a seemingly spurious free energy, we made empirical corrections as explained in the Supplementary Materials.

Zero-point energy corrections are included, and we apply the standard temperature-dependent enthalpy correction term (for 298.15 K) from statistical mechanics by assuming translational and rotational corrections are a constant times kT, and that low frequency vibrational modes generally cancel out when calculating enthalpy differences. However, entropic corrections in aqueous solution are problematic [33,34,35]. Changes in free energy terms for translation and rotation are poorly defined in solution due to restricted complex motion, particularly as the size of the molecule increases (thus, increasing its conformational entropy). Free energy corrections come from two different sources: thermal corrections and implicit solvent. Neither of these parameters is easily separable, nor do they constitute all the required parts of the free energy. We follow the approach of Deubel and Lau [36], assigning the solvation entropy of each species as half its gas-phase entropy (calculated using standard statistical mechanics approximations similar to the enthalpy calculations described above), based on proposals by Wertz [37] and Abraham [38] that upon dissolving in water, molecules lose a constant fraction (~0.5) of their entropy.

To estimate activation energies, transition states were optimized by including several explicit water and/or catalytic molecules to aid in transferring H moieties. All calculated transition states have one significant negative eigenvalue corresponding to the reaction coordinate (eigenvector) involving bond breaking/forming. Several conformers built by hand are tested in each case and we only report the lowest calculated barriers.

In comparing the equilibrium concentrations in a self-oligomerizing solution of 1 M glycolaldehyde at 298 K, our protocol fared very well compared to subsequent NMR measurements [27]. Our relative Gibbs free energies in aqueous solution are typically within 0.5 kcal/mol compared to experiment. That being said, our protocol shows systematic errors of 2–3 kcal/mol when calculating barriers involving carbonyl chemistry when compared to experimental results. Going to a higher level of theory does not reduce this error [39], nor does using anionic species [18]. There are also specific computational problems that include cations in our protocol, as discussed in previous work [25]. Quantum chemistry is about error cancelation, and our protocol (with its foibles, including the simplistic entropy correction) has worked well even with this systematic error for activation barriers. Thus, we do well on thermodynamics, while we have a larger error bar for kinetics though still reasonable for the carbonyl chemistry and aldol reactions in this work.

3. Results and Discussion

To connect this work to our previously published CHOS thermodynamic map [19], we will use the same set of reference compounds: CO₂, H₂, H₂O, and H₂S will be assigned a relative free energy, G_rel of 0.0 kcal/mol. The G_rel of all other species can be determined by calculating the change in free energy, ΔG, for forming the species, analogous to a free energy of formation. For example, the formation of the linchpin species mercaptoaldehyde (C₂H₄OS) can be written as follows:

2 CO₂ + 4 H₂ + H₂S → C₂H₄OS + 3 H₂O

Since ΔG of this reaction is −6.2 kcal/mol, we assign G_rel(C₂H₄OS) = −6.2 kcal. For the rest of this paper, we will use the unit kcal as shorthand to signify kcal/mol.

A consistent set of reference compounds allows us to globally compare energies. In our figures, G_rel values are found next to each compound and in square brackets next to an arrow for transition states. Since some reactions may have more than one non-reference compound, we will also use ΔG to designate the change in free energy when focusing on a particular reaction, where ΔG = G_rel(products) − G_rel(reactants). Similarly, when we refer to the barrier of a specific reaction, we will designate this ΔG^‡ which compares G_rel of the transition state to either the reactants or products depending on whether the forward or reverse reaction is being discussed. Throughout this section, we will regularly compare our ΔG and ΔG^‡ values to their non-sulfur counterparts in our previous work [25].

3.1. Formaldehyde: The “Food” Species

As shown in Figure 2, G_rel of CH₂O is +7.9 kcal. As shown in our previous work [25], it is thermodynamically favorable for CH₂O to exist predominantly as its hydrate in aqueous solution. CH₂(OH)₂ has a G_rel of +3.3; therefore, ΔG for hydration is −4.6 kcal; the transition state has a G_rel of +21.1, the barrier ΔG^‡ is +13.2 kcal; thus both our ΔG and ΔG^‡ values are in good agreement with experimental values [40].

If H₂S is present, it can potentially compete with water and add to the carbonyl. Addition of H₂S is marginally less exergonic (ΔG = −3.9 kcal) with a marginally lower barrier (ΔG^‡ = +12.1 kcal). Thus, depending on the concentration of H₂S we expect to see both addition products in solution in equilibrium with CH₂O. The dehydration of CH₂(OH)(SH) to form the thione CH₂S is significantly uphill (ΔG = +17.2 kcal); any CH₂S formed would easily rehydrate. However, under dehydrating conditions, or if CH₂S is ever found in higher than transient concentrations, its direct reaction with CH₂O to form mercaptoaldehyde is highly exergonic, although the barrier is high; G_rel of the transition state is +56.1 kcal and this reaction is kinetically rather unfavorable.

CH₂O and its hydrate can undergo a Cannizzaro disproportionation reaction to form HCOOH and CH₃OH. This reaction is thermodynamically favorable (ΔG = −19.6 kcal) but has a modest barrier (ΔG^‡ = +25.9 kcal). Thermodynamic favorability is essentially driven by the reduction of CH₂O to CH₃OH with a G_rel difference of −11.2 − (+7.9) = −19.1 kcal, while the oxidation of methanediol to HCOOH has a tiny G_rel difference. With H₂S, we expect some concentration of CH₂(OH)(SH) to be present, thus, the Cannizzaro reaction can lead to a thioacid or a thione-acid as shown in Figure 3. Both reactions are exergonic (although significantly less so) and the barriers are ~5 kcal/mol higher. This is consistent with our thermodynamic map on CHOS compounds where both thioacids and thioneacids were significantly less stable than their corresponding carboxylic acid [19].

The sulfur analogs for addition to carbonyl or a Cannizzaro reaction have similar transition states to their non-sulfur analogs, the hydration reaction or the carboxylic-acid Cannizzaro-forming reaction. The optimum transition state for an addition reaction has two catalytic water molecules (an 8-center transition state) while the lowest barrier Cannizzaro has zero catalytic waters (a 6-center transition state) as shown in Figure 4.

3.2. Mercaptoaldehyde: The C₂ Linchpin Species

Glycolaldehyde is the linchpin C₂ species in the formose reaction; only a small amount is needed to kick-start the autocatalytic cycle (the presence of any member of the cycle will also suffice). For the non-sulfur analog, we have examined its role using the same protocol in our previous work [25]. In the presence of H₂S, glycolaldehyde can be converted into its sulfur analog, mercaptoaldehyde. The energetics of the reaction pathway was shown in our previous work [19] and is repeated in Figure 5. The reaction is overall exergonic by 5.7 kcal. The first two steps, addition of H₂S followed by dehydration are slightly endergonic but the barriers are low. The subsequent two steps, conversion of thione to enol to aldehyde, are both exergonic and also have low barriers. Thus, this reaction is both thermodynamically and kinetically feasible.

As shown in Figure 6, hydration of mercaptoaldehyde is marginally endergonic by 0.4 kcal. In a dilute aqueous solution, the equilibrium will shift towards the hydrated species. In the presence of food species, the hydrate can undergo a cross-Cannizzaro reaction with CH₂O to form mercaptoacetic acid. ΔG of this reaction is −19.9 kcal (similar to the C₁ Cannizzaro), while the barrier (ΔG^‡ = +28.2 kcal) is higher by ~2 kcal. Mercaptoaldehyde can undergo a self-Cannizzaro reaction or a cross-Cannizzaro with glycolaldehyde, but both these have higher barriers.

Addition of H₂S to mercaptoaldehyde is uphill (ΔG = +3.5 kcal), and the subsequent cross-Cannizzaro reaction with CH₂O to form 2-mercaptothioacetic acid is not as exergonic (ΔG = −12.8 kcal) and has a higher barrier (ΔG^‡ = +30.0 kcal); C₂ species containing two sulfur atoms are minor at best (or more likely not found) in the complex mixture.

The way forward into the autocatalytic cycle is the aldol addition of CH₂O to mercaptoaldehyde via its enol. The C₂ enolization is 7.6 kcal uphill and the barrier is +21.5 kcal (in Figure 5 on the left, starting from mercaptoaldehyde, this is the reverse step). This is similar thermodynamically to the enolization of glycolaldehyde (ΔG = +7.6 kcal, ΔG^‡ = +24.3 kcal) but kinetically mercaptoaldehyde enolization has a lower barrier of ~3 kcal. Hence, the presence of sulfur analogs may accelerate entry into the autocatalytic cycle.

3.3. Sulfur Analogs of the C₃ Species: Formation and Interconversion

Before launching into the details, Figure 7 shows our big-picture map of the many reactions that can take place involving C₁ to C₄ species that could be involved directly or indirectly in the core autocatalytic cycle. The top row shows the relevant C₁ and C₂ species. The second row and the top half of the leftmost column are the C₃ species. The rest of Figure 7 contains the C₄ species with retro-aldol products shown in blue boxes. All numerical values (in kcal) are G_rel of the species (if next to a structure) or a transition state (if next to an arrow and in square brackets). The nomenclature of each compound is based on how many carbon atoms it has, its main functional group, the location of the sulfur, and in some enols, the location of the double bond. For example, the C₂ species aldehyde, enol, and thione are named 2a, 2e, and 2t respectively. Black arrows refer to enolization reactions. Aldol additions are shown with red arrows. The aldol addition of CH₂O to 2e has two possible products, the thione with sulfur on the first carbon (3t1) and the aldehyde with sulfur on the second carbon (3a2). Further nomenclature will be discussed as we cover the relevant compounds and reactions.

The two possible products for this first C₂ + C₁ → C₃ aldol addition are 2-thioglyceraldehyde (3a2) and the thione analog of glyceraldehyde (3t1). Stereochemically, these are the analogs of D-glyceraldehyde in conjunction with our previous work [25]. Forming the aldehyde is thermodynamically very favorable (ΔG = −5.9 − (1.4 + 7.9) = −15.2 kcal) from the enol (or (ΔG = −5.9 − (–6.2 + 7.9) = −7.6 kcal from the aldehyde). This is 2.4 kcal less exergonic than its non-sulfur counterpart, the addition of CH₂O to glycolaldehyde. However, the barrier for the sulfur analog (ΔG^‡ = +16.5 kcal from the enol) is ~3 kcal lower than its non-sulfur counterpart. Thus, not only is forming the C₂ enol enhanced kinetically by the presence of the thiol group, but the subsequent aldol addition is also enhanced kinetically. Note that we use the enol rather than an enolate in aldol reactions because our calculations with neutral molecules gave far better results than using anions (see Computational Methods), similar to our previous calculations on the formose reaction [25].

Not surprisingly the thione product (3t1) is less favored thermodynamically, but the barrier to form the thione (ΔG^‡ = +16.8 kcal from the enol) is essentially similar to forming the aldehyde. Thus, we expect both C₃ products to be formed in this system. Interestingly, the transition states have very different distances for the forming C–C bond as shown in Figure 8. In both cases the H transfer is essentially completed before the C–C bond is formed; however, the formation of 3a2 has a shorter distance of 2.07 Å in the transition state, while the less concerted 3t1 has a forming C…C distance of 2.66 Å. We tried several transition state conformations; the structures shown in Figure 8 are the ones with the lowest barriers. Note that the G_rel values for these transition states at +24.3 and +24.7 kcal are some of the most positive, and therefore, in the overall map, this C₂ + C₁ → C₃ aldol addition may represent the rate-determining step globally.

Interconversion of the C₃ species to their isomeric counterparts is possible via enolizations. 3a2 can enolize into 3e2 (ΔG = +6.3 kcal, ΔG^‡ = +25.2 kcal) but is much less likely to form the thione 3t2 (as indicated by the dashed arrow) which is less favored both kinetically and thermodynamically. The enol is more likely to revert back to the aldehyde 3a2. On the other hand, 3t1 favorably enolizes to form 3e1-1 (C₃ enol with thiol on the first carbon, and the double bond at the first carbon). The reaction is 5.6 kcal downhill and the barrier is low (ΔG^‡ = +14.8 kcal) due to the instability of the thione which can be considered a higher-energy or “activated” species. The enol favorably converts to the ketone 3k, the thermodynamic sink of the C₃ species. While the ketone could enolize at the other end to form 3e1-2 and subsequently 3a3 (3-mercaptoglyceraldehyde), this is overall less favorable. Thus, 3a3 may be a minor species in equilibrium with 3k.

While we calculated both the cis and trans enols and their corresponding transition states, we found that in the vast majority of cases, the cis enol was favored both kinetically and thermodynamically; hence, we show only the cis isomers in Figure 7 with their corresponding G_rel values. The free energy differences comparing cis and trans structures can be found in Supplementary Materials. In Figure 9, we show an example of a cis transition state (interconverting 3k and 3e1-1) and a trans transition state (interconverting 3t1 and 3e1-1). The C…H distance in both transition states is similar (1.58 and 1.61 Å). Most of the O…H distances are in the expected range, except the transition state on the left has one that is noticeably longer (1.60 Å) and one noticeably shorter (0.99 Å), and this is likely due to the longer S…H distance of 2.19 Å.

Globally in our map, the G_rel values for the enolization transition states range from +16.7 to +21.9 kcal. Thus, if the C₁ + C₂ → C₃ barrier can be traversed under some experimental conditions, we expect these enolization reactions to also be kinetically accessible. If the C₃ enols are formed transiently, and the food species CH₂O is plentiful, C₁ + C₃ → C₄ aldol addition will proceed. Globally, these aldol addition transition states have G_rel values ranging from +20.4 to +25.0 kcal, which are slightly higher than for the enolizations. It is more kinetically favorable for a C₃ enol to convert back to a ketone or aldehyde, but the aldol addition is more thermodynamically favorable, as discussed in the next section.

3.4. Sulfur Analogs of the C₄ Species: Formation and Interconversion

Each of the three C₃ enols can potentially undergo the aldol addition with CH₂O to form a C₄ compound. On the right hand side of Figure 7, 3e2 can either form the branched aldehyde 4ba2 or the thione 4t2. While the formation of 4ba2 is exergonic, it is a “dead end” where the formose reaction is concerned, and its only route back into the cycle is the reverse retroaldol back to C₃ and C₁. Forming 4t2 is only slightly exergonic from the enol (ΔG = +2.5 − (0.4 + 7.9) = −5.8 kcal) and barely endergonic from the aldehyde 3a2 (ΔG = +0.5 kcal). Not surprisingly, the non-sulfur counterpart forming the ketone is significantly more exergonic. The 3e2 + CH₂O → 4t2 addition has a relatively low barrier (ΔG^‡ = +22.4 − (0.4 + 7.9) = +14.1 kcal from the enol, or +20.4 from the aldehyde 3a2). We were unsuccessful in isolating the transition state to form 4ba2, and our optimizations went to the transition state for the formation of 4t2. In the non-sulfur counterpart, the barrier to the ketone is 4 kcal lower than to the branched aldehyde. The presence of the sulfur loosens the transition state and is likely why we were unable to isolate the transition state to 4ba2. Regardless, we do not expect 4ba2 to play an important role in this system.

On the left hand side, CH₂O addition to the enol 3e1-1 leads to the branched aldehyde 4ba3 or the ketone 3-thioerythrulose (4k3). Forming the ketone is both thermodynamically and kinetically more favorable (ΔG = −16.7 kcal, ΔG^‡ = +16.6 kcal) from the enol. Forming the branched aldehyde is ~2 kcal less exergonic and the barrier is ~2 kcal higher. However, globally, aldol addition is kinetically less favored than enolization when comparing the transition state G_rel values as discussed earlier. Similarly to what we see for 3e1-1, the enol 3e1-2 can add CH₂O to form the same branched aldehyde 4ba3 or 1-thioerythrulose (4k1). Once again, forming the ketone is both thermodynamically and kinetically more favored (ΔG = −17.2 kcal, ΔG^‡ = +16.0 kcal) from the enol. Should the branched aldehyde 4ba3 form, it can undergo a retroaldol, eliminating CH₂O to form either enol, although the path to 3e1-2 is kinetically slightly favored over 3e1-1.

Similarly to what we found for the C₃ species, the ketones are the thermodynamic sink for the C₄ compounds. 4k1 is unlikely to isomerize into the much less stable thione 4t1, and it most likely equilibrates with 4k4 in solution (with a computationally insignificant G_rel difference of 0.1 kcal). 4k4 can isomerize into the aldehyde 4a4 (via a terminal enol) and a small amount of the aldehyde likely exists at equilibrium. Similarly, the ketone 4k3 can isomerize into the aldehyde 4a3 (center of Figure 7) as a minor species at equilibrium. On the right hand side of Figure 7, the less stable thione 4t2 has two pathways forward. It could isomerize to the ketone 4k3 or to the aldehyde 4a2. Both pathways are rather exergonic, with the ketone being thermodynamically favored over the aldehyde. However, the path to the aldehyde (via enol 4e2) is favored kinetically.

While we expect the open chain aldehydes 4a3 and 4a4 to be minor species in solution equilibrating with their respective ketones 4k3 and 4k4, once 4a2 is formed it is unlikely to reverse to the much less stable thione 4t2. This pathway will be of particular interest in light of the C₄ → 2 C₂ retro-aldol that facilitates autocatalysis to be discussed in the next section. Also, any of the open chain aldehydes can be partially sequestered by ring-closing reactions to form the furanoses (4r2, 4r3, 4r4). The furanoses are all slightly favored thermodynamically over their open chain aldehyde counterparts and the barriers to ring closure (and ring opening) are low (~12–16 kcal) and similar to aldehyde hydration barriers [17]. We expect these furanoses to be part of an equilibrating mixture. Since the formation of the aldehyde-thione 4t1 is less likely, we do not expect to see much of its ring-closed counterpart 4r1 either.

All values and structures shown in Figure 7 are sulfur analogs for D-erythrose and D-erythrulose. While we also calculated the G_rel values for D-threose, the overall story does not change and the differences in energies only show minor variances of ~0–2 kcal/mol. Thus, we do not include the threose/erythrose differences in the main body of this article to keep the discussion tractable. The relevant data for D-threose sulfur analogs can be found in Supplementary Materials.

3.5. Tetrose Aldol and Retro-Aldol Reactions

In the non-sulfur formose reaction, the (D-erythrose) C₄ → 2 C₂ retro-aldol reaction is endergonic by 2.2 kcal. Hence, autocatalysis likely does not kick in until there is a sufficient concentration buildup of C₄ versus C₂. The barrier is high at 31.5 kcal. Can the presence of sulfur change this situation?

Of the four retroaldol reactions in Figure 7 (blue arrows and boxes), only one is significantly exergonic: 4t1 → 2a + GA (glycolaldehyde). The split initially produces GA and the enol of mercaptoaldehyde that easily enolizes into 2a. ΔG = −10.0 kcal for the overall reaction and the barrier is low (ΔG^‡ = +15.0 kcal) because the reactant is an activated species. However, it is very unlikely that high-energy 4t1 is formed in the first place, and we do not expect this pathway to be practically realized (For all our C₄ → 2 C₂ retro-aldol reactions, the barriers to erythrose were consistently lower than for threose by 2–3 kcal).

We expect some (albeit small) amount of 4a4 to be present in the mixture since it can be formed from the ketone thermodynamic sink 4k4. The reaction 4a4 → 2a + GA is endergonic by 1.6 kcal/mol, marginally less unfavorable than its non-sulfur counterpart, and the barrier is also marginally lower (ΔG^‡ = +29.9 kcal). Perhaps sulfur can accelerate this autocatalytic step ever so slightly. Unfortunately, the same cannot be said for 4a3 (formed in equilibrium with 4k3) because the retroaldol split leads to the thione 2t, and thus, the reaction is significantly endergonic (ΔG = +11.5 kcal). We were unable to cleanly locate the transition state because it forms a four-membered heterocycle intermediate that looks like the cycloaddition product of 2t and the enol of glycolaldehyde. More details are shown in Supplementary Materials, but this pathway is unlikely to occur.

The most interesting retroaldol reaction is 4a2 → 2a + GA. From our calculations, the reaction is marginally exergonic (ΔG = −0.2 kcal) although this is within the computational error so we consider it equal. The barrier is still relatively high (ΔG^‡ = +28.7 kcal) although lower than the 4a3 retroaldol, and it is ~3 kcal lower than its non-sulfur counterpart. Thus, the situation is more promising in the presence of sulfur when the linchpin mercaptoaldehyde is present. Autocatalysis could begin earlier because the C₄ → 2 C₂ reaction is no longer endergonic, and the kinetics are slightly more favorable. The transition state for this retroaldol reaction is shown in Figure 10. It has a longer C…C distance of 2.64 Å and the H has not quite transferred to the carbonyl oxygen with an O…H distance of 1.40 Å (other distances are as expected).

Globally, the G_rel value for this retroaldol transition state of +22.2 kcal is on par with those of the C₂ + C₁ → C₃ and C₃ + C₁ → C₄ aldol addition reactions which range from +20.4 to +25.0. We therefore expect that the retroaldol could potentially compete kinetically with further aldol additions such as C₄ + C₁ → C₅. If the C₄ enol is 4e2, addition of CH₂O will lead to the thermodynamically less stable thione-pentose as shown in the first row of Figure 11. Thus, we expect the C₄ → 2 C₂ retro-aldol from 4a2 to be favored over further aldol addition of CH₂O. This is a unique situation, because it is not true thermodynamically for other C₄ + C₁ → C₅ additions where formation of the linear 3-ketopentoses (via 4e1, 4e1-1 or 4e3-1) is significantly exergonic as shown in Figure 11. The 3-ketopentoses however are dead ends in the formose cycle; they are thermodynamic sinks that do not undergo retroaldol C₅ → C₃ + C₂ reactions in addition to removing C₄ species from the pool—such reactions are parasites of the autocatalytic cycle. And as we have seen in Figure 7, since aldol additions to branched products are much less favorable, enolization of the 3-ketopentoses and addition of CH₂O in a C₅ + C₁ → C₆ reaction is much less likely.

3.6. When C₁ Food Is Depleted

The scenarios discussed in the previous sections assume that the C₁ food species is abundant. But what happens when it begins to deplete? Depending on the relative concentration of the various C₂, C₃, and C₄ species, the following reactions could begin to be important: C₂ + C₂ → C₄ aldoses (the opposite of the retroaldol), C₂ + C₃ → C₅ aldoses or ketoses (depending on whether the C₃ or C₂ enolizes), C₂ + C₄ → C₆ aldoses or ketoses (depending on whether the C₄ or C₂ enolizes), and C₃ + C₃ → C₆ ketoses. This also opens up the possibility of incorporating more than one thiol group into a C₄, C₅ or C₆ species.

Of the C₂ + C₂ → C₄ reactions, the most favorable reaction between glycolaldehyde and mercaptoaldehyde is to form the 4-thioaldose 4a4 (see Figure 7). The reaction is overall exergonic (ΔG = −1.6 kcal) but the barrier is higher (ΔG^‡ = +28.3 kcal overall, or +20.7 kcal from the enol) compared to aldol addition involving C₁ food species which have barriers 2–5 kcal lower. If two mercaptoaldehyde molecules dimerized, this forms the 2,4-dithioaldoses. For this reaction, ΔG = +0.6 kcal overall to form 2,4-dithioerythrose (the threose is 0.1 kcal/mol less stable) and the overall barrier is 27.6 kcal/mol. Neither of these sulfur analogs is as thermodynamically favorable as the dimerization of glycolaldehyde (ΔG = −2.2 kcal) to form erythrose.

For the C₂ + C₃ → C₅ reactions, we see a similar story. The non-sulfur analog reactions are thermodynamically more favorable in the forward direction (and therefore, less likely to undergo the corresponding retroaldol). In Figure 12, the addition of glycolaldehyde (via its enol) to glyceraldehyde to form open-chain ribose has ΔG = −1.9 kcal, which can favorably undergo ring-closure (the pyranose is more stable than the furanose; values shown are for the β anomers). Similarly, the addition of glycolaldehyde to dihydroxyacetone (via its enol) to form ribulose has similar thermodynamics with ΔG = −2.1 kcal (xylulose was less than 0.1 kcal different in free energy than ribulose).

For the sulfur analogs, the addition of 3a2 and glycolaldehyde to form the open-chain 4-thioribose has ΔG = +0.1 kcal. Ring closure to both the pyranose and furanose is favorable; having sulfur in the ring stabilizes the furanose in this case. Starting from 3a3 and forming 5-thioribose has ΔG = −1.6 kcal because thiols on the terminal carbon are favored. Ring closure is favorable and the pyranose is more stable than the furanose (see our previous work [19] for a more detailed discussion on the position of thiol groups in open chain aldoses and rings). Based on our discussion of Figure 7, we expect 3a3 to be less accessible because 3k is the thermodynamic sink in that pathway. Thus, the more relevant sulfur analog is 3a2, certainly less favorable than its non-sulfur counterpart. If the C₂ species is mercaptoaldehyde, the results are similar but slightly less favorable.

For sulfur analogs forming C₅ ketoses, 3k can form two distinct enols and thus adding glycolaldehyde leads to 3-thioribulose (ΔG = +0.7 kcal) or 1-thioribulose (ΔG = 0.0 kcal). Both these reactions are thermodynamically less favorable than the non-sulfur counterpart (The corresponding thio-xyluloses are within 0.4 kcal of the thio-ribuloses. Also, having mercaptoaldehyde has similar but slightly less favorable thermodynamics as shown in Supplementary Materials). Another possibility is to add glycolaldehyde to the enol of 3a2 but the ketose product is a thione and the reaction is significantly endergonic.

For C₂ + C₄ → C₆, as shown in Figure 13, we see the same trend. The aldol addition of erythrose and glycolaldehyde to form glucose is exergonic (ΔG = −1.6 kcal) while its sulfur analog is endergonic (ΔG = +1.1 kcal) starting from 4a2 (the most of the promising C₄ aldehydes) and glycolaldehyde. If mercaptoaldehyde is used as the C₂ species to form 2,4-dithioglucose, the reaction is more endergonic. Similar results are obtained for the C₃ + C₃ → C₆ reactions forming fructose (see Supplementary Materials).

While C₁ food is plentiful, the most favorable cross-Cannizzaro reaction, both thermodynamically and kinetically, involves the reduction of CH₂O to CH₃OH as discussed in the sections describing C₁ and C₂ reactions. When the food runs out, larger aldehydes could undergo Cannizzaro reactions which parasitize the autocatalytic cycle. A preliminary analysis of our calculations suggests that C₃ and C₄ species show similar energetics to the C₂ species shown in Figure 6. Hence, while Cannizzaro reactions are thermodynamically favorable, they have larger barriers and are kinetically less favorable than the aldol addition reactions.

4. Conclusions

If H₂S is incorporated as a thiol group in the formose reaction, its most salient contribution is utilizing mercaptoaldehyde as the C₂ linchpin species. Both its enolization barrier and the entry into the cycle via first aldol addition (C₂ + C₁ → C₃) are kinetically more favorable in the sulfur analog with barriers lowered by ~3 kcal. While there is no kinetic selectivity in forming the C₃ species, there is significant thermodynamic selectivity for the aldehyde 3a2 over 3t1. This could shunt the cycle through the reactions on the right hand side of Figure 7. While the initial C₃ + C₁ → C₄ product is 4t2, it more favorably enolizes to 4e2 (over 4e3-2); thus, favoring the formation of 4a2, the only C₄ aldehyde that has a thermodynamically favorable C₄ → 2 C₂ retroaldol reaction (the barrier is also ~3 kcal lower than the non-sulfur analog). Thus, the presence of sulfur could accelerate the core autocatalytic cycle of the formose reaction compared to its non-sulfur analog, and this pathway is the most significant positive result of our work.

However, the messiness does not go away. A wide diversity of C₃ and C₄ compounds are accessible as shown in Figure 7. Having the thiol group in different positions provides selectivity for some species over others, but also adds more compounds to the mix. Sulfur analogs do not slow down the competing Cannizzaro reaction since its most likely channel is via reduction of CH₂O to methanol, and reduces the C₁ food. As CH₂O depletes, sulfur analogs less favorably undergo further aldol additions to form C₄, C₅, and C₆ species compared to their non-sulfur counterparts; although as concentrations of C₂ and C₃ build up, the equilibrium will shift towards larger species. It is unclear if slowing down the formation of larger species is favorable for kick-starting a proto-metabolism.

A question we asked but did not sufficiently answer in our previous work [19] was whether thiol groups could provide additional selectivity, especially if more than one thiol was present, and if there was a possibility that thiol groups could have served as precursor tags to phosphates in extant sugar metabolism. Having collected more data in this study, our answer at present is no. Forming dithiolated sugars is unfavorable, and even for monothiolated sugars, the thermodynamic favorability of the ketoses now causes these thermodynamic sinks to retard formose autocatalytic pathways. We are now considering if bisulfite analogs could lead to more pronounced selectivity instead of thiols.

Not included in this work is the intramolecular disproportionation of a thiolated sugar to form thioacids, or more promisingly, the addition of an organothiol to an aldehyde which disproportionates into a thioester. We are currently pursuing this possibility and expect to continue the story of the role of sulfur analogs in potential proto-metabolic autocatalytic cycles in a future publication.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/life15010001/s1, Energy breakdown of molecules and transition states; additional discussion sections and results referred to in main text.

Author Contributions

Conceptualization, J.K.; methodology, J.K.; writing—original draft, J.K.; subsequent edits, J.K.; supervision, J.K.; project administration, J.K.; formal analysis, J.K., M.T.P., S.N.C. and J.L.; investigation, J.K., M.T.P., S.N.C. and J.L.; data curation, J.K., M.T.P., S.N.C. and J.L.; visualization, J.K., M.T.P., S.N.C. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NASA-Exobiology under Award 80NSSC24K0679.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available in Supplementary Materials.

Acknowledgments

This research was supported by the University of San Diego and a research grant from NASA-Exobiology (award # 80NSSC24K0679). J.L. acknowledges a BURST grant from the University of San Diego. Shared computing facilities were provided by the saber3 computing cluster at the University of San Diego.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xavier, J.C.; Hordijk, W.; Kauffman, S.; Steel, M.; Martin, W.F. Autocatalytic chemical networks at the origin of metabolism. Proc. Biol. Sci. 2020, 287, 20192377. [Google Scholar] [CrossRef] [PubMed]
Mizuno, T.; Weiss, A.H. Synthesis and Utilization of Formose Sugars. Adv. Carbohydr. Chem. Biochem. 1974, 29, 173–227. [Google Scholar] [CrossRef]
Breslow, R. On the Mechanism of the Formose Reaction. Tetrahedron Lett. 1959, 1, 22–26. [Google Scholar] [CrossRef]
Appayee, C.; Breslow, R. Deuterium studies reveal a new mechanism for the formose reaction involving hydride shifts. J. Am. Chem. Soc. 2014, 136, 3720–3723. [Google Scholar] [CrossRef] [PubMed]
Ricardo, A.; Frye, F.; Carrigan, M.A.; Tipton, J.D.; Powell, D.H.; Benner, S.A. 2-Hydroxymethylboronate as a reagent to detect carbohydrates: Application to the analysis of the formose reaction. J. Org. Chem. 2006, 71, 9503–9505. [Google Scholar] [CrossRef]
Omran, A.; Menor-Salvan, C.; Springsteen, G.; Pasek, M. The Messy Alkaline Formose Reaction and Its Link to Metabolism. Life 2020, 10, 125. [Google Scholar] [CrossRef]
Robinson, W.E.; Daines, E.; van Duppen, P.; de Jong, T.; Huck, W.T.S. Environmental conditions drive self-organization of reaction pathways in a prebiotic reaction network. Nat. Chem. 2022, 14, 623–631. [Google Scholar] [CrossRef]
van Duppen, P.; Daines, E.; Robinson, W.E.; Huck, W.T.S. Dynamic Environmental Conditions Affect the Composition of a Model Prebiotic Reaction Network. J. Am. Chem. Soc. 2023, 145, 7559–7568. [Google Scholar] [CrossRef]
Bris, A.; Baltussen, M.G.; Tripodi, G.L.; Huck, W.T.S.; Franceschi, P.; Roithova, J. Direct Analysis of Complex Reaction Mixtures: Formose Reaction. Angew. Chem. Int. Ed. 2024, 63, e202316621. [Google Scholar] [CrossRef]
Paschek, K.; Kohler, K.; Pearce, B.K.D.; Lange, K.; Henning, T.K.; Trapp, O.; Pudritz, R.E.; Semenov, D.A. Possible Ribose Synthesis in Carbonaceous Planetesimals. Life 2022, 12, 404. [Google Scholar] [CrossRef]
Vinogradoff, V.L.; Leyva, V.; Mates-Torres, E.; Pepino, R.; Danger, G.; Rimola, A.; Cazals, L.; Serra, C.; Pascal, P.; Meinert, C. Olivine-catalyzed glycolaldehyde and sugar synthesis under aqueous conditions: Application to prebiotic chemistry. Earth Planet. Sci. Lett. 2024, 626, 118558. [Google Scholar] [CrossRef]
Haas, M.; Lamour, S.; Christ, S.B.; Trapp, O. Mineral-mediated carbohydrate synthesis by mechanical forces in a primordial geochemical setting. Commun. Chem. 2020, 3, 140. [Google Scholar] [CrossRef] [PubMed]
Omran, A. Plausibility of the Formose Reaction in Alkaline Hydrothermal Vent Environments. Orig. Life Evol. Biosph. 2023, 53, 113–125. [Google Scholar] [CrossRef] [PubMed]
Rappoport, D.; Galvin, C.J.; Zubarev, D.Y.; Aspuru-Guzik, A. Complex Chemical Reaction Networks from Heuristics-Aided Quantum Chemistry. J. Chem. Theory Comput. 2014, 10, 897–907. [Google Scholar] [CrossRef] [PubMed]
Simm, G.N.; Reiher, M. Context-Driven Exploration of Complex Chemical Reaction Networks. J. Chem. Theory Comput. 2017, 13, 6108–6119. [Google Scholar] [CrossRef]
Roszak, R.; Wolos, A.B.; Benke, M.; Glen, L.; Konka, J.; Jensen, P.; Burgchardt, P.; Zadlo-Dobrowolska, A.; Janiuk, P.; Szymkuc, S.; et al. Emergence of metabolic-like cycles in blockchain-orchestrated reaction networks. Chem 2024, 10, 952–970. [Google Scholar] [CrossRef]
Kua, J.; Avila, J.E.; Lee, C.G.; Smith, W.D. Mapping the Kinetic and Thermodynamic Landscape of Formaldehyde Oligomerization under Neutral Conditions. J. Phys. Chem. A 2013, 117, 12658–12667. [Google Scholar] [CrossRef]
Kua, J.; Hernandez, A.L.; Velasquez, D.N. Thermodynamics of Potential CHO Metabolites in a Reducing Environment. Life 2021, 11, 1025. [Google Scholar] [CrossRef]
Kua, J.; Miller, N.A. Preliminary Free Energy Map of Prebiotic Compounds Formed from CO₂, H₂ and H₂S. Life 2022, 12, 1763. [Google Scholar] [CrossRef]
Braakman, R.; Smith, E. The compositional and evolutionary logic of metabolism. Phys. Biol. 2013, 10, 011001. [Google Scholar] [CrossRef]
de Duve, C. The Beginnings of Life on Earth. Am. Sci. 1995, 83, 428–437. [Google Scholar]
Patel, B.H.; Percivalle, C.; Ritson, D.J.; Duffy, C.D.; Sutherland, J.D. Common origins of RNA, protein and lipid precursors in a cyanosulfidic protometabolism. Nat. Chem. 2015, 7, 301–307. [Google Scholar] [CrossRef] [PubMed]
Youssef-Saliba, S.; Vallee, Y. Sulfur Amino Acids: From Prebiotic Chemistry to Biology and Vice Versa. Synthesis-Stuttgart 2021, 53, 2798–2808. [Google Scholar] [CrossRef]
Wachtershauser, G. Before enzymes and templates: Theory of surface metabolism. Microbiol. Rev. 1988, 52, 452–484. [Google Scholar] [CrossRef]
Kua, J.; Tripoli, L.P. Exploring the Core Formose Cycle: Catalysis and Competition. Life 2024, 14, 933. [Google Scholar] [CrossRef]
Kua, J.; Hanley, S.W.; De Haan, D.O. Thermodynamics and Kinetics of Glyoxal Dimer Formation: A Computational Study. J. Phys. Chem. A 2008, 112, 66–72. [Google Scholar] [CrossRef]
Kua, J.; Galloway, M.M.; Millage, K.D.; Avila, J.E.; De Haan, D.O. Glycolaldehyde Monomer and Oligomer Equilibria in Aqueous Solution: Comparing Computational Chemistry and NMR Data. J. Phys. Chem. A 2013, 117, 2997–3008. [Google Scholar] [CrossRef]
Vosko, S.H.; Wilk, L.; Nusair, M. Accurate spin-dependent electron liquid correlation energies for local spin density calculations: A critical analysis. Can. J. Phys. 1980, 58, 1200–1211. [Google Scholar] [CrossRef]
Becke, A.D. Density-functional exchange-energy approximation with correct asymptotic behavior. Phys. Rev. A 1988, 38, 3098–3100. [Google Scholar] [CrossRef]
Lee, C.; Yang, W.; Parr, R.G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 1988, 37, 785–789. [Google Scholar] [CrossRef]
Becke, A.D. Density-functional thermochemistry. III. The role of exact exchange. J. Chem. Phys. 1993, 98, 5648–5652. [Google Scholar] [CrossRef]
Halgren, T.A. MMFF VII. Characterization of MMFF94, MMFF94s, and other widely available force fields for conformational energies and for intermolecular-interaction energies and geometries. J. Comput. Chem. 1999, 20, 730–748. [Google Scholar] [CrossRef]
Warshel, A.; Florian, J. Computer simulations of enzyme catalysis: Finding out what has been optimized by evolution. Proc. Natl. Acad. Sci. USA 1998, 95, 5950–5955. [Google Scholar] [CrossRef] [PubMed]
Wiberg, K.B.; Bailey, W.F. Chiral diamines 4: A computational study of the enantioselective deprotonation of Boc-pyrrolidine with an alkyllithium in the presence of a chiral diamine. J. Am. Chem. Soc. 2001, 123, 8231–8238. [Google Scholar] [CrossRef] [PubMed]
Nielsen, R.J.; Keith, J.M.; Stoltz, B.M.; Goddard, W.A., 3rd. A computational model relating structure and reactivity in enantioselective oxidations of secondary alcohols by (-)-sparteine-Pd(II) complexes. J. Am. Chem. Soc. 2004, 126, 7967–7974. [Google Scholar] [CrossRef]
Deubel, D.V.; Lau, J.K. In silico evolution of substrate selectivity: Comparison of organometallic ruthenium complexes with the anticancer drug cisplatin. Chem. Comm. 2006, 23, 2451–2453. [Google Scholar] [CrossRef]
Wertz, D.H. Relationship between the gas-phase entropies of molecules and their entropies of solvation in water and 1-octanol. J. Am. Chem. Soc. 1980, 102, 5316–5322. [Google Scholar] [CrossRef]
Abraham, M.H. Relationship between solution entropies and gas phase entropies of nonelectrolytes. J. Am. Chem. Soc. 1981, 103, 6742–6744. [Google Scholar] [CrossRef]
Krizner, H.E.; De Haan, D.O.; Kua, J. Thermodynamics and Kinetics of Methylglyoxal Dimer Formation: A Computational Study. J. Phys. Chem. A 2009, 113, 6994–7001. [Google Scholar] [CrossRef]
Rivlin, M.; Eliav, U.; Navon, G. NMR studies of the equilibria and reaction rates in aqueous solutions of formaldehyde. J. Phys. Chem. B 2015, 119, 4479–4487. [Google Scholar] [CrossRef]

Figure 1. Core autocatalytic cycle of the formose reaction.

Figure 2. Reactions of CH₂O in the presence of H₂S.

Figure 3. C₁ Cannizzaro reactions in the presence of H₂S.

Figure 4. Transition states for adding H₂S to CH₂O and a Cannizzaro reaction (bond distances in Å).

Figure 5. Formation of mercaptoaldehyde from glycolaldehyde.

Figure 6. Addition reactions and Cannizzaro reactions of mercaptoaldehyde.

Figure 7. Overall relative free energy map of sulfur analogs and their possible reactions. Blue boxes show retroaldol products.

Figure 8. Transition state structures for the C₁ + C₂ → C₃ aldol addition. (Left: 2e + CH₂O → 3a2, Right: 2e + CH₂O → 3t1).

Figure 9. Examples of cis (3t1 → 3e1-1) and trans (3e1-1 → 3k) enolization transition states.

Figure 10. Transition state for the slightly exergonic retroaldol C₄ → 2 C₂ reaction (4a2 → 2a + GA).

Figure 11. Thermodynamics of C₄ + C₁ → C₅ aldol additions; G_rel of CH₂O is +7.9 kcal.

Figure 12. Thermodynamics of C₂+ C₃ → C₅ aldol additions.

Figure 13. Thermodynamics of C₂ + C₄ (erythrose) → C₆ (glucose) aldol additions.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kua, J.; Peña, M.T.; Cotter, S.N.; Leca, J. Sulfur Analogs of the Core Formose Cycle: A Free Energy Map. Life 2025, 15, 1. https://doi.org/10.3390/life15010001

AMA Style

Kua J, Peña MT, Cotter SN, Leca J. Sulfur Analogs of the Core Formose Cycle: A Free Energy Map. Life. 2025; 15(1):1. https://doi.org/10.3390/life15010001

Chicago/Turabian Style

Kua, Jeremy, Maria T. Peña, Samantha N. Cotter, and John Leca. 2025. "Sulfur Analogs of the Core Formose Cycle: A Free Energy Map" Life 15, no. 1: 1. https://doi.org/10.3390/life15010001

APA Style

Kua, J., Peña, M. T., Cotter, S. N., & Leca, J. (2025). Sulfur Analogs of the Core Formose Cycle: A Free Energy Map. Life, 15(1), 1. https://doi.org/10.3390/life15010001

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sulfur Analogs of the Core Formose Cycle: A Free Energy Map

Abstract

1. Introduction

2. Computational Methods

3. Results and Discussion

3.1. Formaldehyde: The “Food” Species

3.2. Mercaptoaldehyde: The C₂ Linchpin Species

3.3. Sulfur Analogs of the C₃ Species: Formation and Interconversion

3.4. Sulfur Analogs of the C₄ Species: Formation and Interconversion

3.5. Tetrose Aldol and Retro-Aldol Reactions

3.6. When C₁ Food Is Depleted

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Sulfur Analogs of the Core Formose Cycle: A Free Energy Map

Abstract

1. Introduction

2. Computational Methods

3. Results and Discussion

3.1. Formaldehyde: The “Food” Species

3.2. Mercaptoaldehyde: The C2 Linchpin Species

3.3. Sulfur Analogs of the C3 Species: Formation and Interconversion

3.4. Sulfur Analogs of the C4 Species: Formation and Interconversion

3.5. Tetrose Aldol and Retro-Aldol Reactions

3.6. When C1 Food Is Depleted

4. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. Mercaptoaldehyde: The C₂ Linchpin Species

3.3. Sulfur Analogs of the C₃ Species: Formation and Interconversion

3.4. Sulfur Analogs of the C₄ Species: Formation and Interconversion

3.6. When C₁ Food Is Depleted