Straightforward Creation of Possibly Prebiotic Complex Mixtures of Thiol-Rich Peptides

At the origin of life, extremely diverse mixtures of oligomers and polymers could be obtained from relatively simple molecular bricks. Here, we present an example of the polymerization of two amidonitriles derived from cysteine, Cys-Ala-CN and Cys-Met-CN. The thiol function in a molecule adds onto the nitrile group of another one, allowing efficient condensation reactions and making available an extensive range of polymers containing amide bonds and/or five-membered heterocycles, namely thiazolines. Macrocycles were also identified, the biggest one containing sixteen residues (cyclo(Cys-Met)8). MALDI-TOF mass spectrometry was used to identify all the present species. What these examples show is that complex mixtures are likely to have formed on the primitive Earth and that, ultimately, the selection that must have followed may have been an even more crucial step towards life than the synthesis of the pre-biological species themselves.


Introduction
The concept of life's origin is related to the chemistry of primitive molecules and minerals [1][2][3][4][5]. The early Earth [6] was a complex reactor in which a continuous flow of numerous chemical reactions occurred. The variety of possible reagents and small organic molecules was huge [7]. Their reactions produced masses of larger molecules, oligomers, and polymers, which could lead to a crowded situation [8]. In such a messy environment, the way to life was not a straight road but an entanglement of uncertain paths, dead ends, backtracking, narrow passes, and roundabouts.
Even a very simple starting material, such as formaldehyde [9], may lead to an entire library of derivatives, including hydroxy acetic acid, glycine, N-substituted glycines, and hydroxyacetaldehyde, hence providing lots of sugars [10] and various nitrogen heterocycles. The reaction of any given aldehyde with ammonia and cyanide anion would give an aminonitrile, which would be hydrolysed to an aminoamide and then to an amino acid [11,12]. Amino acids would condense, delivering messy mixtures of peptides, including not only linear ones but also branched peptides thanks to diamino acids (e.g., lysine). However, such polymerization would be counterbalanced by amide bond hydrolysis, so there would always be monomers and short chains, and the most stable sequences would be favored in long chains. Esters would be formed from serine residues, adding ester functions to the polyamide main chains, and thioesters may also be formed from cysteine residues (compared to non-ribosomal aminoacyl transferases nowadays) [13]. In addition, nothing precludes monomeric amino acids from reacting with other types of monomers. They may react with sugars at the anomeric position or at other positions (as they do currently at the 3 and 2 positions of the terminal adenosine of t-RNAs).They may also react with many other acids to give diverse amides (different from peptides), just as aspartic acid takes part in the de novo synthesis of inosine, for instance [14]. Many heterocycles would also be present [15,16]. Thus, starting from a great multiplicity of Figure 1. Part of the "prebiotic mess" from which life originated. Traffic circle sign: chemical link between groups of reactions; no entry sign: non-efficient processes. Some non-represented reagents are necessary, such as HCN and ammonia.
Looking into proteins, cysteine residues with their thiol functional group shows a remarkable tendency for their presence in catalytic sites [17]. The presence of thiol groups allows protein folding and redox reactions via the formation of disulfide bonds. As they are part of iron-sulfur clusters, they are key players in the formation of metalloproteins, such as ferredoxins (four cysteine residues involved), and in zinc fingers (in which up to six cysteine units are involved), both of which are important in protein-protein and DNA (or RNA)-protein interactions. In view of the importance of thiols, the hypothesis that cysteine has been integrated into the primitive peptide chains is persuasive [18]. Plausible prebiotic syntheses of cysteine have been described [19,20]. The next question is: how were cysteine residues able to assemble with the other amino acids to form thiol-containing peptides that grow and finally gain a function? It is possible that both the formation and polymerization of amino acid units were related and synchronized while in harmony with the geochemical environment. Hydrogen cyanide may have played a crucial role in such chemistry. HCN is widespread in the cosmos and seen in different areas, such as near Looking into proteins, cysteine residues with their thiol functional group shows a remarkable tendency for their presence in catalytic sites [17]. The presence of thiol groups allows protein folding and redox reactions via the formation of disulfide bonds. As they are part of iron-sulfur clusters, they are key players in the formation of metalloproteins, such as ferredoxins (four cysteine residues involved), and in zinc fingers (in which up to six cysteine units are involved), both of which are important in protein-protein and DNA (or RNA)-protein interactions. In view of the importance of thiols, the hypothesis that cysteine has been integrated into the primitive peptide chains is persuasive [18]. Plausible prebiotic syntheses of cysteine have been described [19,20]. The next question is: how were cysteine residues able to assemble with the other amino acids to form thiol-containing peptides that grow and finally gain a function? It is possible that both the formation and polymerization of amino acid units were related and synchronized while in harmony with the geochemical environment. Hydrogen cyanide may have played a crucial role in such chemistry. HCN is widespread in the cosmos and seen in different areas, such as near carbon stars, proto-planetary nebulae, and comets [21,22]. Therefore, it is likely that HCN existed on the early Earth. This would have given a central role to the Strecker reaction  From the  aminonitrile intermediate of the Strecker reaction, dipeptides containing terminal nitrile  function may have been formed, including Cys-AA-CN (where AA stands for any amino  acid) [23]. In this paper, we describe our results from the condensation of two representative examples of these aminothiol-containing amidonitriles and demonstrate that they readily polymerize, thereby giving highly complex mixtures of thiol-rich polypeptides (Scheme 1c).
Life 2023, 13, 983 3 of 17 carbon stars, proto-planetary nebulae, and comets [21,22]. Therefore, it is likely that HCN existed on the early Earth. This would have given a central role to the Strecker reaction (Scheme 1a) in the synthesis of amino acids, including cysteine (Scheme 1b) . From the  aminonitrile intermediate of the Strecker reaction, dipeptides containing terminal nitrile  function may have been formed, including Cys-AA-CN (where AA stands for any amino  acid) [23]. In this paper, we describe our results from the condensation of two representative examples of these aminothiol-containing amidonitriles and demonstrate that they readily polymerize, thereby giving highly complex mixtures of thiol-rich polypeptides (Scheme 1c).

Materials and Methods
Products were observed in reaction mixtures by NMR spectroscopy ( 1 H and 13 C) and mass spectrometry. NMR-monitored reactions were run in D2O solutions in NMR tubes. The NMR apparatus was Bruker Avance III 400 or 500. For the mass experiments, H2O was used as the solvent. High-resolution mass spectra were recorded on a Waters G2-S Q-TOF mass spectrometer or on a LTQ Orbitrap XL (Thermo Scientific, Waltham, MA, USA) spectrometer. Low-resolution ESI analysis was performed on an Amazon speed (Brucker Daltonics, Billerica, MA, USA) IonTrap spectrometer. For MALDI analysis, DMSO was used as the solvent to dissolve the obtained polymers, and DHB (2,5-Dihydroxybenzoic acid) was used as the matrix.
General synthetic pathway was used for the preparation of amidonitriles Cys-Met-CN and Cys-Ala-CN (Scheme 2).
Boc-Cys(Trt)-OH (N-(tert-butoxycarbonyl)-S-trityl-L-cysteine) was activated with IBCF (isobutyl chloroformate) in the presence of NMM (N-methyl morpholine). Then, methyl esters of, respectively, methionine 1 and alanine 2 were added to give the corresponding esters: respectively, 3 and 4. The esters were then transformed into the corresponding acids 5 and 6. These acids were activated with IBCF and NMM, and then gaseous ammonia was added to give the amides 7 and 8. The obtained amides were then transformed to nitriles 9 and 10 using cyanuric chloride in DMF. The last step of the synthesis was the cleavage of Boc and Trt groups to obtain the nitriles 11 and 12. The final salts were recovered in D2O for NMR studies or in H2O for mass analysis.

Materials and Methods
Products were observed in reaction mixtures by NMR spectroscopy ( 1 H and 13 C) and mass spectrometry. NMR-monitored reactions were run in D 2 O solutions in NMR tubes. The NMR apparatus was Bruker Avance III 400 or 500. For the mass experiments, H 2 O was used as the solvent. High-resolution mass spectra were recorded on a Waters G2-S Q-TOF mass spectrometer or on a LTQ Orbitrap XL (Thermo Scientific, Waltham, MA, USA) spectrometer. Low-resolution ESI analysis was performed on an Amazon speed (Brucker Daltonics, Billerica, MA, USA) IonTrap spectrometer. For MALDI analysis, DMSO was used as the solvent to dissolve the obtained polymers, and DHB (2,5-Dihydroxybenzoic acid) was used as the matrix.
General synthetic pathway was used for the preparation of amidonitriles Cys-Met-CN and Cys-Ala-CN (Scheme 2).
Boc-Cys(Trt)-OH (N-(tert-butoxycarbonyl)-S-trityl-L-cysteine) was activated with IBCF (isobutyl chloroformate) in the presence of NMM (N-methyl morpholine). Then, methyl esters of, respectively, methionine 1 and alanine 2 were added to give the corresponding esters: respectively, 3 and 4. The esters were then transformed into the corresponding acids 5 and 6. These acids were activated with IBCF and NMM, and then gaseous ammonia was added to give the amides 7 and 8. The obtained amides were then transformed to nitriles 9 and 10 using cyanuric chloride in DMF. The last step of the synthesis was the cleavage of Boc and Trt groups to obtain the nitriles 11 and 12. The final salts were recovered in D 2 O for NMR studies or in H 2 O for mass analysis. Characterization data for intermediates 7 and 8 Characterization data for intermediates 7 and 8 tert-butyl((R)-1-(((S)-1-amino-4-(methylthio)-1-oxobutan-2-yl)amino)-1-oxo-3-(trityl thio)propan-2-yl)carbamate (7) m.

Preparation of nitriles 9 and 10
General procedure: amide 7 or 8 (1 equiv.) was dissolved in DMF (6 mL). Cyanuric chloride (1 equiv.) was added. The mixture was stirred for 1 h at rt and then quenched with water and extracted three times with EA. The organic phase was washed five times with water, dried, and evaporated under vacuum. Flash chromatography afforded the pure nitriles 9 and 10.

Results
As indicated in the Materials and Methods section, we have chosen to carry out nonprebiotic syntheses of our starting compounds, which allowed us to obtain homogeneous materials 11 and 12. As we expected the formation of complex mixtures from these materials, this was a necessary compromise between authenticity and feasibility of the mixture analyses. On the primitive Earth, these compounds could have been obtained by one of the many prebiotic methods proposed for the formation of amide bonds [24].
When the salts of the activated derivatives 11 and 12 were left in aqueous solution (pH fixed at 6.5-7 by adding sodium bicarbonate), the solution became cloudy after a few hours, and a white precipitate was deposited after one day. For NMR studies, the mixtures were left in NMR tubes without further purification. For MALDI analysis, the precipitate was filtered, washed several times with water, and, finally, dried under vacuum.

Formation of Cysteine-Methionine Polymers
The first test was run in deuterated water (D2O). After the deprotection of compound 9, Cys-Met-CN 11 was extracted as a TFA salt in D2O. The obtained nitrile was found to For more details, see Supplementary Files.

Results
As indicated in the Materials and Methods section, we have chosen to carry out nonprebiotic syntheses of our starting compounds, which allowed us to obtain homogeneous materials 11 and 12. As we expected the formation of complex mixtures from these materials, this was a necessary compromise between authenticity and feasibility of the mixture analyses. On the primitive Earth, these compounds could have been obtained by one of the many prebiotic methods proposed for the formation of amide bonds [24].
When the salts of the activated derivatives 11 and 12 were left in aqueous solution (pH fixed at 6.5-7 by adding sodium bicarbonate), the solution became cloudy after a few hours, and a white precipitate was deposited after one day. For NMR studies, the mixtures were left in NMR tubes without further purification. For MALDI analysis, the precipitate was filtered, washed several times with water, and, finally, dried under vacuum.

Formation of Cysteine-Methionine Polymers
The first test was run in deuterated water (D 2 O). After the deprotection of compound 9, Cys-Met-CN 11 was extracted as a TFA salt in D 2 O. The obtained nitrile was found to be stable in acidic water (pH 1-2) for several days. The acidic solution was neutralized using sodium bicarbonate to reach a neutral pH (6.5-7). At this stage, we observed a turbid solution that, within a day, gave a white solid precipitate. This indicates the formation of polymeric materials based on cysteine and methionine amino acids (Scheme 3). be stable in acidic water (pH 1-2) for several days. The acidic solution was neutralized using sodium bicarbonate to reach a neutral pH (6.5-7). At this stage, we observed a turbid solution that, within a day, gave a white solid precipitate. This indicates the formation of polymeric materials based on cysteine and methionine amino acids (Scheme 3). The experiment was conducted again using degassed water H2O. After allowing the solution to remain undisturbed at room temperature overnight, a thin white solid layer was seen at the container's base (Figure 2). This solid was extracted, dissolved in DMSO, and analyzed through MALDI spectrometry to discern the peptide mixture present. The experiment was conducted again using degassed water H 2 O. After allowing the solution to remain undisturbed at room temperature overnight, a thin white solid layer was seen at the container's base (Figure 2). This solid was extracted, dissolved in DMSO, and analyzed through MALDI spectrometry to discern the peptide mixture present.

Scheme 3. Polymerization of amidonitrile 11.
The experiment was conducted again using degassed water H2O. After allowing the solution to remain undisturbed at room temperature overnight, a thin white solid layer was seen at the container's base (Figure 2). This solid was extracted, dissolved in DMSO, and analyzed through MALDI spectrometry to discern the peptide mixture present. The polymerization process starts with the formation of thiazoline rings (Figure 3) [23]. A nucleophilic attack of the thiol of one unit on the nitrile group of a second unit is the starting point. Then, the amine adds onto the obtained iminium double bond in 13 to form a transient thiazolidine 14, which, upon NH3 loss, gives the more stable thiazoline cycle 15. This thiazoline is then slowly hydrolyzed to give dipeptide 16. The formed dimer 16 still has both active sides (a thiol and a nitrile) and it can continue the polymerization process to reach higher-length peptides, which indeed have been observed in mass spectrometry analysis. The polymerization process starts with the formation of thiazoline rings (Figure 3) [23]. A nucleophilic attack of the thiol of one unit on the nitrile group of a second unit is the starting point. Then, the amine adds onto the obtained iminium double bond in 13 to form a transient thiazolidine 14, which, upon NH 3 loss, gives the more stable thiazoline cycle 15. This thiazoline is then slowly hydrolyzed to give dipeptide 16. The formed dimer 16 still has both active sides (a thiol and a nitrile) and it can continue the polymerization process to reach higher-length peptides, which indeed have been observed in mass spectrometry analysis.
The MALDI mass spectrometry analysis ( Figure 4) revealed multiple polypeptide chains of varying lengths up to 25 units (which corresponds to a 50 amino acid sequence). The spectrum that was obtained did not present the anticipated peptides as individual masses (one mass for each (Cys-Met) n ). Instead, a collection of masses was observed for each given peptide length. These masses were attributed to the presence of a varied number of hydrolyzed and non-hydrolyzed thiazoline rings in each chain.
As an example, the multiple masses detected for (Cys-Met) 14 -CN are shown in Figure 5. The first (and lowest) mass detected in this zone was 3068.278, which is compatible with the presence of thirteen thiazoline rings, the highest number of thiazoline rings possible for fourteen Cys-Met units. The following mass, 3091.303, corresponds to one amide (e.g., one hydrolyzed thiazoline) and twelve thiazoline rings. The mass, 3106.822, stands for eleven thiazoline rings, and so on. With less rings, higher masses are expected; the difference between two peaks is 18, the mass of a water molecule, which is required for hydrolysis of one thiazoline ring.
As an example, the peptide structure 18, which consists of six Cys-Met units, contains three thiazoline rings, and ends with a nitrile, is presented in Figure 6.   The MALDI mass spectrometry analysis ( Figure 4) revealed multiple polypeptide chains of varying lengths up to 25 units (which corresponds to a 50 amino acid sequence). The spectrum that was obtained did not present the anticipated peptides as individual masses (one mass for each (Cys-Met)n). Instead, a collection of masses was observed for each given peptide length. These masses were attributed to the presence of a varied number of hydrolyzed and non-hydrolyzed thiazoline rings in each chain. As an example, the multiple masses detected for (Cys-Met)14-CN are shown in Figure  5. The first (and lowest) mass detected in this zone was 3068.278, which is compatible with the presence of thirteen thiazoline rings, the highest number of thiazoline rings possible   The MALDI mass spectrometry analysis (Figure 4) revealed multiple polypeptide chains of varying lengths up to 25 units (which corresponds to a 50 amino acid sequence). The spectrum that was obtained did not present the anticipated peptides as individual masses (one mass for each (Cys-Met)n). Instead, a collection of masses was observed for each given peptide length. These masses were attributed to the presence of a varied number of hydrolyzed and non-hydrolyzed thiazoline rings in each chain. As an example, the multiple masses detected for (Cys-Met)14-CN are shown in Figure  5. The first (and lowest) mass detected in this zone was 3068.278, which is compatible with the presence of thirteen thiazoline rings, the highest number of thiazoline rings possible Most observed masses belonged to peptides ending with a nitrile function. However, we also observed some chains ending with a carboxylic acid group (resulting from nitrile hydrolysis). Some other examples of masses observed from MALDI experiments are given in Table 1.
In many cases, if we focus on a specific region in the MALDI spectrum, we notice an odd peak that is not consistent with the rest of the distribution. An example of such a unique peak is presented in Figure 7. for fourteen Cys-Met units. The following mass, 3091.303, corresponds to one amide (e.g., one hydrolyzed thiazoline) and twelve thiazoline rings. The mass, 3106.822, stands for eleven thiazoline rings, and so on. With less rings, higher masses are expected; the difference between two peaks is 18, the mass of a water molecule, which is required for hydrolysis of one thiazoline ring. As an example, the peptide structure 18, which consists of six Cys-Met units, contains three thiazoline rings, and ends with a nitrile, is presented in Figure 6. Most observed masses belonged to peptides ending with a nitrile function. However, we also observed some chains ending with a carboxylic acid group (resulting from nitrile hydrolysis). Some other examples of masses observed from MALDI experiments are given in Table 1. one hydrolyzed thiazoline) and twelve thiazoline rings. The mass, 3106.822, stands for eleven thiazoline rings, and so on. With less rings, higher masses are expected; the difference between two peaks is 18, the mass of a water molecule, which is required for hydrolysis of one thiazoline ring. As an example, the peptide structure 18, which consists of six Cys-Met units, contains three thiazoline rings, and ends with a nitrile, is presented in Figure 6. Most observed masses belonged to peptides ending with a nitrile function. However, we also observed some chains ending with a carboxylic acid group (resulting from nitrile hydrolysis). Some other examples of masses observed from MALDI experiments are given in Table 1.  At any degree of polymerization, the obtained polymer contains the two active functions: a nitrile group and an aminothiol. Thus, the molecule has two pathways to choose from: either to react with another monomer in order to give a longer peptide chain, or to perform a cyclization reaction to give a macrocycle. For example, (Cys-Met) 5 -CN 19 (Figure 8) can undertake further reaction to give a longer linear peptide chain, or it may cyclize if the thiol function at the opposite end of the chain performs an intramolecular reaction with the nitrile. Such a process eventually yields the macrocycle 20, containing five thiazoline rings, which agrees with the observed mass of 1081.959 (Figure 7). The observed structures of cyclic poly(Cys-Met) are presented in Table 2. In many cases, if we focus on a specific region in the MALDI spectrum, we notice an odd peak that is not consistent with the rest of the distribution. An example of such a unique peak is presented in Figure 7. At any degree of polymerization, the obtained polymer contains the two active functions: a nitrile group and an aminothiol. Thus, the molecule has two pathways to choose from: either to react with another monomer in order to give a longer peptide chain, or to perform a cyclization reaction to give a macrocycle. For example, (Cys-Met)5-CN 19 (Figure 8) can undertake further reaction to give a longer linear peptide chain, or it may cyclize if the thiol function at the opposite end of the chain performs an intramolecular reaction with the nitrile. Such a process eventually yields the macrocycle 20, containing five thiazoline rings, which agrees with the observed mass of 1081.959 (Figure 7). The observed structures of cyclic poly(Cys-Met) are presented in Table 2.  Masses presented in Table 2 are average masses. To confirm the proposed structures, we also performed high-resolution mass spectrometry (HRMS-SEI). As shown in Figure  9, the found exact masses agree with the proposed thiazoline-containing cyclopeptides cyclo(CysMet)n for n between 3 and 6. Unfortunately, the quantities of cyclo(CysMet)7 and cyclo(CysMet)8 in the mixture of polymers were not sufficient to allow the measurement of their exact masses.  Masses presented in Table 2 are average masses. To confirm the proposed structures, we also performed high-resolution mass spectrometry (HRMS-SEI). As shown in Figure 9, the found exact masses agree with the proposed thiazoline-containing cyclopeptides cyclo(CysMet) n for n between 3 and 6. Unfortunately, the quantities of cyclo(CysMet) 7 and cyclo(CysMet) 8 in the mixture of polymers were not sufficient to allow the measurement of their exact masses. In the case of the observed macrocycles, none of the thiazoline rings underwent hydrolysis. It is probable that the bent geometry of the chain containing thiazoline rings favors the reaction between its two active ends. Precipitation of these molecules then prevents any hydrolysis from occurring.

Formation of Cysteine-Alanine Polymers
The behaviour of Cys-Ala-CN 12 was very similar to the behaviour of 11. It was stable in acidic water, but a precipitate formed when the pH was raised to 6.5-7. The solution started to become turbid, then a white solid slowly precipitated out of the solution, indicating the formation of polymeric materials (Scheme 4). As shown in Figure 10, thiazoline rings were characterized by 1 H-NMR spectroscopy thanks to the appearance of new signals around 5 ppm (CH of thiazolines) and in the region from 3.3 to 3.6 ppm (CH2 of thiazolines). In the case of the observed macrocycles, none of the thiazoline rings underwent hydrolysis. It is probable that the bent geometry of the chain containing thiazoline rings favors the reaction between its two active ends. Precipitation of these molecules then prevents any hydrolysis from occurring.

Formation of Cysteine-Alanine Polymers
The behaviour of Cys-Ala-CN 12 was very similar to the behaviour of 11. It was stable in acidic water, but a precipitate formed when the pH was raised to 6.5-7. The solution started to become turbid, then a white solid slowly precipitated out of the solution, indicating the formation of polymeric materials (Scheme 4). In the case of the observed macrocycles, none of the thiazoline rings underwent hydrolysis. It is probable that the bent geometry of the chain containing thiazoline rings favors the reaction between its two active ends. Precipitation of these molecules then prevents any hydrolysis from occurring.

Formation of Cysteine-Alanine Polymers
The behaviour of Cys-Ala-CN 12 was very similar to the behaviour of 11. It was stable in acidic water, but a precipitate formed when the pH was raised to 6.5-7. The solution started to become turbid, then a white solid slowly precipitated out of the solution, indicating the formation of polymeric materials (Scheme 4). As shown in Figure 10, thiazoline rings were characterized by 1 H-NMR spectroscopy thanks to the appearance of new signals around 5 ppm (CH of thiazolines) and in the region from 3.3 to 3.6 ppm (CH2 of thiazolines). As shown in Figure 10, thiazoline rings were characterized by 1 H-NMR spectroscopy thanks to the appearance of new signals around 5 ppm (CH of thiazolines) and in the region from 3.3 to 3.6 ppm (CH 2 of thiazolines). The experiment was repeated in degassed H2O in order to perform mass spectrometry. A solution of 12 was neutralized by sodium bicarbonate, and the solution was left unstirred overnight under inert atmosphere to give a white powder at the bottom of the reaction container. Then, the solid was recovered and subjected to MALDI-TOF analysis ( Figure 11). Taking as an example the region of masses shown in Figure 12, we can observe seven single masses, which correspond to seven Cys-Ala units that end with a nitrile. The highest number of possible thiazoline rings in the case of (Cys-Ala)7-CN is six rings, and this is reflected in the calculated mass of 1134.4598 (observed 1134.965). The adjacent peak, 1152.168, contains one hydrolyzed thiazoline ring, and five rings are still present. The experiment was repeated in degassed H 2 O in order to perform mass spectrometry. A solution of 12 was neutralized by sodium bicarbonate, and the solution was left unstirred overnight under inert atmosphere to give a white powder at the bottom of the reaction container. Then, the solid was recovered and subjected to MALDI-TOF analysis ( Figure 11). The experiment was repeated in degassed H2O in order to perform mass spectrometry. A solution of 12 was neutralized by sodium bicarbonate, and the solution was left unstirred overnight under inert atmosphere to give a white powder at the bottom of the reaction container. Then, the solid was recovered and subjected to MALDI-TOF analysis ( Figure 11). Taking as an example the region of masses shown in Figure 12, we can observe seven single masses, which correspond to seven Cys-Ala units that end with a nitrile. The highest number of possible thiazoline rings in the case of (Cys-Ala)7-CN is six rings, and this is reflected in the calculated mass of 1134.4598 (observed 1134.965). The adjacent peak, 1152.168, contains one hydrolyzed thiazoline ring, and five rings are still present. Taking as an example the region of masses shown in Figure 12, we can observe seven single masses, which correspond to seven Cys-Ala units that end with a nitrile. The highest number of possible thiazoline rings in the case of (Cys-Ala) 7 -CN is six rings, and this is reflected in the calculated mass of 1134.4598 (observed 1134.965). The adjacent peak, 1152.168, contains one hydrolyzed thiazoline ring, and five rings are still present.
It is impossible to discern the exact location of the thiazoline rings in the chain. For instance, the peak of 1184.979, observed in (Cys-Ala) 7 -CN, is likely due to a mixture of oligomers with three hydrolyzed and three remaining thiazoline rings. All of these structures are heptamers of Cys-Ala, meaning that they consist of a chain of fourteen amino acids 17 ( Figure 13). It is impossible to discern the exact location of the thiazoline rings in the chain. For instance, the peak of 1184.979, observed in (Cys-Ala)7-CN, is likely due to a mixture of oligomers with three hydrolyzed and three remaining thiazoline rings. All of these structures are heptamers of Cys-Ala, meaning that they consist of a chain of fourteen amino acids 17 ( Figure 13). Some examples of the found linear polypeptides that contain thiazoline rings are shown in Table 3.

Discussion
What we have tried to prove here is that starting from relatively simple molecules, complex mixtures can be obtained [25]. Thus, from the aminothiol-containing amidonitrile  It is impossible to discern the exact location of the thiazoline rings in the chain. For instance, the peak of 1184.979, observed in (Cys-Ala)7-CN, is likely due to a mixture of oligomers with three hydrolyzed and three remaining thiazoline rings. All of these structures are heptamers of Cys-Ala, meaning that they consist of a chain of fourteen amino acids 17 ( Figure 13). Some examples of the found linear polypeptides that contain thiazoline rings are shown in Table 3.

Discussion
What we have tried to prove here is that starting from relatively simple molecules, complex mixtures can be obtained [25]. Thus, from the aminothiol-containing amidonitrile Some examples of the found linear polypeptides that contain thiazoline rings are shown in Table 3. Table 3. Representative examples of (Cys-Ala) n structures.

Discussion
What we have tried to prove here is that starting from relatively simple molecules, complex mixtures can be obtained [25]. Thus, from the aminothiol-containing amidonitrile emphasized [26,27]. For example, thiamine pyrophosphate, containing a key thiazole ring and present in all living things, cooperates with pyruvate synthase to catalyze one of the oldest anabolic methods of CO 2 fixation, the Wood-Ljungdahl pathway [28,29]. The presence of thiazoles in the primitive environment may have facilitated the rise of a proto-metabolism.
For this spontaneous diversification to happen, the starting molecules must be functionalized and even multi-functionalized. Our starting molecules contain an amide function, which is poorly reactive, a nitrile, an amine, and a thiol. Indeed, nitriles are not so reactive; for instance, their hydrolysis is pretty slow [30], and their reaction with alcohols (the so-called Pinner reaction) [31] requires a strong acid catalyst. Amines are nucleophilic. Unfortunately, they are protonated in water, even at a moderately basic pH, which makes their reactivity dependent on the presence of scarce amounts of their non-protonated forms. Thiols do not face this problem. They are more reactive when deprotonated (pH > 7-8), but remain nucleophilic even in their neutral form.
Of the four functions present in our starting molecules, it is indeed the thiol function that plays the leading role. It is sufficiently nucleophilic to attack the nitrile group of a second molecule, even if this nitrile is relatively weakly electrophilic. Without this thiol, nothing would happen (except probably the slow hydrolysis of the nitrile). It is only after the action of the thiol that the amine intervenes. This amine is, of course, protonated most of the time, but as soon as it is deprotonated, its free form attacks the intermediate C=N double bond. This is a rapid attack because it corresponds to the closing of a five-membered cycle. The attack of the thiol is a prerequisite [32].
We believe that at the origin of life, thiols, which are easy to obtain and which are reactive in any conditions (except perhaps at a very low temperature), participated in the creation of the necessary molecular diversity in which life picked up the molecules that were essential to its development.
To go towards even more diversity, it would of course be appropriate to mix, for example, several amidonitriles. However, this would create such complex mixtures that they could no longer be analyzed by mass spectrometry (at least not by the techniques at our disposal). A whole world of molecules can emerge from actually very simple starting compounds; a world of linear chains and cycles, big and small, and not just thiazolines, of course [33][34][35]. Chemistry before life may have been a "prebiotic mess" from which emerged the chemistry of life today; as complex as this may seem to us, it is perhaps only a small extract.
Supplementary Materials: The following supporting information can be downloaded at: https: //www.mdpi.com/article/10.3390/life13040983/s1, supporting information containing detailed experimental part, 1 H and 13 C NMR spectra and MALDI spectra.