Diatom-Specific Oligosaccharide and Polysaccharide Structures Help to Unravel Biosynthetic Capabilities in Diatoms

Diatoms are marine organisms that represent one of the most important sources of biomass in the ocean, accounting for about 40% of marine primary production, and in the biosphere, contributing up to 20% of global CO2 fixation. There has been a recent surge in developing the use of diatoms as a source of bioactive compounds in the food and cosmetic industries. In addition, the potential of diatoms such as Phaeodactylum tricornutum as cell factories for the production of biopharmaceuticals is currently under evaluation. These biotechnological applications require a comprehensive understanding of the sugar biosynthesis pathways that operate in diatoms. Here, we review diatom glycan and polysaccharide structures, thus revealing their sugar biosynthesis capabilities.


Introduction
Among ocean phytoplankton, diatoms are highly diverse with an estimated 10 5 to 10 7 species [1]. Marine diatoms make up an important group: they contribute to approximately 40% of primary productivity in marine ecosystems and 20% of global carbon fixation [2][3][4][5]. Diatoms also participate in the ocean silica cycle [6][7][8][9], iron cycle [8,[10][11][12] and nitrogen cycle [13][14][15][16]. Due to their high diversity and very specific metabolism, diatoms have been used as bio-indicators and filters for controlling and purifying contaminated water [17][18][19][20]. For example, some diatoms such as Cylindrotheca fusiformis, Cyclotella cryptica, Phaeodactylum tricornutum, Skeletonema costatum, and Thalassiosira pseudonana have been used to absorb high quantities of heavy metals [21][22][23]. Diatoms are also used in nanotechnology to produce living nano-scale structures because they can build a silica shell at room temperature from a very small amount of silica dissolved in water [24][25][26][27]. In parallel, diatoms have been explored as sources of bioactive metabolites. Such compounds have many uses in the food industry. For example, diatoms have long been used as feedstock in aquaculture [28] and more recently in human health and food supplements ( Figure 1). Given their ability to produce carotenoids, phytosterols, vitamins, and antioxidants, diatoms have become valuable sources of food supplements for humans [29]. Moreover, they can synthesize large amounts of polyunsaturated fatty acids, which are bioactive substances proven to promote human health (e.g., decrease in frequency of cardiovascular diseases and cancers) and growth in animals [19,30,31]. Additionally, the major carotenoid of diatoms, the brown-colored fucoxanthin, is used as an antioxidant, anti-inflammatory, anti-diabetes and anti-cancer drug [32,33], as well as for its protective effect on liver, eyes, blood vessels, skin, and lungs [32,34]. Anti-inflammatory and immunostimulating activities of diatoms polysaccharides such as laminarin have also been reported as effective in various fish species [35][36][37]. Additionally, other polysaccharides such as chrysolaminarin from the diatom Chaetoceros muelleri have been shown to be promising candidates as immuno-stimulatory food additives in aquaculture [38]. Chrysolaminarin isolated from the diatom Synedra acus shows anti-tumor activity by inhibiting the proliferation of human colon cancer cells and colony formation [39]. Recently, polysaccharides from algae (including diatoms) have attracted interest in the cosmetic industry: some sulfated polysaccharides have already been tested to prevent the accumulation and the activity of free radicals and reactive chemical species, therefore acting as protective systems against oxidative stress [40]. The use of diatoms is thus likely to expand in the future. Additionally, the diatom P. tricornutum has recently been evaluated as a potential solar-fueled expression system to produce bioplastics [41] and biopharmaceuticals. In the biopharmaceutical field, diatoms have successfully been used to produce functional monoclonal human IgG antibodies directed against the hepatitis B virus surface antigen [42,43]. Understanding post-translational modifications (including glycosylation processing) in diatoms is fundamental, because they determine the critical quality attributes that can influence folding, half-life, activity, and immunogenicity of biopharmaceuticals [44,45].
Glycoconjugates, such as glycans and polysaccharides, are assembled and modified within the endomembrane system [46]. Their synthesis involves three steps, the first being the formation of activated nucleotide sugars, such as NDP-sugars or NMP-sugars within the cytosol [47]. Then, the nucleotide sugars are actively transported to the endoplasmic reticulum (ER) and Golgi apparatus where they serve as donor substrates for glycosyltransferases (GT) that transfer a specific sugar from its activated nucleotide form to a specific acceptor leading to the extension of the glycoconjugates.
In this review, in regard to the capability of diatoms to synthesize glycoconjugates, we focus on the composition, structure and properties of diatom polysaccharides-whether they be intracellular, cell wall-bound or secreted in the culture medium-and on the structure and biosynthesis of N-glycans attached to proteins.

Monosaccharide Composition and Structures of Polysaccharides in Diatoms
Given the thousands of diatom species, with their large variety of forms, symmetry and cell wall shapes, the monosaccharide composition and structure of diatom glycoconjugates are likely to be highly specific. Diatoms are usually described as single cells with a protoplast embedded in a frustule-the name for the diatom cell wall-composed of two overlapping valves or thecae: the larger, upper epitheca and the smaller, lower hypotheca ( Figure 2A). The frustule is composed of three successive layers: (1) the inner-most, organic layer, called the diatotepum, is in contact with the plasmalemma; (2) a mineral, silicified shell that contains organic matter; and finally (3) an external organic coat that is trapped in secreted mucilage, that we call here "cell wall-bound exopolysaccharides (EPSs)" ( Figure 2B). Numerous studies have characterized the monosaccharide composition of cell wall polysaccharides, intracellular food storage polymers, and extracellular mucilage, but results must be carefully interpreted according to the extraction techniques used, which depend on the solubility of the respective components [48][49][50][51][52][53].

Frustules, or Cell Wall Polysaccharides
In diatoms, cell wall silica is associated with organic matter, composed mainly of proteins [54][55][56], polyamines [57], and polysaccharides [58]. Three families of proteins have been isolated from C. fusiformis cell walls: frustulins, pleuralins, and silaffins (see reference [59] for a review). Long polyamine chains, together with silaffins, are likely involved in frustule biosynthesis. However, the location and role of each component involved in frustule biosynthesis are not well-understood. Experimental studies on the chemical composition of the organic matter in diatom cell walls have demonstrated that polysaccharide content dominates that of proteins and lipids [60], although X-ray photoelectron spectroscopy measurements of cell surface components in T. pseudonana appear to show that polysaccharides are not predominant [61]. To characterize and localize these polysaccharides, accurate sequential extraction is necessary for at least three reasons. First, depending on their composition and structure, polysaccharides have various water-solubility properties, which in turn vary with temperature and chemical treatments. Second, cell wall polysaccharides may be more or less tightly bound either together, to silica, or to other insoluble components of the frustule. Third, a secreted or an extracted substance is not necessarily water-soluble once outside the cell; thus an insoluble extracted polysaccharide is not necessarily a cell wall component.
Based on typical sequential extractions performed either on live or mechanically disrupted cells, we reviewed the monosaccharide composition reported in the three final fractions: (1) hot alkali soluble fraction; (2) hot alkali insoluble fraction; and (3) residual material that makes up the insoluble organic cell wall fraction (see Table 1). Different monosaccharide profiles have been observed in alkali soluble fractions: fucose dominates in Thalassiosira gravida and Corethron hystrix; rhamnose is mostly encountered in Chaetoceros affinis [62], galactose in Thalassiosira weissflogii [63] and mannose in P. tricornutum. Fractions with high levels of ribose were attributed to the cross-contamination of the extracts from intracellular content [50,62]. In the alkali insoluble fraction, the monosaccharide composition profile is not that different from that found in insoluble organic cell walls in which mannose is preponderant. Insoluble organic cell walls, as summarized over 19 taxa in Table 1, and first shown by Coombs and Volcani [64], and reviewed by Hoagland et al. [48], contain fucose, galactose, glucose, mannose, xylose, glucuronic acid residues and, to a much lesser extent, rhamnose and arabinose. Although mannose is obviously the most abundant monosaccharide, fucose is dominant in Nitzschia brevirostris, whereas glucose predominates in Melosira granulata and Cyclotella stelligera [55], Nitzschia curvilineata and Amphora salina [53]. It is difficult to determine the abundance of other monosaccharide components due to their heterogeneous representation.
The best studied frustule polysaccharides have been extracted from P. tricornutum. Based on successive alkali extraction, deproteination, chromatographic separation, Percival and co-workers extracted a cell wall polysaccharide from P. tricornutum [65] that is mainly composed of mannose, glucuronic acid residues and sulfate groups. Mild acid hydrolysis of the polysaccharides combined with chemical analysis of the oligosaccharide fragments revealed moieties that should be present in the overall polysaccharide structure. Blocks of 3-linked mannose have been identified and are assumed to form the backbone of the polysaccharides ( Figure 3A). Substitution at position 2 of the mannose of the main chain with di-and trisaccharides composed of mannose and glucuronic acid ( Figure 3B), or with sulfate groups have also been described in Pinnularia viridis [52] as well as in P. tricornutum [66]. However, the detailed configuration of the linkage between residues, as well as the size and distribution of the ramification are still unknown. The cell wall monosaccharide composition of several diatom species includes high amounts of mannose and glucuronic acid and low, but more variable amounts of fucose and xylose. According to these results, the glucuronomannan described in P. tricornutum may be synthesized more generally by other diatoms [52,66,67]. Environmental conditions have been shown to influence the monosaccharide composition of cell wall polysaccharides, thus affecting their respective structures as illustrated in P. tricornutum [66]. Variations in culture conditions, such as phosphate limitation in the culture medium, an increase in salinity, switching culture from liquid to solid medium-all considered as stress conditions-may cause the observed enrichment in rhamnose, uronic acid, sulfate, and O-methylated sugars in the insoluble polymeric fraction. Such variation in monosaccharide composition is assumed to modify polysaccharide structures, enabling cells to adapt to environmental changes. Therefore, the effects of culture conditions on the monosaccharide composition of glycoconjugates must be considered when comparing experimental results. Table 1. Summary of the monosaccharide composition of diatom extracts: alkali soluble fraction, alkali insoluble fraction, and insoluble organic cell wall residues. Values are expressed in mol% of total monosaccharides detected in extracts. Horizontal sums of values lower than 100% indicate that some monosaccharides were not clearly identified in the corresponding study.

Chitinous Spines
Cyclotella and Thalassiosira species produce stiff and highly crystalline fibers of chitin (poly-N-acetyl-D-glucosamine), as demonstrated using chemical, crystallographic, and enzymatic methods [70][71][72]. Due to high crystallinity (chitin is probably the most crystalline polysaccharide material on earth); the crystal structure of chitin fibers has been resolved at the molecular level [73,74]. Diatoms secrete β-chitin (Figure 4), which has a crystalline structure similar to that described in worms [75][76][77], but different from that of arthropods, crustaceans, and fungi, which all synthesize α-chitin. α-Chitin shows anti-parallel chain packing, whereas β-chitin polymer chains show parallel packing, meaning that reducing ends all point out in the same direction. In diatoms, chitin fibers are excreted through specialized pores within the thecae called fultoportulae. Cross-sections examined under transmission electron microscopy show invaginations of the plasma membrane at the site of chitin polymerization [78][79][80]. Similar secretion systems have been reported for the giant tube worm Riftia pachyptila [81,82]. Crystallographic analyses of chitin fibers bound to thecae demonstrate that chitin polymerization occurs by elongation at the non-reducing end, consistent with the reducing chain end being the furthest from the biosynthesis site [83,84].
Genes encoding chitin synthase were discovered in the T. pseudonana genome. Homologous genes, but no chitin fibers, have been described in Skeletonema costatum, Chaetoceros socialis, Lithodesmium undulatum and P. tricornutum, suggesting a common origin of chitin synthase in diatoms, but also indicating potential occurrence of yet undescribed chitin [85]. Chitin occurs in the silica frustule of T. pseudonana [86] and is probably an underestimated component of diatom cell walls in general. Inhibition of chitin synthase or chitin crystallization mainly increases the sedimentation rate of diatoms. This effect suggests that chitin fibers are involved in the buoyancy of the dense siliceous diatom cells. The importance of chitin fibers in diatoms has also been highlighted: the chitin content accounts for an estimated 30% of the organic carbon pool in Cyclotella species [87].

Food Storage Polysaccharides
The food storage polysaccharide in diatoms is a β(1,3) glucan, also called chrysolaminarin because it resembles the β(1,3) glucan found in chrysophyte algae [88]. This particular polysaccharide has been localized in the vacuole using aniline blue dye [89] and anti-β(1,3) glucan antibodies [51]. Vacuolar accumulation is enhanced during photosynthesis and is mobilized in the dark. β(1,3) glucan content can reach up to 20%-30% of dry matter during the exponential growth phase of the diatom [90] and up to 80% during the stationary phase.
Treating diatom cells with hot or boiling water is often sufficient to extract chrysolaminarin as the main polysaccharide component. Mild acid hydrolysis and freeze-drying also help cell wall disruption. Chrysolaminarin is insoluble in organic solvents and can be easily recovered by precipitation in alcohol or acetone [91][92][93]. The structure of diatom β(1,3) glucans was first described from a mixed bloom of freshwater diatom species that included Nitzschia sigmoidea, Cymatopleura solea, Pinnularia sp. and Melosira varians [91]. Since then, chrysolaminarin structures from several diatom species (Skeletonema, Phaeodactylum, Chaetoceros, Thalassiosira) have been studied using chemical analysis and spectroscopic methods such as NMR ( Figure 5). The NMR spectra given in Figure 5 show high degrees of similarities between the β(1,3) glucan extracted from Saccharina latissima ( Figure 5A) and from P. tricornutum ( Figure 5B) , with the exception of fewer β(1,6) branching signals for chrysolamaninarin and a slightly higher reducing-end signal. The P. tricornutum β(1,3) glucan may have lower molecular weight or lack a mannitol residue at the reducing end.  6) branching signals (4.5-4.6 ppm) than laminarin. The slightly higher reducing end signal at 5.26 ppm (α-anomer) in the chrysolaminarin spectrum can be attributed to a lower molecular weight or the absence of a mannitol residue at the reducing end.
Based on the numerous studies, a general picture of diatom chrysolaminarin structure has emerged. It is usually composed of a β(1,3) glucan backbone chain ramified with β(1,6) glucose and sometimes with β(1,2) glucose. The length of the backbone chain and the degree of ramification vary with the diatom species (Table 2). During the growth phase of T. weissflogii and C. muelleri, the structure of chrysolaminarin does not change noticeably, suggesting that culture conditions do not influence the chrysolaminarin structure [94]. Table 2. Overview of the structural features of diatom chrysolaminarins. For comparison, Laminaria digitata laminarin has a degree of polymerization (DP) of 20-30 residues and a degree of branching (DB) of 0.05, [95]. Yield extraction of chrysolaminarin is expressed in % of diatom dry weight.

Exopolysaccharides
Diatoms synthesize extracellular mucilage, which mainly consists of complex heteroglycans. Although EPS usually refers to exopolysaccharides, in its broad sense it includes all extracellular polymeric substances, which have high carbohydrate contents [99,102,103], and can even be used to mean any macromolecule secreted from the plasmalemma (see review by Hoagland et al., reference [48]). EPSs have been described in many forms, such as stalks, tubes, apical pads, adhering films, fibrils, and cell coatings, which imply that EPS components have a wide variety of morphologies, ranging from highly crystalline rigid fibrils to highly hydrated mucilaginous capsules, and including polymers that are tightly bound to or integrated in the cell wall. In this section, we focus only on soluble EPSs.
Excretion of EPSs by diatoms provides a food source for heterotrophic organisms and affects the erodibility of biofilms [102][103][104]. EPS production rates and their monosaccharide compositions differ according to the growth phase and the physiological status of the cells [68,[103][104][105]. EPS secretion depends on environmental conditions such as nutrient availability, daily fluctuations, irradiance, and even metal toxicity [106][107][108][109][110]. Studies have shown that nitrogen (N) and phosphate (P) limitations affect the production rate of EPSs as well as their monosaccharide compositions. For example, N or P limitation have been shown to stimulate EPS production in various diatoms [66,108,110]. Under P-limited conditions in C. fusiformis, monosaccharide composition shows an increase in galactose and a decrease in glucose, whereas the composition of the remaining monosaccharides is almost unaffected [108]. EPS production also increases under P-depleted conditions in other diatom species [109] with reduced glucose content in Cylindrotheca closterium. Likewise, fucose and rhamnose appear to be involved in adhesion, either by enrichment in some biofilm EPS structures with those residues, or by modification of the linkage types of those residues [67,111]. Mass spectrometry on T. pseudonana EPSs has shown that the degree of polymerization and the distribution of EPSs vary in response to nutrient depletion and different nutrient sources [110].
As in cell wall polysaccharides, the monosaccharide composition of EPSs can vary drastically depending on the extraction method [112]. However, diatom EPSs have two general compositional features: (1) they consist of heteropolysaccharides that can be sulfated and (2) they contain rhamnose, fucose, galactose, glucose, mannose, xylose and/or uronic acids as well as some arabinose in lower proportions (Table 3). Furthermore, diatom EPSs have high proportions of methylpentoses compared with intracellular soluble and cell wall polysaccharides. To date, no complete fine structure has been resolved for diatom EPSs and only a few studies report data on linkages. However, the available data show a large diversity of linkages found in diatom EPSs, with many glycosyl residues being typical of branched structures (Table 3).  [116,117] Chaetoceros curvisetus e Fuc (   reported. Ara, arabinose; Fuc, fucose; Gal, galactose; Glc, glucose; GlcA, glucuronic acid; Man, mannose; Rha, rhamnose; Rib, ribose; UA, uronic acid; Unk, unknown; Xyl, xylose. Numbers in brackets following abbreviations give relative proportions of monosaccharide residues expressed as mol%, wt%, molar ratio, etc., as reported. The sugars are ordered from high to low percentages for each species; c nd, not determined; d The ratio varies with hydrolysis conditions; e Data also available in the review [48]; f Glycosyl linkages expressed as the position(s) of substitution in addition to C-1 (t-Fuc, terminal fucosyl; 3-Rha, 3-rhamnosyl); Subscript "f" following sugar abbreviation indicates furanose form.

Structures and Biosynthesis of Protein N-Glycans in Diatoms
N-glycosylation is a major co-and post-translational modification of proteins in eukaryotes occurring in both the ER and the Golgi apparatus ( Figure 6, [122]). In this process, a lipid-linked oligosaccharide composed of three glucose (Glc), nine mannose (Man) and two GlcNAc residues (Glc3Man9GlcNAc2) is first assembled by the stepwise addition of monosaccharides on a dolicholpyrophosphate on the cytosolic side of and then in the lumen of the ER [123]. This oligosaccharide precursor is then transferred by the oligosaccharyl transferase (OST) complex onto the asparagine residues of consensus Asn-X-Ser/Thr sequences of a protein [123]. In 3.5% of studied cases, other sequences such as Asn-X-Cys, Asn-X-Val are glycosylated in endogenous or recombinant proteins produced in mammals or plant cells [124][125][126]. The glycoprotein is deglucosylated by α-glucosidases I and II and then reglucosylated by an uridine diphosphate (UDP)-glucose glycoprotein glucosyl transferase (UGGT) to ensure proper folding of the nascent protein through its interaction with ER-resident chaperones, such as calnexin and calreticulin [127]. These ER events are conserved in eukaryotes because they are crucial for efficient protein folding [127]. Bioinformatic analyses demonstrate that most of the genes encoding enzymes involved in the biosynthesis of the dolicholpyrophosphate-linked oligosaccharide, named asparagine-linked glycosylation (ALG) [128], are predicted in the genomes of diatoms (P. tricornutum [129], T. pseudonana [130], Fragilariopsis cylindrus [131] and Aureococcus anophagereffens [132]) ( Figure 6), [133][134][135]. The only exception is ALG 10, an α(1,2)-glucosyl transferase responsible for the transfer of the terminal Glc residue of the triglucosyl extension of the N-glycan precursor, for which no homology has been found ( Figure 6) [133,135]. In addition to ALG genes, genes encoding subunits of the oligosaccharyl transferase have also been identified in diatom genomes, especially in P. tricornutum (Figure 6), in which α-glucosidase II (but not α-glucosidase I) as well as ER-resident UGGT and chaperones such as calreticulin are also predicted ( Figure 6) [133,135]. These proteins are key elements of the quality control of proteins occurring in the ER. Large oligomannosides, with sizes of up to Man9GlcNAc2, have been found in P. tricornutum glycoproteins [133], suggesting that the synthesis of the oligosaccharide precursor and the quality control of secreted proteins may occur in a similar manner as that observed in other eukaryotes. However, in the ER, α-glucosidase I appears to remove the terminal α(1,2)-glucosyl transferase that is transferred by ALG10. Absence of ALG 10 and α-glucosidase I genes in P. tricornutum suggests that the N-glycan precursor is not fully glucosylated in diatoms into Glc3Man9GlcNAc2 ( Figure 6) [133,135]. In contrast to ER events, evolutionary adaptation of N-glycan processing in the Golgi apparatus has given rise to a variety of organism-specific complex structures [136]. First, α-mannosidases (α-Man I) degrade the oligosaccharide precursor into oligomannosides ranging from Man9GlcNAc2 to Man5GlcNAc2 (Man-9 to Man-5) ( Figure 6). N-acetylglucosaminyl transferase I (GnT I) then transfers a first N-acetylglucosaminyl (GlcNAc) residue onto Man-5 and initiates the synthesis of a large variety of structurally different complex-type N-glycans. This processing continues with the removal of two mannosyl residues and then decoration of the N-glycans by the action of a specific repertoire of glycosyl transferases such as α-fucosyl transferases (FuT). Therefore, mature proteins leaving the secretory pathway carry organism-specific complex N-glycans allowing the protein to acquire a set of glycan-mediated biological functions [137,138]. Searches in diatom genomes for candidate genes encoding Golgi glycosidases and glycosyl transferases involved in N-glycan processing have led to the identification of α-Man I, GnT I and a FuT (putative α(1,3)-FuT) candidates [133,134], (Figure 6). The GnT I gene predicted in the P. tricornutum genome has been demonstrated to encode an active functional enzyme able to restore the maturation of N-linked glycans into complex-type N-glycans in the CHO Lec1 mutant which is affected in its endogenous GnT I [133]. Moreover, structural analysis of glycans N-linked to proteins secreted by the diatom P. tricornutum indicate that these oligosaccharides are processed through a GnT I-dependent pathway into partially fucosylated Man3GlcNAc2 ( Figure 6) [133]. This truncated and fucosylated N-linked glycan likely results from the trimming of two mannose residues from Man-5 by an α-Man II and then transfer of an α(1,3)-fucose residue [133]. Later, the terminal GlcNAc introduced by the Golgi P. tricornutum GnT I are probably eliminated by β-hexosaminidases, as previously described in land plants and insects [139,140]. Two putative β-hexosaminidases have already been identified in P. tricornutum [133].

Nucleotide Sugar Biosynthesis in Diatoms
Monosaccharides represent the building blocks of glycans and polysaccharides (see Section 1). They are usually synthesized and converted into nucleotide sugars through a cytosolic interconversion metabolism (KEGG map 00520; [143]. In turn, nucleotide sugars-which are universal sugar donors-are involved in the formation of polysaccharides, glycoproteins, proteoglycans, and glycolipids. This metabolism is highly conserved in prokaryotes and eukaryotes and involves a set of phosphorylases, epimerases and reductases, as well as fructose-6-phosphate amino transferases enabling the synthesis of aminosugars. Nucleotide sugars may also result from the salvage pathway that involves the hydrolysis of glycans to free sugars, their phosphorylation and finally their nucleotidylation. Searches in diatom genomes (P. tricornutum [129], T. pseudonana [130], Fragilariopsis cylindrus [131], and Aureococcus anophagefferens [132] for genes encoding cytosolic enzymes of both the interconversion and salvage pathways have led to the identification of putative candidates for the synthesis of UDP-sugars such as UDP-galactose (UDP-Gal), UDP-galacturonic acid (UDP-GalA), UDP-glucuronic acid (UDP-GlcA), UDP-xylose (UDP-Xyl) and UDP-rhamnose (UDP-Rha) that are directly derived from UDP-Glc ( Figure 7). Moreover, gene predictions also include enzymes required for the synthesis of the guanine diphosphate (GDP)-sugars originating from Man-6P (Figure 7). Aminosugar such as GlcNAc biosynthesis likely occurs by amination of the C2 on fructose-6P (Fru-6P) as reported in other organisms. Other gene predictions based on diatom genomes include genes encoding several sugar phosphorylases of the salvage pathway ( Figure 7). However, neither UDP-galacturonate decarboxylases, nor UDP-arabinose 4-epimerases required for L-arabinose biosynthesis are predicted in these genomes. Other than arabinose, predicted nucleotide sugar metabolism is generally in agreement with the sugar compositions of polysaccharides and glycans isolated from diatoms. Figure 7. Predicted nucleotide sugar metabolism in diatoms based on bioinformatics analyses of the genomes from Phaeodactylum tricornutum [129], Thalassiosira pseudonana [130], Fragilariopsis cylindrus [131] and Aureococcus anophagefferens [132].

Conclusions and Perspectives
The abundant literature cited in this review demonstrates that monosaccharide composition is highly variable in diatom EPSs. The variation in physiological conditions greatly influences their composition, suggesting that diatoms modulate their polysaccharide biosynthesis machinery to adapt to environmental conditions. EPSs have been shown to be heteropolysaccharides. Branched and sulfated glucuronomannans are thought to be ubiquitous and thus representative of diatom cell walls. β-chitin fibers have also been found in some diatom species. Chrysolaminarin is a common β(1,6) ramified β(1,3) polyglucan for food storage in diatoms. Although numerous efforts have been made to determine the structures of diatom polysaccharides, there is currently a lack of information regarding their biosynthesis pathways, as well as their cell localization and organization.
With regard to protein N-glycosylation, gene prediction analysis suggests that diatoms are equipped with most of the eukaryotic genes encoding ER-resident essential players, such as sugar transferases and chaperones. Some candidate genes involved in subsequent Golgi events of N-glycosylation have also been found in P. tricornutum, justifying further investigations and characterization of diatom N-glycosylation pathways. When looking at potential nucleotide sugar synthesis in diatom genomes, key enzyme genes were predicted for almost all monosaccharides, with the exception of those for arabinose synthesis, although this sugar was detected in some cell wall polysaccharides. In the coming years, the increasing demand for marine polysaccharides and for recombinant therapeutic glycoproteins produced in microalgae will lead the scientific community to carry out more research to better understand diatom polysaccharide and glycan structures, as well as their respective biosynthesis pathways and cell localizations.