Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach

Bilova, Tatiana; Frolova, Nadezhda; Orlova, Anastasia; Silinskaia, Svetlana; Mailov, Akif; Popova, Veronika; Frolov, Andrej

doi:10.3390/plants15030445

Open AccessReview

Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach

by

Tatiana Bilova

^1,2

,

Nadezhda Frolova

^2,*

,

Anastasia Orlova

²

,

Svetlana Silinskaia

²

,

Akif Mailov

³,

Veronika Popova

¹

and

Andrej Frolov

^2,*

¹

Department of Plant Physiology and Biochemistry, Saint Petersburg State University, 199034 Saint Petersburg, Russia

²

Laboratory of Analytical Biochemistry and Biotechnology, K.A. Timiryazev Institute of Plant Physiology, Russian Academy of Sciences, 127276 Moscow, Russia

³

Laboratory of Bioanalytical Chemistry, Higher School of Living Systems, Immanuel Kant Baltic Federal University, 236041 Kaliningrad, Russia

^*

Authors to whom correspondence should be addressed.

Plants 2026, 15(3), 445; https://doi.org/10.3390/plants15030445

Submission received: 17 December 2025 / Revised: 23 January 2026 / Accepted: 27 January 2026 / Published: 31 January 2026

(This article belongs to the Special Issue Advanced Research in Plant Analytical Chemistry)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Decades ago, the introduction of GC-MS marked a significant advancement in primary plant metabolite studies. Here, in our review, we will delve into critical aspects of the workflow, spanning the selection of an analytical platform, sample preparation, analytical acquisition, and data processing and interpretation. The exceptional separation capabilities of GC, characterized by remarkable chromatographic resolution, render it ideal for analysis of the complex plant metabolome, including the separation of isomeric compounds. The diversity of analytical platforms allows the investigation of plant metabolomes using targeted and non-targeted approaches. GC-MS, equipped with efficient extraction methods and reliable derivatization protocols for semi- and non-volatile compounds, enables qualitative and quantitative analysis of these molecules. The stability of derivatives forms the foundation for the robustness and reproducibility of GC-MS methods, and their mass spectra provide characteristic fragments for confident identification and sensitive quantification of individual metabolites. There has been key progress in the advancement of GC-MS approaches to studying plant metabolism. However, the presence of artifacts during GC-MS analysis, particularly during derivatization, is a challenge that requires careful validations, which frequently necessitate additional investigations. The feasible solutions that were achieved to overcome the limitations in GC-MS-based studies are a particular focus of the present discussion.

Keywords:

primary metabolites; GC-MS-based profiling; plant metabolomics

1. Introduction

More than two decades ago, GC-MS was introduced as a method for comprehensive analysis of plant primary metabolites. The initial studies of Roessner et al. (2000) [1] and Fiehn et al. (2000a) [2] utilized gas chromatography–quadrupole mass spectrometry with electron ionization (GC-EI-Q-MS) for untargeted metabolic profiling of polar and semi-polar metabolites in aqueous methanolic and chloroform–methanolic plant extracts, respectively. Roessner and colleagues identified about 150 analytes in polar extracts of potato tubers and showed that the relative contents some of the metabolites (glycerol, sugar alcohols, and amino acids) varied with the plant growth conditions [1]. Similarly, Fiehn et al. analyzed semi-polar and polar metabolites from leaf extracts of different Arabidopsis thaliana genotypes. By using principal component analysis (PCA), they successfully distinguished the genotypes based on the relative abundances of 214 polar and 112 lipophilic annotated analytes [2]. These pioneering studies highlighted the potential of GC-MS-based metabolomics in bridging the gap between phenotypes and genotypes and unraveling molecular networks.

Indeed, the plant metabolome can be interpreted as the realization of genetic information, a kind of “bridge” between genotype and phenotype [3]. This understanding stems from the fact that metabolites always play a specific role in the cell and organism, representing the end result of gene function [4]. GC-MS allows the detection of quantitative differences in the metabolite content of plants, which, in combination with data analysis tools, can serve as a method for comprehensive characterization of plant genotypes in response to biotic and abiotic stresses [5,6,7,8], symbiotic interactions [9,10], and oxidative stress [11], as well as during growth, development and ageing [12]. It should be noted that GC-MS has many other cross-border applications, including the search for new biomarkers and bioactive metabolites [13,14], chemotyping [15], metabolomics-based selection and creating “next generations” of agricultural crops [16,17], and metabolic pathway engineering [18].

The wide range of applications of GC-MS is a clear sign of the technique’s high analytical potential, robustness and reproducibility. This method started its development with the first coupling homemade gas chromatograph with a time-of-flight mass spectrometer (TOF-MS) produced by the chemists R. Gohlke and F. McLafferty in the 1950s. The resulting GC-MS demonstrated a revolutionary enhancement in its analytical power [19]. Thus, by combining the exceptional compound-separating strength of GC and the definitive identification capacity of MS, it became possible to analyze each component emerging from the gas chromatograph separately. The first GC-MS instruments revealed many technical challenges, which determined the path for 25 years in the development of this method [19]. Thus, by 1980, GC-MS had achieved significant advances due to innovations in quadrupole (Q)-MS and faster TOF-MS, the evolution of efficient and reproducible fused-silica capillary columns, affordable computers for data processing, and improved electronics. In the following years further GC-MS-based technical innovations such as comprehensive two- and multi-dimensional gas chromatography (2DGC/MDGC)-TOF-MS, GC–time-of-flight tandem mass spectrometry (TOF-MS/MS), GC–ion mobility mass spectrometry (IMS), and GC coupled with a range of high-resolution mass spectrometers (HRMSs) have been developed [20,21,22,23]. Those innovative technologies together with improvements in raw data processing software, mass spectrum libraries, MS search and bioinformatics tools, and, in addition, the consistency of GC with a variety of sample preparation methods and derivatization strategies, have formed a whole GC-MS-based methodology appropriate for analyzing diverse low-molecular-weight compounds with markedly different physicochemical properties, such as non-polar and semi-polar volatiles and semi- and polar non-volatile metabolites.

Nevertheless, despite the rapid development of GC-MS-based innovative technologies, the canonical single-dimension GC-MS remains a routine method in every laboratory and a preferred method in applied and fundamental plant sciences due to its high efficiency in profiling of plant compounds, robustness, and the equipment being relatively inexpensive but reliable [24,25,26,27,28,29]. Thus, GC-MS has been widely used in research to identify metabolites responsible for the biological properties of medicinal plants, e.g., leaves of tropical plants such as Jacaranda mimosifolia [30], Aporosa cardiosperma [31], and Senecio scandens [32]. In this type of study, crude plant extracts usually are not subjected to derivatization. Their GC-MS analysis focuses on the identification of non- or semi-polar volatile secondary compounds (sesqui-, di- and triterpenoids; saturated fatty acid alcohols; and alkaloids) which may exhibit biological activities and appear to be plant-specific [33]. Usually such profiles cover just several dozen metabolites [30,31,32,33].

However, by implementing a derivatization strategy in the GC-MS workflow, the coverage of metabolites can be significantly expanded up to several hundred metabolites [5,34] as it transforms most non-volatile semi-polar and polar molecules dominating in plant extracts into volatile targets for GC-MS. Most metabolites of the groups are participants in major energy and biosynthetic metabolite pathways. Therefore, even though derivatization introduces complexity (potential incompliteness of reactions and formation of several derivatives) leading to the appearance of artifacts [29,35,36,37], this strategy has made GC-MS an apropriate method for analyzing the plant primary metabolome, significantly increasing the method’s versatility. Moreover, this progress in unravelling the plant primary metabolome using GC-MS has been proven to be a significant contribution to multi-omics research. This can be demonstrated by multiple studies in which GC-MS-based metabolite profiles helped in linking genomic and proteomic data to phenotyic traits of interest, enabling metabolomics-assisted beeding for crop improvement [38], and revealing mechanisms of development [39], growth protectors [40], and responses to a changing environment [41]. Thus, by means of GC-MS Acharjee and co-workers [38] identified metabolites associated with carotenoid biosynthesis contributing to potato tuber flesh color. In another work, Wang and co-authors [40] revealed that the growth-promoting effect of zaxinone on rice seedlings was accompanied by a marked increase in the contents of many sugars and stimulating central energy pathways. And in the third example, Yun and colleagues [39] found 36 GC-MS-identified primary metabolites and 81 volatiles that contributed to the ripening of the peel in harvested bananas.

Thus, the obvious advances of GC-MS for analysing polar metabolites can be listed as follows: (i) The outstanding efficiency of GC separation with a chromatographic resolution as high as 100,000 per run [21] is ideal for the extremely complex plant metabolome, often allowing the baseline separation of even isomeric analytes [21]. (ii) Efficient extraction methods, as well as reliable well-established derivatization protocols compatible with analysis of polar compounds, allow the use of GC-MS for the qualitative and quantitative analysis of polar metabolites in plants [42,43]. (iii) The remarkable stability of the derivatives underlies the high robustness and reproducibility of the GC-MS-based methods [44]. Moreover, the electron ionization (EI) spectra of derivatives often yield characteristic highly intense fragments which give access to both (iv) confident identification and (v) sensitive quantification of individual metabolites with a good linear dynamic range (LDR) [45].

To simplify consideration of the complex methodology of GC-MS-based metabolomics, it can be addressed as a generalized workflow comprising three principal steps: (i) sample preparation, (ii) GC-MS analysis and (iii) processing of the acquired data. Since the overall performance and the final output of the metabolomics analysis critically depend on each of these three steps, we address them in more detail below, highlighting their challenges and achievable solutions. However, the importance of the sample preparation, including derivatization methods, in GC-MS-based metabolomics is given special attention because inaccuracy made at this step may distort the final results of the analysis. Numerous sample preparation protocols for GC-MS provide all the necessary information concerning harvesting, freezing, storage, and homogenizing, as well as metabolite extraction, concentration, derivatization, and resuspension [46]. In general, the details of the sample preparation protocol depend on the purpose and object of the study and the choice of the analytical platform [47].

2. Choosing an Analytical Platform

Choosing the right analytical platform is a challenge that researchers face at the experiment design stage. The choice depends on the experiment objectives: whether the target metabolites or their groups will be searched for, or whether a non-targeted approach will be used to analyze all metabolites in the sample. Depending on the choice of a targeted or non-targeted approach, both chromatographic separation conditions and mass spectrometry platforms are selected.

In plant metabolomics analysis, when coupling GC-based separations with mass spectrometry, careful consideration of the spectral acquisition rate is essential, as well as understanding the unique capabilities and limitations of each platform, which can impact their suitability for targeted and untargeted metabolomics strategies (Table 1). To ensure reliable peak reconstruction and reproducible peak areas, a minimum acquisition of 7 to 10 data points across the chromatographic peak is recommended [48,49]. For conventional GC separations, a minimum acquisition rate of 2 spectra s⁻¹ is sufficient, while GC×GC separations require a minimum acquisition rate of 30 spectra s⁻¹ [21]. The choice of mass analyzer significantly impacts the quality of the mass spectrum and the sensitivity of the GC-MS method. Quadrupole mass analyzers are popular in metabolomics due to their versatility and sensitivity. They register the mass spectrum of a compound by scanning the m/z range (typically from 50 to 700 m/z) and collecting ions of each m/z value at one time which pass through the quadrupole sequentially. However, the quadrupoles, as transmission scanning instruments, can experience spectral skewing (i.e., distortion of the mass spectrum due to analyte concentration changes during the scan period, which occurs mainly on the back side of a chromatographic peak) if the acquisition rate is incompatible with the peak width [50]. Faster acquisition rates are necessary to avoid skewing, but they may result in some loss of sensitivity [51]. Non-scanning instruments like time-of-flight (TOF-MS) and Fourier transform (FT-MS) mass spectrometers which acquire the entire mass spectrum simultaneously in a single scan do not exhibit spectral skewing [52,53]. TOF-MS instruments, which perform ion separation and mass assignment based on the velocity of ions as they travel through a flight tube, offer high sensitivity and resolution and operate at acquisition rates between 50 and 200 spectra s⁻¹. The Orbitrap mass analyzer separates ions by their axial oscillations in an electric field and detects the ion oscillations, generating an image current which is subsequently Fourier-transformed into a mass spectrum. The Orbitrap provides ultrahigh resolution and accuracy but has an acquisition rate-dependent resolving power [23,54,55]. Untargeted metabolomics (metabolite profiling) requires all features (compound peaks) of mass spectrum acquisition, while targeted metabolomics benefits from improved resolution and additional mass selectivity [45]. Metabolic profiling may also explore precursor and product ion scans for group-type analyses using sequential mass spectrometers, such as triple quadrupole (QqQ-MS) or hybrid Q-TOF-MS. Other scanning modes like selected ion monitoring (SIM) and selected reaction monitoring (SRM) are employed for targeted analysis, offering exceptional selectivity [56]. Extracted ion chromatograms (XICs) can also enhance selectivity, especially with high-mass-accuracy data from TOFMS and Orbitrap instruments (Table 1).

3. Sample Preparation

3.1. Harvesting and Fixation of Plant Material

Harvesting and subsequent homogenization are the first steps of sample preparation (Figure 1) and are important for successful GC-MS analysis [57]. Harvesting of plant material needs to be accomplished rapidly, ideally within 15–30 s and is followed by immediate fixation, e.g., by shock-freezing in liquid nitrogen [29,58,59]. A harvesting delay of up to 30 s is considered a sufficient time to prevent the development of wounding signaling [60,61,62]. Shock-freezing in liquid nitrogen captures the actual metabolome state for each sample at any time point chosen according to the experiment design. Thus, shock-freezing provides a footprint of plant metabolism in the form of a “metabolic snapshot”, preventing any further modification of the sample, such as decomposition of metabolites or changes in their concentration or chemical or physical properties. Since metabolites are highly dynamic (in time and space), such a snapshot helps to clearly assess the current physiological state of the plant at a given time and helps to minimize errors associated with sample preparation [59].

Homogenization techniques serve the essential purpose of obtaining representative samples for analysis by disrupting plant cells. Several homogenization methods are commonly employed in plant metabolomics. One widely used method is mechanical disruption, which involves grinding or crushing plant material using a mortar and pestle, blender, or bead mill [63]. The mechanical methods are effective in breaking down plant tissues and releasing metabolites, but they may generate heat, potentially leading to metabolite degradation or enzymatic reactions [64]. To prevent this, liquid nitrogen or dry ice is used during grinding, as well as cooled mills. Liquid nitrogen freezing effectively preserves the metabolite composition by rapidly halting enzymatic activity and minimizing degradation [59]. However, it requires specialized equipment and poses safety considerations due to the handling of cryogenic substances. Additionally, care must be taken to prevent cross-contamination between samples and ensure consistent and reproducible homogenization [64,65]. For this purpose, separate containers (or tubes) are used for each sample with additionally placed special grinding balls, the main property of which should be inertness [66].

Besides shock-freezing and further grinding, other material fixation methods like microdissection [67] and direct drying [68], as well as fixation techniques combined with extraction methods like microextraction [69], ultrasound-assisted extraction [70], and microwave-assisted extraction [71], showed their excellence in many metabolomics studies, e.g., metabolic profiling [29,65] and identification of biomarkers [72,73] and compounds with potential antimicrobial and antioxidant activities [31,74].

Various microdissection techniques (manual microdissection, laser microdissection, and microsampling) enable the precise isolation of specific plant tissues or cells, facilitating the study of metabolites within targeted structures and cell types, thereby providing valuable insights into their metabolic profiles and functions [54,75]. In the first technique, microtools and a microscope are used; in the second, laser technology is used; and in the third, fine capillaries or microsampling devices are used to carefully isolate specific plant tissues, cell clusters or even individual cells [76,77]. However, the limitations of microdissection are that these procedures can be time-consuming and labor-intensive, especially when targeting small or delicate plant structures. Challenges also may arise in precisely isolating specific cells or tissues due to their complex anatomical arrangements or the potential loss of metabolites during the dissection process [78,79].

Plant samples can also be analyzed in their dried form, enabling the determination of specific components based on their dry mass and significantly mitigating issues related to the high water content in crude samples. Importantly, drying plant material does not completely eliminate water, and the term “dry mass” indicates that the material still contains certain percentages (from several to a dozen(s)) of water [63]. Drying of plant material is often carried out by freeze-drying (also known as lyophilization), where water in the form of ice at low pressure is removed from the material by sublimation or by hot air (at 70–80 °C or lower temperatures, such as 40–50 °C, which is particularly important for relatively volatile or subliming analytes) in ventilated ovens or in ovens with a flow of nitrogen. In cases where non-volatile and non-subliming substances need to be isolated, laboratory vacuum ovens are equipped with water absorption, adsorption, or freezing-out systems. However, vacuum systems are not suitable for highly or even medium-volatile substances.

Some types of the combined fixation–extraction and various extraction techniques applied in GC-MS-based metabolite analysis have already been discussed in our recent work [47] and are treated in other well thought-out reviews on this topic [78,80]. An overall workflow of the sample preparation steps in GC-MS analysis addressed here (harvesting, fixation and extraction) is illustrated in Figure 1, which summarizes the variety of applied methods.

3.2. Extraction of Primary Metabolites

High efficiency and selectivity are the main prerequisites for a successful extraction method. Thus, on the one hand, an efficient extraction procedure yields complete isolation of the desired metabolite groups; on the other, the co-extraction of proteins, polysaccharides, nucleic acids and lipids needs to be avoided. As plant primary metabolites vary essentially in their physicochemical properties (polarity, pKa, volatility, and termostability) and relative abundances (by up to several orders of magnitude), their concerted and efficient extraction is challenging. Therefore, selection of the apropriate solvent systems (in terms of polarity and compatibility with the target analytes) is critical [81]. To isolate all metabolites containing polar groups, polar solvents like methanol [82,83,84], isopropanol [85], and alcohol-aqueous solutions [16,86,87] are commonly used in various studies [88,89,90,91]. In addition to single-stage extraction with one or a mixture of solvents, multi-stage methods are often used to enhance the coverage and efficiency of metabolite extraction. These multi-stage extraction approaches involve a series of different solvents or solvent mixtures in sequential steps to target specific groups of metabolites and improve their overall yield [61]. In the first stage, less polar solvents (isopropanol, chloroform, and methanol) are typically added to the plant material, facilitating the extraction of lipids, waxes, and other non- or semi-polar metabolites. In the second stage, polar solvents like water are used to extract polar metabolites (sugars, organic acids, and amino acids). This two (or more)-stage procedure allows more comprehensive extraction of metabolites of varying polarities, which may be analyzed as a complex or as a separate extracts [61].

To access higher extraction efficiency, additional approaches are utilized to optimize the solubility and stability of metabolites, e.g., by manipulating the temperature conditions [81,92]. Temperature optimization is crucial to achieve a comprehensive and representative metabolite profile [57,58]. Despite enhanced temperatures (70–85 °C) providing better solubility for many polar compounds, lower temperatures can be advantageous to get better recoveries of more volatile compounds. Additionaly, lower extraction temperatures favor the stability of chemically labile metabolites, minimizing the potential for their degradation and side reactions during the extraction process. In this context, sample homogeneity should also be considered, as less homogeneous samples might require longer extraction times, which makes higher temperatures beneficial for efficient extraction [93].

To ensure the optimal separation efficiency, the analyzed samples need to be free from the metabolites, which are incompatible with the selected chromatographic system. As compounds cannot be efficiently retained in the stationary phase or/and eluted from it, their adequate quantification appears impossible [21]. Moreover, these contaminants can overload or even damage the separation column, which might compromise the whole metabolomics experiment [81]. In particular, the use of methanol for extraction of polar metabolites suggests that triglycerides and phospholipids are additionally extracted from the plant material with relatively high yields. Because of their relatively high molecular weight, these lipids and their trimethylsilylated derivates (as well as high-weight products produced due to the thermal instability of the derivates [35]) remain non-volatile under the high-temperature conditions used to introduce a liquid sample in a GC system and, therefore, will be retained on walls of the glass liner and may bind irreversibly with poly(dimethyldiphenylsyloxane) phases that are commonly used in GC analysis [21]. Moreover, the lipids accumulated on the liner may be subjected to at least partial pyrolysis under the high-temperature injection conditions [94]. This might result in the increasing intensity of background signals related to fatty acids and their degradation products, which is often also associated with their strong carry-over. Indeed, this phenomenon is often observed as contaminant signals of saturated fatty acids like palmitic and stearic acids [94].

To overcome the above-described issue and to ensure the adequate quality of the quantitative data, metabolites such as lipids, waxes and other high-molecular-weight non-polar compounds which cannot be efficiently analyzed in a system designed to separate low-weight volatile compounds need to be removed from the extract prior to the GC-MS analysis, e.g., by liquid–liquid extraction using non-polar solvents such as n-hexane or n-heptane. Indeed, Fiehn and co-workers showed that removal of lipids significantly increases the accuracy of the analysis [94]. Removal of lipids also essentially improves the efficiency of amino acid and polyamine subsequent derivatization (trimethylsilylation) [45,87].

As multiple factors might affect extraction performance, the reproducibility of this sample preparation step is important and requires special attention. Thus, in metabolomics, control of the extraction performance relies on internal standardization [45,87]. An internal standard (IS) is supplemented to the extraction solvent to consider any potential analyte losses accompanying the whole sample preparation procedure, including besides extraction such steps as extract drying, storage and derivatization [45,81]. Typically, non-natural compounds (at least those not found in plants of interest) with a structure and analytical behavior similar to the target analytes are selected as ISs [45]. For analysis of primary metabolites, the following standards are often used: ribitol (adonitol) [1,86,88], lactose [2,95], or stable isotope-labeled compounds such as [1,2,3-¹³C₃]myristic acid [96].

3.3. Derivatization

The volatility and thermal stability of the analyzed substances are the key prerequisites for successful GC-MS analysis [45,81]. Due to the presence of polar or even charged functional groups, primary plant metabolites are typically not volatile and cannot therefore be efficiently introduced in GC by liquid injection [21]. To adjust the physical properties of these analytes to the requirements of the GC-MS methodology, the polar functional groups (amine, hydroxyl, carboxyl, and thiol groups) of the prospective analytes need to be “caped” by non-polar moieties. This chemical modification of polar functional groups can be achieved by a broad array of derivatization methods which have already been discussed by us in our previous review on sample preparation and derivatization methods [47] and in more detail in the comprehensive review of Evershed and Beale et al. [92].

If performed correctly, this derivatization of polar plant metabolites results in a quantitative transfer of the resultant derivatives to the gaseous phase in the liquid injector of the GC. Thereby, the hydrophobicity of the resultant derivatives allows their efficient retention in non-polar stationary phases, such as the commonly used poly(dimethylsyloxane) and poly(dimethyldiphenylsiloxane), and complete elution from them by reasonable temperature gradients [21].

When choosing the derivatization strategy, the chemical nature and structure diversity of the target analytes must be taken into account. Since trimethylsilylation [97] and its modification—tert-butyldimethylsilylation [98]—exhibit high reactivity towards practically all polar functional groups of most plant metabolites, these derivatization strategies are recognized as the most universal and suitable in primary metabolomics [5,31,99,100]. Both methods rely on the reaction of sylilation, i.e., the substitution of acidic protons of polar groups with organosilicon radicals—trimethylsilyl ((TMS)₃Si^•) and tert-butyl(dimethyl)silyl (TBDMS) [35]. The resultant trimethylsilyl (TMS) derivatives are well soluble in non-polar organic solvents (chloroform, hexane, and dichloromethane) and demonstrate low reactivity, high volatility and thermal stability [35]. N-methyl-N-(trimethylsilyl)trifluoroacetamide (MSTFA) and N,O-bis(trimethylsilyl)trifluoroacetamide (BSTFA) are the most widely used and efficient among commercially available derivatization reagents due to their capacity to quickly derivatize polar groups of metabolites, leading to improved stability and better sensitivity of the resulting trimethylsilyl derivatives in GC-MS analysis [45]. Fortunately, the trimethylsilylation reaction is typically accomplished at moderate or even ambient conditions (temperature range: 30–90 °C; time range: 0.5–2 h) [29,97]. Other silylation agents, such as N,O-bis(trimethylsilyl)acetamide (BSA), N-trimethylsilylacetamide (TMSA), and N-trimethylsilylimidazole (TMSI), can also be used for derivatization in metabolomics studies. However, they are not widely used due to their lower reactivity and less efficient silylation process. Trimethylsilylation, despite its high efficiency and multiple advantages, has several intrinsic limitations. Trimethylsilylation agents and TMS derivatives are highly prone to hydrolytic degradation, and therefore they are extremely sensitive, even to traces of water [21]. In the presence of moisture, the agents are decomposed, producing N-methyltrifluoroacetamide and hexamethyldisiloxane. Therefore, prior to derivatization, the extracts need to be thoroughly dried. Our experience indicates that for dried extracts, additional short (20 min) drying under reduced pressure directly prior to derivatization essentially improves the reproducibility of the analysis (unpublished observation). On the other hand, this high water sensitivity can be overcome by application of N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide (MTBSTFA) and N-methyl-bis(trifluoroacetamide) (MBTFA), due to their experimentally discovered lower susceptibility to hydrolysis, making them more suitable for samples with higher water contents or greater water sensitivity [101,102]. The increased stability of tert-butyldimethylsilyl (TBDMS) derivatives allows for a more accurate and reliable analysis of compounds, especially those prone to degradation in the presence of water. However, due to the increase in the number of carbons of the derivatization reagent, TBDMS derivatives would have different retention times (t_Rs) and retention indices (RIs) relative to TMS derivates, and the analysis time would be increased [103].

One of the crucial steps in preparing samples for GC-MS analysis is the use of O-methylhydroxylamine hydrochloride (MeOX) for derivatization. This derivatization helps to reduce the number of spatial isomers of sugars from five to two and thereby increases the accuracy of sugar identification [47,104]. Subsequent (i.e., combined in a two-step process) derivatization with MeOX and trimethylsilylation is a common practice in plant metabolomics, employed to achieve a comprehensive GC-MS analysis of metabolites. Firstly, MeOX is dissolved in pyridine (which serves as the catalyst of the subsequent sylilation reaction) used for the derivatization of selective targets such as sugars and transforms them, increasing their volatility and stability. Then, trimethylsilylation further improves the stability and volatility of the sugars and a wide range of other polar metabolites, making them compatible with GC-MS analysis. This protocol for the combined derivatization of polar metabolites proved to be efficient in a wide range of plant metabolomics studies, including the metabolic profiling of a wide range of plant species (A. thaliana, R. sativus, A. caudatus, B. napus, P. sativum, etc.) [5,99,105,106], and in studies devoted to alternations of the primary metabolome caused by plant aging, an impact of stress [5,107], and in vitro formation of glycation reaction products [108].

However, the combined MeOX and TMS derivatizations are not without limitations. MeOX derivatization is typically conducted under an elevated temperature (30 °C) and requires an extended period of time (1–2 h), which may introduce artifacts in crude samples, such as an increase in relative contents of pyroglutamic acid (also known as 5-oxoproline) and phosphoric acid, most probably because of spontaneous cyclization of glutamic acid and decay of organic phosphates (sugar phosphates), respectively, leading to potential misinterpretations of the results [36]. TMS derivatization, in turn, is accompanied by the appearance of various by-products which potentially might coelute with analyzed metabolites. Detailed structural annotation and mass spectrum description of the silylation by-products are given by Little [109]. Secondly, silylation reaction exhibits higher reactivity towards carboxyl and hydroxyl groups than amino groups. This leads to partial incomplete derivatization of amino groups in molecules of amino acids [35]. As a result, trimethylsilylation of amino acids usually results in the formation of several (2–4) amino acid derivatives with different numbers of TMS groups [29,37,97,110]. Moreover, TMS derivatization of certain amino acids can lead to a change in their structure and the formation of a different amino acid. This can be exemplified by arginine derivatization, which could lead to the formation of Ornithine lactam 2TMS, Ornithine 3TMS and Citrulline 3TMS [29,37]. Thirdly, the elevated temperatures at which silylation usually takes place can cause the destruction of thermally unstable compounds. Therefore, it should be kept in mind that some compounds detected on chromatograms may be method-related artifacts, i.e., they may be the breakdown products of thermolabile compounds. Finally, attention must be given to the derivatization of crowd samples with high amounts of sugars and/or organic acids, due to their differences in pKa, which could result in differences in derivate yields [97]. Thus, careful validation and consideration of potential artifacts are essential when applying MeOX and TMS/TBDMS derivatization in plant metabolomics studies, and further investigations of this problem are needed.

Fatty acids, a vital metabolite class of plants, play essential roles in various cellular processes, including energy storage and membrane structure, and serve as precursors for signaling molecules and secondary metabolites, making their analysis of significant interest in plant metabolomics studies [111,112]. However, due to their low volatility and poor ionization efficiency, the direct analysis of fatty acids by GC-MS is challenging. To address this issue, as well as silylation, another commonly used derivatization reaction involves converting fatty acids in the reaction of transesterification with a methanol into more volatile and easily detectable derivatives known as fatty acid methyl esters (FAMEs). The reaction requires a strong acid catalyst, such as boron trifluoride (BF₃) or sulfuric acid, and elevated temperatures to ensure the complete conversion of fatty acids into FAMEs [35,113,114]. The fatty acid derivatization into FAMEs offers the following advantages: (i) increased volatility, which allows for shorter retention times and better chromatographic separation, leading to improved sensitivity and resolution [35]; (ii) enhanced ionization efficiency, which results in higher-quality mass spectra and reliable identification of fatty acid species [45]; (iii) a broader range of analyzable fatty acids, including those with varying chain lengths and degrees of unsaturation. This versatility is essential for profiling the diverse fatty acid composition and elucidating the roles of different fatty acids in the physiological processes of plants [114,115].

Fatty acid derivatization into FAMEs has some limitations. The derivatization may result in incomplete conversion of fatty acids into their corresponding derivatives [35], leading to the underestimation or misidentification of certain fatty acids, and may introduce artifacts or chemical modifications that could potentially affect the interpretation of the results [94]. The efficiency of the transesterification can be influenced by reaction time, temperature, and the catalyst concentration. To minimize the potential for incomplete derivatization and, therefore, to ensure reproducibility and consistency across samples, it is crucial to optimize the reaction conditions [113].

Overall, the derivatization of polar compounds greatly expands the GC-MS analytical capabilities. Indeed, due to derivatization, polar and non-volatile compounds obtain their volatility, chemical and thermal stability in the form of the resultant derivates. Thus, derivatization enables GC-MS to perform robust profiling of primary metabolites in plant samples, enhancing our understanding of plant metabolism in general and the roles of specific metabolites in altering plant metabolism during various physiological processes.

4. Analytical Acquisition

GC-MS analytical acquisition can be performed in scanning, selected ion monitoring (SIM), scan/SIM, selected ion recording, selected ion storage, tandem mass spectrometry (MS/MS), multiple reaction monitoring (MRM), selected reaction monitoring (SRM), and multistage mass spectroscopy (MSⁿ) modes [116]. In scan mode, which is commonly used in untargeted and targeted analyses of MEOX-TMS derivatives of polar plant metabolites, a range of ions such as m/z 50–700 is often selected, and continuous change in the voltages on the quadrupole mass spectrometer allows scanning over this predetermined range. In the results of a GC-MS analysis, a total ion chromatogram (TIC) is built. This presents a plot of the sum of total ion intensity in the given m/z range as a function of time. The TIC is built as two dimensions of data. Each point on the TIC contains a mass spectrum, which is a distribution of the relative intensity of ions acquired during a scan. A mass spectrum acquired at the top of the TIC peak or an average of the mass spectrum collected across the TIC peak is often used to identify the analyte by comparison with its spectrum and comparison of the RI with the parameters from the MS library. In SIM mode, which is often used in targeted analysis, only a few ions of interest are selected to greatly increase the sensitivity of the mass analyzer to them. This is achieved by setting the quadrupole voltage to scan only one or a group of the ions. SIM mode improves the signal-to-noise ratio (S/N) because the analyzer is able to spend more time collecting specific ions while reducing noise. Selecting the specific ion to analyze in SIM mode requires prior knowledge of the mass spectrum of the compound, which in turn can be obtained by analyzing the standard of interest in scan mode. Any issues which can impact instrument sensitivity, such as extract degradation, column contamination, and t_R shifts, would affect the data acquisition results. Therefore, to ensure high-quality control in targeted and untargeted GC-MS metabolomics, several of the following recommendations should be followed. The condition of the chosen analytical platform should be regularly (ideally before every analysis of experimental samples) evaluated according to standard protocols described in the instrument documentation and/or the scientific literature [117]. This is necessary to ensure that there are no changes in instrument sensitivity that are usually observed during a metabolomic profiling analysis due to contamination of an injector, an analytical column and an ion source by non-volatiles, as well as column degradation which can also cause shifts in analyte t_Rs. This evaluation involves analyzing quality control samples (QCs) or mixes of standards. Particular attention should be paid to the content of amino acid TMS derivatives because these compounds are very susceptible to degradation during GC-MS analysis. In amino acid TMS derivates, the ratio of nitrogen–silicon bond formation and decomposition strongly depends on the clearness of the injector (liner, syringe needle, and gas line) and the column [21,45]. To assess the potential losses of the derivatives, and, therefore, the clearness of the GC system as a whole, it is recommended to monitor on a daily basis the relative abundance of the amino acid TMS compounds in relation to carbohydrate TMS derivatives [45]. Additionally, to maintain chromatographic resolution, method development should also consider the maximum number of injections per GC column [21]. For large-scale plant metabolomics studies, it is important to replace columns and consumables before encountering quality issues. It is also important to optimize the analytical load of a GC capillary column with an analyzing sample to avoid the column overloading. For that, sequential GC-MS analysis of increasing sample doses is performed. Then, the optimal sample dose (i.e., the sample quantity for GC-MS analysis which does not cause column overload) is calculated from a linear equation observed for linearity between applied sample doses and peak areas obtained from a corresponding chromatogram.

Finally, several recommendations for designing sample sequences (batches) are as follows: (i) arrange hexanes, blanks, a mix of alkanes, metabolite standards and QCs in the same batch as the experimental samples; (ii) randomize the experimental samples across the batch; (iii) place up to six QCs in different positions across the batch (Figure 2).

Including blanks in a GC-MS sample sequence is needed to monitor possible contamination which might occur during GC-MS analysis. Blanks contain solvents used in sample preparation but without the analytes of interest. They are processed along with experimental samples (Figure 2). Contamination in blank chromatograms is indicated as so-called “ghost peaks”, i.e., peaks that are not expected to be in a chromatogram, and background (noise) signals [57]. Examining the mass spectra of the ghost peaks found in blanks helps to identify potential contamination sources such as reagents and solvents, vial septa, injection septa, liners, and columns [118]. Thus, monitoring contaminations with blanks helps to detect ghost peaks and exclude them from obtained sample-related GC-MS data, leading to more robust interpretations of metabolomic profiles. Additionally, the introduction of a derivatizing agent in a GC system before blanks allows column conditioning, during which residual contaminants are flushed out, making the column fit for reliable use [102]. Linear alkanes are included in GC-MS analysis to be used in t_R calibration, allowing for consistent alignment and matching of analyte t_Rs across different GC-MS runs [119].

QC samples, usually obtained by mixing equal aliquots of each biological sample are essential in assessing within- and between-series repeatability and removing features with excessive signal drift prior to statistical analysis. They are also used for equilibrating the analytical platform, checking metabolite profiles, calculating technical precision, signal correction, and standardization [57,87].

Standards of expected metabolites are included in sample batch and serve as both reference compounds for accurate identification of detected analytes and for their absolute quantification [57,87]. Ideally, the standards should cover a wide range of chemical properties and concentrations to match the diversity of the analytes present in the sample. The selection of standards usually relies on the literature data and the personal experience of a researcher in working with the biological object under study. Based on this information, a list of expected metabolites of the object under study is compiled.

Analyte Absolute and Relative Quantification

Quantification of the analytes in samples is based on the registration of quantitative chromatographic information such as peak areas or peak heights. To establish the relationship between the quantitative characteristics and analyte concentration, calibration curves are constructed by analyzing a series of standard solutions with known concentrations (Figure 3). Analysis of dilution series prepared for individual standards is laborious and time-consuming. Therefore, it is possible to prepare a mixture of known compounds, while observing an important condition for the preparation of such mixes—the t_Rs of the standards should not overlap [88,108]. In addition, when calibrating concentrations of the standards, to reduce possible risk of coelution of two or more standards and their co-occurrence in the same TIC peak, it is recommended to use not the area (height) of the TIC peak, but the peak parameters obtained from an extracted ion chromatogram (XIC), i.e., a chromatogram reconstructed for the analyte-specific m/z and t_R values (Figure 3a–c).

Calibration is a fundamental process used to establish a relationship between the instrumental response and an analyte concentration [120]. This relationship enables accurate quantification and identification of the target analyte. Two common calibration approaches are used for absolute quantitative analysis—external standardization and the standard addition method—each with its own advantages and limitations and suitable for different analytical scenarios [120,121].

In the case of external standardization, a series of calibration solutions with known concentrations of reference compounds (standards) corresponding to the target metabolites of interest is prepared, at least in triplicate. These calibration solutions are then analyzed in the same batch as the plant samples to confirm the identification of metabolites by coelution with the corresponding standards and to construct calibration curves establishing a linear relationship between the standard concentrations and quantitative characteristics (area and height) of the corresponding chromatographic peaks. The calibration curve can be plotted using the coordinates of actual or logarithmic transformed values for peak area (height) and standard concentrations (Figure 3d). While ordinary calibration is suitable for measuring metabolites within a narrow concentration range, log-transformed calibration allows for a more uniform coverage of the multi-order range, considering both low- and high-abundance metabolites in biological samples. It should be noted that the points on the calibration curve should be equidistant from each other, and log-transformed calibration allows this to be achieved. Based on the calibration curves, the LDR, limit of detection (LOD) and limit of quantification (LOQ) for each standard should be determined [117]. LDR indicates the maximum concentration range over which a calibration curve remains linear. The curve linearity is usually assessed by the coefficient of determination, R² [122]. LOD is considered the lowest concentration of an analyte from which it is possible to infer its presence in a sample, and LOQ is the lowest concentration of an analyte from which it is possible to determine it in a sample [123,124]. Since mass spectrometers have a logarithmic response allowing detection at several orders of magnitude of the metabolite concentrations, a calibration curve plotted with log-transformed coordinates in which LDR typically spans multi-orders of magnitude of the metabolite concentrations is often preferred. Therefore, log-transformed calibration helps to align the results with the linear detection range, where the mass spectrometer operates most sensitively and accurately. Thus, log-transformed calibration covers a wide concentration range and improves the accuracy and reliability of metabolite content measurements.

For external standardization, standards are selected that represent the target metabolites of interest and maintain their structural stability at high temperatures under GC-MS conditions. This approach allows determination of the content of target metabolites in samples, and, therefore, comparison of the metabolite levels in samples collected from different experimental conditions, which provides valuable information about the metabolic responses of the object under study [5,99,107].

However, external standardization may not fully account for matrix effects present in the complex plant samples and associated instrument variations, particularly for minor metabolites, leading to potential inaccuracies [97]. To address this issue, the standard addition method is employed [2,102]. In this method, serial dilutions of known quantities of a standard are prepared similarly to the external calibration, but the solvent is an extract of the plant sample (matrix) (Figure 3c). Adding a standard of known concentration to the sample matrix allows a sample-specific calibration curve to be constructed, taking into account the influence of the matrix and associated instrumental variations [97,102]. The LDR obtained from the calibration curves is used to determine the concentrations of the targets in the plant samples (Figure 3e) [89].

Relative quantification is based on a comparison of peak quantitative characteristics (areas and heights) obtained from samples subjected to experimental conditions versus those of control samples. This type of quantification is usually used for those compounds for which pure standards do not exist or are difficult to obtain, as well as for unknown compounds. The relative quantification aims are to distinguish different groups of samples by monitoring changes in their metabolite levels and to identify groups of marker metabolites which are potentially related to the factor impact causing the group differentiation.

5. Data Interpretation

Due to the impressive development of MS-based analytical platforms, the analysis of biological samples today generates massive amounts of primary raw data information [125,126]. As each GC-MS run might contain a few hundred features, manual processing of such datasets is not only irrational, it is impossible. Therefore, adequate analysis of the acquired datasets requires the development of software tools for proper data mining and interpretation. These tools need to consider different hardware platforms potentially applicable in metabolomics. Currently, there is a broad selection of commercially and free available programs, which give access to data interpretation in a batch mode with programming and automatization of nearly all steps of the data mining (pre-processing), processing and post-processing (statistical interpretation) (Table 2) [127,128,129].

Data mining in GC-MS-based metabolomics includes the following steps: baseline correction, mass spectrum deconvolution of analytes, peak recognition, peak peaking and chromatogram alignment [130]. This workflow is typically followed by a more or less standardized data processing procedure which is finalized with structural annotation of the aligned features (this is often referred to as identification, although the correctness of this term in this case is questionable [57]) and their further quantification via integration of peak area. Basic steps of data mining, processing and post-processing are shown in Figure 4.

Alignment refers to the procedure of pre-processing and handling datasets comprising both small and large numbers of sample runs. Chromatographic shifts (i.e., variations in t_R across the experiment), caused by a change in gas carrier velocity [131], can lead to misalignment of peaks and hinder accurate comparison of all chromatographic peaks, often referred to as features. After completing the alignment, verification is required to confirm that corresponding peaks are aligned in a consistent manner [57,127]. The correctness of the alignment procedure is critically important for identification and quantification of metabolites across the samples and for adequate and reliable statistical analysis. Furthermore, baseline correction is performed to remove baseline drift or fluctuations that may be present in the chromatographic data due to column stationary-phase bleed, background ionization, and low-frequency variations in the detector and/or instrument-controlled parameters (e.g., temperature or gas flow rate) [130]. Baseline correction procedures are specifically designed to correct low-frequency noise and offsets without affecting higher-frequency variations, aiming to enhance the visibility of metabolites and improve the accuracy of peak integration and quantification [132].

There are different methods available to reduce high-frequency noise variations and enhance the quality of chromatograms by improving the S/N ratio. One commonly used technique is smoothing, which involves fitting a low-order polynomial to each data point and its neighbors using the Savitzky–Golay method [133]. Another approach is wavelet smoothing, which transforms the chromatogram into the frequency domain, removes high-frequency noise, and then converts it back to the time domain [134]. These noise reduction techniques play a crucial role in addressing analytical challenges and enabling the discovery of meaningful chemical differences between samples. However, it is important to select the appropriate parameters to avoid distorting the genuine chemical signals and introducing artifacts during the noise removal process [130].

Another critical aspect of the pre-processing pipeline is data cleaning. This assumes the disclosure, identification (if possible), and removal of artifacts, as well as instrumental noise, that may be present in the raw data. Artifacts can arise from various sources, such as sample preparation, instrument performance, or experimental conditions [57]. By carefully checking the data and applying appropriate filtering techniques, these unwanted features (ghost peaks) can be eliminated, thereby improving the accuracy and reliability of the subsequent analysis.

To date, the above-described pipeline is implemented in practice as a part of multiple software solutions. Thus, at least some steps of this workflow or the whole pre-processing pipeline can be accomplished within the XCMS platform [135,136], along with free-access tools like AMDIS (for qualitative analysis) [137], OpenChrom [138], MS-DIAL [139] (for qualitative and quantitative analysis), and others, including commercially available software (Table 2). However, the results need to be manually reviewed, and the instrument-related settings need to be adjusted if necessary. Ideally the processing method needs to be validated with the set of known standards and/or QCs, as described above.

After all data-mining steps have been completed, data processing is performed, starting with structural annotation (identification) of metabolites. It should be noted that the primary benefit of GC-EI-MS is the fact that electron ionization (EI) is a hard ionization technique [23]. Due to this, the fragmentation behavior of individual analytes (both in terms of m/z patterns and relative intensities of individual signals) is highly reproducible and depends only on electron energy, not on the mass analyzer type or instrument vendor [23,29]. Based on the ionization/fragmentation efficiency distribution of natural products, an electron energy of 70 eV was selected as the standard condition for metabolomics experiments [140]. This setting yields highly informative EI mass spectra, which provide rich structural information about the analyte. This information can be used for structural annotation and, in the best-case scenario, for unambiguous identification of the metabolite. In GC-EI-MS-based metabolomics, this task can be performed by two principal strategies. First, the mass spectra, acquired for the metabolite MeOX-TMS derivatives in the experimental samples, can be compared with those of the corresponding authentic standards (ideally in coelution experiments) to confirm the structural identity of detected analytes [141]. Alternatively, the structures behind the detected features can be annotated by a spectral similarity search, which typically relies on one or several databases—spectral libraries [141,142].

Despite its great impact on the structural annotation of metabolites, mass spectrometry data alone are often insufficient for the identification (i.e., exact and complete structure assignment) of detected compounds, particularly when just minor structural differences between isomers with identical mass spectra need to be distinguished. In this context, a proper annotation of epimeric monosaccharides (e.g., glucose, galactose and mannose, which are typically simultaneously present in plant extracts) is a challenging task, which, obviously, cannot be solved only by interpretation of the mass spectra. Therefore, annotation of such sugars (or other compound classes featuring pronounced isomerism) also relies on other criteria, specifically metabolite-specific t_Rs and Kovats or Fiehn RIs, which describe the retention of substances relative to the series of n-alkanes or FAMEs, respectively [143].

Despite t_R providing important information for compound identification, it cannot be considered to be a reproducible parameter as it is strongly amenable to batch effects. Indeed, any instrument- or/and time-specific alteration in various GC-MS conditions (capillary column parameters, column ageing, temperature gradient, etc.) results in changes in the conditions of chromatographic separation and, therefore, in t_R shifts. This might directly cause problems with accurate compound identification [144]. To compare the retention behavior of individual compounds observed in GC-MS experiments accomplished in different laboratories under different instrumental conditions or even at different times within the same laboratory, RIs are usually employed.

As mentioned above, t_Rs, RIs and mass spectra together provide important chromatographic and spectral data (GC-MS information) for subsequent analyte annotation. However, to effectively utilize this information, match factors are used to quantify the accuracy of the analyte annotation based on the similarity of the experimental mass spectrum and/or RI to a library reference [141,145,146,147]. Most GC-MS data processing tools (Table 2) have in-built algorithms that determine the match factor, which allows for automatic annotation of analytes based on a comparison with the obtained mass spectra and comparison of RIs with the corresponding available information from in-house or open MS libraries. Among the publicly available MS libraries which contain spectral and RI data for annotation of metabolites, the most widely used are the National Institute of Standards and Technology (NIST; EI mass spectra of ~300,000 compounds, https://webbook.nist.gov/chemistry/ (accessed on 28 November 2025)), the Human Metabolome Data Base [148] (HMDB; ~200,000 entries with EI mass spectra of both lipid- and water-soluble compounds, https://hmdb.ca/ (accessed on 15 January 2022)), MassBank [149] (>90,000 unique EI mass spectra, https://massbank.eu (accessed on 25 November 2025)), and the Golm Metabolome Database [150] (GMD; EI mass spectra of >3000 reference substances and plant metabolites, http://gmd.mpimp-golm.mpg.de/ (accessed on 5 January 2025)); other databases are available as well (Table 2).

Despite the advantage of automatic annotation in rapid analyte identification, the results obtained can often be erroneous because the correctness of automatic annotation is highly dependent on the quality of the analyte mass spectrum. Therefore, to improve the reliability of automatic annotation, the analyte spectrum must first be deconvolved to remove noise signals. Then a presumptive annotation should be carefully checked based on the GC-MS information of coeluting reference compounds to establish reliable identification of the analyte. If the GC-MS information of an analyte does not match confidently any coeluted or library reference, the analyte structural annotation (i.e., analyte identification) cannot be established. In this case, it remains possible to perform an annotation to a specific chemical class. The extensive use of trimethylsilylation and EI at 70 eV, which provides the high reproducibility of GC-MS analyses, allowed us to relate the t_R of the analyte to its approximate molecular weight and to roughly estimate the possible chemical nature of the analyte depending on the retention time region (t_R window) in which the given primary metabolite is located in the chromatogram (Figure 5). To more accurately assign an analyte to a specific chemical group, mass spectrometric information is needed.

TMS derivatives are formed by compounds possessing various functional groups such as alcohols, carboxylic acids, carbonyls (enol-TMS ethers), phosphates, amines, thiols, etc. The elucidation of fragmentation patterns and the mechanisms of formation of the compound groups allows us to specify characteristic (diagnostic) fragments which are present in mass spectra for each of the groups as specific or a few relatively intense signals. For example, the diagnostic fragments for 1TMS and 2TMS amine derivatives are specific signals at m/z values of 102 and 174 respectively; for TMS ethers of fatty acids, the m/z values are 132, 117, 145 and 129; for MeOX-TMS C6-aldosaccharides, the m/z values are 319, 217 and 205; for TMS glycerol-phosphates, the m/z values are 299, 315 and 445; while for TMS sugar-phosphates, the m/z values are 299, 315 and 387. The detailed mechanisms of formation of these and other diagnostic fragments characterized by specific m/z values were reviewed by Harvey and Vouros [151]. The information on diagnostic signals at specific m/z values together with knowledge of chemical group-specific t_R windows allows the analyte to be annotated according to the chemical class. However, if detected analytes cannot be annotated by reference spectra to an individual compound or even to a chemical class, they need to be regarded as unknowns. In this case, their annotation relies on the unique t_Rs, RIs, and EI mass spectra. The patterns of the most characteristic and intense MS signals (characteristic m/z values of the diagnostic ions) ideally complement RI data for primary annotation of unknowns [29,57,140].

A typical GC-MS run delivers hundreds of annotated features, each of which corresponds to a chromatographic peak [2,21]. Therefore, several features can represent one metabolite (for example, different MeOX-TMS derivatives of the same metabolite) [141]. The intensities of an analyte in different samples can be compared on a relative basis assuming the linear dependence between peak area (or its height) and compound concentration. Therefore, due to the high probability of individual coelutions, TICs are not recommended for peak integration. In contrast, XICs can be reconstructed for characteristic m/z values (nominal or exact—depending on the instrumental platform used), and the signals of individual metabolites can be integrated at characteristic t_R values [29]. The integrated peak areas are used in the further steps of the data processing workflow, which are statistical analysis and biological interpretation of the resultant statistically confirmed data [57,140,152].

When working with data obtained by either targeted or untargeted strategies, the processing step yields a so-called feature quantification matrix (FQM) or biomatrix, which contains the peak areas (heights) obtained from GC-MS data for all features in all experimental samples [57,140,153]. Values for these quantitative characteristics (i.e., peak areas or heights) may be missed because certain compounds may not be detected in some samples. Those missing values may affect further statistical analysis and interpretation of the results. Depending on the reasons for missing values (biological or technical, random or not), various imputation algorithms are applied, such as replacement by mean or median, k-nearest neighbors (kNN), probabilistic PCA (PPCA), Bayesian PCA (BPCA), singular value decomposition (SVD), random forest (RF) and quantile regression imputation of left-censored data (QRILC). All these methods have their advantages and limitations, and the reader is referred to recent reviews on this topic [154,155]. Importantly, before an imputation algorithm is applied, a biomatrix has to be filtered based on the percentage of missing values. This should be accomplished in accordance with the so-called “80% rule”, which sets a threshold on the percentage of missing values for a specific metabolite, such as 20%. If the amount of missing values in the biomatrix exceeds the threshold, then the imputation loses its statistical meaning because it would distort the data matrix and lead to inaccurate results [154]. Therefore, metabolites with a high percentage of missing values have to be excluded from further analysis, or additional experiments have to be conducted to fill in the obtained gaps.

Before the imputation of missing values, the biomatrix is cleaned of intensities of ghost peaks detected in blank samples and normalization is performed. Various normalization strategies are employed in plant metabolomics, such as fresh weight (FW) or dry weight (DW) normalization, which adjusts metabolite levels based on the sample’s fresh or dry weight, helping to account for variations in the doses of samples. Total sum normalization (TSN) and median normalization (MN) are used to adjust the overall intensity of samples, ensuring that the total signal or median of the dataset is consistent across samples [156,157]. Additionally, probabilistic quotient normalization (PQN) accounts for systematic biases by normalizing metabolite levels using ratios between metabolites [158]. After imputation of missing values, data scaling is required to adjust each variable by a factor computed based on the dispersion of the variables, which helps to improve the performance of further statistical analysis [154,158].

At the next post-processing step, univariate and multivariate statistical analysis, along with mathematical modeling, is applied to the multivariate data generated by metabolomics experiments to derive meaningful information and test for significant differences in individual metabolites between different groups [159,160,161]. Several univariate analysis methods are available for metabolomic data analysis. For example, parametric tests like ANOVA, Student’s t-tests, and z-tests are commonly applied when comparing differences between two or more groups, provided that assumptions of normality are met [159]. The normality of data can be verified using tests such as the Kolmogorov–Smirnov normality test or Bartlett’s homogeneity of variances test [161]. When normal distribution assumptions are not met, non-parametric methods like the Mann–Whitney U test or Kruskal–Wallis one-way analysis of variance can be employed [160]. Moreover, analysis of metabolomic data is considered to apply additional multiple testing. Since many metabolomic properties are examined at once in most investigations, there is a high likelihood of discovering statistically significant results by accident. In order to address this problem, a variety of correction techniques are available, each of which maintains a balance between avoiding false-positive metabolite correlations and preventing the elimination of real linkages (false negatives) [161,162]. One commonly used multiple-test correction method is based on minimizing the false discovery rate (FDR) [163], aiming to minimize the expected proportion of false positives among the total number of positive results, and it has been extensively reviewed by Broadhurst et al. and others [159,161,162].

Multivariate methods consider all features simultaneously, enabling the identification of relationship patterns between them, and can be divided into two groups: supervised and unsupervised methods [159]. Unsupervised methods focus on summarizing complex metabolomic data and identifying data patterns correlated with experimental or biological variables [161]. PCA is the most commonly used unsupervised method in metabolomics. It transforms metabolic features into linearly uncorrelated variables known as principal components, capturing most of the variability in the dataset. Other unsupervised methods like Hierarchical Clustering Analysis (HCA) and Self-Organizing Maps (SOMs) are used to visualize metabolic phenotypes and feature patterns. Supervised methods are usually used when the difference between groups is proved by unsupervised methods. They reduce the impact of additional potential sources of variability when identifying patterns of metabolites associated with the studied phenotypic manifestations because, unlike the unsupervised approach, they use information about the structure of the experiment (i.e., about biological samples belonging to certain variants). Partial least squares (PLS) is a widely used supervised method that can be employed for regression analysis or binary classification (PLS-DA) [164]. Unlike PCA, PLS components focus on the covariance between the variable of interest and the metabolomic data, making it useful for building classifiers based on metabolomic features. Orthogonal PLS (O-PLS) was developed to address the problem of unrelated metabolic features influencing results [164].

The analysis of metabolite networks is the last step in GC-MS-based plant metabolomics studies, providing valuable insights into the dynamics and functional significance of the metabolome within the biological system under investigation [88,128,165]. This step involves the exploration and interpretation of interconnected relationships between metabolites using various computational and statistical approaches based on the information of plant networks from the databases. Techniques like correlation networks and metabolic pathway analysis are commonly employed at this step, and advanced methods like graph theory allow us to quantify the importance and connectivity of individual metabolites [166,167]. To assist with the data analysis process, we have access to online platforms such as MetaboAnalyst [168] and Visual Analysis and Exploration of Networks containing Experimental Data (VANTED) [169], which offer statistical and pathway analysis tools specifically designed for metabolomic data. These platforms enable normalization, clustering, and modeling using various algorithms. Additionally, there are other resources available for metabolomics data analysis, as indicated in Table 2. The application of machine learning has become increasingly relevant in interpreting metabolomics data, contributing to unraveling complex relationships, and it provides opportunities for targeted metabolic engineering, biomarker discovery, and crop improvement [170,171].

Table 2. A few databases and tools for GC-MS data analysis in metabolomics studies.

Name of the Tool/Database	Brief Description	Website URL (Accessed on 1 September 2025)	Ref.
ADAP (Automatic Data Analysis Pipeline)	A tool that provides a range of advanced algorithms and statistical methods for data analysis.	http://www.du-lab.org/software.html	[172]
AMDIS (Automated Mass Spectral Deconvolution and Identification System)	A tool used for spectral deconvolution and metabolite identification from GC-MS data.	https://chemdata.nist.gov/dokuwiki/doku.php?id=chemdata:amdis	[173]
MeltDB 2.0	A comprehensive online platform for metabolomics data management and analysis. It offers a range of tools to facilitate the storage, processing, and interpretation of metabolomics data.	https://meltdb.cebitec.uni-bielefeld.de/cgi-bin/login.cgi	[174]
MetaboAnalyst	A comprehensive web-based tool for metabolomic data analysis and interpretation, offering various modules for statistical analysis, data preprocessing, pathway analysis, and visualization.	http://www.metaboanalyst.ca/	[168]
MetaboLights	A web-based repository and analysis platform for metabolomics data which serves as a comprehensive resource for researchers to store, share, and analyze metabolomics datasets.	https://www.ebi.ac.uk/metabolights/	[175]
MetaboliteDetector	The software offers a comprehensive and automated data analysis pipeline, starting from raw GC-MS data and culminating in principal component analysis.	https://md.tu-bs.de/	[176]
MetAlign	A tool offering a range of features and algorithms to facilitate accurate peak detection, alignment, and normalization of metabolite signals across multiple samples.	https://zenodo.org/record/7273832	[177]
metaX	A comprehensive tool for processing and post-processing mass spectromrtry data.	https://metax.genomics.cn/	[178]
MS-DIAL	A comprehensive software tool developed for the analysis of mass spectrometry data, particularly in metabolomics studies.	http://prime.psc.riken.jp/compms/msdial/main.html	[139]
MET-IDEA (Metabolomic Identification by Database Enrichment Analysis)	A bioinformatics tool designed for the identification and annotation of metabolites in GC-MS-based metabolomics studies.	http://www.msea.ca	[179]
Mzmine	A software tool for processing, visualization, and analysis of MS-based metabolomics data.	https://mzmine.github.io/	[167]
OpenChrom	An open-source software platform designed for GC-MS data analysis, providing a range of powerful features for processing and visualizing chromatographic data.	https://www.openchrom.net/	[138]
SIMAT	A comrehensive tool for analysis of GC-MS data acquired in SIM mode.	https://omics.georgetown.edu/tools#h.72n4e5rnc2eg	[180]
PyMassSpec	A Python (PyMass 2.5.0) library designed for mass spectrometry data analysis. It provides a comprehensive set of tools and functions for processing, analyzing, and visualizing mass spectrometry data.	https://pymassspec.readthedocs.io/en/master/	[181]
TagFinder	A specifically designed tool for the identification and quantification of volatile and semivolatile compounds in complex biological samples.	https://www.mpimp-golm.mpg.de/19405/Corrector_package_V1_91.zip	[182]
XCMS	A software package for processing and visualizing MS-based metabolomic data. It offers various algorithms and methods for peak picking, retention time alignment, background subtraction, and normalization.	https://xcmsonline.scripps.edu/	[135]
Commercially available tools
AnalyzerPro	A comprehensive software tool designed for the analysis of GC- and LC-MS data in metabolomics research, offering a range of advanced features to process, visualize, and interpret metabolomic data.	https://spectralworks.com/software/analyzerpro/
ChromaTOF	Software specifically designed to process and analyze raw GC-MS data, providing a range of features for data processing, peak detection, compound identification, and data visualization.	https://www.leco.com/product/chromatof-software
QuanLynx	The software provides advanced data processing capabilities, including peak picking, alignment, quantification, and statistical analysis.	https://www.waters.com/waters/library.htm?locale=en_US&lid=1545661
Progenesis QI	Software for peak detection, alignment, and normalization used to accurately quantify metabolites in complex samples that provides a range of statistical tools for differential analysis, multivariate analysis, and pathway analysis.	https://www.waters.com/waters/en_US/Progenesis-QI-Software/nav.htm?cid=134790655
Xcalibur	The software provides a comprehensive set of tools for data acquisition, instrument control, data processing, and analysis.	https://www.thermofisher.com/order/catalog/product/OPTON-30965
MassLynx	Offers a user-friendly interface with a range of features for data acquisition, instrument control, data processing, and analysis.	https://www.waters.com/waters/en_US/MassLynx-MS-Software/nav.htm?locale=en_US&cid=513662
Databases and pathway-related tools
BioCyc	A collection of curated databases that provides comprehensive information on the genomes and metabolic pathways of various organisms.	http://biocyc.org/	[183]
HMDB (Human Metabolome Database)	A comprehensive metabolomics and biochemistry database, enabling searches for metabolites, pathways, chemical structures and biological functions.	https://hmdb.ca/	[148]
KEGG (Kyoto Encyclopedia of Genes and Genomes)	A bioinformatics resource that integrates genomic, chemical, and systemic functional information. It provides a comprehensive collection of biological pathways, genomic information, and functional annotations for various organisms.	http://www.genome.jp/kegg/	[184]
MapMan	A tool designed for the visualization and analysis of omics data, with a particular focus on plant systems, providing a comprehensive mapping and annotation platform that allows interpretation and exploration of high-throughput data in the context of metabolic pathways and cellular processes.	http://mapman.gabipd.org/	[185]
MetaCrop	A comprehensive database focusing on the metabolism and pathways of crop plants.	http://metacrop.ipk-gatersleben.de	[186]
MetaMapp	A comprehensive bioinformatics tool designed for the integrated analysis of metabolomic and transcriptomic data.	http://metamapp.fiehnlab.ucdavis.edu/	[187]
MetNetDB	A comprehensive metabolomics database and analysis platform that integrates metabolite data with biochemical pathways and regulatory networks.	https://metnetweb.gdcb.iastate.edu/	[188]
MetiTree	Prototype repository of mass spectra of small chemical compounds for life sciences (<2000 Da).	http://www.metitree.nl/	[189]
MetScape	A bioinformatics tool designed for the visualization and interpretation of metabolomics and transcriptomics data in the context of metabolic pathways.	http://metscape.ncibi.org/	[190]
Pathvisio	An open-source pathway visualization tool for the analysis of omics data integration.	http://www.pathvisio.org/	[191]
PathWhiz	Web-based tool for visualizing and exploring biological pathways.	https://smpdb.ca/pathwhiz	[192]
PMN (Plant Metabolic Network)	Comprehensive resource for plant metabolomics research, including a curated collection of plant metabolic pathways and functional annotations.	http://www.plantcyc.org/databases	[193]
TAIR (The Arabidopsis Information Resource)	A comprehensive online database dedicated to the model plant species Arabidopsis thaliana.	https://www.arabidopsis.org/index.jsp	[194]
GMD (The Golm Metabolome Database)	A comprehensive resource that provides a repository of metabolite spectral profiles of a wide range of metabolite classes from various organisms.	http://gmd.mpimp-golm.mpg.de/Default.aspx	[150]
VANTED (Visual Analysis and Exploration of Networks containing Experimental Data)	A comprehensive software tool for the statistical analysis, visualization and analysis of biological networks, including metabolic pathways.	https://immersive-analytics.infotech.monash.edu/vanted/	[195]
RIKEN Plant Metabolome MetaDatabase (RIKEN PMM)	A comprehensive online resource focusing on plant metabolomics data, provides a curated collection of metabolite information, chemical structures, mass spectra, experimental data, metabolite profiles, metabolic pathways, and statistical analysis.	http://metabobank.riken.jp/pmm/db/plantMetabolomics	[196]

6. Concluding Remarks

Over two decades after the pioneering studies of Roessner and Fiehn [1,2], which bridged the gap between plant phenotypes and genotypes, GC-MS in profiling plant compounds has become the method of choice for investigating how plants respond to environmental factors and biotic interactions. Its exceptional separation capabilities, compatibility with complex plant metabolomes, and robustness due to stable derivatives have solidified its role in plant metabolomics. Furthermore, the electron ionization spectra of derivatives contribute to the success of this approach by providing confident identification and sensitive quantification of metabolites. Additionally, the emergence of more accurate analytical platforms, coupled with the power of bioinformatics tools and machine learning, has reduced the time required for data analysis and interpretation, deepening our insights into metabolic networks. The lack of a universal approach to the study of primary metabolites (due to their great diversity) opens up opportunities for experiments at the stages of extraction and separation of molecules. However, quality assessments must be considered at the experimental design stage, notably for the validation of the matrix effect on yields in such groups of compounds as saccharides, sugar phosphates, and organic and amino acids, and the impact of derivatization on artifact occurrence in the analysis of primary plant metabolites. As we move forward, continuous advancements in sample preparation, analytical techniques, and data processing will further enhance the potential of GC-MS in deciphering the intricate metabolic networks of plants. This methodology will undoubtedly continue to be an invaluable tool for enriching our comprehension of plant metabolism and the specific roles of metabolites in various physiological processes.

Author Contributions

A.F. and V.P. conceptualized the manuscript. V.P., N.F., S.S., A.M. and A.O. contributed to writing the first draft. A.F., N.F. and T.B. supervised the whole work and wrote the final draft of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the Russian Science Foundation (RSF), grants # 23-44-00101 (for Section 1, Section 2 and Section 3) and # 23-14-00383 (for Section 4, Section 5 and Section 6).

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

The infrastructural support from the Ministry of Science and Higher Education of the Russian Federation (theme # 122042700043-9) is acknowledged.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

(TMS)₃Si^•	Trimethylsilyl radical
BPCA	Bayesian PCA
BSA	N,O-bis(trimethylsilyl)acetamide
BSTFA	N,O-Bis(trimethylsilyl)trifluoroacetamide
DLLME	Dispersive liquid–liquid microextraction
DW	Dry weight
EI	Electron ionization
FAMEs	Fatty acid methyl esters
FDR	False discovery rate
FQM	Feature quantification matrix
FTMS	Fourier transform mass spectrometer
FW	Fresh weight
GC×GC	Tandem (two dimensional) gas chromatography
GC-EI-MS	Gas chromatography–electron impact mass spectrometer
GC-EI-Q-MS	Gas chromatography–electron impact–quadrupole mass spectrometer
GC-MS	Gas chromatography–mass spectrometry
HCA	Hierarchical Clustering Analysis
IS	Internal standard
kNN	k-nearest neighbours
LDR	Linear dynamic range
LLE	Liquid–liquid extraction
LLME	Liquid–liquid microextraction
LOD	Limit of detection
LOQ	Limit of quantification
MAE	Microwave-assisted extraction
MBTFA	N-methyl-bis(trifluoroacetamide)
MeOX	O-methylhydroxylamine hydrochloride
MEPS	Microextraction by packed sorbent
MN	Median normalization
MRM	Multiple reaction monitoring
MS/MS	Tandem mass spectrometry
MSⁿ	Multistage mass spectroscopy
MSPD	Matrix solid-phase dispersion
MSPE	Magnetic SPE
MSTFA	N-Methyl-N-(trimethylsilyl)trifluoroacetamide
MTBSTFA	N-tert-butyldimethylsilyl-N-methyltrifluoroacetamide
O-PLS	Orthogonal PLS
PCA	Principal component analysis
PLE	Pressurized liquid extraction
PLS	Partial least squares
PLS-DA	Partial least squares discriminant analysis
PPCA	Probabilistic PCA
PQN	Probabilistic quotient normalization
PT-SPE	Pipette-tip SPE
QCs	Quality control samples
QqQ-MS	Triple quadrupole–mass spectrometer
QRILC	Quantile regression imputation of left-censored data
Q-TOF-MS	Quadrupole–time-of-flight mass spectrometer
RF	Random forest
RIs	Retention indices
S/N	Signal-to-noise ratio
SBSE	Stir-bar sorptive extraction
SDME	Single-drop microextraction
SFE	Supercritical fluid extraction
SHWE	Super-heated water extraction
SIM	Selected ion monitoring
SLE	Solid–liquid extraction
SOMs	Self-Organizing Maps
SPE	Solid-phase extraction
SPME	Solid-phase microextraction
SRM	Selected reaction monitoring
SVD	Singular value decomposition
TBDMS	Tert-butyl(dimethyl)silyl
TIC	Total ion chromatogram
TMSA	N-trimethylsilylacetamide
TMSI	N-trimethylsilylimidazole
TOFMS	Time-of-flight mass spectrometer
TSN	Total sum normalization
UAE	Ultrasound-assisted extraction
XICs	Extracted ion chromatograms

References

Roessner, U.; Wagner, C.; Kopka, J.; Trethewey, R.N.; Willmitzer, L. Simultaneous analysis of metabolites in potato tuber by gas chromatography–mass spectrometry. Plant J. 2000, 23, 131–142. [Google Scholar] [CrossRef]
Fiehn, O.; Kopka, J.; Dörmann, P.; Altmann, T.; Trethewey, R.N.; Willmitzer, L. Metabolite profiling for plant functional genomics. Nat. Biotechnol. 2000, 18, 1157–1161. [Google Scholar] [CrossRef] [PubMed]
Fiehn, O. Metabolomics—The link between genotypes and phenotypes. Plant Mol. Biol. 2002, 48, 155–171. [Google Scholar] [CrossRef] [PubMed]
Wei, W.; Li, S.; Wang, Y.; Wang, B.; Fan, G.; Zeng, Q.; Zhao, F.; Xu, C.; Zhang, X.; Tang, T.; et al. Metabolome-Based Genome-Wide Association Study Provides Genetic Insights into the Natural Variation of Foxtail Millet. Front. Plant Sci. 2021, 12, 665530. [Google Scholar] [CrossRef]
Osmolovskaya, N.; Bilova, T.; Gurina, A.; Orlova, A.; Vu, V.D.; Sukhikh, S.; Zhilkina, T.; Frolova, N.; Tarakhovskaya, E.; Kamionskaya, A.; et al. Metabolic Responses of Amaranthus caudatus Roots and Leaves to Zinc Stress. Plants 2025, 14, 2119. [Google Scholar] [CrossRef] [PubMed]
Rathod, V.; Rathod, K.; Tomar, R.S.; Tatamiya, R.; Hamid, R.; Jacob, F.; Munshi, N.S. Metabolic profiles of peanut (Arachis hypogaea L.) in response to Puccinia arachidis fungal infection. BMC Genom. 2023, 24, 630. [Google Scholar] [CrossRef]
Gallon, M.E.; Silva-Junior, E.A.; Gobbo-Neto, L. GC-MS-based Metabolomics Unravels Metabolites across Larval Development and Diapause of a Specialist Insect. Chem. Biodivers. 2024, 21, e202301779. [Google Scholar] [CrossRef]
Nokhsorov, V.V.; Protopopov, F.F.; Sleptsov, I.V.; Petrova, L.V.; Petrov, K.A. Metabolomic Profile and Functional State of Oat Plants (Avena sativa L.) Sown under Low-Temperature Conditions in the Cryolithozone. Plants 2024, 13, 1076. [Google Scholar] [CrossRef]
Ranner, J.L.; Schalk, S.; Martyniak, C.; Parniske, M.; Gutjahr, C.; Stark, T.D.; Dawid, C. Primary and Secondary Metabolites in Lotus japonicus. J. Agric. Food Chem. 2023, 71, 11277–11303. [Google Scholar] [CrossRef]
Mhlongo, M.I.; Piater, L.A.; Dubery, I.A. Profiling of Volatile Organic Compounds from Four Plant Growth-Promoting Rhizobacteria by SPME-GC-MS: A Metabolomics Study. Metabolites 2022, 12, 763. [Google Scholar] [CrossRef]
Komati, A.; Anand, A.; Shaik, H.; Mudiam, M.K.R.; Suresh Babu, K.; Tiwari, A.K. Bombax ceiba (Linn.) calyxes ameliorate methylglyoxal-induced oxidative stress via modulation of RAGE expression: Identification of active phytometabolites by GC-MS analysis. Food Funct. 2020, 11, 5486–5497. [Google Scholar] [CrossRef]
Xue, T.; Liu, S.; Liu, J.; Yuan, Y. Metabolomics based on GC-MS revealed hub metabolites of pecan seeds germinating at different temperatures. BMC Plant Biol. 2023, 23, 192. [Google Scholar] [CrossRef]
Kaneria, M.; Rakholiya, K.; Bavaliya, K.R.; Pandya, M.H.; Sipai, T.N.; Vadher, S.A.; Patel, M.; Yadav, V.K.; Solanki, R.; Patel, S.; et al. Untargeted metabolomics-based identification of bioactive compounds from Mangifera indica L. seed extracts in drug discovery through molecular docking and assessment of their anticancer potential. J. Sci. Food Agric. 2024, 104, 5907–5920. [Google Scholar] [CrossRef]
Umoh, S.D.; Bojase, G.; Masesane, I.B.; Majinda, R.T.; Sichilongo, K.F. Untargeted GC-MS metabolomics to identify and classify bioactive compounds in Combretum platypetalum subsp. oatesii (Rolfe) Exell (Combretaceae). Phytochem. Anal. 2023, 34, 127–138. [Google Scholar] [CrossRef] [PubMed]
Mariano-Junior, D.R.; Alves, D.d.P.; Pereira, C.d.S.B.; Cavalcante, R.S.; Reichenbach, L.B.; Ribeiro, M.E.P.; Fontes, I.S.; Pinheiro, D.F.d.R.; Silva, M.E.; Pedro, L.B.; et al. Mapping and Chemical Diversity of Baccharis dracunculifolia De Candole (1836) Essential Oil Accessed in Rio de Janeiro, Brazil. Plants 2025, 14, 3443. [Google Scholar] [CrossRef] [PubMed]
Zhao, Q.; Xi, J.; Xu, D.; Jin, Y.; Wu, F.; Tong, Q.; Yin, Y.; Xu, X. A comparative HS-SPME/GC-MS-based metabolomics approach for discriminating selected japonica rice varieties from different regions of China in raw and cooked form. Food Chem. 2022, 385, 132701. [Google Scholar] [CrossRef]
Deng, P.; Yin, R.; Wang, H.; Chen, L.; Cao, X.; Xu, X. Comparative analyses of functional traits based on metabolome and economic traits variation of Bletilla striata: Contribution of intercropping. Front. Plant Sci. 2023, 14, 1147076. [Google Scholar] [CrossRef]
Wang, Z.; Ahmad, W.; Zhu, A.; Geng, W.; Kang, W.; Ouyang, Q.; Chen, Q. Identification of volatile compounds and metabolic pathway during ultrasound-assisted kombucha fermentation by HS-SPME-GC/MS combined with metabolomic analysis. Ultrason. Sonochem. 2023, 94, 106339. [Google Scholar] [CrossRef]
Gohlke, R.S.; McLafferty, F.W. Early gas chromatography/mass spectrometry. J. Am. Soc. Mass Spectrom. 1993, 4, 367–371. [Google Scholar] [CrossRef] [PubMed]
Bartle, K.D.; Myers, P. History of gas chromatography. TrAC Trends Anal. Chem. 2002, 21, 547–557. [Google Scholar] [CrossRef]
Paiva, A.C.; de Oliveira, A.M.; Crucello, J.; Facanali, R.; Hantao, L.W. Practical Considerations in Method Development for Gas Chromatography-Based Metabolomic Profiling. Adv. Exp. Med. Biol. 2021, 1336, 139–157. [Google Scholar] [CrossRef]
Bahaghighat, H.D.; Freye, C.E.; Synovec, R.E. Recent advances in modulator technology for comprehensive two dimensional gas chromatography. TrAC Trends Anal. Chem. 2019, 113, 379–391. [Google Scholar] [CrossRef]
Stettin, D.; Poulin, R.X.; Pohnert, G. Metabolomics Benefits from Orbitrap GC-MS-Comparison of Low- and High-Resolution GC-MS. Metabolites 2020, 10, 143. [Google Scholar] [CrossRef] [PubMed]
Ghatak, A.; Chaturvedi, P.; Weckwerth, W. Metabolomics in Plant Stress Physiology. Adv. Biochem. Eng. Biotechnol. 2018, 164, 187–236. [Google Scholar] [CrossRef] [PubMed]
Shulaev, V.; Cortes, D.; Miller, G.; Mittler, R. Metabolomics for plant stress response. Physiol. Plant. 2008, 132, 199–208. [Google Scholar] [CrossRef] [PubMed]
Caldana, C.; Degenkolbe, T.; Cuadros-Inostroza, A.; Klie, S.; Sulpice, R.; Leisse, A.; Steinhauser, D.; Fernie, A.R.; Willmitzer, L.; Hannah, M.A. High-density kinetic analysis of the metabolomic and transcriptomic response of Arabidopsis to eight environmental conditions. Plant J. 2011, 67, 869–884. [Google Scholar] [CrossRef]
Shim, M.; Jeong, Y.; Lee, D.-K. Identification and Profiling of Primary Metabolites Through GC-MS and Associated Data Processing. Methods Mol. Biol. 2025, 2895, 99–109. [Google Scholar] [CrossRef]
Olmedo, P.; Zepeda, B.; Delgado-Rioseco, J.; Leiva, C.; Moreno, A.A.; Sagredo, K.; Blanco-Herrera, F.; Pedreschi, R.; Infante, R.; Meneses, C.; et al. Metabolite Profiling Reveals the Effect of Cold Storage on Primary Metabolism in Nectarine Varieties with Contrasting Mealiness. Plants 2023, 12, 766. [Google Scholar] [CrossRef]
Abadie, C.; Lalande, J.; Tcherkez, G. Exact mass GC-MS analysis: Protocol, database, advantages and application to plant metabolic profiling. Plant Cell Environ. 2022, 45, 3171–3183. [Google Scholar] [CrossRef]
Naz, R.; Roberts, T.H.; Bano, A.; Nosheen, A.; Yasmin, H.; Hassan, M.N.; Keyani, R.; Ullah, S.; Khan, W.; Anwar, Z. GC-MS analysis, antimicrobial, antioxidant, antilipoxygenase and cytotoxic activities of Jacaranda mimosifolia methanol leaf extracts and fractions. PLoS ONE 2020, 15, e0236319. [Google Scholar] [CrossRef]
Abdul, U.; Manikandan, D.B.; Arumugam, M.; Alomar, S.Y.; Manoharadas, S.; Ramasamy, T. GC-MS based metabolomic profiling of Aporosa cardiosperma (Gaertn.) Merr. leaf extracts and evaluating its therapeutic potential. Sci. Rep. 2024, 14, 16010. [Google Scholar] [CrossRef]
Wang, L.; Liang, J.; Xie, X.; Liu, J.; Shen, Q.; Li, L.; Wang, Q. Direct formation of the sesquiterpeonid ether liguloxide by a terpene synthase in Senecio scandens. Plant Mol. Biol. 2021, 105, 55–64. [Google Scholar] [CrossRef]
Deng, H.; He, R.; Huang, R.; Pang, C.; Ma, Y.; Xia, H.; Liang, D.; Liao, L.; Xiong, B.; Wang, X.; et al. Optimization of a static headspace GC-MS method and its application in metabolic fingerprinting of the leaf volatiles of 42 citrus cultivars. Front. Plant Sci. 2022, 13, 1050289. [Google Scholar] [CrossRef]
Xu, Y.; Yang, M.; Yang, T.; Yang, W.; Wang, Y.; Zhang, J. Untargeted GC-MS and FT-NIR study of the effect of 14 processing methods on the volatile components of Polygonatum kingianum. Front. Plant Sci. 2023, 14, 1140691. [Google Scholar] [CrossRef]
Blau, K.; Halket, J.M. Handbook of Derivatives for Chromatography, 2nd ed.; Wiley: New York, NY, USA, 1993; ISBN 978-0-471-92699-3. [Google Scholar]
Bekele, E.A.; Annaratone, C.E.P.; Hertog, M.L.A.T.M.; Nicolai, B.M.; Geeraerd, A.H. Multi-response optimization of the extraction and derivatization protocol of selected polar metabolites from apple fruit tissue for GC-MS analysis. Anal. Chim. Acta 2014, 824, 42–56. [Google Scholar] [CrossRef] [PubMed]
Domergue, J.-B.; Lalande, J.; Abadie, C.; Tcherkez, G. Compound-Specific 14N/15N Analysis of Amino Acid Trimethylsilylated Derivatives from Plant Seed Proteins. Int. J. Mol. Sci. 2022, 23, 4893. [Google Scholar] [CrossRef] [PubMed]
Acharjee, A.; Kloosterman, B.; Visser, R.G.F.; Maliepaard, C. Integration of multi-omics data for prediction of phenotypic traits using random forest. BMC Bioinform. 2016, 17, 180. [Google Scholar] [CrossRef]
Yun, Z.; Li, T.; Gao, H.; Zhu, H.; Gupta, V.K.; Jiang, Y.; Duan, X. Integrated Transcriptomic, Proteomic, and Metabolomics Analysis Reveals Peel Ripening of Harvested Banana under Natural Condition. Biomolecules 2019, 9, 167. [Google Scholar] [CrossRef]
Wang, J.Y.; Alseekh, S.; Xiao, T.; Ablazov, A.; Perez de Souza, L.; Fiorilli, V.; Anggarani, M.; Lin, P.-Y.; Votta, C.; Novero, M.; et al. Multi-omics approaches explain the growth-promoting effect of the apocarotenoid growth regulator zaxinone in rice. Commun. Biol. 2021, 4, 1222. [Google Scholar] [CrossRef] [PubMed]
Strenkert, D.; Schmollinger, S.; Gallaher, S.D.; Salomé, P.A.; Purvine, S.O.; Nicora, C.D.; Mettler-Altmann, T.; Soubeyrand, E.; Weber, A.P.M.; Lipton, M.S.; et al. Multiomics resolution of molecular events during a day in the life of Chlamydomonas. Proc. Natl. Acad. Sci. USA 2019, 116, 2374–2383. [Google Scholar] [CrossRef]
Szablińska-Piernik, J.; Lahuta, L.B. Polar Metabolites Profiling of Wheat Shoots (Triticum aestivum L.) under Repeated Short-Term Soil Drought and Rewatering. Int. J. Mol. Sci. 2023, 24, 8429. [Google Scholar] [CrossRef]
de Falco, B.; Grauso, L.; Fiore, A.; Bonanomi, G.; Lanzotti, V. Metabolomics and chemometrics of seven aromatic plants: Carob, eucalyptus, laurel, mint, myrtle, rosemary and strawberry tree. Phytochem. Anal. 2022, 33, 696–709. [Google Scholar] [CrossRef] [PubMed]
Poole, C.F. Alkylsilyl derivatives for gas chromatography. J. Chromatogr. A 2013, 1296, 2–14. [Google Scholar] [CrossRef] [PubMed]
Fiehn, O. Metabolomics by Gas Chromatography-Mass Spectrometry: Combined Targeted and Untargeted Profiling. Curr. Protoc. Mol. Biol. 2016, 114, 30.4.1–30.4.32. [Google Scholar] [CrossRef] [PubMed]
Dagar, R.; Gautam, A.; Priscilla, K.; Sharma, V.; Gupta, P.; Kumar, R. Sample Preparation from Plant Tissue for Gas Chromatography-Mass Spectrometry (GC-MS)we. Methods Mol. Biol. 2024, 2788, 19–37. [Google Scholar] [CrossRef]
Frolova, N.; Orlova, A.; Popova, V.; Bilova, T.; Frolov, A. Gas Chromatography–Mass Spectrometry (GC-MS) in the Plant Metabolomics Toolbox: Sample Preparation and Instrumental Analysis. Biomolecules 2026, 16, 16. [Google Scholar] [CrossRef]
Parkinson, D.-R.; Warren, J.M.; Pawliszyn, J. Analysis of ergosterol for the detection of mold in soils by automated on-fiber derivatization headspace extraction–SPME-GC/MS. Anal. Chim. Acta 2010, 661, 181–187. [Google Scholar] [CrossRef]
González, A.; Clavijo, S.; Cerdà, V. Estrogens determination exploiting a SIA-LOV system prior in-port derivatization-large volume injection-programmable temperature vaporization-gas chromatography. Talanta 2019, 194, 852–858. [Google Scholar] [CrossRef]
Samokhin, A. Spectral skewing in gas chromatography–mass spectrometry: Misconceptions and realities. J. Chromatogr. A 2018, 1576, 113–119. [Google Scholar] [CrossRef]
Rocha, S.M.; Costa, C.P.; Martins, C. Aroma Clouds of Foods: A Step Forward to Unveil Food Aroma Complexity Using GC×GC. Front. Chem. 2022, 10, 820749. [Google Scholar] [CrossRef]
Mondello, L. (Ed.) Comprehensive Chromatography in Combination with Mass Spectrometry, 1st ed.; Wiley: Hoboken, NJ, USA, 2011; ISBN 978-0-470-43407-9. [Google Scholar]
Watson, J.T.; Sparkman, O.D. Introduction to Mass Spectrometry: Instrumentation, Applications and Strategies for Data Interpretation, 1st ed.; Wiley: Hoboken, NJ, USA, 2007; ISBN 978-0-470-51634-8. [Google Scholar]
Misra, B.B.; Assmann, S.M.; Chen, S. Plant single-cell and single-cell-type metabolomics. Trends Plant Sci. 2014, 19, 637–646. [Google Scholar] [CrossRef]
He, L.; Hu, Q.; Zhang, J.; Xing, R.; Zhao, Y.; Yu, N.; Chen, Y. An integrated untargeted metabolomic approach reveals the quality characteristics of black soybeans from different geographical origins in China. Food Res. Int. 2023, 169, 112908. [Google Scholar] [CrossRef]
Zoccali, M.; Cappello, S.; Mondello, L. Multilevel characterization of marine microbial biodegradation potentiality by means of flow-modulated comprehensive two-dimensional gas chromatography combined with a triple quadrupole mass spectrometer. J. Chromatogr. A 2018, 1547, 99–106. [Google Scholar] [CrossRef] [PubMed]
Alseekh, S.; Aharoni, A.; Brotman, Y.; Contrepois, K.; D’Auria, J.; Ewald, J.; Ewald, J.C.; Fraser, P.D.; Giavalisco, P.; Hall, R.D.; et al. Mass spectrometry-based metabolomics: A guide for annotation, quantification and best reporting practices. Nat. Methods 2021, 18, 747–756. [Google Scholar] [CrossRef]
Fernie, A.R.; Aharoni, A.; Willmitzer, L.; Stitt, M.; Tohge, T.; Kopka, J.; Carroll, A.J.; Saito, K.; Fraser, P.D.; DeLuca, V. Recommendations for reporting metabolite data. Plant Cell 2011, 23, 2477–2482. [Google Scholar] [CrossRef] [PubMed]
Rodrigues, A.M.; Ribeiro-Barros, A.I.; António, C. Experimental Design and Sample Preparation in Forest Tree Metabolomics. Metabolites 2019, 9, 285. [Google Scholar] [CrossRef]
Balcke, G.U.; Handrick, V.; Bergau, N.; Fichtner, M.; Henning, A.; Stellmach, H.; Tissier, A.; Hause, B.; Frolov, A. An UPLC-MS/MS method for highly sensitive high-throughput analysis of phytohormones in plant tissues. Plant Methods 2012, 8, 47. [Google Scholar] [CrossRef] [PubMed]
Salem, M.A.; Yoshida, T.; Perez De Souza, L.; Alseekh, S.; Bajdzienko, K.; Fernie, A.R.; Giavalisco, P. An improved extraction method enables the comprehensive analysis of lipids, proteins, metabolites and phytohormones from a single sample of leaf tissue under water-deficit stress. Plant J. 2020, 103, 1614–1632. [Google Scholar] [CrossRef]
Chen, Y.; Wang, Y.; Liang, X.; Zhang, Y.; Fernie, A.R. Mass spectrometric exploration of phytohormone profiles and signaling networks. Trends Plant Sci. 2023, 28, 399–414. [Google Scholar] [CrossRef]
Romanik, G.; Gilgenast, E.; Przyjazny, A.; Kamiński, M. Techniques of preparing plant material for chromatographic separation and analysis. J. Biochem. Biophys. Methods 2007, 70, 253–261. [Google Scholar] [CrossRef]
Markert, B. Sample preparation (cleaning, drying, homogenization) for trace element analysis in plant matrices. Sci. Total Environ. 1995, 176, 45–61. [Google Scholar] [CrossRef]
Choudhury, F.K.; Pandey, P.; Meitei, R.; Cardona, D.; Gujar, A.C.; Shulaev, V. GC-MS/MS Profiling of Plant Metabolites. Methods Mol. Biol. 2022, 2396, 101–115. [Google Scholar] [CrossRef]
Balcke, G.U.; Bennewitz, S.; Bergau, N.; Athmer, B.; Henning, A.; Majovsky, P.; Jiménez-Gómez, J.M.; Hoehenwarter, W.; Tissier, A. Multi-Omics of Tomato Glandular Trichomes Reveals Distinct Features of Central Carbon Metabolism Supporting High Productivity of Specialized Metabolites. Plant Cell 2017, 29, 960–983. [Google Scholar] [CrossRef]
Cui, Z.; Li, M.; Han, X.; Liu, H.; Li, C.; Peng, H.; Liu, D.; Huang, X.; Zhang, Z. Morphogenesis, ultrastructure, and chemical profiling of trichomes in Artemisia argyi H. Lév. & Vaniot (Asteraceae). Planta 2022, 255, 102. [Google Scholar] [CrossRef] [PubMed]
Caputo, L.; Amato, G.; de Bartolomeis, P.; De Martino, L.; Manna, F.; Nazzaro, F.; De Feo, V.; Barba, A.A. Impact of drying methods on the yield and chemistry of Origanum vulgare L. essential oil. Sci. Rep. 2022, 12, 3845. [Google Scholar] [CrossRef] [PubMed]
Krause, S.T.; Frey, M. Extraction of Essential Oils and Terpene Volatiles from Plants and Identification by GC-MS-Based Techniques. Methods Mol. Biol. 2025, 2895, 73–82. [Google Scholar] [CrossRef] [PubMed]
Kumari, B.; Tiwari, B.K.; Hossain, M.B.; Rai, D.K.; Brunton, N.P. Ultrasound-assisted extraction of polyphenols from potato peels: Profiling and kinetic modelling. Int. J. Food Sci. Technol. 2017, 52, 1432–1439. [Google Scholar] [CrossRef]
Villa, C.; Robustelli Della Cuna, F.S.; Russo, E.; Ibrahim, M.F.; Grignani, E.; Preda, S. Microwave-Assisted and Conventional Extractions of Volatile Compounds from Rosa x damascena Mill. Fresh Petals for Cosmetic Applications. Molecules 2022, 27, 3963. [Google Scholar] [CrossRef]
Sharma, S.; Kumar, M.; Sircar, D.; Prasad, R. Metabolic profiling and biomarkers identification in cluster bean under drought stress using GC-MS technique. Metabolomics 2024, 20, 80. [Google Scholar] [CrossRef]
Purdy, S.J.; Fuentes, D.; Ramamoorthy, P.; Nunn, C.; Kaiser, B.N.; Merchant, A. The Metabolic Profile of Young, Watered Chickpea Plants Can Be Used as a Biomarker to Predict Seed Number under Terminal Drought. Plants 2023, 12, 2172. [Google Scholar] [CrossRef]
Nisar, R.; Ahmad, S.; Khan, K.-U.-R.; Sherif, A.E.; Alasmari, F.; Almuqati, A.F.; Ovatlarnporn, C.; Khan, M.A.; Umair, M.; Rao, H.; et al. Metabolic Profiling by GC-MS, In Vitro Biological Potential, and In Silico Molecular Docking Studies of Verbena officinalis. Molecules 2022, 27, 6685. [Google Scholar] [CrossRef]
Gong, Z.-G.; Hu, J.; Wu, X.; Xu, Y.-J. The Recent Developments in Sample Preparation for Mass Spectrometry-Based Metabolomics. Crit. Rev. Anal. Chem. 2017, 47, 325–331. [Google Scholar] [CrossRef] [PubMed]
Nelson, N.; Perzov, N.; Cohen, A.; Hagai, K.; Padler, V.; Nelson, H. The cellular biology of proton-motive force generation by V-ATPases. J. Exp. Biol. 2000, 203, 89–95. [Google Scholar] [CrossRef]
Bevilacqua, C.; Ducos, B. Laser microdissection: A powerful tool for genomics at cell level. Mol. Asp. Med. 2018, 59, 5–27. [Google Scholar] [CrossRef]
Fang, J.; Schneider, B. Laser microdissection: A sample preparation technique for plant micrometabolic profiling. Phytochem. Anal. 2014, 25, 307–313. [Google Scholar] [CrossRef]
Datta, S.; Malhotra, L.; Dickerson, R.; Chaffee, S.; Sen, C.K.; Roy, S. Laser capture microdissection: Big data from small samples. Histol. Histopathol. 2015, 30, 1255–1269. [Google Scholar] [CrossRef]
Nelson, T.; Tausta, S.L.; Gandotra, N.; Liu, T. Laser microdissection of plant tissue: What you see is what you get. Annu. Rev. Plant Biol. 2006, 57, 181–201. [Google Scholar] [CrossRef] [PubMed]
Sumner, L.W. Metabolome analysis: An introduction Silas G. Villas-Bôas Ute Roessner Michael A. E. Hansen Jørn Smedsgaard Jens Nielsen wiley-interscience series in mass spectrometry series editors Dominic M. Desiderio and Nico M.M. Nibbering. J. Am. Soc. Mass Spectrom. 2007, 18, R1–R2. [Google Scholar] [CrossRef][Green Version]
Lisec, J.; Schauer, N.; Kopka, J.; Willmitzer, L.; Fernie, A.R. Gas chromatography mass spectrometry–based metabolite profiling in plants. Nat. Protoc. 2006, 1, 387–396. [Google Scholar] [CrossRef]
Sánchez-Parra, B.; Frerigmann, H.; Pérez Alonso, M.-M.; Carrasco Loba, V.; Jost, R.; Hentrich, M.; Pollmann, S. Characterization of Four Bifunctional Plant IAM/PAM-Amidohydrolases Capable of Contributing to Auxin Biosynthesis. Plants 2014, 3, 324–347. [Google Scholar] [CrossRef] [PubMed]
Lehmann, T.; Janowitz, T.; Sánchez-Parra, B.; Alonso, M.-M.P.; Trompetter, I.; Piotrowski, M.; Pollmann, S. Arabidopsis NITRILASE 1 Contributes to the Regulation of Root Growth and Development through Modulation of Auxin Biosynthesis in Seedlings. Front. Plant Sci. 2017, 8, 36. [Google Scholar] [CrossRef]
Das, A.B.; Goud, V.V.; Das, C. Extraction of phenolic compounds and anthocyanin from black and purple rice bran (Oryza sativa L.) using ultrasound: A comparative analysis and phytochemical profiling. Ind. Crops Prod. 2017, 95, 332–341. [Google Scholar] [CrossRef]
Niehaus, T.D.; Nguyen, T.N.D.; Gidda, S.K.; ElBadawi-Sidhu, M.; Lambrecht, J.A.; McCarty, D.R.; Downs, D.M.; Cooper, A.J.L.; Fiehn, O.; Mullen, R.T.; et al. Arabidopsis and maize RidA proteins preempt reactive enamine/imine damage to branched-chain amino acid biosynthesis in plastids. Plant Cell 2014, 26, 3010–3022. [Google Scholar] [CrossRef]
Cajka, T.; Fiehn, O. Toward Merging Untargeted and Targeted Methods in Mass Spectrometry-Based Metabolomics and Lipidomics. Anal. Chem. 2016, 88, 524–545. [Google Scholar] [CrossRef]
Leonova, T.; Popova, V.; Tsarev, A.; Henning, C.; Antonova, K.; Rogovskaya, N.; Vikhnina, M.; Baldensperger, T.; Soboleva, A.; Dinastia, E.; et al. Does Protein Glycation Impact on the Drought-Related Changes in Metabolism and Nutritional Properties of Mature Pea (Pisum sativum L.) Seeds? Int. J. Mol. Sci. 2020, 21, 567. [Google Scholar] [CrossRef]
Chantseva, V.; Bilova, T.; Smolikova, G.; Frolov, A.; Medvedev, S. 3D-clinorotation induces specific alterations in metabolite profiles of germinating Brassica napus L. seeds. Biol. Commun. 2019, 64, 55–74. [Google Scholar] [CrossRef]
t’Kindt, R.; Morreel, K.; Deforce, D.; Boerjan, W.; Van Bocxlaer, J. Joint GC-MS and LC-MS platforms for comprehensive plant metabolomics: Repeatability and sample pre-treatment. J. Chromatogr. B Anal. Technol. Biomed. Life Sci. 2009, 877, 3572–3580. [Google Scholar] [CrossRef]
Bilova, T.; Lukasheva, E.; Brauch, D.; Greifenhagen, U.; Paudel, G.; Tarakhovskaya, E.; Frolova, N.; Mittasch, J.; Balcke, G.U.; Tissier, A.; et al. A Snapshot of the Plant Glycated Proteome. J. Biol. Chem. 2016, 291, 7621–7636. [Google Scholar] [CrossRef] [PubMed]
Beale, D.J.; Pinu, F.R.; Kouremenos, K.A.; Poojary, M.M.; Narayana, V.K.; Boughton, B.A.; Kanojia, K.; Dayalan, S.; Jones, O.A.H.; Dias, D.A. Review of recent developments in GC-MS approaches to metabolomics-based research. Metabolomics 2018, 14, 152. [Google Scholar] [CrossRef] [PubMed]
Smolikova, G.N.; Shavarda, A.L.; Alekseichuk, I.V.; Chantseva, V.V.; Medvedev, S.S. The metabolomic approach to the assessment of cultivar specificity of Brassica napus L. seeds. Russ. J. Genet. Appl. Res. 2016, 6, 78–83. [Google Scholar] [CrossRef]
Fiehn, O.; Wohlgemuth, G.; Scholz, M.; Kind, T.; Lee, D.Y.; Lu, Y.; Moon, S.; Nikolau, B. Quality control for plant metabolomics: Reporting MSI-compliant studies: Quality control in metabolomics. Plant J. 2008, 53, 691–704. [Google Scholar] [CrossRef]
Fiehn, O.; Kopka, J.; Trethewey, R.N.; Willmitzer, L. Identification of uncommon plant metabolites based on calculation of elemental compositions using gas chromatography and quadrupole mass spectrometry. Anal. Chem. 2000, 72, 3573–3580. [Google Scholar] [CrossRef]
Gullberg, J.; Jonsson, P.; Nordström, A.; Sjöström, M.; Moritz, T. Design of experiments: An efficient strategy to identify factors influencing extraction and derivatization of Arabidopsis thaliana samples in metabolomic studies with gas chromatography/mass spectrometry. Anal. Biochem. 2004, 331, 283–295. [Google Scholar] [CrossRef]
Tarakhovskaya, E.; Marcillo, A.; Davis, C.; Milkovska-Stamenova, S.; Hutschenreuther, A.; Birkemeyer, C. Matrix Effects in GC-MS Profiling of Common Metabolites after Trimethylsilyl Derivatization. Molecules 2023, 28, 2653. [Google Scholar] [CrossRef]
Topolewska, A.; Czarnowska, K.; Haliński, Ł.P.; Stepnowski, P. Evaluation of four derivatization methods for the analysis of fatty acids from green leafy vegetables by gas chromatography. J. Chromatogr. B Anal. Technol. Biomed. Life Sci. 2015, 990, 150–157. [Google Scholar] [CrossRef]
Frolova, N.; Gorbach, D.; Ihling, C.; Bilova, T.; Orlova, A.; Lukasheva, E.; Fedoseeva, K.; Dodueva, I.; Lutova, L.A.; Frolov, A. Proteome and Metabolome Alterations in Radish (Raphanus sativus L.) Seedlings Induced by Inoculation with Agrobacterium tumefaciens. Biomolecules 2025, 15, 290. [Google Scholar] [CrossRef] [PubMed]
Shumilina, J.; Kiryushkin, A.S.; Frolova, N.; Mashkina, V.; Ilina, E.L.; Puchkova, V.A.; Danko, K.; Silinskaya, S.; Serebryakov, E.B.; Soboleva, A.; et al. Integrative Proteomics and Metabolomics Analysis Reveals the Role of Small Signaling Peptide Rapid Alkalinization Factor 34 (RALF34) in Cucumber Roots. Int. J. Mol. Sci. 2023, 24, 7654. [Google Scholar] [CrossRef] [PubMed]
Schoene, K.; Bruckert, H.-J.; Steinhanses, J.; König, A. Two stage derivatization with N-(tert.-butyldimethylsilyl)-N-methyl-trifluoroacetamide (MTBSTFA) and N-methyl-bis-(trifluoroacetamide) (MBTFA) for the gas-chromatographic analysis of OH-, SH- and NH-compounds. Fresenius J. Anal. Chem. 1994, 348, 364–370. [Google Scholar] [CrossRef]
Birkemeyer, C.; Kolasa, A.; Kopka, J. Comprehensive chemical derivatization for gas chromatography-mass spectrometry-based multi-targeted profiling of the major phytohormones. J. Chromatogr. A 2003, 993, 89–102. [Google Scholar] [CrossRef] [PubMed]
Caban, M.; Stepnowski, P.; Kwiatkowski, M.; Migowska, N.; Kumirska, J. Determination of β-blockers and β-agonists using gas chromatography and gas chromatography–mass spectrometry—A comparative study of the derivatization step. J. Chromatogr. A 2011, 1218, 8110–8122. [Google Scholar] [CrossRef]
Laine, R.A.; Sweeley, C.C. Analysis of trimethylsilyl O-methyloximes of carbohydrates by combined gas-liquid chromatography-mass spectrometry. Anal. Biochem. 1971, 43, 533–538. [Google Scholar] [CrossRef]
Shishova, M.; Puzanskiy, R.; Gavrilova, O.; Kurbanniazov, S.; Demchenko, K.; Yemelyanov, V.; Pendinen, G.; Shavarda, A.; Gavrilenko, T. Metabolic Alterations in Male-Sterile Potato as Compared to Male-Fertile. Metabolites 2019, 9, 24. [Google Scholar] [CrossRef] [PubMed]
Shtark, O.; Puzanskiy, R.; Avdeeva, G.; Yemelyanov, V.; Shavarda, A.; Romanyuk, D.; Kliukova, M.; Kirpichnikova, A.; Tikhonovich, I.; Zhukov, V.; et al. Metabolic Alterations in Pisum sativum Roots during Plant Growth and Arbuscular Mycorrhiza Development. Plants 2021, 10, 1033. [Google Scholar] [CrossRef]
Paudel, G.; Bilova, T.; Schmidt, R.; Greifenhagen, U.; Berger, R.; Tarakhovskaya, E.; Stöckhardt, S.; Balcke, G.U.; Humbeck, K.; Brandt, W.; et al. Osmotic stress is accompanied by protein glycation in Arabidopsis thaliana. J. Exp. Bot. 2016, 67, 6283–6295. [Google Scholar] [CrossRef]
Milkovska-Stamenova, S.; Schmidt, R.; Frolov, A. GC-MS Method for the Quantitation of Carbohydrate Intermediates in Glycation Systems. J. Agric. Food Chem. 2015, 63, 5911–5919. [Google Scholar] [CrossRef] [PubMed]
Little, J.L. Artifacts in trimethylsilyl derivatization reactions and ways to avoid them. J. Chromatogr. A 1999, 844, 1–22. [Google Scholar] [CrossRef]
Miyagawa, H.; Bamba, T. Comparison of sequential derivatization with concurrent methods for GC/MS-based metabolomics. J. Biosci. Bioeng. 2019, 127, 160–168. [Google Scholar] [CrossRef] [PubMed]
Costa, G.d.O.; Germano, A.T.; Bretanha, L.C.; Micke, G.A.; Siwe-Noundou, X.; Sandjo, L.P. GC-MS comparison of fatty acids profile of oils extracted from viscera of Tainha (Mugil liza) and Tambaqui (Colossoma macropomum). Nat. Prod. Res. 2024, 38, 2780–2785. [Google Scholar] [CrossRef]
Abdnim, R.; Lafdil, F.Z.; Elrherabi, A.; El Fadili, M.; Kandsi, F.; Benayad, O.; Legssyer, A.; Ziyyat, A.; Mekhfi, H.; Bnouham, M. Fatty acids characterisation by GC-MS, antiglycation effect at multiple stages and protection of erythrocytes cells from oxidative damage induced by glycation of albumin of Opuntia ficus-indica (L.) Mill seed oil cultivated in Eastern Morocco: Experimental and computational approaches. J. Ethnopharmacol. 2024, 329, 118106. [Google Scholar] [CrossRef]
Liu, R.-L.; Zhang, J.; Mou, Z.-L.; Hao, S.-L.; Zhang, Z.-Q. Microwave-assisted one-step extraction-derivatization for rapid analysis of fatty acids profile in herbal medicine by gas chromatography-mass spectrometry. Analyst 2012, 137, 5135–5143. [Google Scholar] [CrossRef]
Valim, M.F.; Killiny, N. Occurrence of free fatty acids in the phloem sap of different citrus varieties. Plant Signal. Behav. 2017, 12, e1327497. [Google Scholar] [CrossRef]
Guelette, B.S.; Benning, U.F.; Hoffmann-Benning, S. Identification of lipids and lipid-binding proteins in phloem exudates from Arabidopsis thaliana. J. Exp. Bot. 2012, 63, 3603–3616. [Google Scholar] [CrossRef]
Bouchonnet, S. Introduction to GC-MS Coupling; CRC Press: Boca Raton, FL, USA, 2013; ISBN 978-0-429-10086-4. [Google Scholar]
Meng, Z.; Fan, S.; Yuan, X.; Li, Q.; Huang, Y.; Niu, L.; Shi, G.; Zhang, Y. Rapid Screening of 22 Polycyclic Aromatic Hydrocarbons Residues in Vegetable Oils by Gas Chromatography-Electrostatic Field Orbitrap High Resolution Mass Spectrometry. Front. Nutr. 2022, 9, 949025. [Google Scholar] [CrossRef]
Kumar, A.; Sharma, C. Recent update of the various sources originating ghost peaks in gas chromatography: A review. J. Chromatogr. A 2022, 1685, 463625. [Google Scholar] [CrossRef]
Kováts, E. Gas-chromatographische Charakterisierung organischer Verbindungen. Teil 1: Retentionsindices aliphatischer Halogenide, Alkohole, Aldehyde und Ketone. Helv. Chim. Acta 1958, 41, 1915–1932. [Google Scholar] [CrossRef]
Koek, M.M.; Jellema, R.H.; van der Greef, J.; Tas, A.C.; Hankemeier, T. Quantitative metabolomics based on gas chromatography mass spectrometry: Status and perspectives. Metabolomics 2011, 7, 307–328. [Google Scholar] [CrossRef] [PubMed]
Rontani, J.-F. Use of Gas Chromatography-Mass Spectrometry Techniques (GC-MS, GC-MS/MS and GC-QTOF) for the Characterization of Photooxidation and Autoxidation Products of Lipids of Autotrophic Organisms in Environmental Samples. Molecules 2022, 27, 1629. [Google Scholar] [CrossRef]
Kirch, W. (Ed.) Pearson’s Correlation Coefficient. In Encyclopedia of Public Health; Springer: Dordrecht, The Netherlands, 2008; pp. 1090–1091. ISBN 978-1-4020-5613-0. [Google Scholar]
Currie, L.A. Nomenclature in evaluation of analytical methods including detection and quantification capabilities (IUPAC Recommendations 1995). Pure Appl. Chem. 1995, 67, 1699–1723. [Google Scholar] [CrossRef]
Indrayanto, G. Validation of Chromatographic Methods of Analysis: Application for Drugs That Derived from Herbs. Profiles Drug Subst. Excip. Relat. Methodol. 2018, 43, 359–392. [Google Scholar] [CrossRef] [PubMed]
Misra, B.B. New software tools, databases, and resources in metabolomics: Updates from 2020. Metabolomics 2021, 17, 49. [Google Scholar] [CrossRef] [PubMed]
Bean, H.D.; Hill, J.E.; Dimandja, J.-M.D. Improving the quality of biomarker candidates in untargeted metabolomics via peak table-based alignment of comprehensive two-dimensional gas chromatography-mass spectrometry data. J. Chromatogr. A 2015, 1394, 111–117. [Google Scholar] [CrossRef]
Tsugawa, H.; Rai, A.; Saito, K.; Nakabayashi, R. Metabolomics and complementary techniques to investigate the plant phytochemical cosmos. Nat. Prod. Rep. 2021, 38, 1729–1759. [Google Scholar] [CrossRef]
Carroll, A.J.; Salek, R.M.; Arita, M.; Kopka, J.; Walther, D. Editorial: Metabolome Informatics and Statistics: Current State and Emerging Trends. Front. Bioeng. Biotechnol. 2016, 4, 63. [Google Scholar] [CrossRef]
Ma, A.; Qi, X. Mining plant metabolomes: Methods, applications, and perspectives. Plant Commun. 2021, 2, 100238. [Google Scholar] [CrossRef] [PubMed]
Pierce, K.M.; Parsons, B.A.; Synovec, R.E. Pixel-Level Data Analysis Methods for Comprehensive Two-Dimensional Chromatography. In Data Handling in Science and Technology; Elsevier: Amsterdam, The Netherlands, 2015; Volume 29, pp. 427–463. [Google Scholar]
Rood, D. Gas Chromatography Problem Solving and Troubleshooting. J. Chromatogr. Sci. 1997, 35, 239–240. [Google Scholar] [CrossRef]
Chau, F.; Kai-man Leung, A. Chapter 9—Application of Wavelet Transform in Processing Chromatographic Data. In Data Handling in Science and Technology; Walczak, B., Ed.; Wavelets in Chemistry; Elsevier: Amsterdam, The Netherlands, 2000; Volume 22, pp. 205–223. [Google Scholar]
Stevenson, P.G.; Mnatsakanyan, M.; Guiochon, G.; Shalliker, R.A. Peak picking and the assessment of separation performance in two-dimensional high performance liquid chromatography. Analyst 2010, 135, 1541. [Google Scholar] [CrossRef] [PubMed]
Soggiu, A.; Marullo, O.; Roncada, P.; Capobianco, E. Empowering Spot Detection in 2DE Images by Wavelet Denoising. In Silico Biol. 2009, 9, 125–133. [Google Scholar] [CrossRef]
Smith, C.A.; Want, E.J.; O’Maille, G.; Abagyan, R.; Siuzdak, G. XCMS: Processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal. Chem. 2006, 78, 779–787. [Google Scholar] [CrossRef]
Dos Santos, E.K.P.; Canuto, G.A.B. Optimizing XCMS parameters for GC-MS metabolomics data processing: A case study. Metabolomics 2023, 19, 26. [Google Scholar] [CrossRef]
Mastrangelo, A.; Ferrarini, A.; Rey-Stolle, F.; García, A.; Barbas, C. From sample treatment to biomarker discovery: A tutorial for untargeted metabolomics based on GC-(EI)-Q-MS. Anal. Chim. Acta 2015, 900, 21–35. [Google Scholar] [CrossRef]
Wenig, P.; Odermatt, J. OpenChrom: A cross-platform open source software for the mass spectrometric analysis of chromatographic data. BMC Bioinform. 2010, 11, 405. [Google Scholar] [CrossRef]
Lai, Z.; Tsugawa, H.; Wohlgemuth, G.; Mehta, S.; Mueller, M.; Zheng, Y.; Ogiwara, A.; Meissen, J.; Showalter, M.; Takeuchi, K.; et al. Identifying metabolites by integrating metabolome databases with mass spectrometry cheminformatics. Nat. Methods 2018, 15, 53–56. [Google Scholar] [CrossRef]
Fiehn, O.; Robertson, D.; Griffin, J.; van der Werf, M.; Nikolau, B.; Morrison, N.; Sumner, L.W.; Goodacre, R.; Hardy, N.W.; Taylor, C.; et al. The metabolomics standards initiative (MSI). Metabolomics 2007, 3, 175–178. [Google Scholar] [CrossRef]
Kind, T.; Fiehn, O. Advances in structure elucidation of small molecules using mass spectrometry. Bioanal. Rev. 2010, 2, 23–60. [Google Scholar] [CrossRef]
Tsugawa, H. Advances in computational metabolomics and databases deepen the understanding of metabolisms. Curr. Opin. Biotechnol. 2018, 54, 10–17. [Google Scholar] [CrossRef]
Lucero, M.; Estell, R.; Tellez, M.; Fredrickson, E. A retention index calculator simplifies identification of plant volatile organic compounds: A Retention Index Calculator. Phytochem. Anal. 2009, 20, 378–384. [Google Scholar] [CrossRef]
Tsugawa, H.; Tsujimoto, Y.; Arita, M.; Bamba, T.; Fukusaki, E. GC/MS based metabolomics: Development of a data mining system for metabolite identification by using soft independent modeling of class analogy (SIMCA). BMC Bioinform. 2011, 12, 131. [Google Scholar] [CrossRef]
Stein, S.E.; Scott, D.R. Optimization and testing of mass spectral library search algorithms for compound identification. J. Am. Soc. Mass Spectrom. 1994, 5, 859–866. [Google Scholar] [CrossRef]
Mihaleva, V.V.; Verhoeven, H.A.; De Vos, R.C.H.; Hall, R.D.; Van Ham, R.C.H.J. Automated procedure for candidate compound selection in GC-MS metabolomics based on prediction of Kovats retention index. Bioinformatics 2009, 25, 787–794. [Google Scholar] [CrossRef] [PubMed]
Koo, I.; Kim, S.; Zhang, X. Comparative Analysis of Mass Spectral Matching-based Compound Identification in Gas Chromatography Mass Spectrometry. J. Chromatogr. A 2013, 1298, 132–138. [Google Scholar] [CrossRef] [PubMed]
Wishart, D.S.; Guo, A.; Oler, E.; Wang, F.; Anjum, A.; Peters, H.; Dizon, R.; Sayeeda, Z.; Tian, S.; Lee, B.L.; et al. HMDB 5.0: The Human Metabolome Database for 2022. Nucleic Acids Res. 2022, 50, D622–D631. [Google Scholar] [CrossRef]
Horai, H.; Arita, M.; Kanaya, S.; Nihei, Y.; Ikeda, T.; Suwa, K.; Ojima, Y.; Tanaka, K.; Tanaka, S.; Aoshima, K.; et al. MassBank: A public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 2010, 45, 703–714. [Google Scholar] [CrossRef] [PubMed]
Hummel, J.; Strehmel, N.; Bölling, C.; Schmidt, S.; Walther, D.; Kopka, J. Mass Spectral Search and Analysis Using the Golm Metabolome Database. In The Handbook of Plant Metabolomics; Weckwerth, W., Kahl, G., Eds.; Wiley: Hoboken, NJ, USA, 2013; pp. 321–343. ISBN 978-3-527-32777-5. [Google Scholar]
Harvey, D.J.; Vouros, P. Mass spectrometric fragmentation of trimethylsilyl and related alkylsylil derivatives. Mass Spectrom. Rev. 2020, 39, 105–211. [Google Scholar] [CrossRef] [PubMed]
Sumner, L.W.; Amberg, A.; Barrett, D.; Beale, M.H.; Beger, R.; Daykin, C.A.; Fan, T.W.-M.; Fiehn, O.; Goodacre, R.; Griffin, J.L.; et al. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics 2007, 3, 211–221. [Google Scholar] [CrossRef]
Alonso, A.-M.; Reyes-Maldonado, O.K.; Puebla-Pérez, A.M.; Arreola, M.P.G.; Velasco-Ramírez, S.F.; Zúñiga-Mayo, V.; Sánchez-Fernández, R.E.; Delgado-Saucedo, J.-I.; Velázquez-Juárez, G. GC/MS Analysis, Antioxidant Activity, and Antimicrobial Effect of Pelargonium peltatum (Geraniaceae). Molecules 2022, 27, 3436. [Google Scholar] [CrossRef]
Wei, R.; Wang, J.; Su, M.; Jia, E.; Chen, S.; Chen, T.; Ni, Y. Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data. Sci. Rep. 2018, 8, 663. [Google Scholar] [CrossRef]
Flores, J.E.; Claborne, D.M.; Weller, Z.D.; Webb-Robertson, B.-J.M.; Waters, K.M.; Bramer, L.M. Missing data in multi-omics integration: Recent advances through artificial intelligence. Front. Artif. Intell. 2023, 6, 1098308. [Google Scholar] [CrossRef]
Dieterle, F.; Ross, A.; Schlotterbeck, G.; Senn, H. Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal. Chem. 2006, 78, 4281–4290. [Google Scholar] [CrossRef] [PubMed]
Bylesjö, M.; Cloarec, O.; Rantalainen, M. Normalization and Closure; Elsevier: Amsterdam, The Netherlands, 2009; pp. A109–A127. [Google Scholar]
Noonan, M.J.; Tinnesand, H.V.; Buesching, C.D. Normalizing Gas-Chromatography-Mass Spectrometry Data: Method Choice can Alter Biological Inference. Bioessays 2018, 40, e1700210. [Google Scholar] [CrossRef]
Broadhurst, D.I.; Kell, D.B. Statistical strategies for avoiding false discoveries in metabolomics and related experiments. Metabolomics 2006, 2, 171–196. [Google Scholar] [CrossRef]
Goodacre, R.; Broadhurst, D.; Smilde, A.K.; Kristal, B.S.; Baker, J.D.; Beger, R.; Bessant, C.; Connor, S.; Capuani, G.; Craig, A.; et al. Proposed minimum reporting standards for data analysis in metabolomics. Metabolomics 2007, 3, 231–241. [Google Scholar] [CrossRef]
Alonso, A.; Marsal, S.; JuliÃ, A. Analytical Methods in Untargeted Metabolomics: State of the Art in 2015. Front. Bioeng. Biotechnol. 2015, 3, 23. [Google Scholar] [CrossRef]
Scheubert, K.; Hufsky, F.; Petras, D.; Wang, M.; Nothias, L.-F.; Dührkop, K.; Bandeira, N.; Dorrestein, P.C.; Böcker, S. Significance estimation for large scale metabolomics annotations by spectral matching. Nat. Commun. 2017, 8, 1494. [Google Scholar] [CrossRef] [PubMed]
Benjamini, Y.; Hochberg, Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995, 57, 289–300. [Google Scholar] [CrossRef]
Pomerantsev, A.L.; Rodionova, O.Y. New trends in qualitative analysis: Performance, optimization, and validation of multi-class and soft models. TrAC Trends Anal. Chem. 2021, 143, 116372. [Google Scholar] [CrossRef]
Tarakhovskaya, E.; Lemesheva, V.; Bilova, T.; Birkemeyer, C. Early Embryogenesis of Brown Alga Fucus vesiculosus L. is Characterized by Significant Changes in Carbon and Energy Metabolism. Molecules 2017, 22, 1509. [Google Scholar] [CrossRef] [PubMed]
Vinay, C.M.; Udayamanoharan, S.K.; Prabhu Basrur, N.; Paul, B.; Rai, P.S. Current analytical technologies and bioinformatic resources for plant metabolomics data. Plant Biotechnol. Rep. 2021, 15, 561–572. [Google Scholar] [CrossRef]
Schmid, R.; Heuckeroth, S.; Korf, A.; Smirnov, A.; Myers, O.; Dyrlund, T.S.; Bushuiev, R.; Murray, K.J.; Hoffmann, N.; Lu, M.; et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. Nat. Biotechnol. 2023, 41, 447–449. [Google Scholar] [CrossRef]
Chong, J.; Wishart, D.S.; Xia, J. Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis. Curr. Protoc. Bioinform. 2019, 68, e86. [Google Scholar] [CrossRef]
Rohn, H.; Junker, A.; Hartmann, A.; Grafahrend-Belau, E.; Treutler, H.; Klapperstück, M.; Czauderna, T.; Klukas, C.; Schreiber, F. VANTED v2: A framework for systems biology applications. BMC Syst. Biol. 2012, 6, 139. [Google Scholar] [CrossRef]
Misra, B.B. Advances in high resolution GC-MS technology: A focus on the application of GC-Orbitrap-MS in metabolomics and exposomics for FAIR practices. Anal. Methods 2021, 13, 2265–2282. [Google Scholar] [CrossRef]
Sakurai, N. Recent applications of metabolomics in plant breeding. Breed. Sci. 2022, 72, 56–65. [Google Scholar] [CrossRef] [PubMed]
Aksenov, A.A.; Laponogov, I.; Zhang, Z.; Doran, S.L.F.; Belluomo, I.; Veselkov, D.; Bittremieux, W.; Nothias, L.F.; Nothias-Esposito, M.; Maloney, K.N.; et al. Auto-deconvolution and molecular networking of gas chromatography-mass spectrometry data. Nat. Biotechnol. 2021, 39, 169–173. [Google Scholar] [CrossRef]
Stein, S.E. An integrated method for spectrum extraction and compound identification from gas chromatography/mass spectrometry data. J. Am. Soc. Mass Spectrom. 1999, 10, 770–781. [Google Scholar] [CrossRef]
Kessler, N.; Neuweger, H.; Bonte, A.; Langenkämper, G.; Niehaus, K.; Nattkemper, T.W.; Goesmann, A. MeltDB 2.0–advances of the metabolomics software system. Bioinformatics 2013, 29, 2452–2459. [Google Scholar] [CrossRef]
Haug, K.; Cochrane, K.; Nainala, V.C.; Williams, M.; Chang, J.; Jayaseelan, K.V.; O’Donovan, C. MetaboLights: A resource evolving in response to the needs of its scientific community. Nucleic Acids Res. 2019, 48, gkz1019. [Google Scholar] [CrossRef]
Hiller, K.; Hangebrauk, J.; Jäger, C.; Spura, J.; Schreiber, K.; Schomburg, D. MetaboliteDetector: Comprehensive Analysis Tool for Targeted and Nontargeted GC/MS Based Metabolome Analysis. Anal. Chem. 2009, 81, 3429–3439. [Google Scholar] [CrossRef]
Lommen, A. MetAlign: Interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. Anal. Chem. 2009, 81, 3079–3086. [Google Scholar] [CrossRef] [PubMed]
Wen, B.; Mei, Z.; Zeng, C.; Liu, S. metaX: A flexible and comprehensive software for processing metabolomics data. BMC Bioinform. 2017, 18, 183. [Google Scholar] [CrossRef]
Lei, Z.; Li, H.; Chang, J.; Zhao, P.X.; Sumner, L.W. MET-IDEA version 2.06; improved efficiency and additional functions for mass spectrometry-based metabolomics data processing. Metabolomics 2012, 8, 105–110. [Google Scholar] [CrossRef]
Nezami Ranjbar, M.R.; Poto, C.D.; Wang, Y.; Ressom, H.W. SIMAT: GC-SIM-MS data analysis tool. BMC Bioinform. 2015, 16, 259. [Google Scholar] [CrossRef]
Gressling, T. 76 Gas chromatography-mass spectrometry (GC-MS). In Data Science in Chemistry; De Gruyter: Berlin, Germany, 2020; pp. 373–376. ISBN 978-3-11-062945-3. [Google Scholar]
Luedemann, A.; Strassburg, K.; Erban, A.; Kopka, J. TagFinder for the quantitative analysis of gas chromatography—Mass spectrometry (GC-MS)-based metabolite profiling experiments. Bioinformatics 2008, 24, 732–737. [Google Scholar] [CrossRef]
Karp, P.; Billington, R.; Holland, T.; Kothari, A.; Krummenacker, M.; Weaver, D.; Latendresse, M.; Paley, S. Computational Metabolomics Operations at BioCyc.org. Metabolites 2015, 5, 291–310. [Google Scholar] [CrossRef]
Kanehisa, M. KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 2000, 28, 27–30. [Google Scholar] [CrossRef]
Usadel, B.; Poree, F.; Nagel, A.; Lohse, M.; Czedik-Eysenberg, A.; Stitt, M. A guide to using MapMan to visualize and compare Omics data in plants: A case study in the crop species, Maize. Plant Cell Environ. 2009, 32, 1211–1229. [Google Scholar] [CrossRef]
Schreiber, F.; Colmsee, C.; Czauderna, T.; Grafahrend-Belau, E.; Hartmann, A.; Junker, A.; Junker, B.H.; Klapperstück, M.; Scholz, U.; Weise, S. MetaCrop 2.0: Managing and exploring information about crop plant metabolism. Nucleic Acids Res. 2012, 40, D1173–D1177. [Google Scholar] [CrossRef] [PubMed]
Barupal, D.K.; Haldiya, P.K.; Wohlgemuth, G.; Kind, T.; Kothari, S.L.; Pinkerton, K.E.; Fiehn, O. MetaMapp: Mapping and visualizing metabolomic data by integrating information from biochemical pathways and chemical and mass spectral similarity. BMC Bioinform. 2012, 13, 99. [Google Scholar] [CrossRef] [PubMed]
Wurtele, E.S.; Li, J.; Diao, L.; Zhang, H.; Foster, C.M.; Fatland, B.; Dickerson, J.; Brown, A.; Cox, Z.; Cook, D.; et al. MetNet: Software to Build and Model the Biogenetic Lattice of Arabidopsis. Comp. Funct. Genom. 2003, 4, 239–245. [Google Scholar] [CrossRef]
Rojas-Chertó, M.; Van Vliet, M.; Peironcely, J.E.; Van Doorn, R.; Kooyman, M.; Te Beek, T.; Van Driel, M.A.; Hankemeier, T.; Reijmers, T. MetiTree: A web application to organize and process high-resolution multi-stage mass spectrometry metabolomics data. Bioinformatics 2012, 28, 2707–2709. [Google Scholar] [CrossRef] [PubMed]
Karnovsky, A.; Weymouth, T.; Hull, T.; Tarcea, V.G.; Scardoni, G.; Laudanna, C.; Sartor, M.A.; Stringer, K.A.; Jagadish, H.V.; Burant, C.; et al. Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data. Bioinformatics 2012, 28, 373–380. [Google Scholar] [CrossRef]
Martens, M.; Ammar, A.; Riutta, A.; Waagmeester, A.; Slenter, D.N.; Hanspers, K.; Miller, R.A.; Digles, D.; Lopes, E.N.; Ehrhart, F.; et al. WikiPathways: Connecting communities. Nucleic Acids Res. 2021, 49, D613–D621. [Google Scholar] [CrossRef] [PubMed]
Ramirez-Gaona, M.; Marcu, A.; Pon, A.; Grant, J.; Wu, A.; Wishart, D.S. A Web Tool for Generating High Quality Machine-readable Biological Pathways. J. Vis. Exp. 2017, 120, 54869. [Google Scholar] [CrossRef]
Hawkins, C.; Ginzburg, D.; Zhao, K.; Dwyer, W.; Xue, B.; Xu, A.; Rice, S.; Cole, B.; Paley, S.; Karp, P.; et al. Plant Metabolic Network 15: A resource of genome-wide metabolism databases for 126 plants and algae. J. Integr. Plant Biol. 2021, 63, 1888–1905. [Google Scholar] [CrossRef] [PubMed]
Pujar, A.; Jaiswal, P.; Kellogg, E.A.; Ilic, K.; Vincent, L.; Avraham, S.; Stevens, P.; Zapata, F.; Reiser, L.; Rhee, S.Y.; et al. Whole-Plant Growth Stage Ontology for Angiosperms and Its Application in Plant Biology. Plant Physiol. 2006, 142, 414–428. [Google Scholar] [CrossRef][Green Version]
Sommer, B.; Schreiber, F. Integration and virtual reality exploration of biomedical data with CmPI and VANTED. IT—Inf. Technol. 2017, 59, 181–190. [Google Scholar] [CrossRef]
Fukushima, A.; Takahashi, M.; Nagasaki, H.; Aono, Y.; Kobayashi, M.; Kusano, M.; Saito, K.; Kobayashi, N.; Arita, M. Development of RIKEN Plant Metabolome MetaDatabase. Plant Cell Physiol. 2022, 63, 433–440. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Workflow of sample harvesting, fixation and extraction for GC-MS analysis: (a) experimental plant object; (b) harvesting; (c) freeze drying; (d) freezing by liquid nitrogen and grinding by ball mill; (e) steam distillation; (f) weighing samples; (g,h) extraction techniques: (g) SLE—solid–liquid extraction, LLE—liquid–liquid extraction, MAE—microwave-assisted extraction, UAE—ultrasound-assisted extraction, SFE—supercritical fluid extraction, SHWE—super-heated water extraction, PLE—pressurized liquid extraction; (h) SPE—solid-phase extraction, SPME—solid-phase microextraction, MSPD—matrix solid-phase dispersion, MSPE—magnetic SPE, PT-SPE—pipette-tip SPE, MEPS—microextraction by packed sorbent, LLME—liquid–liquid microextraction, DLLME—dispersive liquid–liquid microextraction, SDME—single-drop microextraction, SBSE—stir-bar sorptive extraction.

Figure 2. Typical sample sequence (batch) for GC-MS analysis. Mixes of alkanes or FAMEs are used for RI calibration. ^a Experimental samples (randomly arranged), mix of standards which can be reference compounds, external calibration or standard addition calibration (n ≥ 3); ^b at least 6 separate QC samples in duplicates during the analysis [45].

Figure 3. Example of external calibration and standard addition results for rhamnose, which is usually a low-abundance metabolite in plant extracts. (a) EI mass spectrum of rhamnose. (b,c) Peak areas (n = 3), which are indicated for each used concentration of the standard, were integrated from extracted ion chromatograms reconstructed for characteristic values of m/z = 277 and t_R = 23.60–23.80 min for external calibration and t_R = 23.50–23.60 min for standard addition calibration. In the external calibration table, the orange and red frames mark rhamnose concentrations, which correspond to the LOD and LOQ, for which S/N is ≥3 and ≥10, respectively. (d) External calibration curves built using coordinates of peak areas and standard concentrations and their log-transformed values. (e) Standard addition calibration curve. EIC(or XIC)s—extracted ion chromatograms.

Figure 4. GC-MS data mining and processing workflow. Data mining: (a) deconvolution of mass spectra; (b) chromatogram alignment by analyte retention times (t_R); (c) peak picking and analyte identification, RI—retention index; (d) peak area integration (XIC—extracted ion chromatogram reconstructed for characteristic m/z values and t_Rs of individual metabolites). Data processing: (e) building a matrix of integrated analyte peak areas. Data post-processing: (f) statistical analysis; (g) elucidation of metabolic pathways.

Figure 5. A typical total ion current (TIC) chromatogram obtained from GC-EI-Q-MS analysis of 7-week-old Arabidopsis thaliana plants, with marked retention time windows (t_R = 1–3) indicating the position of some chemical classes of primary metabolites in the chromatogram. t_R1—C3–5 organic acids, amino acids (Gly, Ala, Val, Ser, Thr, Pro, Asp, Glu, Asn, Gln, Met, Phe), C3–5 carbohydrates and their derivatives; t_R2—C6–7-monosaccharides and their derivatives, amino acids (Lys, Tyr, Trp), C16–18 fatty acids; t_R3—di- and oligosaccharides and their derivatives, sterols, lysolipids.

Table 1. Types of mass analyzers applied in GC-MS-based plant metabolomics.

Analytical Platform ^a	Basic Principle of Mass Analysis	Mass Accuracy (a.m.u.) ^b	Main Field of Application	Field of Potential Application
Quadrupole	Selective mass filtering based on applied RF and DC voltages	0.1–0.01	Analysis of specific compounds or classes of metabolites	Quantification and confirmation of known metabolites
Quadrupole Ion Trap (QIT)	Ion trapping and mass analysis based on the stability of ions in a quadrupole field	0.1–0.01	Structural elucidation and fragmentation analysis	Screening for unknown metabolites, natural product discovery
Linear Ion Trap (LIT)	Ion trapping and mass analysis based on the stability of ions in a linear RF field	0.1–0.01	MSⁿ experiments, enhanced structural characterization	Identification of isomeric compounds
Quadrupole–Linear Ion Trap (QLIT)	Combination of quadrupole and linear ion trap analyzers for improved selectivity and sensitivity	0.1–0.01	Analysis with enhanced sensitivity and dynamic range	Metabolite pathway analysis, drug metabolite profiling
Time-of-Flight (TOF)	Measurement of ion flight times to determine their m/z	0.01–0.001	Untargeted metabolomics, comprehensive profiling	Screening for unknown metabolites, biomarker discovery
Quadrupole–Time-of-Flight (Q-TOF)	Combination of quadrupole and TOF analyzers for precursor ion selection and accurate mass measurement	0.01–0.001	Metabolite identification and characterization	Comparative metabolomics, pathway analysis
Magnetic Sector	Deflection of ions based on their m/z in a magnetic field	0.001–0.0001	Accurate quantification and structural elucidation of metabolites	Isomer differentiation, metabolite profiling
Triple Quadrupole (QqQ)	Selective mass filtering and fragmentation in multiple stages	0.001–0.0001	Quantitative analysis with high sensitivity and selectivity	Metabolite quantification, trace-level analysis
Orbitrap	Detection of ion oscillations in a high-resolution electrostatic field	0.0001–0.00001	High-resolution accurate mass analysis, metabolite profiling	Discovery of unknown metabolites, metabolite annotation
Quadrupole–Orbitrap (Q-Orbitrap)	Combination of quadrupole and Orbitrap analyzers for precursor ion selection and high-resolution accurate mass measurement	0.0001–0.00001	Comprehensive metabolite profiling and identification	Metabolomics, metabolite biomarker discovery

^a Platforms are ranged based on their resolution capacity; ^b mass accuracy values provided are approximate ranges and may vary depending on the specific instrument setup, calibration, and experimental conditions.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bilova, T.; Frolova, N.; Orlova, A.; Silinskaia, S.; Mailov, A.; Popova, V.; Frolov, A. Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach. Plants 2026, 15, 445. https://doi.org/10.3390/plants15030445

AMA Style

Bilova T, Frolova N, Orlova A, Silinskaia S, Mailov A, Popova V, Frolov A. Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach. Plants. 2026; 15(3):445. https://doi.org/10.3390/plants15030445

Chicago/Turabian Style

Bilova, Tatiana, Nadezhda Frolova, Anastasia Orlova, Svetlana Silinskaia, Akif Mailov, Veronika Popova, and Andrej Frolov. 2026. "Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach" Plants 15, no. 3: 445. https://doi.org/10.3390/plants15030445

APA Style

Bilova, T., Frolova, N., Orlova, A., Silinskaia, S., Mailov, A., Popova, V., & Frolov, A. (2026). Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach. Plants, 15(3), 445. https://doi.org/10.3390/plants15030445

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analyzing Plant Low-Molecular-Weight Polar Metabolites: A GC-MS Approach

Abstract

1. Introduction

2. Choosing an Analytical Platform

3. Sample Preparation

3.1. Harvesting and Fixation of Plant Material

3.2. Extraction of Primary Metabolites

3.3. Derivatization

4. Analytical Acquisition

Analyte Absolute and Relative Quantification

5. Data Interpretation

6. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI