Next Article in Journal
Cognitive–Affective Negotiation Process in Green Food Purchase Intention: A Qualitative Study Based on Grounded Theory
Previous Article in Journal
Accurate Classification of Multi-Cultivar Watermelons via GAF-Enhanced Feature Fusion Convolutional Neural Networks
Previous Article in Special Issue
Increased Oxygen Treatment in the Fermentation Process Improves the Taste and Liquor Color Qualities of Black Tea
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Characterization and Exploration of the Flavor Profiles of Green Teas from Different Leaf Maturity Stages of Camellia sinensis cv. Fudingdabai Using E-Nose, E-Tongue, and HS-GC-IMS Combined with Machine Learning

College of Food and Pharmaceutical Engineering, Guizhou Institute of Technology, Guiyang 550025, China
*
Authors to whom correspondence should be addressed.
Foods 2025, 14(16), 2861; https://doi.org/10.3390/foods14162861
Submission received: 14 July 2025 / Revised: 13 August 2025 / Accepted: 14 August 2025 / Published: 18 August 2025
(This article belongs to the Collection Advances in Tea Chemistry)

Abstract

Understanding how leaf maturity affects the flavor attributes of green tea is crucial for optimizing harvest timing and processing strategies. This study comprehensively characterized the flavor profiles of Fudingdabai green teas at three distinct leaf maturity stages—single bud (FDQSG), one bud + one leaf (FDMJ1G), and one bud + two leaves (FDTC2G)—using a multimodal approach integrating electronic nose, electronic tongue, HS-GC-IMS, relative odor activity value (rOAV) evaluation, and machine learning algorithms. A total of 85 volatile compounds (VOCs) were identified, of which 41 had rOAV > 1. Notably, 2-methylbutanal, 2-ethyl-3,5-dimethylpyrazine, and linalool exhibited extremely high rOAVs (>1000). FDQSG was enriched with LOX (lipoxygenase)-derived fresh, grassy volatiles such as (Z)-3-hexen-1-ol and nonanal. FDMJ1G showed a pronounced accumulation of floral and fruity compounds, especially linalool (rOAV: 7400), while FDTC2G featured Maillard- and phenylalanine-derived volatiles like benzene acetaldehyde and 2,5-dimethylfuran, contributing to roasted and cocoa-like aromas. KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis revealed significant enrichment in butanoate metabolism and monoterpenoid biosynthesis. Random forest–SHAP analysis identified 20 key flavor markers, mostly VOCs, that effectively discriminated samples by tenderness grade. ROC–AUC validation further confirmed their diagnostic performance (accuracy ≥ 0.8). These findings provide a scientific basis for flavor-driven harvest management and the quality-oriented grading of Fudingdaibai green tea.

1. Introduction

Green tea, a globally consumed non-fermented tea product, is prized for its distinctive fresh, floral, and umami sensory attributes [1,2], as well as its significant health benefits, including antioxidant [3,4], anti-inflammatory [5], neuroprotective [6], anticancer [7], cardioprotective [8], antihyperglycemic, and anti-obesity effects [4]. These nutritional and sensory qualities are closely linked to the cultivar and maturity stage of the tea leaves used for processing. Among the diverse tea cultivars in China, Camellia sinensis cv. Fudingdabai stands out as a high-quality variety due to its favorable bud and leaf morphology [9,10]. It is widely cultivated not only in its native Fujian Province but also in regions such as Guizhou, where distinct geographical conditions contribute to unique flavor signatures and nutritional profiles of processed teas derived from this cultivar [11,12]. The Guizhou region, characterized by its low latitude, high altitude, and limited sunshine, provides an ideal ecological niche for premium tea production. Under this terroir, Fudingdaibai green tea (FDG) develops distinctive sensory attributes shaped by the synergy between cultivar and processing techniques [12,13,14].
A critical agronomic and quality-related factor influencing green tea is leaf maturity stage, typically categorized as single bud, one bud with one leaf, and one bud with two leaves. The maturation stage of tea shoots directly affects the chemical composition of flavor precursors and secondary metabolites [15,16,17]. Prior studies have demonstrated that teas made from more tender shoots generally exhibit enhanced sweetness and umami, along with lower bitterness, which are associated with the accumulation of metabolites such as maltitol and L-aspartic acid [18]. Similarly, in yellow tea, tender buds have been shown to possess higher antioxidant and α-glucosidase inhibitory activity, while mature leaves exhibit greater levels of Maillard reaction-derived heterocyclic volatiles [19]. Recent research also suggests that the distribution of chiral aroma compounds in green tea varies systematically with leaf maturity [20]. Additionally, Xu et al. [17] found that green teas made from tender shoots had higher amino acid and polyphenol content, yielding fresher and sweeter taste profiles, whereas mature leaf teas exhibited intensified bitterness and astringency. Similar maturity-dependent changes in flavor-active compounds and bioactivity have been documented in Lithocarpus litseifolius (sweet tea) [21], mulberry leaf tea [22], black tea [23], and white tea [24], suggesting that leaf maturity is a general determinant of tea flavor and functional properties.
Despite growing interest in the relationship between leaf maturity and tea quality, a systematic flavoromic investigation of how varying leaf maturity stages affect the flavor characteristics, volatile profiles, sensory attributes, and related metabolic pathways in Fudingdabai green tea (FDG)—especially from the Guizhou region—remains lacking. Clarifying these relationships is critical for identifying quality markers and establishing rapid grading frameworks, with important implications for harvest scheduling and processing control.
Recent advancements in flavoromics have enabled more comprehensive tea flavor evaluation. Headspace gas chromatography–ion mobility spectrometry (HS-GC-IMS) has emerged as a powerful analytical platform owing to its high sensitivity, fast analysis time, and minimal sample preparation [2,25,26,27,28]. This technique excels in detecting key volatiles such as aldehydes and ketones, and has been successfully applied to characterize the aroma of green, white, oolong, black, and dark teas, as well as baijiu and meat products [2,25,26,27,28,29,30,31,32]. In parallel, electronic nose (E-nose) and electronic tongue (E-tongue) systems enable rapid quantification of overall aroma and taste features, thereby overcoming the limitations of traditional sensory evaluation [31,32]. Moreover, relative odor activity value (rOAV) analysis provides a quantitative estimation of the contribution of each volatile to the overall aroma profile [25,33]. However, the standalone application of these techniques is often insufficient to fully capture the complexity of food flavor systems.
The integration of machine learning (ML) algorithms presents an effective solution for decoding such multidimensional datasets. Algorithms such as random forest (RF), support vector machine (SVM), k-nearest neighbor (KNN), Gaussian naïve Bayes (GNB), logistic regression (LR), and decision tree (DT) have shown strong performance in modeling nonlinear and high-dimensional flavor data, while also enabling the selection of key discriminative features [34]. These tools have been increasingly used to evaluate and predict flavor quality in baijiu, tea, and other food products [25,26,34,35,36]. Notably, ensemble modeling strategies combining multiple algorithms can further enhance prediction accuracy and reduce model bias [34].
Against this backdrop, the present study aimed to investigate the flavor characteristics of FDG processed from leaves of three different maturity stages—single bud (FDQSG), one bud plus one leaf (FDMJ1G), and one bud plus two leaves (FDTC2G)—harvested from Guizhou Province. By integrating HS-GC-IMS, E-nose, E-tongue, rOAV calculation, and multiple machine learning models, we sought to (1) characterize how tenderness affects the overall flavor profile of FDG, (2) identify key flavor markers associated with leaf maturity, and (3) elucidate potential biosynthetic pathways of differential volatile compounds. The findings offer novel insights into the flavor basis of tea grading and provide a theoretical foundation and technical support for optimized harvest timing and tailored processing of FDG.

2. Materials and Methods

2.1. Tea Samples and Chemical Reagents

Three green tea samples representing different leaf maturity stages were processed from the fresh leaves of Camellia sinensis ‘Fudingdabai’. The harvested fresh shoots included single bud (FDQSG, Fuding Que She), one bud with one leaf (FDMJG, Fuding Maojian), and one bud with two leaves (FDTC2G, Fuding Taicha). All raw tea materials were collected in March and April 2023 from nine-year-old tea plants grown at the Hongfengshanyun Tea Farm in Qingzhen, Guizhou Province, China (106°22′ E, 26°31′ N). To ensure consistency and representativeness, all samples were harvested from equivalent positions on the tea bushes and then thoroughly homogenized prior to analysis. The processed green tea samples were stored at –4 °C until further evaluation. For instrumental analysis, each group was tested in triplicate for the electronic nose (E-nose) and HS-GC-IMS, and in quintuplicate (n = 5) for the electronic tongue (E-tongue) to ensure measurement accuracy and repeatability.
All chemical reagents were of chromatographic or analytical grade. Ethyl decanoate (≥98%, GC grade, TCI, Shanghai, China) was used as the internal standard for volatile compound analysis. Standard n-ketones for retention index (RI) calibration—2-butanone (99.5%), 2-pentanone (99.5%), 2-hexanone (99.5%), 2-heptanone (98%), 2-octanone (99%), and 2-nonanone (99%)—were purchased from Sinopharm Chemical Reagent Co., Ltd. (Shanghai, China). Sodium chloride (NaCl) and absolute ethanol were obtained from Jinshan Chemical Co., Ltd. (Chengdu, China), and distilled water was provided by Wahaha Group Co., Ltd. (Hangzhou, China).

2.2. Electronic Tongue Sensory Analysis

Taste attributes of green tea infusions were assessed using a taste sensing system (SA402B Electronic Tongue, INSENT Inc., Kanagawa, Japan), which is equipped with five test sensors (AAE, CTO, CAO, COO, AE1) and two reference electrodes. These sensors detect the intensity of umami, saltiness, sourness, bitterness, and astringency, as well as aftertaste characteristics such as aftertaste-bitterness (aftertaste-B), aftertaste-astringency (aftertaste-A), and richness.
Each tea sample (1.5 g) was extracted with 75 mL of boiling distilled water for 5 min. The resulting infusion was filtered through gauze and cooled to room temperature prior to measurement. Five independent replicates were prepared and analyzed for each tenderness type. The operating conditions and measurement procedure of the electronic tongue are detailed in Table S1.

2.3. Electronic Nose Sensory Analysis

The aroma profiles of green tea samples with varying leaf maturity stages were analyzed using a PEN3 electronic nose system (Airsense Analytics GmbH, Schwerin, Germany), which is equipped with a sensor array of ten metal oxide gas sensors. These sensors are sensitive to different classes of volatile compounds, including aromatic benzenes, nitrogen oxides, amines, hydrogen, short/long-chain alkanes, sulfur organics, alcohols, aldehydes, and ketones (Table S2).
For sample preparation, 0.5 g of liquid nitrogen-ground tea powder was accurately weighed into a 20 mL headspace vial. After adding 5 mL of boiling distilled water, the vial was immediately sealed to allow headspace extraction. The sealed vials were equilibrated in a water bath at 65 °C for 50 min before analysis.
The PEN3 system was operated under the following conditions: sampling interval of 1 s, pre-sampling time of 5 s, measurement time of 150 s, and flushing time of 60 s. Zero-point calibration was performed automatically with clean air for 10 s before each measurement. Both the chamber and injection flow rates were set at 400 mL/min. The measurement lasted 120 s. The chamber was purged with filtered air between runs to restore baseline sensor responses.

2.4. HS-GC-IMS Analytical Method

The analysis of volatile organic compounds (VOCs) in green tea samples was conducted using a FlavourSpec® GC-IMS instrument (G.A.S., Dortmund, Germany), equipped with an MXT-5 capillary column (15 m × 0.53 mm × 1.0 μm, Restek, Bellefonte, PA, USA), a headspace autosampler (CTC-PAL3, CTC Analytics AG, Zwingen, Switzerland), and an ion mobility spectrometry detector. For sample preparation, 1.0 g of finely ground tea sample was accurately weighed into a 20 mL headspace vial, to which 20 μL of ethyl decanoate (100 ppm) was added as the internal standard. The vial was incubated at 80 °C for 15 min with agitation at 500 rpm. A 500 μL aliquot of the headspace was injected into the GC-IMS system using a heated syringe (85 °C) in splitless mode.
The GC-IMS operating parameters were as follows: the column temperature was maintained at 60 °C; the IMS drift tube was held at 45 °C; the carrier gas (nitrogen, ≥99.999%) flow rate was programmed as 2 mL/min (0–2 min), increasing linearly to 100 mL/min over 18 min. The drift gas (nitrogen, ≥99.999%) was supplied at a constant flow rate of 150 mL/min. The total analysis time was 20 min. Retention indices (RI) were calculated using a homologous series of n-ketones (C4–C9) under identical chromatographic conditions. All measurements were performed in triplicate to ensure accuracy and reproducibility.

2.5. Qualitative and Quantitative Analysis

The identification of volatile compounds was performed using VOCal software (version 0.4.07, rev.# 346; G.A.S., Dortmund, Germany), integrated with both the National Institute of Standards and Technology (NIST 17) mass spectral database and the proprietary GC-IMS library. Retention indices (RI) were calculated by analyzing a homologous series of n-alkanes (C4–C9) under identical chromatographic conditions as the tea samples. VOCs were identified by comparing both their RI values and drift times (Dt) with those in the reference databases, ensuring accurate compound assignment.
For semi-quantitative analysis, the internal standard method was applied, using ethyl decanoate (100 ppm) as the reference. The concentrations of individual VOCs were expressed as relative values based on the ratio of the peak area of each compound to that of the internal standard.
Additionally, rOAVs were calculated to evaluate the sensory relevance of each volatile compound. The rOAV was defined as the ratio of the compound’s relative concentration to its reported odor threshold in water. By integrating both chemical concentration and sensory detection threshold, rOAV provides a more reliable indication of human olfactory perception than concentration alone. In food, fragrance, and environmental odor analysis, this metric enables the prioritization of high-rOAV odorants (even at low concentrations) over abundant but sensorially negligible compounds with low rOAVs. VOCs with rOAV values equal to or greater than 1 were considered to contribute significantly to the overall aroma profile of the green tea samples [25]. To assess the contribution of individual VOCs to the aroma profiles of FDG with different maturity stages, those meeting this criterion were extracted and summarized in Table 1 for further discussion.

2.6. Feature Selection and Extraction of Key Flavor Attributes in Green Teas of Different Leaf Maturity Stages

To identify key sensory and chemical markers responsible for the quality differentiation among green tea samples of varying leaf maturity stages, multivariate statistical analysis and machine learning algorithms were employed for feature selection and extraction from datasets obtained through electronic nose, electronic tongue, and HS-GC-MS analyses.
In the domain of machine learning, feature engineering methods are generally classified into two categories: feature extraction and feature selection. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were applied as typical feature extraction techniques to reduce data dimensionality while preserving variance and class separation. For feature selection, three methodological frameworks were considered: filter methods, wrapper methods, and embedded methods. Among these, the random forest (RF)-based wrapper method was used to rank the importance of individual features according to their contribution to classification performance.
The most informative features identified were further subjected to classification modeling using six commonly adopted machine learning algorithms: k-nearest neighbors (KNN), random forest (RF), support vector machine (SVM), Gaussian naïve Bayes (GNB), logistic regression (LR), and decision tree (DT). Model performance was evaluated based on receiver operating characteristic (ROC) curves, accuracy, sensitivity, and specificity [34,35,36].
Prior to classification modeling, three complementary feature selection strategies were employed to identify key variables. First, the entire dataset was utilized without prior selection to assess the predictive performance of all available features. Second, feature importance was evaluated using Shapley Additive Explanations (SHAP) derived from the random forest model, and the top 20 features with the highest SHAP values were selected to construct reduced models. Third, each of the top 20 SHAP-ranked features was individually used to train classification models, allowing for the assessment of the independent predictive power of each variable [34,36].
To optimize model performance, hyperparameter tuning was conducted for all classification algorithms using K-fold cross-validation, with the area under the receiver operating characteristic curve (AUC) serving as the primary evaluation metric. Features associated with AUC values between 0.80 and 0.90 were considered moderately discriminative, while those with AUC values above 0.90 were deemed to exhibit strong predictive ability [25].

2.7. Data Analysis

Volatile compounds detected by the electronic nose and HS-GC-IMS were analyzed in triplicate, while taste attributes measured using the electronic tongue were conducted in five replicates. All results were expressed as mean ± standard deviation (SD). Statistical significance was assessed using one-way analysis of variance (ANOVA) via SPSS Statistics 26.0 (IBM, Armonk, NY, USA), with a threshold of p < 0.05 considered significant.
To visualize GC-IMS results, two-dimensional top-view plots, three-dimensional topographic plots, and differential spectral fingerprints were generated using Reporter and Gallery Plot plug-ins (G.A.S., Dortmund, Germany). These visualizations facilitated comparison of volatile profiles across green tea samples of different leaf maturity stages.
Multivariate statistical analyses were performed using MetaboAnalyst 6.0 [37]. Principal component analysis (PCA) and partial least squares discriminant analysis (PLS-DA) were applied to examine overall clustering and sample discrimination. Additional data visualization methods, including hierarchical cluster analysis (HCA), heatmaps, and volcano plots, were also employed within the MetaboAnalyst platform to identify and interpret significantly different variables. Pie charts and chord diagrams were created using OriginPro 2021 (OriginLab Corporation, Northampton, MA, USA) for the graphical presentation of selected variables and compound associations. K-means clustering and supervised machine learning analyses were conducted using the Biodeep Cloud Platform provided by Panomix (Suzhou, China), enabling feature discrimination and predictive model construction across green tea samples.

3. Results and Discussion

3.1. Electronic Tongue Analysis

Taste profiles across green tea samples of differing tenderness were characterized using an electronic tongue (Figure 1). Principal component analysis (PCA) revealed that PC1 and PC2 explained 85.5% and 13.9% of variance, respectively, cumulatively capturing 99.6% of the sensory data (Figure 1A). This demonstrates that the first two principal components effectively capture the majority of taste-related variation among samples. Clear separation in the PCA score plot indicates that leaf tenderness significantly influences flavor attributes. Specifically, the single-bud sample (FDQSG) was strongly associated with “umami” characteristics (Figure 1B), likely reflecting enriched amino acid content [38]. The one-bud, one-leaf tea (FDMJ1G) was marked by heightened sourness, astringency, bitterness, and pronounced bitter aftertaste (aftertaste-B), potentially attributable to increased caffeine levels in moderately mature leaves (Figure 1B,C). By contrast, the one-bud, two-leaf tea (FDTC2G) correlated with saltiness, richness, and aftertaste astringency (Aftertaste-A), which may correspond to increased levels of tea polyphenols and catechins in fully developed leaves [39]. These results imply that a comprehensive investigation into the distribution of non-volatile compounds is necessary to elucidate the correlation between taste profiles and chemical constituents in green teas of different tenderness grades.

3.2. Electronic Nose Analysis

A supervised PLS-DA model was constructed to classify the aroma profiles of the different samples. The model’s first two latent variables accounted for 78.2% and 20.0% of variance (totaling 98.2%) and clearly separated tenderness groups in the score plot (Figure 2A), indicating strong discriminative capacity. Model robustness was confirmed by cross-validated Q2 = 0.988 and R2 = 0.992, with 100% classification accuracy; permutation testing (n = 2000) yielded p < 5 × 10−4, confirming statistical significance (Figure 2C,D).
The bi-plot (Figure 2B,F) revealed clear sensor–odor relationships: FDQSG correlated with sensors W2W, W5S, and W1W; FDTC2G with W5C, W1C, W1S, and W2S; and FDMJ1G overlapped with both but displayed stronger sensitivity to W3C. VIP analysis (VIP > 1) identified W3C (aromatic amines), W3S (long-chain alkanes), W1S (short-chain alkanes), and W6S (hydrogen-related compounds) as key discriminators (Figure 2E, Table S3).

3.3. HS-GC-IMS Analysis

3.3.1. Qualitative and Quantitative VOC Profiling

The overall volatile compound profiles of green teas with different leaf maturity stages were systematically analyzed using HS-GC-IMS. Compound identification was based on retention time, drift time, and retention index values derived from IMS data. Using the Reporter plug-in, both three-dimensional (Figure 3A) and two-dimensional (Figure 3B) topographic visualizations were generated to comprehensively characterize the volatile fingerprints of the tea samples. In the 3D representation, the X-axis corresponds to ion drift time, the Y-axis to gas chromatographic retention time, and the Z-axis to the signal intensity of each volatile compound. The 2D topographic plot offers a more intuitive overview of VOC distribution across the samples, effectively illustrating characteristic fingerprints for green teas of varying leaf tenderness. As shown in Figure 3B, the spectra were normalized and aligned using the reactive ion peak (RIP), which appears at a drift time of 1.0 (marked by a vertical reference line). Each feature point to the right of the RIP represents an individual volatile component detected in the sample. The color gradient from light blue to deep red indicates increasing compound concentration, with deeper red denoting higher relative abundance.
Figure 3 demonstrates the presence of multiple characteristic peaks, indicating that green tea samples of different maturity levels contain a diverse array of volatile organic compounds (VOCs). To further elucidate differences in volatile profiles among the teas, a differential comparison method was employed (Figure 3C), with the single-bud sample (FDQSG) set as the reference. Relative differences in VOC intensity between FDQSG and the other two samples (FDMJ1G and FDTC2G) were computed to generate visual differential plots. In these maps, white regions indicate no significant difference; red regions reflect relative enrichment of specific compounds in the compared sample, whereas blue regions indicate reduced abundance relative to the reference. The differential plots revealed a substantially greater number of blue regions in both FDMJ1G and FDTC2G compared with FDQSG, suggesting that increased leaf maturity is associated with a general decline in the concentration of many volatile aroma compounds. Further identification and visualization of these decreasing compounds require detailed examination of the VOC fingerprint maps.
In total, 85 VOCs were detected (Table S4), comprising 8 terpenes, 19 aldehydes, 13 ketones, 9 alcohols, 10 esters, 6 furans, 3 aromatics, 3 sulfur compounds, 3 pyrazines, 2 acids, 1 heterocyclic (2-acetyl-1-pyrroline), and 8 unidentified peaks. Notably, five VOCs—nonanal-D, heptanal-D, octanal-D, n-hexanol-D, and pentan-1-ol-D—were identified as forming dimers, a phenomenon attributed to proton-bound dimerization (2M+H)+ at elevated concentrations [27]. The dominant compound classes were aldehydes (22.4%), ketones (15.3%), alcohols (11.8%), and esters (11.8%), with terpenes at 8.2% (Figure 4A).
Quantitative analysis revealed the highest total VOC content in FDMJ1G (22.03 mg/L), followed by FDQSG and FDTC2G. FDQSG was notable for its aldehyde, sulfur compound, pyrazine, and acid content. FDMJ1G was enriched in terpenes, ketones, furans, aromatics, and unknown compounds, whereas FDTC2G had higher levels of alcohols, esters, heterocyclics (e.g., 2-acetyl-1-pyrroline), and sulfides—including compounds associated with “popcorn”, “meaty”, and “coffee” aromas (Table S5, Figure 4B). These compositional differences suggest the involvement of distinct metabolic pathways across different leaf maturity levels, which may have important implications for optimizing processing strategies and guiding product classification and grading in green tea production.
To further identify characteristic volatile compounds and distinguish aroma-active substances among green tea samples with different leaf maturity stages, fingerprint profiles were constructed to visualize the concentration changes of individual VOCs (Figure 5). Specifically, regions I, III, and IV correspond to the potential characteristic VOCs of FDQSG, FDMJ11G, and FDTC2G, respectively. Notably, region I, representing FDQSG (single bud), featured the highest number of discriminative compounds (21 in total), predominantly composed of C5–C9 short-chain aldehydes and ketones that impart fresh and green notes. Additionally, short-chain esters (e.g., butyl propanoate) and unsaturated aldehydes (e.g., (E)-2-octenal) contributed fruity and ester-like aromas, while unsaturated alcohols such as oct-1-en-3-ol provided mushroom, earthy, and green sensory characteristics. These compounds are recognized as key odor-active constituents in Nen Xiang (NX), Li Xiang (LX), and Qing Xiang (QX) style green teas [40]. Furthermore, pyrazines (e.g., 2-ethyl-6-methylpyrazine) and furans (e.g., 2-n-butylfuran), common products of Maillard reactions, contributed roasty and nutty aromas [41]. Collectively, these volatiles define the fresh and green aroma characteristics typical of single-bud green teas.
In contrast, Region III, associated with FDMJ11G (one bud with one leaf), contained 12 representative VOCs, including methyl salicylate, linalool, 2-butoxyethanol, n-propyl acetate, 2-methyl-2-pentenal, beta-pinene, 2-acetylfuran, furfural, 3-methylthiopropanal, and three unknown compounds. As an intermediate maturity stage, FDMJ11G exhibited a volatile profile suggestive of a metabolic transition between young and mature leaves. This was characterized by terpenoid dominance (e.g., linalool, beta-pinene) and a balance between esters and aldehydes, reflecting a dynamic conversion from alcohols to aldehydes and subsequently to esters. Linalool and methyl salicylate contributed complex floral attributes (citrus, floral, rose, wintergreen, minty), which distinguished this profile from the fresh-green aroma of FDQSG. Additionally, furfural and 2-acetylfuran provided roasted undertones, mitigating excessive grassy notes. Intriguingly, Chen-Yang Shao’s study [20] on baked green teas with varying tenderness reported a significant increase in methyl salicylate with leaf maturity, which differs from our findings—likely due to varietal differences.
Region IV represented the characteristic VOCs of FDTC2G (one bud with two leaves), which included 3-pentanol, 2-acetyl-1-pyrroline, 2,5-dimethylfuran, benzene acetaldehyde, 2-butanone, and ethylsulfide. The VOC profile of this group, derived from more mature leaves, reflected pronounced Maillard reaction activity during processing, facilitating the formation of compounds such as 2-acetyl-1-pyrroline (popcorn-like) and 2,5-dimethylfuran meaty, gravy, roasted beef juice). The accumulation of 3-pentanol and 2-butanone suggested elevated lipid oxidation in FDTC2G compared to the less mature counterparts. 3-Pentanol (herbal) and benzene acetaldehyde (green, floral, sweet) co-occurred with 2-acetyl-1-pyrroline (meaty, roasted) to create a layered and mellow flavor profile for two-leaf green tea. Previous research has identified dimethyl sulfide (cooked corn-like) as a key contributor to the fresh and delicate aroma of green tea [28,42], while ethylsulfide—another key VOC in FDTC2G—conveyed coffee- and meat-like notes. The conversion pathway between dimethyl sulfide and ethylsulfide may be critical for modulating the aroma type and quality of FDG, warranting further investigation.
Furthermore, Regions II and V reflected common VOCs shared by all three green tea types. Based on their concentrations and aroma characteristics, six major shared compounds were identified with levels exceeding 1000 µg/L: linalool (citrus, floral, rose), heptanal-D (green), 2-methylbutanal (cocoa), 2-propanone (ethereal, apple, pear), mesityl oxide (vegetal), and methyl acetate (ethereal sweet, fruity). Among these, linalool and heptanal are well-established as key aroma-active compounds in green tea [2,43,44]. Additionally, 2-methylbutanal (VIP = 2.70) and 3-methylbutanal (VIP = 1.30) have been reported as differential markers between yellow teas of different tenderness (LYT vs. BYT/SYT) [19], further suggesting that leaf maturity strongly influences the accumulation of 2-methylbutanal.
Additionally, region IV contained compounds commonly expressed in both FDMJ11G and FDTC2G, notably benzaldehyde (sharp sweet, almond, cherry) and beta-ocimene (floral). Benzaldehyde has been cited as a marker for chestnut-like aroma in green tea [45], while beta-ocimene, with OAV > 1, is a key floral contributor in flower-scented green teas [40].
Overall, fingerprint-based VOC profiling provided valuable insights into the compositional and sensory differentiation among green teas with varying tenderness. However, it should be noted that aroma perception is influenced not only by compound concentration but also by the odor threshold in a given matrix. Thus, high-concentration compounds may not necessarily dominate aroma perception, and low-concentration compounds could still play key roles. Quantitative verification of these contributions requires further rOAV (relative Odor Activity Value) analysis.

3.3.2. Analysis of VOCs Variation Among FDG of Different Tenderness

Hierarchical cluster analysis (HCA) of all detected VOCs effectively distinguished the green tea samples by tenderness level (Figure 6A). Notably, the two more mature samples, FDMJ1G and FDTC2G, clustered together on the same branch, while the most tender sample, FDQSG, formed a separate clade, indicating clear differentiation based on VOC composition. Subsequent K-means clustering divided the complete set of VOCs into four distinct clusters (Figure 6B), each exhibiting unique trends associated with leaf tenderness. Specifically, Cluster 2 contained VOCs whose concentrations increased as leaf tenderness decreased; Cluster 4 showed the opposite trend, with a decline in compound abundance as tenderness decreased. Meanwhile, Cluster 1 exhibited a U-shaped pattern (initial decline followed by an increase), and Cluster 3 demonstrated an inverted U-shape (initial increase followed by a decrease).
Focusing on Cluster 2, which included 12 compounds that increased with decreasing tenderness (see Table S4), this group exhibited a wide spectrum of sensory attributes such as citrus, cocoa, fruity, roasted, coffee, and popcorn-like aromas. Several compounds also possessed functional properties beyond aroma: for example, Octanal-M is known for its antioxidant and antimicrobial activity [46], while 3-Pentanol has been reported to stimulate plant immune responses [47]. In contrast, Cluster 4, comprising 37 VOCs that decreased in abundance with increasing leaf maturity, was characterized predominantly by fresh, green, fatty, fruity, and earthy notes. This suggests a substantial loss of freshness and green-earthy qualities as tenderness diminishes.
Specifically, Cluster 1, with 11 VOCs—primarily aldehydes, alcohols, and ketones—displayed a pattern of decreasing and then increasing concentrations across the tenderness gradient. These compounds were associated with floral, green, and citrusy aromas. Whereas, Cluster 3, encompassing 25 VOCs, exhibited greater structural diversity and included terpenes, short-chain aldehydes/ketones, and aromatic compounds. This cluster was dominated by floral-terpenic, woody, and bready notes, with key compounds such as linalool (citrus, floral, woody), cis-linalool oxide (earthy, floral, woody), β-ocimene (citrus, tropical, green), and β-pinene (pine, woody, resinous). Preliminary analysis suggested that the FDMJ1G sample exhibited notably higher expression levels of terpene-related compounds. These dynamic clustering patterns offer valuable insights into the transformation of volatile profiles across different maturity levels of green tea, highlighting both flavor transitions and potential bio-functional implications.
To further elucidate the intergroup differences in VOCs among green teas of varying leaf tenderness, volcano plots were constructed based on fold change (FC) and statistical significance (adjusted p < 0.05, |log2FC| ≥ 1). A total of 17 VOCs showed significant differences between FDMJ1G and FDQSG, of which 8 were upregulated and 9 were downregulated (Figure 7A). The upregulated compounds primarily included terpenes such as linalool and β-pinene, which are associated with enhanced plant defense and antimicrobial activity [48,49], contributing floral and dry woody notes. Aromatic aldehydes (e.g., benzaldehyde) and furans (e.g., furfural) impart bitter almond, bready, and baked aromas, while esters (e.g., ethyl acetate, n-propyl acetate) and ketones (e.g., 2-butanone) enrich fruity characteristics. In contrast, the downregulated VOCs were predominantly green leaf volatiles responsible for fresh-green notes, including (E)-2-pentenal, (E)-hept-2-enal, 4,5-dihydro-3(2H)-thiophenone, and several fruity, earthy, herbal, and spicy compounds such as 2-methylbutanol acetate, butyl propanoate, 2-octanone, and 1-penten-3-one.
Similarly, in the FDTC2G vs. FDQSG comparison (Figure 7B), 10 VOCs were significantly upregulated and 10 were downregulated. The upregulated compounds—linalool, methyl isobutyl ketone, benzaldehyde, 2,5-dimethylfuran, 2-butanone, ethyl acetate, 3-pentanol, 2-acetyl-1-pyrroline, ethylsulfide, and benzene acetaldehyde—collectively contributed floral, almond, bready, sweet fruity, toasted grain, coffee, meaty, and honey-like aromas. On the other hand, downregulated compounds included 2-methylbutanol acetate, 1-penten-3-one, 2-octanone, (E)-2-pentenal, hexanal, 4,5-dihydro-3(2H)-thiophenone, butyl propanoate, 2-ethyl-6-methylpyrazine, and (E)-hept-2-enal, which are associated with green, herbal, fatty, and roasted potato–like aromas. The reduction of these compounds indicates a decline in the fresh, fruity, and green sensory qualities.
By contrast, relatively fewer differential compounds were observed between the two more mature samples (FDTC2G vs. FDMJ1G). Only four VOCs showed significant upregulation in FDTC2G (2,5-dimethylfuran, 3-pentanol, 2-acetyl-1-pyrroline, and benzene acetaldehyde), while five VOCs (including (E)-2-pentenal, furfural, n-propyl acetate, and 2-methyl-2-pentenal) were significantly downregulated (Figure 7C). The upregulated compounds contributed roasted, nutty, and fermented notes, indicating a metabolomic shift characteristic of more mature tea leaves. The downregulated compounds were associated with fresh and fruity aromas, echoing the trend observed between FDQSG and the more mature samples.
In summary, comprehensive analysis revealed distinct volatile compound signatures across the three tenderness grades of green tea, demonstrating clear metabolic stratification associated with leaf maturity. In greater detail, the volatile profile of FDQSG (single bud) is predominantly characterized by high levels of C6 aldehydes and alcohols (e.g., hexanal, (Z)-3-hexenol) and monoterpenes (e.g., nerol), which jointly contribute to the fresh, green, fruity, and floral notes typically associated with cut grass and citrus aromas. By contrast, FDMJ1G (one bud plus one leaf) represents an intermediate stage of leaf maturity, where the aroma profile reflects a transition in metabolic activity. The relative balance between terpenes and esters, along with the emergence of Maillard reaction–derived compounds such as benzaldehyde and furfural, results in a more complex and enriched aroma bouquet. Notably, compounds like linalool (floral), β-pinene (woody), and benzaldehyde (almond-like) are markedly elevated, while the content of C6 aldehydes shows a progressive decline. In the case of FDTC2G (one bud plus two leaves), the volatile composition is dominated by Maillard reaction products (e.g., 2,5-dimethylfuran, 2-acetyl-1-pyrroline), fermentation-related esters (e.g., ethyl acetate), and microbial metabolites (e.g., 3-pentanol), collectively imparting roasted, nutty, and savory characteristics. Interestingly, while 2-ethyl-6-methylpyrazine—a key compound contributing to roasted potato–like aromas—is known to enhance roasted notes, its concentration decreases with increasing leaf maturity, implying the involvement of a more intricate regulatory mechanism that warrants further investigation.
Overall, the findings highlight that FDQSG is particularly rich in fresh, green, and fruity aromatic attributes, making it highly suitable for low-temperature fixation processes that aim to retain delicate floral volatiles. This observation is consistent with previous reports on large-leaf black teas with varying degrees of tenderness [18], although the specific compounds contributing to freshness and fruitiness may vary depending on cultivar and processing techniques. FDMJ1G exhibits a more balanced and multidimensional aroma profile, integrating both floral and fruity characteristics, indicative of an intermediate metabolic state. In contrast, FDTC2G is enriched in nutty, roasted, and fermented sweet–meaty notes, reflecting greater metabolic transformation during processing. These findings are also in agreement with previous research on yellow teas processed from leaves of different maturity levels [19], where teas made from more mature leaves were associated with enhanced roasted aromas, while bud-only teas demonstrated higher intensities of fresh and floral sensory attributes.
These results suggest that leaf maturity markedly influences volatile metabolite composition and aroma profile. Notably, some VOCs with lower concentrations may still significantly impact scent if their odor activity (rOAV) is high, warranting further sensory threshold validation.

3.3.3. Multivariate Statistical Analysis

To further investigate the differential VOCs and their evolution across green teas of varying tenderness, a partial least squares discriminant analysis (PLS-DA) model based on VOCs was employed (Figure 8). The PLS-DA model yielded high explanatory and predictive power, with R2X = 0.909, R2Y = 0.999, and Q2 = 0.998. Additionally, a 200-time permutation test was conducted to assess model robustness, showing a Q2 regression intercept of −0.57 on the Y-axis, which is below zero (Figure 8B), indicating that the model is reliable and not overfitted. As shown in the score plot (Figure 8A), PC1 accounted for 61.6% of the variance, while PC2 contributed 29.3%, together explaining 90.9% of the total variance. This demonstrates that the first two principal components sufficiently represent the original dataset. The three green tea samples—FDQSG, FDMJ1G, and FDTC2G—were well-separated in the model without any overlap, indicating that the HS-GC-IMS method for VOC acquisition and detection provided strong classification ability for tea samples of different leaf maturity stages. Moreover, the strong clustering of VOC profiles within each group reflects high repeatability and stability in VOC detection for samples of the same tenderness level.
The loading plot (Figure 8C) illustrates the contribution of highly correlated VOCs to the principal components, further clarifying the differences in volatile profiles among the three green tea samples of varying tenderness. The plot highlights the VOCs most strongly associated with each sample group—FDQSG, FDMJ1G, and FDTC2G—offering additional insight into their respective aroma characteristics. As shown in Figure 8C, FDQSG exhibits a larger number of highly correlated VOCs compared to the other two groups. These findings are consistent with the characteristic compounds identified in Figure 5. Specifically, FDQSG is predominantly associated with C5–C9 short-chain aldehydes and ketones, short-chain esters, which contribute to its fresh and green aroma. FDMJ1G features a balanced profile of esters and aldehydes, reflecting an intermediate metabolic state. In contrast, FDTC2G is characterized by Maillard reaction products and 3-pentanol, indicating a shift toward roasted, nutty, and fermented flavor attributes as leaf maturity increases.
To further identify key VOCs responsible for differentiating among green tea samples, variable importance in projection (VIP) scores were calculated. VIP scores reflect the contribution and explanatory power of each variable to the classification and discrimination of the sample groups. Compounds with VIP scores > 1 were considered significant discriminators. As shown in Table S6, 31 VOCs with VIP > 1 were identified as key differential volatile markers in the PLS-DA model based on HS-GC-IMS. These VOCs serve as candidate markers for distinguishing green teas of different leaf maturity stages. These VOCs serve not only as markers for distinguishing green teas of different leaf maturity stages but also as foundational features for building subsequent machine learning models aimed at identifying key characteristic parameters.

3.3.4. rOAV Analysis

To assess the contribution of individual volatile compounds (VOCs) to the overall aroma profile of FDG with different tenderness grades, the relative odor activity value (rOAV) of each compound was calculated (Table S5). Compounds with rOAVs greater than 1 are generally considered to have a perceptible impact on aroma [33], and thus VOCs meeting this criterion were extracted and summarized in Table 1 for further discussion.
In total, 41 volatile compounds exhibited rOAV values exceeding 1. Among them, FDQSG contained 33 such compounds, FDMJ1G had 32, and FDTC2G had 31. Notably, 30 VOCs with OAV > 1 were common across all three groups, indicating that the majority of the identified volatiles contribute meaningfully to the aroma profile of FDG. Among these, compounds with extremely high rOAV (rOAV > 1000) were primarily aldehydes (e.g., 2-methylbutanal), terpenes (linalool), and pyrazines (2-ethyl-3,5-dimethylpyrazine), which serve as key odorants. Specifically, 2-methylbutanal (chocolate/cocoa-like) and 3-methylbutanal (chocolate, peach) demonstrated remarkably high rOAVs (>1000 and 580–730, respectively), confirming their potent sensory effects. Linalool, a floral terpene, exhibited the highest rOAV across all samples (ranging from 3800 to 7400), despite its relatively low concentration due to its ultra-low odor threshold (OT = 0.22 μg/L), making it a major aroma-contributing compound. Likewise, 2-ethyl-3,5-dimethylpyrazine, with its nutty character and extremely low OT (0.040 μg/L), also showed high rOAVs (1350–1440), indicating its persistent and potent contribution.
Interestingly, linalool peaked in the intermediate maturity sample FDMJ1G, suggesting a metabolic surge at this developmental stage, whereas 2-methylbutanal increased progressively with maturity and reached its maximum in the most mature sample FDTC2G. Pyrazines, in contrast, remained relatively stable across all tenderness stages, indicating their sustained contribution to nutty and roasted attributes.
Furthermore, a clear maturity-dependent shift in dominant aroma-active compounds was observed across the three leaf maturity stages of green tea. In early maturity (FDQSG, single bud), the aroma profile was primarily characterized by fresh, green, floral, and citrus-like attributes. This was largely driven by high rOAVs of terpenes such as linalool (3800) and trans-linalool oxide (4.2), along with C6 aldehydes and alcohols, including hexanal (170) and (Z)-3-hexen-1-ol (67), which contributed grassy and leafy freshness. In addition, the presence of 2-methylbutanal (1100), a cocoa-like aldehyde, imparted a subtle sweet and nutty nuance. These volatiles collectively defined the characteristic fresh and green sensory quality typical of bud-only green teas.
At the intermediate maturity stage (FDMJ1G, one bud plus one leaf), the aroma complexity increased significantly, reflecting a balance between floral, fruity, and nutty notes. Notably, linalool reached its highest rOAV (7400), indicating a floral surge likely linked to transient metabolic activation. Meanwhile, 2-ethyl-3,5-dimethylpyrazine (1400) provided a stable nutty backbone. A substantial decline in hexanal (from 170 to 71; −58% vs. FDQSG) marked the attenuation of green freshness. Concurrently, elevated levels of Maillard- and fermentation-associated compounds such as 2-acetyl-1-pyrroline and ester volatiles (e.g., 2-methylbutanol acetate) contributed toasted, sweet, and fruity nuances, indicating active secondary metabolic transitions.
In contrast, late maturity (FDTC2G, one bud plus two leaves) was marked by a pronounced shift toward roasted, nutty, and savory sensory attributes. The sharp increase in Maillard-derived 2-acetyl-1-pyrroline (from 66 to 160; +142% vs. FDMJ1G) contributed intense popcorn-like notes. The persistent high levels of 2-ethyl-3,5-dimethylpyrazine (~1350) ensured continuity in nutty perception. These changes collectively reflect a maturity-dependent transformation in aroma composition, likely governed by both enzymatic and non-enzymatic reactions during leaf development and subsequent processing.
To further validate the contributions of aroma-active compounds (rOAV > 1) listed in Table 1 to the overall flavor perception of green tea infusions, Pearson correlation analysis was performed between these volatiles and the responses of 10 E-nose sensors (W1C, W3C, W5C, W1S, W2S, W3S, W5S, W6S, W1W, W2W), as well as 8 E-tongue taste attributes (sourness, bitterness, astringency, aftertaste-A, aftertaste-B, umami, richness, saltiness) (Table S7). Correlation pairs meeting the criteria of |r| ≥ 0.90 and p < 0.05 were considered statistically significant.
Notably, 2-acetyl-1-pyrroline showed extremely strong positive correlations with W1S, W2S, and W5C sensors, establishing it as a core marker for “sweet/popcorn-like” aroma recognition by the electronic nose. Simultaneously, it significantly suppressed bitterness, aftertaste-B, and sourness responses, suggesting its synergistic role in reducing bitterness and sourness. Likewise, methyl acetate exhibited positive correlations with richness and saltiness, and a negative correlation with aftertaste-B, indicating its role in reinforcing umami and mellow taste characteristics. In addition, C6–C9 aldehydes (e.g., hexanal, (E)-2-hexenal, pentanal) consistently showed strong positive correlations with the W5S channel, enabling rapid discrimination of tenderness grades. Moreover, ethyl sulfide demonstrated robust positive correlations with multiple sensors (W1S/W2S/W5C/W1C), confirming its distinct “coffee-like” characteristics in E-nose detection. In the taste dimension, ethyl sulfide exhibited strong negative correlations with aftertaste-B and bitterness, indicating that beyond contributing to characteristic coffee-like notes, this compound may also play a modulatory role in suppressing bitterness and lingering astringency, thereby enhancing the overall smoothness and roundness of the tea infusion.
In summary, aroma-active VOCs such as 2-acetyl-1-pyrroline, methyl acetate, C6–C9 aldehydes, and acetate esters exhibited highly significant correlations with key E-nose channels (e.g., W5S, W2W, W1S) and E-tongue taste indices (e.g., richness and aftertaste-B). These strong chemical–sensor couplings highlight their potential as core diagnostic markers for the rapid classification of green tea leaf maturity levels. Previous studies on food flavor have indicated that volatile aroma compounds can modulate taste perception through cross-modal interactions [50,51]. The specific mechanisms by which these volatiles influence taste attributes—such as bitterness, umami, and aftertaste—merit further in-depth investigation in future research.

3.3.5. KEGG Functional Annotation and Enrichment of VOCs

To further investigate the potential biosynthetic pathways associated with differential VOCs in FDG of varying tenderness, the identified VOCs were cross-referenced with the KEGG database to retrieve detailed information on their involved metabolic pathways. Enrichment analysis was subsequently performed to identify pathways, and the annotated results are presented in Figure 9 and Table S8. A total of eight key VOCs were mapped to four metabolic pathways: butanoate metabolism, monoterpenoid biosynthesis, zeatin biosynthesis, and phenylalanine metabolism. Among them, butanoate metabolism and monoterpenoid biosynthesis were significantly enriched (Figure 9A), indicating their critical roles in the metabolic differentiation of aroma profiles among green teas of different maturity stages. As illustrated in Figure 9B–D and detailed in Table S8, the butanoate metabolism pathway was associated with 2-propanone, butanal, and methyl acetate; monoterpenoid biosynthesis was linked to nerol, β-pinene, and 1,8-cineole. In contrast, zeatin biosynthesis involved 3-methyl-2-butenal, and phenylalanine metabolism was associated with benzene acetaldehyde.
Pairwise comparison of VOC accumulation patterns revealed that 2-propanone (apple, pear aroma), butanal (chocolate), β-pinene (dry woody), and 3-methyl-2-butenal (sweet, fruity, nutty, almond) were predominantly enriched in FDMJ1G (one bud + one leaf), suggesting enhanced activity of butanoate metabolism and monoterpene biosynthesis at the intermediate maturity stage. In contrast, methyl acetate and benzene acetaldehyde, which contribute sweet, fruity, floral, honey, and cocoa-like aromas, were more highly expressed in FDTC2G (one bud + two leaves). Meanwhile, nerol, a floral monoterpene, was specifically enriched in FDQSG (single bud), supporting its strong floral character.
The KEGG-based enrichment results, when integrated with previous aroma and rOAV analyses, provide a coherent metabolic explanation for the VOC variations observed among the three tenderness grades. Specifically, FDMJ1G was characterized by a notable burst in monoterpenoid biosynthesis and active butanoate metabolism, consistent with its complex and balanced aroma profile. FDQSG was dominated by C6 aldehydes/alcohols (e.g., hexanal, (Z)-3-hexenol), mainly derived from the lipoxygenase (LOX) pathway, contributing to its fresh and grassy aroma [52]. In contrast, FDTC2G showed enrichment in Maillard reaction products, esterification-derived volatiles, and phenylalanine-derived sweet-aromatic compounds such as benzene acetaldehyde, reflecting a maturity-dependent shift toward roasted and sweet-fruity notes.
Taken together, these findings suggest that the differential accumulation of aroma-related VOCs across leaf maturity stages is closely tied to the dynamic modulation of key biosynthetic pathways, particularly butanoate metabolism and monoterpenoid biosynthesis. These pathways may serve as biochemical markers for evaluating and optimizing green tea quality in relation to raw material tenderness.

3.4. Identification of Key Flavor Markers of Green Teas of Different Tenderness Based on Six Machine Learning Models

To identify key flavor markers differentiating FDG at distinct maturity stages, data from three complementary analytical platforms—electronic tongue (E-tongue), electronic nose (E-nose), and HS-GC-MS—were comprehensively integrated. The feature set comprised 8 E-tongue sensors, 4 E-nose gas sensors, 31 VOCs with VIP > 1 identified through the PLS-DA model, and 41 aroma-active compounds with rOAV > 1.
These selected features were subjected to classification analysis using six supervised machine learning algorithms: KNN, RF, SVM, GNB, LR, and DT. To enhance model robustness and mitigate overfitting, a 10-fold cross-validation strategy was employed. Classification performance was evaluated using receiver operating characteristic–area under the curve (ROC–AUC), where AUC values between 0.8–0.9 and 0.9–1.0 indicate good and excellent discrimination, respectively.
As illustrated in Figure 10A, five models—KNN, RF, GNB, LR, and DT—achieved perfect classification accuracy (AUC = 1.0), underscoring their high reliability for this task. SVM showed comparatively lower performance, suggesting model-specific limitations for this dataset. Among all models, the RF classifier was prioritized for subsequent analysis due to its well-documented resistance to overfitting and high prediction accuracy in high-dimensional biological datasets [53].
Feature contribution was assessed using Shapley Additive Explanations (SHAP) values derived from the RF model. The top 20 ranked features (Figure 10B) were identified as the most discriminative indicators of tea maturity. Notably, 19 of these features were VOCs detected via HS-GC-IMS, while one corresponded to an E-tongue taste sensor (bitterness). In contrast, no E-nose variables ranked among the top contributors, likely reflecting the broader classification granularity of gas sensors, which may be less sensitive to subtle within-cultivar aroma differences than compound-specific VOCs or taste-related attributes.
Boxplot visualization of the top 20 features (Figure 10C) elucidated their abundance patterns across tea maturity groups. The FDQSG group (single bud) exhibited the greatest number of elevated features (n = 10), including (Z)-3-hexen-1-ol, (E)-2-octenal, propyl isovalerate, n-hexanol-D, octanal-D, 2-ethyl-6-methylpyrazine, n-hexanol-M, nonanal-D, butyl propanoate, and two unidentified VOCs. These compounds conferred a characteristic flavor profile rich in fresh, green, citrusy, herbal, and subtly roasted-potato-like notes, emphasizing the sensory freshness typical of bud-only green tea. In the FDMJ1G group (one bud + one leaf), seven features were enriched, including bitterness (E-tongue), methyl salicylate, 2-methyl-2-pentenal, 2-propanone, n-hexanol-M, 3-methylthiopropanal, and linalool. Collectively, these contributed to a complex and layered aroma composed of floral, fruity, minty, herbal, and vegetable-like elements, reflecting an intermediate degree of leaf maturity and associated biochemical transitions. Conversely, the FDTC2G group (one bud + two leaves) demonstrated elevated levels of only three VOCs—benzene acetaldehyde, 2,5-dimethylfuran, and octanal-M—resulting in a sensory profile characterized by sweet-floral, roasted, cocoa, and mildly herbal notes. These compounds are consistent with enhanced Maillard reaction products and phenylalanine-derived volatiles typically associated with more mature tea leaves.
To assess the discriminative power of individual features, single-variable classification analyses were conducted across all six models, and corresponding ROC curves were generated (Table S9). Among them, RF and LR models exhibited superior classification performance, indicating their robustness and suitability for feature prioritization. All markers, except for (Z)-3-hexen-1-ol and n-Hexanol-D, achieved individual prediction accuracies above 0.8, confirming their reliability as discriminative features. The slightly lower performance of these two compounds may be attributed to their relatively broad distribution across samples or potential co-elution effects.
In summary, the machine learning-based integration of multimodal sensor data effectively identified robust flavor biomarkers associated with different leaf maturities in FDG. VOCs were the most influential feature group, and the RF and LR models proved most reliable for classification and marker selection. These findings provide a methodological foundation for flavor-targeted quality control and harvest timing optimization in green tea processing.

4. Conclusions

This study provides a comprehensive multi-platform analysis of the flavor differences among FDG made from raw materials of three distinct tenderness grades—single bud (FDQSG), one bud plus one leaf (FDMJ1G), and one bud plus two leaves (FDTC2G). Multimodal sensory evaluation using E-tongue, E-nose, and HS-GC-IMS revealed clear differences in taste and volatile compound profiles among the three sample types. FDQSG exhibited higher umami and stronger fresh, green, and herbal notes, with key volatiles such as (Z)-3-hexen-1-ol, (E)-2-octenal, and nonanal reflecting high LOX pathway activity. FDMJ1G presented a complex transitional profile, marked by a peak in linalool (rOAV: 7400), along with floral, fruity, and minty volatiles like methyl salicylate and 2-propanone. FDTC2G, in contrast, was enriched in Maillard-derived and phenylalanine metabolism–related volatiles such as 2-acetyl-1-pyrroline, 2,5-dimethylfuran, and benzene acetaldehyde, contributing to roasted, nutty, and cocoa-like notes. Through KEGG pathway enrichment, four metabolic routes were associated with key volatile formation, notably butanoate metabolism and monoterpenoid biosynthesis, both significantly enriched. Specific markers were also identified for each tenderness grade: Nerol in FDQSG, beta-pinene and 3-methyl-2-butenal in FDMJ1G, and methyl acetate and benzene acetaldehyde in FDTC2G. Machine learning modeling, particularly random forest combined with SHAP analysis, identified 20 top-ranking discriminant markers, with HS-GC-IMS-derived VOCs playing a dominant role. Among them, FDQSG was associated with the highest number of enriched markers (n = 10), reinforcing its sensory profile dominated by freshness and vibrancy. Single-feature ROC–AUC validation confirmed high diagnostic accuracy (≥0.8) for most key indicators, supporting their robustness as maturity-dependent flavor biomarkers.
While this study offers a comprehensive framework for understanding maturity-related flavor shifts in FDG, several limitations remain. Interannual variability in agronomic and climatic factors—such as rainfall patterns, sunlight exposure, and temperature regimes—may influence the biosynthesis and accumulation of both volatile and non-volatile metabolites, thereby affecting flavor expression. These environmental effects should be systematically evaluated in future multi-year studies to consolidate the generalizability of the findings. Moreover, potential sensor cross-sensitivity and matrix effects inherent in E-nose and E-tongue systems may influence the precision of sensor responses and should be carefully considered in future validation efforts. In addition, while the identified volatile compounds with high rOAV values and significant sensor correlations offer valuable insights into the aroma landscape, their direct contributions to perceived flavor require further verification through aroma recombination and omission experiments, as well as human sensory evaluation. To further elucidate the mechanistic basis of taste differences (e.g., umami, bitterness), integration of non-volatile metabolite profiling—such as amino acids and polyphenols—will be essential. Finally, the identity and aroma contribution of unknown VOCs warrant future elucidation via GC–O–MS sniffing techniques. Collectively, these findings provide a molecular basis for understanding the impact of leaf maturity on flavor in FDG. The results offer practical guidance for harvest timing, raw material selection, and grade-specific processing, facilitating more refined quality control and targeted flavor design in the green tea industry.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/foods14162861/s1, Table S1: Electronic tongue system operation procedure; Table S2: Electronic nose sensor array: compound class detection profile; Table S3: Gas sensor VIP matrix from e-nose PLS-DA analysis; Table S4: K-means cluster results of VOCs detected by HS-GC-IMS; Table S5: The information of volatile compounds identified using HS-GC-IMS; Table S6: PLS-DA VIP matrix for discriminant VOCs; Table S7. Pearson correlation analysis between aroma-active VOCs and E-nose/E-tongue sensor responses; Table S8. KEGG-enriched pathways across three green tea tenderness grades compared to Arabidopsis; Table S9: Predictive performance of six machine learning algorithms ranked by SHAP feature importance (Top 20 features). References [54,55,56] are cited in the Supplementary Materials.

Author Contributions

Conceptualization, X.L. (Xiaohui Liu), F.D., and M.W.; methodology, X.L. (Xiaohui Liu), M.H., W.T., F.D., and M.W.; software, X.L. (Xiaohui Liu) and F.D.; validation, X.L. (Xiaohui Liu), F.D., Y.L., and J.X.; formal analysis, X.L. (Xiaohui Liu), Y.L., and F.D.; investigation, X.L. (Xiaohui Liu), F.D., Y.L., J.X., and X.L. (Xiangdong Li); resources and data curation, X.L. (Xiaohui Liu), M.H., W.T., Y.L., L.L., J.X., and X.L. (Xiangdong Li).; writing—original draft preparation, X.L. (Xiaohui Liu); writing—review and editing, X.L. (Xiaohui Liu) and F.D.; visualization, X.L. (Xiaohui Liu), L.L., and F.D.; supervision, F.D. and M.W.; project administration, X.L. (Xiaohui Liu), F.D., and M.W.; funding acquisition, X.L. (Xiaohui Liu). All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science and Technology Plan Project of Guizhou province, grant number QKHJC-ZK[2022] YB180 and QKHJC MS[2025] 196, the Young Scientists Fund of the National Natural Science Foundation of China, grant number 32001463, the Guizhou Institute of Technology High-Level Talent Scientific Research Startup Project, grant number XJGC20190924, the Science and Technology Plan Project of Guizhou province, grant number QKHZ[2023] YB064, Guizhou Fruit Wine Brewing Engineering Research Center, grant number [2022]050, and Guizhou Provincial Science and Technology Department, grant number KXJZ[2024]021. The APC was funded by the Science and Technology Plan Project of Guizhou province, grant number QKHJC-ZK[2022] YB180.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials; further inquiries can be directed to the corresponding authors.

Acknowledgments

We express our gratitude to Weimin Zhang, the legal representative of Hongfengshanyun Tea Farm Co., Ltd., Qingzhen, Guizhou province, for providing the experimental site and tea samples for this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
FDGFudingdabai green tea
FDQSGFuding Que She
FDMJ1GFuding Maojian
FDTC2GFuding Taicha
MLmachine learning
RFrandom forest
SVMsupport vector machine
KNNk-nearest neighbor
GNBGaussian naïve Bayes
LRlogistic regression
DTdecision tree
VOCsvolatile organic compounds
ROCreceiver operating characteristic
SHAPShapley Additive Explanations
AUCthe area under the receiver operating characteristic curve
rOAVrelative odor activity value

References

  1. Zhou, X.; Hu, J.; Tang, J.; Wang, Q.; Jiang, Y.; Li, J.; Yuan, H.; Yang, Y. A multi-platform approach uncovers the effects of different shaping techniques on the sensory quality and non-volatile metabolites of green tea. Future Foods 2025, 11, 100675. [Google Scholar] [CrossRef]
  2. Liu, N.; Shen, S.; Huang, L.; Deng, G.; Wei, Y.; Ning, J.; Wang, Y. Revelation of volatile contributions in green teas with different aroma types by GC–MS and GC–IMS. Food Res. Int. 2023, 169, 112845. [Google Scholar] [CrossRef]
  3. Teixeira Oliveira, J.; Machado da Costa, F.; Gonçalvez da Silva, T.; Dotto Simões, G.; dos Santos Pereira, E.; Quevedo da Costa, P.; Andreazza, R.; Cavalheiro Schenkel, P.; Pieniz, S. Green tea and kombucha characterization: Phenolic composition, antioxidant capacity and enzymatic inhibition potential. Food Chem. 2023, 408, 135206. [Google Scholar] [CrossRef]
  4. Xiao, X.; Erukainure, O.L.; Guo, Y.; Msomi, N.Z.; Chu, M.; Islam, M.S. Jasmine green tea improves glucose homeostasis and antioxidant activities with concomitant hypolipidemic activity in type 2 diabetic rats. Food Biosci. 2025, 69, 107011. [Google Scholar] [CrossRef]
  5. Lan, S.; Wang, R.; Gao, H.; Zhao, M.; Li, H.; Liu, F.; Geng, R.; Wang, Y.; Liu, X.; Song, C.; et al. Anti-inflammatory and hepatoprotective effects of galactose-rich heteropolysaccharides from Laoshan green tea and its physicochemical properties. Nat. Prod. Res. 2025; 1–7, Advance online publication. [Google Scholar] [CrossRef]
  6. Schimidt, H.L.; Garcia, A.; Martins, A.; Mello-Carpes, P.B.; Carpes, F.P. Green tea supplementation produces better neuroprotective effects than red and black tea in Alzheimer-like rat model. Food Res. Int. 2017, 100, 442–448. [Google Scholar] [CrossRef] [PubMed]
  7. Singh, N.; Sahjlan, P.; Yadav, S.S. Phytochemistry and anticancer therapeutics of Camellia sinensis (Green tea). Pharmacol. Res. Mod. Chin. Med. 2024, 12, 100484. [Google Scholar] [CrossRef]
  8. Ibrahim, M.A.; Bakhaat, G.A.; Tammam, H.G.; Mohamed, R.M.; El-Naggar, S.A. Cardioprotective effect of green tea extract and vitamin E on Cisplatin-induced cardiotoxicity in mice: Toxicological, histological and immunohistochemical studies. Biomed. Pharmacother. 2019, 113, 108731. [Google Scholar] [CrossRef]
  9. Jiang, L.; Xie, S.; Zhou, C.; Tian, C.; Zhu, C.; You, X.; Chen, C.; Lai, Z.; Guo, Y. Analysis of the Genetic Diversity in Tea Plant Germplasm in Fujian Province Based on Restriction Site-Associated DNA Sequencing. Plants 2024, 13, 100. [Google Scholar] [CrossRef] [PubMed]
  10. Liu, C.; Ren, M.; Ning, M.; Liao, Y.; Du, X.; Qin, L.; Chen, W.; Liu, X.; Wu, A.; Feng, D.; et al. Cultivar-dependent variation in metabolomic profiles and sensory characteristics of Zhuyeqing green tea (Camellia sinensis). Food Biosci. 2025, 71, 107023. [Google Scholar] [CrossRef]
  11. Ma, C.; Ma, B.; Wang, J.; Wang, Z.; Chen, X.; Zhou, B.; Li, X. Geographical origin identification of Chinese white teas, and their differences in tastes, chemical compositions and antioxidant activities among three production regions. Food Chem. X 2022, 16, 100504. [Google Scholar] [CrossRef] [PubMed]
  12. Jiao, Y.; Cai, M.; Zhang, X.; Feng, Z.; Zhang, Q.; Li, L.; Jin, G.; Fan, S.; Lu, L. Impact of spreading time on flavor quality in Duyun Maojian summer green tea. LWT 2024, 214, 117103. [Google Scholar] [CrossRef]
  13. Xu, S.; Song, L.; Shi, D.; Wu, S.; Ma, F.; Chen, H.; Meng, Q.; Fei, Q.; Meng, L.; Wu, W.; et al. Dynamic variations in flavor profiles of Guizhou high-mountain white tea produced by Eurotium cristatum using solid-state fermentation. Food Biosci. 2025, 68, 106725. [Google Scholar] [CrossRef]
  14. Wang, C.-M.; Du, X.; Nie, C.-N.; Zhang, X.; Tan, X.-Q.; Li, Q. Evaluation of sensory and safety quality characteristics of “high mountain tea”. Food Sci. Nutr. 2022, 10, 3338–3354. [Google Scholar] [CrossRef]
  15. Qingyang, W.; Ziwei, Z.; Jihang, H.; Suhui, Z.; Shuling, R.; Xiaochun, L.; Shuirong, Y.; Yun, S. Analysis of aroma precursors in Jinmudan fresh tea leaves and dynamic change of fatty acid volatile during black tea processing. Food Chem. X 2024, 21, 101155. [Google Scholar] [CrossRef]
  16. Zhang, J.; Jiang, Y.; Zhang, Y.; Li, T.; Zhao, W.; Yu, T.; Wu, Y.; Lu, M.; Ning, J.; Zhang, Z. In vitro bioactivity and metabolomics of green tea processed from raw materials of different. Curr. Res. Food Sci. 2025, 10, 101076. [Google Scholar] [CrossRef]
  17. Xu, C.; Liang, L.; Li, Y.; Yang, T.; Fan, Y.; Mao, X.; Wang, Y. Studies of quality development and major chemical composition of green tea processed from tea with different shoot maturity. LWT 2021, 142, 111055. [Google Scholar] [CrossRef]
  18. Chen, R.; Lai, X.; Wen, S.; Li, Q.; Cao, J.; Lai, Z.; Zhang, Z.; Hao, M.; Li, Q.; Sun, S.; et al. Analysis of tea quality of large-leaf black tea with different harvesting tenderness based on metabolomics. Food Control 2024, 163, 110474. [Google Scholar] [CrossRef]
  19. Qin, C.; Han, Z.; Jiang, Z.; Ke, J.-P.; Li, W.; Zhang, L.; Li, D. Chemical profile and in-vitro bioactivities of three types of yellow teas processed from different tenderness of young shoots of Huoshanjinjizhong (Camellia sinensis var. sinensis). Food Chem. X 2024, 24, 101809. [Google Scholar] [CrossRef]
  20. Shao, C.-Y.; Zhang, Y.; Lv, H.-P.; Zhang, Z.-F.; Zeng, J.-M.; Peng, Q.-H.; Zhu, Y.; Lin, Z. Aromatic profiles and enantiomeric distributions of chiral odorants in baked green teas with different picking tenderness. Food Chem. 2022, 388, 132969. [Google Scholar] [CrossRef] [PubMed]
  21. Liu, G.; Wang, L.; Wang, J.; Zheng, H.; Yang, S.; Zhou, Q.; Fu, L.; Cai, Z.; Zhang, S.; Wang, C.; et al. Temporal dynamics of bioactive compounds in sweet tea (Lithocarpus litseifolius (Hance) Chun): Linking harvest stages to flavor and health benefits. Food Res. Int. 2025, 218, 116918. [Google Scholar] [CrossRef]
  22. Deng, J.; Lin, W.; Li, T.; Wei, J.; Cui, J.; Chen, H.; Zhang, W. Maturity-dependent variations in metabolites and taste quality of Hainan mulberry leaf green tea by using metabolomics and e-tongue analysis. Food Biosci. 2025, 68, 106413. [Google Scholar] [CrossRef]
  23. Deka, H.; Sarmah, P.P.; Nath, B.; Gogoi, M.; Datta, S.; Sabhapondit, S. Unveiling the effect of leaf maturity on biochemical constituents and quality of CTC black tea: Insights from Northeast India’s commercial cultivars. J. Food Meas. Charact. 2024, 18, 9921–9937. [Google Scholar] [CrossRef]
  24. Feng, J.; Zhuang, J.; Chen, Q.; Lin, H.; Chu, Q.; Chen, P.; Wang, F.; Yu, B.; Hao, Z. The effect of maturity of tea leaves and processing methods on the formation of milky flavor in white tea—A metabolomic study. Food Chem. 2024, 447, 139080. [Google Scholar] [CrossRef] [PubMed]
  25. Duan, Y.; Gao, X.; Shi, B.; Yu, M.; Raza, J.; Sun, H.; Wang, Y.; Song, H.; Xu, Y. Characterize the dynamic changes of volatile compounds during the roasting process of Wuyi rock tea (Shuixian) integrating GC-IMS and GC × GC-O-MS combined with machine learning. Food Chem. 2025, 489, 144931. [Google Scholar] [CrossRef] [PubMed]
  26. Niu, J.; Liu, R.; Li, W.; Lang, Y.; Li, X.; Sun, W.; Sun, B. Characterize and explore the dynamic changes in the volatility profiles of sauce-flavor baijiu during different rounds by GC-IMS, GC–MS and GC×GC–MS combined with machine learning. Food Res. Int. 2025, 213, 116568. [Google Scholar] [CrossRef]
  27. Wu, H.; Yao, W.; Ma, S.; Liu, D.; Zhang, M. Evolution of key flavor compounds in sonit sheep lamb during grilling process using HS-GC-IMS, E-nose, E-tongue, and physicochemical techniques. Int. J. Gastron. Food Sci. 2025, 40, 101158. [Google Scholar] [CrossRef]
  28. Zhang, H.; Zhang, J.; Liu, S.; Li, T.; Wei, Y.; Gu, Z.; Su, Z.; Ning, J.; Wang, Y.; Hou, Z. Characterization of the key volatile compounds in longjing tea (Camellia sinensis) with different aroma types at different steeping temperatures by GC–MS and GC–IMS. LWT 2024, 200, 116183. [Google Scholar] [CrossRef]
  29. Gong, S.; Zhang, Z.; Chen, J.; Wu, H.; Jiang, H.; Teng, C.; Dai, Z. Enhanced understanding of dark tea quality through integrated GC-IMS and E-Nose analysis. LWT 2025, 224, 117806. [Google Scholar] [CrossRef]
  30. Guo, X.; Schwab, W.; Ho, C.-T.; Song, C.; Wan, X. Characterization of the aroma profiles of oolong tea made from three tea cultivars by both GC–MS and GC-IMS. Food Chem. 2022, 376, 131933. [Google Scholar] [CrossRef] [PubMed]
  31. Liu, H.; Fu, M.; Wu, J.; Yu, Y.; Si, W.; Xu, Y.; Yang, J. Flavor characterization of aged Citri Reticulatae Pericarpium from core regions: An integrative approach utilizing GC-IMS, GC–MS, E-nose, E-tongue, and chemometrics. Food Chem. 2025, 490, 144995. [Google Scholar] [CrossRef] [PubMed]
  32. Zhang, X.; Lu, X.; He, C.; Chen, Y.; Wang, Y.; Hu, L.; Qing, Q.; Zhu, M.; Liu, Z.; Xiao, Y. Characterizing and decoding the dynamic alterations of volatile organic compounds and non-volatile metabolites of dark tea by solid-state fermentation with Penicillium polonicum based on GC–MS, GC-IMS, HPLC, E-nose and E-tongue. Food Res. Int. 2025, 209, 116279. [Google Scholar] [CrossRef]
  33. Chen, G.; Zhu, G.; Xie, H.; Zhang, J.; Huang, J.; Liu, Z.; Wang, C. Characterization of the key differential aroma compounds in five dark teas from different geographical regions integrating GC–MS, ROAV and chemometrics approaches. Food Res. Int. 2024, 194, 114928. [Google Scholar] [CrossRef] [PubMed]
  34. Li, S.; Han, Y.; Wang, L.; Zhang, Y.; Wang, F.; Ou, Y.; Li, H.; Yang, L.; Qiu, S.; Lu, J. Machine learning-enhanced flavoromics: Identifying key aroma compounds and predicting sensory quality in sauce-flavor baijiu. Food Chem. 2025, 475, 143328. [Google Scholar] [CrossRef]
  35. Hu, Y.; Sheng, W.; Adade, S.Y.-S.S.; Wang, J.; Li, H.; Chen, Q. Comparison of machine learning and deep learning models for detecting quality components of vine tea using smartphone-based portable near-infrared device. Food Control 2025, 174, 111244. [Google Scholar] [CrossRef]
  36. Zhu, M.; Wang, M.; Gu, J.; Deng, Z.; Zhang, W.; Pan, Z.; Luo, G.; Wu, R.; Qin, J.; Gomi, K. Machine learning-assisted aroma profile prediction in Jiang-flavor baijiu. Food Chem. 2025, 478, 143661. [Google Scholar] [CrossRef]
  37. MetaboAnalyst. MetaboAnalyst 6.0—From Raw Spectra to Biomarkers, Patterns, Functions and Systems Biology. Available online: https://www.metaboanalyst.ca (accessed on 25 June 2025).
  38. Shan, X.; Jiang, Y.; Zhang, S.; Chen, L.; Niu, L.; Zhang, Q.; Zhou, Q.; Wang, Y.; Yuan, H.; Li, J. Key umami taste contributors in Longjing green tea uncovered by integrated means of sensory quantitative descriptive analysis, metabolomics, quantification analysis and taste addition experiments. Food Chem. 2024, 453, 139628. [Google Scholar] [CrossRef]
  39. Wang, J.J.; Yuan, H.B.; Hua, J.J.; Yang, Y.Q.; Shen, S.; Pang, D.D.; Jiang, Y.W.; Zhu, J.Y.; Yu, L.Y. Comparison on Main Flavoring Substances in New Shoots of Large Leaf Tea Cultivars with Different Varieties and Tenderness Based on Multivariate Statistical Analysis. Sci. Technol. Food Ind. 2022, 43, 81–92. [Google Scholar] [CrossRef]
  40. Kun, J.; Yang, Y.; Sun, J.; Dai, H.; Luo, Z.; Tong, H. Characterization of potential aroma compounds in five aroma types of green tea using the sensomics approach. LWT 2025, 215, 117177. [Google Scholar] [CrossRef]
  41. Yang, Y.; Zhang, M.; Hua, J.; Deng, Y.; Jiang, Y.; Li, J.; Wang, J.; Yuan, H.; Dong, C. Quantitation of pyrazines in roasted green tea by infrared-assisted extraction coupled to headspace solid-phase microextraction in combination with GC-QqQ-MS/MS. Food Res. Int. 2020, 134, 109167. [Google Scholar] [CrossRef]
  42. Liao, X.; Yan, J.; Wang, B.; Meng, Q.; Zhang, L.; Tong, H. Identification of key odorants responsible for cooked corn-like aroma of green teas made by tea cultivar ‘Zhonghuang 1’. Food Res. Int. 2020, 136, 109355. [Google Scholar] [CrossRef] [PubMed]
  43. Yu, J.; Ho, C.-T.; Lin, Z.; Zhu, Y.; Feng, Z.; Ni, D.; Zeng, S.; Zeng, X.; Wang, Y.; Ning, J.; et al. Sensomics-Assisted Characterization of Key Flowery Aroma Compounds in Lu’an Guapian Green Tea Infusion (Camellia sinensis). J. Agric. Food Chem. 2023, 71, 6120–6132. [Google Scholar] [CrossRef] [PubMed]
  44. Ho, C.-T.; Zheng, X.; Li, S. Tea aroma formation. Food Sci. Hum. Wellness 2015, 4, 9–27. [Google Scholar] [CrossRef]
  45. Zhu, Y.; Lv, H.-P.; Shao, C.-Y.; Kang, S.; Zhang, Y.; Guo, L.; Dai, W.-D.; Tan, J.-F.; Peng, Q.-H.; Lin, Z. Identification of key odorants responsible for chestnut-like aroma quality of green teas. Food Res. Int. 2018, 108, 74–82. [Google Scholar] [CrossRef] [PubMed]
  46. Qiao, L.; Zhang, M.; Zhang, C. Vascular macrophages sense octanal and drive athero-inflammation. Cell. Mol. Immunol. 2022, 19, 1077–1078. [Google Scholar] [CrossRef]
  47. Kour, D.; Negi, R.; Khan, S.S.; Kumar, S.; Kaur, S.; Kaur, T.; Sharma, B.; Dasila, H.; Kour, H.; Ramniwas, S.; et al. Microbes mediated induced systemic response in plants: A review. Plant Stress 2024, 11, 100334. [Google Scholar] [CrossRef]
  48. El Hachlafi, N.; Benkhaira, N.; Al-Mijalli, S.H.; Mrabti, H.N.; Abdnim, R.; Abdallah, E.M.; Jeddi, M.; Bnouham, M.; Lee, L.-H.; Ardianto, C.; et al. Phytochemical analysis and evaluation of antimicrobial, antioxidant, and antidiabetic activities of essential oils from Moroccan medicinal plants: Mentha suaveolens, Lavandula stoechas, and Ammi visnaga. Biomed. Pharmacother. 2023, 164, 114937. [Google Scholar] [CrossRef]
  49. Oliveira-Fernandes, J.; Oliveira-Pinto, P.R.; Mariz-Ponte, N.; Sousa, R.M.O.F.; Santos, C. Satureja montana and Mentha pulegium essential oils’ antimicrobial properties against Pseudomonas syringae pv. actinidiae and elicitor potential through the modulation of kiwifruit hormonal defenses. Microbiol. Res. 2023, 277, 127490. [Google Scholar] [CrossRef]
  50. Hu, Z.; Bakry, A.M.; Shi, L.; Zhan, P.; He, W.; Eid, W.A.M.; Ferweez, H.; Hamed, Y.S.; Ismail, H.A.; Tian, H.; et al. Mechanistic insights into cross-modal aroma-taste interactions mediating sweetness perception enhancement in Fu brick tea. Food Chem. 2025, 489, 144933. [Google Scholar] [CrossRef] [PubMed]
  51. Chen, C.; Zhou, Q.; Tian, T.; Yu, H.; Xu, Z.; Tian, H.; Yuan, H. Interactions among key aroma compounds and the influence of taste substances on aroma perception in Gouda cheese. LWT 2024, 207, 116670. [Google Scholar] [CrossRef]
  52. Wang, B.-H.; Huang, P.-H.; Lo, C.-Y.; Chang, W.-C. Metabolomic analysis elucidates the dynamic changes in aroma compounds and the milk aroma mechanism across various portions of tea leaves during different stages of Oolong tea processing. Food Res. Int. 2025, 209, 116203. [Google Scholar] [CrossRef] [PubMed]
  53. Zhou, X.; Ren, J.; An, J.; Yan, D.; Shi, X.; Jin, X. Predicting open-plan office window operating behavior using the random forest algorithm. J. Build. Eng. 2021, 42, 102514. [Google Scholar] [CrossRef]
  54. The Good Scents Company Information System. Available online: https://www.thegoodscentscompany.com/ (accessed on 9 June 2025).
  55. Flavor Ingredient Library. Available online: https://www.femaflavor.org/flavor-library (accessed on 10 June 2025).
  56. Van Gemert, L.J. Odour Thresholds: Compilations of Odour Threshold Values in Air, Water and Other Media, 2011st ed.; Oliemans Punter & Partners BV: Zeist, The Netherlands, 2011. [Google Scholar]
Figure 1. Electronic tongue (e-tongue) analysis of green tea samples with different leaf maturity stages. (A) Principal component analysis (PCA) score plot demonstrating sample clustering. (B) Biplot integrating sample distribution and variable contributions. (C) Heatmap visualizing taste attribute intensities across samples.
Figure 1. Electronic tongue (e-tongue) analysis of green tea samples with different leaf maturity stages. (A) Principal component analysis (PCA) score plot demonstrating sample clustering. (B) Biplot integrating sample distribution and variable contributions. (C) Heatmap visualizing taste attribute intensities across samples.
Foods 14 02861 g001
Figure 2. E-nose analysis of taste characteristics in green tea with different leaf maturity stages. (A) PLS-DA score plot demonstrating sample clustering based on tenderness grade. (B) Biplot with 95% confidence ellipses showing variable contributions to sample discrimination. (C) Cross-validation (CV) results indicate model prediction accuracy. Q2 values (predictive ability) with asterisks (*) denote statistical significance (p < 0.05), indicating the model outperforms random chance. (D) Permutation test statistics (n = 2000) validating model significance (p < 0.05). (E) Variable importance in projection (VIP) scores plot highlighting key discriminant compounds. (F) Heatmap visualization of normalized sensor response patterns across samples (red: high intensity; blue: low intensity).
Figure 2. E-nose analysis of taste characteristics in green tea with different leaf maturity stages. (A) PLS-DA score plot demonstrating sample clustering based on tenderness grade. (B) Biplot with 95% confidence ellipses showing variable contributions to sample discrimination. (C) Cross-validation (CV) results indicate model prediction accuracy. Q2 values (predictive ability) with asterisks (*) denote statistical significance (p < 0.05), indicating the model outperforms random chance. (D) Permutation test statistics (n = 2000) validating model significance (p < 0.05). (E) Variable importance in projection (VIP) scores plot highlighting key discriminant compounds. (F) Heatmap visualization of normalized sensor response patterns across samples (red: high intensity; blue: low intensity).
Foods 14 02861 g002
Figure 3. Volatile organic compound profiling of three tenderness-grade green teas using HS-GC-IMS. (A) Three-dimensional topographic plots showing ion mobility drift time (x-axis), GC retention time (y-axis), and normalized signal intensity (z-axis, a.u.). (B) Two-dimensional topographic plots displaying VOC fingerprint patterns, with RIP (Reaction Ion Peak) normalized to 1.0 (vertical line) and compound signals color-coded by concentration (red: high; blue: low). (C) Differential comparison plots generated by spectral subtraction (reference: FDQSG), where red indicates upregulated and blue indicates downregulated VOCs relative to the reference.
Figure 3. Volatile organic compound profiling of three tenderness-grade green teas using HS-GC-IMS. (A) Three-dimensional topographic plots showing ion mobility drift time (x-axis), GC retention time (y-axis), and normalized signal intensity (z-axis, a.u.). (B) Two-dimensional topographic plots displaying VOC fingerprint patterns, with RIP (Reaction Ion Peak) normalized to 1.0 (vertical line) and compound signals color-coded by concentration (red: high; blue: low). (C) Differential comparison plots generated by spectral subtraction (reference: FDQSG), where red indicates upregulated and blue indicates downregulated VOCs relative to the reference.
Foods 14 02861 g003
Figure 4. Qualitative and quantitative analysis of volatile organic compounds (VOCs) in green tea samples. (A) Proportional distribution of VOC chemical classes (pie chart with percentage annotations). (B) Chord diagram illustrating compound-specific correlations between tenderness grades and VOC abundances (band width corresponds to relative concentration).
Figure 4. Qualitative and quantitative analysis of volatile organic compounds (VOCs) in green tea samples. (A) Proportional distribution of VOC chemical classes (pie chart with percentage annotations). (B) Chord diagram illustrating compound-specific correlations between tenderness grades and VOC abundances (band width corresponds to relative concentration).
Foods 14 02861 g004
Figure 5. Characteristic VOC fingerprints of FDG across different tenderness grades. Roman numerals indicate clusters of VOCs: I, predominantly enriched in FDQSG; III, predominantly enriched in FDMJ1G; VI, predominantly enriched in FDTC2G; II and V, co-enriched across all FDG grades; IV, predominantly co-enriched in FDMJ1G and FDTC2G.
Figure 5. Characteristic VOC fingerprints of FDG across different tenderness grades. Roman numerals indicate clusters of VOCs: I, predominantly enriched in FDQSG; III, predominantly enriched in FDMJ1G; VI, predominantly enriched in FDTC2G; II and V, co-enriched across all FDG grades; IV, predominantly co-enriched in FDMJ1G and FDTC2G.
Foods 14 02861 g005
Figure 6. Cluster analysis of VOCs in three tenderness grades of green tea. (A) Hierarchical clustering analysis (HCA) dendrogram showing similarity relationships. (B) K-means clustering plot demonstrating VOC grouping patterns.
Figure 6. Cluster analysis of VOCs in three tenderness grades of green tea. (A) Hierarchical clustering analysis (HCA) dendrogram showing similarity relationships. (B) K-means clustering plot demonstrating VOC grouping patterns.
Foods 14 02861 g006
Figure 7. Comparative analysis of VOCs among three tenderness-grade green teas using volcano plots. (A) FDMJ1G vs. FDQSG (thresholds: |log2FC| > 1.0, p < 0.05). (B) FDTC2G vs. FDQSG (thresholds: |log2FC| > 1.0, p < 0.05). (C) FDTC2G vs. FDMJ1G (thresholds: |log2FC| > 1.0, p < 0.05).
Figure 7. Comparative analysis of VOCs among three tenderness-grade green teas using volcano plots. (A) FDMJ1G vs. FDQSG (thresholds: |log2FC| > 1.0, p < 0.05). (B) FDTC2G vs. FDQSG (thresholds: |log2FC| > 1.0, p < 0.05). (C) FDTC2G vs. FDMJ1G (thresholds: |log2FC| > 1.0, p < 0.05).
Foods 14 02861 g007
Figure 8. Multivariate analysis of green tea tenderness grades using partial least squares-discriminant analysis (PLS-DA). (A) PLS-DA score plot demonstrating group separation. (B) Permutation test results (n = 200) validating model robustness. (C) Loading plot highlighting highly correlated compounds contributing to group discrimination. Both grey and black circles represent VOCs, with the black circles marking the top 20 VOCs with the highest explanatory contribution to the original variables.
Figure 8. Multivariate analysis of green tea tenderness grades using partial least squares-discriminant analysis (PLS-DA). (A) PLS-DA score plot demonstrating group separation. (B) Permutation test results (n = 200) validating model robustness. (C) Loading plot highlighting highly correlated compounds contributing to group discrimination. Both grey and black circles represent VOCs, with the black circles marking the top 20 VOCs with the highest explanatory contribution to the original variables.
Foods 14 02861 g008
Figure 9. KEGG pathway enrichment analysis of green tea metabolites across leaf maturity stages. (A) Scatter plot of enriched pathways. (BD) Pathway-metabolite networks (blue: pathways; color gradient: log2FC).
Figure 9. KEGG pathway enrichment analysis of green tea metabolites across leaf maturity stages. (A) Scatter plot of enriched pathways. (BD) Pathway-metabolite networks (blue: pathways; color gradient: log2FC).
Foods 14 02861 g009
Figure 10. Machine learning-driven identification of flavor markers in FDG of different tenderness grades. (A) Receiver operating characteristic (ROC) curves of six algorithms (KNN, RF, SVM, GNB, LR, DT) using all features. (B) Bee swarm plot of top 20 discriminant features ranked by SHAP importance. (C) Box plots of selected markers’ relative abundance.
Figure 10. Machine learning-driven identification of flavor markers in FDG of different tenderness grades. (A) Receiver operating characteristic (ROC) curves of six algorithms (KNN, RF, SVM, GNB, LR, DT) using all features. (B) Bee swarm plot of top 20 discriminant features ranked by SHAP importance. (C) Box plots of selected markers’ relative abundance.
Foods 14 02861 g010
Table 1. Key aroma-active compounds in three tenderness grades of green tea identified using HS-GC-IMS-OAV analysis (rOAV > 1).
Table 1. Key aroma-active compounds in three tenderness grades of green tea identified using HS-GC-IMS-OAV analysis (rOAV > 1).
NO.Volatile CompoundsOdor Type Chemical ClassesOT
(μg/L)
rOAV
FDQSGFDMJ1GFDTC2G
1LinaloolfloralTerpenes0.223.8 × 1037.4 × 1036.3 × 103
2trans-linalool oxidefloralTerpenes604.23.54.0
3Nonanal-MaldehydicAldehydes1.19.1 × 1028.1 × 1028.6 × 102
4Nonanal-DaldehydicAldehydes1.12.1 × 1021.7 × 1022.0 × 102
5(E)-2-octenalfattyAldehydes3.0351621
6Octanal-MaldehydicAldehydes0.587389343422
7Heptanal-MgreenAldehydes2.80135166171
8Heptanal-DgreenAldehydes2.80736514423
9(E)-hept-2-enalgreenAldehydes135.60.950.89
10Octanal-DaldehydicAldehydes0.58752.728.644.8
11PentanalfermentedAldehydes12664525
122-methylbutanalcocoaAldehydes1.01.1 × 1031.3 × 1031.4 × 103
133-methylbutanalaldehydicAldehydes1.15.8 × 1027.3 × 1026.9 × 102
14ButanalchocolateAldehydes2.0699445
15HexanalgreenAldehydes2.41.7 × 1027141
16(E)-2-hexenalgreenAldehydes88.75.963.432.62
17methyl-5-hepten-2-onecitrusKetones688.58.87.6
182-OctanoneearthyKetones50.22.740.710.92
193-hydroxybutan-2-onebutteryKetones149.79.57.0
202-PropanonesolventKetones8324.285.334.53
211-penten-3-onespicyKetones235.91.41.1
223-PentanoneetherealKetones403.01.91.5
23n-Hexanol-MherbalAlcohols5.642.346.139.4
24(Z)-3-hexen-1-olgreenAlcohols3.9675962
25n-Hexanol-DherbalAlcohols5.67.67.76.5
261.8-CineoleherbalAlcohols1.1011.711.412.5
27oct-1-en-3-olearthyAlcohols1.5332217
28pentan-1-ol-MfermentedAlcohols150.22.302.402.41
29pentan-1-ol-DfermentedAlcohols150.21.561.661.66
302-Methylbutanol acetatefruityEsters5.0025.92.581.83
31Propyl isovaleratefruityEsters8.701.611.210.96
32Ethyl AcetateetherealEsters5.0054.595.992.3
33methyl acetateetherealEsters15001.0801.0701.430
342-pentyl furanfruityFurans5.8019.013.811.6
352-n-ButylfuranspicyFurans5.05.83.43.0
36Methyl SalicylatemintyAromatics403.85.34.1
37benzene acetaldehydegreenAromatics6.39.27.614
383-methylthiopropanalvegetableSulfur-containing Compounds0.458.59.35.9
39Ethylsulfidecoffee, meatSulfur-containing Compounds4.83.63.75.2
402-Ethyl-3,5-dimethylpyrazinenuttyPyrazines0.0401.42 × 1031.44 × 1031.35 × 103
412-acetyl-1-pyrrolinepopcornOther Heterocyclics0.1257661.6 × 102
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, X.; Huang, M.; Tang, W.; Li, Y.; Li, L.; Xie, J.; Li, X.; Dong, F.; Wang, M. Characterization and Exploration of the Flavor Profiles of Green Teas from Different Leaf Maturity Stages of Camellia sinensis cv. Fudingdabai Using E-Nose, E-Tongue, and HS-GC-IMS Combined with Machine Learning. Foods 2025, 14, 2861. https://doi.org/10.3390/foods14162861

AMA Style

Liu X, Huang M, Tang W, Li Y, Li L, Xie J, Li X, Dong F, Wang M. Characterization and Exploration of the Flavor Profiles of Green Teas from Different Leaf Maturity Stages of Camellia sinensis cv. Fudingdabai Using E-Nose, E-Tongue, and HS-GC-IMS Combined with Machine Learning. Foods. 2025; 14(16):2861. https://doi.org/10.3390/foods14162861

Chicago/Turabian Style

Liu, Xiaohui, Mingzheng Huang, Weiyuan Tang, Yucai Li, Lun Li, Jinyi Xie, Xiangdong Li, Fabao Dong, and Maosheng Wang. 2025. "Characterization and Exploration of the Flavor Profiles of Green Teas from Different Leaf Maturity Stages of Camellia sinensis cv. Fudingdabai Using E-Nose, E-Tongue, and HS-GC-IMS Combined with Machine Learning" Foods 14, no. 16: 2861. https://doi.org/10.3390/foods14162861

APA Style

Liu, X., Huang, M., Tang, W., Li, Y., Li, L., Xie, J., Li, X., Dong, F., & Wang, M. (2025). Characterization and Exploration of the Flavor Profiles of Green Teas from Different Leaf Maturity Stages of Camellia sinensis cv. Fudingdabai Using E-Nose, E-Tongue, and HS-GC-IMS Combined with Machine Learning. Foods, 14(16), 2861. https://doi.org/10.3390/foods14162861

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop