Next Article in Journal
One-Pot Synthesis of Melamine Formaldehyde Resin-Derived N-Doped Porous Carbon for CO2 Capture Application
Next Article in Special Issue
Antioxidant Guided Fractionation of Blackberry Polyphenols Show Synergistic Role of Catechins and Ellagitannins
Previous Article in Journal
A Novel Insight into the Ullmann Homocoupling Reactions Performed in Heterogeneous Catalytic Systems
Previous Article in Special Issue
Using HPLC with In-Column Derivatization to Authenticate Coffee Samples
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

NMR-Based Metabolomics: A New Paradigm to Unravel Defense-Related Metabolites in Insect-Resistant Cotton Variety through Different Multivariate Data Analysis Approaches

by
Anam Amin Shami
1,2,
Muhammad Tayyab Akhtar
2,*,
Muhammad Waseem Mumtaz
3,
Hamid Mukhtar
2,
Amna Tahir
4,
Syed Shahzad-ul-Hussan
5,*,
Safee Ullah Chaudhary
4,
Bushra Muneer
2,
Hafsa Iftikhar
5 and
Marios Neophytou
6,*
1
Department of Biotechnology, University of Central Punjab, Lahore 54000, Pakistan
2
Institute of Industrial Biotechnology, GC University Lahore, Lahore 54000, Pakistan
3
Department of Chemistry, University of Gujrat, Gujrat 50700, Pakistan
4
Biomedical Informatics and Engineering Research Laboratory, Department of Life Sciences, Syed Babar Ali School of Science and Engineering, Lahore University of Management Sciences, Lahore 54792, Pakistan
5
Department of Biology, Syed Babar Ali School of Science and Engineering, Lahore University of Management Sciences, Lahore 54792, Pakistan
6
KAUST Solar Center (KSC), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia
*
Authors to whom correspondence should be addressed.
Molecules 2023, 28(4), 1763; https://doi.org/10.3390/molecules28041763
Submission received: 30 October 2022 / Revised: 12 December 2022 / Accepted: 5 January 2023 / Published: 13 February 2023
(This article belongs to the Special Issue Natural Polyphenols: Isolation, Characterization and Health Benefits)

Abstract

:
Cotton (Gossypium hirsutum) is an economically important crop and is widely cultivated around the globe. However, the major problem of cotton is its high vulnerability to biotic and abiotic stresses. It has been around three decades since the cotton plant was genetically engineered with genes encoding insecticidal proteins (mainly Cry proteins) with an aim to protect it against insect attack. Several studies have been reported on the impact of these genes on cotton production and fiber quality. However, the metabolites responsible for conferring resistance in genetically modified cotton need to be explored. The current work aims to unveil the key metabolites responsible for insect resistance in Bt cotton and also compare the conventional multivariate analysis methods with deep learning approaches to perform clustering analysis. We aim to unveil the marker compounds which are responsible for inducing insect resistance in cotton plants. For this purpose, we employed 1H-NMR spectroscopy to perform metabolite profiling of Bt and non-Bt cotton varieties, and a total of 42 different metabolites were identified in cotton plants. In cluster analysis, deep learning approaches (linear discriminant analysis (LDA) and neural networks) showed better separation among cotton varieties compared to conventional methods (principal component analysis (PCA) and orthogonal partial least square discriminant analysis (OPLSDA)). The key metabolites responsible for inter-class separation were terpinolene, α-ketoglutaric acid, aspartic acid, stigmasterol, fructose, maltose, arabinose, xylulose, cinnamic acid, malic acid, valine, nonanoic acid, citrulline, and shikimic acid. The metabolites which regulated differently with the level of significance p < 0.001 amongst different cotton varieties belonged to the tricarboxylic acid cycle (TCA), Shikimic acid, and phenylpropanoid pathways. Our analyses underscore a biosignature of metabolites that might involve in inducing insect resistance in Bt cotton. Moreover, novel evidence from our study could be used in the metabolic engineering of these biological pathways to improve the resilience of Bt cotton against insect/pest attacks. Lastly, our findings are also in complete support of employing deep machine learning algorithms as a useful tool in metabolomics studies.

1. Introduction

The development of plants with improved agronomic traits has revolutionized the cash crop (e.g., wheat, cotton, and rice) production through genetic engineering. Moreover, over the years genetic engineering has proven to play a substantial role in the development of crops with enhanced traits yielding better productivity and resistant characteristics against insects and herbicides. Resultantly, these genetically engineered crops generate significant revenues and yields [1,2].
Cotton (Gossypium hirsutum) is one of the major cash crops in the world. It is included among the important agricultural contributors towards economic growth for both developed and underdeveloped countries, and is found to be ranked amongst the major agricultural contributors which produce a significant impact on the economy of both developed and middle-income countries. However, according to an evaluation. About 87% of cotton is provided by developing countries [3]. Further, a retrospective study showed that cotton being a major stakeholder in the world’s economy brought about an economic impact of $600 billion in the world’s leading textile industries [4]. In addition to the economic impact, cotton shows a high vulnerability to both biotic and abiotic stresses, as reported by numerous studies about the key role of these stress factors as key players in the deterioration of cotton production [5,6]. For this very reason, cotton was genetically engineered with genes encoding insecticidal proteins. The commercialization of Bt cotton has revolutionized the cotton industry, it has raised the farmer’s benefit by 261–438 US$ per hectare [7]. Therefore, Bt-engineered cotton is safeguarding crops against major devastators like Helicoverpa armigera, pink bollworm, and diamond black moth [8]. Studies have been reported on the impact of these genes on cotton production and fiber quality [9]. However, the metabolites responsible for conferring resistance in genetically modified cotton need to be explored.
Plants produce a large number of metabolites of heterogeneous nature, which play important roles in plant growth, development, and their response to the external environment [10]. These small molecules (metabolites) are valuable nutritional sources not only for plants but also for humans and livestock. According to an estimate, nearly 200,000 compounds have been found in the plant kingdom and out of 200,000, only 50,000 metabolites are known so far [11]. Plant metabolites, particularly secondary metabolites, play a prominent role in plants’ defense mechanisms. Secondary metabolites include alkaloids, terpenoids, flavonoids, isoprenoids, phenylpropanoids, and nitrogen- and sulfur-based phenolic compounds [12]. Summarily, these phytochemicals serve as a preliminary line of defense against biotic stresses [13]. Terpenoids such as iridoids and benzoxazinoids play an active role in plant defense [14]. Gossypol, a sesquiterpenoid aldehyde was found to be phytoalexin in nature. Similarly, hemigossypolene showed anti-fungal activities against A. flavus [15]. Thus, the diversity of plant metabolites and their complicated regulatory mechanisms have stressed upon the need to explore them in depth. Consequently, making plant metabolomics an emerging field of science for researchers to explore different tangents in order to get a dynamic snapshot of their underlying biochemical pathways, identification of plant disease biomarkers, and studying plants’ response to biotic and abiotic stresses [16].
Plant metabolomics refers to the qualitative and quantitative study of small molecules (<1500 Da). Therefore, this field of science is equipped with the provision of highly accurate information related to the physiological state of an organism [17]. Different analytical techniques like gas chromatography–mass spectrometry (GC-MS), liquid chromatography–mass spectrometry (LC-MS), nuclear magnetic resonance spectroscopy (NMR), liquid chromatography nuclear magnetic resonance (LC-NMR), and liquid chromatography solid phase extraction nuclear magnetic resonance (LC-SPE-NMR) have been employed in plant metabolomics [18,19]. LCMS is a more sensitive approach than GCMS and NMR though the major disadvantage of this platform is ion suppression. Therefore, NMR is used as a complementary technique to GCMS and LCMS [20]. NMR spectroscopy is robust, non-invasive, and highly reproducible including simple sample preparation [21]. It is an ideal technique to study real-time metabolite profiling in living systems [22].
Previously, Ren et al. [23] examined the metabolic profile of wild-type and transgenic Arabidopsis plants through NMR spectroscopy. Similar work was done to study host plant resistance against thrips by using 1H-NMR spectroscopy [24]. Likewise, Becerra–Martinez et al. [25] deployed NMR-based metabolomics to investigate differences in the metabolic composition of transgenic tobacco plants expressing sequisterpene cyclase. Herein, the present study we have used NMR spectroscopy for the metabolomic analysis of Bt and non-Bt cotton varieties. The present work aimed to unveil marker compounds responsible for insect resistance in Bt cotton plants. Furthermore, it explores the impact of Bt genes on the expression level of the key metabolites.

2. Results

2.1. 1H-NMR Identification of Metabolites in Cotton Varieties

1H-NMR was used to unravel the metabolic dynamics between FH-142 (Bt cotton), Mac-07 (pest resistant), and 496 (wild) cotton varieties. The representative 1H-NMR spectra are shown in Figure 1. The identification of metabolites was done by consulting previously published literature [26,27,28] and metabolite databases [29,30]. The detailed metabolites along with their multiplicity pattern are tabulated in Table 1. The assignment of metabolites was validated through 2D J-resolved experiment (Supplementary Materials Figure S1). The 1H-NMR spectra of different G. hirsutum varieties can be divided into three spectral regions i.e., a high field or aliphatic (δ 0.5–3.0 ppm), saccharide (δ 3.0–5.5 ppm), and aromatic region (δ 5.5–10.00 ppm). The major constituents identified were amino acids, organic acids, lipids, terpenoids, and flavonoids. In the aliphatic region, characteristic signals of four non-essential amino acids. Namely, alanine, γ-amino butyric acid (GABA), asparagine, citrulline, and one essential-amino acid (valine) were identified. Additionally, successful detection of succinic acid at δ 2.45 (s), aspartic acid (δ 2.68, dd), α-Ketoglutaric acid (2.43, t), and terpenoids (limonene and terpinolene) were also noted. Regardless of overlapping in the carbohydrate region, anomeric proton signals of arabinose, fructose, xylulose, maltose, and sucrose were observed. Likewise, the anomeric proton signal of sugar moieties of choline and sycloinositol was also pointed at δ 3.20 (s) and δ 3.21 (s), respectively. The diagnostic signals of organic acids namely aconitic acid (δ 3.43, d) shikimic acid (δ 4.01), and tartaric acid (δ 4.35) were also observed. In the low field of 1H-NMR spectra (Figure 1), characteristic signals of uridine (δ 5.95, d) were detected. The signals for phenylpropanoid derivative (cinnamic acid) were found at δ 6.52 (d) and δ 7.60 (m). The 2H signal of fumarate was identified at δ 6.5 (s). However, the characteristic signals of dibutyl phthalate (δ 7.7, s), formate (δ 8.47, s), and trigonelline (δ 9.14, s) were also annotated. The visual inspection revealed marginal metabolic differences between the wild (496) and pest-resistant (FH-142 and MAC-07) varieties of cotton. The most prominent difference observed among cotton varieties is the presence of stigmasterol in non-transgenic pest-resistant varieties (MAC-07). Moreover, the signal intensity of valine, limonene, succinic acid, terpinolene, γ-aminobutyric acid, sycloinositol, proline, shikimic acid, chlorogenic acid, ferulic acid, arabinose, xylulose, trigonelline, and formate was found higher in FH-142. In addition to visual inspection, we performed a visual representative analysis (heat map) of the NMR data to find out metabolic differences among cotton varieties.

2.2. Heat map Analysis

Heat maps assisted in visualizing the variations in the metabolic contents with respect to three different cotton varieties. Further, in order to attain a clear picture of clustering between the metabolites as well as different types of cotton, a data refinement step was performed, where a single data encounter represented the geometric mean of the 6 replicates of each cotton variety calculated for each expression of metabolite. Figure 2 illustrates a heat map along with dendrograms showing clustering along both axes, i.e., between cotton varieties and the level of expression of various metabolites. To understand the expression level, the red and blue color code was used where a red color represents a higher expression level, while blue color depicts a lower expression and white signifies no change. As a result, Figure 2 evidently distinguishes the FH-142 variety to be different from MAC-07 and 496. This could be validated by the outcome in form of a heat map and dendrogram which is illustrating an entirely different pattern of expression for FH-142. Furthermore, a separate branching from the cluster of MAC-07 and 496 varieties reveals that these two varieties share a somewhat similar metabolite profile in comparison to FH-142. Summarily, primary metabolites such as tryptophan, tyrosine, arabinose, citrulline, maltose, fumarate, and GABA were highly expressed in the Bt cotton variety (FH-142). Likewise, the secondary metabolites cinnamic acid, succinic acid, stigmasterol, limonene, linoleic acid, and E-β-ocimene were also found to have increased expression in Bt cotton. This inferred that induction of Cry genes in cotton variety resulted in increased production of primary and secondary metabolites in cotton plants.

2.3. Multivariate Data Analysis

The variation in the metabolite content of cotton varieties was also evaluated by using multivariate data analysis (MvDA). The principal component analysis (PCA) was applied to apprehend the clustering features of three different cotton varieties. In PCA, five components contributed 84% total variance. PC1 explains 53% variation while PC2 accounts for 13% of the variation in data. It can be seen in Figure 3 that PCA was unable to show any meaningful separation among the data. The PCA results did not exhibit notable differences among samples so we applied Orthogonal partial latent square discriminant analysis (OPLSDA) to the data.
The OPLSDA model shows a complete discrimination among the cotton varieties. The model depicts high goodness of fit and low predictability with an R2X value of 0.888 and a Q2 value of 0.362. The model was further validated through a 100 permutations test and a Q2 value of −0.361 was found as shown in Figure 4B. The OPLSDA score plot shown in Figure 4A clearly separates FH-142 (Bt cotton) from MAC-07 and 496. FH-142 was separated by OPLS1 and OPLS2 from the other two varieties, whereas Mac-07 and 496 were projected together on the positive side of OPLS1. A detailed examination of the loading plot (Figure 4C) shows that terpinolene, α-ketoglutaric acid, aspartic acid, cinnamic acid, stigmasterol, fructose, maltose, arabinose, xylulose, cinnamic acid, and Shikimic acid were responsible for separating FH-142 from MAC-07 and 496. Consequently, malic acid and valine correspond more to 496 and MAC-07.
Although OPLSDA clearly separated FH-142 from MAC-07 and 496, MAC-07 and 496 were unable to be discriminated against. Hence, we performed the latest deep learning approaches such as LDA and neural networks on NMR data as a feature reduction approach. Initially, LDA failed to discriminate against the cotton varieties and resultantly most of the predictors given were in the unidentified region as a solution. We further filtered the data by excluding the unidentified region and performed LDA and neural network analyses. Consequently, there was a notable improvement in the results, as shown in Figure 5A. The score plot of LDA accounted for a 100% group classification among the cotton varieties. The discriminant analysis produced fourteen potential predictors classified the types of cotton at a significance level of 0.001, as elucidated in Figure 5B. The Fisher’s discriminant functions revealed that nonanoic acid, valine, alanine, citrulline, and arginine were major contributors in distinguishing between cotton varieties. Coefficients with the highest scores were attributed more toward the classification of a particular group. The classification results as presented in Figure 5C showed that 100% of the originally grouped data was correctly classified and showed a 100% predicted group membership. The fourteen significantly shortlisted predictors from LDA were further subjected to multilayer perceptron neural network analysis, which is a supervised deep learning approach deployed to test the efficiency of the data. The data consisting of independent variables were randomly divided into training and testing sets. The classification results (Figure 6A) from the multilayer perceptron model provided 100% correct prediction calculations for the training and testing data sets. The individual importance of variables in the prediction of a particular variety is elucidated in Figure 6B,C, which showed that malic acid, arginine, citrulline, and valine were the most important metabolites in distinguishing each cotton variety in the present model. Our results showed that there is concordance in the metabolites identified by conventional methods (PCA and OPLSDA) and deep learning methods (LDA and neural networks). However, OPLSDA has given more discriminating metabolites than LDA but LDA showed a clear separation between cotton varieties on the basis of 14 predictor metabolites, depicting clear discrimination of cotton variety to a specific class. Moreover, relative quantification results (Table and graphs (Supplementary Figure S2A,B)) also displayed a comparative level of differentiating metabolites among the studied cotton groups (Supplementary Table S1).

2.4. Metabolic Pathway Analysis

The annotated metabolites were characterized under general biochemical pathways such as tricarboxylic acid cycle (TCA), Shikimic acid pathway, glutamine synthetase/synthase (GS/GOGAT) cycle, and amino acid biosynthetic metabolism based on the search results in KEGG. The induction of the Cry gene regulated the concentration of organic acids, terpenoids, amino acids, and flavonoids. In the TCA cycle, the concentration of aconitic acid, α-ketoglutaric acid, succinic acid, and fumarate has increased in FH-142 (Bt cotton) as highlighted in green color in Figure 7A. The expression of Cry genes was shown to cause an incremental effect in their expression on a number of major intermediates of Shikimic acid pathways. The levels of Shikimic acid and quinic acid were significantly enhanced in FH-142 compared to other varieties. Furthermore, the upregulation of tryptophan (0.103) and tyrosine (0.0085) was observed in the FH cotton variety (Supplementary Table S1, Figure S2A,B). Similarly, the content of cinnamic acid and ferulic acid was also upregulated in FH-142. As these metabolites, along with shikimic acid, are important intermediates of plants’ secondary metabolism, the enhanced production of these compounds would result in a better defense mechanism in Bt cotton. Figure 7B illustrates the biological schema of the secondary metabolites and their flow of interrelationships. In GS/GOGAT biosynthetic pathway, transamination-related metabolites were also upregulated in FH-142 (Bt cotton).

3. Discussion

Recently, plant metabolomics has been extensively used to assess metabolic perturbations in plants caused by biotic or abiotic stresses [31] as well as for risk assessment of genetically modified crops [32]. Thereby, the present work was majorly focused on the identification of biomarkers responsible for insect resistance in pest-resistant cotton varieties. The metabolite characterization results depicted that cotton varieties are enriched in bioactive compounds. Among the identified compounds, the amount of cinnamic acid, E-β-ocimene, tryptophan, valine, uridine, dibutyl phthalate, GABA, succinic acid, fumarate, shikimic acid, arabinose, xylulose, trigonelline, glycine, di-allylic methylene, tyrosine, and malic acid was found to be high in FH-142 (Bt cotton) as compared to other varieties. Levande et al. [33] employed capillary electrophoresis to compare metabolic profiles of three Bt transgenic maize varieties with wild maize plants, the results revealed a significant difference in overall metabolite content. Likewise, Ning et al. [34] used GC–MS and UPLC–MS/MS to examine the cambium metabolomes of multi-gene stress resistant transgenic lines. The study concluded that a notable difference in the relative abundances of sucrose, arginine, uridine diphosphate glucose, glutamate, and catechol was observed in multi-gene stress-resistant transgenic lines of popular plants in comparison to non-transgenic ones. Further, we can state that the findings of our metabolomic analyses are in agreement with the past studies.
Amino acids are key products of primary metabolism, they are involved in multiple functions in plants such as growth promoters, cell wall biosynthesis, osmoregulator, and intermediates of secondary metabolites [27]. Previous data have shown variation in amino acid content among transgenic lines as compared to wild type [34]. In the present study, a significant increase in the amount of valine, GABA, tryptophan, tyrosine, glycine, and arabinose was observed in FH-142. Additionally, aromatic amino acids such as tyrosine and tryptophan are not only the major components of plant proteins, but are also involved in the upregulation of growth hormones and secondary metabolites [27]. Tyrosine is a precursor of various secondary metabolites that serve as attractants and defensive compounds [35]. Tryptophan, an essential amino acid, plays an important role in the biosynthesis of precursors involved in plant growth, defense against biotic and abiotic stresses, and plant-insect interaction [36]. Valine is involved in the biosynthesis of glucosinolates which has a deterring effect on micro-organisms and herbivory [37]. A previous study showed that by exogenous application of amino acids on flowers of Bt cotton the concentration of Bt proteins could be increased [38], therefore, proposing that the concentration of Bt proteins could be increased with the upregulation of amino acids. GABA, a non-protein amino acid act as a signaling molecule in plants [39]. It works either by having a direct inhibitory effect by regulating a defense mechanism or a combination of both [39]. Shelp et al. [40] reported the indirect role of GABA in plant growth as well as the regulation of defense mechanisms against biotic and abiotic stresses. Scholz et al. [41] investigated the impact of insect feeding on the mutants of Arabidopsis thaliana. The results demonstrated that accumulation of GABA occurred upon insect infestation. Furthermore, a decline in the development of Spodoptera littoralis larvae was observed when fed on GABA enriched artificial diet. Bown et al. [42] observed that non-wounding insects crawling on leaves induced the production of GABA, thus inferring the role of GABA in the defense mechanism of plants. The enhanced production of GABA in Bt cotton could possibly be responsible for conferring resistance against Helicoverpa armigera, pink bollworm, and diamond black moth.
Carbohydrates are major energy bearers in plants [43]. Carbohydrates such as sucrose, glucose, fructose, and raffinose are notable compatible solutes in plants [44]. In our work, elevation in the concentration of arabinose and xylulose was observed in Bt cotton. Arabinose, which is a constituent of plant cell walls, plays a pivotal role in the synthesis of cell wall intermediates, flavonoids, and signaling peptides [45]. Zayed [46] studied the role of arabinose in plants under salt stress. He found that arabinose-based glycoproteins help plants in tolerating salt stress. Therefore, a high concentration of arabinose in leaves of FH-142 might play an important role in tolerating abiotic stress.
Among organic acids, fumarate, malic acid, succinic acid, shikimic acid, and malic acid were found in high concentrations in FH-142. These metabolites are intermediates of primary and secondary metabolism. Fumarate, malic acid, and succinate are intermediates of the tricarboxylic acid cycle (TCA), and are also involved in regulating pH in plants [47]. A study conducted by Zhou et al. [48] demonstrated irregularity in the levels of TCA intermediates in Bt-transgenic rice grown under insecticidal stress. After an early decrease, a significant increase in the concentration of TCA intermediates was reported. These findings are in good agreement with our results. Further, Fahnenstich et al. [49] proposed that malate and fumarate play key roles in the primary metabolism of A. thaliana. Reactive oxygen species (ROS) are normally produced in plants. An increase in the concentration of ROS species occurring during the stress conditions leads to oxidative stress [50], which resultantly assists in the release of hydroxyl radicals which damages plant macromolecules especially proteins [51]. However, Succinic acid and malic acid have scavenging effects on ROS species [52]. The upregulation of these metabolites possibly protects FH-142 against abiotic stress. Shikimic acid is an indispensable component of the Shikimate and phenylpropanoid pathways [53]. The elevated level of shikimic acid is the result of the biosynthesis of secondary metabolites which are involved in the plant’s defense mechanisms such as flavonoids and lignins [48]. Thus, a high level of shikimate might lead to the upregulation of secondary metabolites in FH-142 which can lead to better defense against pest attacks.
In the current study, a few terpenoids including limonene, terpinolene, stigmasterol, and E-β-ocimene were also identified. E-β-ocimene, a monoterpene, is known to help plants in pollination. β-ocimene is a major component of floral scents [54]. In addition to playing the role of pollinator attractor, E-β-ocimene was found to be involved in direct insect defense in M. truncatula [55]. Kang et al. [56] proposed that β-ocimene was responsible for activating the defense mechanism of Chinese cabbage against M. persicae. In cotton plants, β-ocimene was found to be synthesized in response to insect attacks [6]. We have also found a high level of E-β-ocimene in FH-142, which may also be responsible for inducing insect resistance in Bt cotton.
Plants use nucleotides to induce defense mechanisms against predators and pathogens [57]. We observed an elevated level of uridine in FH-142, which can also be an important factor in inducing the cotton plant’s defense mechanism. Furthermore, we also observed different levels of cinnamic acid in pest-resistant varieties as compared to non-pest-resistant varieties.
Multivariate data analysis (MvDA) has extensively been used in metabolomic studies [58]. It is primarily used to process massive data obtained through analytical approaches, albeit yielding useful information [59]. The two most commonly used MvDA platforms are the principal component analysis (PCA) and the partial least square regression (PLS) method [60]. PCA is an unsupervised approach, often used as a conventional method in plant metabolomics [61]. Although, PCA is a beneficial platform in MvD; however, occasionally it fails to analyze data obtained from multifactorial experiments [62]. In the current work, PCA was found unable to differentiate the cotton varieties. Therefore, we applied different algorithms like OPLSDA, LDA, and neural networks to differentiate cotton varieties. OPLSDA has given more discriminating metabolites, in comparison to LDA which presented a model with lesser predictors. However, LDA showed a clear segregation between the cotton varieties compared to OPLSDA. The difference in the discriminating metabolites given by OPLSDA and LDA is mainly due to the dimensions in which these tools project the data for feature reduction. LDA projects data in a low dimensional space i.e., K-1 [63]. While PCA projects data in K- dimensions [64]. Balakrishnama and Ganapathiraju, Ref. [65] reported that PCA does more feature classification, while LDA does more data separation which is in accordance with our results. Likewise, previously reported data by Alves et al. [66] applied nine different algorithms to find the best identification tool for fibromyalgia. They inferred that SPA-LDA is a reliable tool in the clinical diagnosis of fibromyalgia. It concluded that NMR-based metabolomics conjoined with multivariate data analysis. In particular, LDA is a powerful platform in differentiating plant samples. Moreover, the elevated levels of defensive metabolites in FH-142 might be responsible for conferring resistance against insect herbivory.

4. Materials and Methods

4.1. Chemicals

All the chemicals and solvents utilized in the present work were of analytical grade and purchased from Cambridge Isotope Laboratories and Acros. The chemicals used were included deutrium oxide (D2O) (Cat-No: 14D-099), sodium deuterium oxide (NaOD) (Cat-No.: DLM-45-PK), methanol-d4 (Cat No. DLM-24-PK), 2,2,3,3-D4 (D,98%) sodium-3-trimethylsilylpropionate (TMSP) (Cat-No: I-18625), potassium dihydrogen phosphate (KH2PO4) (Cat No. AC424200250), and liquid nitrogen.

4.2. Sample Collection

Three different varieties of cotton plant namely MAC-07 (non-transgenic insect = resistant American variety), FH-142 (transgenic insect-resistant variety having Cry proteins, Bt cotton), and 496 (non-insect resistant variety) were procured from the Institute of Agriculture sciences, Punjab University Lahore. The leaf samples from six replicates of each variety were harvested 6 months after the sowing of cotton seeds, labeled, and plunged directly into liquid nitrogen. The samples were quenched to stop enzymatic degradation. Samples were then brought to the research lab of IIB, GCU Lahore. Afterward, the leaves were cryogenically dried using a mortar and pestle, and lyophilized for 48–72 h. The lyophilized plant samples were stored in a −80 °C freezer (Model: MDF-594, Ultra-low, SANYO, Osaka, Japan) till further use.

4.3. NMR Sample Preparation

The plant extract was prepared by following the previously reported protocol (Hussain et al., 2018 [17]) with minor modifications. A sample of 50 mg lyophilized plant material was transferred to an Eppendorf tube, to which methanol d4 (500 µL) and 500 µL of KH2PO4 buffer (in D2O containing 0.01% TMSP, pH 6.0) were added. To ensure proper mixing of the constituents, the mixture was vortexed for 1 min followed by ultrasonication for 20 min. Later, the extract was centrifuged (SIGMA laboratory centrifuges 3K30) for 5 min at 13,000 rpm at 4 °C. After centrifugation, the supernatant of each sample (600 µL) was transferred to a 5 mm NMR tube for NMR analysis.

4.4. NMR Acquisition

NMR acquisition was performed by following the parameters as mentioned in (Hussain et al., 2018 [17]) with few modifications. 1H-NMR spectra were obtained by using Avance neo 600 MHz spectrophotometer. Methanol-d4 was used as an internal lock. The spectra were recorded with 128 scans and 64K data points using Bruker’s pulse program “ZG” without solvent suppression. The spectra were Fourier transformed with line broadening (Lb = 0.3). However, no zero filling was added.

4.5. Data Processing

The 1H-NMR spectra were referenced to TMSP (at δ 0.0 ppm), manually phase and baseline corrected by using MestreNova (14.0 version software). Later, all the spectra were reduced to the ASCII file prior to MvDA. The spectral intensities were normalized to the total area. The spectral region from δ 0.5–9.0 ppm was bucketed into bins of equal width i.e., 0.01 thus yielding 995 variables for each spectrum. The residual water (δ 4.7–4.8) and solvent peak (δ 3.30) was excluded from the data. The resultant file was imported to Simca P (14.0 version) for multivariate data analysis.

4.6. Multivariate Data Analysis (MvDA)

Heat map analysis was performed in order to have a systemic overview and clustering of metabolic distribution, by graphically representing the color-coded values among three groups. R platform was used to generate heat maps after the transformation of the raw data to z-score.
In total, 18 processed NMR files were subjected to PCA using SimcaP (14.0 version). Pareto scaling was utilized to normalize the data. PCA was performed to analyze intrinsic variation between the samples. In order to analyze the maximum variance between samples supervised approaches such as OPLSD-DA (by using SimcaP) were carried out. The quality of the models was described by R2 and Q2 values. R2 is defined as the proportion of variance in the data explained by the models and indicates the goodness-of-fit, and Q2 is defined as the proportion of variance in the data predictable by the model, and indicates predictability [67]. The model was validated by performing 100 permutations randomly.
Later, metabolic pathway analysis was performed with the help of the Kegg pathway database (https://www.genome.jp/kegg/pathway.html (accessed on 30 November 2021)).

4.7. Statistical Analysis

Graphpad prism software (8.44 version), International Business Machine (IBM)—Statistical Package for Social Sciences (SPSS) version 23, SimcaP (14.0 version), and R programming were used for the data analysis. We performed a one-way analysis of variance (ANOVA) and Tukey’s post hoc test of multiple comparisons to check the level of significance of marker metabolites among three varieties. The statistical analysis was performed with a 95% of confidence level and probabilistic value (p < 0.05) indicates statistical significance. LDA (Linear Discriminant Analysis) was done to classify the metabolites based on different cotton varieties. Multilayer Perceptron neural network was trained to test for key metabolites for different cotton varieties. Analyses were declared significant for p-value < 0.05.

5. Conclusions

1H-NMR spectroscopy coupled with multivariate data analysis is a useful method to characterize the metabolic profiles of pest-resistant (FH-142 & MAC-07) and non-pest-resistant (496) cotton varieties. A wide range of bioactive compounds primarily amino acids, carbohydrates, organic acids, terpenoids, fatty acids, and phenylpropanoid derivatives was identified in cotton varieties. We found LDA is a more effective technique than PCA and OPLSDA in classifying cotton varieties on the basis of metabolites. The major discriminating metabolites among cotton varieties are shikimic acid, malic acid, xylulose, maltose, terpinolene, α-ketoglutaric acid, aspartic acid, cinnamic acid, stigmasterol, proline, valine, and fructose. FH-142 (Bt cotton) showed an increase in the concentration of defense-related metabolites, which is probably due to the presence of Cry genes. Furthermore, these metabolites might be responsible for inducing resistance against insect attack in Bt cotton. The bioengineering of defense-related metabolic pathways might be helpful in enhancing the resistance of Bt cotton against insect herbivory.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/molecules28041763/s1, Figure S1: (A) 2D J-resolve of FH-142; (B) 2D J-resolve of 496; (C) 2D J-resolve of MAC; Figure S2: (A) Relative quantification of metabolites, (B) Relative quantification of metabolites, Table S1: Relative quantification of metabolites in different cotton varieties.

Author Contributions

Conceptualization: M.T.A. and A.A.S.; methodology: A.A.S. and M.W.M.; software: A.T.; validation, A.A.S., M.T.A., A.T. and M.W.M.; formal analysis: S.S.-u.-H. and H.I.; investigation: M.T.A. and A.A.S.; resources: H.M., M.N. and S.S.-u.-H. data curation: M.T.A. and S.U.C.; writing—original draft preparation: M.T.A., M.N. and A.A.S.; writing—review and editing: B.M. and H.M.; visualization: S.S.-u.-H. and S.U.C.; supervision: M.T.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available within the article.

Acknowledgments

The authors gratefully acknowledge the support of the Government College University, Lahore for the lab facilities and Lahore University of Management Sciences (LUMS) for their support in the completion of the project.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

All the data processed and used during the current study are available from the Institute of Industrial Biotechnology, Government College University Lahore on request.

References

  1. Malik, C.P.; Sanadhya, D. Genetic Engineering and GM Crops. J. Plant Sci. Res. 2018, 34, 251–258. [Google Scholar] [CrossRef]
  2. Julia, B.S.; Parker, J.E.; Ainsworth, E.A.; Oldroyd, G.E.D.; Schroeder, J.I. Genetic strategies for improving crop yields. Nature 2019, 575, 109–118. [Google Scholar]
  3. Nix, A.; Paull, C.C.; Colgrave, M. Flavonoid profile of the cotton plant, Gossypium hirsutum: A review. Plants 2017, 6, 43. [Google Scholar] [CrossRef]
  4. Ashraf, S.; Sangi, A.H.; Hassan, Z.Y.; Luqman, M. Future of cotton sector in Pakistan: A 2025 Outlook. Pak. J. Agric. Sci. 2018, 31, 145. [Google Scholar] [CrossRef]
  5. Nehra, V.; Saharan, B.S.; Choudhary, M. Evaluation of Brevibacillus brevis as a potential plant growth promoting rhizobacteria for cotton (Gossypium hirsutum) crop. SpringerPlus 2016, 5, 948. [Google Scholar] [CrossRef]
  6. Egbuta, M.A.; McIntosh, S.; Waters, D.L.E.; Vancov, T.; Liu, L. Biological importance of cotton by-products relative to chemical constituents of the cotton plant. Molecules 2017, 22, 93. [Google Scholar] [CrossRef] [PubMed]
  7. Tokel, D.; Genc, B.N.; Ozyigit, I.J. Economic impacts of Bt (Bacillus thuringiensis) cotton. J. Nat. Fibers 2021, 19, 4622–4639. [Google Scholar] [CrossRef]
  8. Bravo, A.; Likitvivatanavong, S.; Gill, S.S.; Soberón, M. Bacillus thuringiensis: A story of a successful bioinsecticide. Insect Biochem. Mol. Biol. 2011, 41, 423–431. [Google Scholar] [CrossRef] [PubMed]
  9. Bilal, M.F.; Saleem, M.F.; Wahid, M.A.; Shakeel, A.; Maqbool, M. Adoption of Bt cotton: Threats and challenges. Chil. J. Agric. Res. 2012, 72, 419. [Google Scholar] [CrossRef]
  10. Pichersky, E.; Gang, D.R. Genetics and biochemistry of secondary metabolites in plants: An evolutionary perspective. Trends Plant Sci. 2000, 5, 439–445. [Google Scholar] [CrossRef] [PubMed]
  11. Deborde, C.; Moing, A.; Roch, L.; Jacob, D.; Rolin, D.; Giraudeau, P. Plant metabolism as studied by NMR spectroscopy. Prog. Nucl. Mag. Res. Sp. 2017, 102, 61–97. [Google Scholar] [CrossRef] [PubMed]
  12. Bloem, E.; Haneklaus, S.; Schnug, E. Significance of Sulfur Compounds in the Protection of Plants against Pests and Diseases. J. Plant Nutr. 2005, 28, 763–784. [Google Scholar] [CrossRef]
  13. Mazid, M.; Khan, T.A.; Mohammad, F. Role of secondary metabolites in defense mechanisms of plants. Biol. Med. 2011, 3, 232–249. [Google Scholar]
  14. Divekar, P.A.; Narayana, S.; Divekar, B.A.; Kumar, R.; Gadratagi, B.G.; Ray, A.; Singh, A.K.; Rani, V.; Singh, V.; Singh, A.K.; et al. Plant secondary metabolites as defense tools against herbivores for sustainable crop protection. Int. J. Mol. Sci. 2022, 23, 2690. [Google Scholar] [CrossRef] [PubMed]
  15. Tian, C.; Chikayama, E.; Tsuboi, Y.; Kuromori, T.; Shinozaki, K.; Kikuchi, J.; Hirayama, T. Top-down phenomics of Arabidopsis thaliana metabolic profiling by one-and two-dimensional nuclear magnetic resonance spectroscopy and transcriptome analysis of albino mutants. J. Biol. Chem. 2007, 282, 18532–18541. [Google Scholar] [CrossRef]
  16. Hong, J.; Yang, L.; Zhang, D.; Shi, J. Plant metabolomics: An indispensable system biology tool for plant science. Int. J. Mol. Sci. 2016, 17, 767. [Google Scholar] [CrossRef] [PubMed]
  17. Hussain, R.M.F.; Kim, H.K.; Khurshid, M.; Akhtar, M.T.; Linthorst, H.J. Overexpression of AtWRKY50 is correlated with enhanced production of sinapic derivatives in Arabidopsis. Metabolomics 2018, 14, 25. [Google Scholar] [CrossRef]
  18. Brennan, L. NMR-based metabolomics: From sample preparation to applications in nutrition research. Prog. Nucl. Magn. Reson. Spectrosc. 2014, 83, 42–49. [Google Scholar] [CrossRef]
  19. Moco, S. Studying metabolism by NMR-based metabolomics. Front. Mol. Biosci. 2022, 372, 882487. [Google Scholar] [CrossRef]
  20. David, A.; Rostkowski, P. Analytical techniques in metabolomics. In Environmental Metabolomics; Elsevier: Amsterdam, The Netherlands, 2020; pp. 35–64. [Google Scholar]
  21. Antonio, J.G.M.; Brasil, J.M.; Cruz, G.C.F.; de Souza, R.N.; Tasic, L. NMR-based metabolomics strategies: Plants, animals and humans. Anal. Methods 2017, 9, 1078–1096. [Google Scholar]
  22. Emwas, A.H.; Roy, R.; McKay, R.T.; Tenori, L.; Saccenti, E.; Gowda, G.N.; Raftery, D.; Alahmari, F.; Jaremko, L.; Jaremko, M.; et al. NMR spectroscopy for metabolomics research. Metabolites 2019, 9, 123. [Google Scholar] [CrossRef] [PubMed]
  23. Ren, Y.; Wang, T.; Peng, Y.; Xia, B.; Qu, L.J. Distinguishing transgenic from non-transgenic Arabidopsis plants by 1H NMR-based metabolic fingerprinting. JGG 2009, 36, 621–628. [Google Scholar] [CrossRef]
  24. Leiss, K.A.; Choi, Y.H.; Abdel-Farid, I.B.; Verpoorte, R.; Klinkhamer, P.G.L. NMR metabolomics of thrips (Frankliniella occidentalis) resistance in Senecio hybrids. J. Chem. Ecol. 2009, 35, 219–229. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Becerra-Martinez, E.; Hernandez, Y.P.; Gloria, E.L.; Jimenez, M.G.B.; Martinez, D.H.; Vallejo, L.G.Z.; Ruano, N.V. 1H-NMR metabolomics profiling of recombinant tobacco plants holding a promoter of a sesquiterpene cyclase. Phytochem. Anal. 2020, 31, 480–487. [Google Scholar] [CrossRef] [PubMed]
  26. Bisht, H.; Bhatnagar, M.; Bisht, S.; Murthy, A.K. Study on variation of polar metabolites in control and water stressed Gossypium hirsutum L. using NMR spectroscopy. Asian J. Appl. Chem. 2018, 1, 1–12. [Google Scholar] [CrossRef]
  27. Yang, Q.; Zhao, D.; Liu, Q. Connections between Amino Acid Metabolisms in Plants: Lysine as an Example. Front. Plant Sci. 2020, 11, 928. [Google Scholar] [CrossRef]
  28. Taylor, R.E.; French, A.D.; Gamble, G.R.; Himmelsbach, D.S.; Stipanovic, R.D.; Thibodeaux, D.P.; Wakelyn, P.J.; Dybowski, C. 1H and 13C solid-state NMR of Gossypium barbadense (Pima) cotton. J. Mol. Struct. 2008, 878, 177–184. [Google Scholar] [CrossRef]
  29. NMRFAM. Available online: http://mmcd.nmrfam.wisc.edu/ (accessed on 10 June 2020).
  30. HMDB. Available online: https://hmdb.ca/ (accessed on 31 November 2019).
  31. Boonchaisri, S.; Rochfort, S.; Stevenson, T.; Dias, D.A. Recent developments in metabolomics-based research in understanding transgenic grass metabolism. Metabolomics 2019, 15, 47. [Google Scholar] [CrossRef] [PubMed]
  32. Bastien, C.; Pluskal, T.; Aubry, S.; Weng, J.K. Contribution of untargeted metabolomics for future assessment of biotech crops. Trends Plant Sci. 2018, 23, 1047–1056. [Google Scholar]
  33. Levandi, T.; Leon, C.; Kaljurand, M.; Garcia-Cañas, V.; Cifuentes, A. Capillary electrophoresis time-of-flight mass spectrometry for comparative metabolomics of transgenic versus conventional maize. Anal. Chem. 2008, 80, 6329–6335. [Google Scholar] [CrossRef]
  34. Ning, K.; Ding, C.; Zhu, W.; Dong, Y.; Shen, Y.; Su, X. Comparative metabolomic analysis of the cambium tissue of non-transgenic and multi-gene transgenic poplar (Populus × euramericana ‘Guariento’). Front. Plant Sci. 2018, 9, 1201. [Google Scholar] [CrossRef] [PubMed]
  35. Schenck, C.A.; Maeda, H.A. Tyrosine biosynthesis, metabolism, and catabolism in plants. Phytochemistry 2018, 149, 82–102. [Google Scholar] [CrossRef]
  36. Zemanová, V.; Pavlík, M.; Pavlíková, D.; Tlustoš, P. The significance of methionine, histidine and tryptophan in plant responses and adaptation to cadmium stress. Plant Soil Environ. 2014, 60, 426–432. [Google Scholar] [CrossRef]
  37. Mikkelsen, M.D.; Halkier, B.A. Metabolic engineering of valine-and isoleucine-derived glucosinolates in Arabidopsis expressing CYP79D2 from cassava. Plant Physiol. 2003, 131, 773–779. [Google Scholar] [CrossRef] [Green Version]
  38. Tambel, L.I.M.; Zhou, M.; Chen, Y.; Zhang, X.; Chen, Y.; Chen, D. Amino acids application enhances flowers insecticidal protein content in Bt cotton. J. Cotton Sci. 2019, 2, 7. [Google Scholar] [CrossRef]
  39. Ramos-Ruiz, R.; Martinez, F.; Knauf-Beiter, G. The effects of GABA in plants. Cogent Food Agric. 2019, 5, 1670553. [Google Scholar] [CrossRef]
  40. Shelp, B.J.; Aghdam, M.S.; Flaherty, E.J. γ-Aminobutyrate (GABA) regulated plant defense: Mechanisms and opportunities. Plants 2021, 10, 1939. [Google Scholar] [CrossRef]
  41. Scholz, S.S.; Reichelt, M.; Mekonnen, D.W.; Ludewig, F.; Mithöfer, A. Insect herbivory-elicited GABA accumulation in plants is a wound-induced, direct, systemic, and jasmonate-independent defense response. Front. Plant Sci. 2015, 6, 1128. [Google Scholar] [CrossRef]
  42. Bown, A.W.; Hall, D.E.; MacGregor, K.B. Insect footsteps on leaves stimulate the accumulation of 4-aminobutyrate and can be visualized through increased chlorophyll fluorescence and superoxide production. Plant Physiol. 2002, 129, 1430–1434. [Google Scholar] [CrossRef]
  43. Trouvelot, S.; Héloir, M.C.; Poinssot, B.; Gauthier, A.; Paris, F.; Guillier, C.; Combier, M.; Trdá, L.; Daire, X.; Adrian, M. Carbohydrates in plant immunity and plant protection: Roles and potential application as foliar sprays. Front. Plant Sci. 2014, 5, 592. [Google Scholar] [CrossRef]
  44. Krasavina, M.S.; Burmistrova, N.A.; Raldugina, G.N. The role of carbohydrates in plant resistance to abiotic stresses. In Emerging Technologies and Management of Crop Stress Tolerance; Academic Press: Cambridge, MA, USA, 2014; pp. 229–270. [Google Scholar]
  45. Rautengarten, C.; Birdseye, D.; Pattathil, S.; McFarlane, H.E.; Saez-Aguayo, S.; Orellana, A.; Persson, S.; Hahn, M.G.; Scheller, H.V.; Heazlewood, J.L.; et al. The elaborate route for UDP-arabinose delivery into the Golgi of plants. Proc. Natl. Acad. Sci. USA 2017, 114, 4261–4266. [Google Scholar] [CrossRef] [PubMed]
  46. Zayed, O.M. The Role of Plant Cell Wall Arabinose in Salt Stress Sensing and Adaptation. Ph.D. Thesis, Purdue University Graduate School, West Lafayette, IN, USA, 2019. [Google Scholar]
  47. Araujo, W.L.; Nesi, A.N.; Fernie, A.R. Fumarate: Multiple functions of a simple metabolite. Phytochemistry 2011, 72, 838–843. [Google Scholar] [CrossRef] [PubMed]
  48. Zhou, J.; Zhang, L.; Chang, Y.; Lu, X.; Zhu, Z.; Xu, G. Alteration of leaf metabolism in Bt-transgenic rice (Oryza sativa L.) and its wild type under insecticide stress. J. Proteome Res. 2012, 11, 4351–4360. [Google Scholar] [CrossRef] [PubMed]
  49. Fahnenstich, H.; Saigo, M.; Andreo, C.; Drincovich, M.F.; Flügge, U.; Maurino, V.G. Malate and Fumarate Emerge as Key Players in Primary Metabolism: Arabidopsis thaliana Overexpressing C 4-NADP-ME Offer a Way to Manipulate the Levels of Malate and to Analyse the Physiological Consequences. In Photosynthesis: Energy from the Sun; Springer: Dordrecht, The Netherlands, 2008; pp. 971–975. [Google Scholar]
  50. Dumont, S.; Rivoal, J. Consequences of oxidative stress on plant glycolytic and respiratory metabolism. Front. Plant Sci. 2019, 10, 166. [Google Scholar] [CrossRef] [PubMed]
  51. Sies, H.; Berndt, C.; Jones, D.P. Oxidative stress. Annu. Rev. Biochem. 2017, 86, 715–748. [Google Scholar] [CrossRef] [PubMed]
  52. Guo, R.; Shi, L.X.; Yang, C.W.; Yan, C.R.; Zhong, X.L.; Liu, Q.; Xia, X.; Li, H.R. Comparison of ionomic and metabolites response under alkali stress in old and young leaves of cotton (Gossypium hirsutum L.) seedlings. Front. Plant Sci. 2016, 7, 1785. [Google Scholar] [CrossRef] [PubMed]
  53. Dixon, R.A.; Achnine, L.; Kota, P.; Liu, C.J.; Reddy, M.S.; Wang, L.J. The phenylpropanoid pathway and plant defence—A genomics perspective. Mol. Plant Pathol. 2002, 3, 371–390. [Google Scholar] [CrossRef] [PubMed]
  54. Farré-Armengol, G.; Filella, I.; Llusià, J.; Peñuelas, J. β-Ocimene, a key floral and foliar volatile involved in multiple interactions between plants and other organisms. Molecules 2017, 22, 1148. [Google Scholar] [CrossRef]
  55. Navia-Giné, W.G.; Yuan, J.S.; Mauromoustakos, A.; Murphy, J.B.; Chen, F.; Korth, K.L. Medicago truncatula (E)-β-ocimene synthase is induced by insect herbivory with corresponding increases in emission of volatile ocimene. Plant Physiol. Biochem. 2009, 47, 416–425. [Google Scholar] [CrossRef]
  56. Kang, Z.W.; Liu, F.H.; Zhang, Z.F.; Tian, H.G.; Liu, T.X. Volatile β-ocimene can regulate developmental performance of peach aphid Myzus persicae through activation of defense responses in Chinese cabbage Brassica pekinensis. Front. Plant Sci. 2018, 9, 708. [Google Scholar] [CrossRef]
  57. Park, S.H.; Scheffler, J.; Scheffler, B.; Cantrell, C.L.; Pauli, C.S. Chemical defense responses of upland cotton, Gossypium hirsutum L. to physical wounding. Plant Direc. 2019, 3, e00141. [Google Scholar] [CrossRef] [PubMed]
  58. Bhalla, R.; Narasimhan, K.; Swarup, S. Metabolomics and its role in understanding cellular responses in plants. Plant Cell Rep. 2005, 24, 562–571. [Google Scholar] [CrossRef] [PubMed]
  59. Okada, T.; Afendi, F.M.; Altaf-Ul-Amin, M.; Takahashi, H.; Nakamura, K.; Kanaya, S. Metabolomics of medicinal plants: The importance of multivariate analysis of analytical chemistry data. Curr. Comput. Aided Drug Des. 2010, 6, 179–196. [Google Scholar] [CrossRef]
  60. Møller, S.F.; von Frese, J.; Bro, R. Robust methods for multivariate data analysis. J. Chemom. 2005, 19, 549–563. [Google Scholar] [CrossRef]
  61. Choi, H.K.; Choi, Y.H.; Verberne, M.; Lefeber, A.W.M.; Erkelens, C.; Verpoorte, R. Metabolic fingerprinting of wild type and transgenic tobacco plants by 1H NMR and multivariate analysis technique. Phytochemistry 2004, 65, 857–864. [Google Scholar] [CrossRef]
  62. Johnson, H.E.; Lloyd, A.J.; Mur, L.A.J.; Smith, A.R.; Causton, D.R. The application of MANOVA to analyse Arabidopsis thaliana metabolomic data from factorially designed experiments. Metabolomics 2007, 3, 517–530. [Google Scholar] [CrossRef]
  63. Tharwat, A.; Gaber, T.; Ibrahim, A.; Hassanien, A.E. Linear discriminant analysis: A detailed tutorial. AI Commun. 2017, 30, 169–190. [Google Scholar] [CrossRef]
  64. Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments, Philosophical Transactions of the Royal Society A: Mathematical. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar]
  65. Balakrishnama, S.; Ganapathiraju, A.; Picone, J. Linear discriminant analysis for signal processing problems. In Proceedings of the IEEE Southeastcon’99, Technology on the Brink of 2000 (Cat. No. 99CH36300), Lexington, KY, USA, 25–28 March 1999; pp. 78–81. [Google Scholar]
  66. Alves, M.V.S.; Maciel, L.I.L.; Ramalho, R.R.F.; Lima, L.A.S.; Vaz, B.G.; Morais, C.L.M.; Passos, J.O.S.; Pegado, R.; Lima, K.M.G. Multivariate classification techniques and mass spectrometry as a tool in the screening of patients with fibromyalgia. Sci. Rep. 2021, 11, 22625. [Google Scholar] [CrossRef]
  67. Mahadevan, S.; Shah, S.L.; Marrie, T.J.; Slupsky, C.M. Analysis of metabolomic data using supporting vector machines. Anal. Chem. 2008, 80, 7562–7570. [Google Scholar] [CrossRef]
Figure 1. Annotated bioactive constituents in G. hirsutum cultivars (NB-01, non-pest-resistant variety); PR-01 (pest-resistant American variety); PR-02 (transgenic pest-resistant variety having Cry proteins, Bt cotton). (1) nonanoic acid, (2) valine, (3) alanine, (4) citrulline, (5) arginine, (6) limonene, (7) linoleic acid, (8) γ-aminobutyric acid; (9) malic acid, (10) succinic acid, (11) terpinolene, (12) di-allylic methylene; (13) asparagine; (14) tryptophan; (15) choline; (16) sycloinositol; (17) proline; (18) glycine; (19) Shikimic acid; (20) arabinose; (21) fructose; (22) xylulose; (23) tartaric acid; (24) trigonelline; (25) e-β-ocimene; (26) stigmasterol; (27) maltose; (28) sucrose; (29) uridine; (30) maleic acid; (31) fumarate; (32) cinnamic acid; (33) tyrosine; (34) dibutyl phthalate; (35) formate; (36) aspartic acid; (37) aconitic acid; (38) α-ketoglutaric acid; (39) chlorogenic acid; (40) ferulic acid; (41) quinic acid.
Figure 1. Annotated bioactive constituents in G. hirsutum cultivars (NB-01, non-pest-resistant variety); PR-01 (pest-resistant American variety); PR-02 (transgenic pest-resistant variety having Cry proteins, Bt cotton). (1) nonanoic acid, (2) valine, (3) alanine, (4) citrulline, (5) arginine, (6) limonene, (7) linoleic acid, (8) γ-aminobutyric acid; (9) malic acid, (10) succinic acid, (11) terpinolene, (12) di-allylic methylene; (13) asparagine; (14) tryptophan; (15) choline; (16) sycloinositol; (17) proline; (18) glycine; (19) Shikimic acid; (20) arabinose; (21) fructose; (22) xylulose; (23) tartaric acid; (24) trigonelline; (25) e-β-ocimene; (26) stigmasterol; (27) maltose; (28) sucrose; (29) uridine; (30) maleic acid; (31) fumarate; (32) cinnamic acid; (33) tyrosine; (34) dibutyl phthalate; (35) formate; (36) aspartic acid; (37) aconitic acid; (38) α-ketoglutaric acid; (39) chlorogenic acid; (40) ferulic acid; (41) quinic acid.
Molecules 28 01763 g001
Figure 2. Heat map showing an overall metabolite content in cotton varieties. M376 refers to Arabinose M716 to tyrosine, M348 to tryptophan, M241 to succinic acid, M648 to cinnamic acid, M674 to E-β-ocimene, M515 to maltose, M512 to stigmasterol, M200 to linoleic acid, M646 to fumarate, M296 to GABA, M278 to di-allylic methylene, M186 to limonene respectively. Red color shows an increased expression level while blue color displays decreased expression level. Hierarchical classification was performed to group metabolites from each group. Dendrograms show clustering between metabolites which were closely related based on their estimated Euclidean distance.
Figure 2. Heat map showing an overall metabolite content in cotton varieties. M376 refers to Arabinose M716 to tyrosine, M348 to tryptophan, M241 to succinic acid, M648 to cinnamic acid, M674 to E-β-ocimene, M515 to maltose, M512 to stigmasterol, M200 to linoleic acid, M646 to fumarate, M296 to GABA, M278 to di-allylic methylene, M186 to limonene respectively. Red color shows an increased expression level while blue color displays decreased expression level. Hierarchical classification was performed to group metabolites from each group. Dendrograms show clustering between metabolites which were closely related based on their estimated Euclidean distance.
Molecules 28 01763 g002
Figure 3. PCA score plot of cotton varieties.
Figure 3. PCA score plot of cotton varieties.
Molecules 28 01763 g003
Figure 4. (A) OPLSDA score plot of different cotton varieties; (B) validation by 100 permutations; (C) loading scatter plot of OPLSDA.
Figure 4. (A) OPLSDA score plot of different cotton varieties; (B) validation by 100 permutations; (C) loading scatter plot of OPLSDA.
Molecules 28 01763 g004
Figure 5. (A) Canonical score plot of discriminant analysis showing maximum variance in three varieties of cotton. (B) Fisher’s discriminant functions from LDA. M78, M79 are nonanoic acid; M90–M92 are valine; M144–M146 refer to alanine; M150–M151 refer to citrulline; M160–M162 refer to arginine; M233 is malic acid. (C) Classification model of the different cotton varieties.
Figure 5. (A) Canonical score plot of discriminant analysis showing maximum variance in three varieties of cotton. (B) Fisher’s discriminant functions from LDA. M78, M79 are nonanoic acid; M90–M92 are valine; M144–M146 refer to alanine; M150–M151 refer to citrulline; M160–M162 refer to arginine; M233 is malic acid. (C) Classification model of the different cotton varieties.
Molecules 28 01763 g005
Figure 6. (A) Multilayer perceptron neural network results. (B) Independent variable importance. M78, M79 are nonanoic acid; M90–M92 are valine; M144–M146 refer to alanine; M150–M151 refer to citrulline; M160–-M162 refer to arginine; M233 is malic acid. (C) Fancier versions of importance table.
Figure 6. (A) Multilayer perceptron neural network results. (B) Independent variable importance. M78, M79 are nonanoic acid; M90–M92 are valine; M144–M146 refer to alanine; M150–M151 refer to citrulline; M160–-M162 refer to arginine; M233 is malic acid. (C) Fancier versions of importance table.
Molecules 28 01763 g006
Figure 7. (A) Tricarboxylic acid cycle; (B) an overview of the secondary metabolites biosynthetic pathways.
Figure 7. (A) Tricarboxylic acid cycle; (B) an overview of the secondary metabolites biosynthetic pathways.
Molecules 28 01763 g007
Table 1. List of identified metabolites from G. hirsutum.
Table 1. List of identified metabolites from G. hirsutum.
Sr. No.MetabolitesChemical Shift (ppm)MultiplicityAssignment Methods
1Nonanoic acid0.82DoubletJRES/1D-HNMR
2Valine0.95DoubletJRES/1D-HNMR
3Alanine1.48DoubletJRES/1D-HNMR
4Citrulline1.56MultipletJRES/1D-HNMR
5Arginine/myristic acid1.68MultipletJRES/1D-HNMR
6Limonene1.91MultipletJRES/1D-HNMR
7Linoleic acid2.06MultipletJRES/1D-HNMR
8γ-aminobutyric acid2.30
3.01
triplet
triplet
JRES/1D-HNMR
9Malic acid2.39doublet of doubletJRES/1D-HNMR
10Succinic acid2.45SingletJRES/1D-HNMR
11Terpinolene2.71SingletJRES/1D-HNMR
12di-allylic methylene2.80MultipletJRES/1D-HNMR
13Asparagine2.94MultipletJRES/1D-HNMR
14Tryptophan3.15singletJRES/1D-HNMR
3.50doublet
7.53doublet
15Choline3.20SingletJRES/1D-HNMR
16Scyloinositol3.21SingletJRES/1D-HNMR
17Proline3.40triplet of doubletJRES/1D-HNMR
18Glycine3.54SingletJRES/1D-HNMR
19Shikimic acid4.01MultipletJRES/1D-HNMR
20Arabinose3.82
4.58
doublet of doublet
doublet
JRES/1D-HNMR
21Fructose3.85DoubletJRES/1D-HNMR
22Xylulose4.28doublet of doubletJRES/1D-HNMR
23Tartaric acid4.35SingletJRES/1D-HNMR
24Trigonelline4.45singletJRES/1D-HNMR
9.14singlet
25E-β-ocimene5.12singletJRES/1D-HNMR
6.79doublet of doublet
26Stigmasterol5.15doublet of doubletJRES/1D-HNMR
27Maltose5.20DoubletJRES/1D-HNMR
28Sucrose5.45DoubletJRES/1D-HNMR
29Uridine5.95DoubletJRES/1D-HNMR
30Maleic acid6.05SingletJRES/1D-HNMR
31Fumarate6.50SingletJRES/1D-HNMR
32Cinnamic acid6.52doubletJRES/1D-HNMR
7.60multiplet
33Tyrosine7.20DoubletJRES/1D-HNMR
34Dibutyl phthalate7.77SingletJRES/1D-HNMR
35Formate8.47SingletJRES/1D-HNMR
36Aspartic acid2.68Doublet of doubletJRES/1D-HNMR
37Aconitic acid3.43DoubletJRES/1D-HNMR
38α-Ketoglutaric acid2.43tripletJRES/1D-HNMR
39Chlorogenic acid7.12Doublet of doubletJRES/1D-HNMR
40Ferulic acid7.07
6.91
Doublet of doublet
Doublet
JRES/1D-HNMR
41Quinic acid3.56Doublet of doubletJRES/1D-HNMR
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shami, A.A.; Akhtar, M.T.; Mumtaz, M.W.; Mukhtar, H.; Tahir, A.; Shahzad-ul-Hussan, S.; Chaudhary, S.U.; Muneer, B.; Iftikhar, H.; Neophytou, M. NMR-Based Metabolomics: A New Paradigm to Unravel Defense-Related Metabolites in Insect-Resistant Cotton Variety through Different Multivariate Data Analysis Approaches. Molecules 2023, 28, 1763. https://doi.org/10.3390/molecules28041763

AMA Style

Shami AA, Akhtar MT, Mumtaz MW, Mukhtar H, Tahir A, Shahzad-ul-Hussan S, Chaudhary SU, Muneer B, Iftikhar H, Neophytou M. NMR-Based Metabolomics: A New Paradigm to Unravel Defense-Related Metabolites in Insect-Resistant Cotton Variety through Different Multivariate Data Analysis Approaches. Molecules. 2023; 28(4):1763. https://doi.org/10.3390/molecules28041763

Chicago/Turabian Style

Shami, Anam Amin, Muhammad Tayyab Akhtar, Muhammad Waseem Mumtaz, Hamid Mukhtar, Amna Tahir, Syed Shahzad-ul-Hussan, Safee Ullah Chaudhary, Bushra Muneer, Hafsa Iftikhar, and Marios Neophytou. 2023. "NMR-Based Metabolomics: A New Paradigm to Unravel Defense-Related Metabolites in Insect-Resistant Cotton Variety through Different Multivariate Data Analysis Approaches" Molecules 28, no. 4: 1763. https://doi.org/10.3390/molecules28041763

Article Metrics

Back to TopTop