Next Article in Journal
Sarcopenia as a Multisystem Disorder—Connections with Neural and Cardiovascular Systems—A Related PRISMA Systematic Literature Review
Previous Article in Journal
Visual Large Language Models in Radiology: A Systematic Multimodel Evaluation of Diagnostic Accuracy and Hallucinations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Preliminary Identification of Putative Terpene Synthase Genes in Caryocar brasiliense and Chemical Analysis of Major Components in the Fruit Exocarp

by
Helena Trindade
1,
Bruno Nevado
1,
Raquel Linhares Bello de Araújo
2,
Viviane Dias Medeiros Silva
2,3,
Lara Louzada Aguiar
2,
Ana Ribeiro
4,5,
Julio Onesio-Ferreira Melo
3,4,*,† and
Paula Batista-Santos
6,*,†
1
Centre for Ecology, Evolution and Environmental Change (CE3C), Global Change and Sustainability Institute, (CHANGE), Faculty of Sciences, University of Lisbon (FCUL), 1749-016 Lisbon, Portugal
2
Departamento de Alimentos, Faculdade de Farmácia, Campus Belo Horizonte, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, MG, Brazil
3
Departamento de Ciências Exatas e Biológicas, Universidade Federal de São João del-Rei (UFSJ), Campus Sete Lagoas, Sete Lagoas 35701-970, MG, Brazil
4
LEAF—Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017 Lisboa, Portugal
5
Faculdade Farmácia, Universidade de Lisboa, Av. Prof. Gama Pinto, 1649-003 Lisboa, Portugal
6
Centro de Estudos Florestais, Laboratório Associado TERRA, Instituto Superior de Agronomia, Universidade de Lisboa, Tapada da Ajuda, 1349-017 Lisboa, Portugal
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Life 2026, 16(1), 67; https://doi.org/10.3390/life16010067 (registering DOI)
Submission received: 26 October 2025 / Revised: 24 December 2025 / Accepted: 24 December 2025 / Published: 1 January 2026
(This article belongs to the Section Plant Science)

Abstract

Background: Caryocar brasiliense Camb. Caryocaraceae is a typical tree from the Brazilian Cerrado with commercial importance due to its edible fruit, known as pequi. This native plant holds significant economic value and is a key candidate for cropping systems. Rich in phytochemicals, such as phenolics, flavonoids, and terpenoids, it has shown notable health benefits. Methods: Considering the importance of terpenes and their biological properties, and based on the first draft genome of C. brasiliense, this study aimed to identify putative terpene synthase genes and classify them into the phylogenetic subfamilies previously identified across all plant lineages. The presence of terpenes was also verified in samples of the outer portion of the fruit by solid-phase microextraction gas chromatography mass-spectrometry. Results: Analysis of genome completeness showed that over 90% of genes were identified despite a highly fragmented assembly, with 71% containing complete gene sequences. Twenty-two genes were retained as putative terpene synthase genes considering their homology with the terpene synthase Hidden Markov Model (HMM) profiles in the Pfam-A database. Ten sequences with a minimum length of 298 amino acids were used for phylogenetic inference. In the resulting phylogenetic tree, C. brasiliense terpene synthase genes clustered within the different previously identified Angiosperm clades and allowed us to classify each gene into different phylogenetic subfamilies: six genes belonged to the h/d/a/b/g, three to the c, and one to the e/f. The headspace solid-phase microextraction technique, in conjunction with gas chromatography mass-spectrometry, has allowed for the identification of eleven chemical compounds, including a terpene. Conclusions: This initial identification of putative terpene synthase genes in pequi, together with the chemical analysis of the outer fruits, lays the groundwork for future studies aimed at optimizing terpene biosynthesis for both biological and commercial applications.

1. Introduction

The Cerrado biome is the second-largest biome of Brazil and is considered a hotspot due to the rapid loss of biological diversity caused by agriculture, livestock, and urbanization [1]. Pequi (Caryocar brasiliense Camb.) is a typical edible fruit from the Brazilian Cerrado, with great occurrence and economic importance in this region. Pequi trees play a valuable ecological role in these ecosystems, since they are a source of food and habitat for fauna [2]. These fruits are rich in nutrients, have unique sensory characteristics, and are used in regional cuisine, for the preparation of flours, for the extraction of oils for cosmetics, and for their therapeutic properties [3,4,5].
Pequi has recently been reported to have analgesic and anti-inflammatory properties [6], reducing oxidative stress, inflammation, and anemia associated with aging in Swiss mice [7]. It is also referred to have anticholinesterase and antioxidant activities, as well as to prevent memory loss in mice caused by aluminum consumption and brain lipid peroxidation [8]. Other studies with the ethanolic extract of pequi bark have shown very low toxicity in vitro and in vivo [9,10] as well as a protective effect against oxidative stress in human coronary artery endothelial cells [11], supporting its medicinal potential.
The compounds associated with positive health impacts include a list of specialized metabolites with active principles, ranging from phenolics, terpenes, and alkaloids [12,13]. Terpenoids are the largest group of specialized metabolites, and tens of thousands of terpenoid compounds have been identified in higher plants [14]. Chemically, they are polymeric isoprene units (C5) that can be arranged in various lengths of backbone polymers. Terpene synthase enzymes can give rise to a multitude of molecules that have been pivotal to the survival and evolution of higher plants. Furthermore, these compounds have been associated with beneficial health effects, such as anti-aggregatory, antiallergic, anti-coagulation, anti-inflammatory, neuroprotective, sedative, analgesic [15], and other biological properties [16], including antimicrobial and antifungal activities [15,17]. Some terpenes can be considered as ecological pesticides, and essential oils rich in terpenes have proven activity against several fungi [18,19]. Studies based on individual terpenes have also shown fungicide activity against Botrytis cinerea, a plant pathogen that affects cultures [20]. Terpenes have been previously identified from pequi fruits [21,22,23] and include several monoterpenes, e.g., α-phellandrene, β-myrcene, β-ocimene, and the diterpene geranyllinalool, just to mention a few. The technique of headspace solid-phase microextraction with gas chromatography—mass spectrometry analysis has been used to identify terpenes and other volatile compounds present in different fruits of the Cerrado, such as Eugenia dysenterica [24], Eugenia brasiliensis [25], Eugenia klotzschiana [26], and pequi peel [27]. Fruit peels have been used to extract a variety of bioactive compounds, including terpenes [28].
Considering the published genome sequencing data for Caryocar brasiliense [29], mining this information will allow the identification of genes responsible for important biological characteristics. In this study, we focused on the identification of putative genes involved in terpene biosynthesis in pequi, laying the foundation for future biotechnological approaches to improve terpene synthesis, several of which rely on yeast systems [30,31,32]. Our preliminary study should be extended, and functional gene characterization needs to be performed for full validation of gene function. These biotechnological tools are considered of utmost importance, considering that terpene extraction from natural sources, as performed in the past, nowadays raises environmental concerns and is no longer considered a viable option.

2. Materials and Methods

2.1. Sequence Retrieval and Identification of Putative Terpene Synthase Genes

In this study, genomic sequences from C. brasiliense [29] were obtained from GenBank under accession number GCA_004918865.1 (Table 1). To infer the genome completeness, BUSCO v. 4.1 (Benchmarking Universal Single-Copy Orthologs) [33] with the Viridiplantae database was used. To identify the Terpene synthase genes, the genome of C. brasiliense was annotated using the MAKER pipeline v2.31 [34]. Both ab initio and homology-based evidence were used and obtained from the proteomes of related species in the Malpighiales order (Table 2), available from the 1KP database [35]. The resulting protein-coding genes against the Pfam-A database were searched using interproscan v 5.61 [36]. We classified as putative Terpene-synthase genes all genes with the best hit against the Terpene_synth_C (PF03936) or the Terpene synthase N-terminal domain (PF01397) profiles [37].

2.2. Phylogenetic Analyses

To classify the putative Terpene synthase genes of C. brasiliense into the phylogenetic subfamilies identified in previous studies [37], we obtained the unaligned sequence data containing all Terpene synthase genes (longer than 350 amino acids) previously identified across green plants [37]. The newly identified Terpene synthase genes from C. brasiliense (minimum length: 298 amino acids) were added to this dataset. All data were aligned using mafft v 7.5 [38] with 1000 iterations of improvement. The best-fitting protein evolution model was identified with modeltest-ng v.0.1.7 [39,40]. The phylogenetic inference was performed using raxml-ng v.1.1 [41] with 10 random starting trees and 100 bootstrap replicates, using the best-fitting protein evolution model (JTT + G + F).

2.3. Solid-Phase Microextraction Gas Chromatography–Mass Spectrometry

Polydimethylsiloxane/Divinylbenzene (PDMS/DVB, 65 μm) fibers were employed for the solid-phase microextraction (SPME) and gas chromatography-mass spectrometry (GC-MS) analysis. Samples of the outer portion of three fruits of C. brasiliense weighing 1.0 g were transferred to 20 mL headspace vials, in triplicate, which were then sealed. The samples were taken to a heating plate, on which an aluminum block with a cylindrical bore was placed, in order to place the headspace vials for sample heating. Samples were pre-heated for 5 min, after which the PDMS/DVB fiber holder was inserted into the vial, and the fiber was exposed with the temperature kept at 50 °C for 10 min. The PDMS/DVB fiber was then retracted, transferred to the GC-MS injector, and exposed, where it remained in the equipment for 5 min and was retracted for the remainder of the run [42].
A gas chromatograph coupled with a mass spectrometer (Shimadzu Scientific Instruments, Kyoto, Japan). A split/splitless injector in splitless mode was used as an ion-trap type analyzer, and it was maintained for 5 min at a temperature of 250 °C. Helium gas (1 mL min−1 flow) was used with a HP-5, 30 m × 0.25 mm × 0.25 μm, MS capillary column (5% phenyl and 95% methylpolysiloxane) (Agilent Technologies Inc., Munich, Germany). The column was held at 40 °C for 1 min, and then, the temperature was increased at a rate of 12 °C min−1 up to 120 °C, maintaining it for 2 min, followed by an increase of 15 °C min−1 up to 150 °C and at 20 °C min−1 to 245 °C, held for 2 min [26].
Mass spectrometry was set to fragment ions between 35 and 300 m/z in 70 eV electron impact ionisation mode; the transference line temperature was 275 °C, and the ion source temperature was 200 °C. Volatile compounds were identified based on the mass-to-charge ratio (m/z) of the sample ion fragments corresponding to each peak generated by the chromatogram. The mass spectra of the analytes found were compared with the mass spectra data obtained from the NIST library (National Institute of Standards and Technology), using the 2011 version of the NIST/EPA/NIH Mass Spectral Database (NIST 11), using Xcalibur software version 2.1 (Thermo Scientific, San Jose, CA, USA), and considering the level of similarity (reverse lookup index, RSI) greater than 600. The RSI index consists of a numerical comparison factor where the higher its value, the closer the compound is to the finding in the NIST library literature. However, only peaks with a value above 600, a relative standard intensity (RSI) and a signal-to-noise ratio (S/N) above 50 decibels were selected.

3. Results and Discussion

Analysis of genome completeness using BUSCO showed that, despite a highly fragmented assembly, over 90% of BUSCO genes were found in the genome assembly of C. brasiliense, with 71% containing complete gene sequences and an additional 21% of genes present but fragmented (Table 1). We identified 33,767 protein-coding genes using the MAKER pipeline. Of these, 22 genes had homology with either (or both) of the Terpene synthase Pfam-A profiles and were thus retained as putative Terpene synthase genes (Table 3). Search on NCBI Conserved Domain [43] allowed us to find several motifs and domains that validated the sequences as partial putative terpene synthases. Larger sequences, CbTPS19, CbTPS20, CbTPS21, and CbTPS22, with respectively 458, 497, 561, and 729 amino acids, showed at least four (or five) of the total five conserved domains characteristic of these terpene synthases (Table 3).
Of the 22 genes identified, we used the ten longest (minimum length: 298 aa; Table 3) for phylogenetic inference. In the resulting phylogenetic tree, C. brasiliense terpene synthase genes clustered within the different angiosperm clades identified in previous work [37] and allowed us to classify each gene into the different phylogenetic subfamilies (Figure 1). Six genes belonged to the h/d/a/b/g subfamily: CbTPS14, CbTPS19, and CbTPS20 clustered together with accessions from Arabidopsis thaliana, while CbTPS15, CbTPS16, and CbTPS21 clustered together with accessions from Oryza sativa. The genes forming the group TPS-h/d/a/b/g are apparently involved in secondary (specialized) metabolism [37].
Considering the remaining four genes, three genes belonged to the c subfamily, namely CbTPS13, CbTPS17, and CbTPS18, and one gene, CbTPS22, was assigned to the e/f subfamilies. Both TPS-c and TPS-e/f subfamilies can be involved in gibberellin biosynthesis, but they can also give rise to numerous proteins involved in secondary metabolism.
The partial sequences we identified here can be used for primer design that will allow gene amplification, followed by heterologous gene expression. This will allow the obtention of protein that will be used to assay enzymatic activity. The final outcome will be the identification of the respective terpene synthases [44,45].
Gas chromatography-mass spectrometry analyses revealed a diverse phytochemical profile (Table 4) comprising five carboxylic acid esters (butanoic acid ethyl ester, (E)-2-butenoic acid ethyl ester, methyl hexanoate, ethyl hexanoate, and ethyl octanoate), one ethyl ester of carboxylic acids (all identified as ethyl acetate), one α-amino acid (alanine), one α,β-unsaturated aldehyde ((E)-2-hexenal), one α,β-unsaturated carboxylic acid ester (ethyl 2-hexenoate), one monoterpene hydrocarbon ((Z)-β-ocimene), and one formate ester (ethenyl formate). Figure 2 shows the resulting chromatogram of the HS-SPME-GC-MS analyses of samples of C. brasiliensis.
The α,β-unsaturated aldehyde (E)-2-hexenal (RT 5.995) functions as a critical green leaf volatile (GLV) rapidly synthesized via the lipoxygenase pathway following tissue damage [46]. The conjugated double bond system confers significant antimicrobial and antifungal properties, serving as part of the plant’s chemical defense arsenal against a broad spectrum of phytopathogens. Recent research has demonstrated its efficacy against economically important pathogens, including Botrytis cinerea and various Colletotrichum species, with minimum inhibitory concentrations in the low ppm range [47,48]. Agricultural applications have expanded to include (E)-2-hexenal-based biopesticides and plant elicitors that activate systemic acquired resistance mechanisms, potentially reducing conventional fungicide requirements by 30–40% when incorporated into integrated pest management programs [49].
Alanine (RT 1.420), an α-amino acid, plays fundamental roles in primary metabolism beyond its function as a protein building block. This compound participates centrally in transamination reactions and the alanine-glucose cycle that regulates carbon and nitrogen flux between plant tissues [50]. During environmental stress conditions, particularly drought and hypoxia, alanine accumulation serves as both a biochemical stress indicator and adaptive response, functioning as a compatible solute that maintains cellular osmotic balance without disrupting enzyme function [51].
The monoterpene hydrocarbon (Z)-β-ocimene (RT 10.150) represents a significant volatile compound derived from the isoprenoid pathway through the MEP pathway in plastids [52]. This acyclic terpene functions prominently in tritrophic plant-herbivore-predator interactions, serving as both a herbivore deterrent and an attractant for natural enemies, including parasitoid wasps and predatory insects. Studies have demonstrated that plants under herbivore attack can increase β-ocimene emissions by up to 1000-fold, triggering defense responses in neighboring plants through volatile signaling networks [53]. The compound’s conjugated diene structure confers notable antioxidant properties, with radical-scavenging activity comparable to vitamin E analogs in some assay systems [54]. β-ocimene serves as a key component of essential oil-based formulations targeting agricultural pest management, particularly in organic production systems where conventional pesticides are restricted [55].

4. Conclusions

In conclusion, this analysis based on the pequi draft genome allowed us to recover the majority of BUSCO genes, including most complete sequences, and to identify 22 putative terpene synthase genes. Phylogenetic analysis of ten well-supported sequences further revealed their distribution across established Angiosperm clades, enabling classification into distinct terpene synthase subfamilies and providing insight into their evolutionary relationships. These preliminary results lay the foundation for further studies to fully characterize terpene synthase gene sequences in pequi and to explore their potential applications. providing a basis for future investigations that may include quantification and functional validation. Furthermore, solid-phase microextraction gas chromatography mass-spectrometry has allowed the identification of significant chemical compounds, including a terpene that plays a key role in this species metabolism and putatively displays significant applications in the food, health, and agricultural industries.

Author Contributions

Conceptualization: H.T., B.N. and J.O.-F.M.; Data analysis: B.N., V.D.M.S., and L.L.A.; Writing original draft: H.T., B.N., and R.L.B.d.A.; Funding acquisition: P.B.-S.; Project administration: P.B.-S.; Review and editing: H.T., B.N., R.L.B.d.A., V.D.M.S., L.L.A., A.R., J.O.-F.M., and P.B.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Universidade Federal de São João del-Rei (UFSJ—PROPE), and the Universidade Federal de Minas Gerais (UFMG). This work was funded by Portuguese funds through FCT—Fundação para a Ciência e a Tecnologia, I.P., under the projects UID/00239/2025 of Centro de Estudos Florestais, UID/04129/2025 of LEAF-Linking Landscape, Environment, Agriculture and Food, LA/P/0092/2020 of Associate Laboratory TERRA, UID/00329/2025 of cE3c—Centre for Ecology, Evolution and Environmental Changes and by funds of the Tropical College of the University of Lisbon—CTROP-ULisboa. B.N. is supported by funds from Fundação para a Ciência e a Tecnologia (CEECIND/00229/2018 and 2024.14346.CPCA.A3); the Coordination for the Improvement of Higher Education Personnel (CAPES)/code 001), the National Council for Scientific and Technological Development (CNPq) (research productivity grant 132217/2023-6, 307787/2022-2 and 404432/2024-7), Minas Gerais State Research Support Foundation (FAPEMIG)- Finance Code APQ-04336-23, BPD-00858-22, and 5.308/15 and the Teaching, Research and Extension Group in Chemistry and Pharmacognosy (GEPEQF).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The coding sequences of the Terpene synthase genes presented in this article are not readily available because of time limitations. Requests to access the datasets should be directed to the coauthors Helena Trindade (htrindade@fc.ul.pt) or Bruno Nevado (bnevado@ciencias.ulisboa.pt). The remaining Terpene sequences were kindly provided by Qidong Jia [32].

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Damasco, G.; Fontes, C.; Françoso, R.; Haidar, R. The Cerrado biome: A forgotten biodiversity hotspot. Front. Young Minds 2018, 6, 22. [Google Scholar] [CrossRef]
  2. Araujo, F.D. A review of Caryocar brasiliense (Caryocaraceae)—An economically valuable species of the central Brazilian cerrado. Econ. Bot. 1995, 49, 40–48. [Google Scholar] [CrossRef]
  3. De Carvalho, L.S.; Pereira, K.F.; de Araújo, E.G. Botanical features, therapeutic effects and active ingredients present in pequi (Caryocar brasiliense). Arq. Ciênc. Saúde UNIPAR 2015, 19, 147–157. [Google Scholar] [CrossRef]
  4. Pinto, L.C.L.; Morais, L.M.O.; Guimarães, A.Q.; Almada, E.D.; Barbosa, P.M.; Drumond, M.A. Traditional knowledge and uses of the Caryocar brasiliense Cambess. (Pequi) by “quilombolas” of Minas Gerais, Brazil: Subsidies for sustainable management. Braz. J. Biol. 2016, 76, 511–519. [Google Scholar] [CrossRef] [PubMed]
  5. Santos, B.O.; Tanigaki, M.; Silva, M.R.; Ramos, A.L.C.C.; Labanca, R.A.; Augusti, R.; Melo, J.O.F.; Takahashi, J.A.; de Araújo, R.L.B. Development and Chemical Characterization of Pequi Pericarp Flour (Caryocar brasiliense Camb.) and Effect of in vitro Digestibility on the Bioaccessibility of Phenolic Compounds. J. Braz. Chem. Soc. 2022, 33, 1058–1068. [Google Scholar] [CrossRef]
  6. Junior, A.J.; Leitão, M.M.; Bernal, L.P.T.; dos Santos, E.; Kuraoka-Oliveira, Â.M.; Justi, P.; Argandoña, E.J.S.; Kassuya, C.A.L. Analgesic and Anti-inflammatory Effects of Caryocar brasiliense. Antiinflamm. Antiallergy Agents Med. Chem. 2020, 19, 313–322. [Google Scholar] [CrossRef]
  7. Roll, M.M.; Miranda-Vilela, A.L.; Longo, J.P.F.; da Agostini-Costa, T.S.; Grisolia, C.K. The pequi pulp oil (Caryocar brasiliense Camb.) provides protection against aging-related anemia, inflammation and oxidative stress in Swiss mice, especially in females. Genet. Mol. Biol. 2018, 41, 858–869. [Google Scholar] [CrossRef]
  8. De Oliveira, T.S.; Thomaz, D.V.; Neri, H.F.D.S.; Cerqueira, L.B.; Garcia, L.F.; Gil, H.P.V.; Pontarolo, R.; Campos, F.R.; Costa, E.A.; Dos Santos, F.C.A.; et al. Neuroprotective effect of Caryocar brasiliense Camb. leaves is associated with anticholinesterase and antioxidant properties. Oxid. Med. Cell. Longev. 2018, 2018, 9842908. [Google Scholar] [CrossRef]
  9. Roesler, R.; Lorencini, M.; Pastore, G. Fontes de antioxidantes do cerrado brasileiro: Citotoxicidade e fototoxicidade in vitro. Ciênc. Tecnol. Aliment. 2010, 30, 814–821. [Google Scholar] [CrossRef]
  10. Souza, M.R.; de Carvalho, R.K.; de Carvalho, L.S.; de Sá, S.; Andersen, M.L.; de Araújo, E.G.; Mazaro-Costa, R. Effects of subchronic exposure to Caryocar brasiliense peel ethanolic extract on male reproductive functions in Swiss mice. Reprod. Toxicol. 2019, 87, 118–124. [Google Scholar] [CrossRef]
  11. Braga, K.M.S.; Araujo, E.G.; Sellke, F.W.; Abid, M.R. Pequi Fruit Extract Increases Antioxidant Enzymes and Reduces Oxidants in Human Coronary Artery Endothelial Cells. Antioxidants 2022, 11, 474. [Google Scholar] [CrossRef]
  12. Gallardo, E.; Seca, A.M.L. Secondary Metabolites and Their Applications. Appl. Sci. 2022, 12, 2317. [Google Scholar] [CrossRef]
  13. Melo, J.O.F.; Conchinhas, B.; Leitão, A.E.B.; Ramos, A.L.C.C.; de Sousa, I.M.N.; Ferreira, R.M.d.S.B.; Ribeiro, A.C.; Batista-Santos, P. Phenolic Compounds Characterization of Caryocar brasiliense Peel with Potential Antioxidant Activity. Plants 2024, 13, 2016. [Google Scholar] [CrossRef]
  14. Tetali, S.D. Terpenes and isoprenoids: A wealth of compounds for global use. Planta 2019, 249, 1–8. [Google Scholar] [CrossRef]
  15. Masyita, A.; Sari, R.M.; Astuti, A.D.; Yasir, B.; Rumata, N.R.; Emran, T.B.; Nainu, F.; Simal-Gandara, J. Terpenes and terpenoids as main bioactive compounds of essential oils, their roles in human health and potential application as natural food preservatives. Food Chem. X 2022, 13, 100217. [Google Scholar] [CrossRef] [PubMed]
  16. de Queiroz, J.C.E.; Leite, J.R.S.A.; Vasconcelos, A.G. Prospecting Plant Extracts and Bioactive Molecules with Antimicrobial Activity in Brazilian Biomes: A Review. Antibiotics 2023, 12, 427. [Google Scholar] [CrossRef] [PubMed]
  17. Tohidi, B.; Rahimmalek, M.; Trindade, H. Review on essential oil, extracts composition, molecular and phytochemical properties of Thymus species in Iran. Ind. Crops Prod. 2019, 134, 89–99. [Google Scholar] [CrossRef]
  18. Gucwa, K.; Milewski, S.; Dymerski, T.; Szweda, P. Investigation of the antifungal activity and mode of action of Thymus vulgaris, Citrus limonum, Pelargonium graveolens, Cinnamomum cassia, Ocimum basilicum, and Eugenia caryophyllus essential oils. Molecules 2018, 23, 1116. [Google Scholar] [CrossRef] [PubMed]
  19. Moazeni, M.; Davari, A.; Shabanzadeh, S.; Akhtari, J.; Saeedi, M.; Mortyeza-Semnani, K.; Abastabar, M.; Nabili, M.; Moghadam, F.H.; Roohi, B.; et al. In vitro antifungal activity of Thymus vulgaris essential oil nanoemulsion. J. Herb. Med. 2021, 28, 100452. [Google Scholar] [CrossRef]
  20. Pedroso, M.B.; Scariot, F.J.; Rocha, R.K.M.; Echeverrigaray, S.; Delamare, A.P.L. Antifungal activity and mechanism of action of monoterpenes against Botrytis cinerea. Ciênc. Agrotec. 2024, 48, e018823. [Google Scholar] [CrossRef]
  21. Belo, R.F.C.; Augusti, R.; Lopes, P.S.N.; Junqueira, R.G. Characterization and classification of pequi trees (Caryocar brasiliense Camb.) based on the profile of volatile constituents using headspace solid-phase microextraction–gas chromatography–mass spectrometry and multivariate analysis. Ciênc. Tecnol. Aliment. 2013, 33, 116–124. [Google Scholar] [CrossRef]
  22. Geőcze, K.; Barbosa, L.; Fidêncio, P.; Silvério, F.; Lima, C.; Barbosa, M.; Ismail, F.M. Essential oils from pequi fruits from the Brazilian Cerrado ecosystem. Food Res. Int. 2013, 54, 1–8. [Google Scholar] [CrossRef]
  23. da Costa, C.A.R.; do Nascimento, S.V.; da Silva Valadares, R.B.; da Silva, L.G.M.; Machado, G.G.L.; da Costa, I.R.C.; Nahon, S.M.R.; Rodrigues, L.J.; Vilas Boas, E.V.B. Proteome and metabolome of Caryocar brasiliense Camb. fruit and their interaction during development. Food Res. Int. 2024, 191, 114687. [Google Scholar] [CrossRef]
  24. Silva, M.; Bueno, G.; Araújo, R.; Lacerda, I.; Freitas, L.; Morais, H.; Augusti, R.; Melo, J. Evaluation of the Influence of Extraction Conditions on the Isolation and Identification of Volatile Compounds from Cagaita (Eugenia dysenterica) using HS-SPME/GC-MS. J. Braz. Chem. Soc. 2019, 30, 379–387. [Google Scholar] [CrossRef]
  25. Ramos, A.L.C.C.; Mendes, D.D.; Silva, M.R.; Augusti, R.; Melo, J.O.F.; de Araújo, R.L.B.; Lacerda, I.C.A. Chemical profile of Eugenia brasiliensis (Grumixama) pulp by PS/MS Paper Spray and SPME-GC/MS solid-phase microextraction. Res. Soc. Dev. 2020, 9, e318974008. [Google Scholar] [CrossRef]
  26. Mariano, A.P.X.; Ramos, A.L.C.C.; Júnior, A.H.d.O.; García, Y.M.; de Paula, A.C.C.F.F.; Silva, M.R.; Augusti, R.; de Araújo, R.L.B.; Melo, J.O.F. Optimization of Extraction Conditions and Characterization of Volatile Organic Compounds of Eugenia klotzschiana O. Berg Fruit Pulp. Molecules 2022, 27, 935. [Google Scholar] [CrossRef]
  27. Santos, B.O.; Augusti, R.; Melo, J.O.F.; Takahashi, J.A.; de Araújo, R.L.B. Optimization of extraction conditions of volatile compounds from pequi peel (Caryocar brasiliense Camb.) using HS-SPME. Res. Soc. Dev. 2020, 9, e919974893. [Google Scholar] [CrossRef]
  28. Takahashi, J.A.; Melo, J.O.; de Araújo, R.L.; Pimenta, L.P.; Mazzinghy, A.C.D.C.; Ramos, A.L.; Silva, V.D. Economic, nutritional, and innovative aspects of non-conventional Brazilian fruits in the international novel foods market. Food Res. Int. 2024, 197, 115223. [Google Scholar] [CrossRef]
  29. Nunes, R.; Gonçalves, A.R.; Telles, M.P.C. Data on the draft genome sequence of Caryocar brasiliense Camb. (Caryocaraceae): An important genetic resource from Brazilian savannas. Data Brief 2019, 26, 104543. [Google Scholar] [CrossRef]
  30. Couillaud, J.; Leydet, L.; Duquesne, K.; Iacazio, G. The Terpene Mini-Path, a New Promising Alternative for Terpenoids Bio-Production. Genes 2021, 12, 1974. [Google Scholar] [CrossRef]
  31. Fordjour, E.; Mensah, E.O.; Hao, Y.; Yang, Y.; Liu, X.; Li, Y.; Liu, C.-L.; Bai, Z. Toward improved terpenoids biosynthesis: Strategies to enhance the capabilities of cell factories. Bioresour. Bioprocess. 2022, 9, 6. [Google Scholar] [CrossRef]
  32. Ma, Y.; Zu, Y.; Huang, S.; Stephanopoulos, G. Engineering a universal and efficient platform for terpenoid synthesis in yeast. Proc. Natl. Acad. Sci. USA 2022, 120, e2207680120. [Google Scholar] [CrossRef] [PubMed]
  33. Seppey, M.; Manni, M.; Zdobnov, E.M. BUSCO: Assessing Genome Assembly and Annotation Completeness. Methods Mol. Biol. 2019, 1962, 227–245. [Google Scholar] [CrossRef] [PubMed]
  34. Cantarel, B.L.; Korf, I.; Robb, S.M.C.; Parra, G.; Ross, E.; Moore, B.; Holt, C.; Alvarado, A.S.; Yandell, M. MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2008, 18, 188–196. [Google Scholar] [CrossRef]
  35. One Thousand Plant Transcriptomes Initiative. One thousand plant transcriptomes and the phylogenomics of green plants. Nature 2019, 574, 679–685. [Google Scholar] [CrossRef]
  36. Jones, P.; Binns, D.; Chang, H.-Y.; Fraser, M.; Li, W.; McAnulla, C.; McWilliam, H.; Maslen, J.; Mitchell, A.; Nuka, G.; et al. InterProScan 5: Genome-scale protein function classification. Bioinformatics 2014, 30, 1236–1240. [Google Scholar] [CrossRef]
  37. Jia, Q.; Brown, R.; Köllner, T.G.; Chen, F. Origin and early evolution of the plant terpene synthase family. Proc. Natl. Acad. Sci. USA 2022, 119, e2100361119. [Google Scholar] [CrossRef]
  38. Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. MAFFT: A novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef] [PubMed]
  39. Darriba, D.; Posada, D.; Kozlov, A.M.; Stamatakis, A.; Morel, B.; Flouri, T. ModelTest-NG: A new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol. 2020, 37, 291–294. [Google Scholar] [CrossRef]
  40. Flouri, T.; Izquierdo-Carrasco, F.; Darriba, D.; Aberer, A.; Nguyen, L.-T.; Minh, B.; Von Haeseler, A.; Stamatakis, A. The Phylogenetic Likelihood Library. Syst. Biol. 2015, 64, 356–362. [Google Scholar] [CrossRef]
  41. Kozlov, A.M.; Darriba, D.; Flouri, T.; Morel, R.; Stamatakis, A. RAxML-NG: A fast, scalable, and user-friendly tool for maximum likelihood phylogenetic inference. Bioinformatics 2019, 35, 4453–4455. [Google Scholar] [CrossRef] [PubMed]
  42. Mazzinghy, A.C.D.C.; Silva, V.D.M.; Ramos, A.L.C.C.; de Oliveira, C.P.; de Oliveira, G.B.; Augusti, R.; de Araújo, R.L.B.; Melo, J.O.F. Influence of the Different Maturation Conditions of Cocoa Beans on the Chemical Profile of Craft Chocolates. Foods 2024, 13, 1031. [Google Scholar] [CrossRef]
  43. NCBI Conserved Domain. 2025. Available online: https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi (accessed on 25 October 2025).
  44. Mendes, M.D.; Barroso, J.G.; Oliveira, M.M.; Trindade, H. Identification and characterization of a second isogene encoding γ-terpinene synthase in Thymus caespititius. J. Plant Physiol. 2014, 171, 1017–1027. [Google Scholar] [CrossRef]
  45. Lima, A.S.; Schimmel, J.; Lukas, B.; Novak, J.; Barroso, J.G.; Figueiredo, A.C.; Pedro, L.G.; Degenhardt, J.; Trindade, H. Genomic characterization, molecular cloning and expression analysis of two terpene synthases from Thymus caespititius (Lamiaceae). Planta 2013, 238, 191–204. [Google Scholar] [CrossRef]
  46. Hao, X.; Wang, S.; Fu, Y.; Liu, Y.; Shen, H.; Jiang, L.; McLamore, E.S.; Shen, Y. The WRKY46–MYC2 module plays a critical role in E-2-hexenal-induced anti-herbivore responses by promoting flavonoid accumulation. Plant Commun. 2024, 5, 100734. [Google Scholar] [CrossRef]
  47. Zhang, X.; Li, D.; Luo, Z.; Xu, Y. (E)-2-hexenal fumigation control the gray mold on fruits via consuming glutathione of Botrytis cinerea. Food Chem. 2024, 432, 137146. [Google Scholar] [CrossRef] [PubMed]
  48. Sdiri, Y.; Lopes, T.; Rodrigues, N.; Silva, K.; Rodrigues, I.; Pereira, J.A.; Baptista, P. Biocontrol ability and production of volatile organic compounds as a potential mechanism of action of olive endophytes against Colletotrichum acutatum. Microorganisms 2022, 10, 571. [Google Scholar] [CrossRef] [PubMed]
  49. Paradza, V.M.; Khamis, F.M.; Yusuf, A.A.; Subramanian, S.; Akutse, K.S. Efficacy of Metarhizium anisopliae and (E)–2–hexenal combination using autodissemination technology for the management of the adult greenhouse whitefly, Trialeurodes vaporariorum Westwood (Hemiptera: Aleyrodidae). Front. Insect Sci. 2022, 2, 991336. [Google Scholar] [CrossRef]
  50. Heinemann, B.; Hildebrandt, T.M. The role of amino acid metabolism in signaling and metabolic adaptation to stress-induced energy deficiency in plants. J. Exp. Bot. 2021, 72, 4634–4645. [Google Scholar] [CrossRef]
  51. Bouillaud, F.; Hammad, N.; Schwartz, L. Warburg effect, glutamine, succinate, alanine, when oxygen matters. Biology 2021, 10, 1000. [Google Scholar] [CrossRef]
  52. Chen, S.; Xie, P.; Li, Y.; Wang, X.; Liu, H.; Wang, S.; Han, W.; Wu, R.; Li, X.; Guan, Y.; et al. New insights into stress-induced β-ocimene biosynthesis in tea (Camellia sinensis) leaves during oolong tea processing. J. Agric. Food Chem. 2021, 69, 11656–11664. [Google Scholar] [CrossRef] [PubMed]
  53. Zhang, N.X.; Andringa, J.; Brouwer, J.; Alba, J.M.; Kortbeek, R.W.J.; Messelink, G.J.; Janssen, A. The omnivorous predator Macrolophus pygmaeus induces production of plant volatiles that attract a specialist predator. J. Pest Sci. 2022, 95, 1343–1355. [Google Scholar] [CrossRef]
  54. Rushendran, R.; Singh, A.; Siva, K.B.; Ilango, K. Chemical composition of essential oils–fatty acids. In Essential Oils: Sources, Production and Applications; Padalia, R.C., Verma, D.K., Arora, C., Mahish, P.K., Eds.; De Gruyter: Berlin, Germany, 2023; pp. 65–88. [Google Scholar] [CrossRef]
  55. Liu, G.; Wang, Q.; Chen, H.; Wang, Y.; Zhou, X.; Bao, D.; Wang, N.; Sun, J.; Huang, F.; Yang, M.; et al. Plant-derived monoterpene S-linalool and β-ocimene generated by CsLIS and CsOCS-SCZ are key chemical cues for attracting parasitoid wasps for suppressing Ectropis obliqua infestation in Camellia sinensis L. Plant Cell Environ. 2024, 47, 913–927. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Phylogeny of Terpene synthase genes, including the ten longest genes identified in C. brasiliense (denoted in orange). Classification into families (TPS-e/f, TPS-c, and TPS-h/d/a/b/g) follows from figure 3 in reference [37].
Figure 1. Phylogeny of Terpene synthase genes, including the ten longest genes identified in C. brasiliense (denoted in orange). Classification into families (TPS-e/f, TPS-c, and TPS-h/d/a/b/g) follows from figure 3 in reference [37].
Life 16 00067 g001
Figure 2. C. brasiliense exocarp extract chromatogram. The Y-axis shows abundance (AU), and the X-axis shows run time in minutes.
Figure 2. C. brasiliense exocarp extract chromatogram. The Y-axis shows abundance (AU), and the X-axis shows run time in minutes.
Life 16 00067 g002
Table 1. Summary statistics of the C. brasiliense genome used in this study.
Table 1. Summary statistics of the C. brasiliense genome used in this study.
Statistics
Total sequence length (bp)212,172,521
Number of scaffolds55,248
Scaffold N506005
Scaffold L5010,533
Complete BUSCOs302 (71.1%)
Fragmented BUSCOs88 (20.7%)
Missing BUSCOs35 (8.2%)
Table 2. Malpighiales proteomes used for genome annotation.
Table 2. Malpighiales proteomes used for genome annotation.
Species Name
Bischofia javanica
Chrysobalanus icaco
Croton tiglium
Drypetes deplanchei
Erythroxylum coca
Galphimia gracilis
Garcinia oblongifolia
Hypericum perforatum
Licania michauxii
Linum bienne
Malesherbia fasciculata
Mammea americana
Ochna serrulata
Passiflora edulis
Rhizophora mangle
Salix acutifolia
Viola tricolor
Table 3. Detailed information on each putative Terpene synthase gene, including the conserved domains and motifs found.
Table 3. Detailed information on each putative Terpene synthase gene, including the conserved domains and motifs found.
Conserved DomainsConserved Motif
IdentificationSize (aa)Active Site Lid ResiduesSubstrate Binding PocketSubstrate-Mg2+ Binding SiteAspartate-Rich Region 1Aspartate-Rich Region 2DDxxD
CbTPS0150
CbTPS0252
CbTPS0353
CbTPS0457 x X
CbTPS0574
CbTPS0683
CbTPS07109
CbTPS08123 x
CbTPS09124
CbTPS10229 x X
CbTPS11287 xx X
CbTPS12294
CbTPS13 *298 xxx
CbTPS14 *325 xxx X
CbTPS15 *391 x X
CbTPS16 *394 x X
CbTPS17 *443 x
CbTPS18 *448 x
CbTPS19 *458 xxxxX
CbTPS20 *497 xxxx
CbTPS21 *561xxxxxX
CbTPS22 *729 xxxxX
* genes with a minimum coding region corresponding to 298 amino acids that were used for phylogenetic analysis. DDxxD corresponds to a highly conserved aspartate-rich sequence, D: aspartate; x: any amino acid.
Table 4. Qualitative results of solid-phase microextraction gas chromatography-mass spectrometry of C. brasiliensis exocarp analysis.
Table 4. Qualitative results of solid-phase microextraction gas chromatography-mass spectrometry of C. brasiliensis exocarp analysis.
Peak N.Retention Time (min)CompoundFormulaCASChemical Class
11.420AlanineC3H7NO256-41-7α-Amino acid
21.515Formic acid, ethenyl ester C3H4O2692-45-5Formate ester
32.200Ethyl Acetate C4H8O2141-78-6Ethyl ester of a carboxylic acid
44.825Butanoic acid, ethyl ester C6H12O2105-54-4Carboxylic acid ester
55.800(E)-2-Butenoic acid, ethyl ester, C6H10O2623-70-1Carboxylic acid ester
65.9952-Hexenal, (E)-C6H10O6728-26-3Unsaturated aldehyde
77.500Hexanoic acid, methyl esterC7H14O2106-70-7Carboxylic acid ester
89.105Hexanoic acid, ethyl esterC8H16O2123-66-0Carboxylic acid ester
910.1002-Hexenoic acid, ethyl esterC8H14O21552-67-6α,β-Unsaturated carboxylic acid ester
1010.1501,3,6-Octatriene, 3,7-dimethyl-, (Z)-C10H163338-55-4Monoterpene hydrocarbon
1113.210Octanoic acid, ethyl esterC10H20O2106-32-1Carboxylic acid ester
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Trindade, H.; Nevado, B.; de Araújo, R.L.B.; Silva, V.D.M.; Aguiar, L.L.; Ribeiro, A.; Melo, J.O.-F.; Batista-Santos, P. Preliminary Identification of Putative Terpene Synthase Genes in Caryocar brasiliense and Chemical Analysis of Major Components in the Fruit Exocarp. Life 2026, 16, 67. https://doi.org/10.3390/life16010067

AMA Style

Trindade H, Nevado B, de Araújo RLB, Silva VDM, Aguiar LL, Ribeiro A, Melo JO-F, Batista-Santos P. Preliminary Identification of Putative Terpene Synthase Genes in Caryocar brasiliense and Chemical Analysis of Major Components in the Fruit Exocarp. Life. 2026; 16(1):67. https://doi.org/10.3390/life16010067

Chicago/Turabian Style

Trindade, Helena, Bruno Nevado, Raquel Linhares Bello de Araújo, Viviane Dias Medeiros Silva, Lara Louzada Aguiar, Ana Ribeiro, Julio Onesio-Ferreira Melo, and Paula Batista-Santos. 2026. "Preliminary Identification of Putative Terpene Synthase Genes in Caryocar brasiliense and Chemical Analysis of Major Components in the Fruit Exocarp" Life 16, no. 1: 67. https://doi.org/10.3390/life16010067

APA Style

Trindade, H., Nevado, B., de Araújo, R. L. B., Silva, V. D. M., Aguiar, L. L., Ribeiro, A., Melo, J. O.-F., & Batista-Santos, P. (2026). Preliminary Identification of Putative Terpene Synthase Genes in Caryocar brasiliense and Chemical Analysis of Major Components in the Fruit Exocarp. Life, 16(1), 67. https://doi.org/10.3390/life16010067

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop