Next Article in Journal
Coupled Impact of Anthocyanin and Mineral Concentrations in Cranberry Juice on Gut Microbiota and Function Modulation: A First Demonstration
Previous Article in Journal
Cucurbitane Glycosides from Siraitia Grosvenorii and Their Hepatoprotective Activities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of Thymoquinone Content in Black Cumin Seeds Using Near-Infrared Reflectance Spectroscopy

by
Óscar Ballesteros
and
Leonardo Velasco
*
Instituto de Agricultura Sostenible (IAS-CSIC), Alameda del Obispo s/n, 14004 Córdoba, Spain
*
Author to whom correspondence should be addressed.
Molecules 2025, 30(19), 3985; https://doi.org/10.3390/molecules30193985
Submission received: 4 September 2025 / Revised: 29 September 2025 / Accepted: 2 October 2025 / Published: 4 October 2025
(This article belongs to the Topic Research on Natural Products of Medical Plants)

Abstract

Thymoquinone (TMQ) is the main therapeutic constituent in black cumin (Nigella sativa L.) seeds. Conventional quantification by high-performance liquid chromatography (HPLC) is accurate but unsuitable for large-scale screening. This study evaluated the potential of near-infrared reflectance spectroscopy (NIRS) as a rapid and non-destructive alternative. A multi-year dataset of 780 seed samples was analyzed, and robust calibration models were developed using modified partial least squares regression. Independent validation of a two-year calibration equation using samples from a third year yielded a high predictive performance (r2 = 0.85; SEP = 1.18 mg g−1). Adding part of the samples from the third year to the calibration contributed to further improvement in the prediction of the remaining samples, demonstrating the benefits of continuous equation updates. The calibration equation proved effective for selecting genotypes with high TMQ content, particularly when expanded with samples from the third year. Spectral analysis identified key wavelengths associated with TMQ content, with wavelengths around 2106 nm and 2254 nm being the most relevant. This work demonstrates the applicability of NIRS for rapid phenotyping of TMQ content in black cumin seeds.

1. Introduction

Thymoquinone (2-isopropyl-5-methyl-1,4-benzoquinone—TMQ; Figure 1) is a molecule in plants with exceptional biological activity, including antioxidant, anti-inflammatory, and anticancer activity [1]. Therapeutic applications encompass a wide range of diseases, including atherosclerosis [2], diabetes [3], various types of cancer [4], and others [5].
TMQ is mainly found in the aerial parts of Monarda spp., Satureja spp., and Thymus spp. (Lamiaceae) and the seeds of black cumin (Nigella sativa, Ranunculaceae) [6]. The TMQ content in black cumin seeds varies significantly depending on the genotype, environmental factors, and agricultural practices [7,8]. There is also a substantial effect of the analytical procedures used to quantify TMQ contents [9]. Accordingly, reports in the literature on TMQ content in black cumin germplasm are very variable. For example, ranges of variation from non-detected to 2.48 mg g−1 seed [7], 8.96 to 17.28 mg g−1 [10], and 6.76 to 11.19 mg g−1 [8] have been reported.
The analysis of TMQ in black cumin seeds and oils is most commonly performed using high-performance liquid chromatography (HPLC) [11,12], although more powerful analytical techniques, such as liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis, are also employed [13]. The process is expensive, time-consuming, and involves the use of chemical reagents [11]. An alternative analytical method is necessary for processes involving the analysis of large numbers of samples, such as breeding programs aimed at selecting genotypes with high TMQ content.
Near-infrared reflectance spectroscopy (NIRS) is an analytical procedure that utilizes chemometric statistics to correlate spectral information in the near-infrared region (from 780 nm to 2500 nm) with chemical data obtained through a primary analytical method [14]. The technique was initially developed for the analysis of agricultural commodities, but its application has since expanded to various other areas, including the analysis of nutraceutical and pharmaceutical products [15].
A previous study showed the capacity of NIRS to discriminate different concentrations of TMQ prepared from pure TMQ [16]. The objective of the present study was to evaluate the feasibility of estimating TMQ content in intact black cumin seeds using NIRS and its applicability to a breeding program for selecting seeds with high TMQ content.

2. Results

2.1. Development of a Two-Year Calibration Equation

The initial calibration set, comprising seed samples harvested in 2021 from individual black cumin plants, consisted of 288 samples with an average TMQ content of 7.75 mg g−1 and extreme values of 0.68 and 14.46 mg g−1 (Figure 2). The TMQ levels spanned the entire distribution range for this compound in the set. However, there were higher frequencies in the zone of high TMQ content, as expected for a breeding program in which increasing the seed TMQ content is one of the objectives. The calibration equation showed an SEC of 0.84 mg g−1 and an R2 of 0.96. The SECV was slightly higher, 0.97 mg g−1, whereas the R2 in cross-validation was 0.94 (Table 1).
Independent validation of the 2021 calibration equation with a set of 313 samples harvested in 2022 provided an acceptable detection of TMQ, with a SEP of 1.28 mg g−1 and an R2 of 0.78 (Table 1). As shown in Figure 3, a strong bias largely affected the validation.
Calibration with the combined sets of 2021 and 2022, totaling 601 samples, yielded R2 values in calibration and cross-validation of 0.92 in both cases, a SEC of 1.10 mg g−1, and a SECV of 1.15 mg g−1 (Table 1).

2.2. Validation of the Two-Year Calibration Equation

Three approaches were followed to validate the two-year calibration equation using samples from a third environment. The first one was a completely independent validation with a set of 179 seed samples harvested in 2023. In this case, the SEP was 1.18 mg g−1 seed, and the R2 was 0.85 (Table 2). Expanding the calibration set with 25% of the 2023 samples and validating it with the remaining 75% resulted in a SEP of 1.08 mg g−1 and an R2 of 0.86. Finally, using 50% of the samples to expand the calibration set resulted in a SEP of 1.01 mg g−1 and an R2 value of 0.88 in the validation of the remaining 50% of the 2023 samples (Table 2; Figure 4).

2.3. Predictability of NIRS for Selection of High Thymoquinone Content

The evaluation of the capacity of the NIRS equation, developed using samples from 2021 and 2022, to select samples with high TMQ content was conducted by assessing the benefits of updating the equation with part of the 2023 samples. Without the addition of samples from 2023, the use of a 5% selection criterion, i.e., 5% of the samples with the highest TMQ content as determined by NIRS, resulted in a match of 67% of the target population samples, i.e., 5% of the samples with the highest TMQ content as determined by HPLC. The 10% and 15% cutoffs resulted in 56% and 67% selection matches, respectively (Table 3). When the calibration set was expanded with 25% of the samples of 2023, the selection criteria of 5%, 10%, and 15% resulted in matches of 89%, 89%, and 81%, respectively. In the case in which 50% of the samples had been added to the calibration set, the 5%, 10%, and 15% selection cutoffs provided the actual selection of the 100%, 89%, and 85% of the samples according to the rank based on HPLC data (Table 3).

2.4. Significant Wavelength Regions

The NIRS spectrum of TMQ, in its second derivative form that has enhanced peak resolution and corrected baseline effects, shows three main absorption areas from around 1660 to 1700 nm, 2038 to 2114 nm, and 2226 to 2418 nm (Figure 5). The comparison of average spectra of seed samples with low and high TMQ contents showed marked differences between them (Figure 6A). The maximum differences, as shown in the graphic of standard deviations (Figure 6B), were observed at around 2106 nm and 2254 nm. High standard deviation values were observed in a broad spectral range from 2224 nm to 2390 nm. Around 2106 nm, the average spectra showed particularly marked differences, with peaks at 2090 nm and 2106 nm in the high-TMQ average spectrum that were not observed in the low-TMQ one (Figure 6C). Spectral differences between the high-TMQ and low-TMQ average spectra at around 2254 nm are shown in Figure 6D, with peaks of the high-TMQ average spectrum at 2228, 2256, 2272, and 2294 nm differing from the low-TMQ average spectrum.

3. Discussion

NIRS has become an indispensable tool for the rapid, non-destructive estimation of bioactive compounds in seeds with pharmaceutical and nutraceutical value [17]. Its capacity for high-throughput screening is essential for both plant breeding and quality control, facilitating the identification of genotypes with enhanced health-promoting properties. NIRS has been successfully applied to quantify a wide array of bioactive molecules in intact seeds and fruits, such as silymarin in milk thistle (Silybum marianum) [18], lignans in sesame [19], phenolics and flavonoids in peanut seeds [20], glucosinolates in Brassicaceae [21], and phenolics and theobromine in cacao beans [22], among others [15]. These successes are primarily due to NIRS’s ability to detect spectral signals associated with functional chemical groups (e.g., –OH, –CH, –NH) that are characteristic of secondary metabolites [23]. Given these capabilities, NIRS is particularly suitable for the analysis of TMQ content in black cumin seeds, offering a rapid and sustainable alternative to conventional chromatographic techniques in both breeding and industrial applications.
Developing robust NIRS calibration models for seed quality traits necessitates the inclusion of samples that represent broad genetic diversity and environmental variability. These models, which associate spectral data with reference analytical values, benefit from calibration sets that encompass the full range of variation influencing trait expression. Incorporating genetically and environmentally diverse samples not only improves model accuracy and predictive power but also ensures the inclusion of rare or extreme phenotypes, ultimately enhancing model robustness [24,25,26]. In the present study, this principle was confirmed: calibration equations developed using seed samples from 2021 showed strong performance in cross-validation but lower accuracy when tested against independent 2022 samples. Similarly, adding a subset of 2023 samples to the 2021 + 2022 calibration set significantly improved the model’s predictive robustness, demonstrating the value of multi-environmental data in enhancing the calibration performance. Our results suggest that NIRS prediction of TMQ content would benefit from expanding the calibration set with samples from additional environments, i.e., the calibration equation included in this manuscript cannot be considered final, but it will need to be periodically expanded to improve its predictive performance. Similar benefits of composite calibration sets have been reported in previous NIRS studies involving seed analysis across multiple growing seasons [27,28,29].
In the present research, calibration equations were developed using chemometrics, specifically a multivariate partial least squares approach that incorporates information from the entire spectrum. Furthermore, the study showed that specific wavelengths, particularly around 2106 nm and 2254 nm, have a discriminant value between seeds with low and high TMQ contents. The wavelength at 2106 nm is associated with a combination of C-H bending and C=O stretching [30]. The wavelength at 2254 nm corresponds to a methylene combination band [31]. Both wavelengths belong to spectral areas identified in the present research as being associated with strong reflectance of the TMQ molecule (Figure 5). Banerjee et al. [16] utilized NIRS to differentiate between TMQ samples at varying concentrations within the NIRS region, spanning 900–1700 nm. The authors found a peak for TMQ concentration at around 1650 nm, which has also been observed in the present research (Figure 5). Other interesting wavelengths identified in the present research fell outside the scope of the mentioned research [16]. Our findings regarding the significant wavelengths involved in detecting quantitative differences in TMQ content not only reinforce the potential of NIRS as a powerful tool for quantifying TMQ in black cumin seeds but they also open new avenues for the development of more accessible, cost-effective analytical filter-based NIRS instruments and sensors [15,32]. The identification of specific wavelength regions strongly associated with TMQ content suggests that future instruments might be tailored to monitor only these key regions of the spectrum, facilitating quality control in black cumin seed production, processing, and breeding.

4. Materials and Methods

4.1. Plant Materials

The study used individual black cumin plants from the breeding program conducted by the authors, which included segregating populations and breeding lines. As part of a dynamic breeding program, the plant material varies from year to year. The plants included in the study corresponded to the 2020/2021, 2021/2022, and 2022/2023 growing seasons, with 288, 313, and 179 individual plants, respectively.
Every year, the seeds were sown in the first fortnight of November in 6 L pots filled with a soil mixture prepared with sand, silt, and peat in a 2:1:1 proportion by volume and 8 g of time-release fertilizer Nutricote® 15-9-10 (2MgO)+ME. The pots were maintained in an insect-proof mesh cage. They were irrigated on demand and not supplemented with additional fertilization. Mature seeds were harvested in the second fortnight of June from individual plants.

4.2. Thymoquinone Extraction

Approximately 30 mg of seeds were weighed into a 10 mL polypropylene tube. After 2 mL of 70% methanol was added, the seeds were finely crushed and homogenized in a homogenizer (Heidolph RC 500, Kelheim, Germany) equipped with a stainless-steel rod of 8 mm diameter at a speed of 5000 rpm for approximately 15 s. The rod was washed with 1 mL of 70% methanol and collected in the tube. The samples were stirred and kept overnight at room temperature in the dark (extraction time about 16 h). After extraction, the samples were stirred again and centrifuged at 3500 rpm for 20 min. 1 mL of the supernatant was filtered through 0.45 μm 96-well filter plates using a Multiscreen HTS Manifold Deep Well Collar vacuum system (Millipore Corporation, Burlington, MA, USA). The filtered extract was used for HPLC analysis.

4.3. Analysis of Thymoquinone Content by HPLC

The HPLC system consisted of a Waters 1525 binary pump with an oven maintained at 40 °C, a Waters 2707 autosampler, and a Waters UV/Visible 2489 detector set at a wavelength of 254 nm (Waters Corporation, Milford, MA, USA). The column was a C18 Kinetex 150 × 4.6 mm with 5 μm of particle size (Phenomex, Torrance, CA, USA). The mobile phase was composed of acetonitrile and water in an isocratic mode with a 50:50 (v/v) ratio at a flow rate of 1 mL min−1. The duration of the analysis was 15 min.
Quantification of TMQ was performed using calibration with external standards prepared from TMQ reference standard (>98%; product number 274666; Sigma Aldrich, Saint Louis, MO, USA). The stock solution was prepared by weighing 10 mg and diluting it to 10 mL with 70% methanol. Two standards, 20 mg L−1 and 100 mg L−1, were prepared by dilution, and a calibration curve was generated using injections of 3, 5, 10, 20, and 30 µL of both standards, which resulted in the following injected amounts of TMQ: 0.06, 0.10, 0.20, 0.30, 0.40, 0.50, 0,6, 1.0, 2.0, and 3.0 µg.

4.4. Acquisition of NIRS Spectra

About 300 mg of black cumin seeds from individual plants were scanned on a NIRS monochromator instrument (NIR Systems model 6500, Foss A/S, Hillerød, Denmark). Absorbance spectra (log 1/R; R = reflectance) were recorded at 2 nm intervals from 400 to 2500 nm. For NIRS analysis of pure TMQ, approximately 1 mg of the reference standard powder was impregnated into a glass fiber disk (47 mm diameter, 475 μm thickness, product number AP4004705, MilliporeSigma, Burlington, MA, USA), which was then scanned using the standard cup used for black cumin seed analysis.

4.5. NIRS Calibration and Validation

A calibration equation for TMQ content was developed using 288 samples harvested in 2021. Mathematical procedures on the spectral information were performed using WinISI II software (Infrasoft International, LLC., Port Matilda, PA, USA). The absorbance spectra were modified by applying a standard normal variate (SNV) transformation, de-trend scatter correction, and second derivative transformation at gaps of 5 data points (10 nm) and smoothing over segments of 5 data points (2, 5, 5, 1). This combination of spectra treatments was selected after a round of evaluation of the effect of different math treatments on cross-validation statistics obtained during the calibration process. Similarly, the 400–1100 nm region was excluded based on a comparison between using it and excluding it. Calibration equations were developed using modified partial least squares (MPLS). In cross-validation, the software computes the statistic “one minus variance ratio,” which is an estimate of the coefficient of multiple determination R2. The SEP in external validation is provided after correction for bias [24].
The 313 samples harvested in 2022 were first used to conduct an independent validation of the 2021 calibration equation. Then, the sample sets from 2021 and 2022 were combined to develop a new calibration equation. The samples harvested in 2023 were used to validate the 2022 equation, including samples harvested in 2021 and 2022, following three strategies: (1) using 100% of the samples for validation; (2) using 25% of the samples for expanding the 2022 calibration and the remaining 75% of the samples for independent validation, and (3) using 50% of the samples for expanding the 2022 calibration and the remaining 50% of the samples for independent validation. In all cases, the selection of the validation set was performed randomly.

4.6. Predictability of NIRS to the Selection of High Thymoquinone Content

The three external validation strategies conducted with the samples harvested in 2023 were paralleled with a study on the validity of the calibration equations to select for high TMQ content. To that end, arbitrary selection cutoffs of 5%, 10%, and 15% of the samples with the highest TMQ content, based on NIRS analyses, were defined. A study was conducted to determine the percentage of samples corresponding to the selected percentile, based on HPLC analyses. In cases where the calibration equation was updated, including 25% or 50% of the 2023 samples, the study on the selection predictability of the equations used only the remaining 75% and 50% of the samples, respectively, that had not been included in the calibration set.

4.7. Study of Significant Wavelength Regions

The average spectrum of 50 black cumin seed spectra from seeds with low TMQ content (0.71 to 2.01 mg g−1) and the average spectrum of 50 black cumin seed spectra from seeds with high TMQ content (14.16 to 16.84 mg g−1) were computed and compared to study wavelength regions significantly associated with seed TMQ content. To facilitate the study, the standard deviation between the average spectra was computed. All the spectra used in the study were previously subjected to SNV and detrend correction, as well as second-derivative transformation (2,5,5,1).

5. Conclusions

The results of the present study demonstrated that NIRS is a reliable and powerful alternative to HPLC for determining TMQ content in black cumin seeds. In contrast to HPLC, which is labor-intensive, time-consuming, and requires solvent extraction and sample destruction, NIRS offers a rapid, non-destructive, and environmentally friendly approach with minimal sample preparation. Its suitability for high-throughput multi-trait analysis makes it particularly advantageous for breeding programs and industrial quality control, where large numbers of samples must be screened efficiently. Furthermore, the development of robust calibration models across multiple environments underscores the potential of NIRS to provide accurate predictions of TMQ content under diverse conditions. The identification of specific wavelengths strongly associated with TMQ content opens the possibility of designing specific NIRS-based devices and sensors for TMQ content analysis in black cumin seeds.

Author Contributions

Conceptualization, L.V.; methodology, Ó.B. and L.V.; formal analysis, Ó.B. and L.V.; resources, L.V.; writing—original draft preparation, L.V.; writing—review and editing, Ó.B. and L.V.; supervision, L.V.; project administration, L.V.; funding acquisition, L.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Consejería de Transformación Económica, Industria, Conocimiento y Universidades, Junta de Andalucía, and FEDER, grant number QUAL21_023 IAS.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available upon request to the first author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
TMQThymoquinone
NIRSNear-infrared Reflectance Spectroscopy
SECStandard Error of Calibration
SECVStandard Error of Cross-validation
SEPStandard Error of Prediction

References

  1. Gupta, B.; Ghosh, K.K.; Gupta, R.C. Thymoquinone. In Nutraceuticals: Efficacy, Safety and Toxicity; Gupta, R.C., Ed.; Academic Press: London, UK, 2016; pp. 541–550. [Google Scholar] [CrossRef]
  2. Majdalawieh, A.F.; Yousef, S.M.; Abu-Yousef, I.A. Thymoquinone, a major constituent in Nigella sativa seeds, is a potential preventative and treatment option for atherosclerosis. Eur. J. Pharmacol. 2021, 909, 174420. [Google Scholar] [CrossRef]
  3. Shaukat, A.; Zaidi, A.; Anwar, H.; Kizilbash, N. Mechanism of the antidiabetic action of Nigella sativa and thymoquinone: A review. Front. Nutr. 2023, 10, 1126272. [Google Scholar] [CrossRef] [PubMed]
  4. Almajali, B.; Al-Jamal, H.A.N.; Taib, W.R.W.; Ismail, I.; Johan, M.F.; Doolaanea, A.A.; Ibrahim, W.N. Thymoquinone, as a novel therapeutic candidate of cancers. Pharmaceuticals 2021, 14, 369. [Google Scholar] [CrossRef]
  5. Darakhshan, S.; Pour, A.B.; Colagar, A.H.; Sisakhtnezhad, S. Thymoquinone and its therapeutic potentials. Pharmacol. Res. 2015, 95–96, 138–158. [Google Scholar] [CrossRef]
  6. Taborsky, J.; Kunt, M.; Kloucek, P.; Lachman, J.; Zeleny, V.; Kokoska, L. Identification of potential sources of thymoquinone and related compounds in Asteraceae, Cupressaceae, Lamiaceae, and Ranunculaceae families. Cent. Eur. J. Chem. 2012, 10, 1899–1906. [Google Scholar] [CrossRef]
  7. Ravi, Y.; Vethamoni, I.P.; Saxena, S.N.; Velmurugan, S.; Santanakrishnan, V.P.; Raveendran, M.; Bariya, H.; Harsh, M. Guesstimate of thymoquinone diversity in Nigella sativa L. genotypes and elite varieties collected from Indian states using HPTLC technique. Open Life Sci. 2023, 18, 20220536. [Google Scholar] [CrossRef] [PubMed]
  8. Thani, P.R.; Johnson, J.B.; Bhattarai, S.; Trotter, T.; Walsh, K.; Broszczak, D.; Naiker, M. An in-depth examination into how genotype, planting density, and time of sowing affect key phytochemical constituents in Nigella sativa seed. Seeds 2024, 3, 357–380. [Google Scholar] [CrossRef]
  9. Habib, N.; Choudhry, S. HPLC Quantification of thymoquinone extracted from Nigella sativa L. (Ranunculaceae) seeds and antibacterial activity of its extracts against Bacillus species. Evid.-Based Complement. Altern. Med. 2021, 2021, 6645680. [Google Scholar] [CrossRef]
  10. Mani, J.S.; Johnson, J.B.; Bhattarai, S.; Trottera, T.; Naiker, M. Phytochemistry and therapeutical potential of new Nigella sativa genotypes from Australia. J. Herbs Spices Med. Plants 2023, 29, 229–249. [Google Scholar] [CrossRef]
  11. Avula, B.; Wang, Y.H.; Ali, Z.; Khan, I.A. Quantitative determination of chemical constituents from seeds of Nigella sativa L. using HPLC-UV and identification by LC-ESI-TOF. J. AOAC Int. 2010, 93, 1778–1787. [Google Scholar] [CrossRef]
  12. Khaikin, E.; Chrubasik-Hausmann, S.; Kaya, S.; Zimmermann, B.F. Screening of thymoquinone content in commercial Nigella sativa products to identify a promising and safe study medication. Nutrients 2022, 14, 3501. [Google Scholar] [CrossRef] [PubMed]
  13. Shrivastava, A.K.; Shrestha, L.; Pokhrel, B.R.; Joshi, B.; Lamichhane, G.; Vidović, B.; Koirala, N. LC-MS based metabolite profiling, in-vitro antioxidant and in-vivo antihyperlipidemic activity of Nigella sativa extract. eFood 2023, 4, e107. [Google Scholar] [CrossRef]
  14. Osborne, B.G. Near-Infrared Spectroscopy in Food Analysis. In Encyclopedia of Analytical Chemistry, Online; Meyers, R.A., McGorrin, R.J., Eds.; John Wiley & Sons: Hoboken, NJ, USA, 2006; pp. 1–14. [Google Scholar] [CrossRef]
  15. Nagy, M.M.; Wang, S.; Farag, M.A. Quality analysis and authentication of nutraceuticals using near IR (NIR) spectroscopy: A comprehensive review of novel trends and applications. Trends Food Sci. Technol. 2022, 123, 290–309. [Google Scholar] [CrossRef]
  16. Banerjee, S.; Ghatak, B.; Naskar, H.; Ghosh, S.; Dhara, M.; Ali, B.; Das, N.; Tudu, B.; Bandyopadhyay, R. Discrimination of thymoquinone using near infrared spectroscopy technique. In Proceedings of the 2nd IEEE International Conference on Emerging Frontiers in Electrical and Electronic Technologies, Patna, India, 24–25 June 2022; pp. 462–463. [Google Scholar]
  17. Pasquini, C. Near infrared spectroscopy: A mature analytical technique with new perspectives—A review. Anal. Chim. Acta 2018, 1026, 8–36. [Google Scholar] [CrossRef]
  18. Mayr, S.; Strasser, S.; Kirchler, C.G.; Meischl, F.; Stuppner, S.; Beć, K.B.; Grabska, J.; Sturm, S.; Stuppner, H.; Popp, M.A.; et al. Quantification of silymarin in Silybi mariani fructus: Challenging the analytical performance of benchtop vs. handheld nir spectrometers on whole seeds. Planta Med. 2022, 88, 20–32. [Google Scholar] [CrossRef]
  19. Jiao, W.C.; Li, Y.K.; Jia, M.; Wang, D.M.; Qi, K.; Wang, X.D. Quick determination of sesame lignans in sesame oil using a portable near-infrared spectrometer. Food Sci. Technol. Camp. 2023, 43, e104422. [Google Scholar] [CrossRef]
  20. Haruna, S.A.; Li, H.; Wei, W.; Geng, W.; Luo, X.; Zareef, M.; Adade, S.Y.S.A.; Ivane, N.M.A.; Isa, A.; Chen, Q. Simultaneous quantification of total flavonoids and phenolic content in raw peanut seeds via NIR spectroscopy coupled with integrated algorithms. Spectrochim. Acta A Mol. Biomol. Spectrosc. 2023, 285, 121854. [Google Scholar] [CrossRef]
  21. Redha, A.A.; Torquati, L.; Langston, F.; Nash, G.R.; Gidley, M.J.; Cozzolino, D. Determination of glucosinolates and isothiocyanates in glucosinolate-rich vegetables and oilseeds using infrared spectroscopy: A systematic review. Crit. Rev. Food Sci. Nutr. 2024, 64, 8248–8264. [Google Scholar] [CrossRef]
  22. Hernández-Hernández, C.; Fernández-Cabanás, V.M.; Rodríguez-Gutiérrez, G.; Fernández-Prior, A.; Morales-Sillero, A. Rapid screening of unground cocoa beans based on their content of bioactive compounds by NIR spectroscopy. Food Control 2022, 131, 108347. [Google Scholar] [CrossRef]
  23. Workman, J. Spectral interpretation. In Handbook of Near-Infrared Analysis, 4th ed.; Ciurczak, E.W., Igne, B., Workman, J., Burns, D.A., Eds.; CRC Press: Boca Raton, FL, USA, 2021; pp. 45–67. [Google Scholar]
  24. Williams, P.C.; Sobering, D.C. How do we do it: A brief summary of the methods we use in developing near-infrared calibration. In Near Infrared Spectroscopy: The Future Waves; Davis, A.M.C., Williams, P., Eds.; NIR Publications: Chichester, UK, 1996; pp. 185–188. [Google Scholar]
  25. Towett, E.K.; Alex, M.; Shepherd, K.D.; Polreich, S.; Aynekulu, E.; Maas, B.L. Applicability of near-infrared reflectance spectroscopy (NIRS) for determination of crude protein content in cowpea (Vigna unguiculata) leaves. Food Sci. Nutr. 2013, 1, 45–53. [Google Scholar] [CrossRef] [PubMed]
  26. Cabrera-Bosquet, L.; Molero, G.; Bort, J.; Nogués, S.; Araus, J.L. High-throughput phenotyping and genomic selection: The frontiers of crop breeding converge. J. Integr. Plant Biol. 2012, 54, 312–320. [Google Scholar] [CrossRef] [PubMed]
  27. Greenwood, C.F.; Allen, J.A.; Leong, A.S.; Pallot, T.N.; Golder, T.M.; Golebiowski, T. An investigation of the stability of NIRS calibrations for the analysis of oil content in whole seed canola. In Proceedings of the 10th International Rapeseed Congress, Canberra, Australia, 26–29 September 1999. [Google Scholar]
  28. Elfadl, E.; Reinbrecht, C.; Claupein, W. Development of near infrared reflectance spectroscopy (NIRS) calibration model for estimation of oil content in a worldwide safflower germplasm collection. Int. J. Plant Prod. 2010, 4, 259–270. [Google Scholar]
  29. Sileoni, V.; Marconi, O.; Perretti, G.; Fantozzi, P. Evaluation of different validation strategies and long term effects in NIR calibration models. Food Chem. 2013, 141, 2639–2648. [Google Scholar] [CrossRef]
  30. Workman, J.; Weyer, L. Practical Guide to Interpretive Near-Infrared Spectroscopy; CRC Press: Boca Raton, FL, USA, 2007. [Google Scholar]
  31. Miller, C.E.; Eichinger, B.E. Determination of crystallinity and morphology of fibrous and bulk poly(ethylene terephthalate) by near-infrared diffuse reflectance spectroscopy. Appl. Spectros. 1990, 44, 496–504. [Google Scholar] [CrossRef]
  32. Rodriguez-Macadaeg, F.; Armstrong, P.R.; Maghirang, E.B.; Scully, E.D.; Brabec, D.L.; Arthur, F.H.; Adviento-Borbe, A.D.; Yaptenco, K.F.; Suministrado, D.C. Developing a multi-spectral NIR LED-based instrument for the detection of pesticide residues containing chlorpyrifos-methyl in rough, brown, and milled rice. Sensors 2024, 24, 4055. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Chemical structure of thymoquinone.
Figure 1. Chemical structure of thymoquinone.
Molecules 30 03985 g001
Figure 2. Histogram of seed thymoquinone content (mg g−1 seed) in 288 samples of the 2021 calibration set.
Figure 2. Histogram of seed thymoquinone content (mg g−1 seed) in 288 samples of the 2021 calibration set.
Molecules 30 03985 g002
Figure 3. Scatter plot of the independent validation of the 2021 calibration equation for thymoquinone content with a set of 313 seed samples harvested in 2022.
Figure 3. Scatter plot of the independent validation of the 2021 calibration equation for thymoquinone content with a set of 313 seed samples harvested in 2022.
Molecules 30 03985 g003
Figure 4. Scatter plot of the independent validation of the 2021 + 2022 calibration equation for thymoquinone content with a set of 179 seed samples of 2023 through three approaches: (A) validation with 100% of the samples; (B) using 25% of the samples to expand the equation and 75% for validation; (C) using 50% of the samples to expand the equation and 50% for validation.
Figure 4. Scatter plot of the independent validation of the 2021 + 2022 calibration equation for thymoquinone content with a set of 179 seed samples of 2023 through three approaches: (A) validation with 100% of the samples; (B) using 25% of the samples to expand the equation and 75% for validation; (C) using 50% of the samples to expand the equation and 50% for validation.
Molecules 30 03985 g004
Figure 5. NIRS spectrum shown as absorbance (log(1/R); R = reflectance) and its second derivative in the wavelength range from 1100 to 2500 nm. The second derivative is presented in arbitrary units.
Figure 5. NIRS spectrum shown as absorbance (log(1/R); R = reflectance) and its second derivative in the wavelength range from 1100 to 2500 nm. The second derivative is presented in arbitrary units.
Molecules 30 03985 g005
Figure 6. (A) Average NIRS spectra (second derivative, 1100 to 2500 nm) of 50 black cumin seed samples with low thymoquinone contents (TMQ; 0.07 to 0.20 mg g−1 seed) and high TMQ contents (1.42 to 1.68 mg g−1 seed); (B) Standard deviations between both average spectra; (C) Expanded area between 2050 and 2150 nm: (D) Expanded area between 2200 and 2300 nm. The second derivative spectra are presented in arbitrary units.
Figure 6. (A) Average NIRS spectra (second derivative, 1100 to 2500 nm) of 50 black cumin seed samples with low thymoquinone contents (TMQ; 0.07 to 0.20 mg g−1 seed) and high TMQ contents (1.42 to 1.68 mg g−1 seed); (B) Standard deviations between both average spectra; (C) Expanded area between 2050 and 2150 nm: (D) Expanded area between 2200 and 2300 nm. The second derivative spectra are presented in arbitrary units.
Molecules 30 03985 g006
Table 1. Calibration and validation statistics for thymoquinone content (mg g−1 seed) developed with seeds harvested in 2021 and expanded with seeds harvested in 2022.
Table 1. Calibration and validation statistics for thymoquinone content (mg g−1 seed) developed with seeds harvested in 2021 and expanded with seeds harvested in 2022.
TypeYearnMeanRangeSE aR2 b
Calibration20212887.750.68–14.460.840.96
Cross-validation20212887.750.68–14.460.970.94
Validation202231312.001.74–17.301.280.78
Calibration2021 + 202260110.040.68–17.301.100.92
Cross-validation2021 + 202260110.040.68–17.301.150.92
a Standard error of calibration (SEC), standard error of cross-validation (SECV), or standard error of prediction (SEP) in the case of external validation. b In cross-validation, the software computes the statistic “one minus variance ratio”, which is an estimation of R2.
Table 2. Validation statistics for the 2021 + 2022 calibration equation with seeds harvested in 2023 following three strategies: (a) validation with 100% of the samples; (b) using 25% of the samples to expand the equation and 75% for validation; (c) using 50% of the samples to expand the equation and 50% for validation.
Table 2. Validation statistics for the 2021 + 2022 calibration equation with seeds harvested in 2023 following three strategies: (a) validation with 100% of the samples; (b) using 25% of the samples to expand the equation and 75% for validation; (c) using 50% of the samples to expand the equation and 50% for validation.
Type% 2023 SamplesNMeanRangeSE aR2 b
(a) Validation1001798.751.08–13.491.180.85
(b) Cross-validation256469.920.68–17.301.120.92
(b) Validation751348.831.26–13.491.080.86
(c) Cross-validation506919.850.68–17.301.110.92
(c) Validation50898.701.08–13.491.010.88
a Standard error of cross-validation (SECV), or standard error of prediction (SEP) in the case of external validation. b In cross-validation, the software computes the statistic “one minus variance ratio”, which is an estimation of R2.
Table 3. Evaluation of the calibration equations’ predictability for selecting high thymoquinone content based on arbitrary selection cutoffs of 5%, 10%, and 15% of the samples based on NIRS analyses and compared with HPLC analyses, following three strategies: A: No addition of samples of 2023 in the calibration set; B: using 25% of the 2023 samples for expanding the calibration set; C: using 50% of the 2023 samples to enlarge the calibration set.
Table 3. Evaluation of the calibration equations’ predictability for selecting high thymoquinone content based on arbitrary selection cutoffs of 5%, 10%, and 15% of the samples based on NIRS analyses and compared with HPLC analyses, following three strategies: A: No addition of samples of 2023 in the calibration set; B: using 25% of the 2023 samples for expanding the calibration set; C: using 50% of the 2023 samples to enlarge the calibration set.
Selection IntensityABC
5%67%89%100%
10%56%89%89%
15%67%81%85%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ballesteros, Ó.; Velasco, L. Analysis of Thymoquinone Content in Black Cumin Seeds Using Near-Infrared Reflectance Spectroscopy. Molecules 2025, 30, 3985. https://doi.org/10.3390/molecules30193985

AMA Style

Ballesteros Ó, Velasco L. Analysis of Thymoquinone Content in Black Cumin Seeds Using Near-Infrared Reflectance Spectroscopy. Molecules. 2025; 30(19):3985. https://doi.org/10.3390/molecules30193985

Chicago/Turabian Style

Ballesteros, Óscar, and Leonardo Velasco. 2025. "Analysis of Thymoquinone Content in Black Cumin Seeds Using Near-Infrared Reflectance Spectroscopy" Molecules 30, no. 19: 3985. https://doi.org/10.3390/molecules30193985

APA Style

Ballesteros, Ó., & Velasco, L. (2025). Analysis of Thymoquinone Content in Black Cumin Seeds Using Near-Infrared Reflectance Spectroscopy. Molecules, 30(19), 3985. https://doi.org/10.3390/molecules30193985

Article Metrics

Back to TopTop