The Potential of Spectroscopic Techniques in Coffee Analysis—A Review

: This review provides an overview of recent studies on the potential of spectroscopy techniques (mid-infrared, near infrared, Raman, and ﬂuorescence spectroscopy) used in coffee analysis. It speciﬁcally covers their applications in coffee roasting supervision, adulterants and defective beans detection, prediction of specialty coffee quality and coffees’ sensory attributes, discrimination of coffee based on variety, species, and geographical origin, and prediction of coffees chemical composition. These are important aspects that signiﬁcantly affect the overall quality of coffee and consequently its market price and ﬁnally quality of the brew. From the reviewed literature, spectroscopic methods could be used to evaluate coffee for different parameters along the production process as evidenced by reported robust prediction models. Nevertheless, some techniques have received little attention including Raman and ﬂuorescence spectroscopy, which should be further studied considering their great potential in providing important information. There is more focus on the use of near infrared spectroscopy; however, few multivariate analysis techniques have been explored. With the growing demand for fast, robust, and accurate analytical methods for coffee quality assessment and its authentication, there are other areas to be studied and the ﬁeld of coffee spectroscopy provides a vast opportunity for scientiﬁc investigation.


Introduction
Coffee is one of the most widely traded commodities worldwide and is produced in over 50 countries.According to ICO [1], world green coffee production reached 10.52 million tonnes in 2020, a 6.3% increase compared to 2019.Generally, production has been increasing year by year, clearly indicating an upward trend in coffee consumption.This can be attributed to coffees' desirable sensory properties, the stimulant effects of caffeine, as well as its benefits on human health.More than 100 species in the genus Coffea have been identified; however, only two species, namely, Coffea canephora and Coffea arabica, commonly known as robusta and arabica, respectively, are traded commercially.Arabica coffee accounts for more than 60% of the world's coffee production with a higher commercial value than robusta coffee due to its superior aroma characteristics and lower caffeine content [2].
Assessment of coffee along its supply chain, i.e., from farm to cup, is crucial in ensuring consistent supply of quality products to consumers.The overall quality of a cup of coffee is directly linked to careful control of a number of parameters including the quality of cherries, raw materials (species, origin, presence of adulterants and defective beans, type of processing method (wet or dry), chemical composition etc.), roasting process, storage conditions, and preparation parameters and methods.In the past years, different analytical techniques have been developed and used to authenticate and assess the quality of coffee.They may involve target analyses, in which chemical profiling is done and the identified compounds are assessed against a set limit established by law or stated on the label.Non-target analyses, in contrast, provide a fingerprint of the whole sample that can then be utilized to detect possible adulteration.Physicochemical and chromatographic analytical methods are the predominant methods in the evaluation of coffee species, origin, and chemical composition [3,4].Although very objective, these methods usually require experienced personnel, and are labor intensive, expensive, and time consuming.Visual coffee inspection is also a common practice that is done to check for the presence of defective beans, adulterants, degree of roast as well as to distinguish coffee species based on the colour, shape, and size of the bean.This is a subjective method that depends mainly on the experience and skills of the inspector and hence could be subject to erroneous conclusions.
The global coffee market is increasingly stressing the importance of the quality and chemical characteristics of coffee, which calls for appropriate analytical methods.Spectroscopy is a promising technique in this industry because it is rapid, non-destructive, requires no/minimal sample preparation, and integrates easily into other processes.Infrared spectral techniques have gained considerable interest in evaluation of coffee with respect to its chemical composition, species, geographical origin, presence of adulterants and defective beans, roast degree, sensory attributes, etc. [5][6][7][8][9][10].The potential of Raman and fluorescence spectroscopy in coffee analysis have also been investigated [11][12][13][14].
Potential application of spectroscopy techniques in the food industry has gained interest over the last few years.Therefore, the aim of this review is to provide a summary of the recent studies on applications of near infrared, fluorescence, mid infrared, and Raman spectral techniques in coffee assessment.It is hoped that this will provide an overview to interested parties on what can be achieved using spectroscopy techniques in coffee analysis.Additionally, aspects that still require clarification or further investigations are identified.However, this review does not address all the aspects related to coffee assessment.Important topics such as prediction of coffee processed using different methods, differentiation of organic from inorganic coffee, monitoring of coffee quality during storage, among others, were not considered.Limited information is available, and therefore these are areas that could be considered in the future given their significant impact on the quality of the beverage.In this review, first a brief introduction to the basic concepts of the aforementioned spectroscopic methods is provided followed by discussion of the latest research carried out to develop new approaches based on these methods.

Overview of Spectroscopic Techniques
Spectroscopic techniques have gained interest in the food industry over the past years due to their potential in assessing the chemical composition and quality of foods.These techniques involve interaction between matter and electromagnetic radiation, which provides useful information about food samples and can be used for both qualitative and quantitative analysis.These methods present some advantages over conventional analytical methods given their rapidity, provision of on-line analysis, minimal or no sample preparation, and the fact that they are non-destructive [15].Fluorescence spectroscopy techniques utilizes a beam of light, which excites the electrons in molecules of certain compounds commonly known as fluorophores, causing them to emit light.At a specific wavelength, fluorophores will absorb energy in the form of light and later liberate this energy in the form of light emission at a longer wavelength.
The processes that occur between light absorption and emission are clearly explained in Figure 1 for fluorescence, NIR, and Raman spectroscopy.In fluorescence (Figure 1a) visible light is involved, where the electronic state of a molecule is changed.In the figure, S 0 and S 1 denote the singlet ground and first excited electronic state, respectively.The fluorescence process starts with the excitation where a molecule absorbs light and gets excited from ground state (S 0 ) to a higher vibrational level in S 1 .This is followed by rapid relaxation of molecules from an upper vibrational level to the lowest level in the first electronic excited state S 1 , without any radiation.This process is known as internal conversion and normally happens within 10 −12 s or less.Finally, emission occurs usually 10 −8 s after excitation, when the molecule goes back to any vibrational state of its electronic ground state, S 0 .In modern food analysis, the fluorescence spectroscopy method has been extensively used in the evaluation of food components, adulterants, contaminants, and additives, thanks to its high specificity and sensitivity [16,17].

Raman NIR Fluorescence
Virtual level 200 -4000 cm -1 4000 -12,000 cm -1 800 -2500 nm 270 -600 nm Infrared (IR) spectroscopy is among the most common vibrational spectroscopic methods used in the food industry.It is usually divided into three regions of wavelength: far infrared (15 µm-1 mm or 400-10 cm −1 ), mid infrared (MIR: 4000-400 cm −1 or 2500-25,000 nm), and near infrared (NIR:13,333-4000 cm −1 or 800-2500 nm) [18].IR spectroscopy is based on the principle that every chemical bond vibrates with specific frequency, which corresponds to certain amount of energy.These vibrations induce absorption bands, useful in the qualitative and quantitative analyses of many molecules in samples.Vibrations of certain functional groups, for instance C=O, NH 2 , -OH, -CH 3 , C 6 H 5 -etc., always produce bands in the infrared spectrum within frequency ranges that are well-defined irrespective of the molecule having the functional group.Identification and designation of these bands to particular chemical groups provide specific information about the product being investigated [17,19].

Mostly ground oscillation
Broad bands rising from combinations and overtones of the fundamental vibrations relating to O-H, N-H, and C-H chemical bonds characterize spectra in the NIR region (Figure 1b).Samples with different chemical bonds, i.e., chemical composition, will record different spectra, which are unique to them acting as a 'fingerprint'.NIR spectroscopy has been widely used for analysing the quality and chemical composition of foods including meat, fish, milk, cereals, fruits, cereals, wine, beer, eggs, and their related products [17,19,20].Despite its great potential in food analysis, the interpretation of the spectra in NIR analysis is challenging due to its broadband nature, which consists of overlapping overtone and combination bands.Specific compounds are not as well determined as they are in the MIR region.This has however, been addressed by data processing using chemometric methods to relate spectral information to properties of the samples [21].
Mid-infrared spectroscopy provides information on the related rotational-vibrational structure as well as fundamental vibrations of the chemical bonds in the samples.Through technological advances, original MIR spectrophotometers, which measured the amount of absorbed energy based upon the fact that the infrared light frequency varies when passing through a monochromator, have been replaced by Fourier transform infrared (FTIR) spectrophotometers.FTIR equipment uses interferometers instead of monochromators, yielding advantages such as high sensitivity and precision, fast analysis, and a high signalto-noise ratio.Consequently, the term "FTIR spectroscopy" and "MIR spectroscopy" are sometimes interchangeably used [22], although FTIR is also applied for NIR.The midinfrared spectrum is comprised of four regions: the triple bond, the X-H stretching, the fingerprint, and the double bond regions.Fundamental vibrations from O-H, C-H, and N-H stretching characterize the X-H stretching region.The triple bond stretching region is generally due to C≡N and C≡C bonds vibrations, whereas in the double bond region, absorption bands characterized by C=N, C=C, and C=O occur.Bending and skeletal vibrations characterize the fingerprint region [17].Utilization of MIR spectroscopy has been used as an effective analytical method in the food industry [10,23].The far infrared (far-IR) spectral region (10-400 cm −1 ) is the range of wavenumbers characterized by largeamplitude anharmonic vibrations.The technique based on the far-IR has shown to be more suitable in providing information on dynamics of proteins and analyzing structures that are highly ordered, for instance, the formation of fibrillar, due to its sensitivity to vibrational modes resulting from hydrogen bonds and peptide skeletons [24].
In Raman spectroscopy (Figure 1c), a virtual excited energy state is involved.Comparing Raman spectroscopy to IR spectroscopy, the information provided by Raman is due to inelastic light scattering, as opposed to IR, which relies on elastic scattering.In contrast to IR, Raman spectra are obtained from symmetric and non-polar molecules.Raman scattering results from inelastic scattering of the incident photons through which energy is received from or transferred to the molecule owing to the changes in the vibrational or rotational modes of sample molecules.This causes a change in the energy, and thus the frequency of the scattered light [25].
A Raman spectrum is a plot of scattering intensity vs. wavenumber or wavelength consisting of bands, which corresponds to a Raman shift that occurs because of the difference in excitation and emission light energy (see Figure 1c).So-called Stokes-Raman lines are due to smaller emission energy than excitation, and for Anti-Stokes-Raman lines this behavior is reversed, but with much smaller intensity.The obtained Raman bands characterize specific functional groups and/or chemical bonds (C-X (X = Cl, I, F, or Br), C-S, C-NO 2 , S-S, C=N, C=C) of materials and by using them, it is possible to obtain fingerprints of materials with Raman spectroscopy.As the bands' intensity is directly proportional to the analyte concentration of the materials, quantitative analyses using Raman spectroscopy is possible [26].Analyses of safety, quality, and composition of foods are among the applications of Raman spectroscopy in the food industry [11,27,28].
In summary, the potential of spectroscopic methods in the food industry is enormous and broad.With regard to coffee, there is much literature on the utilization of these techniques in determining coffees authenticity, quality, geographical origin, species, and composition and detecting misrepresentation and adulteration.Table 1 shows reported absorption bands of coffees chemical components in the IR and Raman region.

Coffee Roasting and Monitoring
Roasting is a significant step in coffee processing where physical, chemical, and structural transformations occur.Moreover, there is production of melanoidins and thousands of flavour compounds, all of which are accountable for the characteristic colour, aroma, and flavour of the beverage [57].Therefore, roasting is a step with a significant impact on the quality of the final product, thus warranting its control.Several studies on monitoring and controlling the coffee roasting process using spectroscopic methods have been conducted (Table 2).Bertone et al. [58] studied the ability of Fourier transform (FT-NIR) spectroscopy to determine roast degree (best on colour) and varietal composition of coffee simultaneously.In the study, 130 blends of roasted-ground coffee containing both arabica and robusta species at different levels were used.The partial least square regression (PLS) algorithm was used, and validation was done with a completely independent external set.The developed model had a root mean square error of prediction (RMSEP) of 1.28 Au for colour and 4.34% (w/w) for arabica content in the blends, thus demonstrating a fast and reliable assessment of important coffee characteristics.Online monitoring of acidity during coffee roasting has also been studied using FT-NIR techniques [38].Coffee samples (arabica and robusta) were roasted using fast and slow roasting designs at 220 • C for 17 min and 183 • C for 25 min, respectively.Online acquisition of the spectra was done in the 10,000 and 4000 cm −1 range.In order to develop a calibration model between titratable acidity and online-acquired NIR spectra, important wavelength regions were selected (5150-5950 cm −1 , 4000-4760 cm −1 , 6320-7080 cm −1 , and 7470-9400 cm −1 ) and used in the development of a PLS model for acidity.These selected regions are characteristic of 5-caffeoylquinic acid and were highlighted by Ribeiro et al. [40], who aimed to link perceived acidity of beverages with arabica coffees chemical composition.The proposed model had a RMSEP and range-error-ration (RER) of 0.16 and 11, respectively, indicating its fairness in acidity prediction during coffee roasting.Yergenson and Aston [7] carried out a similar study with the aim of determining coffee roast degree online by controlling acidity.Coffee was roasted using four different temperature-time treatments and NIR spectra acquired online in the 4000-7405 cm −1 range.Measurement of titratable acidity over a varied range of roast times and temperatures followed by modeling the findings allowed for the quantification of the relationships between degree of roast, acidity, and roast time.
In another study, Santos et al. [31] explored the ability of FT-NIR spectroscopy in in-line monitoring of coffee roasting process by measuring sucrose and colour.PLS regression was utilized to develop predictive models by regressing spectral data against content of sucrose and L*, a*, and b* colour values.For the case of sucrose, important wavenumbers with significant contribution included 4980-5000, 5385-5410, 5325, 5075, and 5140 cm −1 .The C=O second overtone region, first overtone and combination band region of O-H and C=O bonds characterize the selected wavenumbers.Thus, the region could be associated with carbohydrates, organic acids, and proteins.Colour characterization of coffee beans was based on L* parameters where wavenumbers 6100, 5300, 5200, and 4965 cm −1 were selected as having the highest influence.The regions capture vibrations from C=O (5300 cm −1 ), combination band vibrations from O-H (5200 cm −1 ), first overtone vibrations of C-H (6100 cm −1 ), and combination vibrations of N-H (4965 cm −1 ).The authors concluded that free amino acids, carbohydrates, and chlorogenic acids could be the main compounds associated with colour during coffee roasting because they are involved in important chemical reactions to form colour pigments.The developed PLS models showed a strong prediction ability, with a co-efficient of determination of 0.85 and RER higher than 10.
Yergenson and Aston [6] explored the use of in-situ NIR spectroscopy in the prediction of cracking events (start and end) during coffee roasting to provide a more vigorous technique for controlling degree of roast based on the cracks.During coffee roasting, two sets of popping sounds occur (first and second cracks) that are important roast degree indicators for determining the roast endpoint.In this study, coffee samples were roasted using different time-temperature profiles.Based on the PLS regression (PLSR) with audio recordings from coffee roasting, in-situ NIR spectroscopy proved to be a reliable method in predicting the start and end times of first and second crack events.Models exhibited a good relationship between predicted and reference values with RMSEP values of 0.0068, 0.0091, 0.0041, and 0.0070 Au for the start of the first crack, end of the first crack, start of the second crack, and end of the second crack, respectively.
Pires et al. [59] successfully employed multivariate calibration and NIR spectroscopy as a substitute to the Agtron method to predict roast degrees in ground coffees and coffee beans.With PLS the method, the development of mathematical models was based on the relationship between data sets of NIR spectra and Agtron reference results to predict Agtron values of new coffee samples.In order to build representative models, all profiles of Agtron roasting were considered.The following NIR spectra regions were important for modeling in the whole bean coffee model: 6770-6992, 7482-8000, 5774-5992, 4136-4391, and 4806-5257 cm −1 .For the ground coffee model, the NIR spectra regions included 7447-8000, 6783-6922, 5317-5445, 5034-5219, and 4173-4380 cm −1 .The developed models presented promising results for roasting profiles prediction in roasted whole coffee beans as well as ground coffees with RMSEP values of 4.48 and 3.67, respectively.
Real-time detection of faults during the coffee roasting process by NIR spectroscopy and multivariate statistical process control (MSPC) based on principal component analysis (PCA) has also been evaluated.In one study, five batches with enforced disturbances (non-nominal batches) were prepared to mimic anomalous conditions of roasting and to authenticate the detection capability of the MSPC method.The non-nominal batches had deviations from nominal batches, i.e., higher amount of robusta coffee in the blend, roasting with lower and higher roasting power, and use of higher and lower initial mass of green coffee beans.PCA analysis of the acquired in-line NIR spectra demonstrated clear differences among non-nominal batches and between nominal batches.The best result was provided by a modelling method based on a time sliding window in terms of differentiating batches with and without disturbances, resourcing to typical MSPC charts: squared predicted error and Hotelling's T 2 statistics.In addition, a PCA model including a four minute time window with three principal components efficiently detected all the disturbances studied [60].

Prediction of Specialty Coffee Quality and Sensory Attributes
Sensory analysis of coffee is a complex process owing to the disparities of aromas, flavours, as well as diversity of compounds produced during coffee roasting.The cupping method is by far the most common professional technique for evaluating coffee, which entails professional tasters evaluating coffee based on physical quality (size, odor, colour, and shape of the beans) and cup quality (fragrance, aroma, astringency, acidity, flavour, body, bitterness, aftertaste, and overall quality) [83].However, this method is somewhat subjective and requires professional personnel.Additionally, it is associated with various shortcomings: it is time consuming, there is disparity in sensory perception amongst evaluators, a lack of reproducibility, and inadequate conclusions about samples, etc.In view of this, spectroscopy techniques may be used as alternative tools in the sensory evaluation of coffee as they are rapid and have the capability to reproduce results (Table 2).
Baqueta et al. [67] studied the ability of NIR spectroscopy in association with the PLS method to determine coffees sensory characteristics.Coffee samples with different variations including species, production region, variety, drying conditions, transport, postharvest process, storage times, coffee blend, coffee composition, and roasting process were used.Professional cuppers evaluated the samples based on the fragrance, acidity, aroma, bitterness, body, flavour, astringency, overall quality, and aftertaste.NIR spectra of the same samples were also taken.The performance of PLS models for each parameter were validated using the following merit parameters: sensitivity, accuracy, linearity, residual prediction deviation, fit, quantification, and detection limits.The developed models were suitable to quantify, detect, differentiate, and predict sensory characteristics of coffee samples, as all sensory characteristics were predicted with acceptable values consistent with the merit parameters.
Similarly, Ribeiro et al. [40] carried out a study using NIR in conjunction with PLS with the aim of developing predictive models for a reproducible and objective sensory analysis of coffee.Additionally, the correlation between beverage sensory characteristics and chemical composition of the green coffee bean was determined.Experts assessed the coffee samples based on flavour, acidity, bitterness, body, overall quality, and cleanliness, assigning a score to each attribute.PLS models were constructed for each attribute and they exhibited good correlation between estimated values and those supplied by the experts with RMSEP values of 0.37, 0.30, 0.25, 0.37, 0.42, and 0.30 for bitterness, acidity, flavour, cleanliness, overall quality, and body, respectively.The researchers observed that, in roasted beans, lipids and proteins were closely related to body attribute of the beverage, chlorogenic acids to acidity, caffeine and chlorogenic acids to bitterness, and finally caffeine, trigonelline, chlorogenic acid, polysaccharides, sucrose, and protein to flavour, cleanliness, and overall quality.
The market for specialty coffee is growing constantly due to the changing preferences of consumers.Specialty coffees have a higher quality than commercial coffees.They are characterized by unique flavours, have a known geographical origin, and commonly comprise coffee beans certified as organic, or with rainforest alliance and fair-trade certifications, etc.On the global market, specialty coffees are traded with a price premium of approximately 20-50% when compared to regular coffees [84].Therefore, there is a need for more objective and reliable techniques for authenticating such coffees.Tolessa et al. [69] developed a model to predict the quality of specialty coffee based on green beans using NIR spectroscopy in conjunction with PLS analysis.Samples consisted of coffee from different environments (low, mid, and high altitudes in Ethiopia) and processed differently (washed, dry, and semi-washed methods).Professional coffee testers evaluated the samples based on body, acidity, overall cup preference, aroma, uniformity, flavour, aftertaste, cup cleanness, balance, and sweetness.The proposed models provided reliable predictions for the quality of a specialty cup as well as its different quality characteristics with RMSEP values of 1.04, 0.27, 0.22, 0.27, and 0.24 for total quality, acidity, overall preference for the cup, aftertaste, and body, respectively.NIR spectroscopy has also been explored as an alternative to traditional methods of sensory evaluation (cupping tests) to predict flavours of specialty coffees.Models were developed using deep convolutional neural networks (DCNN) and the support vector machine (SVM) method.Training was then done and used for the prediction of several specialty coffee flavours (as presently being done by professional evaluators) utilizing readings from NIR spectra as the inputs.Descriptions of flavour were classified into nine flavour classes: fruity, floral, fermented/sour, vegetable/green, spices, roasted, sweet, cocoa/nutty, and other.The two methods yielded comparable performance, having the recall and accuracy of 70-73% for SVM and 75-77% for DCNN.This study offered an innovative and objective method in the prediction of intricate flavours present in specialty coffees [68].A study by Arboleda [70] discriminated civet coffee from ordinary processed coffee utilizing NIR spectroscopy with the help of feed forward back propagation artificial neural networks.The following wavelengths showed major differences among the two sample groups: 1088 nm, 907 nm, 1650 nm, and 1540 nm.A good model was developed with classification scores of 95-100%.
De-Araújo et al. [33] proposed a new non-destructive methodology based on NIR spectroscopy and CACHAS (chemometrics-assisted colour histogram-based analytical systems) for the verification of roasted-ground gourmet coffees (specialty coffee) with no sample preparation.NIR spectra of the samples (36 traditional, 10 superior, and 44 gourmet) were acquired in the 4000-10,000 cm −1 range.Digital images of the samples were made using a scanner module from an HP Deskjet multifunctional printer.Histograms in the hue-saturation-intensity (HSI), red-green-blue (RGB), and grayscale (GS) colour systems from each sample were acquired from a circular region of interest (ROI).In this study, one-class partial least squares (OC-PLS) and data-driven soft independent modeling of class analogy (DD-SIMCA) were employed as one-class classifiers (OCC).Models were developed using different pre-processing techniques and combinations of colour histograms for NIR spectroscopy and CACHAS, respectively.Performance of the constructed models were assessed with regard to sensitivity and specificity.DD-SIMCA using RGB histogram for CACHAS and offset correction for the NIR spectra reported the best results as they both correctly recognized 100% of the studied samples in the training set and test set.
The potential of MIR spectroscopy in association with chemometrics in predicting the cup quality of coffees roasted to different roast degrees were examined (Table 2) [45].Coffee samples were first classified based on cup quality by skilled cuppers from worst to best.The samples were then roasted to dark, medium, and light roast degrees.PCA was used to show variability among the data as well as to detect probable outliers, whereas classification models were developed using PLS discriminant analysis (PLS-DA).Different preprocessing techniques of the raw data were compared and, with respect to PCA, the best clustering of the samples was achieved with second derivative application to the wavenumber ranges of 1680-1800, 2800-2960, and 1130-1249 cm −1 .Models were developed based on a two-level PLS-DA hierarchical strategy.In the first level, coffee was classified as of low or high quality followed by separation based on quality of the cup in the second level.Spectral regions 2918, 1749, 1038, and 806 cm −1 were the most important for the differentiation between coffees of low or high quality, whereas in the classification based on cup quality there was no particular band region that characterized one particular class.However, there were differences among the intensities.Similarly, Belchior et al. [8] investigated FTIR potential for quantitative valuation of the sensory attributes of specialty coffees.Coffee samples were evaluated by professional cuppers according to the Specialty Coffee Association of America (SCAA) protocol for sensory analysis of coffee.Based on the scores, PLS regression models were then constructed to establish and predict a sensory profile.The models showed a good relationship between predicted and experimental data with low RMSEP values of 0.23.However, there were no specific peaks significant for the coffee classification determination when correlated to the SCAA scores for the entire spectra.This can be explained by the fact that different compounds affecting the sensory profile of coffee absorb throughout the entire spectrum.
Abreu et al. [44] compared the performance of ultraviolet-visible spectroscopy (UV-Vis), high-performance liquid chromatography (HPLC), and MIR to discriminate between traditional and specialty coffees using exploratory factor analysis (FA).Specialty and traditional roasted ground coffee samples were extracted using the best solvents (water and dichloromethane), i.e., those that produced the most divergent metabolic fingerprints.Among the methods investigated, HPLC and UV-Vis spectral fingerprints of the water extracts provided more relevant discriminatory information than did MIR.Fatty acids, chlorogenic acids, organic acids, trigonelline, and lipids were the most important compounds for this discrimination.A method to discriminate espresso coffees based on their sensory characteristics using attenuated total reflectance Fourier transform infrared spectroscopy (ATR-FTIR) and PLS-DA models was developed by Belchior et al. [71].Samples were evaluated based on their flavour, aroma, acidity, aftertaste, and body, and models were developed for each attribute.The models classified the samples based on these attributes with good values of specificity and sensitivity for both calibration and validation sets.Proteins, lipids, trigonelline, caffeine, carbohydrates, chlorogenic acids, and carboxylic acids were linked to the body, flavour, and aftertaste.The acidity was linked to the presence of chlorogenic acid, carboxylic acids, and alcohols, whereas esters, ketones, overall acids, and aldehydes contributed to the aroma of the beverage.
Suhandy and Yulia [12] studied the feasibility of fluorescence spectroscopy in discriminating Indonesian specialty coffees (Pea berry, Pagar Alam, and Civet).Soft independent modeling of class analogy (SIMCA) and PCA were used as chemometric methods.The PCA approach allowed grouping of the samples into three clusters where an excitation of 370 nm was associated with this observation.In contrast, SIMCA models showed a high degree of specificity and sensitivity for calibration and prediction data sets.

Detection of Defective Beans and Adulterants in Coffee
Coffee has a high market value and as a result, its adulteration has become an extensive practice for economic gain.It can include the addition of spent coffee grounds, chicory, coffee husks, cereals (e.g., barley, corn, wheat, and rice) as well as the substitution of arabica coffee, which is more expensive, with robusta coffee [85].It is important to emphasize that some countries have allowed the use of specific adulterants up to a certain level.However, it is illegal if the information is not declared on the label.Therefore, fast and reliable techniques are needed to protect consumers from fraudulent practices, and this has necessitated development of new techniques based on spectroscopy (Table 2).Correia et al. [32] proposed a methodology for identification and quantification of adulterants in roasted coffee based on NIR spectroscopy and multivariate calibration by PLS and PCA. Green coffee beans were roasted to different roast levels (dark, medium, and light), ground, and mixed with adulterants (robusta coffee, corn, and peels/sticks) in a range of 1 to 100% w/w.The PCA approach allowed for recognition of adulterated and pure samples with first and second components accounting for over 90% of the total spectral variance.PLS models with good sensitivity were constructed.However, the model for quantification of corn in coffee samples was the best with RMSEP and coefficient of determination (R 2 ) of 4% and 0.9788, respectively, compared to that of the peels/sticks.In regard to roast levels, light roasted samples gave the best RMSEP values (2.8%).
Ebrahimi-Najafabadi et al. [61] studied the adulteration of ground-roasted coffee with barley utilizing NIR spectroscopy and chemometrics methods.A widely applicable model was constructed using nine varieties of coffee samples with their mixtures adulterated with four types of barley, all, roasted to different roast degrees.Barley was mixed at levels ranging from 2 to 20% w/w.A PLS regression was developed to obtain a quantitative model for prediction of adulteration levels.The model proposed provided dependable predictions of barley adulteration at levels as low as 2% w/w with low values of root mean square errors (RMSE).Selection of variables was done using genetic algorithms, and the following spectral regions were selected as the most significant in identifying coffee adulterated with barley: 6032, 5748, 4788-4880, 4628-4688, 4336, and 4276 cm −1 .The regions are ascribed to the first overtone of N-H, C=O, C-H, O-H, and S-H functional groups of ROH, ArOH, H 2 O, RNH 2, and CONHR.In a similar approach, Winkler-Moser et al. [62] evaluated NIR spectroscopy in conjunction with PLS regression analysis for corn adulteration detection in ground-roasted coffee samples.Different samples were prepared by mixing corn with coffee at concentrations of 0, 1, 5, 10, 15, and 20% w/w.Based on the confidence intervals of predicted percent corn at specific levels of adulteration, the developed model seemed not to be reliable in the detection of all coffee samples adulterated at levels below 4%.This contrasts with the finding of Ebrahimi-Najafabadi et al. [61], whose model was able to detect coffee adulteration with barley at 2% levels.
The capability of NIR hyperspectral imaging in combination with multivariate curve resolution-alternating least squares (MCR-ALS) to detect and quantify adulterants in roasted-ground coffee has also been reported [86].Four coffee adulterants were studied, i.e., coffee husks, wood sticks, soil, and corn kernels.They were added to the roastedground coffee in levels of 1.0 to 40% (w/w).Quantification of adulterants was based on the utilization of the MCR correlation constraint, whereas detection was achieved through comparison of the reference spectrum of each adulterant with the spectra recovered by the MCR model utilizing only the adulterated samples.For detection, the values of the coefficient of correlation of the studied adulterants ranged from 0.90 to 0.99.However, the absolute errors for the quantification of adulterants in the developed models were below 4%.The findings pointed out the practicability in the application of the methodologies proposed.Flores-Valdez et al. [47] successfully employed FT-NIR spectroscopy coupled with chemometrics for identification and quantification of adulterants in coffee.Coffee samples were adulterated with soy, corn, barley, oat, coffee husks, and rice, in proportions ranging from 1 to 30% in increments of 1%.They developed a SIMCA model based on the percentage rejection (selectivity) and recognition (sensitivity), which allowed the discrimination of adulterated and unadulterated samples with an accuracy of 100%.The quantitative model with the highest performance corresponded to the PLS1 algorithm with an R 2 ≥ 0.99, standard error of prediction (SEP) of 0.45-0.94,and standard error of calibration (SEC) of 0.39-0.82.To build this model, the most significant spectral range considered were 800-1800 and 2800-3500 cm −1 because they showed the highest variability of the spectra, signifying the change in absorbance as well as increase in the levels of adulterants in the samples.In another study, Brondi et al. [63] compared differential scanning calorimetry (DSC) and attenuated total reflectance-Fourier transform infrared (ATR-FTIR) spectroscopy methodologies together with chemometric analysis for detection and quantification of corn in roasted-ground coffee.Adulterated samples were prepared mixing coffee roasted to light, medium, and dark roasts with ground roast corn in levels ranging from 0.5 to 40% (m/m).PLS and PCA models were constructed to quantify adulteration levels and to distinguish non-adulterated samples from adulterated ones, respectively.PCA models built with FTIR and DSC data were able to detect adulteration of coffee with corn in concentrations below 1%.Conversely, PLS models demonstrated a good correlation between reference concentrations and estimated values with RMSEP of 1.53% for FTIR and 3.94% for DSC.
The presence of defective beans (i.e., insect damaged, broken, sour, black, mouldy, or immature) significantly affects the price and quality of the batch as well as the beverage.According to Franca et al. [87], defective beans roast differently, acquiring a different colour after roasting and imparting undesirable organoleptic characteristics to the beverage, which decreases cup quality.Therefore, it is vital to develop techniques for rapid and accurate assessment of coffee quality.Craig et al. [41] successfully developed a technique based on FTIR and NIR to quantify defects in roasted coffees.Quantitative models based on PLS regression were developed with the aim of predicting defective bean percentage in mixtures with non-defective beans.The mixtures were prepared by mixing each type of defective bean (immature, black, light sour, and dark sour) with non-defective ones, with the defect amounts ranging from 3 to 30% in steps of 3%.PLS regression models provided accurate predictive results with correlation coefficients and root mean square error of validation (RMSEV) of 0.891 and 0.032, respectively, for FTIR, and 0.953 and 0.026 for the NIR technique.Comparing the two methods, quantitative models from NIR were considerably more robust as compared to the ones based on FTIR for quantifying defective beans in roasted coffees.With the employment of a new statistical approach, Elastic net, the same authors evaluated the performance of NIR and ATR-FTIR for the discrimination of roasted defective (immature, black, dark sour, and light sour) and non-defective coffees.The developed models based on Elastic net allowed for the correct classification of non-defective and defective coffees with amino acids/proteins, carbohydrates, lipids, chlorogenic acids, and caffeine being the chief chemical descriptors that characterized the samples.Generally, both NIR and ATR-FTIR proved to be reliable methods for the discrimination of roasted defective and non-defective coffees.
In another study, Santos et al. [64] assessed the ability of NIR spectroscopy in discriminating defective beans (sour, immature, and black) from non-defective ones using PCA and PLSR analysis.Although PCA modelling did not allow the separation of all the studied defects, PLS regression linking the NIR spectrum to the mass fraction of non-defective and defective beans showed relative errors of about 5%.This is an indication that with the proposed methodology, it may be possible to do quantification of non-defective quality against other qualities in a batch.Chen et al. [65] studied the capability of VISIBLE-NEAR INFRARED (Vis-NIR) hyperspectral imaging for insect-damaged beans detection.The experimental samples included 570 and 569 defective and healthy beans, respectively.A Vis-NIR push-broom hyperspectral imaging camera was used to attain the beans images, a hyperspectral insect damage detection algorithm (HIDDA) was used for selection of bands, and a convolutional neural network (CNN) and constrained energy minimizationsupport vector machine (CEM-SVM) was used for identification.Classification accuracy of defective beans was 93% and that of healthy beans was 96.4%.It was observed that the wavelength range of 850-950 nm was important in accurately identifying insect damaged and healthy beans.
Application of diffuse reflectance (DR) and attenuated total reflectance (ATR) Fourier transform infrared spectroscopy for discrimination of mature and immature green coffees has also been studied.Agglomerative hierarchical clustering (AHC) and PCA analysis of the normalized ATR and DR spectra clearly separated the samples into two groups, mature and immature.Spectral ranges that significantly contributed to sample clustering were as follows 2074-3100, 1800-1995, 1477-800, and 1725-1762 cm −1 for DR spectra and 3030-3080 and 1533-1535 cm −1 for ATR spectra.Linear discriminant analysis (LDA) classification models based on the absorbance readings after normalization at four wavenumbers (1141, 1741, 2852, and 2921 cm −1 ) were developed.The results provided by the models presented recognition as well as prediction capabilities of 100%, for both ATR and DR data [49].The same research group also employed similar techniques with an aim of characterizing and discriminating between non-defective and defective (sour, immature, black) coffee beans prior to roasting.Similar observations were reported with PCA and AHC of the reflectance spectra showing discrimination between non-defective and defective beans [66].
Another study focused on the application of Fourier transform infrared-photoacoustic spectroscopy (FTIR-PAS) in a quantitative assessment of defects in roasted ground coffee [51].Mixtures of different defects (broken, sour, black, peels, and woods) were mixed with non-defective coffee of robusta and arabica species in precise ratios to form different classes of blends.All samples were medium roasted and ground.With PCA, it was possible to predict the fraction/amount as well as the defects nature in blends.PLS-DA provided information on the similarities between the blends.The developed model classified samples into four classes with 100% accuracy and specificities of higher than 0.9.

Discrimination of Coffee Based on Species, Variety, and Geographical Origin
Coffee identification has become an important aspect of the global coffee market considering the great variability in quality and selling price based on its geographic origin, species, and variety.This has necessitated the development of fast and reliable methods for discrimination of coffee based on the aforementioned parameters to avoid fraudulence.Arabica coffees are usually considered to be of better quality than robusta coffees in terms of flavour and taste.NIR spectroscopy in combination with the direct orthogonal signal correction (DOSC) preprocessing technique was utilized to develop an analytical technique to discriminate between pure robusta and arabica species and their blends.Classification models from raw and corrected NIR spectra were developed using the potential function class-modelling technique.It was concluded that the combination of DOSC, NIR spectroscopy, and the potential functions method could be utilized as an ideal approach to distinguish between pure robusta and arabica coffee varieties and to differentiate between blends and pure varieties of the two species [72].In another study, Zhang et al. [48] sought to establish the best pattern recognition method that could be employed in conjunction with MIR spectroscopy for coffee variety identification.PCA successfully categorized each coffee variety with the first four principal components (PCs), explaining over 99% of the total variance.Twenty-nine best wavenumbers were selected by the loadings of the first four PCs and used in the classification models development utilizing ten different pattern recognition techniques.The extreme learning machine (ELM), radial basis function neural network (RBFNN), back propagation neural network (BPNN), relevance vector machine (RVM), SIMCA, and SVM models were categorized as highly effective techniques, with classification accuracies of more than 95% in the test and training set.In contrast, naive Bayes classifier and PLS-DA, K-nearest neighbors, and random forest were categorized as techniques of low effectiveness and medium effectiveness with classification accuracy below 80% and above 80% in the test and training, respectively.
Marquetti et al. [37] evaluated the potential of NIR spectroscopy and PLS-DA in distinguishing green arabica coffee samples based on their geographical origin and genotypic characteristics.Coffee samples of four genotypes (IA 59, IPR 106, IPR 99, and IPR 105) and from four different cities were studied.The best PLS-DA models were developed from the pretreated spectra (Savitzky-Golay second derivative and multiplicative scatter correction) and were capable of distinguishing coffee samples both genotypically and geographically.However, the specificity and sensitivity for prediction and calibration sets for geographic discrimination were greater than for the genotypic one.The best model for both parameters correctly identified 94.4% of validation samples.Similarly, Giraudo et al. [36] used NIR spectroscopy and multi-variate data analysis to classify coffee cultivated in different countries and continents.Interval PLS-DA and laboratory-independent partial least square-discriminant analysis models were developed by following a ranked method, i.e., by first putting into consideration the continent followed by the country of origin as the rule of discrimination.For the continent and country of origin, the best classification models correctly identified over 98% and 100% of validation samples, respectively.Medina et al. [73] compared the capability of nuclear magnetic resonance ( 1 H-NMR), NIR, and ATR-MIR spectroscopic methods in discriminating coffee based on its species (arabica and robusta) and geographic origin (Colombian coffee versus other origins).Considering species, samples were successfully discriminated as expected by all the techniques using the built PLS classification models.However, for the case of origin determination, ATR-MIR and 1 H-NMR exhibited comparable capability to discriminate Colombian coffee samples, but weak results were observed with the NIR spectroscopy technique.In contrast, Bona et al. [52] evaluated the ability of NIR and FTIR (MIR) spectroscopy in conjunction with support vector machines (SVM) for geographical classification of different arabica coffee genotypes and reported better results for the NIR method.The technique exhibited greater performance as compared to FTIR, with a specificity and sensitivity of 100% in discriminating coffee samples from different regions.Okubo and Kurata [34] proposed an approach for geographical classification of different species of coffee (arabica and robusta) exploiting NIR spectroscopy and SIMCA as the classification methods.Models were developed and their distance from SIMCA evaluated to establish if they were different.Although some samples were partially classified into several categories, a good classification result was obtained with over 73% correct classification rate.Model distance values from SIMCA were relatively similar among samples, which could explain the misclassification.
Wongsaipun et al. [5] applied an artificial neural network technique (a self-organizing map (SOM)) with the aim of tracing geographical origin of arabica coffee beans grown in three provinces in northern Thailand using NIR spectroscopy.The three provinces included Chiang Mai, Lampang, and Mae Hong Son.The self-organizing map discrimination index (SOMDI) was applied to identify important parameters of the spectroscopic data.SOM analysis of the NIR spectra separated the samples from the three different regions.Based on the SOMDI results, the coffee samples from Chiang Mai could be well discriminated using the following NIR spectral ranges: 2260-2498, 1896-2180, 1254-1326, and 880-1182 nm.Luna et al. [74] evaluated different chemometric methods for the classification of robusta coffee cultivars using FT-NIR spectroscopy.Five coffee cultivars were studied.SOM, SIMCA, PLS-DA, and SVM were explored for the cultivars classification, whereas PCA was used for identification of the clusters.PCA analysis of the pre-processed spectra discriminated samples into different groups, which were correlated to the presence of water, caffeine, lipids, sugars, chlorogenic acids, carbohydrates, and proteins.SOM presented the best results, providing 100% correct identification of the validation samples, whereas SIMCA, PLS-DA, SVM using 4 PCs and SVM using 3 PCs provided 99.6, 82.9, 99.6, and 82.9%, respectively.
A study by Link et al. [50] distinguished coffee samples of different geographical origins and genetic make-up using FTIR spectroscopy in tandem with radial-basis function networks (RBFs).Optimization of some parameters was done using sequential simplex in order to select the best neural network.The optimized RBFs correctly classified the samples both genotypically (94.44%) and geographically (100%).This chemometric method exhibited superior performance compared to multilayer perceptron (MLP) and SIMCA developed for classification of coffee as reported by the same research group.A feasibility study using ATR-FTIR spectroscopy and chemometric analysis with the aim of discriminating coffees from different topographical origins as well as of different roast degrees was conducted by Wang et al. [75].Coffee samples from different origins (Kenya, Colombia, Ethiopia, and Costa Rica) were roasted to two roast degrees (medium and dark) and extracted using six organic solvents (hexane, ethyl acetate, dichloromethane, acetic acid, ethanol, and acetone) and a mixture of an equal volume of the solvent and water.Extraction using water and the solvent resulted in clean extracts, which provided good spectral information important for sample discrimination.Developed classification models based on SIMCA and of dark roasted coffee correctly classified samples (100%) based on their origin when ethyl acetate solvent was used.
The potential of Raman spectroscopy in coffee species and cultivar discrimination has been investigated.A study by Rubayiza and Meurens [53] discriminated arabica, liberica, and robusta coffees based on their lipid fraction using Fourier transform (FT) Raman spectroscopy.Raman measurements of the lipid fraction from roasted and green coffee beans was taken using a laser emissions at a wavelength of 1064 nm and resolution of 4 cm −1 .PCA analysis of the spectra revealed clustering of the samples based on the species with the first principal component (PC1), explaining 93% of the spectral variance.This was associated with the concentration of kahweol, which manifests at two specific scattering bands at 1478 and 1567 cm −1 .Keidel et al. [54] reported similar findings after analyzing green coffee beans belonging to robusta and arabica species from different geographical origins (South America, Asia, and Africa).Measurements were taken using a 1064 nm laser and a spectral resolution of 4 cm −1 .It was possible to classify coffee samples based on the specific kahweol content.In a similar approach, Wermelinger et al. [55] evaluated Raman spectroscopy for quantification of robusta coffee fraction in blends by analyzing the lipid fraction.Pure robusta and arabica and their mixtures with contents of robusta of 5, 10, 25, 33, 50, and 75 wt.%were used.Lipids were extracted from roasted coffee beans and their Raman spectra collected at 532 nm.It was possible to determine the robusta content in studied mixtures by examining the intensity ratio between the Raman peaks of fatty acids at 1665 or 1460 cm −1 and the kahweol peak at 1570 cm −1 .
Luna et al. [11] compared different chemometric approaches that could be used to classify robusta coffee cultivars using Raman spectroscopy.Raman spectra of five coffee cultivars were collected at 785 nm with a laser excitation power of 100 mW.The following classification methods were studied: quadratic discriminant analysis (QDA), mixture discriminant analysis (MDA), PLS-DA with Bayesian inference, SIMCA, regularized discriminant analysis (RDA), and LDA.Two preprocessing methods, multiplicative scatter correction (MSC) and mean centering (MC), were also compared, with MSC yielding more accurate results for all the classification methods.Using multiplicative scatter correction, RDA, MDA, QDA, SIMCA, and PLS-DA methods correctly classified 100% of the samples, whereas LDA correctly classified 98.7% of the samples.Conversely, using MC, correct classification of the samples was 70.7%, 62.7%, 62.7%, 62.7%, 61.3%, and 97.3% for MDA, LDA, RDA, QDA, PLS-DA, and SIMCA, respectively.A study by Figueiredo et al. [56] differentiated green coffee beans of different genotypes using Raman spectroscopy in conjunction with chemometric methods (PLS-DA and PCA).Acquisition of the Raman spectra were made using an excitation laser with λ = 1064 nm at a spectral resolution of 4 cm −1 .It was possible to distinguish the coffee genotypes based on spectral bands that are typical of kahweol (1567 and 1479 cm −1 ) and fatty acids (1442 and 1302 cm −1 ) using PCA.The developed PLS-DA models demonstrated a good association between the training and test set of the studied samples indicating its suitability in performing the task.
El-Abassy et al. [14] proposed a method capable of discriminating between robusta and arabica coffee species based on their lipid and chlorogenic acid (CGA) contents using Raman spectroscopy.Measurements of the samples were made using a 514.5 nm laser with an excitation power of 10 mW.PCA analysis of the whole Raman spectra presented a clear discrimination between robusta and arabica coffee with 93% of the total spectral variation.The most important spectral range that contributed to this observation was 1000-1750 cm −1 , an area dominated with Raman bands of CGA.Discrimination of the two coffee species was also observed when PCA analysis was restricted to the spectral range between 2700-3050 cm −1 , which is dominated by bands of lipids.In this respect, the first two PCs constituted 85% of explained variation in the spectra.Studies on fluorescence spectroscopy as a valuable technique to discriminate coffee based on their geographical origin and phenotypic characteristics are also present in the literature.Botelho et al. [13] developed a supervised classification technique using chemometrics tools with the ability to discriminate coffee of different origins (four) using fluorescence spectroscopy.Discrimination models were constructed by employing N-way partial least squares (NPLS-DA), parallel factor analysis (PARAFAC), and unfolded partial least squares (UPLS-DA) methods.Among the three methods studied, UPLS-DA models showed the best results, with f-scores of coffees from two regions above 0.8, for both test and training sets.In another study, Dankowska et al. [76] compared the ability of UV-Vis spectroscopies, fluorescence, and the low-and mid-level data fusion of both spectroscopies to quantify concentrations of roasted robusta and arabica in coffee blends.Classification models were developed using multiple linear regression (MLR) and LDA.The best prediction capability of the MLR models was achieved with application of the mid-level data fusion model of fluorescence and UV-Vis intensities at 60 nm wavelength interval with RMSEV of 7.9% and RMSEC of 3.6%.Better discrimination ability was obtained with data fusion with the highest classification accuracy (above 96.0%) being achieved for the low-level LDA model with fluorescence intensities at a 60 nm wavelength interval.

Prediction of Coffee Chemical Composition
Coffees chemical composition affects the quality of the beverage, making its analysis a necessity for coffee processors.The use of known conventional methods to determine these compounds in coffee is laborious and expensive due to sample preparation.Spectroscopic analysis of ground or whole coffee beans is a potential alternative to overcome these limitations.Studies on the application of NIR spectroscopy for the quantification of the chemical components of green coffee beans have been done (Table 2).Tugnolo et al. [30] compared the potential of NIR spectroscopy and thermogravimetric analysis in measuring moisture content of roasted beans and their grounds.The developed PLS models from NIR data demonstrated a good relationship between estimated values and reference concentrations with errors in prediction below 0.15% and coefficients of determination higher than 0.95.To compare the measurements of the two techniques, the passing-Bablok regression method was performed.Even though there were significant differences, the proposed residual dispersion index (RDI%) showed higher predictive accuracy of NIR-based predictions (RDI% = 5.93) with regard to thermogravimetric analysis measurements (RDI% = 9.68).Escobar et al. [77] developed a rapid method for moisture content prediction in unroasted coffee beans using NIR spectroscopy.Among the samples analyzed (two accessions of liberica coffee and four varieties of arabica coffee), the calibration model for typica coffee (arabica variety) exhibited the best performance in predicting the moisture content of studied samples as evidenced by its high cross-validated coefficient of determination (0.8556) and cross-validated root mean square error of prediction (0.6355).
Kyaw et al. [78] reported promising results in relation to the moisture content prediction of ground unroasted coffee beans using NIR spectroscopy.Good accuracy for the prediction of moisture content was obtained from the spectral data pretreated with second derivative and Kubelka-Munk (K/S) data transformation (correlation coefficient (r) = 0.87 and accuracy = 99%).Zhu et al. [35] proposed a rapid method for lipid and protein content determination in unroasted coffee beans using NIR spectroscopy and chemometrics.Orthogonal signal correction (OSC) and different spectral pretreatment methods (standard normal variate (SNV), MSC, 1st or 2nd derivative, and Savitzky-Golay smoothing) were compared during the process of building a PLS regression model.Model quality was enhanced by the MSC, SNV, and OSC pretreatment methods.On the contrary, the 1st and 2nd derivative reduced the model quality.Important variable selection significantly enhanced the PLS models, with OSC-PLS models standing out as the most robust for prediction of lipids and proteins with an RMSEP of less than 0.106 and a coefficient of determination for prediction of more than 0.982.In a similar approach, Macedo et al. [79] proposed a methodology based on the use of NIR spectroscopy associated with PLS regression to estimate some chemical properties (soluble solids, total and reducing sugars, and moisture content) in intact green coffee samples.The highest R 2 obtained for the samples in the validation set included 0.781, 0.810, 0.694, and 0.516 for reducing sugars, moisture content, total sugar, and soluble solids, respectively.
The literature also contains investigations on the feasibility of FTIR/MIR spectroscopy in the prediction of chemical components in coffee.Liang et al. [80] used ATR-FTIR spectroscopy together with PLS regression with the aim of developing a rapid method to determine antioxidant activity and CGA isomer composition in roasted and unroasted coffee beans.FTIR spectral data were processed (i.e., baseline corrected and mean centered) prior to model development.For the case of CGA isomer concentrations, the models presented a good relationship between HPLC reference values and estimated values with R cv of more than 0.92 for all the isomers.Good models were also constructed with the ability to predict antioxidant capacities in coffee beans.In another study, Weldegebreal et al. [81] successfully developed rapid methods for direct caffeine content determination in aqueous solution of unroasted coffee beans using fluorescence, NIR, and ATR-FTIR spectroscopy.Caffeine content of the samples was determined using the three types of equipment and the results were compared to the standard method (UV-Vis spectroscopy) for validation.The results for the newly developed techniques were similar to the results achieved by the standard method.In a similar approach, Yisak et al. [82] developed a reliable and sensitive technique for simultaneous trigonelline and caffeine determination in the aqueous extract of unroasted coffee beans with relative standard deviations below 4%.Evaluation of the developed analytical method was done by spiking green coffee beans with trigonelline and caffeine standards.Average recoveries of 99 ± 2% for both the alkaloids was reported.

Future Perspectives and Conclusions
The information presented in this review clearly confirms the potential of spectroscopic techniques in coffee assessment.However, most of these studies are purely academic.Mendes and Duarte [23] highlighted issues that must be addressed for the developed spectroscopic analytical methods to be applied as official methods of analysis in the coffee industry.They include guidelines on the validation of the methods, use of samples from credible sources (known origin) for method development, use of a satisfactory number of representative samples in order to cover all sample variances for development of a robust model, and choosing the most appropriate chemometric technique for a particular objective.The findings of this review demonstrate the potential of spectroscopic methods to evaluate coffee for different parameters.However, there are other aspects that need further investigation, including prediction of coffees processed using different methods, differentiation of organic from inorganic coffee, and monitoring of coffee quality during storage, of which limited scientific information is available.Additionally, Raman and fluorescence spectroscopy received very little attention, with most applications being focused on NIR spectroscopy.Available information in the field of coffee spectroscopy primarily focused on IR spectroscopy.To broaden this scope, this review includes recent studies based on IR, Raman, and fluorescence spectroscopy.
In conclusion, coffee composition and quality assessment are an important practice in the global coffee trade market.Conventional methods that are most frequently used to perform these tasks are often time-consuming, complex, expensive, and require several steps in sample preparation.Spectroscopic techniques (NIR, MIR/FTIR, Raman, and fluorescence), as evidenced by the reviewed literature, could be used to evaluate coffee for different parameters along the production process and in the near future replace the conventional methods.The vast majority of applications are based on NIR spectroscopy; only a few examples with Raman and fluorescence spectroscopy are available in the literature.In the future, applications using these methods will likely become more common, as valuable information can also be obtained from these spectra.The major advantages of these techniques are that they are rapid, require no or simple sample preparation, are nondestructive, and can be easily integrated into processes.Nevertheless, it is important for laboratory studies to upscale to industrial applications if these advantages are to be realized.

Figure 1 .
Figure 1.Simplified potential curves to illustrate different spectroscopic methods discussed here; just vibrational states are presented: (a) fluorescence, (b) NIR, (c) Raman.The red arrow indicates the energy involved in the corresponding radiation.

Table 1 .
Reported absorption bands of coffees chemical components in the IR and Raman region.

Table 2 .
Summary of reported studies on potential applications of NIR, FTIR/MIR, fluorescence and Raman spectroscopy in the coffee industry.