A Pre-Trial Study to Identify Species of Origin in Halloumi Cheese Utilising Chemometrics with Near-Infrared and Hyperspectral Imaging Technologies

: Halloumi cheese has recently gained a Protected Designation of Origin (PDO) indicator, which is related to the place (Cyprus) in which halloumi cheese is made. The PDO label is linked with several requirements, e.g., milk species, quantities, etc.; thus, it is important to study this product regarding authenticity. The utility of using two spectroscopic techniques, hyperspectral imaging (HSI) (400–1000 nm) and conventional near-infrared spectroscopy (NIR) (800–2500 nm) were assessed for the discrimination of 17 Cypriot halloumi cheese types, which could be categorized as of cow or goat–sheep origin. The aim of this study was to obtain spectral information for halloumi cheese using other promising infrared and imaging spectroscopic techniques as a comparison to a previously acquired mid-infrared (MIR) spectroscopy dataset. NIR and HSI are both fast and easy techniques in application, both of which provide significant information in food analysis. Chemometric analysis was crucial for interpreting the spectroscopic data by applying the unsupervised methods: principal component analysis (PCA) and hierarchical cluster analysis (HCA). The HSI model was found to be based intuitively on the appearance of cheese samples after freeze-drying (e.g., color; yellow/white, and texture; oily/dry), while the NIR grouping of samples was determined to be based on composition, mainly fat, protein and lactose content of the cheese samples. The HSI model returned distinct clusters of the two halloumi cheese types, cow and goat–sheep origin, with one outlier (16/17 accuracy; 94%), while the NIR model proved less accurate (13/17; 76%).


Introduction
There is emerging interest in products that are made using goat milk, as this type of milk has the potential to improve digestibility and enhance nutritional characteristics or value whilst reducing symptoms from exposure to allergens commonly found in cow milk [1,2].The total protein content of milk consists of true proteins (e.g., casein and albumin) and non-protein nitrogenous compounds.Possible differences in the content or physical structure of true proteins and differences in the ratio of true protein and nonprotein nitrogen in milk are usually due to samples that have been collected from different animals [3].Amino acid composition, the secondary structure of proteins, alongside fat globules, are the main factors influencing the differences between milk of goat and cow origin [4][5][6].The milk used to produce halloumi is usually a combination of goat-sheep or cow-goat-sheep milk.Dairy products of goat origin are more expensive than those of cow origin due to the above characteristics as well as the lower production of goat-origin milk due to lower yield.Therefore, the economically motivated adulteration (EMA) of goatorigin products with water, whey, urea, cow-origin milk, starch [7][8][9], and milk powder [10] has been reported.
To assess quality or adulteration issues, chemical analyses are often conducted by using laboratory-based techniques such as chromatography (e.g., gas chromatography, liquid chromatography) and mass spectrometry (MS) to precisely identify and quantify specific components or contaminants in a sample.Physical attributes of products, such as texture, viscosity, density, and color, are measured using techniques such as rheology, viscometry, densitometry, and colorimetry.In terms of microbiological analysis, to detect the presence of harmful microorganisms or assess the microbial quality of products, industries often use techniques such as agar plate culturing and polymerase chain reaction (PCR).In addition, techniques such as adulterant-specific chemical tests, microscopy, and elemental analysis have been used to identify the presence of contaminants, diluents, or substitute materials in products.They are valuable tools in food analysis due to their versatility, and high separation capabilities, and can augment the ability of hyphenated MS techniques to detect and quantify a wide range of analytes.However, these techniques, especially MS, have not always performed optimally resulting from sample complexity (containing a wide range of compounds, including fats, proteins, carbohydrates, vitamins, minerals, and various additives), co-elution of compounds, matrix interferences, and difficulty in achieving baseline separation.Furthermore, the especially complex matrix of food samples can impact chromatographic separations by causing ion suppression or enhancement, interfering with analyte detection, or affecting column performance.Thus, extensive sample preparation steps are often required, such as extraction, purification, and concentration, before MS analysis.Sample preparation can be time-consuming and labor-intensive, and may introduce potential sources of error or loss of analytes.Matrix effects can lead to decreased sensitivity, reduced chromatographic resolution, or inaccurate quantification.In addition, some food components, such as vitamins, flavors, or volatile compounds, can be sensitive to heat, light, or oxidative degradation [11,12].
Chromatographic techniques that involve high temperatures or prolonged analysis times may result in analyte degradation, leading to inaccurate quantification or loss of valuable information.Moreover, food samples encompass a wide variety of analytes, including polar and nonpolar compounds with different chemical properties and chromatographic techniques may not offer universal selectivity, requiring the use of different columns or methods to analyze different classes of compounds and this is time-consuming and expensive.Furthermore, chromatographic methods used in food analysis require rigorous validation to ensure accuracy, precision, selectivity, and robustness.Method validation can be time-consuming, and resource-intensive, and necessitates the availability of appropriate reference materials and standards [12][13][14].
A challenge of recent times is in the development of non-destructive, low-cost, rapid, and automated analytical techniques to determine the various adulterations in cheese products that can cause discrepancies in the products compared to their label [5,7,9,10].Automations are now available due to new technological achievements, particularly the use of chemometrics to facilitate the interpretation of analytical data [5,9,[15][16][17].Currently, the fingerprinting and profiling methods including spectroscopic techniques and e-sensors, in combination with chemometrics, are deemed more suitable for determining authenticity, with marker-based methods such as MS techniques reserved for more precise adulteration detection [13].However, the advancement of spectroscopic techniques, their instrumentation, and algorithmic means of analyses continues apace and offers the portable capability and, in some circumstances, even a viable lab-based alternative to MS. Vibrational spectroscopy techniques, particularly near-infrared spectroscopy (NIR), a popular technique for many years in food analysis, and combination with chemometric tools have given rise to many recent research studies [5,[18][19][20][21][22][23].
Several studies that have been performed to assess quality issues or adulteration in cheese using NIR technology are presented in Table 1.Ripening with a rind content of a maximum 18% in Parmigiano Reggiano cheese is well classified where the subregion 4243-4566 cm −1 was found to be important for both research groups: Alinovi et al. [20] and Cevoli et al. [19].Karoui et al. [24] found that the region of 8000-10000 cm −1 was important when classifying Emmental cheese samples regarding geographical origin, which is commonly based on chemical properties, such as fat and total nitrogen.Ayvaz et al. [18] studied chemical properties such as protein, fat, salt, dry matter, moisture, and ash in Ezine cheese by applying NIR, and they found the spectrum from 10000 to 4000 cm −1 to be critical for accurate classification.It has also been reported previously that the determination of cow milk adulteration in goat cheese exploits the significant peaks at 9043, 9286, 9353, and 9632 cm −1 [5].However, to determine other adulterants, unrelated to milk greater subregions should be used, such as water, urea, and bovine whey as in the findings of da Paixão Teixeira et al. [9], alongside cellulose, silicon dioxide, wheat flour, wheat semolina, and sawdust as documented by Visconti et al. [22].Similarly, by using a pocket-sized NIR system, Manuelian et al. [21] studied chemical parameters such as total nitrogen, soluble nitrogen, ripening index, major minerals content (Ca, K, Mg, Na, and P), and fatty acid profile, proving that the performance was comparable to benchtop devices and confirming the potential for a breakthrough of NIR technology in the dairy industry.Commercial grated cheeses Detection of adulterants, i.e., cellulose, silicon dioxide, wheat flour, wheat semolina, and sawdust in grated cheese 4000-5000 and 6000-7000 cm −1 / 2500-2000, 1667-1429 nm [22] Soft and semi-hard cheeses (Caciotta, Mozzarella, Mozzarella Pizza, Primo sale, Ricotta siero, and Scarmorza) Total nitrogen, soluble nitrogen, ripening index, major minerals content (Ca, K, Mg, Na, and P), and fatty acid profile - [21] Several studies which have been performed using HSI technology are highlighted in Table 2. Calvini et al. [25] focused on the quantification of rind amount in grated Parmigiano Reggiano cheese by taking into consideration spectral information at wavelengths of 1195-1225, 1330-1340, and 1400 nm.In a similar study, Calvini et al. [26], quantified the rind amount (8, 18, 28%), and investigated the influence of fat content and grater type in grated Parmigiano Reggiano cheese.Due to the lipids' presence from 1320 to 1390 nm and at 1070 nm, the highest percentage of rind, 28%, was determined.The lowest percentage of rind, 8%, was detectable at 1420-1520 nm due to protein content and bound water and proteins, respectively.The study by Darnay et al. [27] was related to ripening and used wavelengths for the detection of enzyme treatment (1387 nm) and for fat content (1190 nm, 1234 nm).Similarly, Vásquez et al. [17] studied a Swiss-type cheese regarding ripening based on hardness, and the region 650-1000 nm was studied.Malegori et al. [28] used various Italian-type cheese samples to study ripening through proteolytic maturation, lipolytic reactions, and surface dehydration by observing bands at 1690 nm (due to the secondary structure of proteins), 2140 nm (due to the ester bond in lipids) and 1360-1500 nm (due to O-H stretching of water), respectively.In terms of adulterants, Barreto et al. [7] used HSI to identify starch in cheese where the most significant wavelengths for this investigation were at 584 nm and between 976 and 1000 nm.The subregion between 976 and 1000 nm is correlated to water content in cheese since starch has a high-water absorption capacity.Fresh cheese Adulterant: starch based on water content 17123, 10276-10000 cm −1 /584, 976-1000 nm [7] This study aimed to evaluate two spectroscopic techniques, i.e., HSI and NIR.In a previous study [16], the same sample set was also measured using MIR spectroscopy (4000-400 cm −1 ).Thus, the overall aim of this study was to compare the results from all three spectroscopic techniques (e.g., MIR, NIR and HSI) which have been used to analyze the same halloumi cheese sample set.To the best of our knowledge, there is no other publication that characterizes halloumi cheese using these spectroscopy-based measurements.The cheeses in this study were grated, as also studied previously by Calvini et al. [26], since grated type cheese may be easily adulterated with lower-value cheese.

Sample Collection
In total, 17 commercial halloumi cheese samples purchased across Cyprus were used in this study.Their physical appearance is presented in Table 3.All samples contain milk from a known origin (i.e., cow and goat-sheep) due to a study conducted previously by Tarapoulouzi et al. [16].Nine of the samples were from goat or sheep, and the remaining eight were from cows.The samples were chosen because they were produced during the same season of the year to avoid any discrepancies or possible influences from environmental conditions on the production of the milk.For the same reason, the samples had the same geographical origin and aging time.

Preparation of Samples
The samples were freeze-dried, as described by Tarapoulouzi et al. [16].A Christ Alpha 1-2 freeze drier was used, with the condenser temperature at 233 K; the final pressure in the drying chamber was 3 Pa.The samples were all frozen using liquid nitrogen.Five grams of each halloumi cheese sample was treated for 5 hr.After freeze-drying, the samples were all grated with a drum grater (Ghizzoni mod.GS electric, Retsch, Haan, Germany).One sample was grated using a knife mill (Grindomix GM 200, Retsch, Haan, Germany).The reason was to find the best way to treat the samples, and treatment with a drum grater was more efficient; eventually, the use of a knife mill was abandoned.After freeze-drying, the appearance of the samples was not homogeneous, regarding colour and texture; this was due to different milk species.Ground samples were kept in sealed plastic containers at room temperature.

NIR Analysis
NIR measurements were performed using an iS50 benchtop NIR (Thermo Fisher Scientific, Dublin, Ireland) operated in the spectral range of 833-2500 nm (12,000-4000 cm −1 ) with 32 scans and a resolution of 8 cm −1 in triplicate.The samples were poured into the sample cup (diameter: 3.2 cm, height: 1.5 cm) spinner on the integrating sphere module of the instrument prior to analysis.Background measurements were taken before each sample.

HSI Analysis
Measurements were performed in triplicate using a HinaLea 4250 VNIR HSI camera (HinaLea Imaging, Emeryville, CA, USA) with a spectral resolution of 4 nm (full width half maximum) and 2.3 MegaPixel color sensor with 300 spectral channels (wavelengths) analyzed via a linear variable filter across a 400-1000 nm range.The HSI camera was mounted on a stand with two adjacent halogen light bulbs.Prior to analysis, on HinaLea TruScope software, v. 1.1.17,background (dark) and reference (white) spectra were acquired.The camera gap, exposure, gain, and focus were all optimized prior to measurement.Approximately 10 g of each sample was placed onto a neutral background for measurement.Point spectra were studied before averaged spectral rectangular regions were delineated and output as csv files for chemometric analysis.

Chemometric Analysis
The development of chemometric models took place using SIMCA 17 (Sartorius, Umeå, Sweden).Unit variance (UV) scaling was used throughout.Several pre-processing strategies such as multiplicative scatter correction (MSC), standard normal variate (SNV), first-order derivatives (1DER), and Savitzky-Golay (SG) were first evaluated.Subsequently, the combination of SNV and SG were used based on internal cross-validation results conducted in SIMCA.Following this, the unsupervised technique principal component analysis (PCA) was used to obtain a general overview of the samples, as well as to obtain the PCA loadings, which highlight the most important variables.In addition, another unsupervised technique, hierarchical cluster analysis (HCA), was applied for clustering analysis.

Spectral Analysis of Halloumi Cheese Using HSI Spectroscopy
The particle size and distribution of grated halloumi cheese were influenced by the texture of each sample, as shown in Figure 1A.Samples with an oilier texture produced smaller particles, whilst a dry texture produced more heterogeneous particles; thus, different particle sizes were obtained after grating.In addition, the appearance, colour, and texture of the samples are related to the milk species' origin; thus, goat or goat-sheep-origin samples have a yellow-oily appearance compared to cow-origin samples which are whitedry.The region 400-1000 nm has been covered during HSI measurements, as presented in Figure 1C.
Vásquez et al. [17] noted that publications that have used HSI in cheese are scarce; however, the studies that were found are presented in Table 2.The paper by Vásquez et al. [17] is the closest to our study, as it took measurements within the range of 400-1000 nm.Other available studies have measured above 1000 nm either to detect an adulterant or to quantify ripening in cheese.

Spectral Analysis of Halloumi Cheese Using NIR Spectroscopy
The overtone and combination bands in the NIR spectra occur because of the C-H, N-H, and O-H bonds.Weak water absorption (due to freeze-drying application) takes place at around 6800 and 5200 cm −1 due to the first overtone stretching of the free O-H group and OH combination bands from the water, respectively.Lactose is present at 4740 cm −1 [3,29].Goat milk contains less lactose than cow milk [30], as shown in Figure 2A.The presence of CH 2 group gives two bands in the 4600-4200 cm −1 region, and in lipids and/or protein at 4250 and 4600 cm −1 , respectively.The absorptions which are present at 4260 and 4330 cm −1 are derived due to the combination bands of C-H and C-O stretching vibrations in fats, and at 4980 cm −1 due to the amides of proteins.The first overtone of the C-H stretching vibration of fats gives the absorptions at around 5670 and 5780 cm −1 as well and the second overtone of the C-H stretching vibration of fats is responsible for the absorptions between 8700 and 8100 cm −1 , as shown in Figure 2B.These findings agree with Karoui et al. [18] and Ayvaz et al. [24].Goat milk has more medium-chain fatty acids than cow milk, and, as Silanikove et al. [30] stated, they provide the "goaty" odour in cheese products.In addition, it is very important that the differences in the spectral region between 8700 and 7350 cm −1 are identified as being related to the N-H second overtone, and this is directly linked with the fact that the protein contents of cow and goat milk are different [5].
cm −1 [3,29].Goat milk contains less lactose than cow milk [30], as shown in Figure 2A.The presence of CH2 group gives two bands in the 4600-4200 cm −1 region, and in lipids and/or protein at 4250 and 4600 cm −1 , respectively.The absorptions which are present at 4260 and 4330 cm −1 are derived due to the combination bands of C-H and C-O stretching vibrations in fats, and at 4980 cm −1 due to the amides of proteins.The first overtone of the C-H stretching vibration of fats gives the absorptions at around 5670 and 5780 cm −1 as well and the second overtone of the C-H stretching vibration of fats is responsible for the absorptions between 8700 and 8100 cm −1 , as shown in Figure 2B.These findings agree with Karoui et al. [18] and Ayvaz et al. [24].Goat milk has more medium-chain fatty acids than cow milk, and, as Silanikove et al. [30] stated, they provide the "goaty" odour in cheese products.In addition, it is very important that the differences in the spectral region between 8700 and 7350 cm −1 are identified as being related to the N-H second overtone, and this is directly linked with the fact that the protein contents of cow and goat milk are different [5]. Figure 3 shows that the spectra of samples 2 and 6 and 4 and 10 are similar and the four spectral subregions showing the important differences are related to protein, fat, and lactose.Additionally, it is difficult to determine the similarity by eye, hence the reason we need chemometrics.Spectral features appeared between 8620 and 4250 cm −1 due to C-H bond absorption of proteins and/or lipids; at 4740 cm −1 due to O-H bond in lactose; and at 8400-4385 cm −1 due to N-H absorption of proteins.As the NIR technique is well-known for focusing on protein, fat, and lactose composition in the dairy sector [3,5,18], small changes in the proportions of milk species (i.e., ovine-to-caprine ratio or bovine-to-caprine-ovine ratio) may affect the spectral subregions of these nutrients as well as cheese composition, as stated in Tarapoulouzi et al. [16].
bond absorption of proteins and/or lipids; at 4740 cm −1 due to O-H bond in lactose; and a 8400-4385 cm −1 due to N-H absorption of proteins.As the NIR technique is well-know for focusing on protein, fat, and lactose composition in the dairy sector [3,5,18], sma changes in the proportions of milk species (i.e., ovine-to-caprine ratio or bovine-to caprine-ovine ratio) may affect the spectral subregions of these nutrients as well as chees composition, as stated in Tarapoulouzi et al. [16].Moreover, the observed differences in the four spectral regions shown in Figure could take place due to the effect of milk proteins on milk fat determination, as protein are included in the native fat globule membrane that surrounds the lipid droplets.Diffus reflection occurred because of the uneven cheese surface.In addition, individual differ ences in the milk proteins of each animal have affected similar studies due to the ratio o true protein and nonprotein nitrogen in the milk.This is observed in differences occurrin in the spectral region 5130-4760 cm −1 , which is influenced by nitrogenous components i the milk [3].

Chemometric Modelling
Data analysis of HSI measurements revealed that samples 4, 5, 8, 7, 12, 10, 3, and 1 are grouped together and in the left-hand side group (white appearance and dry texture by the chemometric method HCA.Similarly, samples 6, 1, 2, 15, 16, 17, 13, 9, and 14 fall i the right-hand side group (yellow appearance and oily texture), as seen in Figure 4B.Moreover, the observed differences in the four spectral regions shown in Figure 3 could take place due to the effect of milk proteins on milk fat determination, as proteins are included in the native fat globule membrane that surrounds the lipid droplets.Diffuse reflection occurred because of the uneven cheese surface.In addition, individual differences in the milk proteins of each animal have affected similar studies due to the ratio of true protein and nonprotein nitrogen in the milk.This is observed in differences occurring in the spectral region 5130-4760 cm −1 , which is influenced by nitrogenous components in the milk [3].

Chemometric Modelling
Data analysis of HSI measurements revealed that samples 4, 5, 8, 7, 12, 10, 3, and 11 are grouped together and in the left-hand side group (white appearance and dry texture) by the chemometric method HCA.Similarly, samples 6, 1, 2, 15, 16, 17, 13, 9, and 14 fall in the right-hand side group (yellow appearance and oily texture), as seen in Figure 4B.PCA was used to obtain a general overview of the samples.The HCA chemome method revealed that samples 7, 6, 12, 3, 5, 2, 8, and 11 are grouped together and in left-hand side group (cow origin) by the chemometric method HCA, and this result is same as results found using PCA clustering.Similarly, samples 15, 4, 13, 9, 1, 17, 16 and 14 fall in the right-hand side group (goat-sheep origin), as seen in Figure 4A.A t of 4 out of 17 samples have the "wrong grouping": 2 from cow origin (nos. 2 and 6) an from goat-sheep origin (nos.4 and 10), as indicated in Figure 4B, thus resulting in a c sification accuracy of 13/17 (76%).Sample 6 had the smallest dimensions of particles PCA was used to obtain a general overview of the samples.The HCA chemometric method revealed that samples 7, 6, 12, 3, 5, 2, 8, and 11 are grouped together and in the left-hand side group (cow origin) by the chemometric method HCA, and this result is the same as results found using PCA clustering.Similarly, samples 15, 4, 13, 9, 1, 17, 16, 10, and 14 fall in the right-hand side group (goat-sheep origin), as seen in Figure 4A.A total of 4 out of 17 samples have the "wrong grouping": 2 from cow origin (nos. 2 and 6) and 2 from goat-sheep origin (nos.4 and 10), as indicated in Figure 4B, thus resulting in a classification accuracy of 13/17 (76%).Sample 6 had the smallest dimensions of particles due to grating with a knife mill.However, sample 6 was not an outlier in the NIR model, and the exclusion of sample 6 from the model did not improve the grouping of the other three samples (nos.2, 4, and 10).More investigation is needed regarding the influence of particle size in NIR analysis.To investigate the "wrong grouping" of the four samples, more research must be conducted and maybe another sample preparation method should be implemented, for instance, Raman spectroscopy with an emphasis on fats.

Conclusions
The samples used in this study come from an authentic database built using the spectral information obtained by MIR in Tarapoulouzi et al. [16].The main aim of this study was to compare MIR and HSI data using these two techniques, based largely on correctly identifying the milk species' origin in 16 out of 17 of the samples by using the HSI technique.In terms of comparing MIR and NIR data, 4 samples out of 17 analyzed by NIR were incorrectly grouped, which was unexpected.This challenge requires further studies on halloumi cheese using NIR spectroscopy to increase the size of the sample set and build more robust models, as the number of samples tested was small.The appearance of the samples after freeze-drying was associated with the milk species; thus, color and texture influenced the groupings within the model.The appearance of the samples was found to be an important parameter that drives the clustering of the samples when measured using HSI.Calvini et al. [26] concluded that different grater types gave different shapes and dimensions of particles in their study with HSI measurements, and the dimension of particles in grated cheese samples may be an important factor for the HSI model, as sample 6 appeared to be an outlier.This pre-trial study to identify species of origin in halloumi cheese utilising chemometrics with NIR and HSI technologies seems to be an important addition in halloumi cheese research.Further experimentation is required to elucidate the potential impact of cheese particle size on spectroscopic response.On the other hand, the NIR model gave the wrong clustering position regarding sample 6, alongside three other incorrectly classified cheese samples.

Figure 1 .
Figure 1.HSI measurements: representative samples of goat-sheep origin; sample no. 15 (left) texture, yellow color, and the representative sample of cow origin; sample no. 5 (right)-dry te white color.(A) Samples under white light, (B) samples under dark light and the regions of in (ROIs) we selected (10 individual points) to obtain the corresponding spectral information in HSI point spectra are raw data and averaged internally , and (D) all HSI spectra.

Figure 1 .
Figure 1.HSI measurements: representative samples of goat-sheep origin; sample no. 15 (left)-oily texture, yellow color, and the representative sample of cow origin; sample no. 5 (right)-dry texture, white color.(A) Samples under white light, (B) samples under dark light and the regions of interest (ROIs) we selected (10 individual points) to obtain the corresponding spectral information in (C); (C) HSI point spectra are raw data and averaged internally, and (D) all HSI spectra.

Figure 4 .
Figure 4. (A) PCA score scatter plot where right-hand side group (goat-sheep origin) and left-hand side group (cow origin), and (B) HCA dendrogram, for the HSI data.

Table 1 .
Overview of near-infrared (NIR) spectroscopy techniques for the determination of quality and adulteration in cheese.

Table 2 .
Overview of hyperspectral imaging (HSI) techniques for the determination of quality and adulteration in cheese.

Table 3 .
Physical appearance and nutritional content of the halloumi cheese samples.
* Yellow colour samples usually have an oily appearance, while white colour samples are always dry.