Next Article in Journal
Functional Reduced-Fat Mozzarella Cheese from “Essential Oil-Fed” Milk and Inulin Fortification
Next Article in Special Issue
High-Accuracy Prediction of Chunmee Tea Grade via DeepSpectra Model and Near-Infrared Spectroscopy
Previous Article in Journal
Valorization and Functional Enhancement of Mature Assam Tea Leaves Through Indigenous Filamentous Fungi-Based Fermentation for Functional Drink Development
Previous Article in Special Issue
Nondestructive Detection of Soluble Solids Content in Apples Based on Multi-Attention Convolutional Neural Network and Hyperspectral Imaging Technology
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Visualization of Three-Dimensional SSC (Soluble Solids Content) Across the Entire Surface of Strawberries Using Near-Infrared Hyperspectral Imaging

1
Institute of Agricultural Machinery, National Agricultural and Food Research Organization, 1-40-2, Nisshin-Cho, Kita-Ku, Saitama 331-8537, Japan
2
Graduate School of Bioagricultural Sciences, Nagoya University, Furo-Cho, Chikusa, Nagoya 464-8601, Japan
*
Author to whom correspondence should be addressed.
Foods 2026, 15(9), 1563; https://doi.org/10.3390/foods15091563
Submission received: 8 March 2026 / Revised: 23 April 2026 / Accepted: 25 April 2026 / Published: 1 May 2026

Abstract

Near-infrared hyperspectral imaging (NIR-HSI) is widely used as a non-destructive technique for evaluating internal fruit quality; however, reliable pixel-wise visualization remains challenging due to geometry-induced spectral distortions and the lack of statistically interpretable validation criteria. This study proposes an integrated framework for three-dimensional visualization of soluble solids content (SSC) across the entire surface of strawberries using NIR-HSI combined with shape-aware spectral correction and pixel-level reliability assessment. Two complementary imaging systems—a line-scan system and a rotation-scan system—were used to acquire hyperspectral and 3D shape data. Fruit height and surface orientation were incorporated into spectral preprocessing to reduce illumination and curvature effects. Partial least squares regression (PLSR) models were developed using region-of-interest-averaged spectra and applied to pixel-wise SSC mapping. To assess the statistical validity of pixel-level predictions, an imaging reliability index based on the Mahalanobis distance in the PLS score space was introduced. The results show that models with high sample-level accuracy do not necessarily produce reliable SSC maps, whereas reliability-based model selection improves image interpretability. This framework enables consistent three-dimensional SSC visualization and is applicable to hyperspectral imaging of internal fruit attributes.

1. Introduction

Fruit is one of the most widely consumed foods worldwide, and the fruit industry plays a vital role in many regions. Post-harvest quality inspection and grading support the distribution of fruit with standardized quality, thereby increasing profitability and enhancing market competitiveness. Fruit quality comprises both external and internal attributes. While external traits such as size, color, and shape are important, consumers often place greater emphasis on internal attributes because they directly influence taste [1].
In particular, the strawberry is a kind of very popular fruit with its attractive appearance, unique flavor, and nutritional compositions. Global strawberry production has expanded in acreage over recent decades due to rising demand, driving research efforts to address threats to production while improving fruit quality to meet consumer preferences [2]. Consequently, considerable effort is devoted to developing reliable methods for evaluating strawberry quality.
Non-destructive and high-speed image-based approaches have been widely investigated for evaluating external strawberry quality. Numerous systems have been developed for grading and maturity classification based on geometric and color features extracted from 2D images [3,4,5,6]. However, because these approaches fundamentally rely on surface appearance, they are unable to capture internal quality attribute.
For internal quality assessment, Near-infrared (NIR) spectroscopy has been widely used for non-destructive prediction of soluble solids content (SSC) in strawberries based on point-based spectral measurements [7,8,9]. Hyperspectral imaging (HSI) further extends this approach by enabling pixel-wise analysis and spatial visualization of soluble solids content and other internal quality attributes in strawberries [10]. Strawberries exhibit pronounced spatial heterogeneity in chemical composition: SSC and free sugar levels are substantially higher in the upper fruit region and decrease toward the peduncle, whereas organic acids remain relatively uniform throughout the flesh [11]. Consequently, accurate characterization of internal quality attributes requires region-specific evaluation, which has motivated increasing interest in HSI-based visualization of internal variability. Recent studies have demonstrated that hyperspectral imaging combined with deep learning has been increasingly applied to strawberry quality assessment, including maturity and soluble solids content determination as well as early detection and spatial visualization of quality defects [12,13,14]. HSI has also been applied to the non-destructive evaluation of various strawberry quality attributes, including postharvest quality monitoring under different processing and storage conditions [15].
However, HSI measurements are strongly influenced by fruit geometry—including curvature, variations in optical path length, scattering behavior, and shading—which can substantially degrade prediction accuracy if geometric effects are not accounted for. For small or strongly curved objects, reflectance intensity diminishes toward the edges due to angular effects described by the Lambert cosine law, resulting in pronounced pixel-level inconsistencies. Delwiche et al. [16] identified curvature-induced reflectance variation as a dominant error source in cereal-seed-sized objects and demonstrated that Lambertian correction can partially reduce spatial non-uniformity. Building on this concept, Li et al. [17,18] showed that integrating 3D shape profiles into NIR-HSI improves reflectance uniformity and enhances bruise-detection accuracy in apples by correcting both height- and surface-normal-dependent distortions. Their findings underscore the substantial impact of geometric factors on pixel-wise spectral fidelity. Along this line, the combination of three-dimensional shape measurement and hyperspectral imaging has been proposed as a four-dimensional framework for fruit grading, enabling simultaneous assessment of external geometry and subsurface defects, though the reliability of pixel-wise quantitative imaging has not been explicitly addressed [19].
In parallel, spectral variability within strawberries also arises from region-dependent differences in curvature, scattering properties, and local composition. Chun et al. [20] demonstrated that region-wise segmentation yields markedly different noise characteristics and model performance, with peripheral regions exhibiting larger errors. Moreover, Seki et al. [21] showed that HSI can successfully visualize SSC distribution even in white strawberries, where external coloration provides limited information, further highlighting the potential of NIR-HSI for internal quality mapping. Despite these advances, methodological challenges remain in ensuring the reliability of pixel-wise SSC mapping.
First, neither Li et al. [17] nor Seki et al. [21] quantitatively evaluated whether pixel-wise spectra—either after geometric correction or during visualization—are statistically consistent with the calibration manifold of the predictive model.
Second, conventional performance metrics such as R 2 and RMSEP are derived from averaged spectra and therefore cannot capture the pixel-level reliability of SSC maps, despite the well-recognized risks of spectral outliers and noise amplification during image generation. Consequently, ensuring the statistical reliability of pixel-wise predictions remains an essential and unresolved challenge in HSI-based internal quality evaluation. Because PLSR models are typically constructed using averaged spectra but applied on a pixel-wise basis, rigorous evaluation of pixel-level reliability is indispensable for robust SSC visualization.
To address these limitations, the purpose of this study is to establish a unified framework for reliable three-dimensional visualization of soluble solids content (SSC) in strawberries using near-infrared hyperspectral imaging.
Unlike previous studies that primarily evaluated model performance using sample-averaged spectra and conventional accuracy metrics, this study focuses on the statistical validity of pixel-wise predictions applied to hyperspectral images. The originality of this work lies in the integration of shape-aware spectral correction with a pixel-level reliability assessment based on the Mahalanobis distance in the partial least squares (PLS) score space, enabling quantitative evaluation of whether imaging spectra are statistically consistent with the calibration manifold. Furthermore, the proposed framework is validated under two different acquisition schemes—a line-scan system and a rotation-scan system—to demonstrate its robustness across measurement geometries. By explicitly addressing both geometry-induced spectral distortion and pixel-wise reliability, this study provides a principled approach for interpretable and reliable hyperspectral visualization of internal fruit quality.

2. Materials and Methods

2.1. Sample Preparation

In this study, two datasets were prepared using different hyperspectral imaging systems. Dataset I was acquired using a line-scan system, whereas Dataset II was obtained using a rotation-scan system. A total of 193 strawberries for Dataset I and 130 strawberries for Dataset II, all belonging to the “Tochigi i37” cultivar grown in Tochigi Prefecture, Japan, were collected. The samples were transported under refrigerated conditions and stored at 5 °C until one hour before measurement. Before imaging, the strawberries were equilibrated at 23 °C in the laboratory. Some samples exhibited minor damage during transportation, although this did not affect their inclusion in the study. The strawberries used in this study were commercially sourced and measured within two days after transportation. Samples were predominantly at a market-ripe stage with red surface coloration. Because the near-infrared wavelength range employed in this study is not influenced by visible color, surface coloration was not considered a major confounding factor.

2.2. Data Acquisition

Figure 1 and Figure 2 illustrate the conceptual workflows for hyperspectral data acquisition and SSC imaging for Datasets I and II, respectively. Dataset I was measured using a line-scan system, whereas Dataset II employed a rotation-scan system to capture hyperspectral images together with corresponding shape information.
Near-infrared hyperspectral images and 3D shape data are acquired during a full rotation of the fruit on a turntable, enabling full-surface coverage. After background removal and flesh-ROI extraction, geometric correction (height and angle) and spectral preprocessing are applied. PLSR models constructed from ROI-averaged spectra are then applied to pixel-wise spectra to generate SSC maps, which are fused with the reconstructed shape to produce three-dimensional SSC visualization over the entire fruit surface.

2.2.1. Systems

The measurement system consisted of a near-infrared hyperspectral imaging (NIR-HSI) system (push-broom line scanning type; Compovision, Sumitomo Electric Industries, Ltd., Tokyo, Japan), a light source, and a laser displacement meter (LJ-X8200, KEYENCE, Ltd., Osaka, Japan). For the line-scan system, sample movement was achieved using a slide table integrated into the NIR-HSI system. In the rotation-scan system, a turntable (OSMS-40YAW, Sigumakoki, Co., Ltd., Tokyo, Japan) was employed. The NIR-HSI system was equipped with a spectroscope and a two-dimensional InGaAs/GaAsSb photosensitive element (256 pixels for wavelength × 320 pixels for position), capable of detecting NIR light in the range of 913–2519 nm at a spectral interval of 6.2 nm. A wavelength ranging from 913 to 2166 nm (i.e., 200 wavelength bands) was selected here because reflectance over 2166 nm has a low signal-to-noise ratio. Illumination was provided by a tube-shaped light source with four halogen lamps positioned on both sides, and the irradiation angle was set to 45°. The laser displacement meter utilized a 405 nm laser, with a profile data count of 3200 points and Z-axis repeatability of 1 μm.

2.2.2. Hyperspectral Images Data and 3D Scan Data Measurement

In the line scan system, the NIR HSI frame rates were set to 150 frames s−1 for sample images and 320 frames s−1 for both dark and white reference images. Samples were placed on precisely machined aluminum blocks (60 mm × 70 mm) to ensure accurate alignment between the hyperspectral data obtained from the NIR HSI system and the shape data acquired by the laser displacement meter. A black sheet was attached to the block surface to improve background segmentation.
The field of view of both the NIR HSI system and the laser displacement meter was calibrated using a reference rectangular block or landmark. The laser displacement meter operated at a scanning speed of 2.5 mm s−1. Dark reference images were acquired with the lens cap closed to prevent light interference. For reference measurements, a white calibration board was placed above the block. The white board was 10mm thick.
In the rotation scan system, the NIR HSI frame rates were set to 100 frames s−1 for sample images, 100 and 320 frames s−1 for dark images, and 320 frames s−1 for white reference images. The NIR HSI system and the laser displacement meter were used to acquire hyperspectral images and shape data during one full rotation of the turntable, which was operated at 30° s−1. Reference data were obtained by measuring a white calibration board at height intervals of 2 mm from 0 to 30 mm, using the center of the turntable as the rotational origin. Dark reference images were recorded with the lens cap closed to prevent external light from entering the detector.

2.2.3. SSC Measurement

After hyperspectral imaging, each measurement surface was divided into two regions corresponding to the fruit apex (top) and the fruit base (bottom). Each section was wrapped in a nonwoven cloth, manually squeezed, and pressed to extract juice. The extracted juice was gently stirred, and the soluble solids content (SSC) was measured as Brix using a digital Brix meter (PR-101α, ATAGO Co., Ltd., Tokyo, Japan).

2.3. Data Analysis

2.3.1. Geometric Correction and Spectral Preprocessing

Shape correction is an important preprocessing step in hyperspectral imaging of fruits and other agricultural products, as sample geometry strongly influences the measured reflectance spectra. A widely used approach for shape correction is Lambertian surface correction, which assumes that the sample surface behaves as an ideal diffuse reflector. According to the Lambertian model, the reflected intensity decreases in proportion to the cosine of the angle between the surface normal and the observation direction. This model and its variants have been extensively applied to compensate for curvature-related effects and non-uniform illumination in hyperspectral imaging of spherical fruits [16,17].
This study evaluated a preprocessing method for hyperspectral data using 3D profile data and its effectiveness. The data was first cut out based on the measurement block to match the coordinates of the measured hyperspectral data and shape data. In addition, the shape data was resized based on the data size of the hyperspectral data.
The reflectance of each pixel i , j at each wavelength was calculated using a general formula without employing geometric shape correction by Equation (1).
R λ i , j = I λ i , j D λ i W λ i D λ i
where λ , i and j represent the wavelength and pixel index variables; R λ i , j represents the standardized reflectance intensity at wavelength λ and pixel; I λ i , j , W λ i and D λ i represent sample, white reference images and dark images, respectively.
In Dataset I, the height-corrected sample intensity I h , λ i , j data for each pixel was calculated from Equation (2), employing a simplified geometric attenuation model. This model approximates distance-dependent intensity variation by referencing the inverse square law, without explicitly modeling complex light scattering and illumination geometry. The diffuse light source was therefore treated as providing vertically downward illumination.
I h , λ i , j * = I λ i , j D λ i H 1 + H s i , j H 1 2 · H 2 + H s i , j H 2 2
where H 1 is the distance from the light source to the white panel surface, H 2 is the distance from the camera lens to the white panel surface, and H s is the height data of each pixel with the white panel surface height of 0 mm.
The distance H 1 was 100 mm, and the distance H 2 was 200 mm. For Dataset II, the brightness values of the white plate measured at discrete heights were used to calculate * I h , λ i , j . The height of each pixel was determined from the 3D scan information, and the corresponding white reference values were obtained by linear interpolation along the height direction based on the measured white plate data. Equation (3) was used to calculate the height-corrected relative reflectance R h , λ .
R h , λ i , j = I h , λ i , j W λ i D λ i
The angle(θ) between the surface normal vector and the vertical vector was calculated using the height information of the pixels in the width direction. Equation (4) was applied to obtain the intensity data of the corrected sample I a , λ by applying an angle correction using Lambertian.
I a , λ i , j = I λ i , j D λ i cos θ i , j
The relative reflectance corrected by Lambertian R a , λ was calculated using Equation (5).
R a , λ i , j = I a , λ i , j W λ i D λ i
Equation (6) was used to calculate the intensity data for each pixel with height and angle corrections I h _ a , λ .
I h _ a , λ i , j = I h , λ i , j cos θ i , j
The relative reflectance corrected height and angle corrections R h _ a , λ was calculated using Equation (7).
R h _ a , λ i , j = I h _ a , λ i , j W λ i D λ i
These preprocessing steps were applied uniformly to both calibration and imaging data to reduce geometry-induced spectral variability, thereby providing a consistent basis for subsequent ROI averaging, model construction, and pixel-wise reliability evaluation.

2.3.2. Region of Interest (ROI) Determination

To create an SSC estimation model, it is necessary to determine the region of interest (ROI) and calculate the average spectrum. ROI was determined by excluding pixels containing achenes from the fruit surface pixels, using A method combining PCA and image processing [21]. Specifically, we first created an ROI mask by removing the background outside the fruit using thresholding. We then calculated loading vectors for each pixel within the ROI range using principal component analysis. Using these loading vectors, we computed the PC1 score for each pixel. Finally, we created an ROI mask by removing pixels outside the fruit surface from the entire image through binarization using Otsu’s method [22]. For Dataset II, the ROI for the fruit region was determined using a reflectance image at 1204 nm with a threshold of 0.2 for background processing. For Dataset II, the ROI was determined using a reflectance intensity image at 1123 nm with a threshold of 12,000. For PCA, data within the wavelength range 1160–2168 nm was used for Dataset II and data within the wavelength range 1476–1917 nm was used for Dataset II. These parameters were determined through preliminary analysis. The average spectra of the fruit top and fruit bottom for each sample within this defined ROI were calculated.

2.3.3. Spectral Preprocessing

Based on the ROI-averaged spectra obtained as described above, spectral preprocessing was applied prior to PLSR model construction. Spectral processing is generally considered when developing quantitative models to remove noise and disturbances from spectral data and improve prediction accuracy. In this study, spectral processing methods included no processing, the representative Savitzky–Golay smoothing [23], and eight patterns combining first-order differentiation, second-order differentiation (window size: 9, polyorder: 2), and SNV (Standard Normalize variate). These preprocessed spectra were subsequently used for PLSR model development and evaluation, as described in the following section.

2.3.4. SSC Estimation Model

A model for estimating SSC from the average spectrum using PLSR [24] was created by combining geometric spectral correction and spectral preprocessing to generate 32 patterns. The dataset was split 7:3 into training and test sets for model building and evaluation. The split was performed using SSC-stratified sampling to keep the SSC distributions comparable between the training and test sets. The model construction optimized the number of PLS factors by 5-hold-out cross-validation for the training set. The optimal LVs were selected for the maximum root-mean-square error (RMSE) for cross-validation (RMSECV) within the global minimum + 1 standard deviation range. We used 5-fold cross-validation on the training set to select the optimal number of latent variables (LVs) by the “one-standard-error” rule and reported external test performance on the held-out set. The quality of the PLSR model was assessed using the determination coefficient R 2 and RMSE for calibration ( R 2 c and RMSEC) and prediction ( R 2 p and RMSEP). A good model possesses a low RMSEC, RMSEP, and high determination coefficient ( R 2 c ,   R 2 p ) such that calibration and confirmation results do not diverge.

2.3.5. SSC Imaging

PLSR model described above, pixel-wise SSC imaging was performed for the test dataset. Pixels corresponding to the flesh ROI in the hyperspectral images of the test data were used for sugar content visualization by applying the PLSR model constructed to estimate the SSC values. Because the hyperspectral images and the 3D shape data were acquired simultaneously and subsequently resized into a unified spatial grid, pixel-wise correspondence between spectral and geometric information was preserved, allowing direct fusion of SSC maps and shape data to generate three-dimensional visualizations.

2.3.6. SSC Estimation Model Reliability

Generally, hyperspectral imaging performance is evaluated using global metrics such as the coefficient of determination ( R 2 ), standard error, residual predictive deviation (RPD), and bias, which are derived from calibration or validation datasets. These metrics are effective for assessing the predictive performance of regression models built from averaged spectra. However, in hyperspectral imaging, the estimated SSC at each pixel is calculated by applying a model developed from the mean spectrum of a region of interest (ROI) to pixel-wise spectra, whose statistical properties may differ substantially from those of the calibration spectra. Furthermore, pixel-level spectra are strongly affected by factors such as local illumination conditions, surface geometry, signal-to-noise ratio, and spectral preprocessing. As a result, different preprocessing strategies may lead to substantial differences in spatial noise patterns and apparent heterogeneity in SSC maps, even when global prediction accuracy metrics remain similar. Therefore, conventional regression metrics alone are insufficient to evaluate the reliability of hyperspectral SSC mapping. To address this limitation, we propose a statistical evaluation framework based on the distribution of pixel-wise PLS scores, which enables quantitative assessment of the consistency between imaging spectra and the calibration dataset.
PLSR decomposes the spectral matrix X and the response variable y as Equation (8) [25]:
X = T P + E ,   y = T q + f
where represents the latent score matrix. Let X t r a i n R m × λ denote the preprocessed spectral matrix of the calibration dataset, where m is the number of calibration samples and λ is the number of wavelengths. The PLS score matrix of the calibration dataset is given by Equation (9).
T t r a i n = X t r a i n W *
where W * = W ( P W ) 1 and W and P are the PLS weight and loading matrices, respectively. For hyperspectral imaging, each pixel spectrum x j R p within the ROI is projected onto the same PLS latent space as Equation (10).
t j = x j W *
where t j R A is the PLS score vector of pixel j .
To evaluate whether a pixel-wise spectrum is statistically consistent with the calibration data, the squared Mahalanobis distance of each pixel score vector is computed in the PLS score space. The mean vector μ and covariance matrix Σ of the calibration score matrix T t r a i n are defined as Equation (11).
μ = 1 m i = 1 m t i ,   Σ = 1 m 1 i = 1 m t i μ t i μ
The squared Mahalanobis distance of pixel j is then calculated as Equation (12).
D j 2 = t i μ Σ 1 t i μ
Assuming that the PLS scores of the calibration dataset approximately follow a multivariate normal distribution, D j 2 follows a chi-square distribution with A degrees of freedom as Equation (13).
D j 2 ~ χ 2 A
To define a statistically interpretable threshold, a k σ criterion is introduced. The corresponding cumulative probability of a standard normal distribution is given by Equation (14).
α k = P r Z < k = e r f k 2
where Z N ( 0 ,   1 ) . In this study, k = 3 was adopted, corresponding to α 0.997 .
The Mahalanobis distance threshold is therefore defined as Equation (15).
τ = χ 2 A ,   a
are classified as inliers, while pixels exceeding the threshold are regarded as outliers in the PLS score space. Let N denote the total number of pixels within the ROI. The imaging reliability index is defined as the proportion of inlier pixels (Equation (16)):
R e l i a b i l i t y = 1 N j = 1 N 1 D j 2 τ
where Reliability is the indicator function.
This index represents the fraction of pixel-wise spectra that are statistically consistent with the calibration dataset under the selected preprocessing and model complexity. A higher Reliability value indicates a more reliable SSC map with reduced influence of spectral distortion and noise amplification. Unlike conventional regression metrics that evaluate prediction accuracy at the sample level, the proposed reliability index directly assesses the statistical validity of pixel-wise predictions in hyperspectral imaging. The pixel-wise reliability index was defined using a χ2-based boundary in the PLS score space. A 99% χ 2 threshold was adopted to define the calibration manifold, providing a conservative yet not overly restrictive criterion for pixel-wise evaluation. This choice avoids arbitrary tuning while preventing excessive rejection of pixel spectra when comparing pixel-wise spectra with the calibration mean spectrum. For intuitive reference, this threshold is approximately comparable to 3 σ range in a univariate normal distribution, although the formulation itself is strictly multivariate. The model’s reliability metric was defined as the average value across the test set samples.
All data analyses were conducted using MATLAB 2025a, a computer analysis software package. PLSR were conducted using libPLSR_1.98 [24].

3. Results and Discussions

3.1. Spectral Analysis

Figure 3 shows the average reflectance spectra (R) and the average spectra corrected for geometric shape including height R h , angle R a , height and angle R h _ a for each dataset. The average reflectance spectra had absorption peaks at 970, 1165, 1420, 1780, and 1900 nm. The peaks at 970, 1420, and 1900 nm corresponded to O–H-related water content, those at 1165 and 1780 nm corresponded to C–H-related sugar [26,27]. In the height-corrected spectrum, the standard deviation tended to decrease, particularly in the wavelength band below 1500 nm, suggesting that the effect of distance was reduced for each sample. Regarding angle correction, reflectance tended to be higher overall. This was considered to result from the correction of the angled edges of the fruit. For both height and angle corrections, the characteristics of the effects of both height correction and angle correction were confirmed. Although the height-correction implementations differ between Dataset I and Dataset II, the similar spectral trends observed after correction indicate that both approaches compensate for height-dependent intensity variation. These results suggest that a comparable correction effect can be achieved using Equation (2) without measuring the white reference at multiple heights. Nevertheless, further systematic validation using additional datasets will be required to fully assess reproducibility and model generalizability, which is left for future work.

3.2. SSC Models

Table 1 summarizes the dataset. The dataset was split into training and test sets at a 7:3 ratio. To avoid imbalance in SSC distribution between the two sets, samples were stratified based on their reference SSC values and assigned so that the mean and standard deviation of SSC were comparable between the training and test sets. The training set exhibited a wider SSC range and more distributed values. Fruit tops tended to have higher SSC values and larger standard deviations than fruit bottoms, consistent with the general observation that the top of strawberries is sweeter. This trend was consistent across both Dataset I and Dataset II. These dataset characteristics provide the basis for the SSC model evaluation presented below.
Table 2 summarizes the results of the PLSR model performance of the Dataset I. Comparing the coefficients of determination for the test set, the model with smoothing after height correction showed the best prediction accuracy with a coefficient of determination of 0.813 and RMSEP of 0.687 for the test set. This model demonstrates higher estimation accuracy than the uncorrected model, suggesting that geometric shape correction is an effective preprocessing method for the data. Table 3 summarizes the results of the PLSR model performance of Dataset II. The models for all conditions (shape correction and spectral pretreatment) had good prediction accuracy with R 2 p greater than 0.850. Comparing the coefficients of determination for the test set, the model with SNV-2nd derivative after height correction showed the best prediction accuracy with a coefficient of determination of 0.919 and PMSEP of 0.436 for the test set. Liu et al. [28] demonstrated that suppressing the contribution of spectral components strongly affected by distance can improve prediction accuracy. In the present study, models incorporating distance and angle corrections were likewise selected, supporting the notion that geometric factors play an important role in SSC estimation using NIR-HSI, while the reliability analysis indicates that such geometric considerations do not necessarily guarantee stable pixel-wise predictions under complex acquisition conditions.

3.3. SSC Imaging and Reliability

Figure 4 shows the imaging results for geometric correction and spectral processing in Dataset I, while Figure 5 shows the results for Dataset II. The HSI method often employs smoothing filters to reduce noise and improve visibility, but in this case, no filter was used. Both Dataset I and Dataset II demonstrate that the combination of geometric shape correction and spectral processing alters the appearance of the imaging results. In particular, for Dataset I, it can be confirmed that using second-order differentiation processing results in noisy imaging. These visual differences motivate a quantitative evaluation of pixel-wise consistency using the proposed reliability index.
When comparing models using Reliability, the proposed evaluation metric for imaging in this study, the model with the highest coefficient of determination did not necessarily exhibit the highest reliability. In Dataset I, the model using first-derivative spectral processing for height correction achieved 0.554, while in Dataset II, the model using SNV processing and second-derivative processing for both height and angle correction achieved a high value of 0.417. Meanwhile, the reliability of the model with the highest coefficient of determination in Dataset I was 0.012, while the reliability of the model with the highest coefficient of determination in Dataset II was 0.161. This indicates that even with a high model coefficient of determination, many individual pixel spectra predicted from the dataset’s average spectrum are statistically outliers. This reliability metric is likely to be higher with fewer latent variables. However, in Dataset I, with the same number of latent variables, the reliability of the first-order derivative model after height correction was 0.554, while the reliability of the second-order derivative model was 0.290, showing a difference. Second-derivative preprocessing enhanced linear fit at the sample level but also amplified high-frequency noise at the pixel level, widening the score-space dispersion relative to the calibration manifold; this explains the low reliability despite a high test-set R 2 . SNV tends to normalize individual spectra and reduce pixel-wise dispersion relative to the calibration data, which can be advantageous in some datasets. The reliability index is intended to provide a relative measure for comparing pixel-wise consistency among different preprocessing strategies, rather than to define an absolute threshold of image validity. Although Dataset II exhibited higher model-level accuracy, the lower pixel-wise reliability suggests that acquisition geometry, particularly scan direction and fruit rotation, may increase pixel-level spectral dispersion and affect imaging stability.
In hyperspectral imaging, calibration models are typically constructed using averaged spectra, whereas imaging involves pixel-wise spectra acquired under heterogeneous noise and illumination conditions. Consequently, spectral variability arising from complex fruit geometry and measurement conditions may be suppressed during averaging at the model-construction stage, while becoming critical for image-level interpretation. In agricultural products with complex shapes, it is therefore common to examine a wide range of preprocessing strategies. In this study, representative and commonly used preprocessing methods were systematically evaluated. The results indicate that combining geometric correction with appropriate spectral preprocessing improves the proposed reliability index in both datasets, suggesting that such preprocessing can mitigate spectra that deviate substantially from the calibration manifold. These findings imply that pixel-wise application of geometric correction and spectral preprocessing is effective for preserving image validity, as sample-derived spectral variability associated with complex geometry may otherwise be lost during spectral averaging.
Figure 6 and Figure 7 show the relationship between the predicted values and actual measured values for the SSC models of Dataset I and Dataset II, respectively. The left panel shows the model with the maximum coefficient of determination in the test set, while the right panel shows the model with the maximum imaging reliability index (Reliability). Both Dataset I and Dataset II showed that the model with the highest coefficient of determination provided the best linear fit. However, even the model with the highest reliability fitted the linear trend sufficiently well, suggesting it could predict values from low to high sugar content. The achieved prediction performance is comparable to that reported in previous HSI-based SSC studies on strawberries [10,20,21]. Therefore, we decided to adopt the model with the highest reliability metric as the imaging model.
Figure 8 shows a scatter plot of the PLS scores (PC1 and PC2) for the training dataset of the model with the highest reliability for both Dataset I and Dataset II. The calibration scores formed compact distributions without strong directional bias or anomalous clustering, supporting the practical use of a distance-based criterion in the latent score space. To verify the robustness of the proposed reliability-based model selection, we confirmed that the selected model remained unchanged even when the threshold (corresponding to levels of approximately 2σ, 3σ, and 4σ) was altered. These results indicate that the proposed framework is not sensitive to reasonable variations in the threshold and that model selection is driven by intrinsic consistency within the score space rather than arbitrary parameter choices.

3.4. 3D SSC Imaging Results

Figure 9 shows the imaging results for the strawberry samples in the test set of Dataset I, and Figure 10 shows the imaging results for the strawberry samples in the test set of Dataset II. The sample alphabet is arranged in descending order of measured SSC, from A to P.
In the imaging data for Dataset I, the SSC values at the fruit top were high for most samples, with those of A, B, C, and D being particularly high and clearly visible. The fruit bottom showed less variation in SSC values compared to the fruit top and tended to be lower. Sample P, in particular, exhibited low sugar content overall; while appearing red in the image, it exhibited consistently low SSC values across the fruit. These findings align with actual measurement results indicating higher sugar content in the strawberry fruit top, lower sugar content in the fruit bottom, and smaller standard deviations. Furthermore, by synthesizing shape data with sugar content imaging, shape can also be evaluated simultaneously. This imaging technique can be applied as a line sorting system for strawberries placed on a container without overlapping fruits. In the imaging data of Dataset II, differences in SSC values were observed between the top and bottom of the fruit, as seen in A (high overall SSC values) and P (low overall SSC values). It should be noted that the reference SSC measurements in this study were limited to the top and bottom regions of the fruit, which restricts direct validation of pixel-wise spatial accuracy across the entire surface. While the high correlations between these reference values and the corresponding ROI-averaged imaging results (Table 2 and Table 3) support the consistency of the SSC maps at the sample-average level, they do not provide direct evidence for fine-scale spatial accuracy within the maps. Validation experiments using samples with known internal heterogeneity would enable a more rigorous assessment of spatial fidelity. Such investigations are considered important for further strengthening the reliability and generalizability of the proposed 3D SSC visualization framework.
By first excluding achene regions and focusing on the flesh ROI, clearer SSC distribution patterns can be obtained, as shown in our previous work [21]. Building on this improvement in two-dimensional visualization, the present study further integrates shape information to enable three-dimensional SSC mapping over the entire fruit surface. This technology, which can measure the sugar content of the entire fruit rather than just one side, holds promise for use in new variety development and as a quality evaluation technique for sorting.
While the present study focuses on soluble solids content as a representative internal quality attribute, comprehensive quality indices integrating multiple physicochemical and sensory-related parameters have been proposed to better represent overall strawberry quality and can be predicted using hyperspectral imaging [29].

4. Conclusions

To address the challenge of ensuring reliable pixel-wise predictions in hyperspectral fruit quality evaluation, this study demonstrates that combining geometric correction with pixel-wise reliability assessment improves the interpretability of hyperspectral SSC imaging. Unlike conventional approaches that rely solely on sample-level accuracy metrics, the proposed framework explicitly evaluates the statistical consistency of imaging spectra. By incorporating fruit shape information—specifically height and tilt—into spectral preprocessing, the proposed approach enables reliable three-dimensional visualization of SSC distributions. The introduction of a pixel-level reliability index based on the Mahalanobis distance in the PLS score space provides a robust measure for assessing spatial noise and outliers, with practical implications for fruit quality evaluation, breeding, and sorting systems.
In summary, the proposed framework provides a practical foundation for reliable 3D SSC visualization and offers a promising direction for advanced non-destructive fruit quality assessment. While the present study demonstrates the feasibility and interpretability of the proposed approach, several limitations should be acknowledged. Although pixel-wise SSC prediction is discussed, the reference SSC values were obtained from region-averaged measurements of halved fruits, which limits the ability to claim pixel-level spatial accuracy of the estimated SSC distributions. Additional limitations include the use of instrument-specific acquisition configurations. Future work will consider the extension of this framework toward practical imaging systems, improved spatial validation strategies, and more comprehensive quality evaluation by integrating multiple internal attributes.

Author Contributions

Conceptualization, H.S. and T.I.; methodology, H.S. and B.L.; software, H.S. and B.L.; validation, H.S., T.I. and T.M.; formal analysis, H.S.; investigation, H.S. and T.K.; resources, S.T.; data curation, H.S., T.K. and B.L.; writing—original draft preparation, H.S. and T.I.; writing—review and editing, H.S., B.L., T.K., T.M., S.T. and T.I.; visualization, T.I.; supervision, S.T.; project administration, T.I.; funding acquisition, H.S. and T.I. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by JSPS KAKENHI Grant Number JP25K18336.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and analyzed during the current study are not publicly available due to restrictions related to instrument-specific configurations but are available from the corresponding author on reasonable request.

Acknowledgments

During the preparation of this manuscript, the authors used Copilot for the purposes of language editing and grammatical improvement. The AI tools were not used for data analysis, interpretation, or generation of scientific content. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Yang, X.; Zhu, L.; Huang, X.; Zhang, Q.; Li, S.; Chen, Q.; Wang, Z.; Li, J. Determination of the Soluble Solids Content in Korla Fragrant Pears Based on Visible and Near-Infrared Spectroscopy Combined With Model Analysis and Variable Selection. Front. Plant Sci. 2022, 13, 938162. [Google Scholar] [CrossRef]
  2. Hernández-Martínez, N.R.; Blanchard, C.; Wells, D.; Salazar-Gutiérrez, M.R. Current State and Future Perspectives of Commercial Strawberry Production: A Review. Sci. Hortic. 2023, 312, 111893. [Google Scholar] [CrossRef]
  3. Oo, L.M.; Aung, N.Z. A Simple and Efficient Method for Automatic Strawberry Shape and Size Estimation and Classification. Biosyst. Eng. 2018, 170, 96–107. [Google Scholar] [CrossRef]
  4. He, J.Q.; Harrison, R.J.; Li, B. A Novel 3D Imaging System for Strawberry Phenotyping. Plant Methods 2017, 13, 93. [Google Scholar] [CrossRef]
  5. Yuan, S.-Q.; Cao, Y.; Cheng, X. Research on Strawberry Quality Grading Based on Object Detection and Stacking Fusion Model. IEEE Access 2023, 11, 137475–137484. [Google Scholar] [CrossRef]
  6. Jovanović, R.; Djordjevic, A.; Stefanovic, M.; Eric, M.; Pajić, N. Enhanced Defect Management in Strawberry Processing Using Machine Vision: A Cost-Effective Edge Device Solution for Real-Time Detection and Quality Improvement. Appl. Sci. 2024, 14, 7771. [Google Scholar] [CrossRef]
  7. Qiao, Y.; Wang, C.; Zhu, W.; Sun, L.; Bai, J.; Zhou, R.; Zhu, Z.; Cai, J. Online Assessment of Soluble Solids Content in Strawberries Using a Developed Vis/NIR Spectroscopy System with a Hanging Grasper. Food Chem. 2025, 478, 143671. [Google Scholar] [CrossRef] [PubMed]
  8. Grabska, J.; Beć, K.B.; Ueno, N.; Huck, C.W. Analyzing the Quality Parameters of Apples by Spectroscopy from Vis/NIR to NIR Region: A Comprehensive Review. Foods 2023, 12, 1946. [Google Scholar] [CrossRef] [PubMed]
  9. Jaywant, S.A.; Singh, H.; Arif, K.M. Sensors and Instruments for Brix Measurement: A Review. Sensors 2022, 22, 2290. [Google Scholar] [CrossRef]
  10. Weng, S.; Yu, S.; Guo, B.; Tang, P.; Liang, D. Non-Destructive Detection of Strawberry Quality Using Multi-Features of Hyperspectral Imaging and Multivariate Methods. Sensors 2020, 20, 3074. [Google Scholar] [CrossRef]
  11. Ikegaya, A.; Toyoizumi, T.; Ohba, S.; Nakajima, T.; Kawata, T.; Ito, S.; Arai, E. Effects of Distribution of Sugars and Organic Acids on the Taste of Strawberries. Food Sci. Nutr. 2019, 7, 2419–2426. [Google Scholar] [CrossRef]
  12. Su, Z.; Zhang, C.; Yan, T.; Zhu, J.; Zeng, Y.; Lu, X.; Gao, P.; Feng, L.; He, L.; Fan, L. Application of Hyperspectral Imaging for Maturity and Soluble Solids Content Determination of Strawberry With Deep Learning Approaches. Front. Plant Sci. 2021, 12, 736334. [Google Scholar] [CrossRef]
  13. Sun, C.; Zhang, L.; Zhai, L.; Shen, T.; Cai, J.; Zou, X.; Guo, Z. Automatic Early Bruise Detection in Strawberry Fruit by Hyperspectral Imaging and Deep Learning Techniques. Postharvest Biol. Technol. 2026, 232, 113966. [Google Scholar] [CrossRef]
  14. Liu, Y.; Zhou, S.; Wu, H.; Han, W.; Li, C.; Chen, H. Joint Optimization of Autoencoder and Self-Supervised Classifier: Anomaly Detection of Strawberries Using Hyperspectral Imaging. Comput. Electron. Agric. 2022, 198, 107007. [Google Scholar] [CrossRef]
  15. Feng, C.-H. Hyperspectral Imaging Combined with Chemometrics Technique for Monitoring the Quality of Strawberries Under Various Pre-Cooling Treatments. Processes 2026, 14, 983. [Google Scholar] [CrossRef]
  16. Delwiche, S.R.; Baek, I.; Kim, M.S. Effect of Curvature on Hyperspectral Reflectance Images of Cereal Seed-Sized Objects. Biosyst. Eng. 2021, 202, 55–65. [Google Scholar] [CrossRef]
  17. Li, B.; Ma, T.; Bai, L.; Inagaki, T.; Seki, H.; Tsuchikawa, S. Three-Dimensional Visualization and Detection of Early Bruise in Apple Based on near-Infrared Hyperspectral Imaging Coupled with Geometrical Influence Correction. Postharvest Biol. Technol. 2024, 210, 112753. [Google Scholar] [CrossRef]
  18. Li, B.; Ma, T.; Inagaki, T.; Tsuchikawa, S. Enhanced Detection of Early Bruises in Apples Using Near-Infrared Hyperspectral Imaging with Geometrical Influence Correction for Universal Size Adaptation. Postharvest Biol. Technol. 2025, 219, 113282. [Google Scholar] [CrossRef]
  19. Naqvi, L.H.; Balasubramaniam, B.; Li, J.; Liu, L.; Li, B. Four-Dimensional Hyperspectral Imaging for Fruit and Vegetable Grading. Agriculture 2025, 15, 1702. [Google Scholar] [CrossRef]
  20. Chun, S.-W.; Lee, H.-G.; Lee, J.-E.; Yu, W.-H.; Hwang, I.G.; Mo, C. Region-Based Hyperspectral Imaging and Lightweight CNN Model for Nondestructive Prediction of Soluble Solid Content in Strawberries. Agriculture 2026, 16, 321. [Google Scholar] [CrossRef]
  21. Seki, H.; Ma, T.; Murakami, H.; Tsuchikawa, S.; Inagaki, T. Visualization of Sugar Content Distribution of White Strawberry by Near-Infrared Hyperspectral Imaging. Foods 2023, 12, 931. [Google Scholar] [CrossRef]
  22. Otsu, N. A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
  23. Savitzky, A.; Golay, M.J.E. Smoothing and Differentiation of Data by Simplified Least Squares Procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
  24. Li, H.-D.; Xu, Q.-S.; Liang, Y.-Z. libPLS: An Integrated Library for Partial Least Squares Regression and Linear Discriminant Analysis. Chemom. Intell. Lab. Syst. 2018, 176, 34–43. [Google Scholar] [CrossRef]
  25. Andersson, M. A Comparison of Nine PLS1 Algorithms. J. Chemom. 2009, 23, 518–529. [Google Scholar] [CrossRef]
  26. Golic, M.; Walsh, K.; Lawson, P. Short-Wavelength Near-Infrared Spectra of Sucrose, Glucose, and Fructose with Respect to Sugar Concentration and Temperature. Appl. Spectrosc. 2003, 57, 139–145. [Google Scholar] [CrossRef] [PubMed]
  27. Liu, Q.; Wei, K.; Xiao, H.; Tu, S.; Sun, K.; Sun, Y.; Pan, L.; Tu, K. Near-Infrared Hyperspectral Imaging Rapidly Detects the Decay of Postharvest Strawberry Based on Water-Soluble Sugar Analysis. Food Anal. Methods 2019, 12, 936–946. [Google Scholar] [CrossRef]
  28. Liu, H.; Zhang, K.; Lu, M.; Li, Y.; Tian, S.; Hu, J.; Fang, Y. Detecting Soluble Solids Content of Individual Berries Using Image Depth Correction in Hyperspectral Imaging of Grape Clusters. Food Control 2025, 178, 111473. [Google Scholar] [CrossRef]
  29. Sun, M.; Jiang, F.; Wang, C.; Zhang, A.; Tian, Y. Development of a Comprehensive Quality Index for Strawberry and Its Non-Destructive Prediction Using Hyperspectral Imaging. Food Control 2026, 181, 111754. [Google Scholar] [CrossRef]
Figure 1. Conceptual workflow of hyperspectral data acquisition and SSC imaging for Dataset I (line-scan system). Near-infrared hyperspectral images and 3D shape data are acquired simultaneously. After background removal and flesh-ROI extraction, geometric correction (height and angle) and spectral preprocessing are applied. PLSR models are constructed using ROI-averaged spectra and subsequently applied to pixel-wise spectra to generate SSC maps, which are finally fused with shape data for three-dimensional visualization. In the SSC imaging results, differences in color correspond to differences in Brix.
Figure 1. Conceptual workflow of hyperspectral data acquisition and SSC imaging for Dataset I (line-scan system). Near-infrared hyperspectral images and 3D shape data are acquired simultaneously. After background removal and flesh-ROI extraction, geometric correction (height and angle) and spectral preprocessing are applied. PLSR models are constructed using ROI-averaged spectra and subsequently applied to pixel-wise spectra to generate SSC maps, which are finally fused with shape data for three-dimensional visualization. In the SSC imaging results, differences in color correspond to differences in Brix.
Foods 15 01563 g001
Figure 2. Conceptual workflow of hyperspectral data acquisition and SSC imaging for Dataset II (rotation-scan system). Near-infrared hyperspectral images and 3D shape data are acquired during a full rotation of the fruit on a turntable, enabling full-surface coverage. After background removal and flesh-ROI extraction, geometric correction (height and angle) and spectral preprocessing are applied. PLSR models constructed from ROI-averaged spectra are then applied to pixel-wise spectra to generate SSC maps, which are fused with the reconstructed shape to produce three-dimensional SSC visualization over the entire fruit surface. In the SSC imaging results, differences in color represent differences in Brix.
Figure 2. Conceptual workflow of hyperspectral data acquisition and SSC imaging for Dataset II (rotation-scan system). Near-infrared hyperspectral images and 3D shape data are acquired during a full rotation of the fruit on a turntable, enabling full-surface coverage. After background removal and flesh-ROI extraction, geometric correction (height and angle) and spectral preprocessing are applied. PLSR models constructed from ROI-averaged spectra are then applied to pixel-wise spectra to generate SSC maps, which are fused with the reconstructed shape to produce three-dimensional SSC visualization over the entire fruit surface. In the SSC imaging results, differences in color represent differences in Brix.
Foods 15 01563 g002
Figure 3. Average reflectance spectra (mean ± standard deviation) of strawberries for Dataset I (line-scan system) and Dataset II (rotation-scan system), shown under four geometric-correction conditions: uncorrected (A), height-corrected (B), angle-corrected (C), and combined height–angle correction (D). The shaded areas represent the standard deviation.
Figure 3. Average reflectance spectra (mean ± standard deviation) of strawberries for Dataset I (line-scan system) and Dataset II (rotation-scan system), shown under four geometric-correction conditions: uncorrected (A), height-corrected (B), angle-corrected (C), and combined height–angle correction (D). The shaded areas represent the standard deviation.
Foods 15 01563 g003
Figure 4. Examples of SSC imaging results for Dataset I acquired using the line-scan system, illustrating the effects of different shape-correction and spectral-processing methods. Each panel displays pixel-wise SSC predictions generated by applying PLSR models under varying preprocessing conditions, highlighting differences in spatial noise, contrast, and image stability.
Figure 4. Examples of SSC imaging results for Dataset I acquired using the line-scan system, illustrating the effects of different shape-correction and spectral-processing methods. Each panel displays pixel-wise SSC predictions generated by applying PLSR models under varying preprocessing conditions, highlighting differences in spatial noise, contrast, and image stability.
Foods 15 01563 g004
Figure 5. Examples of SSC imaging results for Dataset II acquired using the rotation-scan system, illustrating the effects of different shape-correction and spectral-processing methods. Each panel shows pixel-wise SSC predictions produced by PLSR models under varying preprocessing conditions, highlighting differences in spatial uniformity, noise behavior, and the overall stability of full-surface SSC visualization.
Figure 5. Examples of SSC imaging results for Dataset II acquired using the rotation-scan system, illustrating the effects of different shape-correction and spectral-processing methods. Each panel shows pixel-wise SSC predictions produced by PLSR models under varying preprocessing conditions, highlighting differences in spatial uniformity, noise behavior, and the overall stability of full-surface SSC visualization.
Foods 15 01563 g005
Figure 6. Relationships between predicted SSC values and reference SSC measurements for Dataset I. The left panel shows the model with the highest coefficient of determination ( R 2 ) on the test set; the right panel shows the model with the highest pixel-wise reliability index.
Figure 6. Relationships between predicted SSC values and reference SSC measurements for Dataset I. The left panel shows the model with the highest coefficient of determination ( R 2 ) on the test set; the right panel shows the model with the highest pixel-wise reliability index.
Foods 15 01563 g006
Figure 7. Relationships between predicted SSC values and reference SSC measurements for Dataset II. The left panel illustrates the model with the highest test-set R 2 , while the right panel displays the model with the highest imaging-reliability index.
Figure 7. Relationships between predicted SSC values and reference SSC measurements for Dataset II. The left panel illustrates the model with the highest test-set R 2 , while the right panel displays the model with the highest imaging-reliability index.
Foods 15 01563 g007
Figure 8. PLS score distributions (LV1 and LV2) of the training datasets for the models with the highest reliability in Dataset I and Dataset II. The ellipse represents the χ2-based 99% boundary estimated from the calibration scores, defining the calibration manifold used for pixel-wise reliability evaluation. This figure is provided as a sanity check to confirm that the PLS score distributions do not exhibit strong directional bias or anomalous clustering.
Figure 8. PLS score distributions (LV1 and LV2) of the training datasets for the models with the highest reliability in Dataset I and Dataset II. The ellipse represents the χ2-based 99% boundary estimated from the calibration scores, defining the calibration manifold used for pixel-wise reliability evaluation. This figure is provided as a sanity check to confirm that the PLS score distributions do not exhibit strong directional bias or anomalous clustering.
Foods 15 01563 g008
Figure 9. 3D SSC imaging results for strawberry samples in the Dataset I test set. Samples are arranged from high to low measured SSC (AP). The values in the image indicate the measured Brix values for top and bottom.
Figure 9. 3D SSC imaging results for strawberry samples in the Dataset I test set. Samples are arranged from high to low measured SSC (AP). The values in the image indicate the measured Brix values for top and bottom.
Foods 15 01563 g009
Figure 10. 3D SSC imaging results for Dataset II (rotation-scan) test samples. Samples are arranged from high to low measured SSC (AP). Note: Panel (C) was excluded due to imaging artifacts during acquisition. The values in the image indicate the measured Brix values for top and bottom.
Figure 10. 3D SSC imaging results for Dataset II (rotation-scan) test samples. Samples are arranged from high to low measured SSC (AP). Note: Panel (C) was excluded due to imaging artifacts during acquisition. The values in the image indicate the measured Brix values for top and bottom.
Foods 15 01563 g010
Table 1. Overview of the datasets used for PLSR model development. Dataset I was measured using a line scan system, while Dataset II was measured using a rotational scan system. The n value indicates the total number of strawberry samples in the dataset and the number of samples in the split training and test sets. SSC represents the distribution of the range and mean ± standard deviation at the top and bottom of the fruit.
Table 1. Overview of the datasets used for PLSR model development. Dataset I was measured using a line scan system, while Dataset II was measured using a rotational scan system. The n value indicates the total number of strawberry samples in the dataset and the number of samples in the split training and test sets. SSC represents the distribution of the range and mean ± standard deviation at the top and bottom of the fruit.
Dataset nFruit SectionRange
(Brix%)
Mean ± SD
(Brix%)
ITotal193Bottom5.8–10.18.3 ± 0.91
Top7.5–13.510.7 ± 1.08
Train136Bottom5.8–10.18.3 ± 0.91
Top7.5–13.510.8 ± 1.08
Test57Bottom5.8–10.18.1 ± 0.91
Top7.9–13.310.6 ± 1.08
IITotal130Bottom7.1–11.08.9 ± 0.78
Top9.2–14.611.5 ± 1.19
Train93Bottom7.1–11.08.9 ± 0.82
Top9.2–14.611.6 ± 1.23
Test37Bottom7.7–10.28.9 ± 0.66
Top9.3–14.211.4 ± 1.08
Table 2. Summary of PLSR performance and pixel-wise imaging reliability for Dataset I acquired using the line-scan system, comparing models under different shape-correction and spectral-processing conditions. The highlights indicate the conditions with the maximum coefficient of determination and the maximum reliability index, respectively.
Table 2. Summary of PLSR performance and pixel-wise imaging reliability for Dataset I acquired using the line-scan system, comparing models under different shape-correction and spectral-processing conditions. The highlights indicate the conditions with the maximum coefficient of determination and the maximum reliability index, respectively.
SpectraSpectral ProcessingLVRMSECVRMSECRMSEPR2CVR2CR2PReliabilityr*
R-60.7100.6910.7380.7950.8070.7850.0290.891
RSmoothing60.6800.6610.7360.8120.8230.7860.0310.893
R1st derivative40.6820.6580.7930.8120.8240.7510.3090.872
R2nd derivative40.6440.6140.6990.8320.8470.8060.1040.916
RSNV60.6740.6450.7780.8160.8310.7610.2170.887
RSNV-Smoothing50.7050.6760.8200.7990.8140.7340.2270.867
RSNV-1st derivative30.7050.6840.8270.7990.810.7290.4480.849
RSNV-2nd derivative30.6540.6260.7340.8260.8410.7870.2110.872
Rh-70.6940.6520.7410.8050.8280.7830.0270.896
RhSmoothing70.6670.6310.7270.820.8390.7910.0240.900
Rh1st derivative30.7230.7000.7920.7880.8010.7510.5540.874
Rh2nd derivative30.7540.7420.7550.770.7770.7750.2900.894
RhSNV60.6490.6230.7390.8290.8430.7840.1960.900
RhSNV-Smoothing60.6890.6580.7910.8080.8240.7520.2020.883
RhSNV-1st derivative40.7060.6270.7260.7980.8410.7910.3230.896
RhSNV-2nd derivative30.6660.6360.7320.820.8360.7880.2220.902
Ra-60.7050.6830.7150.7980.8110.7980.0250.899
RaSmoothing60.6790.6570.7070.8130.8250.8020.0280.901
Ra1st derivative40.6740.6490.7640.8160.8290.7690.3110.882
Ra2nd derivative30.7840.7630.770.7510.7640.7650.2520.890
RaSNV60.6530.6260.7510.8270.8410.7760.2020.894
RaSNV-Smoothing50.7140.6850.8230.7930.8100.7320.2290.870
RaSNV-1st derivative30.7190.6940.8360.7910.8040.7230.4700.862
RaSNV-2nd derivative20.7580.7440.7730.7670.7750.7640.4630.886
Rh_a-70.6700.6350.7200.8180.8370.7950.0140.904
Rh_aSmoothing70.6300.6000.6870.8390.8540.8130.0120.911
Rh_a1st derivative30.7190.6960.7680.790.8030.7670.4480.883
Rh_a2nd derivative30.7380.6790.710.7790.8130.8010.2090.909
Rh_aSNV60.6320.5980.7110.8380.8550.80.1780.907
Rh_aSNV-Smoothing60.6820.6550.7870.8110.8260.7550.210.886
Rh_aSNV-1st derivative30.7280.7030.8470.7850.80.7160.5010.861
Rh_aSNV-2nd derivative30.6700.6390.7380.8180.8340.7840.2350.903
r*: The correlation coefficient represents the linear relationship between the reference SSC values and the imaging-based SSC values calculated as the average of pixel-wise predictions in the top and bottom regions.
Table 3. Summary of PLSR performance and pixel-wise imaging reliability for Dataset II acquired using the rotation-scan system, comparing models under different shape-correction and spectral-processing conditions. The highlights indicate the conditions with the maximum coefficient of determination and the maximum reliability index, respectively.
Table 3. Summary of PLSR performance and pixel-wise imaging reliability for Dataset II acquired using the rotation-scan system, comparing models under different shape-correction and spectral-processing conditions. The highlights indicate the conditions with the maximum coefficient of determination and the maximum reliability index, respectively.
SpectraSpectral ProcessingLVRMSECVRMSECRMSEPR2CVR2CR2PReliabilityr*
R-70.5080.4600.5090.9070.9240.8890.0260.941
RSmoothing70.4870.4450.4870.9150.9290.8990.0280.946
R1st derivative60.4990.4620.5200.9100.9230.8850.2270.949
R2nd derivative50.5310.4750.4640.8990.9190.9080.2200.955
RSNV80.4710.4300.4850.9200.9330.9000.0580.960
RSNV-Smoothing80.4820.4400.4980.9170.9300.8940.0590.959
RSNV-1st derivative70.4470.4110.4750.9280.9390.9040.1620.961
RSNV-2nd derivative40.5310.5470.4580.8990.8920.9110.3540.957
Rh-80.4980.4560.5220.9110.9250.8840.0300.945
RhSmoothing80.4860.4490.5030.9150.9270.8920.0270.949
Rh1st derivative70.4900.4510.4740.9140.9270.9040.1520.955
Rh2nd derivative40.6020.5450.4740.8700.8930.9040.2670.943
RhSNV70.5440.5400.5890.8930.8950.8520.1450.933
RhSNV-Smoothing80.4870.4460.5440.9150.9290.8740.0800.949
RhSNV-1st derivative70.4580.4260.4900.9250.9350.8980.1730.950
RhSNV-2nd derivative40.5060.4030.4360.9080.9420.9190.1610.958
Ra-70.5420.4950.4880.8940.9120.8980.0430.946
RaSmoothing70.5150.4720.4790.9050.9200.9020.0420.949
Ra1st derivative70.5110.4620.5000.9060.9230.8930.1380.953
Ra2nd derivative50.5710.5290.4880.8830.8990.8980.2400.952
RaSNV80.4800.4380.4970.9170.9310.8950.0590.958
RaSNV-Smoothing80.4900.4470.5060.9140.9280.8910.0590.958
RaSNV-1st derivative70.4520.4160.4820.9270.9380.9010.1600.961
RaSNV-2nd derivative40.5570.5500.4550.8880.8910.9120.3630.959
Rh_a-80.5120.4490.5160.9060.9280.8860.0690.940
Rh_aSmoothing70.5460.4480.4970.8930.9280.8950.0640.944
Rh_a1st derivative70.5200.4920.4970.9030.9130.8950.1880.944
Rh_a2nd derivative40.6320.5950.5280.8560.8720.8810.3300.938
Rh_aSNV70.5850.5670.5770.8770.8840.8580.1490.932
Rh_aSNV-Smoothing80.4950.4510.5560.9120.9270.8680.0800.942
Rh_aSNV-1st derivative70.4660.4330.4940.9220.9320.8960.1750.944
Rh_aSNV-2nd derivative30.6110.5750.4970.8660.8810.8950.4170.947
r*: The correlation coefficient represents the linear relationship between the reference SSC values and the imaging-based SSC values calculated as the average of pixel-wise predictions in the top and bottom regions.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Seki, H.; Li, B.; Kawaide, T.; Ma, T.; Tsuchikawa, S.; Inagaki, T. Visualization of Three-Dimensional SSC (Soluble Solids Content) Across the Entire Surface of Strawberries Using Near-Infrared Hyperspectral Imaging. Foods 2026, 15, 1563. https://doi.org/10.3390/foods15091563

AMA Style

Seki H, Li B, Kawaide T, Ma T, Tsuchikawa S, Inagaki T. Visualization of Three-Dimensional SSC (Soluble Solids Content) Across the Entire Surface of Strawberries Using Near-Infrared Hyperspectral Imaging. Foods. 2026; 15(9):1563. https://doi.org/10.3390/foods15091563

Chicago/Turabian Style

Seki, Hayato, Bin Li, Tetsuo Kawaide, Te Ma, Satoru Tsuchikawa, and Tetsuya Inagaki. 2026. "Visualization of Three-Dimensional SSC (Soluble Solids Content) Across the Entire Surface of Strawberries Using Near-Infrared Hyperspectral Imaging" Foods 15, no. 9: 1563. https://doi.org/10.3390/foods15091563

APA Style

Seki, H., Li, B., Kawaide, T., Ma, T., Tsuchikawa, S., & Inagaki, T. (2026). Visualization of Three-Dimensional SSC (Soluble Solids Content) Across the Entire Surface of Strawberries Using Near-Infrared Hyperspectral Imaging. Foods, 15(9), 1563. https://doi.org/10.3390/foods15091563

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop