Next Article in Journal
Mitigating Soil Compaction in Sugarcane Production: A Systems Approach Integrating Controlled Traffic Farming and Strip Soil Tillage
Previous Article in Journal
Economic Impact of Optical Sensors and Deep Learning in Smart Agriculture: A Scientometric Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Assessment of Physicochemical Properties of Cashew Apple Through Computer Vision

1
ICAR-CCARI, Ela, Old Goa 403402, Goa, India
2
Department of Agricultural and Biosystems Engineering, North Dakota State University, Fargo, ND 58102, USA
3
ICAR-Directorate of Cashew Research, Dakshina Kannada, Puttur 574202, Karnataka, India
4
Department of Agricultural and Biosystems Engineering, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
5
Food Science & Technology, University of Georgia, Athens, GA 30602, USA
*
Author to whom correspondence should be addressed.
AgriEngineering 2025, 7(12), 398; https://doi.org/10.3390/agriengineering7120398 (registering DOI)
Submission received: 20 October 2025 / Revised: 22 November 2025 / Accepted: 24 November 2025 / Published: 28 November 2025

Abstract

Cashew apples, a byproduct of the cashew nut industry with an estimated global production of 38 million tonnes, are rich in several essential nutrients and are widely processed into juice, syrup, wine, pickles, and other value-added products. However, their morphological and physicochemical properties vary significantly across varieties, complicating in-field characterization, maturity assessment, and biochemical analysis. These challenges originate from the reliance on costly chemicals, skilled manpower, limited time, and sophisticated equipment. This study employed a user-developed computer vision-based ImageJ 1.x batch processing plugin to assess 15 physicochemical properties across six diverse cashew apple varieties from the images of slices and whole samples. Five methodologies—color grid, surface morphology, gray level co-occurrence matrix, local binary pattern, and color indices—generated image-based metrics rapidly ( 2.87 ± 0.79  s/image). The correlation of wet chemistry with image-based parameters, linear modeling, and wet chemistry parameters prediction with an independent dataset were successfully performed, and the successfully modeled properties include acidity, antioxidants, carbohydrates, carotenoids, crude fat, flavonoids, pH, phenolics, proteins, tannins, vitamin C, and total soluble solids. The results demonstrated the feasibility of predicting 11 out of 15 physicochemical properties of cashew apples ( R 2 > 0.5 ). This methodology offers a faster, safer, and cost-effective alternative to wet chemistry and can be extended to other horticultural crops.

1. Introduction

Cashew (Anacardium occidentale L.) apple is a pseudofruit formed from the pedicel of the flower. The cashew nut, a highly sought after commercial product component, is attached to the cashew apple (Figure 1). Cashew apples are elongated, round, or pear-shaped fibrous fruits that naturally occur in yellow, orange, and bright red colors [1]. They weigh between 40 and 80 g and have lengths between 60 and 100 mm [2]. The cashew apple is sometimes used directly for table purposes [3]; however, it is primarily utilized in various processed forms.
Although the cashew tree originated in Brazil, it was spread to many of its colonies through Portuguese traders and is now globally distributed in approximately thirty countries between 25° N and 30° S latitude of the equator. Currently, the Ivory Coast has emerged as the largest producer of cashew, followed by India and Vietnam [4,5]. However, just a few years ago, India was the top producer, followed by Vietnam, Brazil, and East Africa [6]. The production of cashew apples has been indirectly estimated to be 9 to 10 times that of cashew nuts [7,8]. The global production of cashew nuts in 2023 was 4.2 × 106 t [5], which amounts between 37.8 × 106 t and 42 × 106 t of cashew apples.
Cashew apple is a storehouse of vitamins, minerals (potassium, magnesium, sodium, and iron), amino acids, carotenoids, phenolics, flavonoids (myricetin and quercetin), organic acids, and antioxidants [9,10,11,12,13,14,15]. Cashew apple has several proven health benefits, such as being an instant energy booster [16], having immunobooster-enhancing bioactive compounds functionality [13,16,17], being anti-inflammatory, and being anti-microbial [18,19]. Beyond its nutritional value, cashew apple has been traditionally used in various medicinal practices [19,20]. Its bagasse, the byproduct obtained after juice extraction—also known as pomace—is rich in cellulose, lignin, and hemicellulose [14,21,22].
Numerous processes and products have been developed from the cashew apple, including its juice and bagasse/pomace. The fruit is commonly processed into candies, pickles, jams, jellies, preserves, and syrup, and is also used as animal feed [23,24,25]. Cashew apple juice contains five times more vitamin C than orange juice and has been utilized in the production of ready-to-serve beverages, squash, wine, feni, and bioethanol [26,27,28]. The pomace serves various applications, including as a flour substitute in bakery products, alternatives to meat and dairy, and as a raw material for nanocomposites, polylactic acid, and other bioproducts [29,30]. Despite these diverse and well-documented uses, the cashew apple remains underutilized commercially due to its highly perishable nature [2] and the presence of undesirable flavors and compounds [8,31,32].
Rapid and timely quality assessment immediately after harvest is essential for optimal utilization and processing. However, the short harvesting season limits the feasibility of traditional wet chemistry methods and the generation of useful results. The wet chemistry procedures require sophisticated equipment like HPLC, GCMS, Raman, NIR, MIR, UV, Vis/SWIR spectroscopy, texture analyzers, NMR, X-ray imaging, etc., which are time-consuming, expensive, and require skilled personnel. There is currently a lack of rapid, affordable, and labor-efficient alternatives for estimating the physicochemical properties of cashew apples.
Developing such alternatives is crucial not to replace wet lab analyses but to support them by identifying optimal harvest stages and enabling in situ rapid screening. This approach would minimize the number of samples requiring full laboratory analysis. Such innovations would be globally beneficial: they could alleviate skilled labor shortages in developed countries and compensate for the limited research infrastructure in developing regions. Moreover, the methodology could be adapted for use with minimal training, enabling even unskilled workers to assess cashew apple quality, potentially extendable to other fruits and vegetables.
Open-source tools, being free and accessible to all, could play an impactful role in developing these alternative methodologies. In the domain of image analysis, “ImageJ”, a free and open-source image analysis system, has been widely used to estimate various physical and chemical properties of fruits, vegetables, and other food products [33,34,35,36]. Numerous studies have demonstrated a correlation between surface texture and color and functional compounds, biochemical, and physicochemical properties [37,38,39]. The integration of ImageJ with computer vision technologies enables rapid, non-destructive, and repeatable assessments of cashew apple quality in both field and laboratory settings. These approaches not only reduce reliance on traditional wet lab analyses but also make quality evaluation more accessible, thus supporting the scalability and commercial viability of cashew apple processing.
Therefore, it is hypothesized that the cashew apples’ image-based parameters, derived from various formulas, are correlated to the physicochemical properties, which are estimated through standard wet chemistry methods, and consequently, simplified models for physicochemical properties can be developed using image-based parameters as independent variables. Therefore, this study aims to correlate image-based parameters derived from the images (whole and sliced) of different varieties of cashew apples, varying in color, shape, and size, to the physicochemical properties estimated through the standard laboratory protocols, and develop rapid tools (plugins and models) for in-field estimation of cashew apple physicochemical properties.
The specific objectives of this study are to (i) derive image-based parameters for whole fruits and cut slices of cashew apples; (ii) correlate the derived parameters to the actual wet chemistry physicochemical property results; and (iii) develop models for the estimation of physicochemical properties of cashew apples based on image analysis parameters and their validation.
The results of this study present a rapid, cost-effective, and user-friendly methodology for estimating the physicochemical properties of cashew apple. This methodology will serve as a valuable tool for researchers, breeders, food processors, technicians, farmers, and other stakeholders, thereby reducing project budgeting costs, requiring minimal training, supporting data security, enhancing product quality, enabling the analysis of more samples, and minimizing the time required for evaluation.

2. Material and Methods

The methodology followed in the study consisted of four stages (Figure 2). First was the sample collection from the field; their image capture, as a whole and sliced; processing; and storage for further analysis. The second stage was the wet chemistry in the laboratory for estimation of the physicochemical and biochemical properties (dependent variables) and simultaneously preprocessing and processing of the images acquired using a user-developed ImageJ plugin to derive various parameters (independent variables) for estimation of the physicochemical properties. The third stage was the determination of the correlation of the dependent and independent variables. In the ultimate stage, based on the best-ranked parameters, models were developed and validated.

2.1. Samples Description

Six varieties of cashew apples of varying flesh and skin colors viz. orange (Bhaskara, VRI-3), red (Vengurla-2, Madakkathara-MDK), and yellow (NRCC-1, VTH-174) were harvested from orchards of the ICAR-Directorate of Cashew Research (12°44′44″ N, 75°13′54″ E) and (12°44′24″ N, 75°13′50″ E), Puttur, Karnataka, India (Figure 3). Cashew apples were harvested based on a phenological scale developed by Adiga et al. [40], namely, modified BBCH (Biologische Bundesanstalt, Bundessortenamt, and Chemische Industrie). This scale has five nut and fruit maturity stages: 811, 813, 815, 817, and 819. The final two stages, ‘817’ representing approximately 70% ripened and ‘819’ representing fully ripened cashew apples, were chosen for the study.
In this study, images of whole and sliced cashew apples were utilized as input images to assess the external and internal characteristics and their effect on the determination potential of physicochemical properties. After harvesting, whole cashew apples and their slices (images captured shortly after slicing to prevent discoloration) were photographed using a Nikon D5600 camera (Nikon Corporation, Tokyo, Japan) fitted with a Nikon F lens (maximum resolution: 6000 × 4000 , 24 MP). The apples were then pulped and stored at −20 °C until use for physicochemical properties analysis. All analyses were performed in triplicate. Original images of the cashew apple samples used in the study, along with samples wet chemistry physicochemical properties measurements, can be found in the “Mendeley Data” repository (https://doi.org/10.17632/67kn43rj5c.1; accessed on 16 October 2025) [41].

2.2. Wet Chemistry

The traditional wet chemistry methods were used for estimation of the physicochemical, namely, moisture content, fat, protein, total carbohydrates, total sugar, reducing sugar, vitamin C, and biochemical properties like total phenolic content, total flavonoids content, antioxidant activity, tannin content, and carotene content of the cashew apple varieties at the two maturity stages using AOAC standard methods. For the physicochemical properties analysis, the cashew apples were pulped using a mortar and pestle, and three replications were employed. All chemicals and solvents used were of analytical grade and obtained from Sigma (Sigma-Aldrich Co., Bangalore, India) and Merck (Darmstadt, Germany). A total of 15 wet-chemistry-based physicochemical properties was measured, and they were considered dependent variables in the data analysis. Details of the wet chemistry protocols for physicochemical properties considered are presented in Appendix A.

2.3. Computer Vision Methodology

The computer vision methodology, in essence, involves acquiring images of whole and sliced cashew apple varieties at two maturity stages, followed by preprocessing using the ImageJ software (version 1.54q), and subsequent image-based parameter extraction using a custom-developed ImageJ plugin (Figure 2). The selection of image-based parameters is guided by a logical framework that ensures its relevance to the measured physicochemical properties. Finally, the correlation between the extracted parameters and the physicochemical properties is analyzed to derive the desired insights and model development.

2.3.1. Image Acquisition

Cashew apples harvested from the field were cleaned, and their images were captured in two forms—whole and sliced—to study how these forms correlate with physicochemical properties. The samples (whole and slices) were placed on a uniform colored background with better contrast, and photographed using a Nikon D5600 camera equipped with an F-mount lens (maximum resolution: 6000 × 4000 , 24 MP). The images were saved in *.jpg format and organized into specific folders for subsequent analysis.

2.3.2. Image Preprocessing

The captured images were preprocessed to make them suitable for image-based parameter extraction using the user-developed ImageJ plugin. The sequence of preprocessing operations followed is illustrated in Figure 4. The original images are collections of many samples (slices or whole fruits) on the original background (Figure 4A) that need to be suppressed for further image processing. Hence, the collections are split into individual samples, and their backgrounds are eliminated. Preprocessed images were made to have a uniform black background, enabling the automatic extraction of objects of interest by the plugin, thereby facilitating the determination of analysis parameters.
Initially, the images were opened in ImageJ, and each slice or whole fruit was selected (rectangle selection tool) and cropped (Figure 4B). Each of these cropped images serves as a replication for the analyses. Segmentation of the samples from the background is a prerequisite for the extraction of the sample features. Usually, the segmentation of color images is performed via the conversion of grayscale images, from which the final segmented binary images are obtained. However, this method was not applicable, as the images acquired on-field, as well as in this study, may lack a contrasting background. Therefore, the application of color-based segmentation was the most suitable option. Although several color spaces are available, the selection of the appropriate color space facilitates effective segmentation. For the selection of the color space, the presence of distinct and separated histogram peaks in the component channels serves as a discriminating criterion. Preliminary studies showed that “color thresholding” in the YUV (luminance, chrominance-blue, and chrominance-red) color space (Figure 4C) produced the best segmented images for both cashew apple slices and whole fruits. The ‘U’ channel of the YUV color space exhibited distinct and separated histogram peaks, assisting the segmentation process.
The ‘U’ channel was utilized to select the region of interest (polygonal), which was converted to a binary image (Figure 4D) employing the “make binary” command. The binary images were then further refined by filling holes (Figure 4D), as necessary. A median filter was employed to smooth (Figure 4E) the edges (10 pixels) and eliminate minor particle artifacts. Finally, the preprocessed binary image (mask) was overlaid onto the corresponding cropped image sample, utilizing the image calculator command with the ‘AND’ operator (Figure 4F).
Any issues, such as the presence of skin of the fruits appearing in slice images or blemishes or pedicel cavities in whole fruits, had a darker color and required rectification, as they would misrepresent the natural color of the samples analyzed. This correction was performed using the polygon tool, followed by fit spline to smooth the sharp edges and corners. In slices, the skin portion was manually selected using the smoothed polygon and filled with a black background (Figure 4G–I). For whole fruits, the blemishes or pedicel cavities were removed by copying and pasting the nearby similar-colored patches (Figure 4J–L). For the ease of processing multiple images, shortcuts were created for the often-used operations, such as color threshold, make binary, fill holes, median filter, convert to mask, save as gif, image calculator, fit spline, and save as jpeg.

2.3.3. Image-Based Parameters

The preprocessed images (Figure 2) were analyzed to extract color, texture, and surface morphology descriptors using various methodologies described below, to study how they relate to wet chemistry-based physicochemical properties. A total of 15 wet chemistry parameters (dependent variables) and 144 image-based parameters (independent variables) derived from various methodologies are listed in Table 1. These parameters were assessed separately for slice and whole fruit images due to their distinct surface properties in terms of color and overall surface texture.
(i)
Color-based analysis
The overall color profiles of each image (slices or whole fruits; e.g., Figure 5A1) based on their pixel color intensities (R—red, G—green, and B—blue) were captured for the data analysis. These values were collected after identifying several regions (color grid) inside the sample outlines, and the descriptive statistics were obtained for further analysis. For this, the images were systematically partitioned into rectangular regions of interest (ROIs) along the length and breadth of the sample outline (Figure 5A2). The number of ROIs employed in the color analysis was based on the user-specified inputs (Section 2.3.4). For each ROI, the pixels’ R, G, and B intensity values were extracted, and statistical descriptors, including mean, median, geometric mean, standard deviation, maximum, minimum, skewness, and kurtosis, were computed (Table 1). The total number of independent variables in the color-based analysis was 27.
(ii)
Surface morphology—waveness and roughness
Surface morphology or texture was evaluated in terms of waveness and roughness. The waveness represents large-scale surface deviations, while roughness refers to the fine irregularities on the surface. Surface morphology method operates on grayscale images (Figure 5B1). Before surface morphology parameter extraction, the following preprocessing operations were applied to minimize artifacts: (i) Leveling—the surface was leveled to correct tilt using a polynomial least squares plane subtraction, (ii) Filtering—a Gaussian filter was applied to reduce noise while preserving edge features, and (iii) ROI selection—a consistent ROI was selected across the samples to ensure comparability.
To distinguish between waveness and roughness, the surface profile was decomposed using the waveness and roughness functions. A cutoff wavelength-based filter was selected to separate waveness from roughness. The 2D waveness (Figure 5B4) and roughness (Figure 5B7) profiles were extracted using the plugin following the reported methodology [42] and the reported plugin generating various parameters [43]. The profile data (Figure 5B3,B6) was also used to generate descriptors such as mean, median, mode, maximum, minimum, kurtosis, and skewness, employing descriptive statistics. With 14 parameters each for waveness and roughness, the total number of independent variables considered in the surface morphology analysis was 28 (Table 1). These surface morphological parameters of waveness and roughness are influenced by various inputs (Section 2.3.4). Where the “Cutoff value (pixel)” is the cutoff radius for the Gaussian filter to be applied, it has to have a minimum value of 4, but the maximum value depends on the size of the image, and the offset variables of the line ROI (ROI x1, ROI y1, ROI x2, and ROI y2) decide the position (%) of the line transect, along which the waveness and roughness are determined (Figure 5B2, B5).
(iii)
Gray level co-occurrence matrix (GLCM) method
The GLCM is a second-order statistical texture analysis method, originally developed by Haralick [44,45], that uses the spatial relationship between the gray values of pairs of pixels. Therefore, the preprocessed images were initially converted to 8-bit grayscale. The GLCM is a matrix with gray levels in the rows and columns. Each entry ( i , j ) in the matrix represents the probability or frequency with which two pixels separated by a distance defined by the user have intensity values of i and j. The parameters that can influence these values are considered as user-defined inputs (Section 2.3.4), which will be the distance between the pixel pairs referred to as the “Step size” in pixels (1, 2, 3, etc.) and the “Step direction”, which can be 0°, 90°, 180°, and 270°, and the number of gray levels employed (256 to 8 are used in the study; other values such as 16 are also possible).
The GLCM is normalized by dividing each element by the total number of pairs considered before obtaining the statistical features. Each normalized GLCM represents the joint probability distribution of gray-level intensity pairs. From these matrices, the contrast, correlation, angular second moment, inverse difference moment, and entropy were calculated. Directional values were averaged to obtain a single representative measurement per parameter for each sample. The commonly used GCLM extracted features are as follows: (i) contrast—measures the local intensity variation, where higher contrast implies higher local variations; (ii) correlation—measures the correlation between a pixel and its neighboring pixel; (iii) angular second moment (energy)—measures textural uniformity, where higher energy means less textural complexity; (iv) inverse difference moment (homogeneity)—measures the closeness of distribution of the GLCM to its diagonal, where, again, higher homogeneity implies a more uniform texture; and (v) entropy—measures randomness, where higher value signifies the complexity of texture (Table 1). The total number of GLCM-based parameters derived was five.
(iv)
Local binary pattern (LBP) method
The LBP method (Figure 5C1–C8) operates on grayscale images by comparing the intensity of each central pixel with that of its neighboring pixels within a defined radius [46]. For each neighbor, a binary value of 0 is assigned if its intensity is lower than the central pixel, and 1 if it is equal to or higher. The resulting binary sequence is concatenated in a predefined order to form a LBP code, which is then converted into its decimal equivalent. A histogram of these LBP codes is generated to represent the frequency distribution of distinct local texture patterns within the image. Since the colors in the RGB spectra may be correlated with physicochemical properties, a modified approach of calculating the intensities separately in the three spectra was developed in this study. For this, the RGB channels of the preprocessed color images were extracted, and corresponding images were obtained utilizing the “RGB Stack” conversion and “Stack to Images” ImageJ commands. Descriptive statistics were then used to generate parameters such as minimum, maximum, mean, geometrical mean, quadratic mean, standard deviation, etc. (Table 1). No specific user-input variables are necessary. The total number of LBP-based parameters derived was 40.
(v)
Color-based indices (CBI)
Numerous researchers have successfully utilized vegetative indices (VIs) derived from RGB color data of target images to assess crop yield, chlorophyll content, and various other qualitative parameters of crops. Building upon the success of VIs applications, this study dealing with fruits (both slice and whole) employed 27 existing VIs, hereafter referred to as color indices. These color indices were computed based on the overall mean R, G, and B values using relevant formulas (Table 2). Given that cashew apple samples showing the prominence of red and yellow hues in their sample images (both slice and whole), 17 additional color indices focusing mostly on ‘R’ component and its interaction with ‘G’ and ‘B’ components, extending the VIs formulas, were newly developed in this study (Table 1). Therefore, the total number of color-based indices (27-existing, 17-new) was 44.

2.3.4. The Plugins Input Panel

The developed plugin, titled “Cashew Apple and Slices Analysis”, features an input panel depicted in Figure 6. The panel provides the user interface for various methodologies and enables the input of various parameters for the analysis. The input panel is designed to be self-explanatory with brief instructions, distinct checkboxes, and input fields categorized by types of analysis. Users interact with the panel to generate analysis and output. This plugin operates in a “batch-mode”, where all the preprocessed images stored in the input folder are automatically analyzed, results are appended with timestamps and stored in a ‘*.csv’ file format for record and further analysis, and visualize the results progress of batch analysis as a pop-up window (Figure 5D1) and on the ImageJ progress bar.
The plugin (Figure 6) provides user-friendly prompts to ensure accurate input. The first checkbox, “Opening the session for the first time?”, establishes the default folder for storing input images during batch processing, a task that is necessary only once. When the ‘csv’ output is desired, the second box should be checked. The type of analysis desired can be chosen from the group of checkboxes: All, Color, Wave, GLCM, LBP, and CBI. Users can choose the analysis individually or in combination, or ‘All’ (ignores individual analysis and generates all analysis results). The other inputs (input boxes) are related to the different analyses and are explained in Section 2.3.3. Upon clicking the “OK” button, the program executes based on the selected analysis and generates images, log files, and CSV data. The “Cancel” stops the execution, bypassing it, and makes the plugin ready for the next session.

2.3.5. The Cashew Apples and Slices Analysis Plugin

The “Cashew Apples and Slices Analysis” ImageJ plugin has a total of 2050 lines coded in Java using Fiji (Ver. ImageJ 1.54q). The plugin’s functionality is encapsulated within various Java methods (functions), including reading inputs, performing different analyses, and generating graphical and textual outputs, which play well with the batch processing strategy used in the plugin. Some of the advanced features used in the plugin are derived from packages including Apache Commons DescriptiveStatistics; ImageJ’s ImagePlus, ImageProcessor, GenericDialog, GaussianBlur, and WindowManager; and Java’s Stream, Date, File, BufferedWriter, and FileWriter.

2.4. Data Analysis for Predicting the Wet Chemistry from Image-Based Parameters

Data analysis is aimed to (i) determine the strength of association between the wet chemistry physicochemical properties (dependent) and the image-based parameters (independent) variables and graphically visualize the correlation, (ii) evaluate the range and ranking of image-based parameters correlation to wet chemistry parameters, (iii) develop the top three linear prediction models of wet chemistry based on the image-based parameters, and (iv) perform prediction and validation with visualization, employing the top-ranked prediction model developed and applied to unseen images. To accomplish these aims, an R (Ver. 4.5.1, 2025) program was developed utilizing the RStudio IDE (Ver. 2025.09.1, Posit Software, PBC). Major R packages involved in the R program development are tidyverse, dplyr, readr, corrplot, and ggplot2. The R program was coded as an assembly of several “functions”, each designed to perform specific tasks. These functions were appropriately invoked in the program to execute the analysis and visualization. A brief description of the various data analysis procedures is presented subsequently, and the R script containing the data is uploaded to an online data repository [41].

2.4.1. Correlation Analysis and Visualization

The data obtained from the measured wet chemistry parameters and generated by the computer vision plugin (Section 2.3.4) as an *.csv file forms the input for the correlation analysis. The program reads the entire csv file and converts it into a “dataframe”. The data should be preprocessed by subsetting to extract the relevant variables, discarding other informative (e.g., time and image file name) and non-varying (e.g., certain minimum and maximum values) parameters (Section 3.2). The subset data contains categorical variables, namely, cashew apple varieties (6), maturity stages (2), and sample types (2), along with their replications. The replications should be combined into mean values associated with unique combinations of the categorical variables. The analysis is performed separately for the slice and the whole samples (as their images exhibit distinct differences). These operations were carried out through group_by, filter, and select operations.
A matrix of correlation coefficients (r) was generated using the cor() command, and this object, after being converted to a dataframe, was output as a csv file for further analysis using the write_excel_csv commands. The corrplot() command with method of “square” and color options created the r values visualization through a color-coded correlation diagram (Figure 7). Correlation diagrams were created separately for the slice, whole, and combined samples to study their effect on the association.

2.4.2. Range and Ranking of Correlation Coefficients

Although feasible with R, the MS Excel spreadsheet program was utilized to create the range data (Section 3.5) and ranking of r values. In addition, code for generating a LATEX ranking table was created (Section 3.6). Utilizing the MIN() and MAX() functions on the slice and whole samples *.csv files separately, the r range data was generated, excluding the r values of wet chemistry parameters (self-correlation). The generated range data was then read into the R program, reshaped using pivot_longer, and plotted using ggplot commands. The ggplot utilized geom_bar with the “stack” option to draw the range maximum and minimum r values, facet_warp, and other commands to depict both slice and whole samples.
For developing the ranking table, the actual r values (both +ve and −ve) were (i) converted to absolute values using the ABS(), (ii) for each wet chemistry parameter, the image-based parameters were ranked using the RANK(), (iii) the top 10 image-based parameters for each wet chemistry parameter were collected after sorting, and (iv) those top ten image-based parameters as text were combined using the CONCATENATE() with other symbols (e.g., & and \\) to generate the LATEX table code.

2.4.3. Wet Chemistry Prediction Models Development

The process of creating the top three (3) models for each wet chemistry parameter (15) with maturity stages (2) and sample types (2) from the single master subset data (total = 180 models; Section 3.7) involves a series of intricate operations. The data processing requirements to achieve this as a “repetitive” function are as follows: select the specific wet chemistry variable while dropping the other 14; select all of the image-based parameters; filter the rows that contain the desired maturity stage and sample type; create a set of linear models using the lm() function for all the image-based parameters by generating a formula for the dependent with all the image-based parameters predictor variables using the map() function that generates a “list” of models; extract the intercept, slope, R 2 , number of observation, and individual formula strings using the sapply() function; generate a dataframe with these values; sort the models based on R 2 in descending order using arrange(-()); and select the top three models using slice_head(n = 3). This function was invoked 60 times (15 wet chemistry × 2 stages × 2 sample types), and each time it produced the top three models (3 rows). These three rows of output were appended to generate the 180 model result rows using bind_rows and finally written to the file using the write_excel_csv() command.

2.4.4. Wet Chemistry Prediction and Validation

For prediction, the top-performing models of wet chemistry in each combination of stage and sample type (60) are utilized across the test set of unseen images. The predictions are validated with the known wet chemistry measurements for the six varieties of cashew apple. The image-based parameters are generated by the developed plugin with the test images as inputs. The top-performing models in each category specify the independent variable involved, and the value of that variable is obtained as a mean from the image-based parameters plugin results. The mean value of the independent variable when input into the linear model with specific slope and intercept values will generate the prediction of wet chemistry physicochemical properties. The performance of prediction will be evaluated as the ratio (predicted to actual), where the ideal prediction will have a ratio of 1.0, while underestimation will have ratios < 1.0 and overestimation will have ratios > 1.0.
The prediction and validation can be visualized as a dodged bar graph with maturity stages for all the wet chemistry properties evaluated for each sample type. The six varieties will be depicted as facets. The dodged bar plot function takes the data (actual, predicted, and ratio, specific to slice and whole), and y-axis limits (min and max) inputs, and generates the plot. The plot function filters the ratios, the geom_bar() command takes “dodge” as argument, passes the y-axis inputs into the ylim() command, draws a horizontal line at y = 1.0 using geom_hline with yintercept = 1.0, and makes facets using facet_wrap() with ∼Variety as the argument. The online data repository R script provides additional details [41].

3. Results and Discussion

3.1. Results: Variation in Wet Chemistry Results Within Samples and Varieties

Significant differences were observed in the physicochemical properties of cashew apples between maturity stages 817 and 819 across all varieties (mean ± std shown in Table 3; wet chemistry measurement protocols are given in Appendix A). With the progress of maturity stages, the acidity decreased while the pH and TSS increased consistently ( p < 0.05 ). Antioxidant activity, flavonoids, phenolic content, and tannins showed significant reductions ( p < 0.01 ), indicating a decline in bioactive compounds after maturation. Carbohydrate content, reducing sugars, and total sugars increased significantly at stage 819 ( p < 0.01 ), suggesting an increase in sweetness and energy value. Carotenoids and vitamin C also increased sharply, particularly in Bhaskara and VRI-3 varieties ( p < 0.001 ). Crude fat content and proteins declined across all varieties, with VTH-174 and MDK showing the most pronounced reductions. The variety VTH-174 exhibited the highest antioxidant activity at stage 817. Bhaskara recorded the highest vitamin C content at stage 819 (181.71 mg/100 g). Variety VRI-3 showed a threefold increase in carotenoids from stage 817 to 819, and NRCC-1 had the highest sugar and total carbohydrate levels at stage 819.
Overall, significant fluctuations in the biochemical composition of cashew apples were observed between maturity stages 817 and 819, reflecting the typical ripening dynamics (Table 3). The decline in phenolics and flavonoids suggests reduced antioxidant activity with maturity (negative correlation between these compounds and fruit ripening) as reported in previous studies [65]. Conversely, the increase in sugar and vitamin C content indicates that with ripening, the palatability and nutritional appeal of cashew apples are improved. This is particularly evident in varieties Bhaskara and VRI-3. Therefore, it is expected that any visual or textural differences identified at these stages could be correlated with these biochemicals.
The selected quality parameters are essential for assessing the quality of the fruit; however, these methods are destructive in nature. All parameters require special instruments and an elaborate extraction procedure of the samples for analysis. Total phenols, tannins, flavonoids, antioxidant activity, sugars, carbohydrates, vitamin C, and carotenoids require solvent-based extraction of the targeted compounds followed by spectrophotometric or titrimetric estimation. Extraction and estimation process demand additional time and the use of expensive solvents and chemicals for analysis, especially for total phenols, tannins, flavonoids, antioxidant activity, sugars, carbohydrates, and carotenoids. Parameters such as carbohydrates, protein, and fat estimation involve extended extraction and thus require more time for estimation. Additionally, the estimation of many of these quality parameters requires the use of chemicals which are corrosive and harmful to skin and eyes, which may also lead to systemic injury.

3.2. Independent Image-Based Variables Considered for Analysis

By executing the developed ImageJ plugin for a set of user inputs (Figure 6) with the training (75%) batch of images of cashew apple samples (6 varieties; 2 sample types; 2 sample stages; 7–13 slice replications/variety; 4–6 whole replications/variety; 104 slices + 60 wholes; 164 total), the exhaustive 144 image-based parameters output was generated. Upon analyzing the variations of these parameters across the 15 wet chemistry physicochemical properties variables (dependent), several of the image-based variables (independent) exhibited no or insignificant variation. Therefore, the following set of 28 parameters was excluded (abbreviated names provided and expanded form presented in Table 1 for further analysis and model development: Ov-mxR, Ov-mxG, Ov-mxB, Ov-miR, Ov-miG, Ov-miB, Rn-Gmn, Ov-noR, Ov-noG, Ov-noB, Wn-no, Rn-no, L-pxGr, L-pxR, L-pxG, L-pxB, L-miGr, L-mxGr, L-medGr, L-miR, L-mxR, L-medR, L-miG, L-mxG, L-medG, L-miB, L-mxB, and L-medB. It should be noted that these parameters, when considered, would have resulted in the least correlation and have not generated practical models or those with the lowest performance. Thus, only the remaining 116 image-based parameters ( 144 28 = 116 ) were used for further analysis.

3.3. Plugin Input Variables Combination Selection

Given that user inputs and their combinations (Figure 6) influence the image-based parameters evaluated, the best inputs must be selected based on the collective performance of the prediction models. Even though five sets of image-based parameters, including color, waveness and roughness, GLCM, LBP, and color indices, were evaluated (Table 3), the input variables were only from the three sets (Figure 6), while other inputs were internally configured within the plugin. A systematic approach was employed to evaluate and identify the most suitable combination of input variables (color: columns and rows; waveness and roughness: cutoff value; and GLCM: step size and step direction) entered through the plugin’s input panel (Figure 6). Every combination of the input parameters for the 4 categorical variables (types and stages), 15 dependent wet chemistry variables, with the 116 independent parameters resulted in the generation of a total of 6960 ( 2 × 2 × 15 × 116 ) simple linear models generated internally. However, only the top three, totaling 180 ( 2 × 2 × 15 × 3 ), models within each category were utilized for further analysis. For the sum of the coefficient of determination ( R 2 ) of the top three 180 models ( Σ R 2 ), the largest value serves as an indicator for selecting the most suitable combination of input variables (Table 4).
Therefore, initially the color inputs, namely the grid size, were varied to 3 × 3 , 5 × 5 , 7 × 7 , 10 × 10 , 12 × 12 , and 15 × 15 , while keeping the other inputs at their starting values. It was observed that the Σ R 2 values attained their peak at 7 × 7 and subsequently declined. Now keeping 7 × 7 as a constant input for color inputs, the waveness and roughness Gaussian filter radius were gradually varied to 4, 5, 10, 15, 20, 25, 30, 35, 40, and 50 pixels. The effects of these variations were recorded, and the 35 pixels yielded the best outcome (Table 4). Keeping these two inputs constant, the GLCM step size was varied gradually to 1, 2, 3, 5, and 10 pixels, which revealed that the 1 pixel step size was the best. Proceeding similarly, a combination of 4 GLCM step direction angles (0°, 90°, 180°, and 270°) was studied, and it was found that 270° was the best (Table 4). To further validate the effectiveness of the color grid size in conjunction with the selected input combination, the models were subjected to testing for 5 × 5 , 7 × 7 , 10 × 10 , a step above and below the considered 7 × 7 , which reaffirmed their effectiveness with the highest value of Σ R 2 of 76.437 (Table 4). Consequently, the best plugin input combination that is used for further analysis is as follows: a color grid size of 7 × 7 , a waveness and roughness cutoff of 35 pixels, a GLCM step size of 1 pixel, and a GLCM step direction of 270°. The CPU processing time for this combination was 2.87 ± 0.79   s / image (HP ZBook Fury 16-inch G11 Mobile Workstation PC; Intel® Core™ i9 13950HX 5.5 GHz, 32 GB RAM, 1 TB storage).
The results underline the importance of input selection in modeling with multiple variables. The optimization strategy—based on cumulative Σ R 2 values—proved effective in isolating input combinations that maximized the predictive performance across the multiple dependent variables. This approach is particularly valuable when dealing with complex systems where interactions among variables are non-linear or context dependent. This methodology offers a framework for model refinement. Thus, by iteratively evaluating input combinations, one could converge on combinations that yield the most reliable and generalizable models. However, caution should be exercised when using Σ R 2 as a heuristic for input selection, as it may not fully capture nuances such as overfitting, multicollinearity, or model interpretability. Therefore, further research on additional criteria such as adjusted Σ R 2 , or cross-validation error, etc., is necessary. Additionally, applying a larger dataset (with more varieties and maturity stages) and non-linear modeling techniques may enhance the predictive capacity where linear assumptions are insufficient.

3.4. Visualizing Correlation—Correlation Diagram and Absolute Correlation Coefficients

Pearson’s correlation coefficient (r) quantifies the association of strength and direction of 15 wet chemistry physicochemical properties with the 116 image-based parameters, and was evaluated using the input combination ( 7 × 7 ; 35 pix; 1 pix; 270°; Section 3.3) and visualized in the form of a correlation diagram (Figure 7). The correlations were calculated separately for (i) all sample types combined, (ii) slice images only, and (iii) whole fruit images only to determine the effect of the sample type category during model development. Table 5 presents the sum and mean absolute correlation coefficient ( r ¯ ) values, which facilitate the comparison of the performance of sample types and groups of parameters. The correlation results (Figure 7 and Table 5) indicate that combining both sample types is not an effective strategy ( r ¯ = 0.225 ). Although keeping the types separate yielded a better correlation, the slice ( r ¯ = 0.288 ) performance was superior to that of the entire sample ( r ¯ = 0.274 ).

3.4.1. Interrelationships Among Wet Chemistry Parameters

Overall, the correlation diagram (Figure 7; separated by a solid horizontal line) shows better intercorrelation among wet chemistry variables compared to the image-based parameters, evident from the darker-colored tiles with wet chemistry. This observation is corroborated by the absolute mean r ¯ of 0.433, which is greater than the range of 0.128–0.324 observed for the image-based parameters (Table 5). Strong interrelationships were observed among key physicochemical properties, specifically between crude fat, vitamin C, antioxidants, and total phenol content; carbohydrates and total sugars; carotenoids and total sugars; and tannins and antioxidants. These robust associations also suggest, apart from serving as direct predictors, that certain physicochemical properties may serve as indirect predictors for others, which can be beneficial in gaining insights into other parameters. For example, if vitamin C shows poor correlation with texture features but is strongly linked to crude fat, then crude fat, if reliably predicted from image-derived parameters, could act as a proxy for estimating vitamin C levels. Despite these specific advantages of the direct prediction of physicochemical properties from others, the wet chemistry determination is time consuming and highly expensive.
This strategy of leveraging internal biochemical correlations to enhance predictive modeling aligns with previous studies. Hasanzadeh et al. [66] demonstrated that hyperspectral imaging combined with regression analysis could accurately estimate apple quality traits such as pH, soluble solids, and titratable acidity, with correlation coefficients exceeding 0.99. Similarly, Paulo et al. [67] demonstrated that visual descriptors extracted from fruit images captured in uncontrolled environments could be mapped to physicochemical parameters using non-linear regression models, thereby effectively estimating fruit maturity.

3.4.2. Correlation Between Wet Chemistry and Image-Based Parameters

The five groups of image-based parameters collectively had a lower correlation with wet chemistry compared to the intercorrelation of wet chemistry. While all sample images of image-based parameters were consistently low, the slice and whole samples were the best for certain groups (Table 5). For example, slices exhibited better correlation with waveness and roughness, LBP, and color indices, whereas wholes exhibited better correlation with the color grid and GLCM.
Among the image-based parameters group methodologies, slice sample images exhibited the highest correlation with surface morphology (waveness and roughness; r ¯ = 0.324 ), followed by color indices ( r ¯ = 0.292 ), and whole samples color grid ( r ¯ = 0.267 ), followed by GCLM ( r ¯ = 0.234 ) and LBP ( r ¯ = 0.206 ) were the best (Table 5). It is interesting to note that the color indices, which are easy to obtain, as they are derived solely from R, G, and B values, also exhibit favorable correlation for slice and whole samples ( 0.272 r ¯ 0.292 ). This observation can be visualized in the correlation diagram (Figure 7) with methodology groups separated by dotted horizontal lines and darker color patches. Correlations derived from the whole fruit sample images indicated that, although associations were generally stronger than the combined dataset, they were consistently weaker than those observed for slice images. Notably, several wet chemistry parameters, including vitamin C, TSS, total sugars, crude fat, and moisture content, exhibited moderately higher correlations with GLCM contrast and roughness-based texture metrics.
Overall, the r ¯ for image-based parameters for slice, whole, and combined samples were 0.268, 0.252, and 0.197, respectively, which is lower than the intercorrelation of wet chemistry value of 0.443 as observed previously. Collectively, these findings reinforce the potential of image-based parameters employing computer vision analysis not only for the direct prediction of physicochemical properties but also for inferring quality parameters through correlated biochemical proxies. Thus, it expands the scope for the development of non-destructive quality assessment tools in postharvest and processing operations.

3.5. Correlation Analysis of Wet Chemistry and Image-Based Parameters—Value Ranges

The range of r for all the 116 image-based parameters (independent variables) against the 15 wet chemistry (dependent variables) for the finalized input combination, grouped by sample types (slice and whole) is presented in Figure 8. These categories are distinct from the image viewpoint, and their correlations are treated separately (Table 5). For slice samples, r ranged from 0.86 to 0.87, indicating both strong negative and positive associations. In contrast, for whole samples, the range was slightly lower and narrower, from r = 0.72 to 0.73, suggesting that the independent parameters computed based on slice images exhibited enhanced sensitivity to the image-based parameters.
Contrasting the association of wet chemistry parameters with image-based parameters between sample types (slice and whole), the parameters derived from slice images revealed stronger correlations for most parameters and in “internal quality” traits, particularly those related to biochemical composition (e.g., flavonoids, crude fat, proteins, total phenolics content, acidity, and moisture content). This supports our hypothesis that exposing internal tissues (through slice images) enhances the accuracy of indirect measurements, especially for parameters not uniformly distributed across the fruit. Whole fruit images favored parameters linked to “external features” (e.g., antioxidant activity, proteins, phenolics, crude fat, and vitamin C) but underperformed for most internal chemical traits. The reduced correlation range suggests that whole fruit assessments may require more advanced imaging or hybrid sensing techniques to improve predictive power. The observed correlation patterns underscore the potential of using image-based data to predict wet chemistry traits, especially with images of sliced fruits. For improved correlation, a larger sample size for training may account for better results.
For slice sample images, the most strongly correlated dependent parameters ( r > 0.7 ) estimated were flavonoids (0.866), crude fat ( 0.859 ), proteins (0.772), total phenolic content ( 0.768 ), acidity (0.749), moisture content (0.736), and TSS ( 0.705 ). The least correlated parameters were pH, tannins, reducing sugars, and total sugars, with coefficients of 0.476 , 0.535 , 0.553, and 0.566, respectively. For whole fruit sample images, the most strongly correlated dependent parameters were antioxidant activity ( r = 0.729 ), proteins ( 0.720 ), and total phenolics (0.706). The moderately correlated parameters were crude fat (0.696), vitamin C (0.691), TSS (0.666), moisture content ( 0.662 ), and pH (0.623). The least correlated variables were total carbohydrates, tannins, total sugars, reducing sugars, and carotenoids, with correlation coefficients of 0.450 , 0.468 , 0.530 , 0.587 , and 0.587 , respectively.
Overall, among the wet chemistry parameters, flavonoids and crude fat consistently exhibited high correlation ( r > 0.8 ) with the plugin image-based parameters. Proteins, total phenolic content, acidity, moisture content, antioxidant activity, carotenoids, and TSS demonstrated moderate to strong correlations ( r = 0.6 to 0.77) with texture and color-based parameters, especially where internal tissue exposure improves detection. Total carbohydrates, tannins, total sugars, and reducing sugars showed weak correlations ( r < 0.6 ) across most independent variables, suggesting limited predictability from image-based non-destructive measurements. Furthermore, the correlation coefficients between dependent and independent variables were consistently higher for slice-derived independent parameters compared to those obtained from whole-fruit images. Focusing on analyzing the dynamics of the maturity stages, varieties, and collecting more samples could contribute to improving the association between the independent and dependent parameters.

3.6. Top Ranking Image-Based Parameters Correlation Coefficients

To identify the most influential image-based parameters predictors for each wet chemistry physicochemical properties trait, they were ranked based on their Pearson correlation coefficients, and the top ten features of wet chemistry categorized by sample type (slice and whole), are presented in Table 6. This ranking framework enabled the identification of dominant image patterns and feature categories associated with specific chemical and physical attributes of the fruits. In general, slice and whole samples had a distinctly different set of top-ranking parameters for the same fruit physicochemical properties due to differences in color, texture, and morphology.
Among the 150 top 10 ranked image-based parameters for slice samples (15 wet chemistry × 10 ranks; Table 6), the surface methodology covered 52.0% followed by color indices (25.3%), color grid (14.0%), GLCM (4.7%), and LBP (4.0%); and for whole samples surface methodology covered 38.0% followed by color grid (32.6%), color indices (14.7%), LBP (8.7%), and GLCM (6.0%). Across the 15 physicochemical properties, surface morphology descriptors (waveness and roughness metrics) consistently ranked among the top predictors, followed by color-based indices or grids, and finally GLCM and LGB methodologies with similarly low performance. Among these top-ranked parameters, the color indices developed in this study (17; Table 2) performed well. The total number of color indices featured in the top-ranking 150 parameters was 22 for each slice and whole sample (Table 6). Out of these, the number of indices developed in this study was 9 for the slice and 3 for the whole samples. This, out of the total 150, was 6.0% (slice) and 2.0% (whole), and out of color-based indices was 40.9% (slice); 13.6% (whole), indicating the significant performance of the indices developed in this study.
Color-based indices methodology, which is easy to derive, also showed strong associations with biochemical properties such as phenolics and carotenoids, indicating that reflectance captured in RGB images can serve as a proxy for phenolics and carotenoids profiling. Color grid methodology parameters (e.g., mean, median, STD of blue, green, and blue channels) were particularly informative for sugar-related traits such as TSS, total sugars, and reducing sugars. Acidity and pH had dominating associations with surface morphology-based parameters and color-based indices. These results agree with earlier studies [68,69] that have demonstrated a direct correlation between color values and textural properties to various physicochemical properties such as acidity, pH, and phenolics.

3.7. Wet Chemistry Physicochemical Properties Prediction Model Development

The Table 7 presents the top three models, assessed based on the model performance parameter R 2 , for each wet chemistry physicochemical properties parameter across all sample types and maturity stages (total = 180 models; Section 3.3), from the complete set of 116 image-based parameters independent variables. It is important to note that image-based parameters featuring in the top models (based on R 2 ; Table 7) may differ from the top ten highly correlated image-based parameters (based on r; Table 6). This is because there are differences in the linear model fitting and Pearson’s correlation evaluation. While the correlation coefficient evaluates the strength of association between the independent and dependent variables, the model development provides the actual tool used for making the final prediction. Therefore, the latter (Table 7) is more suitable for the objective of the study.
Across all the wet chemistry physicochemical properties parameters, the R 2 values, which represent the models’ performance, ranged from as high as 0.936 to as low as 0.167 for both maturity stages, indicating varying degrees of predictive strength depending on the dependent parameter, sample type, and maturity stage categories. For slice samples, the mean R 2 values for stage 817, stage 819, and both are 0.42 ± 0.18 , 0.48 ± 0.18 , and 0.45 ± 0.17 ; and for whole samples are 0.37 ± 0.12 , 0.35 ± 0.12 , and 0.36 ± 0.12 , respectively. The mean R 2 for the samples and stages combined is 0.408 ± 0.149 . Considering the better-performing models ( R 2 > 0.5 ), it was found that slices covered 24.4% with stage 817 and 51.1% with stage 819, and for wholes 6.7% and 13.3% coverage, respectively. These results indicate that the slice images and ultimate maturity stage of 819 are the most effective for evaluating the wet chemistry, hence being recommended for the computer vision analysis of physicochemical properties. This combination will also yield 24.4% of high-performance models ( R 2 > 0.6 ). When whole samples are required, the ultimate stage of 819 should be chosen again.
The models of slice and the maturity stage 819 combination ( R 2 > 0.5 ) will cover 11 out of 15 (73.3%) wet chemistry physicochemical properties. These wet chemistry properties that can be successfully modeled are as follows: acidity, antioxidant, carbohydrates, carotenoids, crude fat, flavonoids, pH, phenolics, proteins, tannins, and vitamin C. The wet chemistry properties covered by whole samples were only five (acidity, carbohydrates, carotenoids, proteins, and vitamin C), which is a subset of the “slice + 819” combination. The “slice + 817” combination covered again five wet chemistry properties (carotenoids, crude fat, pH, TSS and vitamin C), where the property TSS was the unique addition to the list of slice and 819 combination. From the effort and complexity viewpoint, TSS is a relatively simple and quick measurement using a refractometer. Therefore, the recommendation of the “slice + 819” combination, along with more sampling closer to the ultimate maturity stage, will effectively deliver the wet chemistry physicochemical properties through the simpler image-based computer vision methodology developed.
Several studies have validated the accuracy of predictive models based on surface color and morphological parameters. These studies have demonstrated a strong association between specific phenotypes and the color or surface morphological properties of fruits and foods as observed in the present research. Carotenoids are important contributors to fruit coloration, imparting hues in the yellow to red range, and also enhancing fruit nutritional value [70]. Consequently, the surface color characteristics of cashew apple predict well the carotenoid content ( R 2 = 0.930 –0.936; Table 7). Phenolic compounds are known to play a principal role in the coloration of fruits [71], substantiating the observed next strong correlation between the phenolics of the cashew apples and their surface color properties ( R 2 = 0.633 –0.686). Tannins contribute significantly to the sensory and nutritional characteristics of fruits, nuts, and vegetables, influencing flavor, color, and nutrient absorption [72], demonstrating the best prediction of tannin content by a color-based index VEG ( R 2 = 0.678 ). The presence of certain genes (proteins) has been expressed as a color change during maturation and in freshly cut fruit slices, as cashew apples whole and slices also change colors quickly due to their highly perishable nature [73], the proteins could influence the appearance of the slices as evidenced by the good prediction by the models using Ov-kuB ( R 2 = 0.674 ). Expression of color changes in fruits has been monitored by pH-based sensors [74], which supports the observed good prediction of acidity ( R 2 = 0.651 –0.669) and pH ( R 2 = 0.606 –0.612) of the cashew apple slices by color-based WI, RBDR, and Ov-medG parameters. Surface characteristics from infrared hyperspectral images have been used for the rapid detection of vitamin C content in strawberries [75], which corroborates the good prediction of vitamin C in slices as well as whole cashew fruits ( R 2 = 0.591 –0.625) based on roughness parameters.
However, when physicochemical properties are more influenced by intrinsic features, the models of the present machine vision methodology derived from the surface characteristics cannot produce acceptable performance. The total and reducing sugars, crude fat, and moisture content have been estimated by quantitative methods, such as light refraction and gravimetry, using the extracted samples that are inherently intrinsic [76,77]. Their influence on the cashew fruits is more intrinsic than external, which could be the reason for their below-average prediction by the models ( R 2 = 0.318 –0.477; Table 7).

3.8. Wet Chemistry Physicochemical Properties Prediction Models Validation

A systematic validation strategy is employed using approximately 25% percent of the reserved (test) data to assess the performance of developed models (based on 104 slices + 60 wholes = 164 images; Figure 6 and Section 3.7). The testing batch (unseen) of images of cashew apple samples (6 varieties; 2 sample types; 2 sample stages; 2–3 slice replications/variety; 2 whole replications/variety; 30 slices + 24 wholes; 54 total) are processed through the ImageJ plugin with the finalized inputs to generate the image-based parameters, and the wet chemistry physicochemical properties are predicted using the topmost model (Table 7). Predicted wet chemistry parameters are compared with their actual values (Table 3), and the deviations expressed in terms of ratios (predicted/actual) are visualized as a paneled dodged bar graph across wet chemistry parameters, with maturity stages and cashew apple varieties (Figure 9).
Ideally, a ratio close to 1.0 indicates high predictive fidelity, while deviations highlight (<1.0) under- or (>1.0) over-estimations (horizontal dotted lines in Figure 9). Extreme values <0.75 or >1.25 depict poor prediction accuracy or model bias. For the 360 predictions (15 wet chemistry × 2 stages × 2 types × 6 varieties), the mean ratio for slice ( 1.081 ± 0.553 ) and whole ( 1.079 ± 0.556 ) were comparable, indicating similar prediction performance. Within the acceptable prediction ratio range of 0.75 to 1.25, a 66.7% for slice and 60.6% for whole samples demonstrate that a predominant number of predictions fall in this range, and slices perform better than whole samples. The under- and over-estimations by the models beyond this range are 14.4% and 18.9% for slice and 18.3% and 21.1% for whole samples, respectively.
Upon comparing the effects of maturity stages, the mean ratio of stage 819 (slice: 1.069; whole: 1.029) is closer to the ideal ratio of 1.0 than that of stage 817 (slice: 1.092; whole: 1.129) from 90 predictions each (Figure 9). This prediction result supports the idea that the ultimate maturity stage of 819 will be more favorable for the methodology, which is already anticipated from the analysis of the developed prediction models (Table 7).
Within the high prediction range of the ratio between 0.85 and 1.15, the wet chemistry physicochemical properties of moisture content and TSS have a perfect count of 12 for both maturity stages and sample types. The subsequent five wet chemistry parameters are carbohydrates, pH, proteins, and total sugars, featuring in both stages and sample types (6–11 out of 12). This result suggests a strong model generalization for these traits. The lowest prediction performance is exhibited by phenolics, antioxidants, crude fat, flavonoids, tannins, and vitamin C (2–3 out of 12), both for slices and wholes. It is interesting to note that carotenoids can be predicted using slices (5/12) but cannot be predicted by whole samples (1/12) based on the analysis results.

3.9. Study’s Limitations and Suggestions for Future Work

Several limitations were identified and will be encountered by users when reproducing this research in terms of experimentation and computer vision methodology development. Obtaining a suite of wet chemistry data for several varieties of cashew apple is expensive, time intensive, and sometimes not affordable. The study utilized only six varieties at two maturity stages, resulting in the utilization of only six data points for each model developed (180 total). Therefore, this should be considered when applying the models. The study utilized only natural lighting during the image data acquisition, and this created some variation and affected the prediction accuracy. Developing new methodologies beyond those presented in the study requires a high level of computer vision and image analysis programming skills.
In fact, these limitations present opportunities and guidance for future research in this knowledge domain. Expanding the dataset can assist in identifying and correcting outliers prior to training, thereby reducing the risk of skewed predictions. A larger dataset could also support more robust calibration across different varieties and development stages, ultimately enhancing the models’ performance. Incorporating non-linear modeling approaches, along with increased data volume, may further enhance models’ predictive performance. Color calibration of the input images will address and mitigate the variations induced by ambient lighting, potentially improving the computer vision analysis methodology. The current approach can be further optimized by integrating additional analytical techniques (e.g., unexplored textural features, color indices, and other surface morphologies) as well as other relevant wet chemistry properties into the computer vision ImageJ plugin.
The developed methodology can serve as a rapid tool for predicting physicochemical properties and have application in breeding programs for the assessment of physicochemical properties of a high volume of fruits and vegetables [78], the labor-intensive monitoring of large-scale orchards [79], and the advanced modeling and assessment of the postharvest quality of horticultural produce [80,81]. The methodology can be extended to the evaluation of the wet chemistry physicochemical properties of other horticultural crops. As the results revealed that the optimal maturity stage is 819, it is feasible to develop an intermediate maturity stage, such as 818 (between stages 817 and 819), to expand the time window available for harvest, measurement, and analysis. Also, integration of the methodology with machine learning (ML) and artificial intelligence (AI) could enable universal, in vitro tools for rapid fruit analysis and postharvest automation. These aspects will constitute a great potential for future research and can contribute to the enhancement of the overall performance of the computer vision methodology for indirect wet chemistry physicochemical properties measurements.

4. Conclusions

This study successfully developed a computer vision-based methodology for assessing 15 physicochemical and biochemical properties of cashew apples, which are conventionally evaluated through wet chemistry protocols, across six diverse varieties and two maturity stages from the images of whole samples and slices. From the wet chemistry evaluation of physicochemical properties, significant changes were observed between maturity stages 817 and 819 across all varieties, which reflect variety-dependent ripening dynamics and support their use for quality assessment. The alternative methodology of a developed user-coded and user-friendly computer vision ImageJ plugin, employing five image-based methodologies (color grid, surface morphology, GLCM, LBP, and color indices), analyzed the input images of slices and whole cashew apple samples, in batch processing mode, and generated 144 image-based parameters for further analysis (correlations, linear models development, and predictions). Optimal input configuration for the plugin was color grid: 7 × 7; waveness and roughness cutoff radius: 35 pixels; GLCM sampling size: 1 pixel; and GLCM angle: 270°. The plugin execution was rapid, clocking a CPU time of about 2.87 ± 0.79   s / image (Intel® Core™ i9 13950HX 5.5 GHz, 32 GB RAM).
The correlation between the image-based parameters (independent) with wet chemistry (dependent) traits, exhibited stronger correlations for slice-based image variables ( r = 0.86 to 0.87 ; r ¯ = 0.268 ) compared to that of whole fruit images ( r = 0.72 to 0.73 ; r ¯ = 0.252 ), indicating enhanced sensitivity to internal quality features. Strong intercorrelations among wet chemistry properties ( r = 0.85 to 0.96 ; r ¯ = 0.443 ) indicate that reliably predicted parameters can effectively serve as proxies for those with weaker image associations, thereby reinforcing the potential of image-based analysis and biochemical surrogates for non-destructive quality assessment. Surface morphology (52% and 38%), color indices (25.3% and 14.7%), and color grid (14% and 32.7%) features consistently ranked as top predictors across physicochemical properties, with slice and whole fruit images revealing distinct yet complementary patterns. The color indices developed in the study contributed significantly, featuring among the top-ranking 150 parameters (6.0% and 2.0%), and among the 22 color-based indices (40.9% and 13.6%). This ranking framework highlights the potential of easy-to-evaluate RGB descriptors for their non-destructive estimation.
The developed models for the slice at the maturity stage 819 ( R 2 > 0.5 ) covered 73.3% (11 out of 15) wet chemistry physicochemical properties and successfully modeled acidity, antioxidant, carbohydrates, carotenoids, crude fat, flavonoids, pH, phenolics, proteins, tannins, and vitamin C. In contrast, the wet chemistry properties covered by whole samples were 33.3% (5 out of 15; acidity, carbohydrates, carotenoids, proteins, and vitamin C). The combination “slice + 819”, along with more sampling closer to the ultimate maturity stage, is recommended to effectively deliver the wet chemistry properties through the developed simpler image-based computer vision methodology.
From the prediction and validation of the developed model using the unseen image data, 66.7% for slice and 60.6% for whole samples belonged to the prediction ratio range of 0.75 to 1.25, demonstrating model validity. Models of the slice samples performed better than the whole samples. The wet chemistry properties predicted by the models in the decreasing order of performance were moisture content, TSS, carbohydrates, pH, proteins, and total sugars. Those with the lowest prediction performance were phenolics, antioxidants, crude fat, flavonoids, tannins, and vitamin C. Suggestions for future research include a larger dataset across different varieties and maturity stages; exploring non-linear modeling approaches; performing color calibration of the input images; incorporating additional wet chemistry and analytical techniques; developing an intermediate maturity stage; extending this methodology to other horticultural crops; and developing robust practical tools specifically for breeding programs, orchard-scale monitoring, and high-volume postharvest quality measurement, and integrating them with modern ML and AI techniques.

Author Contributions

Conceptualization, M.J.G., C.I., and J.N.; methodology, M.J.G., C.I., and J.N.; formal analysis, M.J.G., C.I., J.N., A.J., H.T., S.S., A.M., J.D.A., and P.K.; investigation, M.J.G., C.I., J.N. A.J., H.T., S.S., A.M., J.D.A., and P.K.; resources, M.J.G., J.N., J.D.A., P.K., and C.I.; data curation, M.J.G., and C.I.; writing—original draft preparation, M.J.G., J.N., and C.I.; writing—review and editing, M.J.G., J.N., C.I., A.J., H.T., S.S., A.M., P.K., and J.D.A. visualization, M.J.G., C.I., and J.N.; supervision, M.J.G., J.N., and C.I.; project administration, J.N., J.D.A., M.J.G., and P.K.; funding acquisition, J.N., and J.D.A. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Indian Council for Agricultural Research, Project ID. No: IXX20122, Project title: Enhancing the shelf life of cashew apple to increase the market potential and development of functional foods.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The image data used in the manuscript is available from Mendeley Data, V1, at https://doi.org/10.17632/67kn43rj5c.1 (accessed on 21 November 2025). This dataset is cited in the manuscript as [41]. This dataset contains three files, namely, (1) Raw input images, (2) Wet chemistry data of cashew apples, and (3) R script used for analysis in the study.

Acknowledgments

The facilities extended by the NGPRL, USDA-ARS, Mandan, ND, USA, during this study are gratefully acknowledged by the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Physicochemical Properties

Appendix A.1.1. Moisture, Protein, and Fat

Moisture content in cashew apples was determined by following the ISO 712:1998 method with a gravimetric principle using a hot air oven. The protein was estimated by following the Kjeldahl method as described by the AOAC method (920.151) [82], where a Kjeldahl apparatus (DX VATS, Pelican, Kelplusclassic, 2015) was used for digestion, distillation, and titration. For crude fat estimation, a Soxhlet apparatus (KSEA-1B, KEMI) was used, and samples were subjected to fat extraction using n-hexane. The extraction was carried out for approximately 3 h, and the percentage of fat was calculated as per the AOAC method (920.39).

Appendix A.1.2. Total Soluble Solids, pH, and Titratable Acidity Determination

Total soluble solids (TSSs) were analyzed based on the AOAC method (932.12) using a portable hand refractometer corrected for 20 °C. The AOAC method (981.12) was also used for determining the pH of the fruit. The AOAC method (942.15) was followed for assessing the titratable acidity (TA) of cashew apples, and the results were expressed as percentage acidity in terms of citric acid.

Appendix A.1.3. Total Carbohydrate

The total carbohydrate was estimated spectrophotometrically using the anthrone method [83], where the sample was hydrolyzed using 2.5N HCl for 3 h in a boiling water bath. After cooling to room temperature, it was neutralized with Na2CO3 until the effervescence ceased. The solution was centrifuged after adding 10 mL of distilled water. The supernatant was collected, and the volume was made up to 100 mL with distilled water. Total carbohydrate was analyzed by adding 4 mL of anthrone reagent to a mixture of 0.1 mL aliquot and 0.9 mL distilled water. The mixture was incubated in a boiling water bath for 8 min and cooled to room temperature. The green to dark green color developed was measured using a spectrophotometer at 630 nm. With spectrophotometry methods, the sample values are measured indirectly with reference to a standard solution of known concentration. A standard curve was prepared by plotting the absorbance values of standard glucose solutions. The concentration of total carbohydrate is calculated from the following equation:
Total carbohydrate , g / 100 g = Absorbance of sample y - intercept of standard curve Slope of standard curve

Appendix A.1.4. Total Sugar

Total sugar content in cashew apple samples was determined using the Hedge and Hofreiter [84] spectrophotometric method. Sugars were extracted by mixing the samples with 80% hot ethanol and were refluxed for 1–2 h. The mixture was filtered, and the ethanol was removed using a rotary evaporator. The volume of the filtrate was made up to 100 mL. An aliquot (0.1 mL) was mixed with 0.9 mL distilled water and 5 mL anthrone reagent. The solution was then incubated at 100 °C for 11 min and the absorbance was read at 630 nm. D-glucose was used as a standard, and the results were expressed as a percentage on a fresh weight basis.

Appendix A.1.5. Reducing Sugar

Reducing sugar was estimated using the dinitro salicylic acid (DNSA) method [85]. Briefly reducing sugars were extracted from the samples as described for total sugars using 80% hot ethanol. A 3 mL DNSA reagent was added to 1 mL aliquot and incubated in a boiling water bath for 16 min. The mixture was then cooled to room temperature, and the volume was filled to 25 mL with distilled water. The absorbance of the sample was read at 575 nm using a spectrophotometer. The reducing sugar content was calculated from the calibration curve of standard D-glucose, and the results were expressed as a percentage on a wet basis.

Appendix A.1.6. Vitamin C (Ascorbic Acid) Estimation

Vitamin C was determined using the 2,6-dichlorophenolindophenol (DCPIP) method (AOAC Method, 967.21). The samples were extracted using 3% metaphosphoric acid to obtain a solution of vitamin C. The extract was titrated with a standardized DCPIP solution until a pale pink color appeared. The results were expressed as mg/100 g of ascorbic acid on a wet basis.

Appendix A.2. Biochemical Properties

For the estimation of biochemical properties, polyphenolics were extracted as described by Ramful et al. [86] with some modifications. Approximately 1 g of pulp was extracted using 10 mL of 80% ethanol. The extract was sonicated for 30 min at 25 °C and then centrifuged at 10,000 × g for 10 min at 4 °C and the extract was decanted. The residue was extracted twice following the aforementioned procedure. The combined extracts were adjusted to 30 mL and were filtered with a 0.4 μ m nylon filter. The filtered extract was stored at −20 °C and used for the determination of total phenol, total flavonoids, antioxidant activity, and tannins.

Appendix A.2.1. Total Phenolic Content (TPC)

TPC was estimated spectrophotometrically using the Folin–Ciocalteu reagent (FCR) [87]. The aliquot (0.1 mL) was mixed with 2.9 mL of deionized water, 0.5 mL of Folin–Ciocalteu reagent, and 2.0 mL of 20% Na2CO3 solution. The mixture was allowed to stand for 20 min, and absorption was measured at 750 nm against a reagent blank in a UV-Vis spectrophotometer. Results were expressed as gallic acid equivalent (mg GAE/100 g fresh weight [FW]).

Appendix A.2.2. Total Flavonoids Content (TFC)

TFC in cashew apple was measured spectrophotometrically using the method of Zhishen et al. [88]. A known volume (0.3 mL) of sample extract in acidified ethanol (2.1 mL) was taken, and mixed with 0.3 mL of 5% w/v NaNO2. After 5 min, 0.3 mL of 10% w/v AlCl3 was added to it followed by the addition of 2 mL of NaOH (1M) after 6 min. The solution was thoroughly mixed, resulting in a pink to yellow color. The absorbance was read at 510 nm against a reagent blank using 80% aqueous ethanol instead of the sample in a UV-Vis spectrophotometer. The results were expressed as quercetin equivalent ( μ g QE/g, wet basis).

Appendix A.2.3. Antioxidant Activity Using DPPH Assay

DPPH (2,2-diphenyl-1-picrylhydrazyl) scavenging assay was performed as described by Brand-Williams et al. [89]. A 3.9 mL aliquot of a 0.0634 m mol / L of DPPH solution in methanol (70%) was added to 0.1 mL of each extract and vigorously shaken. Change in the absorbance of the sample extract was measured at 517 nm for 30 min. The percentage inhibition of DPPH by the test sample and known solutions was calculated by the following formula, and results were expressed in terms of Trolox equivalent per gram fresh weight ( μ mol TE/g FW):
Inhibition , % = A 0 A A 0 × 100
where, A 0 was the initial absorbance of DPPH solution at 517 n m and A was the final absorbance of the sample extract added with DPPH solution. Methanol (70%) was used as a blank.

Appendix A.2.4. Tannin Content

Tannin content was measured using Folin’s Denis method [90]. An ethanolic extract (1 mL) was added to 5 mL of Folin-Denis reagent and 10 mL of saturated sodium carbonate solution. The absorbance of the mixture was measured at 700 nm after 30 min incubation in the dark. Tannic acid was used as the standard, and distilled water as the blank. The results were expressed as mg TaE (tannic acid equivalent)/100 g on a wet basis.

Appendix A.2.5. Carotene Estimation

Carotene in cashew apples was estimated by following the method described by Rodriguez [91]. Exactly 5 g of the sample was repeatedly extracted using acetone in a pestle and mortar until the residue became colorless. The acetone extract was collected in a separating funnel containing 20 mL of petroleum ether and mixed gently. Next, 20 mL of a 5% sodium sulfate solution was added, and the mixture was gently shaken. The top petroleum ether layer was separated, and the lower aqueous phase was extracted again with another 20 mL of petroleum ether. Both petroleum ether extracts were combined and rinsed with a small amount of distilled water. Then, 10 g of anhydrous sodium sulfate was added to the petroleum ether extract, which was allowed to stand for 30 min. The petroleum ether extract was decanted into a 100 mL volumetric flask, and the volume was filled with petroleum ether. The absorbance in the spectrophotometer was measured at 452 nm using petroleum ether as a blank. The contents of carotene and lycopene were expressed as mg per 100 g wet basis.

References

  1. Maciel, M.L.; Hansen, T.J.; Aldinger, S.B.; Labows, J.N. Flavor chemistry of cashew apple juice. J. Agric. Food Chem. 1986, 34, 923–927. [Google Scholar] [CrossRef]
  2. Das, I.; Arora, A. Post-harvest processing technology for cashew apple—A review. J. Food Eng. 2017, 194, 87–98. [Google Scholar] [CrossRef]
  3. Pushpalatha, P.; Sobhana, A.; Mini, C. Processing and product diversification in cashew apple. In Advances in Cashew Production Technology; Indian Council of Agricultural Research: New Delhi, India, 2015; pp. 109–116. [Google Scholar]
  4. Sri Lakshmi, C.; Prema, A. Resource-use efficiency in raw cashew nut production in Kerala, India. Asian J. Agric. Ext. Econ. Sociol. 2025, 43, 92–98. [Google Scholar]
  5. FAOSTAT. Food and Agriculture Organization of the United Nations—FAOSTAT. Crops and Livestock Database. 2025. Available online: https://www.fao.org/faostat/en/#data/QCL (accessed on 23 November 2025).
  6. Ganesh, S.; Kannan, M.; Jawaharlal, M. Cashew Industry—An outlook. Acta Hortic. 2015, 1080, 89–95. [Google Scholar] [CrossRef]
  7. Anoopkumar, A.; Gopinath, C.; Annadurai, S.; Abdullah, S.; Tarafdar, A.; Hazeena, S.H.; Rajasekharan, R.; Kuriakose, L.L.; Aneesh, E.M.; de Souza Vandenberghe, L.P.; et al. Biotechnological valorisation of cashew apple: Prospects and challenges in synthesising wide spectrum of products with market value. Bioresour. Technol. Rep. 2024, 25, 101742. [Google Scholar] [CrossRef]
  8. Gnagne, A.A.G.B.; Soro, D.; Ouattara, Y.; Koui, E.; Koffi, E. A literature review of cashew apple processing. Afr. J. Food Agric. Nutr. Dev. 2023, 23, 22452–22469. [Google Scholar] [CrossRef]
  9. Trevisan, M.; Pfundstein, B.; Haubner, R.; Würtele, G.; Spiegelhalder, B.; Bartsch, H.; Owen, R. Characterization of alkyl phenols in cashew (Anacardium occidentale) products and assay of their antioxidant capacity. Food Chem. Toxicol. 2006, 44, 188–197. [Google Scholar] [CrossRef]
  10. Honorato, T.L.; Rabelo, M.C.; Gonçalves, L.R.B.; Pinto, G.A.S.; Rodrigues, S. Fermentation of cashew apple juice to produce high added value products. World J. Microbiol. Biotechnol. 2007, 23, 1409–1415. [Google Scholar] [CrossRef]
  11. dos Santos Lima, F.C.; da Silva, F.L.H.; Gomes, J.P.; da Silva Neto, J.M. Chemical composition of the cashew apple bagasse and potential use for ethanol production. Adv. Chem. Eng. Sci. 2012, 2, 519–523. [Google Scholar] [CrossRef]
  12. Dedehou, E.; Dossou, J.; Anihouvi, V.; Soumanou, M.M. A review of cashew (Anacardium occidentale L.) apple: Effects of processing techniques, properties and quality of juice. Afr. J. Biotechnol. 2016, 15, 2637–2648. [Google Scholar]
  13. Pascal, A.D.C.; Virginie, G.; Diane, B.F.T.; Estelle, K.R.; Félicien, A.; Valentin, W.D.; Dominique, S.K.C. Nutritional profile and chemical composition of juices of two cashew apple’s varieties of Benin. Chem. J. 2018, 4, 91–96. [Google Scholar]
  14. Reina, L.J.C.; Durán-Aranguren, D.D.; Forero-Rojas, L.F.; Tarapuez-Viveros, L.F.; Durán-Sequeda, D.; Carazzone, C.; Sierra, R. Chemical composition and bioactive compounds of cashew (Anacardium occidentale) apple juice and bagasse from Colombian varieties. Heliyon 2022, 8, e09528. [Google Scholar] [CrossRef] [PubMed]
  15. Akyereko, Y.G.; Yeboah, G.B.; Wireko-Manu, F.D.; Alemawor, F.; Mills-Robertson, F.; Odoom, W. Nutritional value and health benefits of cashew apple. JSFA Rep. 2023, 3, 110–118. [Google Scholar] [CrossRef]
  16. Prasertsri, P.; Roengrit, T.; Kanpetta, Y.; Tong-Un, T.; Muchimapura, S.; Wattanathorn, J.; Leelayuwat, N. Cashew apple juice supplementation enhanced fat utilization during high-intensity exercise in trained and untrained men. J. Int. Soc. Sport. Nutr. 2013, 10, 13. [Google Scholar] [CrossRef]
  17. Aidoo, R.; Kwofie, E.M.; Ngadi, M.O. Circularity of cashew apples: Examining the product-process pathways, techno-functional, nutritional/phytomolecular qualities for food applications. ACS Food Sci. Technol. 2022, 2, 1051–1066. [Google Scholar] [CrossRef]
  18. da Silveira Vasconcelos, M.; Gomes-Rochette, N.F.; de Oliveira, M.L.M.; Nunes-Pinheiro, D.C.S.; Tome, A.R.; Maia de Sousa, F.Y.; Pinheiro, F.G.M.; Moura, C.F.H.; Miranda, M.R.A.; Mota, E.F.; et al. Anti-inflammatory and wound healing potential of cashew apple juice (Anacardium occidentale L.) in mice. Exp. Biol. Med. 2015, 240, 1648–1655. [Google Scholar] [CrossRef]
  19. Salehi, B.; Gültekin-Özgüven, M.; Kirkin, C.; Özçelik, B.; Morais-Braga, M.F.B.; Carneiro, J.N.P.; Bezerra, C.F.; Silva, T.G.d.; Coutinho, H.D.M.; Amina, B.; et al. Antioxidant, antimicrobial, and anticancer effects of anacardium plants: An ethnopharmacological perspective. Front. Endocrinol. 2020, 11, 295. [Google Scholar] [CrossRef]
  20. Attri, B. Effect of initial sugar concentration on the physico-chemical characteristics and sensory qualities of cashew apple wine. Nat. Prod. Radiance 2009, 8, 374–379. [Google Scholar]
  21. Kouassi, E.K.A.; Soro, Y.; Vaca-Garcia, C.; Yao, K.B. Chemical composition and specific lipids profile of the cashew apple bagasse. Rasayan J. Chem. 2018, 11, 386–391. [Google Scholar]
  22. Okpanachi, U.; Attah, S.; Shaahu, D. A comparative study between vitamins and amino acid profile of sun-dried red and yellow cashew pulp. Int. J. Anim. Biol. 2015, 1, 23. [Google Scholar]
  23. Runjala, S.; Kella, L. Cashew apple (Anacardium occidentale L.) therapeutic benefits, processing and product development: An overview. Pharma Innov. 2017, 6, 260. [Google Scholar]
  24. Sivaprakasam, S.; Chinnaiyan, U.; Varatharajan, S.; Uthayasuryan, E.; Chandrasekaran, M.; Paramasivam, S.; Ramasamy, P. Unveiling the physicochemical attributes and antioxidant potential of cashew apple juice varieties. Appl. Fruit Sci. 2025, 67, 74. [Google Scholar] [CrossRef]
  25. Kiatti, D.d.; Serrapica, F.; Musco, N.; Di Palo, R.; Calabrò, S. Potential use of tropical and subtropical fruits by-products in pig diet: In vitro two-step evaluation. Animals 2025, 15, 1454. [Google Scholar] [CrossRef] [PubMed]
  26. Preethi, P.; Rajkumar, A.; Shamsudheen, M.; Nayak, M. Prospects of Cashew Apple—A Compilation Report; Technical Bulletin; ICAR-Directorate of Cashew Research Puttur: Karnataka, India, 2019; Volume 2, pp. 1–28. [Google Scholar]
  27. Codjia, F.S.; Dabadé, D.S.; Agbobatinkpo, P.; Collombel, I.; Achir, N.; Azokpota, P.; Dossou, J. Fermented cashew apple beverages: Current state of knowledge and prospects. Beverages 2025, 11, 49. [Google Scholar] [CrossRef]
  28. Mtashobya, L.A.; Mgeni, S.T.; Emmanuel, J.K. Potential contributions of cashew apple juice to nutrition, medicine and bioethanol generation: An outlook. Nat. Prod. Commun. 2025, 20, 1934578X251357414. [Google Scholar] [CrossRef]
  29. Levy, H.M.; Rodrigues, T.H.S.; de Albuquerque, T.L.; Sant’Ana, H.B.; Feitosa, F.X.; Rocha, M.V.P. Biosynthesis of biolubricant catalyzed by lipase B from Candida antarctica immobilized on nanocomposite of cashew apple bagasse lignin. Renew. Energ. 2025, 243, 122452. [Google Scholar] [CrossRef]
  30. Osei, E.D.; Amotoe-Bondzie, A.; Ataa Pokuah, A.; Laar, W.S.; Afoakwah, N.A.; Ivanišová, E. Cashew apple pomace: Chemical composition and applications in functional food product development—A review. Food Sci. Nutr. 2025, 13, e70185. [Google Scholar] [CrossRef]
  31. Aluko, A.; Makule, E.; Kassim, N. Effect of clarification on physicochemical properties and nutrient retention of pressed and blended cashew apple juice. Food Sci. Nutr. 2023, 11, 1891–1903. [Google Scholar] [CrossRef]
  32. Ogunjobi, M.; Ogunwolu, S. Physicochemical and sensory properties of cassava flour biscuits supplemented with cashew apple powder. J. Food Technol. 2010, 8, 24–29. [Google Scholar]
  33. Mohammadzadeh, Z.; Shojaeiyan, A.; Mahfeli, M.; Ayyari, M.; Tohidfar, M.; Mokhtassi-Bidgoli, A.; Atighi, M.R. Predictive modeling of CIELAB color parameters in okra accessions based on phytochemical composition and antioxidant activity: A non-destructive ImageJ and RSM approach. LWT—Food Sci. Technol. 2025, 228, 118080. [Google Scholar] [CrossRef]
  34. Sun, Z.; Tian, H.; Hu, D.; Yang, J.; Xie, L.; Xu, H.; Ying, Y. Integrating deep learning and data fusion for enhanced oranges soluble solids content prediction using machine vision and Vis/NIR spectroscopy. Food Chem. 2025, 464, 141488. [Google Scholar] [CrossRef]
  35. Wang, X.; Feng, Y.; Wang, Y.; Zhu, H.; Song, D.; Shen, C.; Luo, Y. Enhancing optical non-destructive methods for food quality and safety assessments with machine learning techniques: A survey. J. Agric. Food Res. 2025, 19, 101734. [Google Scholar] [CrossRef]
  36. Jinorose, M.; Prachayawarakorn, S.; Soponronnarit, S. A novel image-analysis based approach to evaluate some physicochemical and cooking properties of rice kernels. J. Food Eng. 2014, 124, 184–190. [Google Scholar] [CrossRef]
  37. Wu, X.; Wu, K.; Xu, D.; Ren, D. Investigation into the biochemical properties of loquat fruits and their optical properties of hyperspectral imaging spectral correlation during the progression of postharvest fungal infection. Food Chem. 2025, 487, 144688. [Google Scholar] [CrossRef] [PubMed]
  38. Cheng, X.; Zhou, Y.; Huo, Z.; Li, R.; Xu, S.; Qi, H.; Zhu, J.; Wang, F.; Bi, Y. Predicting postharvest weight loss and texture changes in table grapes using fruit color and machine learning. J. Future Foods 2025, 12, 100703. [Google Scholar] [CrossRef]
  39. Sabzi, S.; Nadimi, M.; Abbaspour-Gilandeh, Y.; Paliwal, J. Non-destructive estimation of physicochemical properties and detection of ripeness level of apples using machine vision. Int. J. Fruit Sci. 2022, 22, 628–645. [Google Scholar] [CrossRef]
  40. Adiga, J.D.; Muralidhara, B.M.; Preethi, P.; Savadi, S. Phenological growth stages of the cashew tree (Anacardium occidentale L.) according to the extended BBCH scale. Ann. Appl. Biol. 2019, 175, 246–252. [Google Scholar] [CrossRef]
  41. Gupta, M.J.; Nishad, J.; Igathinathane, C. Assessment of physicochemical properties of cashew apple through computer vision—Original images, data, and R script. Mendeley Data, Version 1; Elsevier Inc.: London, UK, 2025. [Google Scholar] [CrossRef]
  42. Pawlus, P.; Reizer, R.; Wieczorowski, M. Functional importance of surface texture parameters. Materials 2021, 14, 5326. [Google Scholar] [CrossRef]
  43. Chinga, G. Waveness Roughness. In ImageJ Plugins; 2018. Available online: https://imagej.net/ij/plugins/waveness-roughness.html (accessed on 23 November 2025).
  44. Haralick, R.M.; Shanmugam, K. Computer classification of reservoir sandstones. IEEE Trans. Geosci. Electron. 1973, 11, 171–177. [Google Scholar] [CrossRef]
  45. Mohanaiah, P.; Sathyanarayana, P.; GuruKumar, L. Image texture feature extraction using GLCM approach. Int. J. Sci. Res. Publ. 2013, 3, 1–5. [Google Scholar]
  46. Song, Y.; Sa, J.; Luo, Y.; Zhang, Z. A comprehensively improved local binary pattern framework for texture classification. Multimed. Tools Appl. 2025, 84, 22043–22068. [Google Scholar] [CrossRef]
  47. Woebbecke, D.M.; Meyer, G.E.; Von Bargen, K.; Mortensen, D.A. Color indices for weed identification under various soil, residue, and lighting conditions. Trans. ASAE 1995, 38, 259–269. [Google Scholar] [CrossRef]
  48. Bendig, J.; Yu, K.; Aasen, H.; Bolten, A.; Bennertz, S.; Broscheit, J.; Gnyp, M.L.; Bareth, G. Combining UAV-based plant height from crop surface models, visible, and near infrared vegetation indices for biomass monitoring in barley. Int. J. Appl. Earth Obs. 2015, 39, 79–87. [Google Scholar] [CrossRef]
  49. Yuan, H.; Zhang, J.; Zhang, H.; Xu, W.; Peng, J.; Wang, X.; Chen, P.; Li, P.; Lu, F.; Yan, J.; et al. Monitoring autumn phenology in understory plants with a fine-resolution camera. Remote Sens. 2025, 17, 1025. [Google Scholar] [CrossRef]
  50. Li, J.; Wu, W.; Zhao, C.; Bai, X.; Dong, L.; Tan, Y.; Yusup, M.; Akelebai, G.; Dong, H.; Zhi, J. Effects of solar elevation angle on the visible light vegetation index of a cotton field when extracted from the UAV. Sci. Rep. 2025, 15, 18497. [Google Scholar] [CrossRef]
  51. Beniaich, A.; Silva, M.L.N.; Avalos, F.A.P.; de Menezes, M.D.; Cândido, B.M. Determination of vegetation cover index under different soil management systems of cover plants by using an unmanned aerial vehicle with an onboard digital photographic camera. Semin. Ciências Agrárias 2019, 40, 49–66. [Google Scholar] [CrossRef]
  52. Ju, Z.; Liu, C.; Yuan, Y.; Wang, Y.; Liu, G. Coloration potential, anthocyanin accumulation, and enzyme activity in fruit of commercial apple cultivars and their F1 progeny. Sci. Hortic. 1999, 79, 39–50. [Google Scholar] [CrossRef]
  53. Guijarro, M.; Pajares, G.; Riomoros, I.; Herrera, P.; Burgos-Artizzu, X.; Ribeiro, A. Automatic segmentation of relevant textures in agricultural images. Comput. Electron. Agr. 2011, 75, 75–83. [Google Scholar] [CrossRef]
  54. Banerjee, K.; Dutta, S.; Das, B.; Roy, D.; Sen, S.; Mandal, B.P.; Chatterjee, A. Crop type discrimination through low cost proximal RGB imaging and multivariate analysis. Arab. J. Geosci. 2025, 18, 31. [Google Scholar] [CrossRef]
  55. Meyer, G.E.; Neto, J.C. Verification of color vegetation indices for automated crop imaging applications. Comput. Electron. Agr. 2008, 63, 282–293. [Google Scholar] [CrossRef]
  56. Strati, V.; Albéri, M.; Barbagli, A.; Boncompagni, S.; Casoli, L.; Chiarelli, E.; Colla, R.; Colonna, T.; Elek, N.I.; Galli, G.; et al. Advancing grapevine disease detection through airborne imaging: A pilot study in Emilia-Romagna (Italy). Remote Sens. 2025, 17, 2465. [Google Scholar] [CrossRef]
  57. Jiménez-Muñoz, J.C.; Sobrino, J.A.; Plaza, A.; Guanter, L.; Moreno, J.; Martínez, P. Comparison between fractional vegetation cover retrievals from vegetation indices and spectral mixture analysis: Case study of PROBA/CHRIS data over an agricultural area. Sensors 2009, 9, 768–793. [Google Scholar] [CrossRef]
  58. Woebbecke, D.M.; Meyer, G.E.; Von Bargen, K.; Mortensen, D.A. Plant species identification, size, and enumeration using machine vision techniques on near-binary images. In Proceedings of the Optics in Agriculture and Forestry; SPIE: St Bellingham, WA, USA, 1993; Volume 1836, pp. 208–219. [Google Scholar]
  59. Hussain, S.; Teshome, F.T.; Tulu, B.B.; Awoke, G.W.; Hailegnaw, N.S.; Bayabil, H.K. Leaf area index (LAI) prediction using machine learning and UAV based vegetation indices. Eur. J. Agron. 2025, 168, 127557. [Google Scholar] [CrossRef]
  60. Ritchie, G.; Sullivan, D.; Vencill, W.; Bednarz, C.; Hook, J. Sensitivities of normalized difference vegetation index and a green/red ratio index to cotton ground cover fraction. Crop Sci. 2010, 50, 1000–1010. [Google Scholar] [CrossRef]
  61. Aynalem, H.M.; Righetti, T.L.; Reed, B.M. Non-destructive evaluation of in vitro-stored plants: A comparison of visual and image analysis. Vitr. Cell. Dev. Biol. Plant 2006, 42, 562–567. [Google Scholar] [CrossRef]
  62. Gamon, J.A.; Surfus, J.S. Assessing leaf pigment content and activity with a reflectometer. New Phytol. 1999, 143, 105–117. [Google Scholar] [CrossRef]
  63. Gitelson, A.A.; Kaufman, Y.J.; Stark, R.; Rundquist, D. Novel algorithms for remote estimation of vegetation fraction. Remote Sens. Environ. 2002, 80, 76–87. [Google Scholar] [CrossRef]
  64. Xiaoqin, W.; Miaomiao, W.; Shaoqiang, W.; Yundong, W. Extraction of vegetation information from visible unmanned aerial vehicle images. Trans. Chin. Soc. Agric. Eng. 2015, 31, 152–159. [Google Scholar]
  65. Song, H.; Asghari, M.; Zahedipour-Sheshglani, P.; Aljanabi, S.M.A.K.; Diao, E.; Xiang, X.; Qian, S.; Liang, X. Impact of exogenous cinnamic acid on some quality indices, phenolic compounds, main phenolic biosynthesis enzymes and senescence rate of strawberry fruit. Food Chem. 2025, 487, 144726. [Google Scholar] [CrossRef]
  66. Hasanzadeh, B.; Abbaspour-Gilandeh, Y.; Soltani-Nazarloo, A.; Hernández-Hernández, M.; Gallardo-Bernal, I.; Hernández-Hernández, J.L. Non-destructive detection of fruit quality parameters using hyperspectral imaging, multiple regression analysis and artificial intelligence. Horticulturae 2022, 8, e8070598. [Google Scholar] [CrossRef]
  67. Paulo, D.J.; Neves, C.M.; Wessel, D.F.; Neves, J.C. WildFruiP: Estimating fruit physicochemical parameters from images captured in the wild. In Proceedings of the Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications; Springer: Cham, Switzerland, 2023; Volume 14469, pp. 314–326. [Google Scholar]
  68. Andrés-Bello, A.; Barreto-Palacios, V.; García-Segovia, P.; Mir-Bel, J.; Martínez-Monzó, J. Effect of pH on color and texture of food products. Food Eng. Rev. 2013, 5, 158–170. [Google Scholar] [CrossRef]
  69. Akkaya, E. Characterization of physicochemical, colour and textural properties of Turkish type cheeses. Dicle Üniversitesi Vet. Fakültesi Derg. 2024, 17, 137–142. [Google Scholar] [CrossRef]
  70. Amaya, I.; Roldán-Guerra, F.J.; Ordóñez-Díaz, J.L.; Torreblanca, R.; Wagner, H.; Waurich, V.; Olbricht, K.; Moreno-Rojas, J.M.; Sánchez-Sevilla, J.F.; Castillejo, C. Differential expression of CCD4(4B) Drives Nat. Var. Fruit Carotenoid Content Strawb. (Fragaria Spp.). Plant Biotechnol. J. 2025, 23, 679–691. [Google Scholar] [CrossRef]
  71. Todorov, S.D.; de Almeida, B.M.; Lima, E.M.F.; Fabi, J.P.; Lajolo, F.M.; Hassimotto, N.M.A. Phenolic compounds and bacteriocins: Mechanisms, interactions, and applications in food preservation and safety. Mol. Nutr. Food Res. 2025, 69, e202400723. [Google Scholar] [CrossRef] [PubMed]
  72. Cosme, F.; Aires, A.; Pinto, T.; Oliveira, I.; Vilela, A.; Gonçalves, B. A comprehensive review of bioactive tannins in foods and beverages: Functional properties, health benefits, and sensory qualities. Molecules 2025, 30, 800. [Google Scholar] [CrossRef] [PubMed]
  73. Cao, Q.y.; Yuan, X.; Zhang, C.; Deng, X.; Jiang, Y.y.; Wang, B. An opinion on the roles of phenylalanine ammonia-lyase in the browning of fresh-cut fruit and vegetables. Front. Nutr. 2025, 12, 1561620. [Google Scholar] [CrossRef] [PubMed]
  74. Tang, T.; Zhang, M.; Jia, H.; Adhikari, B.; Guo, Z. Real-time freshness monitoring of fruits and vegetables integrating 3D-printed alginate-based colorimetric sensors with deep convolutional neural networks. Chem. Eng. J. 2025, 520, 166387. [Google Scholar] [CrossRef]
  75. Baca-Bocanegra, B.; García, A.I.E.; Hernández-Hierro, J.M.; Escudero-Gilete, M.L.; Nogales-Bueno, J. Feasibility assessment on the use of near infrared hyperspectral imaging for the screening of vitamin C content and total soluble solids in strawberries. Spectrochim. Acta Part A 2025, 343, 126542. [Google Scholar] [CrossRef]
  76. Medlicott, A.P.; Thompson, A.K. Analysis of sugars and organic acids in ripening mango fruits (Mangifera Indica L. var Keitt) by high performance liquid chromatography. J. Sci. Food Agric. 1985, 36, 561–566. [Google Scholar] [CrossRef]
  77. Lam, H.H.; Nguyen, T.-M.-T.; Do, T.-A.-S.; Dinh, T.H.; Dang-Bao, T. Quantification of total sugars and reducing sugars of dragon fruit-derived sugar-samples by UV-Vis spectrophotometric method. IOP Conf. Ser. Earth Environ. Sci. 2021, 947, 012041. [Google Scholar] [CrossRef]
  78. Prichko, T.; Ulyanovskaya, E.; Droficheva, N. Evaluation of biochemical indicators of apple fruits quality for the complex selection of the valuable source material for breeding. BIO Web Conf. 2020, 25, 02019. [Google Scholar] [CrossRef]
  79. Hu, Z.; Chang, C.Y.; Shen, R.; Yang, W. Multi-parameter orchard monitoring and control system. In Proceedings of the 2025 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-Taiwan), Kaohsiung, Taiwan, 16–18 July 2025; IEEE: New York, NY, USA, 2025; pp. 609–610. [Google Scholar]
  80. Salehi, F. Recent advances in the modeling and predicting quality parameters of fruits and vegetables during postharvest storage: A review. Int. J. Fruit Sci. 2020, 20, 506–520. [Google Scholar] [CrossRef]
  81. Júnior, M.R.B.; Dos Santos, R.G.; de Azevedo Sales, L.; Vargas, R.B.S.; Deltsidis, A.; de Oliveira, L.P. Image-based and ML-driven analysis for assessing blueberry fruit quality. Heliyon 2025, 11, e42288. [Google Scholar] [CrossRef]
  82. AOAC. Official Methods of Analysis of the Association of Official Analytical Chemists; The Association of Official Analytical Chemists: Washington, DC, USA, 2000; Volume 11. [Google Scholar]
  83. Sadasivam, S.; Manickam, A. Biochemical Methods, 3rd ed.; New Age International: New Delhi, India, 2010. [Google Scholar]
  84. Hedge, J.; Hofreiter, B. Methods of estimating starch and carbohydrates. Carbohydr. Chem. 1962, 17, 163–201. [Google Scholar]
  85. Miller, G.L. Use of dinitrosalicylic acid reagent for determination of reducing sugar. Anal. Chem. 1959, 31, 426–428. [Google Scholar] [CrossRef]
  86. Ramful, D.; Tarnus, E.; Aruoma, O.I.; Bourdon, E.; Bahorun, T. Polyphenol composition, vitamin C content and antioxidant capacity of Mauritian citrus fruit pulps. Food Res. Int. 2011, 44, 2088–2099. [Google Scholar] [CrossRef]
  87. Singleton, V.L.; Orthofer, R.; Lamuela-Raventós, R.M. Analysis of total phenols and other oxidation substrates and antioxidants by means of folin-ciocalteu reagent. Methods Enzymol. 1999, 299, 152–178. [Google Scholar]
  88. Zhishen, J.; Mengcheng, T.; Jianming, W. The determination of flavonoid contents in mulberry and their scavenging effects on superoxide radicals. Food Chem. 1999, 64, 555–559. [Google Scholar] [CrossRef]
  89. Brand-Williams, W.; Cuvelier, M.E.; Berset, C. Use of a free radical method to evaluate antioxidant activity. LWT—Food Sci. Technol. 1995, 28, 25–30. [Google Scholar] [CrossRef]
  90. Smit, C.; Joslyn, M.; Lukton, A. Determination of tannins and related polyphenols in foods. Anal. Chem. 1955, 27, 1159–1162. [Google Scholar] [CrossRef]
  91. Rodriguez-Amaya, D.B.; Raymundo, L.; Lee, T.; Simpson, K.; Chichester, C. Carotenoid pigment changes in ripening Momordica charantia Fruits. Ann. Bot. 1976, 40, 615–624. [Google Scholar] [CrossRef]
Figure 1. Orange-colored cashew apple on the tree with a nut of the Bhaskara variety, developed in ICAR-DCR, Puttur, Karnataka, India.
Figure 1. Orange-colored cashew apple on the tree with a nut of the Bhaskara variety, developed in ICAR-DCR, Puttur, Karnataka, India.
Agriengineering 07 00398 g001
Figure 2. Flowchart elucidating the research methodology of cashew apple physicochemical properties assessment through computer vision.
Figure 2. Flowchart elucidating the research methodology of cashew apple physicochemical properties assessment through computer vision.
Agriengineering 07 00398 g002
Figure 3. Six varieties of cashew apples with cut slices, harvested at two stages of maturity. The numbers indicated by 817 and 819 are the two stages of maturity. The complete set of original images is available from the uploaded data [41].
Figure 3. Six varieties of cashew apples with cut slices, harvested at two stages of maturity. The numbers indicated by 817 and 819 are the two stages of maturity. The complete set of original images is available from the uploaded data [41].
Agriengineering 07 00398 g003
Figure 4. Steps followed in the preprocessing of cashew apple slice samples. (A) Original image showing the layout of several cashew apple slice samples before image cropping; (B) cropped individual sample; (C) sample segmentation utilizing ‘U’ channel histogram and selection; (D) sample binary image showing unsegmented regions as holes; (E) binary sample image mask after filling holes and median filter; (F) original color image after overlaying with mask; (G) selection of skin areas with polygon tool and smoothening with fit spline; (H) background black filling of selected area; (I) final segmented sample image; (J) cropped whole fruit original image; (K) preprocessed whole fruit showing pedicel cavity and blemishes showing dark spots; and (L) dark regions rectified by pasting similar colored patches giving the final segmented sample image.
Figure 4. Steps followed in the preprocessing of cashew apple slice samples. (A) Original image showing the layout of several cashew apple slice samples before image cropping; (B) cropped individual sample; (C) sample segmentation utilizing ‘U’ channel histogram and selection; (D) sample binary image showing unsegmented regions as holes; (E) binary sample image mask after filling holes and median filter; (F) original color image after overlaying with mask; (G) selection of skin areas with polygon tool and smoothening with fit spline; (H) background black filling of selected area; (I) final segmented sample image; (J) cropped whole fruit original image; (K) preprocessed whole fruit showing pedicel cavity and blemishes showing dark spots; and (L) dark regions rectified by pasting similar colored patches giving the final segmented sample image.
Agriengineering 07 00398 g004
Figure 5. Processing of images based on different methodologies (AC). (A1) Preprocessed slice of cashew apple processed in color-based analysis; (A2) ROIs generated for color extraction; (B1) grayscale image for surface morphology analysis; (B2) waveness image with line ROI for sampling; (B3) waveness line ROI profile plot; (B4) 3D-waveness surface plot of image; (B5) roughness image with line ROI for sampling; (B6) roughness line ROI profile plot; (B7) 3D-roughness surface plot; (C1) grayscale image for LBP; (C2C4) red, green, and blue component channels, respectively; (C5C8) texture plots of LBP analysis of gray, red, green, and blue channels, respectively; and (D1) analysis progress indicated during the batch processing of images. The output image colors are different because each method uses different input image types such as color and grayscale.
Figure 5. Processing of images based on different methodologies (AC). (A1) Preprocessed slice of cashew apple processed in color-based analysis; (A2) ROIs generated for color extraction; (B1) grayscale image for surface morphology analysis; (B2) waveness image with line ROI for sampling; (B3) waveness line ROI profile plot; (B4) 3D-waveness surface plot of image; (B5) roughness image with line ROI for sampling; (B6) roughness line ROI profile plot; (B7) 3D-roughness surface plot; (C1) grayscale image for LBP; (C2C4) red, green, and blue component channels, respectively; (C5C8) texture plots of LBP analysis of gray, red, green, and blue channels, respectively; and (D1) analysis progress indicated during the batch processing of images. The output image colors are different because each method uses different input image types such as color and grayscale.
Agriengineering 07 00398 g005
Figure 6. Input panel of the developed “Cashew Apple and Slices Analysis” batch processing plugin for calculating various image-based parameters used for estimating physicochemical properties of cashew apples. The panel shows brief instructions, various method choices for analysis, input parameters for the methodologies, and buttons to operate the plugin. The main inputs (5 total) are color subimage column and rows (2 inputs), waveness and roughness cutoff pixel (1 input), and GLCM step size pixel and direction degrees (2 inputs).
Figure 6. Input panel of the developed “Cashew Apple and Slices Analysis” batch processing plugin for calculating various image-based parameters used for estimating physicochemical properties of cashew apples. The panel shows brief instructions, various method choices for analysis, input parameters for the methodologies, and buttons to operate the plugin. The main inputs (5 total) are color subimage column and rows (2 inputs), waveness and roughness cutoff pixel (1 input), and GLCM step size pixel and direction degrees (2 inputs).
Agriengineering 07 00398 g006
Figure 7. Correlation diagram for cashew apple sample types demonstrating the correlation of wet chemistry and image-based parameters. Solid horizontal line separates interrelationships among wet chemistry, and dotted horizontal lines separate different parameter groups of methodologies. All—combined slice and whole sample types, Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Figure 7. Correlation diagram for cashew apple sample types demonstrating the correlation of wet chemistry and image-based parameters. Solid horizontal line separates interrelationships among wet chemistry, and dotted horizontal lines separate different parameter groups of methodologies. All—combined slice and whole sample types, Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Agriengineering 07 00398 g007
Figure 8. Range of image-based parameters correlation coefficients with physicochemical properties wet chemistry parameters. Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Figure 8. Range of image-based parameters correlation coefficients with physicochemical properties wet chemistry parameters. Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Agriengineering 07 00398 g008
Figure 9. Prediction and validation showing the ratio change of the predicted to actual physicochemical properties for cashew apple varieties from test image data. The gray dotted horizontal lines across the ratio of 1.0 indicate the ideal prediction. Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Figure 9. Prediction and validation showing the ratio change of the predicted to actual physicochemical properties for cashew apple varieties from test image data. The gray dotted horizontal lines across the ratio of 1.0 indicate the ideal prediction. Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type.
Agriengineering 07 00398 g009
Table 1. Dependent wet chemistry-based physicochemical properties variables and independent image-based variables employed in the study.
Table 1. Dependent wet chemistry-based physicochemical properties variables and independent image-based variables employed in the study.
MethodologyApplicable Parameter VariablesTotal: 159
Wet chemistry *Acidity (titrable), Antioxidant activity, Carbohydrates (total), Carotene (content), Fat crude, Flavonoids (total), Moisture (content), pH, Phenolic (total content), Protein, Reducing sugars, Tannin (content), Total soluble solids, Total sugars, and Vitamin C (refer Section 2.2).Total: 15
Color grid **,Mean R, Mean G, Mean B, Standard deviation of R, Standard deviation of G, Standard deviation of B, Median R, Median G, Median B, Geometric mean R, Geometric mean G, Geometric mean B, Maximum R, Maximum G, Maximum B, Minimum R, Minimum G, Minimum B, Number of R values, Number of G values, Number of B values, Skewness R, Skewness G, Skewness B, Kutosis R, Kutosis G, and Kurtosis B (refer Section 2.3.3.i).Total: 27
Waveness and roughness **Descriptive statistics parameters of waveness plot (Number of waveness plot data points, Minimum waveness, Maximum waveness, Mean waveness, Geometric mean waveness, Standard deviation waveness, Median waveness, Geometric mean waveness, Standard deviation of waveness), Waveness parameters calculated based on formulae from references (Arithmetic mean waveness, Root mean square waveness, Maximum valley depth of waveness plot, Maximum peak height of waveness plot, Average valley depth of waveness plot, Average peak height of waveness plot, Maximum peak to valley height of waveness plot, Descriptive statistics parameters of roughness plot (Number of roughness plot data points, Minimum roughness, Maximum roughness, Mean roughness, Geometric mean roughness, Standard deviation roughness, Median roughness), Waveness parameters calculated based on formulas from references (Arithmetic mean roughness, Root mean square roughness, Maximum valley depth of roughness plot, Maximum peak height of roughness plot, Average valley depth of roughness plot, Average peak height of roughness curve, and Maximum peak to valley height of roughness plot) (refer Section 2.3.3.ii).Total: 28
GLCM **Angular second moment, Contrast, Correlation, Inverse difference moment, and Entropy (refer Section 2.3.3.iii).Total: 5
LBP **,Number of gray pixels analyzed, Minimum gray, Maximum gray, Mean gray, Geometric mean gray, Quartile mean gray, Standard deviation gray, Median gray, Skewness gray, Kurtosis gray, Number of R pixels analyzed, Minimum R, Maximum R, Mean R, Geometric mean R, Quartile mean R, Standard deviation R, Median R, Skewness R, Kurtosis R, Number of G pixels analyzed, Minimum G, Maximum G, Mean G, Geometric mean G, Quartile mean G, Standard deviation G, Median G, Skewness G, Kurtosis G, Number of B pixels analyzed, Minimum B, Maximum B, Mean B, Geometric mean B, Quartile mean B, Standard deviation B, Median B, Skewness B, and Kurtosis B (refer Section 2.3.3.iv).Total: 40
Color indices **Color indices derived from overall R, G, and B values (27 existing + 17 new; Table 2; refer Section 2.3.3.v).Total: 44
* Dependent variables (total 15); ** Independent variables (total 144); † Overall values derived from the pixels of the selected ROIs based on user inputs; ‡ Set of values analyzed by splitting the color image into the component R ,   G ,   and B channels; GLCM—gray level co-occurrence matrix; LBP—local binary pattern; R—red, B—blue, and G—green pixel intensity values.
Table 2. Comprehensive list of existing and newly developed RGB-based indices.
Table 2. Comprehensive list of existing and newly developed RGB-based indices.
Vegetation IndexEquationReference
Blue chromatic coordinate (BCC) B R + G + B  [47]
Green-blue difference index (GBDI) G B  [48]
Blue red vegetation index (BRVI) B R B + R  [49]
Color index of vegetation (CIVE) 0.441 R 0.881 G + 0.385 B + 18.787  [50,51]
Coloration index (CI) R B R  [52]
Combination1 (COMB1) 0.25 ExG + 0.3 EXGR + 0.33 CIVE + 0.12 VEG  [53]
Combination2 (COMB2) 0.36 ExG + 0.47 CIVE + 0.17 VEG  [53]
Excess green (ExG) 2 G B R  [47,54]
Excess green minus excess red (ExGR) ExG 2 ExR  [50,55]
Excess red (ExR) 1.4 R G R + G + B  [50]
Green-blue vegetation index (GBVI) G B G + B  [56,57]
Green chromatic coordinate (GCC) G R + G + B  [47]
Green-red difference (GRD) G R  [48]
Green-red ratio index (GRRI) G R  [54]
Kawashima index (IKAW) R B R + B  [49,50]
Modified green-red vegetation index (MGRVI) G 2 R 2 G 2 + R 2  [48]
Normalized difference index (NDI) 128 × G R G + R + 1  [58]
Normalized excess green (ExG2) 2 G B R R + G + B  [47]
Normalized green-red difference index (NGRDI) G R G + R  [59,60]
Red chromatic coordinate (RCC) R R + G + B  [47]
Red blue ratio index (RBRI) R B  [61]
Red green ratio index (RGRI) R G  [62]
Red green blue vegetation index (RGBVI) G 2 B R G 2 + B R  [48]
Vegetativen (VEG) G R 0.667 + B 0.333  [48]
Visible atmospherically resistant index (VARI) G R G + R B  [63]
Visible-band difference vegetation index (VDVI) 2 G R B 2 G + R + B  [64]
Woebbecke index (WI) G B R G  [47]
Excess modified red (ExmR) 2 R G B
Modified coloration index (MCI) R G R
Modified red-blue vegetation index (MRBVI) R 2 B 2 R 2 + B 2
Modified red-green-blue vegetation index (MRGBVI) R 2 G B R 2 + G B
Modified red-green vegetation index (MRGVI) R 2 G 2 R 2 + G 2
Normalized excess red (ExR2) 2 R G B R + G + B
Normalized red-blue difference index (NRBDI) 128 × R B R + B + 1
Normalized red-green difference index (NRGDI) 128 × R G R + G + 1
Red-blue difference ratio (RBDR) R B G R
Red color index (RCI) 2 R G B 2 R + G + B
Red-green difference ratio (RGDR) R G B R
Red-blue difference (RBD) R B
Red-green difference (RGD) R G
Red-green vegetation index (RGVI) R G R + G
Triangular redness index (TRI) R 0.39 G 0.61 B
Visible atmospherically resistant red-blue index (VARRBI) R B R + B G
Visible atmospherically resistant red-green index (VARRGI) R G R + G B
Red (R), blue (B), and green (G) bands recorded from the satellite platforms. † Developed in the present study to capture red-tint-dominated samples.
Table 3. Cashew apple wet chemistry data for various varieties and maturity stages.
Table 3. Cashew apple wet chemistry data for various varieties and maturity stages.
Wet ChemistryCashew Apple Varieties and Maturity Stages
Bhaskara MDK NRCC-1 Vengurla-2 VRI-3 VTH-174
817 819 817 819 817 819 817 819 817 819 817 819
Acidity (%)0.14 ± 0.00.09 ± 0.010.12 ± 0.00.08 ± 0.00.16 ± 0.00.12 ± 0.00.25 ± 0.00.18 ± 0.00.16 ± 0.00.12 ± 0.00.160 ± 0.00.11 ± 0.0
Antioxidants ( μ mol / 100 g ) 28.59 ± 1.1320.85 ± 0.2225.96 ± 0.458.94 ± 0.1124.39 ± 0.4722.69 ± 0.9126.01 ± 0.832.88 ± 0.284.21 ± 0.1211.83 ± 0.8730.47 ± 0.7222.57 ± 0.87
Carbohydrates ( g / 100 g ) 14.09 ± 0.9316.35 ± 0.7114.67 ± 0.4217.2 ± 0.6916.94 ± 0.1120.77 ± 0.3516.81 ± 0.2419.24 ± 0.4814.16 ± 1.115.78 ± 0.4114.67 ± 0.6218.07 ± 0.29
Carotenes (mg / 100 g ) 1.5 ± 0.011.4 ± 0.011.41 ± 0.020.82 ± 0.01.29 ± 0.00.34 ± 0.00.74 ± 0.01 ± 0.00.42 ± 0.00.86 ± 0.00.57 ± 0.00.78 ± 0.0
Crude fat (%)2.02 ± 0.111.78 ± 0.042.73 ± 0.072.46 ± 0.095.61 ± 0.044.59 ± 0.192.46 ± 0.042.32 ± 0.042.23 ± 0.111.68 ± 0.075.44 ± 0.014.66 ± 0.11
Flavonoids (mg / g ) 64.33 ± 13.9106.04 ± 0.0101.49 ± 21.630.84 ± 2.825.87 ± 0.416.78 ± 0.2171.30 ± 56.771.52 ± 5.780.24 ± 8.412.98 ± 3.5218.94 ± 31.828.47 ± 5.7
MC (% w.b.)87.57 ± 0.4985.81 ± 0.5686.42 ± 185.92 ± 0.887 ± 0.6585.92 ± 0.3287.31 ± 0.2586 ± 0.2188.89 ± 1.4786.93 ± 0.3286.91 ± 0.2885.26 ± 0.17
pH4.56 ± 0.014.73 ± 0.024.87 ± 0.025.23 ± 0.154.49 ± 0.024.78 ± 0.014.29 ± 0.094.45 ± 0.024.53 ± 0.014.69 ± 0.014.49 ± 0.04.85 ± 0.04
Phenolics (mg / 100 g ) 311.41 ± 5.1191.77 ± 4.3308.66 ± 6.5134.33 ± 4.0161 ± 7.0124.33 ± 3.8374.7 ± 10.7308.33 ± 6.8318.55 ± 3.5170.49 ± 4.5405.84 ± 4.3243.5 ± 4.8
Proteins (%)1.7 ± 0.071.2 ± 0.042.16 ± 0.061.12 ± 0.032.05 ± 0.071.18 ± 0.032.26 ± 0.091.26 ± 0.041.78 ± 0.141.26 ± 0.052.24 ± 0.091.45 ± 0.03
Reducing sugars ( g / 100 g ) 7.89 ± 0.176.75 ± 0.389.33 ± 0.5110.84 ± 0.2914.7 ± 0.5917.68 ± 0.5514.11 ± 0.4116.2 ± 1.349.04 ± 0.0710.4 ± 0.5812.22 ± 0.719.7 ± 0.38
Tannins (mg / 100 g ) 2.41 ± 0.111.65 ± 0.052.25 ± 0.280.88 ± 0.030.91 ± 0.020.76 ± 0.052.57 ± 0.072.36 ± 0.062.04 ± 0.061.11 ± 0.032.53 ± 0.141.86 ± 0.14
Total sugars ( g / 100 g ) 8.62 ± 0.1512.05 ± 0.0810.4 ± 1.2713.07 ± 1.4715.81 ± 0.519.88 ± 0.69.92 ± 0.5911.19 ± 0.2515.26 ± 1.2617.81 ± 0.4213.63 ± 0.2317.05 ± 0.26
TSS (°brix)2.41 ± 0.111.65 ± 0.052.25 ± 0.280.88 ± 0.030.91 ± 0.020.76 ± 0.052.57 ± 0.072.36 ± 0.062.04 ± 0.061.11 ± 0.032.53 ± 0.141.86 ± 0.14
Vitamin C (mg / 100 g ) 98.86 ± 5.1181.71 ± 1.5154.62 ± 3.1252.2 ± 17.427.5 ± 2.391.12 ± 0.29197.8 ± 12.3305.5 ± 11.5150.86 ± 2.6305 ± 2.340.62 ± 2.369.24 ± 7.6
Number of replications employed = 3 and measurement protocols in Appendix A; MC—moisture content; TSS—total soluble solids; and values shown are mean ± 1 standard deviation.
Table 4. Image analysis input parameters selection based on the sum of R 2 based on the top three sets of 180 models.
Table 4. Image analysis input parameters selection based on the sum of R 2 based on the top three sets of 180 models.
Selected InputInput Combination Σ R 2 Σ R 2 Change from
* Reference (%)
Waveness cutoff value 7 × 7 ; 4 pix; 1 pix; 0°73.900 *0.00
7 × 7 ; 5 pix; 1 pix; 0°73.9000.00
7 × 7 ; 10 pix; 1 pix; 0°75.0031.49
7 × 7 ; 15 pix; 1 pix; 0°75.5052.17
7 × 7 ; 20 pix; 1 pix; 0°75.8052.58
7 × 7 ; 25 pix; 1 pix; 0°75.9412.76
7 × 7 ; 30 pix; 1 pix; 0°76.0142.86
7 × 7 ; 35 pix ***; 1 pix; 0°76.043 **2.90 **
7 × 7 ; 40 pix; 1 pix; 0°75.9662.80
7 × 7 ; 50; pix 1 pix; 0°75.9662.80
GCLM step size 7 × 7 ; 35 pix; 1 pix ***; 0°76.043 **2.90 **
7 × 7 ; 35 pix; 2 pix; 0°76.0232.87
7 × 7 ; 35 pix; 3 pix; 0°76.0242.87
7 × 7 ; 35 pix; 5 pix; 0°76.0312.88
7 × 7 ; 35 pix; 10 pix; 0°75.9332.75
GCLM step direction 7 × 7 ; 35 pix; 1 pix; 0°76.0432.90
7 × 7 ; 35 pix; 1 pix; 90°76.4343.43
7 × 7 ; 35 pix; 1 pix; 180°76.0562.92
7 × 7 ; 35 pix; 1 pix; 270° ***76.437 **3.43 **
Color grid size 5 × 5 ; 35 pix; 1 pix; 270°74.6701.04
7 × 7 ***; 35 pix; 1 pix; 270°76.437 **3.43 **
10 × 10 ; 35 pix; 1 pix; 270°73.751 0.20
* Reference Σ R 2 value from 180 models; ** maximum value at each input block; *** selected combination of image analysis input variables at different evaluation stages; and † final selected input variable combination used for further analysis.
Table 5. Effect of sample types and analysis groups in correlation analysis of wet chemistry with image-based parameters through absolute and mean Pearson’s correlation coefficient (r and r ¯ ).
Table 5. Effect of sample types and analysis groups in correlation analysis of wet chemistry with image-based parameters through absolute and mean Pearson’s correlation coefficient (r and r ¯ ).
Parameters GroupNAll SamplesSlice SamplesWhole Samples
Sum Mean Sum Mean Sum Mean
Wet chemistry22599.60.44399.60.44399.60.443
Color grid27061.00.22667.80.25172.20.267
Waveness and roughness37556.70.151121.60.32495.10.254
GLCM759.60.12816.00.21317.50.234
LBP36059.30.16568.20.18974.00.206
Color indices660156.50.237193.00.292179.70.272
Total image-based1740343.10.197466.50.268438.60.252
Total all methods1965442.70.225566.10.288538.20.274
Data used represent absolute r values with stages (817 and 819) combined; N is the number of parameters in each group; GLCM is gray level co-occurrence matrix; LBP is local binary pattern; Total represents all the combinations of r values of wet chemistry (15) with both wet chemistry and image-based (15 + 116) parameters.
Table 6. Slice and whole cashew apple top-10 ranking for the wet chemistry properties based on correlation coefficients of image-based independent parameters.
Table 6. Slice and whole cashew apple top-10 ranking for the wet chemistry properties based on correlation coefficients of image-based independent parameters.
Wet ChemistryRanks of Image-Based Independent Variables Based on Correlation (r) Values
Properties12345678910
Slice
AcidityWn-avPHWn-mxPHWn-amWnRn-amRnRn-xpvhRn-avPHRn-medWn-GmnOv-stdRRn-mxVD
AntioxidantsRn-medWn-medRn-avPHWn-mxVDWn-mnGl-corrRn-avVDWn-GmnOv-medGWn-mxPH
CarbohydratesOv-skBWn-xPtRn-mnRn-avPHRn-xpvhWn-medGl-corrGl-contRn-avVDWn-avPH
CarotenesVDVIGCCExG2VEGCOMB1COMB2ExGCIVERGDRWn-avPH
Crude fatGl-contRn-mxPHtRn-mxL-GmnRL-GmnGL-GmnGrGl-IDML-GmnBRn-miRn-mxVD
FlavonoidsRn-mxPHRn-mxGl-contRn-medWn-GmnL-GmnRGl-IDMRGBVIRn-miL-GmnG
MCWn-avPHWn-mxPHRn-medWn-amWnRn-amRnOv-stdROv-stdBRn-xpvhOv-medROv-GmnG
pHWn-medGRDRGDOv-medGRBRIExmRTRIRCCExR2BRVI
PhenolicsRn-mxRn-mxPHRn-miRn-mxVDOv-mnGRn-xpvhRn-amRnRn-stdVEGOv-GmnG
ProteinsRn-mxRn-mxPHRn-medWn-GmnOv-mnGRn-avPHWn-mxVDOv-GmnGRn-xpvhWn-mi
Reducing sugarsWn-avPHWn-mxPHWn-GmnOv-medBRn-xpvhExGROv-medGVARIVARRGIRGRI
TanninsWn-avPHWn-mxPHWn-miRn-xpvhWn-amWnRn-amRnOv-skBWn-rsWnWn-stdWn-Gmn
Total SugarsOv-medGRBRIBRVIIKAWNRBDIBCCCIMRBVIMRGBVIRCC
TSSWn-avPHWn-amWnRn-amRnWn-GmnVEGVDVIGCCExG2COMB1COMB2
Vitamin CWn-avVDWn-amWnRn-amRnOv-stdRWn-avPHOv-GmnRWn-rsWnWn-stdOv-mnROv-stdB
Whole
AcidityGl-contOv-skROv-skGOv-skBRn-mnOv-medROv-stdROv-stdBVEGGCC
AntioxidantsGl-contOv-GmnBGl-IDML-QmnBL-mnBL-skBL-kuBL-stdBWn-avVDRGBVI
CarbohydratesRn-mxPHRn-mxGl-IDMWn-miWn-mxVDWn-avVDWn-amWnRn-amRnL-GmnGL-GmnGr
CarotenesWn-xpvhOv-mnGOv-skBWn-rsWnWn-stdOv-GmnBWn-amWnRn-amRnWn-mxWn-avVD
Crude fatL-GmnGL-GmnGrGl-corrRn-medWn-mnWn-mxWn-GmnGl-ASML-GmnRL-GmnB
FlavonoidsRn-rsRnRn-stdRn-medRn-mxPHRn-mxRn-avVDGl-contOv-medRWn-avPHWn-mn
MCGl-contOv-skROv-medROv-stdBOv-stdROv-GmnROv-mnRGCCExG2VDVI
pHOv-medGOv-mnBCIVEExGOv-GmnBWIRBDRCOMB1COMB2RGDR
PhenolicsOv-skBOv-mnGOv-GmnGWn-avPHOv-GmnBOv-stdRWn-rsWnWn-stdWn-amWnRn-amRn
ProteinsGl-contOv-mnGOv-GmnGOv-GmnBRn-mxPHRn-mxOvr-medRRn-avVDRn-stdRn-rsRn
Reducing sugarsWn-xpvhOv-GmnBWn-mxVDWn-mxOv-skGOv-mnGOv-mnBCIVEExGWn-rsWn
TanninsWn-xpvhRn-mxRn-mxPHWn-mxVDOv-GmnBWn-mxOv-medBCIVEOv-mnBExG
Total SugarsOv-GmnBOv-mnBCIVEExGL-kuBOv-medGCOMB2COMB1L-skBVDVI
TSSOv-skBOv-skGWn-xpvhWn-rsWnWn-stdWn-avVDWn-amWnRn-amRnOv-mnGWn-mx
Vitamin COv-skRRn-mnOv-stdROv-skBRn-miRn-mxVDOv-GmnROv-mnROv-medROv-stdB
Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type. Short forms for the image-based independent parameters were used. They can be combined to make the complete image-based independent property names. Table 1 list the full form. Short forms utilized are as follows: am—arithmetic mean, av—average, B—blue, cont—contrast, corr—correlation, G—green, Gl—gray level co-occurrence matrix, Gmn—geometric mean, Gr—gray, IDM—inverse difference moment, kurt—kurtosis, L—local binary pattern, med—median, mi—minimum, mx—maximum, Ov—overall, PH—peak height, pvh—peak to valley height, rs—root mean square, R—red, Rn—roughness. sk—skewness, std—standard deviation, VD—valley depth, and Wn—waveness; color-based indices: BCC, CIVE, COMB1, COMB2, ExG, ExG2, GCC, GRD, MRGBVI, RBDR, RGD, RGDR, RGBVI, VDVI, VEG, and WI are available from Table 2; MC—moisture content; and TSS—total soluble solids.
Table 7. Top-three ranked linear models of wet chemistry parameters with image-based parameters as independent variables from the training image dataset for cashew apple slices and whole fruits.
Table 7. Top-three ranked linear models of wet chemistry parameters with image-based parameters as independent variables from the training image dataset for cashew apple slices and whole fruits.
Wet Chem. (y)xModel ( R 2 )xModel ( R 2 )Wet Chem. (y)xModel ( R 2 )xModel ( R 2 )
Slice
Stage = 817Stage = 819 Stage = 817Stage = 819
AcidityOv-medR 0.0043 x + 0.9284 ; ( R 2 = 0.223 ) WI 0.229 x 0.235 ; ( R 2 = 0.669 ) AntioxidantVDVI 481.89 x 0.2153 ; ( R 2 = 0.429 ) VEG 170.16 x 17.33 ; ( R 2 = 0.505 )
Ov-mnR 0.003 x + 0.6866 ; ( R 2 = 0.206 ) RBDR 0.229 x 0.464 ; ( R 2 = 0.669 ) GCC 1044.72 x 347.99 ; ( R 2 = 0.426 ) Ov-mnG 1.9564 x 233.88 ; ( R 2 = 0.394 )
Ov-GmnR 0.0028 x + 0.6437 ; ( R 2 = 0.206 ) Ov-medG 0.0154 x 2.0769 ; ( R 2 = 0.651 ) ExG2 348.24 x + 0.2505 ; ( R 2 = 0.426 ) COMB2 12.68 x 120.77 ; ( R 2 = 0.385 )
CarbohydratesOv-stdR 0.1099 x + 13.42 ; ( R 2 = 0.240 ) Ov-skG 6.5953 x + 2.4957 ; ( R 2 = 0.672 ) CarotenoidsVEG 3.8925 x + 0.0719 ; ( R 2 = 0.936 ) Rn-xpH 0.0206 x + 2.1379 ; ( R 2 = 0.502 )
Ov-GmnB 0.0583 x + 18.42 ; ( R 2 = 0.232 ) Ov-kuB 0.0009 x + 18.59 ; ( R 2 = 0.216 ) COMB1 0.5129 x 2.9892 ; ( R 2 = 0.930 ) Rn-mx 0.0205 x + 2.147 ; ( R 2 = 0.501 )
Ov-stdB 0.0722 x + 13.33 ; ( R 2 = 0.204 ) Ov-stdR 0.1938 x + 14.22 ; ( R 2 = 0.167 ) COMB2 0.3559 x 3.0132 ; ( R 2 = 0.930 ) Rn-mpvh 0.0214 x + 2.5627 ; ( R 2 = 0.428 )
Crude fatVEG 12.76 x + 5.994 ; ( R 2 = 0.542 ) Gl-cont 0.1355 x + 0.255 ; ( R 2 = 0.518 ) FlavonoidsExmR 2.0486 x + 424.89 ; ( R 2 = 0.195 ) VEG 550.93 x 67.64 ; ( R 2 = 0.509 )
COMB2 1.121 x + 15.60 ; ( R 2 = 0.497 ) Rn-mx 0.0608 x 0.5068 ; ( R 2 = 0.503 ) TRI 3.9558 x + 443.18 ; ( R 2 = 0.193 ) COMB2 44.00 x 434.37 ; ( R 2 = 0.446 )
COMB1 1.6147 x + 15.53 ; ( R 2 = 0.496 ) Rn-xpH 0.061 x 0.4769 ; ( R 2 = 0.503 ) RBD 3.3652 x + 481.89 ; ( R 2 = 0.181 ) COMB1 62.76 x 426.48 ; ( R 2 = 0.444 )
MCOv-kuB 0.0006 x + 80.00 ; ( R 2 = 0.430 ) Rn-mx 0.0218 x + 87.18 ; ( R 2 = 0.385 ) pHRBD 0.0166 x + 2.7368 ; ( R 2 = 0.509 ) Ov-GmnB 0.07 x + 8.1414 ; ( R 2 = 0.612 )
RBD 0.0604 x + 93.88 ; ( R 2 = 0.358 ) Rn-mxPH 0.0219 x + 87.17 ; ( R 2 = 0.385 ) TRI 0.0186 x + 2.9997 ; ( R 2 = 0.495 ) CIVE 0.1321 x + 4.0447 ; ( R 2 = 0.608 )
TRI 0.0685 x + 92.98 ; ( R 2 = 0.356 ) Rn-mpvh 0.0224 x + 87.60 ; ( R 2 = 0.318 ) ExmR 0.0095 x + 3.1075 ; ( R 2 = 0.483 ) ExG 0.0658 x + 6.2548 ; ( R 2 = 0.606 )
PhenolicsGRD 6.4798 x + 586.02 ; ( R 2 = 0.273 ) VEG 1840.87 x 152.75 ; ( R 2 = 0.686 ) ProteinsOv-GmnB 0.0147 x + 2.8153 ; ( R 2 = 0.426 ) Ov-kuR 0.0017 x + 0.9997 ; ( R 2 = 0.674 )
RGD 6.4798 x + 586.02 ; ( R 2 = 0.273 ) COMB2 151.10 x 1421.89 ; ( R 2 = 0.635 ) Ov-stdB 0.0191 x + 1.5097 ; ( R 2 = 0.414 ) Ov-kuG 0.0009 x + 1.1169 ; ( R 2 = 0.570 )
ExmR 2.8428 x + 741.30 ; ( R 2 = 0.255 ) COMB1 215.56 x 1395.25 ; ( R 2 = 0.633 ) Ov-mnB 0.0216 x + 3.4392 ; ( R 2 = 0.308 ) Ov-medR 0.015 x + 3.8487 ; ( R 2 = 0.470 )
R-sugarsOv-stdR 0.3289 x + 5.6368 ; ( R 2 = 0.463 ) Ov-skG 12.74 x 17.62 ; ( R 2 = 0.477 ) TanninsVDVI 33.54 x + 0.509 ; ( R 2 = 0.466 ) VEG 16.07 x 1.5984 ; ( R 2 = 0.678 )
Ov-stdB 0.2153 x + 5.3745 ; ( R 2 = 0.391 ) Ov-stdR 0.6897 x 0.9805 ; ( R 2 = 0.401 ) GCC 72.71 x 23.70 ; ( R 2 = 0.463 ) COMB2 1.2671 x 12.11 ; ( R 2 = 0.579 )
Ov-GmnG 0.1451 x + 29.32 ; ( R 2 = 0.379 ) Ov-stdB 0.6667 x 8.1091 ; ( R 2 = 0.332 ) ExG2 24.24 x + 0.5414 ; ( R 2 = 0.463 ) COMB1 1.8024 x 11.86 ; ( R 2 = 0.574 )
T-sugarsOv-stdR 0.3531 x + 6.2683 ; ( R 2 = 0.478 ) Ov-skG 10.73 x 9.88 ; ( R 2 = 0.491 ) TSSVEG 3.61 x + 10.12 ; ( R 2 = 0.547 ) Rn-mx 0.0184 x + 8.63 ; ( R 2 = 0.393 )
Ov-stdB 0.2345 x + 5.89 ; ( R 2 = 0.416 ) Ov-medR 0.3703 x + 79.45 ; ( R 2 = 0.246 ) COMB1 0.4595 x + 12.84 ; ( R 2 = 0.507 ) Rn-mxPH 0.0185 x + 8.6394 ; ( R 2 = 0.393 )
Ov-GmnG 0.1575 x + 31.91 ; ( R 2 = 0.401 ) Ov-stdB 0.4693 x + 0.9376 ; ( R 2 = 0.239 ) COMB2 0.3189 x + 12.86 ; ( R 2 = 0.507 ) Rn-mpvh 0.0181 x + 8.3368 ; ( R 2 = 0.298 )
Vitamin CL-GmnR 2.4462 x 255.55 ; ( R 2 = 0.512 ) Rn-mpvh 6.1257 x + 650.14 ; ( R 2 = 0.625 )
L-GmnGr 2.4646 x 257.40 ; ( R 2 = 0.508 ) Rn-mx 5.2789 x + 499.37 ; ( R 2 = 0.592 )
L-GmnB 2.4933 x 262.33 ; ( R 2 = 0.507 ) Rn-mxPH 5.2914 x + 496.49 ; ( R 2 = 0.591 )
Whole
Stage = 817Stage = 819 Stage = 817Stage = 819
AcidityWI 0.1894 x 0.1214 ; ( R 2 = 0.423 ) Ov-medG 0.0162 x 2.1898 ; ( R 2 = 0.564 ) AntioxidantOv-skG 21.74 x + 74.51 ; ( R 2 = 0.401 ) Ov-skG 18.44 x + 58.08 ; ( R 2 = 0.355 )
RBDR 0.1894 x 0.3108 ; ( R 2 = 0.423 ) Ov-GmnB 0.0092 x 0.3184 ; ( R 2 = 0.538 ) Wn-mxPH 0.3435 x + 39.66 ; ( R 2 = 0.291 ) Wn-avVD 1.0144 x + 27.94 ; ( R 2 = 0.279 )
RGDR 1.1605 x + 0.6284 ; ( R 2 = 0.405 ) Ov-mnB 0.0069 x 0.3073 ; ( R 2 = 0.517 ) Wn-mx 0.2186 x + 53.37 ; ( R 2 = 0.289 ) Wn-mxVD 0.2122 x + 27.25 ; ( R 2 = 0.230 )
CarbohydratesOv-skG 4.4682 x + 4.8171 ; ( R 2 = 0.758 ) Gl-cont 0.2345 x + 21.48 ; ( R 2 = 0.292 ) CarotenoidsOv-GmnG 0.0498 x 5.0505 ; ( R 2 = 0.638 ) Ov-skG 1.1342 x + 3.6336 ; ( R 2 = 0.440 )
Ov-kuB 0.0006 x + 15.73 ; ( R 2 = 0.214 ) COMB1 2.5718 x 1.1543 ; ( R 2 = 0.254 ) Ov-stdR 0.0584 x + 2.0017 ; ( R 2 = 0.606 ) Wn-mxVD 0.0139 x + 1.7822 ; ( R 2 = 0.322 )
Ov-stdR 0.1121 x + 13.11 ; ( R 2 = 0.167 ) COMB2 1.8006 x 1.4471 ; ( R 2 = 0.254 ) Ov-stdG 0.0362 x + 2.1947 ; ( R 2 = 0.495 ) Wn-xpvh 0.0083 x + 1.8279 ; ( R 2 = 0.192 )
Crude fatOv-mnG 0.2689 x + 37.55 ; ( R 2 = 0.337 ) VEG 17.06 x 0.3989 ; ( R 2 = 0.339 ) FlavonoidsWn-xpvh 1.3864 x + 258.44 ; ( R 2 = 0.258 ) Ov-kuB 0.0114 x + 24.51 ; ( R 2 = 0.282 )
Ov-GmnG 0.1594 x + 22.36 ; ( R 2 = 0.322 ) L-GmnG 0.0806 x + 15.78 ; ( R 2 = 0.32 ) Ov-kuR 0.602 x + 36.06 ; ( R 2 = 0.246 ) VEG 272.13 x 12.52 ; ( R 2 = 0.235 )
Gl-cont 0.2158 x + 7.0035 ; ( R 2 = 0.308 ) L-GmnGr 0.0798 x + 15.68 ; ( R 2 = 0.307 ) WI 220.86 x 217.40 ; ( R 2 = 0.243 ) Wn-med 0.6225 x + 92.67 ; ( R 2 = 0.218 )
MCGl-cont 0.1424 x + 85.00 ; ( R 2 = 0.456 ) RGBVI 28.13 x + 91.99 ; ( R 2 = 0.389 ) pHWI 0.8142 x + 5.7736 ; ( R 2 = 0.443 ) Ov-GmnB 0.0693 x + 8.0592 ; ( R 2 = 0.591 )
Ov-kuB 0.0006 x + 86.96 ; ( R 2 = 0.418 ) Gl-cont 0.0718 x + 84.89 ; ( R 2 = 0.285 ) RBDR 0.8142 x + 6.5879 ; ( R 2 = 0.443 ) Ov-medG 0.0957 x + 18.36 ; ( R 2 = 0.377 )
Ov-medG 0.1892 x + 60.53 ; ( R 2 = 0.235 ) Ov-GmnB 0.0992 x + 81.30 ; ( R 2 = 0.229 ) RGDR 5.0703 x + 2.5177 ; ( R 2 = 0.438 ) Ov-mnB 0.0414 x + 7.312 ; ( R 2 = 0.358 )
PhenolicsWn-mx 2.1458 x + 609.55 ; ( R 2 = 0.341 ) Ov-skG 167.97 x + 588.64 ; ( R 2 = 0.374 ) ProteinsGl-cont 0.0391 x + 2.7014 ; ( R 2 = 0.421 ) Ov-kuB 0.0001 x + 1.1598 ; ( R 2 = 0.701 )
Wn-xpvh 1.8216 x + 500.79 ; ( R 2 = 0.318 ) Ov-kuB 0.0333 x + 161.26 ; ( R 2 = 0.316 ) Ov-stdR 0.0265 x + 1.5324 ; ( R 2 = 0.257 ) Ov-kuR 0.0018 x + 0.9572 ; ( R 2 = 0.690 )
Ov-kuR 0.7722 x + 211.25 ; ( R 2 = 0.289 ) Wn-mxVD 2.1042 x + 317.09 ; ( R 2 = 0.287 ) Ov-stdB 0.0262 x + 1.2445 ; ( R 2 = 0.257 ) WI 0.5206 x + 0.4554 ; ( R 2 = 0.510 )
R-sugarsOv-skG 7.5294 x 6.3176 ; ( R 2 = 0.458 ) Ov-stdR 0.6574 x 0.7915 ; ( R 2 = 0.376 ) TanninsWn-mx 0.0151 x + 4.2023 ; ( R 2 = 0.318 ) Ov-skG 1.3642 x + 4.6378 ; ( R 2 = 0.313 )
Ov-stdR 0.3561 x + 4.5304 ; ( R 2 = 0.357 ) RBRI 8.1523 x + 34.40 ; ( R 2 = 0.303 ) Wn-mxPH 0.0218 x + 3.1684 ; ( R 2 = 0.270 ) Wn-avVD 0.0756 x + 2.4145 ; ( R 2 = 0.249 )
Ov-medR 0.335 x + 69.41 ; ( R 2 = 0.314 ) Ov-medG 1.3929 x 186.01 ; ( R 2 = 0.292 ) Ov-mnG 0.0834 x 8.4526 ; ( R 2 = 0.252 ) Wn-mxVD 0.0171 x + 2.4307 ; ( R 2 = 0.239 )
T-sugarsOv-skG 7.6094 x 5.4483 ; ( R 2 = 0.417 ) VEG 49.18 x + 5.7874 ; ( R 2 = 0.407 ) TSSGl-cont 0.0642 x + 10.46 ; ( R 2 = 0.344 ) Gl-cont 0.0509 x + 10.42 ; ( R 2 = 0.217 )
Ov-stdR 0.3869 x + 5.0093 ; ( R 2 = 0.376 ) COMB1 6.2159 x 30.77 ; ( R 2 = 0.405 ) Ov-mnG 0.0765 x + 19.10 ; ( R 2 = 0.343 ) Ov-skG 0.8118 x + 7.7602 ; ( R 2 = 0.205 )
Ov-medR 0.3685 x + 76.27 ; ( R 2 = 0.338 ) COMB2 4.3502 x 31.46 ; ( R 2 = 0.405 ) Ov-GmnG 0.0439 x + 14.61 ; ( R 2 = 0.308 ) Ov-medR 0.0417 x + 16.82 ; ( R 2 = 0.203 )
Vitamin CRn-avVD 63.05 x 26.61 ; ( R 2 = 0.602 ) Wn-Gmn 3.7138 x + 535.69 ; ( R 2 = 0.442 )
Wn-mn 2.7542 x + 361.94 ; ( R 2 = 0.425 ) Wn-mn 3.7273 x + 543.16 ; ( R 2 = 0.437 )
Wn-Gmn 2.7725 x + 357.73 ; ( R 2 = 0.423 ) Wn-mxVD 3.7988 x + 416.03 ; ( R 2 = 0.425 )
Wet Chem.—Wet Chemistry. Slice—cut cashew apples sample type, and Whole—whole cashew apples sample type. Short forms for the image-based independent parameters were used. They are combined to make the image-based independent properties; refer to Table 1 to obtain the full form. Short forms: am—arithmetic mean, av—average, B—blue, cont—contrast, corr—correlation, G—green, Gl—gray level co-occurrence matrix, Gmn—geometric mean, Gr—gray, IDM—inverse difference moment, kurt—kurtosis, L—local binary pattern, med—median, mi—minimum, mx—maximum, Ov—overall, PH—peak height, pvh—peak to valley height, rs—root mean square, R—red, Rn—roughness. sk—skewness, std—standard deviation, VD—valley depth, and Wn—waveness; color-based indices: CIVE, COMB1, COMB2, ExG, ExG2, GBVI, GCC, GRD, RBD, RBDR, RGD, RGDR, RGBVI, VDVI, VEG, and WI are available from Table 2; MC—moisture content; and TSS—total soluble solids.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gupta, M.J.; Igathinathane, C.; Nishad, J.; Tazeen, H.; Joice, A.; Sunoj, S.; Mohan, A.; Kumar, P.; Adiga, J.D. Assessment of Physicochemical Properties of Cashew Apple Through Computer Vision. AgriEngineering 2025, 7, 398. https://doi.org/10.3390/agriengineering7120398

AMA Style

Gupta MJ, Igathinathane C, Nishad J, Tazeen H, Joice A, Sunoj S, Mohan A, Kumar P, Adiga JD. Assessment of Physicochemical Properties of Cashew Apple Through Computer Vision. AgriEngineering. 2025; 7(12):398. https://doi.org/10.3390/agriengineering7120398

Chicago/Turabian Style

Gupta, Mathala Juliet, C. Igathinathane, Jyoti Nishad, Humeera Tazeen, Astina Joice, S. Sunoj, Anand Mohan, Parveen Kumar, and Jamboor Dinakara Adiga. 2025. "Assessment of Physicochemical Properties of Cashew Apple Through Computer Vision" AgriEngineering 7, no. 12: 398. https://doi.org/10.3390/agriengineering7120398

APA Style

Gupta, M. J., Igathinathane, C., Nishad, J., Tazeen, H., Joice, A., Sunoj, S., Mohan, A., Kumar, P., & Adiga, J. D. (2025). Assessment of Physicochemical Properties of Cashew Apple Through Computer Vision. AgriEngineering, 7(12), 398. https://doi.org/10.3390/agriengineering7120398

Article Metrics

Back to TopTop