Next Article in Journal
Species-Specific and Altitude-Induced Variation in Karst Plants’ Use of Soil Dissolved Inorganic Carbon
Previous Article in Journal
Seed-Germination Ecology of Vicia villosa Roth, a Cover Crop in Orchards
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Non-Destructive Estimation of Fruit Weight of Strawberry Using Machine Learning Models

by
Jayanta Kumar Basak
1,2,
Bhola Paudel
3,
Na Eun Kim
3,
Nibas Chandra Deb
3,
Bolappa Gamage Kaushalya Madhavi
3 and
Hyeon Tae Kim
3,*
1
Institute of Smart Farm, Gyeongsang National University, Jinju 52828, Korea
2
Department of Environmental Science and Disaster Management, Noakhali Science and Technology University, Noakhali 3814, Bangladesh
3
Department of Bio-Systems Engineering, Institute of Smart Farm, Gyeongsang National University, Jinju 52828, Korea
*
Author to whom correspondence should be addressed.
Agronomy 2022, 12(10), 2487; https://doi.org/10.3390/agronomy12102487
Submission received: 2 September 2022 / Revised: 26 September 2022 / Accepted: 6 October 2022 / Published: 12 October 2022

Abstract

:
Timely monitoring of fruit weight is a paramount concern for the improvement of productivity and quality in strawberry cultivation. Therefore, the present study was conducted to introduce a simple non-destructive technique with machine learning models in measuring fruit weight of strawberries. Nine hundred samples from three strawberry cultivars, i.e., Seolhyang, Maehyang, and Santa (300 samples in each cultivar), in six different ripening stages were randomly collected for determining length, diameter, and weight of each fruit. Pixel numbers of each captured fruit’s image were calculated using image processing techniques. A simple linear-based regression (LR) and a nonlinear regression, i.e., support vector regression (SVR) models were developed by using pixel numbers as input parameter in modeling fruit weight. Findings of the study showed that the LR model performed slightly better than the SVR model in estimating fruit weight. The LR model could explain the relationship between the pixel numbers and fruit weight with a maximum of 96.3% and 89.6% in the training and the testing stages, respectively. This new method is promising non-destructive, time-saving, and cost-effective for regularly monitoring fruit weight. Hereafter, more strawberry samples from various cultivars might need to be examined for the improvement of model performance in estimating fruit weight.

1. Introduction

Strawberry (Fragaria x ananassa) is one of the most high-value berry fruits across the world, having distinctive flavor, appearance, firmness, and chemical composition (anti-inflammatory properties, vitamin C, antioxidants, flavonoids, sugar, and organic acids) [1,2,3]. In recent times, its consumption has increased widely due to the high level of nutritional content and health benefit [4]. Being non-climateric, faster fruit-bearing, smaller plant size, shorter reproductive stage, and earlier maturation compared to other berries, it has been extensively cultivated all over the world [5]. Due to the value of these quality attributes of strawberries, the global production of it has doubled in the last 20 years to 8.8 million tons [4]. Hundreds of strawberry cultivars have been grown under numerous breeding projects worldwide for increasing the production level as well as for improving the fruit qualities, mainly color, weight, size, and internal compositions of fruits.
In strawberries, among the fruit quality parameters, weight is one of the most important factors in understanding water and nutrient use efficiency, biomass composition, fruit consistency, identification of cultivar, determining crop values, and fruit acceptance by consumers [6]. Moreover, fruit weight is highly sensitive to environmental conditions and management practices; therefore, most of the strawberry cultivars are grown in a greenhouse, where the microclimate and management processes can be precisely maintained [7,8]. In this circumstance, frequently monitoring fruit weight while considering environmental conditions and growth techniques is a paramount concern in strawberry cultivation [9]. Knowing fruit weight in advance helps growers to make more informed decisions regarding irrigation and nutrient management, harvest scheduling, and to optimize profit. Therefore, developing a constructive as well as comprehensible technique for measuring the weight of strawberries regularly using a machine vision technique is essential to improve the productivity as well as the quality of fruits.
The weight of strawberries is commonly estimated by instrumental assessment using a measuring scale. The main advantages of this technique are being user friendly and producing reliable results [10]. This technique can measure a large number of strawberries at the same time [10,11]. However, the traditional procedures are considered destructive, labor-demanding, and time-consuming, and they sometimes give low accuracy due to measurement errors [12,13]. Moreover, frequently monitoring strawberry weight in field conditions may be troublesome when using a measuring scale procedure [14]. Thus, the fruit weight prediction models based on machine learning algorithms and non-destructive measurements are useful. Machine learning algorithms develop models that are used to carry out a particular task based on a provided dataset [15]. In machine learning processes, data are fed to its algorithms, where they learn from the data to build models through a training process for making predictions and to optimize their operations to improve performance by developing intelligence over time [16]. Concerning processing ability to deal with complex data mapping relations, machine learning algorithms have been widely applied in agriculture fields over the past two decades, such as in plant breeding [15], in vitro culture [17], stress phenotyping [18], plant system biology [19], plant identification [20] and pathogen identification [21]. In particular, machine learning algorithms have undergone substantial development and have been applied along with non-destructive methods to estimate fruit weight. They have also received increasing attention as they outweigh the traditional destructive procedures with several advantages, including high throughput measurement, simultaneous multiple assessments and real-time decision making [22].
Recently, few papers have focused on machine learning techniques combined with digital image processing to estimate fruit weight [23,24,25]. Zhang et al. [26] estimated the weight and volume of apples using a 3D reconstruction technique which includes noncontact measuring methods involving computer vision, and the results showed that the least squares support vector machine model was found to have a higher correlation between the projected and actual weight of apples with a determination coefficient (R2) equal to 0.860. Teoh and Syaifudin [27] also described an image processing technique for predicting the weight of mango where the procedure can explain a maximum of 0.934 of the changes in measured and projected data in the training stage. In the case of strawberry, a number of studies have been conducted to develop models based on machine learning algorithms and non-destructive techniques for predicting volume, firmness, and internal quality parameters (total soluble solids content, acidity, sugar, and moisture content) [3,28,29] and the classification of ripeness level of strawberries, which is related to grading and sorting of fruits [30,31,32,33]. However, estimation of fruit weight of strawberries using non-destructive techniques is limited in the available literature. Hence, in this study, we have introduced a non-destructive, time-saving, and cost-effective method where the pixel numbers of strawberry images and machine learning algorithms have been used to develop models for measuring fruit weight.
Pixel numbers acquired by image processing techniques have been widely applied for the analysis of plants and animals research [34,35,36,37]. Images are first captured using a digital camera; subsequently, the image processing technique is applied for pixel numbers calculation. A number of research studies have been carried out to employ pixel number and linear and nonlinear-based machine learning models in measuring the volume of oranges [38], apples [9], tomatoes [39], and strawberries [40]. The effectiveness of these models in estimating volume of fruit is two-fold. Initially, the models can be employed to find the changes in volume in various ripening periods of fruit, which may give rise to enhanced knowledge. Subsequently, they are useful for monitoring fruit quality to explain the relations between the volume and pixel numbers. The main purpose of these models is to establish a relationship between the explanatory and response variables [41]. The most popular type of regression-based model is the simple linear form, which is widely applied in diverse research areas [41,42,43]. However, for nonlinear data, a linear-based regression model (LR) is difficult to perform well. Hence, machine learning-based nonlinear models such as support vector regression (SVR) are commonly used [44,45,46]. Therefore, in this study, we have developed both linear and nonlinear-based machine learning models to predict fruit weight and to compare their performance. The main objectives of this present study are three-fold: first, to determine the biometrical characteristics, i.e., length, diameter, and weight at the six ripening periods of three strawberry cultivars; second, to acquire pixel numbers of strawberry images using image processing techniques; and finally, to develop LR and SVR-based models using the pixel numbers for estimating the fruit weight of strawberries.

2. Materials and Methods

2.1. Sample Collection of Strawberries

Three strawberry cultivars, i.e., Seolhyang, Maehyang, and Santa, the most popular and commonly grown in South Korea, were selected for this experiment. Each variety of strawberry was separately grown in three greenhouses at Smart Farm Systems Laboratory of Gyeongsang National University during the period of September 2021 to January 2022. Five hundred strawberry plants of each cultivar were cultivated in individual greenhouses under the combination of Hoagland solution and bio plus compost soil (Figure 1). The bio plus compost soil includes zeolite (9.00%), peat moss (11.00%), perlite (11.00%) and cocopeat (68.86),which is a well-known medium for strawberry cultivation of different varieties inside greenhouses in Korea [46]. Daily application of irrigation water ranged from 20–30 mL for each plant during the early stages of strawberry growth to 30–50 mL for each plant for overall ripening stage [47]. During the whole growing stages of strawberry, the inside air temperature of all three greenhouses was maintained using Farm-link control system (Farmlink™ v 3.0, Jinju, Gyeongnam, Republic of Korea), while the illuminance, humidity and CO2 were monitored through MCH 383SD (Lutron Co. Ltd., Taipei, Taiwan) sensor unit.
For collecting the fruit samples of each strawberry cultivar, harvesting was performed at six distinct ripening stages, i.e., Dark red, Bright red, Three-quarter red, Half red, Turning and Whiting. The ripening stages were identified and divided according to the development of color on the skin, which ranged from dark red (final stage) to white (initial stage) [6]. For the measurement of biometrical parameters, i.e., length, diameter, and fresh weight, 50 fruit samples were collected in each ripening stage of strawberry cultivars.

2.2. Measurement of Biometrical Characteristics

The fruit sample was cleaned with distilled water, drained and followed by cleaning with the use of paper towels to remove extra water from their surface before the measurement of length, diameter, and fresh weight. A total of 300 samples of each cultivar were used to determine their length, diameter and fresh weight. The fresh weight of each strawberry (in g) was measured using a digital balance (FX-300iWP, A&D Company Ltd., Tokyo, Japan), whereas the diameter and stem length (in mm) were measured by using a digital Vernier caliper (Model: E03-150 122–521, Datac Co., Ltd., Seoul, Korea).

2.3. Image Acquisition of Strawberry Fruits

Each strawberry sample’s image was acquired using a laboratory-based imaging system (Figure 2). A rectangular light chamber of dimension (80 × 80 × 80 cm) was used as studio to capture an image of each fruit sample using a camera setup (SONY DSC-RX100 vii, Sony, Seoul, Korea). The acquired image was in RGB format with resolution of 5472 × 3648 pixels. The light chamber consists of dual light-emitting diodes (LED) strips, where each strip of lamps had a 10 W total light emitting capacity [3]. In order to prevent shadows on the strawberry and to obtain a high-quality image, the interior of the light chamber (with the exception of the bottom surface) was made of an aluminum surface. To create a uniform background, the bottom surface of the light chamber was made of a black surface. The sample was placed at a fixed distance of 80 cm from the camera lens. Estimation of pixel numbers from spherical or quasi-spherical images is quite easy because they have a strong correlation with some dimensional parameters of the 2D projected region [48]; however, it is very hard to calculate for strawberries due to its natural irregularities in shape. In order to minimize this limitation, we captured images of each strawberry on both sides of the horizontal plane. The same procedure was followed during the image collection period for all 900 strawberry samples.

2.4. Pixel Numbers Calculation from Strawberry Image

Open-source Python libraries (Python 3.7.0) were used to process each image for calculating their number of pixels. Since our study focused only on the fruit area, both the opposite sides of each strawberry image were segmented from their background by employing remove.bg platform, and it was subsequently resized by 500 × 500 pixels. The background removal images were converted to Red, Green, Blue (RGB) and binary images consequently for estimating pixel numbers by using image processing algorithms. Figure 3 shows the typical strawberry sample with background removal (BR), RGB, and binary image in each cultivar.

2.5. Data Pre-Processing for Prediction Models Development

In this study, overall datasets, i.e., pixel numbers of each image and fruit weight, were obtained in all three cultivars of 900 strawberries (Table 1). Data preprocessing techniques commonly included four steps, i.e., missing data analysis, feature extraction, data normalization, and partitioning training and testing data. Since there were no missing values in the measured data; therefore, no method for imputing the missing data was taken into consideration in the current study. Throughout the model development stage, only one variable (pixel numbers) was considered as an input parameter; hence, feature extraction and data normalization methods were not applied during the data preprocessing time [44]. Additionally, the performance of machine learning models also depends on the size of data partition during the training and testing periods [3]. Moreover, sources of uncertainty in machine learning models arise when the testing and training data are mismatched due to the presence of noise in the dataset [49]. A number of studies used different portions of their data in training and testing stages such as 70% and 30% (training and testing), 80% and 20%, and 90% and 10% to reduce the uncertainty [50,51,52]. After evaluating the model’s performance with the three data subsets (70:30, 80:20, and 90:10), we used 80% of the collected data for training purpose and the remaining 20% of data for testing purposes during the model development.

2.6. Development of Linear Regression (LR) Model

Linear regression is the initial form of regression analysis where the relationship between the dependent and independent variables is considered in the linear approach. It is a modeling technique where the relationship is modeled using linear predictor functions [53]. The LR model has a wide range of applications in three major areas, i.e., for formulating numerical predictions, for examining the relationship between the variables and for time series modeling. It is commonly applied in various sectors such as crop yield prediction, weather forecasting, electricity demand estimation, and business projection, among others [43,54]. The LR model is represented by the following Equation (1).
Y i = β o + β 1 X i + ε i
where the dependent variable Y i is represented of a linear function β o + β 1 X i of the explanatory variable X i together with an error term ε i . β o and β 1 are the intercept parameter (bias term) and slope parameter (coefficient), respectively. The basic structure of LR is presented in Figure 4a.

2.7. Development of Support Vector Regression (SVR)

Support vector machine is a powerful non-linear technique that makes a hyperplane or set of hyperplanes in a high-dimensional space for classification called as support vector classification (SVC), and the support vector regression (SVR) is developed on the same principle of SVC [55,56]. The performance of the SVR model is influenced by the basic kernel functions, since selection of the kernel functions is important for handling non-linear relationships more efficiently [57]. After testing on a number of SVR structures applying the three kernel functions, i.e., polynomial, sigmoid functions and radial basis function (RBF) with gamma (γ), penalty parameter of the error term (C) and epsilon (ε), in this study, as a kernel type, we decided to use a radial basis function (RBF) with γ = 0.5, C = 50 and ε = 0.1. The predicted fruit weight of strawberries was followed by the SVR operated Equation (2).
Y ^ 0 = i = 1 n K ( X i , X 0 ) ( α i α i * ) ; K ( X i , X 0 ) = exp ( | X i X j | ) 2 Y
In the equation, α i and α i * represent support vectors; n represents the numbers of datas; K ( X i , X 0 ) denotes the radial basis function. The fundamental structure of the one-dimensional SVR is shown in Figure 4b.

2.8. Application Methodology and Model Performance Metrics

Python open source libraries (Python 3.7.0) have been employed to develop LR and SVR models in this study. Python, a high-level programming language, is widely applied in diverse research areas. In the Python environment, NumPy, Pandas, and Matplotlib were used for data management, processing, and presentation [58,59]. Two performance metrics, i.e., root mean square error (RMSE) (Equation (3)), and coefficient of determination (R2) (Equation (4)) have been used to evaluate the models’ performance. Statistical analysis was conducted using Statistical Package for the Social Sciences (IBM, SPSS Statistics 22.0.0.0, New York, NY, USA), and the data are represented as figures using Origin Pro 9.5.5. (OriginLab, Northampton, MA, USA). Figure 5 demonstrates the summative workflow of the current study methodology.
R M S E = 1 n t = 1 n   ( y t   a c t u a l   y t   p r e d i c t e d ) 2
R 2 = 1 t = 1 n ( y t   a c t u a l   y t   p r e d i c t e d ) 2 t = 1 n ( y t   a c t u a l   y t   m e a n ) 2

3. Results and Discussion

3.1. Changes in Biometrical Characteristics and Pixel Numbers

For analyzing biometrical characteristics (length, diameter, and weight) and pixel numbers of fruit images, a total number of 900 strawberries in three cultivars, i.e., Seolhyang, Maehyang, and Santa were collected from experimental greenhouses. The summary statistics of each measurement of length, diameter, weight, and pixel numbers of fruit images are shown in Table 2. Length, diameter, weight, and pixel numbers of strawberry images changed significantly in Maehyang and Santa cultivars (p < 0.05); however, the variations were not statistically significant in Seolhyang with the other two cultivars (Maehyang and Santa) for diameter, weight and pixel numbers.
Biometrical data of strawberries showed that shape of Santa cultivar was larger compared to Seolhyang, and Maehyang. Length, diameter, and weight in tested Maehyang (Length = 35.36–50.37 mm; diameter = 26.10–38.66 mm; weight = 11.21–36.67 g), Seolhyang (Length = 29.42–51.42 mm; diameter = 25.47–40.34 mm; weight = 10.88–38.45 g) and Santa (Length = 34.51–53.58 mm; diameter = 27.52–42.72 mm; weight = 12.10–40.24 g) was into the range mentioned in literature for the three strawberry cultivars [60]. ‘Pajaro,’, ‘Chandler,’, and ‘Selva’ strawberry cultivars showed similar biometrical characteristics [61,62,63]. Similar to biometrical parameters, pixel numbers also obtained the highest for the Santa cultivar, followed by Seolhyang and Maehyang. This result indicated that the pixel numbers of strawberry images increased with the increase in the shape and weight of fruit. Similar findings were also reported for ‘Oso Grande’ and ‘Sweet Charly’ strawberries [64]. In this study, fruit length and diameter of each strawberry are scarcely described. Moreover, in general, fruits collected in three strawberry cultivars showed good biometrical characteristics.

3.2. LR Model Performance for Fruit Weight Prediction

The performance of the linear regression model mainly depends on the existence of a linear relationship between the explanatory and response variables [42]. A simple linear model is widely applied in diverse research fields due to its simplicity of nature along with an easy interpretation of its outcome [41,46,65]. In the current study, we applied a linear regression model, i.e., simple linear regression (LR) in predicting the fruit weight of strawberries. The pixel numbers of images in three strawberry cultivars have been used as an input parameter for developing LR model. The performances of LR in predicting fruit weight are shown in Table 3.
The study results showed that the highest R2 (0.963 and 0.896) and lowest RMSE (0.712 and 1.054) were observed for the D1 dataset, indicating that the LR model can explain a maximum of 96.3% and 89.6% of the variations of actual and predicted data during the training and testing stages, respectively. Contrary to that, the lowest performance was observed for the D3 dataset, where the lowest R2 (0.947 and 0.860) and the maximum RMSE (0.856 and 1.207) were found in the training and testing stages, respectively. The LR model developed by the D1 dataset could predict fruit weight with a 1.70% and 4.19% increase in R2 and a reduction of 16.82% and 12.68% in RMSE in the training and testing stages, respectively, compared to the LR model developed by the D3 dataset. Apart from the two datasets (D1 and D3), the LR model also performed better in predicting fruit weight using the D2 and D4 datasets. The measured and predicted values of fruit weight in the D1, D2, D3 and D4 datasets for the LR model in the testing stage are illustrated in scatter plots and time series graph (Figure 6). According to Figure 6, it is shown that the measured data fit well with the predicted data for the D1 dataset and the values were very close to the line 1:1 compared to the D2, D3, and D4 datasets. Lee et al. [40] established a linear regression model to predict strawberry volume with coefficients of determination of 0.866 and 0.603 in the training and testing stages, respectively. Omid et al. [34] estimated weight by measured volume using an image processing technique and found the coefficient of determination for lemon, lime, orange, and tangerine, which were 0.962, 0.970, 0.985, and 0.959, respectively, which were almost similar results compared to this study. The mass model of pomegranate developed by volume using a water displacement method was explained as a linear form of mass (M) = 0.96 volume (V) + 4.25 [66]. Similar experiments were also conducted by employing digital image processing techniques to predict the volume and mass of oranges [38], lettuce [35] and tomatoes [39]. These studies established a linear relationship between the volume and weight of fruit; however, in this study, we measured the strawberry fruit weight using a non-destructive method with high accuracy.

3.3. SVR Model Performance for Fruit Weight Prediction

The support vector regression (SVR) can process the non-linear data effectively and fit very well; as a result, the performance is comparatively better than linear models [30]. Since LR is a linear-based regression model, a nonlinear-based regression model, i.e., SVR has been performed in predicting fruit weight and evaluating the performance of the two models. For the non-linear approach, the SVR model is widely used in a number of studies to measure the fruit quality parameters [3,56,67]. Like the LR model, the four datasets, i.e., D1, D2, D3, and D4, were also used to develop the SVR model and to compare the performance among the datasets for predicting fruit weight.
The performance of the SVR model for each dataset in predicting fruit weight is shown in Table 3. Based on the R2 and RMSE values, the findings of this study showed that the SVR with D1 dataset provided better performance compared to D2, D3, and D4. According to the outcomes of SVR, the highest R2 (training = 0.942 and testing = 0.856) and the lowest RMSE (training = 0.891 and testing = 1.239) values were obtained for D1, which indicates that SVR could explain a maximum of 94.2% and 85.6% in the training and testing period, respectively. However, the worst performance was obtained for the D3-based SVR model (Table 3). Moreover, the SVR model developed on the D1 dataset could predict fruit weight with a 1.30% and 3.13% increase in R2 and 9.45% and 11.63% reduction in RMSE in training and testing, respectively, compared to the D3 dataset. Nyalala et al. [68] estimated the tomato weight using the computer vision and radial basis function–SVR (RBF-SVR) and Bayesian artificial neural network (Bayesian-ANN), and the proposed technique obtained an average accuracy of 95% and 96% using RBF-SVR and Bayesian-ANN models, respectively, for weight prediction, which were almost similar results compared to the present study. Moreover, Khojastehnazhand et al. [69], El Hariri et al. [70], and Fellegari and Navid [71] determined the volume and weight of tangerine, tomato, and orange, respectively, using image processing with machine learning algorithms (SVR). According to the findings of these three studies, it can be concluded that the performances of the SVR model developed in our study using D1, D2, D3, and D4 in estimating strawberry fruit weight were within the acceptable levels. In addition, the actual and predicted data on fruit weight obtained from SVR with D1, D2, D3, and D4 were shown in the time series graph and scatter plots (Figure 7). As shown in Figure 7, the predicted data and the observed data fit well; similarly, the values were near the 1:1 line, suggesting that SVR had a high accuracy level in estimating the fruit weight of strawberry.

3.4. Comparison LR and SVR Model’s Performance and Proposed Model

Regarding the results of comparison between the LR and SVR models with four datasets D1, D2, D3, and D4, it was observed that the performances of both models were almost similar for all four datasets. However, among the four datasets, the D1 (Maehyang) performed best for both LR and SVR models in predicting fruit weight. This finding indicated that the shape of the strawberry cultivars may affect the pixel numbers of captured images, which are directly related to fruit weight. Uniform shape of any object gives a good quality image for acquiring precise information [72,73,74]. The Maehyang and Seolhyang strawberry cultivars have a conical and almost uniform shape, whereas Santa is a large size strawberry cultivar and sometimes follows an irregular shape [60]. This may be one of the main reasons for lower performance of the D3 (Santa) dataset compared to D1 (Maehyang), D2 (Seolhyang), and D4 (combined three datasets) in the current study. Similar results were also reported by Agulheiro-Santos et al. [75] and Guo et al. [76].
Moreover, the LR model had a slightly better performance compared to the SVR model in predicting fruit weight of strawberries (Figure 8). Based on the performance metrics (R2 and RMSE), the results of the current study showed that the selected LR model based on D1 could predict fruit weight with a 2.23% and 4.67% higher R2 and with 20.10% and 14.93% lower RMSE in training and testing periods, respectively, compared to the SVR model with the D1 dataset. One of the main reasons might be a certain portion of the relationship between the explanatory (pixel numbers) and response (fruit weight) variables exiting a linear relation, and the linear data could play a significant role in the better performance of the LR model. Some studies also noted that SVR models could not increase the prediction accuracy compared to simple linear models due to the linear nature of the variables [45,77,78]. In addition, the cumulative distribution function that resulted from the LR model achieved a residual value of 78.33% of the observed and predicted data of fruit weight between the ranges from −1.0 to 1.0, whereas it was 63.67% for the SVR model in the same boundary limit (Figure 9). As shown in Figure 9, there was a linear relationship between the actual and predicted data of fruit weight. Moreover, models developed in the current study using machine learning algorithms might be subjected to noise and model inference errors. It is thus highly desirable to represent uncertainty in a trustworthy manner in any machine learning-based process [79,80]. Therefore, it is vital to evaluate the reliability and efficacy of machine learning models by conducting more experiments with a large number of samples; thereafter, they could be applied in real fields.

4. Conclusions

The experiment was conducted to assess the performance of machine learning models, i.e., LR and SVR models for the fruit weight prediction of three strawberry cultivars using image processing techniques. The finding of this study showed that there was a significant variation in length, diameter, and fruit weight between the Maehyang and Santa cultivars. The pixel numbers of fruit images acquired from the image processing techniques were used as input variables for developing fruit weight prediction models, and the performance of the models was measured using R2 and RMSE. The results of the study noted that the prediction accuracy of the LR model was slightly better than SVR model in estimating fruit weight, indicating a linear relationship between the pixel numbers and fruit weight. Among the four datasets (D1, D2, D3, and D4), LR and SVR models achieved the best performance for the D1 (Maehyang) dataset, which may be due to its conical and almost uniform shape. The LR and SVR models with the D1 dataset could explain a maximum of 96.3% and 94.2% in the training stage and 89.6% and 85.6% in the testing stage, respectively, of the variations of the pixel numbers and fruit weight data. In conclusion, strawberry fruit weight was satisfactorily predicted on the basis of the pixel numbers. Hereafter, more samples from various strawberry cultivars might need to be examined to determine whether the LR and SVR models performed better than the other regression-based models, such as Elastic Net, Decision Tree, Random Forest, in predicting fruit weight. This new method is promising non-destructive, time-saving and cost-effective for timely monitoring of fruit weight; hence, the next experiment will give attention to its application in the real agriculture field.

Author Contributions

J.K.B. conceived and designed the experiment, performed the experiment, analyzed and interpreted the data, and wrote the paper. H.T.K.; B.P., N.C.D. and B.G.K.M. supervised and reviewed and edited the article. N.E.K. helped during the experimental setup and data collection period as well as project administration. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been financially supported by the Korea Institute of Planning and Evaluation for Technology in Food, Agriculture, Forestry and Fisheries (IPET) through Agriculture, Food and Rural Affairs Convergence Technologies Program for Educating Creative Global Leader, funded by the Ministry of Agriculture, Food and Rural Affairs (MAFRA) (717001-7) and in part by the Brain Pool program through the National Research Foundation of Korea (2021H1D3A2A02038875).

Data Availability Statement

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Simpson, D. The Economic Importance of Strawberry Crops. In The Genomes of Rosaceous Berries and Their Wild Relatives; Springer: Berlin/Heidelberg, Germany, 2018; pp. 1–7. [Google Scholar]
  2. Giampieri, F.; Forbes-Hernandez, T.Y.; Gasparrini, M.; Alvarez-Suarez, J.M.; Afrin, S.; Bompadre, S.; Quiles, J.L.; Mezzetti, B.; Battino, M. Strawberry as a Health Promoter: An Evidence Based Review. Food Funct. 2015, 6, 1386–1398. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Basak, J.K.; Gamage, B.; Madhavi, K.; Paudel, B.; Kim, N.E.; Kim, H.T. Prediction of Total Soluble Solids and pH of Strawberry Fruits Using RGB, HSV and HSL Colour Spaces and Machine Learning Models. Foods 2022, 11, 2086. [Google Scholar] [CrossRef] [PubMed]
  4. Amoriello, T.; Ciccoritti, R.; Ferrante, P. Prediction of Strawberries’ Quality Parameters Using Artificial Neural Networks. Agronomy 2022, 12, 963. [Google Scholar] [CrossRef]
  5. Zhou, C.; Hu, J.; Xu, Z.; Yue, J.; Ye, H.; Yang, G. A Novel Greenhouse-Based System for the Detection and Plumpness Assessment of Strawberry Using an Improved Deep Learning Technique. Front. Plant Sci. 2020, 11, 559. [Google Scholar] [CrossRef]
  6. de Jesús Ornelas-Paz, J.; Yahia, E.M.; Ramírez-Bustamante, N.; Pérez-Martínez, J.D.; del Pilar Escalante-Minakata, M.; Ibarra-Junquera, V.; Acosta-Muñiz, C.; Guerrero-Prieto, V.; Ochoa-Reyes, E. Physical Attributes and Chemical Composition of Organic Strawberry Fruit (Fragaria x ananassa Duch, Cv. Albion) at Six Stages of Ripening. Food Chem. 2013, 138, 372–381. [Google Scholar] [CrossRef]
  7. Suárez, J.C.; Casanoves, F.; Di Rienzo, J. Non-Destructive Estimation of the Leaf Weight and Leaf Area in Common Bean. Agronomy 2022, 12, 711. [Google Scholar] [CrossRef]
  8. Tagawa, A.; Ehara, M.; Ito, Y.; Araki, T.; Ozaki, Y.; Shishido, Y. Effects of CO2 Enrichment on Yield, Photosynthetic Rate, Translocation and Distribution of Photoassimilates in Strawberry ‘Sagahonoka’. Agronomy 2022, 12, 473. [Google Scholar] [CrossRef]
  9. Zhang, B.; Gu, B.; Tian, G.; Zhou, J.; Huang, J.; Xiong, Y. Challenges and Solutions of Optical-Based Nondestructive Quality Inspection for Robotic Fruit and Vegetable Grading Systems: A Technical Review. Trends Food Sci. Technol. 2018, 81, 213–231. [Google Scholar] [CrossRef]
  10. Kohyama, K.; Masuda, T.; Shimada, H.; Tanaka, T.; Wada, Y. A Simple Mechanical Index of Storage Quality of Strawberry Fruits. Rep. Natl. Food Res. Inst. 2013, 77, 1–11. [Google Scholar]
  11. Ruan, J.; Lee, Y.H.; Hong, S.J.; Yeoung, Y.R. Sugar and Organic Acid Contents of Day-Neutral and Ever-Bearing Strawberry Cultivars in High-Elevation for Summer and Autumn Fruit Production in Korea. Hortic. Environ. Biotechnol. 2013, 54, 214–222. [Google Scholar] [CrossRef]
  12. Diago, M.P.; Tardaguila, J.; Aleixos, N.; Millan, B.; Prats-Montalban, J.M.; Cubero, S.; Blasco, J. Assessment of Cluster Yield Components by Image Analysis. J. Sci. Food Agric. 2015, 95, 1274–1282. [Google Scholar] [CrossRef]
  13. Linares Torres, R.; Baeza Trujillo, P.; Miranda, C.; Lissarrague Garcia-Gutierrez, J.R. Comparison of Different Methods of Grapevine Yield Prediction in the Time Window between Fruitset and Veraison. J. Int. Sci. Vigne Vin 2015, 49, 27–35. [Google Scholar]
  14. Demirsoy, H.; Demirsoy, L. Prediction Model for Estimating Peach Fruit Weight and Volume on the Basis of Fruit Linear Measurements during Growth. J. Fruit Ornam. Plant Res. 2007, 15, 65–69. [Google Scholar]
  15. van Dijk, A.D.J.; Kootstra, G.; Kruijer, W.; de Ridder, D. Machine Learning in Plant Science and Plant Breeding. iScience 2021, 24, 101890. [Google Scholar] [CrossRef]
  16. Lu, X.; Ma, F.; Xu, J.; Zhang, Z. Oil Futures Volatility Predictability: New Evidence Based on Machine Learning Models1. Int. Rev. Financ. Anal. 2022, 83, 102299. [Google Scholar] [CrossRef]
  17. Hesami, M.; Naderi, R.; Tohidfar, M. Introducing a Hybrid Artificial Intelligence Method for High-Throughput Modeling and Optimizing Plant Tissue Culture Processes: The Establishment of a New Embryogenesis Medium for Chrysanthemum, as a Case Study. Appl. Microbiol. Biotechnol. 2020, 104, 10249–10263. [Google Scholar] [CrossRef]
  18. Singh, A.; Ganapathysubramanian, B.; Singh, A.K.; Sarkar, S. Machine Learning for High-Throughput Stress Phenotyping in Plants. Trends Plant Sci. 2016, 21, 110–124. [Google Scholar] [CrossRef] [Green Version]
  19. Hesami, M.; Alizadeh, M.; Jones, A.M.P.; Torkamaneh, D. Machine Learning: Its Challenges and Opportunities in Plant System Biology. Appl. Microbiol. Biotechnol. 2022, 106, 3507–3530. [Google Scholar] [CrossRef]
  20. Grinblat, G.L.; Uzal, L.C.; Larese, M.G.; Granitto, P.M. Deep Learning for Plant Identification Using Vein Morphological Patterns. Comput. Electron. Agric. 2016, 127, 418–424. [Google Scholar] [CrossRef] [Green Version]
  21. Mishra, B.; Kumar, N.; Mukhtar, M.S. Systems Biology and Machine Learning in Plant–Pathogen Interactions. Mol. Plant-Microbe Interact. 2019, 32, 45–55. [Google Scholar] [CrossRef] [Green Version]
  22. Li, B.; Lecourt, J.; Bishop, G. Advances in Non-Destructive Early Assessment of Fruit Ripeness towards Defining Optimal Time of Harvest and Yield Prediction—A Review. Plants 2018, 7, 3. [Google Scholar] [CrossRef] [Green Version]
  23. Lu, W.; Du, R.; Niu, P.; Xing, G.; Luo, H.; Deng, Y.; Shu, L. Soybean Yield Preharvest Prediction Based on Bean Pods and Leaves Image Recognition Using Deep Learning Neural Network Combined With GRNN. Front. Plant Sci. 2022, 12, 791256. [Google Scholar] [CrossRef]
  24. Zhao, S.; Zheng, H.; Chi, M.; Chai, X.; Liu, Y. Rapid Yield Prediction in Paddy Fields Based on 2D Image Modelling of Rice Panicles. Comput. Electron. Agric. 2019, 162, 759–766. [Google Scholar] [CrossRef]
  25. Yang, W.; Nigon, T.; Hao, Z.; Paiao, G.D.; Fernández, F.G.; Mulla, D.; Yang, C. Estimation of Corn Yield Based on Hyperspectral Imagery and Convolutional Neural Network. Comput. Electron. Agric. 2021, 184, 106092. [Google Scholar] [CrossRef]
  26. Zhang, B.; Guo, N.; Huang, J.; Gu, B.; Zhou, J. Computer Vision Estimation of the Volume and Weight of Apples by Using 3D Reconstruction and Noncontact Measuring Methods. J. Sens. 2020, 2020, 5053407. [Google Scholar] [CrossRef]
  27. Teoh, C.C.; Syaifudin, A.R.M. Image Processing and Analysis Techniques for Estimating Weight of Chokanan Mangoes. J. Trop. Agric. Food Sci. 2007, 35, 183–190. [Google Scholar]
  28. ElMasry, G.; Wang, N.; ElSayed, A.; Ngadi, M. Hyperspectral Imaging for Nondestructive Determination of Some Quality Attributes for Strawberry. J. Food Eng. 2007, 81, 98–107. [Google Scholar] [CrossRef]
  29. Peng, Y.; Lu, R. Analysis of Spatially Resolved Hyperspectral Scattering Images for Assessing Apple Fruit Firmness and Soluble Solids Content. Postharvest Biol. Technol. 2008, 48, 52–62. [Google Scholar] [CrossRef]
  30. Indrabayu, I.; Arifin, N.; Areni, I.S. Strawberry Ripeness Classification System Based on Skin Tone Color Using Multi-Class Support Vector Machine. In Proceedings of the 2019 International Conference on Information and Communications Technology (ICOIACT), Yogyakarta, Indonesia, 24–25 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 191–195. [Google Scholar]
  31. Pardede, J.; Husada, M.G.; Hermana, A.N.; Rumapea, S.A. Fruit Ripeness Based on RGB, HSV, HSL, L* A* B* Color Feature Using SVM. In Proceedings of the 2019 International Conference of Computer Science and Information Technology (ICoSNIKOM), Medan, Indonesia, 28–29 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–5. [Google Scholar]
  32. Li, X.; Li, J.; Tang, J. A Deep Learning Method for Recognizing Elevated Mature Strawberries. In Proceedings of the 2018 33rd Youth Academic Annual Conference of Chinese Association of Automation (YAC), Nanjing, China, 18–20 May 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1072–1077. [Google Scholar]
  33. Elhariri, E.; El-Bendary, N.; Hussein, A.M.M.; Hassanien, A.E.; Badr, A. Bell Pepper Ripeness Classification Based on Support Vector Machine. In Proceedings of the 2014 International Conference on Engineering and Technology (ICET), Cairo, Egypt, 19–20 April 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 1–6. [Google Scholar]
  34. Omid, M.; Khojastehnazhand, M.; Tabatabaeefar, A. Estimating Volume and Mass of Citrus Fruits by Image Processing Technique. J. Food Eng. 2010, 100, 315–321. [Google Scholar] [CrossRef]
  35. Jung, D.-H.; Park, S.H.; Han, X.Z.; Kim, H.-J. Image Processing Methods for Measurement of Lettuce Fresh Weight. J. Biosyst. Eng. 2015, 40, 89–93. [Google Scholar] [CrossRef] [Green Version]
  36. Bozkurt, Y.; Aktan, S.; Ozkaya, S. Body Weight Prediction Using Digital Image Analysis for Slaughtered Beef Cattle. J. Appl. Anim. Res. 2007, 32, 195–198. [Google Scholar] [CrossRef]
  37. Lee, J.W.; Son, J.E. Nondestructive and Continuous Freshweight Measurements of Bell Peppers Grown in Soilless Culture Systems. Agronomy 2019, 9, 652. [Google Scholar] [CrossRef] [Green Version]
  38. Sayinci, B.; Ercisli, S.; Ozturk, I.; Eryilmaz, Z.; Demir, B. Determination of Size and Shape in the’Moro’blood Orange and’Valencia’sweet Orange Cultivar and Its Mutants Using Image Processing. Not. Bot. Horti Agrobot. Cluj-Napoca 2012, 40, 234–242. [Google Scholar] [CrossRef] [Green Version]
  39. Campillo, C.; Prieto, M.H.; Daza, C.; Monino, M.J.; Garcia, M.I. Using Digital Images to Characterize Canopy Coverage and Light Interception in a Processing Tomato Crop. Hortscience 2008, 43, 1780–1786. [Google Scholar] [CrossRef]
  40. Lee, D.H.; Cho, Y.; Choi, J.M. Strawberry Volume Estimation Using Smartphone Image Processing. Hortic. Sci. Technol. 2017, 35, 707–716. [Google Scholar]
  41. Basak, J.K.; Eun, N.; Shihab, K.; Shahriar, A.; Paudel, B.; Eun, B. Applicability of Statistical and Machine Learning—Based Regression Algorithms in Modeling of Carbon Dioxide Emission in Experimental Pig Barns. Air Qual. Atmos. Health 2022, 15, 1899–1912. [Google Scholar] [CrossRef]
  42. Basak, J.K.; Okyere, F.G.; Arulmozhi, E.; Park, J.; Khan, F.; Kim, H.T. Artificial Neural Networks and Multiple Linear Regression as Potential Methods for Modelling Body Surface Temperature of Pig. J. Appl. Anim. Res. 2020, 48, 207–219. [Google Scholar] [CrossRef]
  43. Basak, J.K.; Qasim, W.; Okyere, F.G.; Khan, F.; Lee, Y.J.; Park, J.; Kim, H.T. Regression Analysis to Estimate Morphology Parameters of Pepper Plant in a Controlled Greenhouse System. J. Biosyst. Eng. 2019, 44, 57–68. [Google Scholar] [CrossRef]
  44. Arulmozhi, E.; Basak, J.K.; Sihalath, T.; Park, J.; Kim, H.T.; Moon, B.E. Machine Learning-Based Microclimate Model for Indoor Air Temperature and Relative Humidity Prediction in a Swine Building. Animals 2021, 11, 222. [Google Scholar] [CrossRef]
  45. Basak, J.K.; Arulmozhi, E.; Moon, B.E.; Bhujel, A.; Kim, H.T. Modelling Methane Emissions from Pig Manure Using Statistical and Machine Learning Methods. Air Qual. Atmos. Health 2022, 15, 575–589. [Google Scholar] [CrossRef]
  46. Madhavi, B.G.K.; Basak, J.K.; Paudel, B.; Kim, N.E.; Choi, G.M.; Kim, H.T. Prediction of Strawberry Leaf Color Using RGB Mean Values Based on Soil Physicochemical Parameters Using Machine Learning Models. Agronomy 2022, 12, 981. [Google Scholar] [CrossRef]
  47. Khan, F.; Kim, N.E.; Bhujel, A.; Jaihuni, M.; Lee, D.H.; Basak, J.K.; Kim, H.T. Assessment of Combined Trichoderma-Enriched Biofertilizer and Nutrients Solutions on the Growth and Yield of Strawberry Plants. J. Biosyst. Eng. 2021, 46, 225–235. [Google Scholar] [CrossRef]
  48. Cubero, S.; Aleixos, N.; Moltó, E.; Gómez-Sanchis, J.; Blasco, J. Advances in Machine Vision Applications for Automatic Inspection and Quality Evaluation of Fruits and Vegetables. Food Bioprocess Technol. 2011, 4, 487–504. [Google Scholar] [CrossRef]
  49. Phan, B.T. Bayesian Deep Learning and Uncertainty in Computer Vision. Master’s Thesis, University of Waterloo, Waterloo, ON, Canada, 2019. [Google Scholar]
  50. Bhujel, A.; Khan, F.; Basak, J.K.; Jaihuni, M.; Sihalath, T.; Moon, B.E.; Park, J.; Kim, H.T. Detection of Gray Mold Disease and Its Severity on Strawberry Using Deep Learning Networks. J. Plant Dis. Prot. 2022, 129, 579–592. [Google Scholar] [CrossRef]
  51. Basak, J.K.; Arulmozhi, E.; Khan, F.; Okyere, F.G.; Park, J.; Kim, H.T. Modeling of Ambient Environment and Thermal Status Relationship of Pig’s Body in a Pig Barn. Indian J. Anim. Res. 2020, 54, 1049–1054. [Google Scholar] [CrossRef]
  52. Bhujel, A.; Kim, N.E.; Arulmozhi, E.; Basak, J.K.; Kim, H.T. A Lightweight Attention-Based Convolutional Neural Networks for Tomato Leaf Disease Classification. Agriculture 2022, 12, 228. [Google Scholar] [CrossRef]
  53. Kumari, K.; Yadav, S. Linear Regression Analysis Study. J. Pract. Cardiovasc. Sci. 2018, 4, 33–36. [Google Scholar] [CrossRef]
  54. Zhao, T.; Xue, H. Regression Analysis and Indoor Air Temperature Model of Greenhouse in Northern Dry and Cold Regions. In Proceedings of the International Conference on Computer and Computing Technologies in Agriculture, Nanchang, China, 22–25 October 2010; Springer: Berlin/Heidelberg, Germany, 2010; pp. 252–258. [Google Scholar]
  55. Thissen, U.; Pepers, M.; Üstün, B.; Melssen, W.J.; Buydens, L.M.C. Comparing Support Vector Machines to PLS for Spectral Regression Applications. Chemom. Intell. Lab. Syst. 2004, 73, 169–179. [Google Scholar] [CrossRef]
  56. Drucker, H.; Burges, C.J.; Kaufman, L.; Smola, A.; Vapnik, V. Support Vector Regression Machines. In Advances in Neural Information Processing Systems 9; MIT Press: Cambridge, MA, USA, 1996. [Google Scholar]
  57. Sarkar, S.; Basak, J.K.; Moon, B.E.; Kim, H.T. A Comparative Study of PLSR and SVM-R with Various Preprocessing Techniques for the Quantitative Determination of Soluble Solids Content of Hardy Kiwi Fruit by a Portable Vis/NIR Spectrometer. Foods 2020, 9, 1078. [Google Scholar] [CrossRef] [PubMed]
  58. Van Der Walt, S.; Colbert, S.C.; Varoquaux, G. The NumPy Array: A Structure for Efficient Numerical Computation. Comput. Sci. Eng. 2011, 13, 22–30. [Google Scholar] [CrossRef] [Green Version]
  59. McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010; Volume 445, pp. 51–56. [Google Scholar]
  60. Jeong, H.J.; Choi, H.G.; Moon, B.Y.; Cheong, J.W.; Kang, N.J. Comparative Analysis of the Fruit Characteristics of Four Strawberry Cultivars Commonly Grown in South Korea. Hortic. Sci. Technol. 2016, 34, 396–404. [Google Scholar] [CrossRef]
  61. Ferreyra, R.M.; Viña, S.Z.; Mugridge, A.; Chaves, A.R. Growth and Ripening Season Effects on Antioxidant Capacity of Strawberry Cultivar Selva. Sci. Hortic. 2007, 112, 27–32. [Google Scholar] [CrossRef]
  62. Figueroa, C.R.; Pimentel, P.; Gaete-Eastman, C.; Moya, M.; Herrera, R.; Caligari, P.D.S.; Moya-León, M.A. Softening Rate of the Chilean Strawberry (Fragaria Chiloensis) Fruit Reflects the Expression of Polygalacturonase and Pectate Lyase Genes. Postharvest Biol. Technol. 2008, 49, 210–220. [Google Scholar] [CrossRef]
  63. Doymaz, I. Convective Drying Kinetics of Strawberry. Chem. Eng. Process. Process Intensif. 2008, 47, 914–919. [Google Scholar] [CrossRef]
  64. Nunes, M.C.N.; Brecht, J.K.; Morais, A.M.M.B.; Sargent, S.A. Physicochemical Changes during Strawberry Development in the Field Compared with Those That Occur in Harvested Fruit during Storage. J. Sci. Food Agric. 2006, 86, 180–190. [Google Scholar] [CrossRef]
  65. Arulmozhi, E.; Bhujel, A.; Moon, B.-E.; Kim, H.-T. The Application of Cameras in Precision Pig Farming: An Overview for Swine-Keeping Professionals. Animals 2021, 11, 2343. [Google Scholar] [CrossRef] [PubMed]
  66. Khoshnam, F.; Tabatabaeefar, A.; Varnamkhasti, M.G.; Borghei, A. Mass Modeling of Pomegranate (Punica granatum L.) Fruit with Some Physical Characteristics. Sci. Hortic. 2007, 114, 21–26. [Google Scholar] [CrossRef]
  67. Mancini, M.; Mazzoni, L.; Gagliardi, F.; Balducci, F.; Duca, D.; Toscano, G.; Mezzetti, B.; Capocasa, F. Application of the Non-Destructive NIR Technique for the Evaluation of Strawberry Fruits Quality Parameters. Foods 2020, 9, 441. [Google Scholar] [CrossRef] [Green Version]
  68. Nyalala, I.; Okinda, C.; Nyalala, L.; Makange, N.; Chao, Q.; Chao, L.; Yousaf, K.; Chen, K. Tomato Volume and Mass Estimation Using Computer Vision and Machine Learning Algorithms: Cherry Tomato Model. J. Food Eng. 2019, 263, 288–298. [Google Scholar] [CrossRef]
  69. Khojastehnazhand, M.; Omid, M.; Tabatabaeefar, A. Determination of Tangerine Volume Using Image Processing Methods. Int. J. Food Prop. 2010, 13, 760–770. [Google Scholar] [CrossRef]
  70. Elhariri, E.; El-Bendary, N.; Fouad, M.M.M.; Platoš, J.; Hassanien, A.E.; Hussein, A.M.M. Multi-Class SVM Based Classification Approach for Tomato Ripeness. In Innovations in Bio-Inspired Computing and Applications; Springer: Berlin/Heidelberg, Germany, 2014; pp. 175–186. [Google Scholar]
  71. Fellegari, R.; Navid, H. Determining the Orange Volume Using Image Processing. In Proceedings of the Interbational Conference on Food Engineering and Biotechnoly (ICFEB 2011), Bangkok, Thailand, 7–9 May 2011; Citeseer: University Park, PA, USA, 2011; pp. 180–184. [Google Scholar]
  72. Raj, R.; Cosgun, A.; Kulić, D. Strawberry Water Content Estimation and Ripeness Classification Using Hyperspectral Sensing. Agronomy 2022, 12, 425. [Google Scholar] [CrossRef]
  73. Weng, S.; Yu, S.; Guo, B.; Tang, P.; Liang, D. Non-Destructive Detection of Strawberry Quality Using Multi-Features of Hyperspectral Imaging and Multivariate Methods. Sensors 2020, 20, 3074. [Google Scholar] [CrossRef]
  74. Hernandez-Munoz, P.; Almenar, E.; Del Valle, V.; Velez, D.; Gavara, R. Effect of Chitosan Coating Combined with Postharvest Calcium Treatment on Strawberry (Fragaria x ananassa) Quality during Refrigerated Storage. Food Chem. 2008, 110, 428–435. [Google Scholar] [CrossRef]
  75. Agulheiro-Santos, A.C.; Ricardo-Rodrigues, S.; Laranjo, M.; Melgão, C.; Velázquez, R. Non-destructive Prediction of Total Soluble Solids in Strawberry Using near Infrared Spectroscopy. J. Sci. Food Agric. 2022, 102, 4866–4872. [Google Scholar] [CrossRef]
  76. Guo, Z.; Huang, W.; Chen, L.; Wang, X.; Peng, Y. Nondestructive Evaluation of Soluble Solid Content in Strawberry by near Infrared Spectroscopy. In Proceedings of the Piageng 2013: Image Processing and Photonics for Agricultural Engineering, Sanya, China, 27 January 2013; SPIE: Bellingham, WA, USA, 2013; Volume 8761, pp. 128–132. [Google Scholar]
  77. Craninx, M.; Fievez, V.; Vlaeminck, B.; De Baets, B. Artificial Neural Network Models of the Rumen Fermentation Pattern in Dairy Cattle. Comput. Electron. Agric. 2008, 60, 226–238. [Google Scholar] [CrossRef]
  78. Özesmi, S.L.; Tan, C.O.; Özesmi, U. Methodological Issues in Building, Training, and Testing Artificial Neural Networks in Ecological Applications. Ecol. Modell. 2006, 195, 83–93. [Google Scholar] [CrossRef] [Green Version]
  79. Abdar, M.; Pourpanah, F.; Hussain, S.; Rezazadegan, D.; Liu, L.; Ghavamzadeh, M.; Fieguth, P.; Cao, X.; Khosravi, A.; Acharya, U.R. A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges. Inf. Fusion 2021, 76, 243–297. [Google Scholar] [CrossRef]
  80. Lee, D.D.; Pham, P.; Largman, Y.; Ng, A. Advances in Neural Information Processing Systems 22; Tech. Rep.; MIT Press: Cambridge, MA, USA, 2009. [Google Scholar]
Figure 1. Experimental greenhouse for strawberry cultivation.
Figure 1. Experimental greenhouse for strawberry cultivation.
Agronomy 12 02487 g001
Figure 2. Schematic diagram of image acquisition system for strawberry fruits.
Figure 2. Schematic diagram of image acquisition system for strawberry fruits.
Agronomy 12 02487 g002
Figure 3. Strawberry fruit images with raw image, background removal image (BR), RGB image, and binary image of the three different cultivars. Pixel numbers of images and their biometrical data; L: length (mm), D: diameter (mm), W: weight (g) are respectively shown in the rightmost two columns.
Figure 3. Strawberry fruit images with raw image, background removal image (BR), RGB image, and binary image of the three different cultivars. Pixel numbers of images and their biometrical data; L: length (mm), D: diameter (mm), W: weight (g) are respectively shown in the rightmost two columns.
Agronomy 12 02487 g003
Figure 4. Diagrams indicating the structure of (a) linear regression (LR) and (b) support vector regression (SVR).
Figure 4. Diagrams indicating the structure of (a) linear regression (LR) and (b) support vector regression (SVR).
Agronomy 12 02487 g004
Figure 5. Flow diagram of prediction of fruit weight of strawberries with pixel values using linear regression (LR) and support vector regression (SVR) algorthims.
Figure 5. Flow diagram of prediction of fruit weight of strawberries with pixel values using linear regression (LR) and support vector regression (SVR) algorthims.
Agronomy 12 02487 g005
Figure 6. Scatter plots with 1:1 line and time series graph for comparing results between measured and predicted values by LR model with D1, D2, D3, and D4 datasets for fruit weight prediction of strawberries in the testing period. D1: measured vs. predicted data for Maehyang cultivar; D2: measured vs. predicted data for Seolhyang cultivar; D3: measured vs. predicted data for Santa cultivar; D4: measured vs. predicted data for all three cultivars.
Figure 6. Scatter plots with 1:1 line and time series graph for comparing results between measured and predicted values by LR model with D1, D2, D3, and D4 datasets for fruit weight prediction of strawberries in the testing period. D1: measured vs. predicted data for Maehyang cultivar; D2: measured vs. predicted data for Seolhyang cultivar; D3: measured vs. predicted data for Santa cultivar; D4: measured vs. predicted data for all three cultivars.
Agronomy 12 02487 g006
Figure 7. Scatter plots with 1:1 line and time series graph for comparing results between measured and predicted values by SVR model with D1, D2, D3, and D4 datasets for fruit weight prediction of strawberries in the testing period. D1: Measured vs. predicted data for Maehyang cultivar; D2: Measured vs. predicted data for Seolhyang cultivar; D3: Measured vs. predicted data for Santa cultivar; D4: Measured vs. predicted data for all three cultivars.
Figure 7. Scatter plots with 1:1 line and time series graph for comparing results between measured and predicted values by SVR model with D1, D2, D3, and D4 datasets for fruit weight prediction of strawberries in the testing period. D1: Measured vs. predicted data for Maehyang cultivar; D2: Measured vs. predicted data for Seolhyang cultivar; D3: Measured vs. predicted data for Santa cultivar; D4: Measured vs. predicted data for all three cultivars.
Agronomy 12 02487 g007
Figure 8. Taylor diagram of testing and training results of LR and SVR models with D1, D2, D3, and D4 datasets for weight prediction of strawberry fruits.
Figure 8. Taylor diagram of testing and training results of LR and SVR models with D1, D2, D3, and D4 datasets for weight prediction of strawberry fruits.
Agronomy 12 02487 g008
Figure 9. Cumulative distribution analysis from measured and predicted data using LR and SVR models.
Figure 9. Cumulative distribution analysis from measured and predicted data using LR and SVR models.
Agronomy 12 02487 g009
Table 1. Cultivar-wise compositions datasets and number of data fruit weight of strawberries during the testing and training period.
Table 1. Cultivar-wise compositions datasets and number of data fruit weight of strawberries during the testing and training period.
Cultivar NameDatasetNumber of Data
MaehyangD1300
SeolhyangD2300
SantaD3300
All three cultivarsD4900
Table 2. Biometrical characteristics and pixel numbers of images in three strawberry cultivars at ripening stages.
Table 2. Biometrical characteristics and pixel numbers of images in three strawberry cultivars at ripening stages.
Cultivar NameLength (mm)Diameter (mm)Weight (g)Pixel Numbers
Maehyang41.78 ± 3.28 a31.93 ± 2.41 a19.00 ± 3.61 a51,460 ± 7425 a
Seolhyang41.25 ± 3.69 a32.51 ± 2.17 ab20.41 ± 3.34 ab57,579 ± 6776 ab
Santa45.27 ± 2.95 b34.45 ± 2.10 b23.28 ± 3.65 b59,966 ± 8502 b
All three cultivars42.77 ± 3.7733.97 ± 2.4820.90 ± 3.9656,335 ± 8396
Values represent the mean ± the standard deviation of 300 distinct strawberries measurement (N = 300) of each strawberry cultivar. Values mentioned by the same letter within the same column are not significantly different (p < 0.05).
Table 3. Performance metrics (RMSE and R2) of both models for predicting the fruit weight of strawberries during the training and testing period. Italicized values indicate the best performance result for each model.
Table 3. Performance metrics (RMSE and R2) of both models for predicting the fruit weight of strawberries during the training and testing period. Italicized values indicate the best performance result for each model.
Model NameDatasetTrainingTesting
R2RMSER2RMSE
LRD10.9630.7120.8961.054
D20.9500.7850.8711.136
D30.9470.8560.8601.207
D40.9540.7580.8801.101
SVRD10.9420.8910.8561.239
D20.9340.9530.8381.362
D30.9300.9840.8301.402
D40.9360.9460.8401.280
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Basak, J.K.; Paudel, B.; Kim, N.E.; Deb, N.C.; Kaushalya Madhavi, B.G.; Kim, H.T. Non-Destructive Estimation of Fruit Weight of Strawberry Using Machine Learning Models. Agronomy 2022, 12, 2487. https://doi.org/10.3390/agronomy12102487

AMA Style

Basak JK, Paudel B, Kim NE, Deb NC, Kaushalya Madhavi BG, Kim HT. Non-Destructive Estimation of Fruit Weight of Strawberry Using Machine Learning Models. Agronomy. 2022; 12(10):2487. https://doi.org/10.3390/agronomy12102487

Chicago/Turabian Style

Basak, Jayanta Kumar, Bhola Paudel, Na Eun Kim, Nibas Chandra Deb, Bolappa Gamage Kaushalya Madhavi, and Hyeon Tae Kim. 2022. "Non-Destructive Estimation of Fruit Weight of Strawberry Using Machine Learning Models" Agronomy 12, no. 10: 2487. https://doi.org/10.3390/agronomy12102487

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop