Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique

Chen, Yingbo; Wang, Siyu; Xue, Zhankui; Hu, Jijie; Chen, Shaojie; Lv, Zunfu

doi:10.3390/plants14081206

Open AccessArticle

Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique

by

Yingbo Chen

^1,†,

Siyu Wang

^1,†,

Zhankui Xue

²,

Jijie Hu

³,

Shaojie Chen

³ and

Zunfu Lv

^1,*

¹

Zhejiang A&F University, Lin’an, Hangzhou 311300, China

²

Jinhua Agricultural Technology Promotion and Seed Management Center, Jinhua 321000, China

³

Ningbo Agricultural Technology Promotion Station, Ningbo 315800, China

^*

Author to whom correspondence should be addressed.

^†

The first two authors contributed equally to this work.

Plants 2025, 14(8), 1206; https://doi.org/10.3390/plants14081206

Submission received: 2 March 2025 / Revised: 31 March 2025 / Accepted: 11 April 2025 / Published: 14 April 2025

(This article belongs to the Special Issue Crop Nutrition Diagnosis and Regulation)

Download

Browse Figures

Versions Notes

Abstract

The integration of crop models and remote sensing data has become a useful method for monitoring crop growth status and crop yield based on data assimilation. The objective of this study was to use leaf area index (LAI) values and plant nitrogen accumulation (PNA) values generated from spectral indices to calibrate the Decision Support System for Agrotechnology Transfer (DSSAT) model using the Monte Carlo Markov Chain (MCMC) technique. The initial management parameters, including sowing date, sowing rate, and nitrogen rate, are recalibrated based on the relationship between the remote sensing state variables and the simulated state variables. This integrated technique was tested on independent datasets acquired from three rice field tests at the experimental site in Deqing, China. The results showed that the data assimilation method achieved the most accurate LAI (R² = 0.939 and RMSE = 0.74) and PNA (R² = 0.926 and RMSE = 7.3 kg/ha) estimations compared with the spectral index method. Average differences (RE, %) between the inverted initialized parameters and the original input parameters for sowing date, seeding rate, and nitrogen amount were 1.33%, 4.75%, and 8.16%, respectively. The estimated yield was in good agreement with the measured yield (R² = 0.79 and RMSE = 661 kg/ha). The average root mean square deviation (RMSD) for the simulated values of yield was 745 kg/ha. Yield uncertainty from data assimilation between crop models and remote sensing was quantified. This study found that data assimilation of crop models and remote sensing data using the MCMC technique could improve the estimation of rice leaf area index (LAI), plant nitrogen accumulation (PNA), and yield. Data assimilation using the MCMC technique improves the prediction of LAI, PNA, and yield by solving the saturation effect of the normalized difference vegetation index (NDVI). This method proposed in this study can provide precise decision-making support for field management and anticipate regional yield fluctuations in advance.

Keywords:

crop environment resource synthesis for rice model; remote sensing data; monte carlo markov chain technique; rice

1. Introduction

The integration of crop models and remote sensing data has become a useful method for monitoring crop growth status and crop yield based on data assimilation over extensive regions. In this process, weather models and weather stations play a foundational supporting role. Weather models provide spatiotemporally continuous predictions of key climatic variables (e.g., precipitation, temperature) at regional scales, while widely distributed weather stations calibrate and validate model outputs through field observations. This integration ensures the accuracy of input data, thereby significantly enhancing the predictive precision of crop models and strengthening their practical applicability [1]. The GreenSeeker™ optical sensor (GS) is a highly effective tool for site-specific nitrogen fertilizer management tailored to specific needs. Vegetation indices, especially the normalized difference vegetation index (NDVI), which is calculated using reflectance in the red and near-infrared bands, are among the most commonly utilized indicators [2]. The NDVI can estimate crop leaf area index (LAI), plant nitrogen accumulation (PNA), nitrogen (N) requirement, and grain yield and improve the N use efficiency. However, the NDVI is prone to saturation at moderate to high LAI values or PNA values [3,4]. The saturation effect of the NDVI was mainly due to the canopy closure—the differences in penetration into the canopy between visible light (R) and near-infrared (NIR). Since the total absorption by a canopy in the red range is already between 90% and 95%, further increases in the green leaf area index (gLAI) do not result in any additional changes in the absorption and reflectance [5] or the normalization effect embedded in the calculation formula of this index [2]. In addition, former studies were principally based on the relationship between vegetation indices and yield to predict yield. However, using remote sensing (RS) data alone is not enough to explain the fundamental principles of crop growth and development processes and connect them to crop yield [6]. Moreover, it also gives poor annual performance in the spatial extension because of environmental, soil, and management changes [7].

Crop growth models are extensively utilized for assessing crop growth status and predicting yields [8]. They capture the interactions among genetic potential, environmental factors, and management practices by simulating the dynamic growth patterns of crops [6,9]. On the field scale, each farmer in China has only a few fields. Different sowing times, sowing rates, and fertilizer amounts were used in different fields, which will affect the application and popularization of crop models. Due to the saturation effect, the crop models could continuously estimate crop LAIs and PNAs and make up for the shortage of spectral monitoring. Therefore, the integration of crop models and remote sensing data has become a useful method for monitoring crop growth status (leaf area index, LAI, and accumulated nitrogen uptake, ANU) and crop yield.

Many studies have been carried out using the data assimilation of crop models and remote sensing data using a calibration method, including the Simplex Search Algorithm [10], Maximum Likelihood Solution [11], Very Fast Annealing Algorithm [12], Ensemble Square Root Filter [13,14], Particle Swarm Optimization Algorithm [6,15], and Ensemble Kalman Filter (EnKF) [16,17]. The calibration method was utilized to reduce the discrepancies between the remote sensing data and the simulated crop model data by employing an optimization algorithm. This process aims to enhance the model’s accuracy in reflecting observed conditions [18]. However, the above methods cannot quantify the uncertainty of data assimilation between crop models and remote sensing. The MCMC method is built on the Bayesian theoretical framework, constructing a balanced distribution for the Markov chain and sampling from it. By continuously updating the sample information, the chain can thoroughly explore the parameter space and ultimately converge to areas of high probability density [19]. Compared with the above methods, the MCMC method not only could find the optimal combination of initial parameters but also quantify the range of initial parameters.

The main objective of this study is (1) to improve the estimation of rice leaf area index (LAI), plant nitrogen accumulation (PNA), and yield by combining crop models and remote sensing data using the MCMC technique; (2) to improve the prediction of LAI and PNA by solving the saturation effect of the NDVI; and (3) to quantify the uncertainty of data assimilation between crop models and remote sensing.

2. Result

2.1. Spectral Index for LAI and VNA Estimation

The data from experiments 1 and 2 were used to build the relationship between the NDVI and the LAI or PNA. The results showed that there was an exponential regression relationship between the NDVI and the LAI or PNA (Figure 1). The NDVI indices could be used to estimate the PNA and LAI in rice.

The GreenSeeker NDVI showed a significant correlation with LAI and PNA (R² = 0.79), but saturation occurred when the LAI exceeded 7, becoming nearly constant at values above this threshold (Figure 2). Consequently, the LAI was often underestimated when it surpassed 7. Similarly, across various growth stages and site years, the GreenSeeker NDVI demonstrated a significant correlation with PNA (R² = 0.71), though it reached saturation when the PNA hit 7.0 and remained almost unchanged for PNA values above 90 (Figure 2).

2.2. The Probability Distribution of Inverted Initial Parameters

The value of the LAIe and PNAe derived from the NDVI exponential regression equation was used as a variable to calibrate the CERES-Rice model using the MCMC method. Table 1 shows the differences between the initialized parameters based on the remote sensing (RS)-CERES-Rice assimilation model and the original input parameters. Average differences (RE, %) between the inverted initialized parameters and the original input parameters for sowing date, seeding rate, and nitrogen amount were 1.33%, 4.75%, and 8.16%, respectively (Table 1). The RMSE values of three initial parameters between the retrieved and actual values were 1.30 d, 4.2 kg/ha, and 15.6 kg/ha based on the MCMC algorithm, respectively, after running the model 5000 times. Figure 3 shows the probability distribution of inverted initial parameters based on the MCMC method and the quantified uncertainty of the inverted initial parameters of the model.

2.3. Data Assimilation for LAI and PNA Estimation

This technique was tested on independent datasets. The data from experiment 3 were used to validate the assimilation model. The LAI and PNA dynamics of rice were simulated by the assimilation model based on the MCMC algorithm and the optimal assimilation parameters (LAI and PNA). We compared the LAI and PNA estimated using the assimilation model with the LAI and PNA estimated using the spectral index method. The result showed that values of LAI and PNA based on the assimilation model agreed better with actual values than values simulated by the spectral index method (Figure 4). The results showed that the R² values between the simulated and measured values of LAI and PNA were 0.939 and 0.926, respectively. The RMSE between the simulated and measured values of LAI and PNA were 0.74 and 7.3 kg/ha, respectively. When the GreenSeeker NDVI was used to estimate the LAI and PNA, the R² values between the simulated and measured values of LAI and PNA were 0.841 and 0.84, respectively. The RMSE between the simulated and measured values of LAI and PNA were 1.32 and 13.8 kg/ha, respectively. The data assimilation model achieved better LAI and PNA estimations than the spectral index method. The result confirmed that integrating spectral indices into the DSSAT-CERES model by the MCMC data assimilation algorithm was an effective tool for PNA estimation.

2.4. Data Assimilation for LAI and PNA Prediction and Uncertainty Analysis

The MCMC method could quantify the uncertainty of data assimilation between crop models and remote sensing. The RMSD for LAI varies from 0.08 to 0.26, while the RMSD for PNA varies from 2.85 to 6.02. The RMSD for LAI in the tilling period is higher than in other periods, while the RMSD for LAI in the jointing period is lower than in other periods. The RMSD in the flowering period is lower than in other periods, while the RMSD for PNA in the booting period is lower than in other periods.

2.5. Data Assimilation for Yield Prediction and Uncertainty Analysis

The relationship between the measured and simulated yields is shown in Figure 5. The result showed that the values of yield based on the assimilation model agreed better with the actual values. R² values between the simulated and measured values of yield were 0.79. The RMSE between the simulated and measured values of yield was 661 kg/ha. The simulated yield was in agreement with the measured yield across all three experiments. The simulated yield of hybrid rice was generally underestimated. The RMSD for yield varies from 678 to 792 across all three experiments. The average RMSD for the simulated values of yield was 745 kg/ha. Yield uncertainty from data assimilation between crop models and remote sensing was quantified.

3. Materials and Methods

3.1. Experimental Design

Three experiments were conducted from 2015 to 2017 at the Zhejiang Agricultural and Forest University Modern Agricultural and Forestry Science and Technology Park in Deqing, Huzhou City, Zhejiang Province, China (120°04′ E, 30°33′ N). The field soil was classified as sandy soil and soil organic matter. The total N, available phosphate, and potassium K (0 to 25 cm soil depth) are shown in Table 2. All experiments were conducted in a randomized complete block design with three replicates for each N dressing method at a plant density of 2.55 × 10⁵ for hybrid rice and 8 × 105 plants ha⁻¹ for conventional rice. Before transplanting, we applied a total of 135 kg ha⁻¹ P₂O₅ (as Ca(H₂PO₄)₂) in all experiments plus 180 kg ha⁻¹ K₂O (as KCl) to the soil in all experiments. The area of each plot was 24 m² (3 m × 8 m) in all experiments.

Experiment 1: Yongyou538 was planted on 28 May 2015 with a seeding rate of 60 kg ha⁻¹. Two N dressing methods were used: total N (as urea) was applied at rates of 0, 70, 140, 210, and 280 kg ha⁻¹, with 50% applied at pre-planting and 50% at the jointing stages.

Experiment 2: Xiushui134 was planted on 28 May 2016 with a seeding rate of 60 kg ha⁻¹. Two N dressing methods were used: total N (as urea) was applied at rates of 0, 70, 140, 210, and 280 kg ha⁻¹, with 50% applied at pre-planting and 50% at the jointing stages.

Experiment 3: Yongyou1540 was planted on 30 May 2017 with a seeding rate of 60 kg ha⁻¹. Two N dressing methods were used: total N (as urea) was applied at rates of 0, 70, 140, 210, and 280 kg ha⁻¹, with 50% applied at pre-planting and 50% at the jointing stages.

3.2. Data Acquisition

3.2.1. Measurement of Canopy Spectral Reflectance

A handheld GreenSeeker^TM (NTech Industries Inc., Ukiah, CA, USA) was used to measure canopy reflectance at the red region (656 nm) and near-infrared (NIR) region (774 nm). The NDVI was determined as NDVI = (NIR − Red)/(NIR + Red), where NIR and Red represent the fraction of emitted NIR and red radiation reflected back from the sensed area, respectively. Measurements were taken with the sensor positioned 1 m above the canopy. Readings were collected every 10 days following the jointing stages, resulting in five measurements per plot, with the average serving as a single observation.

3.2.2. Plant Sampling and Analysis

After each canopy spectral reflectance measurement, five plant samples were randomly collected from each plot. The dry weight of plant organs (leaf, haulm, and grain) and the leaf area index (LAI) were measured separately, and their average values were calculated. The LAI for each plot was calculated using the specific leaf area method, which is the ratio of green leaf area to dry weight. Total nitrogen concentration in tissues was measured using the micro-Kjeldahl method [20]. Nitrogen accumulation in the above-ground parts was calculated by multiplying the above-ground dry matter (kg ha⁻¹) by the above-ground plant nitrogen concentration (g kg⁻¹). Grain yield for each plot was determined at maturity by harvesting 2 m² of plants at a moisture content of 14%.

3.3. CERES-Rice Model

The Decision Support System for Agrotechnology Transfer (DSSAT 4.6) used in this research is a software tool designed for simulating crop growth and management [21]. It integrates various components such as soil, weather, and crop management practices to help agronomists and farmers make informed decisions about agricultural practices. CERES-Rice is a crop simulation model that is part of the DSSAT framework. CERES-Rice serves as a specific module within DSSAT focused on simulating rice growth and yield.

3.4. Integrating the MCMC Technique for Data Assimilation

The MCMC technique was used to combine the CERES-Rice model and remote sensing data for rice growth estimation and yield prediction. The MCMC technique, based on the Bayesian approach, effectively synthesizes information from various sources for analyzing model uncertainties and optimizing model parameters. The Metropolis-Hastings (M-H) algorithm [22,23] is a type of MCMC technique based on Bayes’ theorem for generating samples from the posterior distribution. The principle is to generate a large enough sample from the posterior parameter distribution so that features of this distribution (expected parameter values, parameter variances) can be accurately determined. The M-H-based method for estimating region-specific cultivar parameters in this study consisted of the following steps (Figure 6).

Step 1: The adjusted parameters from this study included the sowing date, plant density, and fertilization amount.

θ_{i}^{k}

represented the above three parameters (i = 1, 2, 3; k = 1, 2, 3, … N). An initial set of parameters was θ⁽⁰⁾, including three initial values sampled randomly within the range of each parameter. Prior to data collection, this distribution was based on the existing knowledge about the parameter values before the measurement of new data. It was impossible to define an uninformative prior distribution if there was no available information. The probability density function of the parameters was unknown, so a uniform distribution q (

θ_{i}^{n e w}

/

θ_{i}^{k - 1}

) was assumed as the prior distribution.

Step 2: Proposing a candidate of

θ_{i}^{n e w}

:

θ_{i}^{n e w} = θ_{i}^{k - 1} + r \times (\max (θ_{i}) - \min (θ_{i})) / D

(1)

where r was a random number uniformly distributed between 0 and 1, and max(θ_i) and min(θ_i) were the highest and lowest values of θ_i, respectively. D, controlling the proposed size, was 5.

Step 3: The dscsm046.exe model was run with two sets of parameters (θ^new and θ^k−1) with the required data, and the simulated LAI and PNA were calculated.

Step 4: The likelihood function π_p(θ) was calculated with the simulated LAI and PNA values and the measured values. The likelihood function was calculated at all observation times as follows:

π_{p} (θ) \propto e x p {- \frac{1}{σ_{1}^{2}} \sum_{t = 1}^{T} {[O_{1} (t) - S_{1} (t)]}^{2} - \frac{1}{σ_{2}^{2}} \sum_{t = 1}^{T} {[O_{2} (t) - S_{2} (t)]}^{2}}

(2)

where σ₁ and σ₂ were the standard deviations of actual measurements of LAI and PNA; S₁(t) and O₁(t) were simulated and observed LAI, respectively; S₂(t) and O₂(t) were simulated and observed PNA, respectively; and t stood for the different phenological stages.

Step 5: The ratio (a_p) of the likelihood function of the above two sets of parameters was calculated as follows:

a_{p} (θ^{k - 1}, θ^{n e w}) = m i n {1, \frac{(π_{p} (θ^{n e w})) q (\frac{θ^{k - 1}}{θ^{n e w}})}{(π_{p} (θ^{k - 1})) q (\frac{θ^{n e w}}{θ^{k - 1}})}}

(3)

Step 6: By comparing a_p with a random number U [0, 1], the better candidate was chosen. If a_p ≥ U, set θ^k = θ^new; otherwise, set θ^k = θ^k−1. This was called the M-H criterion to determine whether to accept the proposed candidate.

Step 7: Steps 2–7 were repeated until k = N. N was 10,000 for a single chain.

Step 8: The samples from each chain were gathered after the burn-in (number of iterations to be discarded) of 2000 and 8000 samples were used to calculate the means and variance of the posterior distribution.

3.5. Statistical Analysis

Relative error (RE, %) and root mean square error (RMSE, %) were used to calculate the fitness between the simulated and measured values and evaluate the reliability of the assimilation technique.

\bar{R M S D}

represented the dispersion among estimated values.

RE = {(O}_{j} - S_{j}) / O_{j}

R M S E = \sqrt{\frac{\sum_{j = 1}^{N} {(O_{j} - S_{j})}^{2}}{N}}

{R M S D}_{j} = \sqrt{\frac{\sum_{j = 1}^{K} {(S_{j} - {\bar{S}}_{j})}^{2}}{K}}

where O_j was the measured value, S_j was the simulated value,

{\bar{S}}_{j}

was the average of the simulated values, and N and K were the total number of the measured values.

Microsoft Excel 2016 was used for data entry, organization, preliminary calculations, and drawing. The data were analyzed by two-way ANOVA and Duncan’s multiple range test (p < 0.05) using SPSS 26.0 to evaluate the dry matter, the N concentration, and the yield under different N treatments.

4. Discussion

4.1. Integration of Crop Model and Remote Sensing

The NDVI indices based on visible and red light tended to become saturated as crop stand density increased [24]. Serrano et al. [25] reported that the relationship between the NDVI and leaf area index multiplied by chlorophyll concentration (similar to N uptake) saturated at values around 1000 mg m⁻². The normalized difference vegetation index (NDVI) became saturated for maize when LAI > 2, AGB > 3 t/ha, or PNU > 80 kg/ha [2]. Wang et al. [26] showed that the NDVI became saturated for rice when the Leaf Nitrogen Content (LNC) reached 3%. Li et al. [27] showed that canopy N accumulation was underestimated at a high LAI level. Takahashi et al. [28] showed that the spectral index displayed an obvious saturation when the plant nitrogen accumulation value of rice reached 80 kg/ha. In our research, the NDVI became saturated when the LAI reached six. The NDVI became saturated when PNA reached 90 kg/ha. The saturation effect of the NDVI was mainly due to the canopy closure, the differences in penetration into the canopy between visible light (R) and NIR, and the normalization effect embedded in the calculation formula of this index [2]. The NDVI becomes insensitive to changes in both red and NIR reflectance [3]. Therefore, the statistical model based on the relationship between remote sensing data and agronomic variables is not accurate because of the saturation of the NDVI. Some research built the relationship between the inversion agronomic variables to estimate the crop yield [29]. Crop models could continuously estimate crop LAI and Plant Nitrogen Uptake (PNU) and make up for the shortage of spectral monitoring. The data assimilation model achieved better LAI and PNA estimations than the spectral index method.

4.2. MCMC Method

The MCMC method is based on Monte Carlo simulations and can be directly applied to a nonlinear model [30]. Moreover, it does not rely on the Gaussian assumption of distributions as the Kalman filter-based algorithms do and is thus adapted to the potentially highly nonlinear plant/crop models [31]. The MCMC-based strategy appears to be a better choice for crop growth models in nonlinear and non-Gaussian systems [32] because it improves accuracy and efficiency and produces correct estimates of prediction uncertainty in nonlinear and non-Gaussian crop-growth model data assimilation [33]. In addition, the MCMC method can evaluate model uncertainty properly and provide a credibility interval compared to simple uncertainty analysis.

There are various specific algorithms, including the Synthetic Kalman Filter (SCE-UA) [34], Very Fast Annealing Algorithm [12], Particle Swarm Optimization Algorithm [6,15], and Ensemble Kalman Filter (EnKF) [16,17], that have been used in previous research. Too many parameters must be determined, and repeated testing is needed before the SCE-UA can be used effectively, which increases the complexity of calculations. Although annealing and particle swarm optimization are highly parallel, stochastic, and adaptive general optimization algorithms, there are some shortcomings, such as premature convergence or slow convergence in practical applications. These methods tend to fall into local optimum rather than global optimization. Moreover, none of these methods can quantify the uncertainty of data assimilation. Many studies have shown that the EnKF method has been developed for data assimilation between crop models and remote sensing data. The Kalman filter applies only to linear models with Gaussian prediction errors. Although the extended Kalman filter was developed for nonlinear dynamic models, Kalman filter-based approaches still rely on the Gaussian assumption of distribution [35]. The assimilation of the ensemble Kalman filter when processing nonlinear observation data has to simply linearize the nonlinear observations, which results in large errors and low accuracy. Chen and Cournède [31] showed that Kalman filter-based approaches suffer from the important nonlinearity of the model.

4.3. Uncertainty Analysis

The MCMC method can achieve a proper evaluation of model uncertainty. The uncertainty in data assimilation not only comes from errors from the remote sensing data and model inputs of soil, weather, and field management information but also from the interaction of parameters. In the process of data assimilation, all errors can be transferred to the initial parameters of the model.

We proposed the MCMC method in a probabilistic framework in order to quantify and reduce the uncertainties in rice simulations using data assimilation techniques. The MCMC method has never been applied to complex dynamic crop models for data assimilation. The MCMC method can correct the uncertainty of data assimilation between crop models and remote sensing and then reduce errors from data assimilation algorithms [18]. This method can help us understand uncertainties in the parameters and how those uncertainties affect predictions. The uncertainty of LAI and PNA estimation is derived from the uncertainty of the initial parameters, as seen in Table 3. Table 4 shows the probability distribution of inverted initial parameters based on the MCMC method and the quantified uncertainty of the initial parameters of the model. The integration of Bayesian probability inversion and the MCMC technique is an effective method for analyzing the uncertainties in data assimilation.

4.4. Yield Prediction

The data assimilation method that incorporates two state variables, typically leaf area index (LAI) and plant nitrogen accumulation (PNA), demonstrated greater accuracy in estimating grain yield compared to methods using only one state variable [6,7]. The primary reason for this improvement is that the LAI plays a critical role in crop growth monitoring and yield prediction, as it reflects the extent of the crop canopy and the potential for light interception and photosynthesis. Meanwhile, the PNA serves as an important indicator of the nitrogen status in rice, which has a direct impact on photosynthetic efficiency, grain yield, and quality [7,36]. The estimated sowing date and sowing rate are slightly different from actual values and increase with rising nitrogen values. This is because the sowing date, sowing rate, and nitrogen rate are related to the LAI and PNA. There are some interactions between management parameters. We hope to improve the accuracy of model prediction and to decrease the uncertainty of model prediction. The proper number of initial parameters is important for data assimilation. Three parameters are enough to explain the variation in the LAI and PNA. Some researchers chose more initial parameters for data assimilation, but more parameters will cause larger prediction uncertainty. Too many parameters assimilated into the model will cause a co-linearity effect among parameters.

The yield of hybrid rice is higher than normal rice. Therefore, the simulated yield for hybrid rice by the CERES-Rice model is lower than the actual measured yield. The simulated yield values were overestimated at low measured yield, whereas some simulated yield values were underestimated at high measured yield. The yield reached its largest value when the nitrogen application amount was 210 kg/ha, but the yield change is not obvious with the increase in the nitrogen application rate. However, the actual yield will increase when the nitrogen application amount is more than 210 kg/ha, especially for hybrid rice. Shi et al. [37] showed that the CERES-Rice model simulated the effect of nitrogen fertilizer under a low nitrogen fertilizer application level, but it does not behave so well when excess nitrogen fertilizer is applied.

When estimating yields over large areas, genetic and cultivar parameters, water and soil characteristics, and meteorological data may bias crop growth models [38]. Farmers can use the GreenSeeker™ optical sensor to obtain canopy spectral data, which can be substituted into the calibrated model to obtain optimized parameters, further predicting the yield more accurately. Through the accurate perception of the growth status at the seedling stage and the dynamic feedback of the model, this realizes the leap from passive response to active regulation. However, this technology requires farmers to be trained, and the operation requirements are high, so the lightweight APP should be further developed to reduce the threshold for farmers and improve their acceptance.

5. Conclusions

This research aimed to calibrate the CERES-Rice model using LAI and PNA values derived from spectral indices, employing the MCMC technique. The initial management parameters, including sowing date, sowing rate, and nitrogen rate, were recalibrated based on the correlations between remote sensing state variables and simulated state variables. The findings indicated that the data assimilation method provided the most accurate estimations for LAI (R² = 0.939 and RMSE = 0.74) and PNA (R² = 0.926 and RMSE = 7.3 kg/ha) when compared to the spectral index method. Furthermore, the estimated yield closely matched the measured yield (R² = 0.79 and RMSE = 876 kg/ha). Data assimilation of crop models and remote sensing data using the MCMC technique could improve the estimation of rice leaf area index (LAI), plant nitrogen accumulation (PNA), and yield by solving the saturation effect of the NDVI. The study also quantified yield uncertainty resulting from data assimilation between crop models and remote sensing. Overall, this research presents a novel approach for estimating rice growth and predicting yield by integrating the CERES-Rice model with remote sensing data through the MCMC technique. This technology can provide farmers with precise decision-making support for field management (optimizing water and fertilizer regulation, disaster early warning systems, etc.), assist governments and relevant departments in anticipating regional yield fluctuations and formulating food security strategies, and hold significant practical value for addressing climate change and stabilizing grain production.

Author Contributions

Y.C.: Conceptualization, Data curation, Investigation, Visualization, Writing—original draft, Writing—review and editing; S.W.: Conceptualization, Formal analysis, Software, Validation, Writing—original draft, Writing—review and editing; Z.X., J.H., S.C.: Investigation, Visualization; Z.L.: Funding acquisition, Investigation, Project administration, Resources, Software, Supervision, Visualization, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of China (32272222, 32071897), the Three Rural Areas and Nine Rural Areas of Zhejiang Province (2022SNJF007), the Ningbo Key Projects (2022S092), and the Jinhua Key Projects (2023-2-026).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Fan, L.; Fang, S.; Fan, J.; Wang, Y.; Zhan, L.; He, Y. Rice Yield Estimation Using Machine Learning and Feature Selection in Hilly and Mountainous Chongqing, China. Agriculture 2024, 14, 1615. [Google Scholar] [CrossRef]
Xia, T.; Miao, Y.; Wu, D.; Shao, H.; Khosla, R.; Mi, G. Active optical sensing of spring maize for in-season diagnosis of nitrogen status based on nitrogen nutrition index. Remote Sens. 2016, 8, 605. [Google Scholar] [CrossRef]
Anthony, N.R.; Anatoly, G.; Yi, P.; Viña, A.; Timothy, A.; Donald, R. Green leaf area index estimation in maize and soybean: Combining vegetation indices to achieve maximal sensitivity. Agron. J. 2012, 104, 1336. [Google Scholar]
González-Sanpedro, M.C.; Le Toan, T.; Moreno, J.; Kergoat, L.; Rubio, E. Seasonal variations of leaf area index of agricultural fields retrieved from Landsat data. Remote Sens. Environ. 2008, 112, 810–824. [Google Scholar] [CrossRef]
Gitelson, A.A. Remote sensing estimation of crop biophysical characteristics at various scales. In Hyperspectral Remote Sensing of Vegetation; Thenkabail, P.S., Lyon, J.G., Huete, A., Eds.; CRC Press: Boca Raton, FL, USA, 2011; pp. 1–36. [Google Scholar]
Wang, H.; Zhu, Y.; Li, W.; Cao, W.; Tian, Y. Integrating remotely sensed leaf area index and leaf nitrogen accumulation with ricegrow model based on particle swarm optimization algorithm for rice grain yield assessment. J. Appl. Remote Sens. 2014, 8, 083674. [Google Scholar] [CrossRef]
Li, Z.; Wang, J.; Xu, X.; Zhao, C.; Jin, X.; Yang, G.; Feng, H. Assimilation of two variables derived from hyperspectral data into the DSSAT-CERES model for grain yield and quality estimation. Remote Sens. 2015, 7, 12400–12418. [Google Scholar] [CrossRef]
Akumaga, U.; Gao, F.; Anderson, M.; Dulaney, W.P.; Houborg, R.; Russ, A.; Hively, W.D. Integration of remote sensing and field observations in evaluating DSSAT model for estimating maize and soybean growth and yield in Maryland, USA. Agron. J. 2023, 13, 1540. [Google Scholar] [CrossRef]
Thirumeninathan, S.; Pazhanivelan, S.; Mohan, R.; Pouchepparadjou, A.; Sudarmanian, N.S.; Ragunath, K.; Aruna, L.; Satheesh, S. Integrating S1A microwave remote sensing and DSSAT CROPGRO simulation model for groundnut area and yield estimation. Eur. J. Agron. 2024, 161, 127348. [Google Scholar] [CrossRef]
Ma, G.; Huang, J.; Wu, W.; Fan, J.; Zou, J.; Wu, S. Assimilation of MODIS-LAI into the WOFOST model for forecasting regional winter wheat yield. Math. Comput. Model. 2013, 58, 634–643. [Google Scholar] [CrossRef]
Dente, L.; Satalino, G.; Mattia, F.; Rinaldi, M. Assimilation of leaf area index derived from ASAR and MERIS data into CERES-Wheat model to map wheat yield. Remote Sens. Environ. 2008, 112, 1395–1407. [Google Scholar] [CrossRef]
Dong, Y.; Zhao, C.; Yang, G.; Chen, L.; Wang, J.; Feng, H. Integrating a very fast simulated annealing optimization algorithm for crop leaf area index variational assimilation. Math. Comput. Model. 2013, 58, 877–885. [Google Scholar] [CrossRef]
Huang, J.; Sedano, F.; Huang, Y.; Ma, H.; Li, X.; Liang, S.; Tian, L.; Zhang, X.; Fan, J.; Wu, W. Assimilating a synthetic Kalman filter leaf area index series into the WOFOST model to improve regional winter wheat yield estimation. Agric. For. Meteorol. 2016, 216, 188–202. [Google Scholar] [CrossRef]
Zhao, Y.; Chen, S.; Shen, S. Assimilating remote sensing information with crop model using Ensemble Kalman Filter for improving LAI monitoring and yield estimation. Ecol. Model. 2013, 270, 30–42. [Google Scholar] [CrossRef]
Jin, X.; Li, Z.; Yang, G.; Yang, H.; Feng, H.; Xu, X.; Wang, J.; Li, X.; Luo, J. Winter wheat yield estimation based on multi-source medium resolution optical and radar imaging data and the AquaCrop model using the particle swarm optimization algorithm. ISPRS J. Photogramm. Remote Sens. 2017, 126, 24–37. [Google Scholar] [CrossRef]
De Wit, A.D.; Van Diepen, C. Crop model data assimilation with the Ensemble Kalman filter for improving regional crop yield forecasts. Agric. For. Meteorol. 2007, 146, 38–56. [Google Scholar] [CrossRef]
Nearing, G.S.; Crow, W.T.; Thorp, K.R.; Moran, M.S.; Reichle, R.H.; Gupta, H.V. Assimilating remote sensing observations of leaf area index and soil moisture for wheat yield estimates: An observing system simulation experiment. Water Resour. Res. 2012, 48, W05525. [Google Scholar] [CrossRef]
Jin, X.; Kumar, L.; Li, Z.; Feng, H.; Wang, J. A review of data assimilation of remote sensing and crop models. Eur. J. Agron. 2018, 92, 141–152. [Google Scholar] [CrossRef]
Lv, Z.; Liu, X.; Tang, L.; Liu, L.; Cao, W.; Zhu, Y. Estimation of ecotype-specific cultivar parameters in a wheat phenology model and uncertainty analysis. Agric. For. Meteorol. 2016, 221, 219–229. [Google Scholar] [CrossRef]
Siriwardene, J.A.D.S.; Thomas, A.J.; Evans, R.A.; Axford, R.F. Automated analysis of total nitrogen in solid biological material. J. Sci. Food Agric. 1966, 17, 456–460. [Google Scholar] [CrossRef]
Jones, J.W.; Hoogenboom, G.; Porter, C.H.; Boote, K.J.; Batchelor, W.D.; Hunt, L.A.; Wilkens, P.W.; Singh, U.; Gijsman, A.J.; Ritchie, J.T. The DSSAT cropping system model. Eur. J. Agron. 2003, 18, 235–265. [Google Scholar] [CrossRef]
Hastings, W. Monte Carlo sampling methods using Markov chains and their application. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
Metropolis, N.; Rosenbluth, A.; Rosenbluth, M.; Teller, A.; Teller, E. Equations of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef]
Erdle, K.; Mistele, B.; Schmidhalter, U. Comparison of active and passive spectral sensors in discriminating biomass parameters and nitrogen status in wheat cultivars. Field Crops Res. 2011, 124, 74–84. [Google Scholar] [CrossRef]
Serrano, L.; Gamon, J.A.; Peñuelas, J. Estimation of canopy photosynthetic and nonphotosynthetic components from spectral transmittance. Ecology 2000, 81, 3149–3162. [Google Scholar] [CrossRef]
Wang, W.; Yao, X.; Yao, X.; Tian, Y.; Liu, X.; Ni, J.; Cao, W.; Zhu, Y. Estimating leaf nitrogen concentration with three-band vegetation indices in rice and wheat. Field Crops Res. 2012, 129, 90–98. [Google Scholar] [CrossRef]
Li, F.; Miao, Y.X.; Chen, X.P.; Zhang, H.; Jia, L.; Bareth, G. Estimating winter wheat biomass and nitrogen status using an active crop sensor. Intell. Autom. Soft Comput. 2010, 16, 1221–1230. [Google Scholar]
Takahashi, W.; Vu, N.C.; Kawaguchi, S.; Minamiyama, M.; Ninomiya, S. Statistical models for prediction of dry weight and nitrogen accumulation based on visible and near-infrared hyperspectral reflectance of rice canopies. Plant Prod. Sci. 2000, 3, 377–386. [Google Scholar] [CrossRef]
Ntakos, G.; Prikaziuk, E.; ten Den, T.; Reidsma, P.; Vilfan, N.; van der Wal, T.; van der Tol, C. Coupled WOFOST and SCOPE model for remote sensing-based crop growth simulations. Comput. Electron. Agric. 2024, 225, 109238. [Google Scholar] [CrossRef]
Naud, C.; Mitchell, K.L.; Muller, J.P.; Clothiaux, E.E.; Albert, P.; Preusker, R.; Fischer, J.; Hogan, R.J. Comparison between ATSR-2 stereo, MOS O2-A band and ground-based cloud top heights. Int. J. Remote Sens. 2007, 28, 1969–1987. [Google Scholar] [CrossRef]
Chen, Y.; Cournède, P.H. Data assimilation to reduce uncertainty of crop model prediction with convolution particle filtering. Ecol. Model. 2014, 290, 165–177. [Google Scholar] [CrossRef]
Huang, J.; Song, J.; Huang, H.; Zhuo, W.; Niu, Q.; Wu, S.; Ma, H.; Liang, S. Progress and perspectives in data assimilation algorithms for remote sensing and crop growth model. Sci. Remote Sens. 2024, 10, 100146. [Google Scholar] [CrossRef]
Jiang, Z.; Chen, Z.; Chen, J.; Ren, J.; Li, Z.; Sun, L. The estimation of regional crop yield using ensemble-based four-dimensional variational data assimilation. Remote Sens. 2014, 6, 2664–2681. [Google Scholar] [CrossRef]
Li, Q.; Gao, M.; Duan, S.; Yang, G.; Li, Z.L. Integrating remote sensing assimilation and SCE-UA to construct a grid-by-grid spatialized crop model can dramatically improve winter wheat yield estimate accuracy. Comput. Electron. Agric. 2024, 227, 1–10. [Google Scholar] [CrossRef]
Ren, S.; Chen, H.; Hou, J.; Zhao, P.; Dong, Q.; Feng, H. Based on historical weather data to predict summer field-scale maize yield: Assimilation of remote sensing data to WOFOST model by ensemble Kalman filter algorithm. Comp. Electron. Agric. 2024, 219, 108822. [Google Scholar] [CrossRef]
Wang, D.; Struik, P.C.; Liang, L.; Yin, X. Developing remote sensing- and crop model-based methods to optimize nitrogen management in rice fields. Comput. Electron. Agric. 2024, 220, 108899. [Google Scholar] [CrossRef]
Shi, C.; Jin, Z.; Gao, L.; Ge, D.; Wei, X. Application and Analysis of Ceres-Rice in Rice Nitrogen Fertilizer Management. Jiangsu Agric. Sci. 2003, 19, 1–4. [Google Scholar]
Hansen, J.W.; Jones, J.W. Short Survey: Scaling-up crop models for climate variability. Agric. Syst. 2000, 65, 43–72. [Google Scholar] [CrossRef]

Figure 1. The statistical model between the NDVI and the LAI and PNA.

Figure 2. The statistical model between the NDVI and the LAI and PNA when the NDVI was higher than 0.74.

Figure 3. The probability distribution of inverted initial parameters using the MCMC method. Red line: N0; Green line: N1; Dark blue line: N2; Light blue line: N3; Pink line: N4.

Figure 4. Comparisons of simulated and measured LAI and PNA values by CERES-Rice before and after assimilation based on data from experiment 3.

Figure 5. Comparisons of simulated yield by data assimilation and measured yield.

Figure 6. Flowchart of data assimilation based on the MCMC technique.

Table 1. Differences (RE, %) between the initialized parameters based on the RS-RiceGrow assimilation model and the original input parameters.

Treatments	Sowing Date (%)	Seeding Rate (%)	Nitrogen Amount (%)
E1N0	−2	−16.2	/
E1N1	−1.33	3.2	7.2
E1N2	0	17.9	−7.3
E1N3	0.67	27.2	−11.9
E1N4	1.33	32.1	1.8
E2N0	−1.6	−12.4	/
E2N1	−1.4	4.3	−5.2
E2N2	−0.7	13.6	8.4
E2N3	1	23.2	−9.3
E2N4	1.4	29.4	7.5
E3N0	−2.2	−15.2	/
E3N1	−1.3	−1	8.53
E3N2	−0.4	12.7	−3.7
E3N3	1.3	24.1	5.8
E3N4	1.2	32.5	11.7

Table 2. Fertilizer design and sampling date of three field trials.

Experiment No	Site	Cultivar	N Rate (Kg ha⁻¹)	Planting Date	Soil Parameters
Experiment 1 2015	Deqing	Yongyou538	N0(0) N1(70) N2(140) N3(210) N4(280)	28-May	Soil organic matter: 22 g kg⁻¹ Total N: 1.37 g kg⁻¹ P₂O₅: 32.59 mg kg⁻¹ K₂O: 98.96 mg kg⁻¹
Experiment 2 2016	Deqing	Xiushui134	N0(0) N1(70) N2(140) N3(210) N4(280)	28-May	Soil organic matter: 21.1 g kg⁻¹ Total N: 1.27 g kg⁻¹ P₂O₅: 38.12 mg kg⁻¹ K₂O: 90.23 mg kg⁻¹
Experiment 3 2017	Deqing	Yongyou1540	N0(0) N1(70) N2(140) N3(210) N4(280)	30-May	Soil organic matter: 22.4 g kg⁻¹ Total N: 1.24 g kg⁻¹ P₂O₅: 40.14 mg kg⁻¹ K₂O: 94.56 mg kg⁻¹

Table 3. The RMSD for the LAI and PNA.

	LAI				PNA
	Tilling	Jointing	Booting	Flowering	Tilling	Jointing	Booting	Flowering
N0	0.17	0.10	0.16	0.15	5.30	3.28	2.92	3.07
N1	0.26	0.11	0.17	0.17	6.02	4.25	3.66	3.93
N2	0.23	0.12	0.16	0.16	5.06	3.98	3.59	3.92
N3	0.22	0.13	0.16	0.13	4.85	4.12	3.64	4.03
N4	0.15	0.13	0.13	0.14	3.84	3.85	3.61	3.78

Table 4. Yield prediction results by assimilated model.

	Yield
	R²	RMSE (kg/ha)	RMSD (kg/ha)
E1	0.86	960	678
E2	0.83	338	764
E3	0.7	685	792
Total	0.79	661	745

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, Y.; Wang, S.; Xue, Z.; Hu, J.; Chen, S.; Lv, Z. Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique. Plants 2025, 14, 1206. https://doi.org/10.3390/plants14081206

AMA Style

Chen Y, Wang S, Xue Z, Hu J, Chen S, Lv Z. Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique. Plants. 2025; 14(8):1206. https://doi.org/10.3390/plants14081206

Chicago/Turabian Style

Chen, Yingbo, Siyu Wang, Zhankui Xue, Jijie Hu, Shaojie Chen, and Zunfu Lv. 2025. "Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique" Plants 14, no. 8: 1206. https://doi.org/10.3390/plants14081206

APA Style

Chen, Y., Wang, S., Xue, Z., Hu, J., Chen, S., & Lv, Z. (2025). Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique. Plants, 14(8), 1206. https://doi.org/10.3390/plants14081206

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Rice Growth Estimation and Yield Prediction by Combining the DSSAT Model and Remote Sensing Data Using the Monte Carlo Markov Chain Technique

Abstract

1. Introduction

2. Result

2.1. Spectral Index for LAI and VNA Estimation

2.2. The Probability Distribution of Inverted Initial Parameters

2.3. Data Assimilation for LAI and PNA Estimation

2.4. Data Assimilation for LAI and PNA Prediction and Uncertainty Analysis

2.5. Data Assimilation for Yield Prediction and Uncertainty Analysis

3. Materials and Methods

3.1. Experimental Design

3.2. Data Acquisition

3.2.1. Measurement of Canopy Spectral Reflectance

3.2.2. Plant Sampling and Analysis

3.3. CERES-Rice Model

3.4. Integrating the MCMC Technique for Data Assimilation

3.5. Statistical Analysis

4. Discussion

4.1. Integration of Crop Model and Remote Sensing

4.2. MCMC Method

4.3. Uncertainty Analysis

4.4. Yield Prediction

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI