1. Introduction
Rice is one of the most important cereal grains and serves as a staple food resource for more than half of the world’s population. For this reason, rice consumption increases dramatically together with population raising, and the demand for rice is predicted to be strong [
1,
2]. International Food Policy Research Institute analyses show that the demand for rice is increasing by about 1.8% per year. It will be difficult to increase rice production to satisfy this demand because land and water resources become more scarce with increasing competition from fast-growing non-farming sectors [
3]. Especially, rice crop is a heat and water-loving crop [
4], then, sometimes the rice badly suffers from cold stress during the seedling stage when it is grown in winter environment [
5]. Therefore, monitoring the global productivity of rice is one of the adequate solutions to meet the demand for rice.
Rice parameters (height, dry biomass, leaf area index, salinity, etc.) are primarily monitored through two approaches, including field measurements by farmers and model retrievals from remote sensing data. While the filed measurement is time-consuming, remote sensing is able to perform rice parameters measurement on a large scale without directly contacting the crop [
6]. Spatial remote sensing provides the opportunity to have information on a regional scale with high spatial and temporal resolution. Data acquired from satellites provides a great tool for tracking temporal changes in soil and crop conditions, mapping their characteristics over large areas. There are two kinds of remote sensing techniques: one uses optical sensors and the other uses synthetic aperture radar data (SAR) sensors. In addition, the fusion of optical and SAR data can be exploited as well [
7,
8]. Optical sensors (e.g., UAV, multispectral, and hyperspectral data) are suited for monitoring agricultural areas [
9,
10,
11]. For example, the joint analysis of time series of vegetation and water indices derived from these sensors, such as Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), and Normalized Difference Water Index (NDWI), can be used to estimate rice parameters [
9,
12]. However, a large cloud coverage can limit the use of optical sensors [
13].
Unlike passive optical sensors, SAR systems are capable of producing high-quality images of the earth even in cloud cover conditions. Most rice fields are always found in cloudy cover conditions, and SAR sensors can work in these conditions, so it is more effective for monitoring rice fields than optical sensors [
14]. Furthermore, for agriculture, SAR data can provide information on activities such as plowing, field preparation, planting, and the state of growth of the crops from germination to maturity stages. Recent studies have shown the high potentialities of using such data for crop monitoring [
15,
16]. Thus, the application of SAR data in agriculture is becoming more popular. In previous studies, many investigations have been carried out on monitoring of rice growing. Various of them are based on rice parameters estimation [
17,
18,
19], the others are focused on the used water within the rice fields [
20], and some of them are on soil penology and organic matters [
21]. Besides several SAR data which were used to map and monitor rice such as RADARSAT-1/2 [
22], ENVISAT ASAR [
23], ERS-1/2 [
24], Sentinel-1 data is one of them. The Sentinel-1 mission is based on a constellation of two satellites (A and B). Sentinel-1 as a C-band SAR imaging satellite constellation ensures the continuity of ERS and ENVISAT missions [
25,
26]. Sentinel-1 data are systematically acquired in terrain observation with progressive scan (TOPS) mode with a 6-day revisit period (We note that outside the European zones, it drops to 12 days.). Dense time series of open access Sentinel-1 data at high spatial resolution (20 m) offers new opportunities for monitoring agriculture [
27]. In addition, Torres et al., [
26] reported that C-band SAR data were particularly well suited for monitoring and mapping rice because of their abilities to acquire information on rice growing areas with frequent cloud cover and a remarkable increase in backscattering coefficients throughout the rice growth cycle. Ferrazzoli et al., [
28] also showed that the sensitivity of C-band sensors to plant biomass depends on the type of crop.
In Camargue, the rice crop plays a crucial role for the hydrological balance [
29]. This region is one of the main rice producers and suppliers in France [
30]. Therefore, understanding and analyzing rice growth of Camargue is significant for agriculture in France. The aim of the paper is to analyze the capability of SAR Sentinel-1 data to assess the rice parameters in Camargue. Rice crop height is an important agronomic feature linked to plant type and yield potential. The rice plant height estimation, particularly, is considered a simple method for determining rice growth because this parameter greatly influences the yield potential. Rice biomass is regarded as an important indicator of ecological and management processes in the vegetation.
In the literature of remote sensing, various machine learning algorithms are available for regression [
31,
32,
33,
34]. In practice, the algorithms are selected based on the trade-off between the performances in terms of estimation of a given biophysical parameter, computation time and interpretability of the results [
35]. Most of remote sensing works are based on classical algorithms, such as Multiple Linear Regression (MLR), Support Vector Regression (SVR) and Random Forest (RF) [
36,
37]. Although they have been introduced since the early 2000s, they still compete with other approaches in many applications. In this paper, classical machine learning methods (Multiple Linear Regression, Support Vector Regression and Random Forest) are applied in order to estimate rice height and dry biomass in the Camargue region.
This paper includes six sections. In
Section 2, characteristics of the study site, weather, cultivation of Camargue area and the information of ground truth measurement are described.
Section 3 presents the principle of three methods used to estimate rice parameters (Multiple Linear Regression, Support Vector Regression and Random Forest). The results are shown in
Section 4. The discussions are presented in
Section 5 and finally, the
Section 6 involves the main conclusions of this paper.
2. Study Area
2.1. Camargue Study Site
Camargue is located in southern France; centered coordination is approximately at
N latitude and
E longitude. Among 110,000 ha of total area, 54,000 ha are used for agriculture and the rest is protected for nature conservation [
30]. Placed between two branches of the Rhône and the Mediterranean Sea, Camargue region is a low-land containing alluvial deposits of the Rhone and the Mediterranean Sea. That is mainly characterized by the absence of important relief in which we distinguish two areas [
38]. Starting from the north to south of Camargue, there are plots behind the Rhône river and from the south along the sea, there are much saltwater regions.
Agriculture is the main economic sector of the Camargue region although the semi-arid Mediterranean climate is disadvantageous for rice cultivation because Mediterranean climate is rainy in winter and dry in summer, then it is not easy for planting paddy rice in summer. Temperature and precipitation are important factors that affect directly crops cultivation, especially rice crops. The annual average temperature of this region is about
C (January) to
C (July). Besides that, precipitation also varies greatly during a year with less than 10 mm in July and 160 mm in September (due to short periods of heavy rain). Based on those features, in Camargue, there is one rice crop activity per year from May to September when the temperature and precipitation are the highest. Flooded rice cultivation allows the water needed for soil desalination and to introduce other crops such as wheat, sunflower, and fodder to be rotated. In the Camargue region, the rice crop has an important impact on the ecological, economic, and social equilibrium [
29,
39].
2.2. Rice Phenomena
The temporal observation of rice growth is important for understanding the radar responses of rice plots at different stages of growth. Camargue has a single cropping season of rice; the agricultural calendar is shown in
Table 1.
The rice crop in Camargue is an intensive type irrigated with direct seeding. Temperature and light are key factors in crop growth:
For levee, a minimum temperature of C;
For flowering, a minimum temperature of C.
There are three major periods of rice cultivation: the sowing period (starting according to the weather conditions from the end of April to middle May), the growth period (up to September) and the harvest period (end of September to the beginning of October).
In Camargue, the sowing date is from 24 April to 16 May, followed by a germination stage from May to June. At the germination stage, water is not evacuated. If the surface is bumpy, two risks are present: (1) either the water height is too high and the seedling dies; (2) the land is bare and the seedling germinates badly. Constant monitoring of water levels, which have a significant role as thermal regulators, should be carried out. Continuous water movement compensates the variations of air temperature. A high water depth protects the seedling in case of cooling. However, it is necessary to rapidly lower the level as soon as the ambient conditions become again favorable.
The growing period begins from the successful germination to the flowering/maturation stage.
Figure 1 shows clearly the development of rice plants in Camargue.
Three essential factors play a role in the success of seedling emergence: water temperature, with an average above C with the nocturnal minimum around C; the constancy of the insulation; the fight against parasites: larvae or algae. After the emergence, the rice plot is dried up to 10 or 12 times. Drying can last a day or sometimes half a day. Temporary drying inhibits the development of parasites and activates the rice rooting. Between 25 July and 25 August comes the flowering period. The height of crops does not increase much at this stage. Flowering period requires a warm climate, without wind, without rain. After flowering comes the ripening period where the farmers begin to evaluate their future harvest at the end of August. The harvest stage occurs from the end of September to the beginning of October, all ripening rice crops are reaped. At the end of the harvest, two options are possible. For the first option, rice farmers normally have a long period enough to prepare their rice plots. In irrigated rice farming, as is the case in the Camargue, special attention is paid to leveling, using a laser-guided blade. The use of this equipment aims to obtain perfectly flat plots, in order to manage water precisely. This method is also used after harvest period between seasons for maintenance of irrigation canals (“waterways”). In Spring, once the last surface preparation is carried out to obtain a suitable seedbed, ditches are dug in the plots to facilitate their filling and emptying during cultivation, and then watering is done; we have a continuous cultural cycle. For the second option, farmers work very quickly to sown as early as autumn, usually with wheat which is the second crop in rice farms in Camargue.
2.3. Ground Data Measurement
Ground truth measurements of rice parameters (rice height and rice biomass) were collected in eleven reference plots (
Figure 2) which were about 2 to 6 hectares. A plot survey was carried out from May 2017 to September 2017 on the 11 rice reference plots selected to cover the variability encountered in both soils and agricultural practices. The reference plots were chosen in such a way that they could be represented the paddy rice fields in Camargue and they should not be adjacent parcels. The ground surveys were conducted based on the Sentinel-1 data acquisitions to measure rice height and biomass through the full growth rice cycle. In that respect, the following measurements were made every 12 days: 2 locations per plot for biomass assessment (1 m
for each location), 30 points per plot for crop height measurements.
Figure 2 shows the position of ground surveys. The rice height has been taken on all the 11 plots. In contrast, rice biomass had been only taken on 1G1, 1G3, 2M1, 2M2, 3M1. The number of plots used to cut the biomass is reduced to five; we could not cut the biomass on all the plots otherwise it would be a waste and it would have had an impact on the production. The biomass was dried to have the dry biomass that is used in our analyses.
In addition, information such as the label, surface (in hectare), the sowing date, rice variety and harvest date are known for each plot and represented in
Table 2.
Regarding the parameters displayed in
Table 2, the sowing date is not the same for all plots and the harvest date also is not at the same time. The variability observed is due to the local conditions at the time of sowing and also is due to the rice variety. The harvest period takes place between 29 September (DoY 272) and 14 October (DoY 287). After harvesting, the rice in Camargue is dried and preserved in the geographical area at a humidity level allowing its (good) conservation in silos whose temperature conditions are controlled. The ground measurements of rice height and rice biomass were collected at the same time with SAR Sentinel-1 images acquisition over reference plots. The
Figure 3 shows one of eleven plots with its pictures that were taken during the ground survey.
In summary, for rice height in-situ data, there are 132 measurements, whereas, for rice biomass, we only have 50 measurements.
2.4. SAR Data
The Sentinel-1 SAR data includes 25 TOPS mode acquisitions from May 2017 to September 2017, with 6 revisit days. This is dual-polarization (VV and VH) data, leading to 50 images in total. First of all, a reference image was selected and all images are coregistered to it by taking into account the TOPS acquisition [
41]. Five-look (in range) intensity radar data are generated and radiometrically calibrated for antenna gain, range spreading loss, normalized reference area and the calibration constant using information from Sentinel-1 SAR header.
Reliable estimates using backscattering coefficients require that the estimated number of looks (ENL) is large. To increase the ENL, speckle filtering can be used with loss of spatial resolution. In this paper, we improve the time series SAR Sentinel-1 dataset by exploiting a temporal adaptive filtering to reduce speckle while keeping as much as possible the fine structures present in radar images [
42].
Finally, all images are orthorectified into map coordinates. This can be done by simulating SAR image from the SRTM DEM 30 m and using it to do coregistration. The image pixel size of the final data is 20 m. The Sentinel-1 SAR data are processed by using the TomoSAR platform (i.e., a platform supports the entire processing from SAR, Interferometry, Polarimetry, to Tomography (so-called TomoSAR)) developed by Dinh Ho Tong Minh and Yen-Nhi Ngo [
43]. In details, the full description of this SAR dataset can be found in [
30].
5. Discussion
In this paper, we show that the Sentinel-1 can be used to estimate rice height and biomass in Camargue with a high accuracy. The dynamics in rice height and biomass can be observed by the backscatter behaviors in each of the two polarizations (VV and VH). Good results can be obtained by using classical approaches (MLR, SVR, and RF). The validated metric indicates good performance, in which the correlation coefficient was greater than 0.8. We show that there is a better performance using non-parametric methods (SVR and RF) over the parametric MLR method. Thus, these demonstrated results confirm the great potential of Sentinel-1 data for rice height and biomass retrievals.
First of all, even with the traditional approaches, we show that good retrieved performance could be yielded with Sentinel-1 SAR data. This is not straightforward due to the speckle noise nature in radar images. Good performance can be explained by the fact that 6 days time series Sentinel-1 SAR data allows not only a good follow-up of the rice growth but also mostly noise-free dataset, thanks to the speckle adaptive filtering. In addition, for rice biophysical parameter estimations, the Day after Sowing information, which can be retrieved directly from Sentinel-1 data within the accuracy of 3 days (our estimation), allows us to mitigate the potential problem on the radar signal saturation. This situation is different from forest applications, where the forest age is challenged to retrieval. For example, for forest biomass estimations, many works have to use prior knowledge (e.g., Landsat-derived tree cover [
48]) or advanced techniques (e.g., SAR tomography [
49,
50]) to mitigate the radar signal saturated problem. Finally, we note that outside the European zones, there is a limitation for Sentinel-1 data since the revisit period will be 12 days. However, it should be emphasized that, nowadays, the Sentinel-1 constellation is the only system which can provide free and global coverage radar data. Therefore, it is a good candidate for operational rice monitoring tasks.
Using in-situ measurements at Camargue, we show that radar backscattering coefficients are sensitive to rice biophysical parameters and have a strong correlation with rice height and biomass. This observation is consistent with reports from the literature [
14,
51] due to the sensitivity of radar signals to rice structures. The backscatter of VH and VV polarizations can be separated into two main parts. In the first part, the VV and VH increase with the rice growth until 50 cm height (See
Figure 5c). At this stage, the main backscattering mechanism is contributed by the interaction between the stem and the underlying water surface (e.g., double bounce). In the second part (where rice grows from 50 cm to 100 cm (maturity)), although the VV goes down, the VH is still slightly increased. Indeed, at this stage, the backscattering mechanisms are contributed not only by the double bounce interaction but also from the volume stem directly. Beyond 50 cm, the double bounce phenomenon can be reduced, and the stem and leaves of the rice are not remained vertical. While they lead to the power decrease of the VV backscatter, the VH signal is increased due to its sensitivity to random volume backscattering. Similar interactions were also reported by Lopez-Sanchez et al. [
52]. The same trend of backscattering coefficients versus rice biomass is observed. When rice biomass reaches around 800 g·m
(See
Figure 5d), this is the period when the rice stem is no longer vertical and the leaves have already deviated from the stem. Interestingly, it is worth pointing out that there is a jump in both VV and VH backscatter at 100 DaS days (See
Figure 5a). This is mainly because there is a rapid transition from always irrigated rice fields at the reproductive stage to no longer irrigated status at the ripening stage.
We can observe that non-parametric methods (SVR and RF) have better performance over the parametric MLR method for estimating rice parameters (See
Table 3 and
Table 5). Between the SVR and RF models, the RF method obtains slightly better results than the SVR. With the RF method, the correlation coefficient
and RMSE were 0.92 and
for rice height, 0.90 and
for rice dry biomass. For rice height, all methods (MLR, SVR, and RF) yield the best performances with the dual-polarization VV and VH model. For rice biomass, we observe that the combination of VH and VV polarizations does not bring to the good result as a single VH polarization in SVR and RF methods. This can be explained by the fact that the VH backscattering is mainly from the depolarizing part represents a small proportion of biomass, but it is highly correlated with the total biomass [
48]. The RMSE was 16% (7.9 cm) for rice height, which is better than the result (13.3 cm) reported in the recent literature [
53]. For rice biomass, the RMSE was 18% (162 g·m
), which is better than the recent results (RMSE = 170.49 g·m
) reported by Jing et al. [
54], who used an Artificial Neural Network (ANN) inversion method on the C-band RADARSAT data. Last but not least, our rice biomass estimation (162 g·m
) is much better than the performance (200 g·m
) from L-band ALOS/PALSAR data [
55].
In parallel with machine learning algorithms, there is also another approach to estimate rice parameters. This is physical model-based inversion (such as Karam Model [
56], Water Cloud Model [
57], etc.). However, physical models are complicated and related to certain rice growth models, which require a wide area for the experiment station to measure model parameters. These limitations of physical model-based inversion methods make sense to work on machine learning methods [
36,
37]. We demonstrated in this paper that the non-parametric methods (SVR and RF) had a good performance in estimating the rice biophysical parameters.
Finally, among agricultural practices in the Camargue region, the rice crop plays a crucial role in the cropping systems. This is mainly because the rice irrigation allows the leaching of salt and consequently leads to the introduction of other species (e.g., wheat, sunflower, etc) into the rotation of crops [
39]. However, the rice areas tend to decrease from 16,000 ha in 2011 ([
29]) to 10,627 ha ([
30]) in 2017. The downward trend of the rice extent can lead to a negative impact on the sustainable agricultural development. Future studies should be considered the remote sensing assimilation, such as Sentinel-1 radar for rice height and biomass as our demonstration, in crop models to better follow farming practices, estimate rice production and provide strategies for the sustainable agricultural development.
6. Conclusions
In this work, we studied the potential of Sentinel-1 remote sensing data for rice height and dry biomass estimations. We proposed to use three classical machine learning approaches to predict rice parameters from Sentinel-1 data, which were applied in the Camargue region.
We demonstrated that good regression performance could be yielded with Sentinel-1 data even with the classical approaches (MLR, SVR, and SVR). We found that non-parametric methods (SVR and RF) had better performance over the parametric MLR method for estimating rice parameters. Between the SVR and RF models, the RF method obtains slightly better results than the SVR. Correlation coefficient and RMSE were 0.90 and (162 g·m) for rice dry biomass, 0.92 and (7.9 cm) for rice height, which indicated a high accuracy of Sentinel-1 SAR retrieval. In other words, we demonstrated that Sentinel-1 remote sensing data could be an alternative and reliable approach to monitor regional rice height and estimate dry biomass, compared with direct field measurements. Future work on the Camargue region could be focused on exploiting Sentinel-1 data to improve crop models to better estimate rice production yields in order tobe able to propose strategies for sustainable agricultural development.
In the context of the Copernicus program, there are two missions: radar Sentinel-1 and optical Sentinel-2. In future research, the data combination from the two satellites will be a necessity as it will be possible to carry out large-scale missions. Radar and optical data can be complementary to each other because they offer different perspectives on the Earth’s surface providing different information content according to their specificity [
7,
8,
58]. Both types of data can also be merged to provide information from multiple sources and to provide improved results for decision making [
59].