1. Introduction
Coffee crop is the second most traded commodity in the world, second only to the oil production chain. Brazil is the main coffee producer in the world. Although the coffee share in the Brazilian’s exports has declined over time due to product diversification, it is still an important generator of foreign currency for the country [
1].
The use of remote sensing data to coffee crop has proved to be very promising, since there is difficulty in obtaining field data on a regional scale, especially for field mapping. However, the process for getting information from satellite data can be complex because it depends on spectral, temporal and spatial resolutions from the sensor used. In a comprehensive study to assess the accuracy of classification methods for coffee mapping in Costa Rica, Cordero-Sancho
et al. [
2] considered the results obtained only moderate. The authors attributed the errors to topographic effects and also to Landsat spatial resolution, which was insufficient to detect the average size of farms in the region. In Brazil, according to [
3], only 68% of the crop fields mapped through Ikonos-II images were also identified on Landsat images.
The spectral crop behavior in Landsat images varies depending on the crop and the dates of the images, but especially for coffee crop, these variations can also be related to several conditions, such as: planting density, crop management, crop age, cultivation, and others [
4].
Crop spectral patterns prevailing in a satellite scene present several characteristics due to different situations, such as: crop phenological stages and vegetative vigor, plant spacing, intercropping system and management practices [
5,
6]. These characteristics make it difficult to map and monitor processes for coffee crop from satellite data. However, the generation of new remote sensing products with significant improvements related to spatial, spectral, radiometric and temporal resolutions brings new perspectives for the development of applied studies.
Landsat images, due to their spatial and spectral resolutions, are more suitable for mapping coffee fields, however, they can be restricted to a few scenes free of clouds and this condition can obstruct an effective monitoring during crop development [
7–
9].
Vegetation indices have frequently been used for crop yield forecasting based on empirical regression models and yield models [
10–
16]. MODIS data, despite not having suitable spatial resolution to correctly identify coffee plantations, it has an appropriate temporal resolution for monitoring agricultural fields. Vegetation indices derived from MODIS data include geolocation accuracy [
17] and atmospheric correction, which enable vegetation monitoring [
18]. With global coverage almost daily, the system has a better chance of providing cloud-free products at regular time intervals. Brunsell
et al. [
19] assessed the feasibility of using MODIS data to monitor coffee productivity in a municipally in southern Minas Gerais, Brazil. It was concluded that the coarse spatial resolution of MODIS data is offset by high temporal coverage, a benefit that favors coffee field monitoring by this sensor.
One of the main characteristics of coffee crop is that it takes two years to complete the entire phenological cycle of fructification. Branches that grow in the first phenological year will produce coffee beans in the second phenological year. In high-production years, a plant works mostly toward grain-filling to the detriment of new branch growing which will be responsible for production the following year. In low-production years the plant works rather to grow new branches which will produce beans the subsequent year. Thus a coffee plot exhibits high and low production in alternated years. Additionally, this relationship between leaf biomass and coffee yield is influenced by the occurrence of diseases, especially coffee rust (
Hemileia vastatrix). In years with high production, rust infestations are more severe resulting in high leaf fall after harvest and, consequently, it causes yield reduction the following year [
20–
24]. Therefore, the occurrence of rust in years of high yield accentuates the effect of coffee biennially [
25].
Coffee yield forecasting in Brazil relies on the assumption that weather is the main factor responsible for bean yield [
26,
27]. Although the biennial bearing effect on coffee yield and its importance in yield modeling are well known [
28], there is no effective tool to assess this pattern and estimate it in spatial domain.
Considering this predominant yield alternation in coffee crops in consecutive years and the relationship between yield and leaf area, it is possible to expect similar patterns in the alternation of vegetation indices. In this case, correlations could be used as an indicator of yield biennially. This relationship has been studied in individual coffee fields scale [
20–
24] or spatially aggregated over a municipally scale [
19] but, up to now it has not been spatially assessed in the whole largest Brazilian coffee-exporting province.
This study aimed to evaluate the potential of using NDVI and EVI indices generated from MODIS product (MOD 13) to detect the biennial coffee yield from 2002 to 2009, in the southern region of Minas Gerais State, Brazil.
2. Methodology
The selected study area covers the southern region of Minas Gerais State (coordinates 20°00′–23°00′S and 43°50′–47°30′W), where the coffee yield represents more than a half of the state total production (
Figure 1). The region was chosen based on its importance in the national coffee yield and also on its diversity of environments and cropping systems. A humid subtropical climate (Cwa) characterizes the region, according to the Koppen classification, with hot and humid summers and mild to cool winters.
We used EVI time series [
29–
31] and NDVI vegetation indices [
32], totaling 23 scenes per year, both indices derived from MODIS sensor, product MOD13. The years from 2002 to 2009 were considered in order to verify the possibility of detection of biennial coffee yield. MODIS images, which were obtained originally in HDF format (Hierarchical Data Format) and in sinusoidal projection, were processed using application MRTool–MODIS Reprojection Tool. The data was initially reprojected to latitude and longitude geographic coordinates, WGS84 datum and then, converted to GeoTIFF format. A coffee field map obtained from Landsat TM/5 satellite by [
33] for the year 2007 also was considered in this study.
2.1. Selection of Pixels with Coffee Fields
Since there are coffee plots in the study region smaller than the minimum area of MODIS pixels (6.25 ha), pixels with spectral mixture among coffee fields and other land cover classes, as well as, mixed patterns with different coffee plots are expected. Thus, we selected only pixels from EVI and NDVI images which represent homogeneous coffee fields. This process was carried out based on coffee field maps obtained from TM/Landsat 5. The procedure used to co-register Landsat images was the same presented by [
34] with Landsat Enhanced Thematic Mapper Plus (ETM+) and MODIS/TERRA multi-temporal data.
Only pixels 100% occupied by coffee areas according to the Landsat derived coffee map [
33] were selected from MODIS images. However, due to several factors as age of coffee crops, plant density, type of substrate,
etc, different spectral responses can be found in the pixel, according to
Figure 2.
Then we calculated the statistics of Landsat pixels (channel 3) confined into MODIS pixels (NDVI and EVI images) in order to minimize the effect of spectral mixture. We have used Landsat channel 3 due to the well-known interaction between this wavelength (red) with vegetation canopies.
Coefficient of variation was used to describe variability. In this case, lower coefficient of variation indicate less variation of Landsat pixels located in MODIS pixels, suggesting the presence of large homogeneous stands.
Figure 3 illustrates the processing performed for the selection of these pixels and an analysis of temporal patterns. This statistical analysis was made using a SPRING GIS tool (Geographical Information System) developed by [
35].
We initially generated a cadastral vector file from MODIS images where each object corresponds to one pixel. This vector was used to calculate the statistics of channel 3—TM/Landsat image. The vector was updated with the Landsat derived coffee map in order to obtain the percentage of coffee field in each object. A query was made to the database to get only objects with 100% of coffee field and coefficient of variation of less than 40 for channel 3 (indicating homogeneous crop). Then the selected pixels are related only to homogeneous coffee plots bigger than 6.25 ha. These pixels were used as the basis for selecting coffee plots in field data collection. The correlations were considered taking into account yield of individual coffee plots and overlapping pixels of EVI/NDVI.
2.2. Filtering the Time Series
The time series corresponding to the selected pixels were filtered using a wavelet based filtering (
Equation (1)), according to [
36–
38]. This procedure was established in order to eliminate possible noise or pixels with presence of clouds. We assumed that the frequency of noise components in the time series of vegetation indices is greater than seasonal changes in these coffee field indices, and a reconstruction of the series by selecting high frequencies allowed the data filtering without changing the pattern of seasonal changes:
where
a is a scale parameter and
b is the change parameter which represents the mother wavelet ψ [
39].
2.3. Metrics Derived from Vegetation Indices
The vegetation indices were converted into five metrics (amplitude, sum, maximum, minimum and average) in order to link biomass reading and yield values throughout the year. For each year of the time series, the amplitude of vegetation indices concerning maximum and minimum values throughout the year for each selected pixel was calculated in order to quantify the magnitude of leaf loss within each crop year (
Figure 4). We assumed that the vegetation indices represented the crop leaf area and the annual variations that occurred in these indices expressed the gain or loss of leaves during the coffee crop development within the agricultural year. Since the plant works to grow branches (and leaves) which will produce beans the next year, these metrics derived from vegetation indices during the year can be linked to yield. Hereafter when we mention vegetation indices we refer to the metrics derived from vegetation indices.
Besides the amplitude values, for each selected pixel we also evaluated the sum of vegetation indices [
40,
41], the maximum, minimum and average values for each year, in order to identify which metric could present better correlation with productivity.
NDVI and EVI data (amplitude, sum, maximum, minimum and average) were weighed in relation to the maximum value observed in the time series as shown in
Equation (2):
where
IVw: Weighed vegetation index;
IVa: Vegetation index in the year;
IVm: Maximum vegetation index in the time series.
2.4. Yield Data
Yield data from 20 to 37 plots of Mundo Novo varieties (60-kg bags of green coffee per hectare), corresponding to the pixels selected from 2002 to 2009 were obtained (
Table 1). This data was collected in interviews with farmers from four locations: Três Pontas, Boa Esperança, São Sebastião do Paraíso e Monte Santo de Minas. Then, yield data and EVI; NDVI metrics (amplitude, sum, maximum, minimum and average) were weighed in relation to the maximum value observed in the time series according to
Equations (2) and
(3):
where
Pn: weighed yield;
Pa: year yield (60-kg-bags of green coffee per hectare),
Pm: maximum yield in the series (60-kg-bags of green coffee per hectare).
2.5. Correlations
Correlations (Pearson correlation coefficient) were calculated for two different situations: (i) correlations between variation in yield and variation in vegetation indices in the same year (
Equations (4) and
(5)), assuming that an increase in yield could result in a reduction of vegetation indices and; (ii) correlations between variation in vegetation indices and variation in yield the following year (
Equations (5) and
(6)), assuming that an increase in vegetation indices could result in an increase of yield the following year. This approach was adopted to highlight the coffee biennial yield and also, to see a possible alternating pattern of vegetation indices during the yield for each two years. Variations or differences in vegetation indices and yield between two years allow us to assess the effect of biennial yield in alternated years.
where Δ
Pi: yield variation for the year (
i);
Pwi: weighed yield for the year (
i);
Pwi−1: weighed yield for the previous year (
i−1); Δ
IVi: vegetation index variation for the year (
i);
IVwi: weighed vegetation index for the year (
i);
IVwi−1: weighed vegetation index for the previous year (
i−1); Δ
Pi+1: yield variation for the following year (
i+1);
Pwi+1: weighed yield for the following year (
i+1);
Pwi: weighed yield for the year (
i).
4. Discussion
Although the correlation results were smaller than those found in studies with annual crops [
11–
15] and errors were high, the trends suggest a leaf area dependence in relation to coffee yield in the same year. Coffee is a perennial crop and takes two years to complete its phenological cycle, unlike most other crops, which complete their reproductive cycle in one year. Thus this crop represents a unique set of problems because it follows a biannual phenological cycle and exhibits high and low production in alternated years. This feature has been reported by several authors [
26,
27] as one important factor to be incorporated on agrometeorological models for estimating coffee yield. An effective tool to assess this pattern and estimate it in spatial domain could improve significantly coffee yield modeling.
Brunsell
et al. [
19] used lagged correlation analysis and deviations from the annual cycle to relate yield to accumulated deviations in fractional vegetation. MODIS vegetation indices were spatially aggregated over the municipally of Monte Santo de Minas, Minas Gerais, Brazil. The authors noted that data from MODIS vegetation indices converted into fractional vegetation indicate trends in coffee yield. Since the correlation between vegetation indices and yield is significant in our study, the alternated pattern in coffee yield is also true for vegetation indices. Thus, it is possible to infer the biennial effect through vegetation indices.
As shown in
Tables 2–
5, these trends can also be seen in the level of pixels
vs. yield of coffee plot, especially for the amplitude and minimum values of VI.
The higher the coffee yield, the higher the defoliation levels, as suggested by positive correlations between VI variation range (ampEVI and ampNDVI) and variation in crop yield in the same year (
Tables 2 and
3). Similar results were obtained by [
44] in a field-based study where the authors found correlations around 0.90 between yield and crop defoliation in southern Minas Gerais region, Brazil. This defoliation could be attributed either to mechanical damage caused by harvest procedure or to a higher incidence of diseases such as coffee rust [
20,
21,
24].
For the correlation analyses between VI and coffee yield the following year (
Tables 4 and
5), the leaf biomass indicates trends in yield as an indirect result of increases in blooming. The water stress in 2007 could have affected blooming development and, consequently, the relationship between coffee yield that year and VI’s in 2006 (
Figure 8). Since it is not possible to use passive remote sensing to quantify the magnitude of the coffee blooming in the study region because of the presence of clouds when coffee trees are flowering, leaf biomass is still a reasonable way to estimate coffee yield.
The better results obtained with the minimum values could point toward the most appropriate period in the study region for image acquisition to be used in further studies. When there are no clouds during this period (August/September) and if there are other images available with better spatial resolution as TM/Landsat, good results are also expected, since the main limitation of MODIS images in this case is their low spatial resolution.
In addition to the meteorological variables, the spectral response of coffee plantation can be influenced by pruning, plant spacing, intercropping system and others cultural practices. These factors can promote several uncertainties when vegetation index is correlated with productivity as shown by the high errors obtained. Moreover, even the best results of this analysis show that the vegetation indices can only explain about 50% of yield variance. In this way correlation analysis showed that vegetation indices did not entirely explain yield variation because there are a lot of factors responsible for final yield, but these indices could be useful as indicators of coffee biennial yield.
In relation to the vegetation indices evaluated, minEVI seems to be slightly better than minNDVI in our study, since in the analyzed period the correlations between minEVI values and yield were significant while the correlation between minNDVI values and yield were not significant in all analyzed years (
Tables 2–
5). A known limitation of NDVI index is the decrease or the saturation in sensitivity when leaf area index values (LAI) increase [
45,
46], such as coffee trees that can reach values of LAI higher than 8 [
47]. Delalieux
et al. [
48] found the NDVI saturation effect when leaf area rates of crops exceed the value 5. On the other hand, EVI has shown to be less prone to saturation effect with higher sensitivity in regions of high biomass [
49,
50] which could make it more appropriate for studies of coffee canopy.
In our study we assessed correlations between metrics derived from MODIS vegetation indices and coffee yield. Only pixels overlapping large coffee plots were related to yield of the respective plots. The results point toward the possibility of using higher spatial resolution imagery to estimate the biennial bearing effect on coffee yield at the level of individual coffee fields. It can also be associated to agrometeorological models for estimating coffee yield in the spatial domain.
5. Conclusions
Among the five metrics derived from MODIS vegetation indices, including amplitude, sum, maximum, minimum and average values over the year, the minimum values were better correlated with coffee yield. Based on significance of correlation for every year and higher Pearson coefficients, correlation analysis between minimum values of vegetation indices (NDVI; EVI) and coffee yield the following year presented better results—Pearson coefficients ranging from 0.29 to 0.74 for minEVI and 0.27 to 0.68 for minNDVI, significant in both cases in five out of the six analyzed years. Even the best results of this analysis showed that the predictions are not high—Standard errors of prediction ranging from 0.42 to 0.61 for minEVI and 0.45 to 0.61 for minNDVI. The lowest correlations, that occurred in 2006 (variation in VI’s 2006 vs. variation in Yield 2007), with Pearson coefficient of 0.29 for minEVI and of 0.27 for minNDVI, have been caused by a long-drawn drought in 2006 which affected flowering development and, consequently, the relationship between coffee yield in 2007 and VI’s in 2006. Therefore, if no extreme weather event happens, minimum values of EVI and NDVI over the year were found to be useful as indicators of coffee biennial yield, since the trends properly reflect the coffee yield. Moreover, the best correlations were observed with vegetation index data obtained in August/September, the period in which there are cloud-free higher spatial resolution images, which can produce better results.
Despite the low spatial resolution of MODIS data and the fact that yield is a complex factor because it depends on several conditions, the indices were able to express the relationships between leaf biomass and coffee yield. Although coffee yield cannot be estimated exclusively from MODIS vegetation indices, these indices can be derived from higher spatial resolution images in order to obtain better results and can be used coupled with agrometeorogical models for estimating coffee yield.