Grasslands are the largest of the Earth’s four major vegetation types and belong to the world’s most productive agricultural lands [1
]. Grass swards are known for their high spatial and temporal variability [2
] and therefore require intensive monitoring to enable a management that is well adjusted to the prevailing environmental conditions. Maps displaying the phenological status of grassland allow creating a spatio-temporal model of grassland production [3
]. Such information is crucial for adjusting the nutrient management in order to mitigate water pollution (through leaching of soil nutrients) and to conserve the flora and fauna of grassland ecosystems. Commonly, grassland agronomists evaluate the phenological status, the quality and the spatial distribution of grasslands in costly and time-consuming field surveys, as described in Cornelissen et al. [4
]. The methods defined in this handbook require destructive sampling and allow only a low spatial and temporal data density. Research developing non-destructive, cost-effective and time-saving methods for grassland mapping and monitoring is urgently needed. These goals can be reached using a combination of minor field sampling efforts and remote sensing [5
Among remote sensing methods, Vegetation Indices (VIs) are frequently used for assessing different grasslands types (e.g., [5
]). Classification of these types based on VIs requires consideration of the timing of data acquisition in relation to growth stage, as well as of type, wavelength and bandwidth of the VI used. Aragón and Oesterheld [5
] have shown that classification accuracy of grasslands depends on the date of the spectral observation. The reason for these changes in classification accuracy is that the optical properties of grassland vegetation are underlying permanent variations [8
These variations in the individual VIs throughout a growing season are caused by changes in the vegetation’s biophysical properties as a result of climatic conditions in interaction with management actions [9
]. VIs composed of near-infrared and visible light strongly correlate with biomass [10
], LAI and chlorophyll content [11
]. Thus, VIs based on these wavelengths are capable of distinguishing vegetation canopies, as long as they differ significantly in these variables. In contrast, spectral reflectance at longer wavelengths (between 1300 and 2500 nm) is highly influenced by water absorption of plants [13
]. This relation between plant water content and absorption of incoming radiation affects the spectral reflectance of plants. Thus, selecting VIs sensitive to the vegetation’s water content may enable a successful classification at times when those VIs appear relatively similar, which are sensitive to other crop variables, such as biomass, leaf area index or chlorophyll content. Further, VIs more sensitive to nitrogen and lignin content, as well as to morphological and physiological properties indicating plant stress may improve classifications at times when grasslands differ in these variables.
Furthermore, bandwidth and spectral resolution have a major impact on the ability of VIs to distinguish grassland communities. Although several efforts have been made to classify grasslands using broadband data [15
], it was found that they are not spectrally distinct in the broadband system throughout all times of a growing season [18
]. Numata et al. [18
] have shown that the utilization of hyperspectral data may significantly improve the quality of maps displaying the distribution of different grassland communities. Novel or planned hyperspectral satellite missions (such as Hyperion, EnMAP and Hyperspectral Infrared Imager—HyspIRI) may open new perspectives for mapping grasslands using narrowband VIs and are of special importance for distinguishing grassland types and management characteristics, as well as for monitoring these plant communities [1
Hyperspectral RS can provide a cost-effective, time-saving and non-destructive method for mapping differences in growth dynamics as induced by fertilizer application, as well as the spatial distribution of grassland communities at high spatial and temporal resolution. However, multitemporal studies investigating the performance of hyperspectral and broadband VIs for mapping Central-European grasslands are still missing. Basic research into this direction necessitates not only a high temporal, but also a high spatial and spectral resolution of remote sensing data across clearly-defined grassland communities. These requirements are not met using satellite data only, because frequent cloud cover and low spatial coverage of hyperspectral satellite systems prevents acquiring dense time series in Central Europe. The solution lies in utilizing field spectrometers, which can acquire hyperspectral data on demand for dates when cloud cover is low. To collect spectra of clearly separable grassland communities, we identified a fertilizer experiment on grassland as the ideal setting.
The above-mentioned deficits in the classification of grassland types make clear that studies are required where the criteria growth stage, bandwidth and type of VI are rigorously tested. Therefore, the aim of this study was to identify single VIs providing the most valuable information at specific points in time that enable grassland scientists to distinguish between differently-managed grassland communities. Furthermore, we introduce a novel approach in grassland research, which uses multiple VIs sensitive to changes in different plant properties for classifying grasslands. Accordingly, our key hypotheses were:
Each plant community on grassland is characterized by a unique VI development throughout the growing season.
The performance of each VI for distinguishing such plant communities varies throughout the growing season.
VIs sensitive to a certain plant property (e.g., chlorophyll content) may allow a successful classification at times when VIs sensitive to other plant properties (e.g., water content, biomass, etc.) fail to classify correctly.
Combining several VIs for grassland classification allows a temporally more stable classification of grassland communities than one single VI because for distinguishing communities at a point in time, a set of optimal VIs is selected.
To test these hypotheses, we collected a series of spectral signatures of five species-rich grassland communities in a long-term fertilization experiment for 38 dates during the growing seasons of 2012–2014. Subsequently, we calculated fifteen broadband and narrowband VIs and tested the performance of each single VI for distinguishing these five communities using the Welch test [19
]. In addition, we tested multiple VIs for classifying the same grassland communities using the random forests algorithm [20
]. Finally, we compared the classification accuracy achieved using single VIs and the Welch test to the performance reached by the random forests algorithm.
The results of this multi-temporal study are presented in the following sections separately for pooled data on Growth 1 and Growth 2. Treating the two growths individually was important because they varied significantly in their development. This implies important consequences for the classification of the grassland communities.
3.1. Seasonal Curves of the VIs
In Figure 2
, the developments of narrowband Red Edge Position (nREP), Leaf Chlorophyll Index (LCI), NDVI, narrowband Water Index (nWI) and narrowband Water Content (nWC) by T∑ within the five plant communities are shown. These VIs were selected for visual presentation because they represent the best performing VIs for each group averaged over both growths of VIs according to Section 2.4
, as well as the best performing VIs of each growth. Information on the performance of the eleven other VIs is attached in Supplement Table S8
. The shape of the curves of all VIs measured in the Ca, CaN and CaNP treatment were more similar in Growth 2 than in Growth 1. This was mostly related to the lower amplitude of changes in VIs that we observed in Growth 2.
The curves of nREP, LCI, NDVI and nWI increased at the beginning of both growths in all plots. Thereby, the slopes of the curves derived for the NP(K)-treated plots were steeper than those in the Ca and CaN plot, but the CaNP treatment did not reach as high VI values as the NPK treatments. The T∑ at which peaks of these four VIs in the NP(K)-treated plots were reached differed between the VIs and between the two growths. The highest values in these four VIs were found at higher T∑s in Growth 1 than in Growth 2. Furthermore, the chlorophyll- and biomass-related VIs (nREP, LCI and NDVI) reached their peaks earlier than the water-related nWI. Towards the end of both growths, values of the four VIs decreased steeply in the NPK treatments. In contrast, VIs in the Ca and CaN plots remained relatively stable. Opposed to the four previously-mentioned VIs, the curve of nWC decreased at the beginning of both growths. Thereby, the highest decline rates were observed in the NP(K) treatments, which reached their minimum later in Growth 1 than in Growth 2. Afterwards, nWC in the NP(K) treatments began to rise. nWC in the Ca and CaN treatments decreased slower than in the NP(K) treatments at the beginning of both growths, but remained relatively stable towards their end.
In Growth 1, curves in the Ca, CaN and CaNP treatments featured differences in their values of nREP, LCI and nWC throughout most phases of the growing season. However, the curves measured in the CaNPKCl and CaNPK2SO4 treatments were similar in these three VIs. At the same time, these treatments differed considerably in nWI, particularly at the end of Growth 1. At the beginning of Growth 1, NDVI values were different between the plots. At later stages, NDVI values in the five plots became relatively similar. In Growth 2, larger differences between the CaNPKCl and the CaNPK2SO4 treatment were identified for nREP, LCI and NDVI than in Growth 1. In contrast, both NPK treatments showed similar courses of their curves for nWI and nWC.
3.2. Distinguishing the Grassland Communities Using the Welch Test
Although time courses of many VIs tested in this study followed similar patterns, uncertainty remained for how the identified differences have influenced the accuracy for classifying the grassland plots using one VI at a time. Table 3
displays the overall accuracies of the single VIs achieved using the Welch test. In the first growth, nWC significantly differed between the five plots in 91% of the cases and provided the highest rates of discrimination, followed by nWI, nLCI, narrowband Structure Intensive Pigment Index (nSIPI) and LCI. The weakest accuracy was achieved by narrowband Normalized Phaeophytization Index (nNPQI), followed by narrowband Photochemical Reflectance Index (nPRI), narrowband Normalized Pigment Chlorophyll Index (nNPCI) and narrowband Green Normalize Difference Vegetation Index (nGNDVI). In the second growth, significant differences in the VIs between the plots were most frequently found using LCI, nWI, nREP, nSIPI and nWC. As in Growth 1, the lowest classification rates in the second growth were reached with nPRI, nNPQI and NPCI.
Regarding the differences in classification accuracy reached by the broadband and narrowband versions of VIs, results between the two growths differed. In Growth 1, broadband NDVI and GNDVI outperformed their narrowband versions, whereas the opposite relation was found in Growth 2.
The probability for nREP, LCI, NDVI, nWC and nWI to distinguish plots at different T∑s correctly is shown in Figure 3
. At the beginning of the first growth, accuracies of LCI, NDVI, nWI and nWC were above 80% and further increased to more than 95%. The accuracy of nREP was about 50% at the onset of growth, but drastically increased during the initial stages of Growth 1 and successfully separated more than 95% of the plots at a T∑ of 450 °C d. Afterwards, the accuracies of nREP, LCI and NDVI dropped to levels between 65% and 70%. Accuracies of nWC and nWI remained more stable, making them the strongest and second strongest VI, respectively. At the end of Growth 1, the accuracy of the chlorophyll-related VIs started to increase earlier than the accuracy of the water-related VIs. Thus, they achieved similar accuracies, like the water-related VIs.
During the onset of the second growth, the highest classification accuracies were reached using nWI (above 90%). Similar to the previous growth, the accuracies of all VIs (except for nREP) increased until T∑s of 400–450 °C d were reached. Afterwards, all VIs’ accuracies dropped, except for REP, which remained constant at this time. As LCI recovered as the first VI, it was the best performing VI between 800 and 1150 °C d. The water-related VIs recovered in their accuracies the latest. At the end of Growth 2, nREP steeply increased and outperformed the other four VIs.
3.3. Random Forests Classification
The random forests classification was performed to investigate how the utilization of several VIs improves the classification accuracies, compared to single VIs. The error rate of the classification was expressed as OOB error, which is shown as a function of T∑ in Figure 4
. It was observed that OOB errors in the second growth (12%) were higher than in the first growth (5%). Furthermore, the classification error in Growth 1 remained relatively stable (between 2.5% and 6%). In contrast, OOB errors in Growth 2 varied significantly with the T∑ and ranged between 2.5% and 17.5%.
To identify the probability of a VI for being selected as a split variable in random forests, the variable importance was calculated and averaged for each growth (Figure 5
). The results of the t
-test (cf. Table A1
) indicate significant differences in the VIs’ importance. In the first growth, nWI reached the highest importance (16% of the decisions made) and was selected significantly more frequently than all other VIs, except for nREP, which ranked second. LCI, nSIPI, nWC and narrowband Normalized Difference Lignin Index (nNDLI) outperformed NDVI, nGNDVI, nNDVI, GNDVI, nPRI, nNPCI and nNPQI significantly. Medium to low importance (between 6% and 4%) was found for nNDNI, nLCI, NDVI, nGNDVI, nNDVI and GNDVI. These VIs have been of significantly higher importance than nNPCI and nNPQI. During the second growth, the differences in importance between the VIs were lower than in the first growth. However, similar VIs influenced the classification the strongest: nREP, nWI, LCI, nSIPI and nWC were of significantly higher importance than nLCI, nNDVI, GNDVI, NDVI, nPRI and nNPQI. Of medium to low importance were nNDNI, nNPCI, nGNDVI, nNDLI, nLCI, nNDVI, GNDVI and NDVI. nNPQI and nPRI were of significantly lower importance (2.9% and 3.5%, respectively) than all other VIs.
shows the variation in importance of nREP, LCI, NDVI, nWI and nWC throughout the growing season. At the beginning of Growth 1, nREP was most important for the classification. However, at T∑s above 400 °C d, the importance of all biomass and chlorophyll-related VIs (nREP, LCI and NDVI) decreased, whereas the importance of the water-related VIs (nWI and nWC) increased. Thus, at T∑s larger than 600 °C d, nWI was of highest importance, followed by nWC or nREP. After a T∑ of 900 °C d was reached, the importance of nWI and nWC decreased. Consequently, the classification was more influenced by chlorophyll-related VIs (particularly LCI and nREP) at T∑s larger than 1150 °C d. During the entire first growth, NDVI was among the VIs with the lowest importance.
At the onset of the second growth, nREP was the most important VI. However, over time, the importance of LCI and nWI increased and exceeded nREP at T∑s larger than 650 °C d. Similar to nWI, nWC increased in importance at the beginning of the second growth, but did not reach the high levels of nREP, LCI and nWI. At T∑s larger than 950 °C d, the importance of nWI decreased. Thus, during the remaining period of Growth 2, LCI or nREP featured the highest importance. Similar to Growth 1, NDVI had a minor impact on the classifications throughout the entire growth.
4.1. Critical Reflection on the Experimental Settings
In this work, the performance of fifteen different VIs for distinguishing five differently-fertilized grassland communities (Ca, CaN, CaNP, CaNPKCl, CaNPK2
) was tested. Constant fertilization lead to a characteristic plant species composition in each plot, which differed between treatments and changed only marginally between years [23
]. Similarly, biomass production varied between the plots; the lowest biomass was produced in the Ca treatment, followed by CaN, CaNP and the NPK treatments [21
] (cf. Supplement Table S7
Because observations were made in three years, it was assured that annual fluctuations in precipitation or radiation were captured in the dataset. This setup, as well as the utilization of T∑ as the temporal variable secured that no climatic or management factors influenced the species composition and the biomass development. Minor disturbances such as lodging of plants (especially at the end of the first growth in the NPK-treated plots) and the naturally-occurring variability of the plant canopies was diminished by acquiring measurements at three different locations within each plot. Stagakis et al. [45
] identified significant effects of the sensor viewing angle and the sensor height on the spectral reflectance measured over plant communities. In this study, the utilization of the crane system allowed systematic measurements from a 2-m height and nadir position, which widely eliminated these effects. Changing cloud cover [26
], as well as changes in solar elevation angle [45
] are often considered as additional confounding factors of the spectral measurements. Diurnal changes in solar elevation were adapted by acquiring spectra between 10:00 a.m. and 4:00 p.m. and averaging this information for an entire day. Visual inspection of VI values recorded on the single days revealed that VI values were relatively stable. Furthermore, acquiring data in a large interval in time covered the daily occurring variance in spectral signal and further stabilized VI values. However, seasonal changes in solar elevation may have altered the irradiation conditions. However, we assume that these effects were minimized by utilizing VIs because their use significantly lowers the impact of illumination conditions and cloud cover (cf. [47
]). Additionally, frequent measurements of the white reference panel ensured that reflectance was adapted for changes in solar irradiance.
4.2. Seasonal Curves of the VIs
Based on multitemporal measurements of five different VIs (nREP, LCI, NDVI, nWI and nWC) sensitive to different biophysical parameters, differences in the VI development over time were investigated. Comparing both growths, similar developments were observed. However, lower maxima of positively-developing VIs (nREP, LCI, NDVI and nWI), as well as higher minima for negatively-developing VIs (nWC) were observed in Growth 1. The reason behind this is that growth rates were lower in the second growth (cf. Supplement Table S7
), resulting in lower intensities of green vegetation reflectance and lower amounts of water stored in plants.
As observed by other scientists [8
], VIs followed strong seasonal dynamics throughout the growing season. In the NPK-fertilized plots, VIs related to biomass, as well as to chlorophyll and water content, such as nREP, LCI, NDVI and nWI, passed through a rapid increase at the beginning of both growths. At later stages, nREP, LCI and NDVI decreased as a result of starting senescence in the NPK-treated plots. This was supported by [13
], who found significant effects of senesced grass components on the near-infrared reflectance, which has been used for the calculation of these VIs. In contrast, nWI remained at a high level for a longer time, indicating that water content stayed at a high level at the onset of senescence. At the time, NP(K) fertilized plots started senescing, all four VIs in the Ca and CaN plots increased because these plots host slow-growing species [23
]. Such species not only increase in their biomass over a longer time-span, but also remain relatively constant in their water and chlorophyll content. nWC developed contrary to the four other VIs due to its negative response to water content.
4.3. Testing the Classification Accuracy of the Fifteen VIs Using the Welch Test
To test the ability of every single VI for distinguishing the five different vegetation communities, a Welch test was performed for each T∑. The average of the Welch tests’ results allowed an estimation of the overall classification accuracy of each VI in the individual growths. The average of the classification accuracies revealed that classification of all VIs (except nNPQI) was lower in Growth 2 than in Growth 1. The explanation for this is that VI curves in the second growth featured lower amplitudes and were less distinct from each other. These relatively low amplitudes were caused by a lower and more similar sward height and biomass production in Growth 2 (Table A2
At the beginning of both growths, classification accuracies were relatively low. We assume that this was caused by small differences in canopy biomass occurring at this stage. This is supported by measurements of Compressed Sward Height (CSH), which are highly correlated to canopy biomass [49
]. CSH featured a standard deviation in compressed sward height between the five communities of 2.93 cm in the initial stage (464 °C d, 2014) and 18.21 cm in the final stage (464 °C d, 2014) of Growth 1 (Table A2
). Similarly, a standard deviation of 1.18 cm was shown at the beginning (210 °C d, 2014) and 4.57 cm at the end (1353 °C d, 2014) of Growth 2 (Table A2
). Furthermore, it is likely that subtle differences in the reflectance of the plant canopies in the early stages of both growths have been concealed by the influence of soil on the reflectance signal (cf. [18
]). At later stages of the two growths, biomass increased steadily, which mitigating the contribution of soil reflectance to the spectral reflectance (cf. [2
]). Combined with increasing differences in biomass occurring between the different fertilizer treatments, an increase in the classification accuracies was achieved. However, as soon as biomass production in the NPK-treated plots languished, classification accuracies dropped, as canopies were optically more similar.
Due to the onset of senescence in the NPK-treated plots, on the one hand, and the constant abundance of green biomass in the Ca and CaN plot, on the other hand, classification rates at the ends of both growths increased. This contradicts with results published by Fava et al. [10
], who stated that the senescence of vegetation lowers the differences between VIs. We assume that the postponed onset of senescence in the Ca and CaN plots led to increased classification accuracies in this study. This is supported by Sanchez-Azofeifa et al. [51
], who demonstrated that the ability to distinguish plant types based on spectral information is strongly dependent on their phenological stage.
During the first growth, the most successful separation in this experiment was achieved using nWC and nWI, which are known to respond to plant water content. The high overall classification rates of nWI and nWC were caused by the relatively stable classification accuracy in the middle of the growth. This stability was caused by differences in water holding capacity of the canopies because canopies featuring low biomass production (Ca and CaN treatments) desiccate earlier after rain events than the canopies of highly productive communities (NPK treatments). VIs coined for detecting changes in biomass, such as NDVI, as well as nNDVI, GNDVI and nGNDVI, achieved lower classification accuracies. The reason for the poor performance of these VIs is the strongly decreasing classification accuracy in the middle of the growths. An explanation for this decrease in classification accuracy are the saturation effects of these VIs, which were observed under high LAIs (cf. [52
]). Interestingly, LCI, which is related to chlorophyll content, was less affected by saturation in the middle of the growth and seems to be more suitable for mapping grasslands featuring high LAIs than the previously-mentioned VIs. In the final stage of Growth 1, the classification accuracies of nREP and LCI increased and reached the accuracy level of nWI and NWC. This behavior is explained by the good performance of these VIs for distinguishing the senescing NPK treatments from the Ca and CaN treatments. nNPQI (related to phaeophytization), nPRI (indicating plant vitality) and nNPCI (related to chlorophyll content) achieved the lowest accuracies.
During the second growth, the highest classification accuracies were achieved by LCI, followed by nWI and nREP, which confirms the strong performances of these VIs found in Growth 1. The relatively good performance of LCI and nREP compared to the water content-related VIs in Growth 2 leads to the assumption that the lower LAIs support the utilization of chlorophyll- or biomass-related VIs. Again, our results indicate that nNPQI, nPRI and nNPCI, as well as the nGNDVI, nNDVI, NDVI and GNDVI are not well-suited for classifying our grassland communities. Finally, nWI and nSIPI, provided high classification accuracies for both growths, which suggests that these VIs are relatively stable for the classification of grasslands affected by different growing conditions.
No clear result was obtained whether broadband or narrowband VIs provide the highest accuracies for classifying the studies grassland communities. In Growth 1, broadband NDVI and GNDVI outperformed the narrowband versions, whereas in Growth 2, the narrowband versions provided higher classification accuracies. These results contradict the findings made by Thenkabail et al. [55
], who found an increased classification accuracy using exclusively narrowbands to separate weeds, shrubs, crops and grasses in Africa compared to a classification using exclusively broadbands. This suggests that the selection of broadband or narrowband VIs for achieving optimal classification is also dependent upon the timing of spectral data acquisition.
4.4. Random Forests Classification
We applied a random forests classification to investigate how much the inclusion of all fifteen VIs improved the classification of the five plots compared to single VIs. Our results show that classification accuracy increased by 4% in Growth 1 and 12% in Growth 2 in comparison to the strongest VI using the Welch test. Furthermore, the classification accuracy achieved by random forests was more stable (remaining always above 82%) than the best performing VIs identified using the Welch test. In Growth 1, classification rates varied marginally over time, whereas OOB errors in Growth 2 fluctuated between 2.5% and 17.5%. These variations in Growth 2 and the resulting lower overall accuracy were most likely caused by larger influences of soil reflectance on the signal (cf. [2
]) at the beginning of Growth 2, as well as by smaller differences in between the plots’ canopy biomasses in the green-up phase (Table A2
). The increasing classification accuracy at the end of Growth 2 was caused by the onset of senescence in the NPK-treated plots, whereas the other plots remained stable in their amount of green leaf area (cf. [56
Variable importance indicates the frequency a VI is selected inside a random forest. In Growth 1, nWI was the most important VI, followed by nREP and LCI. Our results show that other VIs, such as nSIPI, nNDLI, nNDNI and nLCI, carry additional important information for improving the classification. Interestingly, other biomass-related VIs (i.e., NDVI, nGNDVI, nNDVI and GNDVI), as well as water-related VIs (i.e., nWC) were of relatively low importance compared to the Welch test. We assume that nWI as a water-related VI, as well as nREP and LCI as chlorophyll-related VIs carried redundant information and lead to the low importance of these VIs. Of minor importance were nPRI, nNPCI and nNPQI, which indicates that these VIs marginally support the classification. In the second growth, the ranking in importance of the VIs was similar, but the weight of the VIs was more evenly distributed. This even distribution supports the assumption that especially if classification accuracy was low using single VIs, the addition of VIs sensitive to other biophysical variables increased the classification accuracy. Therefore, nREP as the most important VI was closely followed by nWI, LCI, nSIPI and nNDNI. As biomass-related VIs were less influenced by the high LAIs in Growth 2, nGNDVI and nNPCI increased in importance compared to Growth 1. As these two VIs included similar information as nNDVI, GNDVI and NDVI, the latter VIs were of lower importance. As observed in the first growth, nPRI and nNPQI were rarely selected by random forests.
Using nREP, LCI, NDVI, nWI and nWC, the importance of the VIs under changing T∑s was investigated. In Growth 1, the results of variable importance widely confirmed our findings made using the Welch test. It was shown that the importance of water-related VIs was relatively stable at times when biomass- and chlorophyll-related VIs saturated due to high LAIs of the grass canopies (cf. [52
]). NDVI was of low importance throughout the entire first growth, which confirms its low classification accuracy identified using the Welch test. Throughout the entire second growth, the chlorophyll-related VIs nREP and LCI were relatively stable in their importance. However, nWI and nWC added valuable information especially at T∑s between 400 and 1000 °C d, indicating that these VIs are particularly important when LAIs in the canopies are high and obscure differences in biomass or chlorophyll content. Again, NDVI was of minor importance.
This study presents an approach for classifying grassland communities along a gradient from intensive to extensive fertilizer management in a two-cut-system using a set of fifteen remotely-sensed vegetation indices (VIs). It was shown that VIs are useful to identify differences between the grassland communities at different times throughout a growing season. Each VI featured characteristic fluctuations over time in the individual plots. It is well documented for this particular experiment that plots differ significantly in their biophysical properties, such as biomass production. Hence, we conclude that the VIs that we tested were sensitive to these properties.
The different time courses of VI development have far-reaching consequences for the use and interpretation of VIs when classifying grassland. None of the VIs was able to feature constantly high classification accuracies throughout the entire growing season. However, while classification accuracies of VIs sensitive to one biophysical variable decreased, the accuracies of other VIs remained more stable allowing higher classification accuracies. Biomass- or chlorophyll-related VIs, such as nREP, NDVI or LCI, provided good results when plants had further developed in the intensively-managed plots than in the extensively-managed plots. However, at later stages when the extensively-managed plots caught up in development, water content-related VIs, such as nWI or nWC, were the better alternative for a classification because biomass-related VIs were affected by saturation effects. Further, classification in the second growth was more difficult than in the first growth, due to less distinct differences in biophysical characteristics in between the plant communities. Thereby, the classification performed particularly weak when slow-growing canopies had fully developed and fast-growing canopies had not yet entered senescence. At this stage, nWI and nWC did not reach the high classification accuracies they reached in the Growth 1, so that none of the VIs separated the communities reliably.
This problem was solved by applying all fifteen VIs in the random forests algorithm. A time series of random forest models using the best VIs for classifying the plant communities was created.
As spectral information in this procedure was used more effectively, classification accuracies increased in both growths, but most considerably in the second growth. Furthermore, our multitemporal analysis has shown that classification accuracies using this approach remained relatively stable throughout the entire first growth. Although classification rates in the second varied to some degree, significant improvements were made compared to the utilization of single VIs.
These results suggest that the selection of an appropriate VI (depending on the plant development) is essential for classifying grasslands using single VIs. However, an alternative using random forests was promising, because it yielded a more robust grassland classification.
The utilization of this multi-VI approach for grassland mapping at larger spatial extent could improve the separation of designated plant communities with respect to their floristic composition and plant properties. Future research should be guided towards testing random forests classifiers using VIs for grassland mapping with aerial or satellite imagery for other grassland communities featuring higher or lower temporal variability in biomass production and spectral reflectance. Results of such studies may significantly improve existing monitoring techniques (e.g., using single VIs), improve grassland mapping and contribute to a more sustainable management of grassland ecosystems.