Next Article in Journal
Landsat-8 Operational Land Imager Radiometric Calibration and Stability
Previous Article in Journal
Satellite Observations of Wind Farm Impacts on Nocturnal Land Surface Temperature in Iowa
Open AccessArticle

Extraction of Plant Physiological Status from Hyperspectral Signatures Using Machine Learning Methods

Helmholtz Centre for Environmental Research-UFZ, Computational Landscape Ecology, Permoserstraße 15, 04318 Leipzig, Germany
German Research Center for Geosciences, Section Remote Sensing, 14473 Potsdam, Germany
Department of Applied Environmental Science (ITM) and the Bert Bolin Centre for Climate Research, Stockholm University, SE-106 91 Stockholm, Sweden
Department Biogeochemical Integration, Max Planck Institute for Biogeochemistry, Hans-Knöll-Str. 10, 07745 Jena, Germany
Author to whom correspondence should be addressed.
Remote Sens. 2014, 6(12), 12247-12274;
Received: 17 July 2014 / Revised: 25 November 2014 / Accepted: 27 November 2014 / Published: 8 December 2014
The machine learning method, random forest (RF), is applied in order to derive biophysical and structural vegetation parameters from hyperspectral signatures. Hyperspectral data are, among other things, characterized by their high dimensionality and autocorrelation. Common multivariate regression approaches, which usually include only a limited number of spectral indices as predictors, do not make full use of the available information. In contrast, machine learning methods, such as RF, are supposed to be better suited to extract information on vegetation status. First, vegetation parameters are extracted from hyperspectral signatures simulated with the radiative transfer model, PROSAIL. Second, the transferability of these results with respect to laboratory and field measurements is investigated. In situ observations of plant physiological parameters and corresponding spectra are gathered in the laboratory for summer barley (Hordeum vulgare). Field in situ measurements focus on winter crops over several growing seasons. Chlorophyll content, Leaf Area Index and phenological growth stages are derived from simulated and measured spectra. RF performs very robustly and with a very high accuracy on PROSAIL simulated data. Furthermore, it is almost unaffected by introduced noise and bias in the data. When applied to laboratory data, the prediction accuracy is still good (C\(_{ab}\): \(R^2\) = 0.94/ LAI: \(R^2\) = 0.80/BBCH (Growth stages of mono-and dicotyledonous plants) : \(R^2\) = 0.91), but not as high as for simulated spectra. Transferability to field measurements is given with prediction levels as high as for laboratory data (C\(_{ab}\): \(R^2\) = 0.89/LAI: \(R^2\) = 0.89/BBCH: \(R^2\) = \(\sim\)0.8). Wavelengths for deriving plant physiological status based on simulated and measured hyperspectral signatures are mostly selected from appropriate spectral regions (both field and laboratory): 700–800 nm regressing on C\(_{ab}\) and 800–1300 nm regressing on LAI. Results suggest that the prediction accuracy of vegetation parameters using RF is not hampered by the high dimensionality of hyperspectral signatures (given preceding feature reduction). Wavelengths selected as important for prediction might, however, vary between underlying datasets. The introduction of changing environmental factors (soil, illumination conditions) has some detrimental effect, but more important factors seem to stem from measurement uncertainties and plant geometries. View Full-Text
Keywords: hyperspectral data; vegetation status; random forest; PROSAIL; crop hyperspectral data; vegetation status; random forest; PROSAIL; crop
Show Figures

Figure 1

MDPI and ACS Style

Doktor, D.; Lausch, A.; Spengler, D.; Thurner, M. Extraction of Plant Physiological Status from Hyperspectral Signatures Using Machine Learning Methods. Remote Sens. 2014, 6, 12247-12274.

Show more citation formats Show less citations formats

Article Access Map by Country/Region

Only visits after 24 November 2015 are recorded.
Back to TopTop