Estimations of Nitrogen Concentration in Sugarcane Using Hyperspectral Imagery

This study aims to estimate the spatial variation of sugarcane Canopy Nitrogen Concentration (CNC) using spectral data, which were measured from a spaceborne hyperspectral image. Stepwise Multiple Linear Regression (SMLR) and Support Vector Regression (SVR) were applied to calibrate and validate the CNC estimation models. The raw spectral reflectance was transformed into a First-Derivative Spectrum (FDS) and absorption features to remove the spectral noise and finally used as input variables. The results indicate that the estimation models developed by non-linear SVR based Radial Basis Function (RBF) kernel yield the higher correlation coefficient with CNC compared with the models computed by SMLR. The best model shows the coefficient of determination value of 0.78 and Root Mean Square Error (RMSE) value of 0.035% nitrogen. The narrow sensitive spectral wavelengths for quantifying nitrogen content in the combined cultivar environments existed mainly in the electromagnetic spectrum of the visible-red, longer portion of red edge, shortwave infrared regions and far-near infrared. The most important conclusion from this experiment is that spectral signals from the space hyperspectral data contain the meaningful information for quantifying sugarcane CNC across larger geographic areas. The nutrient deficient areas could be corrected by applying suitable farm management.


Introduction
Sugarcane (Saccharum spp. hybrid) is a tall-growing perennial crop widely cultivated in the tropical and subtropical regions of the world [1]. In Thailand, sugarcane is mostly used in the sugar and ethanol energy industries. The precise estimation of sugarcane productivity allows the establishment of deliberate policies to balance the amount of sugarcane within sugar and ethanol energy. Nitrogen is considered a key indicator of the physiological susceptibility of water availability, pests, disease and crop nutrient stress, which could potentially affect crop productivity [2]. In addition, nitrogen is one of the primary regulators of several leaf physiological processes, such as photosynthesis, respiration, and transpiration [3]. Insufficient nitrogen application leads to lower yields, lower sugarcane quality, and an increase of plant vulnerability to certain pests [4]. An increased susceptibility to pests or leaf blades turns a sugarcane leaf from light green to yellow, short, slender stalks and low productivity. To measure the nitrogen concentration using the conventional method, a great number of leaf samples from fields are needed. The process of leaf samples is laborious, lengthy and destructive [4]. Thus, the conventional method is not appropriate for large areas.
The development of hyperspectral remote sensing or imaging spectroscopy has offered possibilities for estimating and mapping the spatial distribution of biophysical and biochemical detection of nutrient deficiencies and precision nutrient management in sugarcane production before harvesting. The main objective of this study is to estimate the spatial variation of sugarcane Canopy Nitrogen Concentration (CNC) from an EO-1 Hyperion image in combined cultivar environments. The capability of SMLR-and SVR-based approaches for estimating CNC were compared. We also validated the performance of a transformed spectral dataset; First-Derivative Spectrum (FDS) and absorption features to correlate with nitrogen content. This study is an extension of a field spectra study, which showed the potential of FDS and SMLR techniques for predicting in situ sugarcane nitrogen at the canopy level [12]. The integration of the proposed methods with hyperspectral variables might be applicable to reliably mapping sugarcane nitrogen content in cropland.

Study Area
The study area is located in Sakaeo Province, in the eastern region of Thailand (102°15′ E, 13°45′ N) as shown in Figure 1. The northern region of the study province is covered with the forested mountains in the Sankamphaeng Range and the Dangrek Mountains. The mean temperature ranges from 27.5 °C to 28.78 °C. The annual rainfall ranges from 1296 to 1539 mm. Sugarcane is the most dominant crop in this area with approximately ten different cultivars. Farmers' sugarcane plots in Wattananakorn, Wang Nam Yen and Klong Hard district, Sakaeo province are usually under the traditional treat conditions. The total study areas are of 40,230 ha. However, there are only four popular sugarcane cultivars; K84-200, LK-92-11, KK-3, and UT-8.

Research Methodology Overview
This research aims to compare the efficiency of two approaches of multivariate based statistical methods with SVR. Two approaches were applied to the estimation of CNC in sugarcane at the canopy level with combined cultivar environments. Spectral reflectance was measured from a hyperspectral satellite. The research methodologies are described below. The northern region of the study province is covered with the forested mountains in the Sankamphaeng Range and the Dangrek Mountains. The mean temperature ranges from 27.5 • C to 28.78 • C. The annual rainfall ranges from 1296 to 1539 mm. Sugarcane is the most dominant crop in this area with approximately ten different cultivars. Farmers' sugarcane plots in Wattananakorn, Wang Nam Yen and Klong Hard district, Sakaeo province are usually under the traditional treat conditions. The total study areas are of 40,230 ha. However, there are only four popular sugarcane cultivars; K84-200, LK-92-11, KK-3, and UT-8.

Research Methodology Overview
This research aims to compare the efficiency of two approaches of multivariate based statistical methods with SVR. Two approaches were applied to the estimation of CNC in sugarcane at the canopy level with combined cultivar environments. Spectral reflectance was measured from a hyperspectral satellite. The research methodologies are described below.

Data Acquisition
The single date EO-1 Hyperion of 128/50 path/row was acquired on 29 January 2012 at the peak of the regional winter season and before the sugarcane harvest. Two hundred and forty-two continuous bands, 7.5 km in width and 100 km in length, covering 400-2500 nm, were recorded. The scene center was at 13.67 • N and 102.26 • E, look angle was 3.59 degrees (almost nadir position) and there was no cloud cover.

Field Data Collection
The collections of sugarcane foliar samples were carried out on the mature sugarcane plots (9-10 months). The canopies were fully developed to avoid the effects of soil brightness. Field data collection was conducted from 4-6 February 2012, within one week of image acquisition. The image pixel size is 30 m ground resolution. Therefore, the sampling site was large enough to ensure the existence of pure pixels in the image. Half of the pixel space was extended in all directions from the considered pixel. Thus, ground measurements with the size of 60 m × 60 m (3600 m 2 ) were randomly selected, concerning the sugarcane cultivars. A total of 70 samples were initially determined. The center coordinates of each plot were recorded. Representative samples from each plot were collected randomly by measuring sugarcane leaves. The second fully expanded leaf from the top of two random selected shoots of one tiller was collected. Sixteen random tillers from 16 subplots (15 m × 15 m) were measured. Thirty-two sampled leaves from one plot were mixed and used for analysis of the nitrogen content in the laboratory. Figure 2 shows the sampling design of field data collections.

Data Acquisition
The single date EO-1 Hyperion of 128/50 path/row was acquired on 29 January 2012 at the peak of the regional winter season and before the sugarcane harvest. Two hundred and forty-two continuous bands, 7.5 km in width and 100 km in length, covering 400-2500 nm, were recorded. The scene center was at 13.67° N and 102.26° E, look angle was 3.59 degrees (almost nadir position) and there was no cloud cover.

Field Data Collection
The collections of sugarcane foliar samples were carried out on the mature sugarcane plots (9-10 months). The canopies were fully developed to avoid the effects of soil brightness. Field data collection was conducted from 4-6 February 2012, within one week of image acquisition. The image pixel size is 30 m ground resolution. Therefore, the sampling site was large enough to ensure the existence of pure pixels in the image. Half of the pixel space was extended in all directions from the considered pixel. Thus, ground measurements with the size of 60 m × 60 m (3600 m 2 ) were randomly selected, concerning the sugarcane cultivars. A total of 70 samples were initially determined. The center coordinates of each plot were recorded. Representative samples from each plot were collected randomly by measuring sugarcane leaves. The second fully expanded leaf from the top of two random selected shoots of one tiller was collected. Sixteen random tillers from 16 subplots (15 m ×15 m) were measured. Thirty-two sampled leaves from one plot were mixed and used for analysis of the nitrogen content in the laboratory. Figure 2 shows the sampling design of field data collections.

Chemical Analysis of Foliar Samples
The second fully expanded leaves from the top of two random shoots of one tiller were collected. Thirty-two fresh leaves from one sampling site were oven-dried at 75 °C for 24 h and then ground up and oven-dried again at 75 °C for 24 h, resulting in leaf powder. The total nitrogen concentration of sugarcane foliar was measured using the Kjeldahl method. The leaf powder was digested by 98% sulfuric acid (w/w) at 380 °C until the solution was transparent. Nitrogen was measured with a Nitrogen Distillation Apparatus using KjeltecTM 2200 Auto distillation and would show the amount of nitrogen concentration in both milligrams per gram (mg/g) and a percentage of nitrogen.

Chemical Analysis of Foliar Samples
The second fully expanded leaves from the top of two random shoots of one tiller were collected. Thirty-two fresh leaves from one sampling site were oven-dried at 75 • C for 24 h and then ground up and oven-dried again at 75 • C for 24 h, resulting in leaf powder. The total nitrogen concentration of sugarcane foliar was measured using the Kjeldahl method. The leaf powder was digested by 98% sulfuric acid (w/w) at 380 • C until the solution was transparent. Nitrogen was measured with a Nitrogen Distillation Apparatus using KjeltecTM 2200 Auto distillation and would show the amount of nitrogen concentration in both milligrams per gram (mg/g) and a percentage of nitrogen.

Geometric Correction
Geometric corrections without the use of ground control points were performed in image level 1GST by USGS. Geo-TIFF is a raster image format that enables referencing by geodetic model or map projection. However, to achieve higher accuracy, the image must be re-rectified by using Ground Control Point (GCP) collected from the field. At least ten clearly visible GCPs were utilized to rectify the image using the first-order polynomial transformation. The RMSE for image rectification is less than 10 m overall.

Radiance Transformation
The digital number (DN) value was rescaled to get the radiance values using the Hyperion calibration coefficients. There are two different rescaling factors used for the VNIR and SWIR region of electromagnetic spectrum [35]. Radiance (L) in the unit of W/m 2 /µm/sr was computed as L = DN/40 and L = DN/80 for visible-near infrared (VNIR) and shortwave infrared (SWIR) bands, respectively.

Atmospheric Correction
Due to the effects of aerosol, water content, gases, digital number in remotely sensed data was considered as the reflectance at Top of Atmospheric (TOA). There are several packages for atmospheric correction widely used in hyperspectral imagery to depict the surface reflectance without atmospheric effects. Subsequently, the Radiance image was atmospherically corrected using the Fast Line-of-Sight Atmospheric Analysis of Spectral Hypercubes (FLAASH) algorithm built in ENVI. FLAASH uses the MODTRAN4 radiation transfer code to convert radiance at the top of the atmosphere to reflectance at the surface, pixel by pixel.

Band Noise Reduction
The Hyperion sensor consists of the continuous 242 bands covering 70 bands in the VNIR and 172 bands in the SWIR region of the electromagnetic spectrum. However, the analysis of Hyperion raw data illustrated that there were 44 bands that have a zero value. There are bands 1 to 7 (355.58-406.47 nm), bands 58 to 76 (902.36-935.58 nm) and bands 225 to 242 (2405.6-2577.07 nm). Thus, out of a total of 242 bands, 198 bands were selected to be calibrated and for further analysis. In addition, it was found that bands 77 and 78 in the SWIR region both have a higher level of spectral noise [36]. Therefore, there are only 196 bands that remain to be processed. Subsequently, atmospheric water vapor bands that absorbed almost all incidents and reflected solar radiation were easily identified by visual inspection of the image data and by atmospheric modeling [37]. Accepting this as a good criterion for band elimination for land surface applications produce the subsets of 176 bands listed in Table 1.

First-Derivative Spectrum: FDS
In this study, FDS was applied and used as a variable for implementing the estimation models. FDS is widely used to enhance absorption features that might be masked by interfering canopy background absorptions [37]. Derivative spectra can be a good noise filter since changes in the baseline have a negligible effect on derivatives. This technique is of benefit for reducing the effects of radiation scattering due to geometry and surface roughness [38]. FDS was calculated using Equation (1): where FDS is the first-derivative transformation at a specific wavelength i midpoint between wavebands j and j + 1. R λ(j) is the reflectance at the j waveband, R λ(j+1) is the reflectance at the j + 1 waveband and ∆λ is the difference in wavelength between j and j + 1.

Absorption Features
The features in the visible, near-infrared and shortwave-infrared regions are widely utilized to elaborate the amount of nitrogen concentration [21]. The continuum-removed reflectance, or R (λi) , is derived by dividing the reflectance value R (λi) of each waveband i in the absorption features by the reflectance level of the continuum line (convex hull) R c(λi) at the corresponding wavelength i, explained in Equation (2): The first and last values of the continuum-removed spectrum are equal to 1. The FDS output ranges between 0 and 1, where the absorption pits are enhanced. In this study, four specific regions of the wavelength were focused, including R 426 -R 528 , R 538 -R 752 , R 1104 -R 1285 , and R 1477 -R 1790 , which are determined as the pigment and water content absorption zones. Two variables, examined in References [22,24] were utilized as the input variables in this study.
(i) Continuum-Removed Derivative Reflectance (CRDR) was derived using a first-derivative transformation to the continuum-removed reflectance spectrum R as described in Equation (3).
(ii) Band depth (BD) was derived by omitting the continuum-removed reflectance at specific wavelength i from 1 shown as in Equation (4)

Feature Dimensionality Reduction
Kernel Principal Component Analysis (KPCA) is an extension of principal component analysis (PCA) based on kernel methods. KPCA allows users to compute the principal components in a high dimensional feature space. This method is non-linearly related to the input variables.
Given a set of centered input vectors x t = (t = 1,..l), and ∑ l t=1 x t = 0 each of which is of m dimension x t = (x t (1), x t (2), . . . x t (m)) T (usually m < 1), the key idea of KPCA is to map the original input vector x t into a higher dimensional feature space φ(x t ) and to calculate the linear PCA in φ(x t ). Dimension is assumed to be larger than l when mapping x t into φ(x t ). The eigenvalue problem was solved by KPCA and illustrated in Equation (5). where , λ i is the non-zero eigenvalue, and µ i is the corresponding eigenvector. Then, KPCA was applied to the training dataset. The extracted non-linear components were projected to the datasets to produce the training and testing data for SVR. There are two well-known types of kernels-Polynomial and Gaussian. The Gaussian equation is shown in Equation (6).
KPCA was applied to reduce the 176 reflectance wavelengths of the sugarcane canopy. The KPCA algorithm was run using Matlab environments developed in Reference [39]. The Gaussian function is selected as the kernel function of KPCA because the Gaussian kernel tends to give a good performance under general smoothness assumptions [40].

Mapping of Sugarcane CNC Using Multivariate Based Statistical Methods
Stepwise Multiple Linear Regression (SMLR) was applied to estimate the CNC in sugarcane from the pixel reflectance spectra and its transformations. The statistical package software SPSS was used for creating the regression model. Estimation models were calibrated using three different datasets-FDS, CRDR and BD. SMLR started running the regression equation with no variable. At each step, the most statistically significant wavelength with the highest or lowest p-value was added to the equation [41]. Wavelength variables were sorted. The variable was retained when the variables were added. However, all other wavelengths in the proposed model were re-tested to check if they were still contributing to the capability of the model.
In the case of a wavelength that no longer contributed to the model significantly, the selected variables were removed. In this study, p-values to enter and remove wavelengths were set at 0.01 and 0.02, respectively. The maximum number of selected wavelengths was limited to five to avoid an over fitting problem. The multicolinearity of variables was validated using a variance inflation factor (VIF) that must be lower than three [17].

Mapping of Sugarcane CNC Using SVR Based Machine Learning Approach
In the process of SVR training, the users must select an appropriate kernel and a set of parameter values. The kernel function transforms the non-linear data boundaries in an original data space into linear ones in the higher dimensional feature space. In this study, linear and Radial Basis Function (RBF) were used as the tested kernel function. The optimal values of free SVR parameters were determined based on the Leave One Out (LOO) cross validation, including a penalty factor (C), the epsilon value of Vapnik's insensitive loss function (ε) and the RBF kernel parameter (γ).
In addition, the best value for the kernel parameter was chosen based on LOO cross validation. The number of used PCs was increased one by one, related to the eigenvalue sorted. The raw data in terms of x, y and z values were transformed into the higher dimensional dot product feature space by the function of Φ. This transformation is done by using a kernel function. In this experiment, a linear kernel (K(x, x i ) = (x x x i )) and an RBF kernel (K(x, x i ) = exp(−γ ||x-x i ||2), γ > 0) were used. This step was run using Matlab software (MathWorks, Natick, MA, United States).

Model Validation
The accuracy of the estimated models was summarized and shown in terms of the coefficient of determination (R 2 ) and the Root Mean Square Error of the Estimate (RMSE) as illustrated in Equation (9) [42]. The Leave One Out (LOO) method was adopted to validate the proposed estimation models (n = 70).
whereŷ i and y i are the estimated and measured crop variables, respectively, and n is the number of samples.  Table 2 shows the variation of sugarcane foliar nitrogen concentrations, which were collected from the field plots. The Kolmogorov-Smirnov method was applied to test the distribution normality of sugarcane nitrogen concentration at the 95% confidence level (p = 0.05). The null hypothesis tested was that the data follow a normal distribution (Ho: p > 0.05) versus the alternative hypothesis (Ha: p < 0.05) that the data did not follow a normal distribution. The results showed that the data used in this experiment were normally distributed (p > 0.05). However, the ranges of nitrogen content in the leaf samples are quite small. This is mainly because sugarcane plots in the study areas are in the mature stage (9-10 months). Most of the sugarcane plots of less than 3 ha in size were cultivated in the rainfed zone. In addition, the local farmers usually supply a low amount of N-fertilizer to their plots due to the limitation of costs. Thus, mean sugarcane productivity in Thailand is quite low when compared to the productivity in Brazil or South Africa.

Estimation of Sugarcane CNC from Hyperion Satellite Image using a Multiple Linear Regression Approach
The number of spectral wavelengths was selected to estimate the CNC using a SMLR ranging from two to four. The results demonstrated that the best model yielded the correlation coefficient with nitrogen content values of 0.74 and RMSE values of 0.038% nitrogen. The sensitive wavelengths are listed in Table 3. The proposed models were validated using the LOO method. The regression equation is Y = 10.75x 1 − 17.54x 2 − 11.75x 3 + 7.366x 4 + 0.69. The scatter plots in Figure 3 show the relationships between the measured and estimated CNC in each sugarcane cultivar. Sugarcane cultivars influence the measured CNC. The percentage of measured canopy nitrogen of KK-3 is higher than that of LK92-11, UT-8 and K84-200, respectively. The spatial distribution of the sugarcane CNC, which was estimated by the SMLR data, are depicted in Figure 4. The CRDR is the best estimator developed by SMLR for explaining the variations of sugarcane CNC. The regression equation is Y = 10.75x 1 − 17.54x 2 − 11.75x 3 + 7.366x 4 + 0.69.

Estimation of Sugarcane CNC from Hyperion Satellite Image Using Support Vector Regression
The high dimensionality of the orbital Hyperion image was reduced using the KPCA method. The Gaussian kernel with δ value of 1 was applied to the FDS, CRDR and BD data sets, respectively. With the spectral data of 176 bands, 76.84 percentage of the variability of FDS was explained by the first 20 Principal Components (PCs). In the case of BD, the first 10 PCs contributed more than 90 percent of the 68 absorption features. It can be concluded that an eigenvalue of the first PCs computed from the BD data set is rather higher than those computed from the CRDR and FDS datasets, respectively. Table 4 summarizes the best estimation models developed from the FDS, CRDR and BD datasets with their optimal number of PCs and SVR parameters. The result shows that an estimation model generated from BD yields a higher accuracy than those computed from CRDR and FDS. Eleven components of BD-KPCA could produce the robust estimation model with the R 2 values of 0.78 and RMSE values of 0.035%, validated with the LOO method. Eleven components of BD-PCA contained 92.12 percent of the spectral information.  Figure 5 depicts the scatter plots of measured CNC versus estimated CNC in each sugarcane cultivar developed by the SVR method for model calibration and KPCA for reducing the feature dimensions. The optimal estimation model based on the SVR-RBF method was applied to map the spatial distribution of CNC in the cropland illustrated in Figure 6.

Estimation of Sugarcane CNC from Hyperion Satellite Image Using Support Vector Regression
The high dimensionality of the orbital Hyperion image was reduced using the KPCA method. The Gaussian kernel with δ value of 1 was applied to the FDS, CRDR and BD data sets, respectively. With the spectral data of 176 bands, 76.84 percentage of the variability of FDS was explained by the first 20 Principal Components (PCs). In the case of BD, the first 10 PCs contributed more than 90 percent of the 68 absorption features. It can be concluded that an eigenvalue of the first PCs computed from the BD data set is rather higher than those computed from the CRDR and FDS datasets, respectively. Table 4 summarizes the best estimation models developed from the FDS, CRDR and BD datasets with their optimal number of PCs and SVR parameters. The result shows that an estimation model generated from BD yields a higher accuracy than those computed from CRDR and FDS. Eleven components of BD-KPCA could produce the robust estimation model with the R 2 values of 0.78 and RMSE values of 0.035%, validated with the LOO method. Eleven components of BD-PCA contained 92.12 percent of the spectral information. C, ε is the common parameter for SVR and γ is the parameter for RBF kernel. Figure 5 depicts the scatter plots of measured CNC versus estimated CNC in each sugarcane cultivar developed by the SVR method for model calibration and KPCA for reducing the feature dimensions. The optimal estimation model based on the SVR-RBF method was applied to map the spatial distribution of CNC in the cropland illustrated in Figure 6.

Discussion
In this experiment, three predictive methods were applied to develop estimation models for sugarcane CNC in the mature stage (11-12 months). The transformed spectral information derived from spaceborne hyperspectral data was employed. The used technical methods for estimating CN consist of SMLR and SVR. The optimal model generated by the SVR-RBF kernel method using the BD dataset as the independent variable yields the highest coefficient of determination value of 0.78 and the lowest RMSE values of 0.035% nitrogen. Our study proves that spectral signals derived from hyperspectral data can possibly be applied to map the sugarcane nitrogen concentration in cropland with the combined cultivar environment. With the proposed SVR predictive model, it is practical to monitor the nutrition status of sugarcane over large areas with the spectral signature and a nondestructive method.

Discussion
In this experiment, three predictive methods were applied to develop estimation models for sugarcane CNC in the mature stage (11-12 months). The transformed spectral information derived from spaceborne hyperspectral data was employed. The used technical methods for estimating CN consist of SMLR and SVR. The optimal model generated by the SVR-RBF kernel method using the BD dataset as the independent variable yields the highest coefficient of determination value of 0.78 and the lowest RMSE values of 0.035% nitrogen. Our study proves that spectral signals derived from hyperspectral data can possibly be applied to map the sugarcane nitrogen concentration in cropland with the combined cultivar environment. With the proposed SVR predictive model, it is practical to monitor the nutrition status of sugarcane over large areas with the spectral signature and a nondestructive method.

Discussion
In this experiment, three predictive methods were applied to develop estimation models for sugarcane CNC in the mature stage (11-12 months). The transformed spectral information derived from spaceborne hyperspectral data was employed. The used technical methods for estimating CN consist of SMLR and SVR. The optimal model generated by the SVR-RBF kernel method using the BD dataset as the independent variable yields the highest coefficient of determination value of 0.78 and the lowest RMSE values of 0.035% nitrogen. Our study proves that spectral signals derived from hyperspectral data can possibly be applied to map the sugarcane nitrogen concentration in cropland with the combined cultivar environment. With the proposed SVR predictive model, it is practical to monitor the nutrition status of sugarcane over large areas with the spectral signature and a nondestructive method.
The results indicate that the estimation models developed by SVR based on a non-linear kernel (RBF) yields a higher correlation coefficient with sugarcane CNC compared to the other models. The predictive model developed by the SVR method possibly improves by approximately 4% of the CNC predictive accuracy when compared with the SMLR model. This is because the information about the relationship between dependent and independent variables is not linear. In addition, the noise redundancy of the spectral data, which were observed from the space was removed before calibrating the model using the KPCA. Subsequently, the raw reflectance was transformed into FDS, CRDR and BD. Such techniques were performed to enhance the absorption regions, which were usually affected by canopy backgrounds [37]. It could be concluded from the study that BD data is the best estimator for the estimation model generated by SVR. BD was calculated from continuum-removed spectra centered in the regions of visible, red-edge and shortwave infrared. Such spectral regions are widely known as the pigment and water content absorption zones. BD is meaningful data, particularly in the case of limited reflectance value due to the saturation problem, especially in the mature stages of sugarcane. This conclusion is the same as those of previous studies [12,43,44].
The results of the study indicate that there is variability of the sugarcane nitrogen content between the eastern and western sites of the study area. This is mainly due to the structure of the sugarcane canopy and water stress. In fact, soil type, topography, planting date and fertilizer practices in the study area are not different. Sugarcane planting cultivars in the western site are mostly KK3 and LK92-11 and in the eastern site are mostly K84-200 and UT-8. At the edge of the eastern region is a rainfed agriculture that relies on natural rainfall. Structures of sugarcane canopy are explained by leaf orientation, which has a significant bearing on its spectral signature. The canopy structure of existing cultivars in the study areas could possibly be classified into planophile (KK-3 and LK92-11) and erectophile structure (K84-200 and UT-8) [10]. In review, sugarcane leaves with a planophile structure contains a higher nitrogen concentration than leaves with an erectophile structure [12]. This is consistent with the findings from the different crop types, such as grassland, winter wheat, cotton, paddy rice [43]. The planophile structure (30 degrees) contributes to significantly greater reflectance in the near infrared and to a greater absorption in the red due to the larger leaf areas facing towards the sun light compared with the erectophile structure (65 degree) [10,45]. The small portions of the leaf area and water stress, due to the variation in rainfall, results in the low rate of photosynthesis, chlorophyll and nitrogen content.
The narrow sensitive spectral wavelengths extracted from hyperspectral image were utilized to estimate sugarcane CNC. The selected wavelengths are centered in the visible-red, longer portion of red edge, far-near infrared and shortwave infrared regions of the electromagnetic spectrum. The red-edge region, which centered at 680-780 nm, has been indicated to be more sensitive to water content, chlorophyll and nitrogen [46]. This is mainly due to the low reflectance in the red region because of chlorophyll absorption and high reflectance in the near-infrared region.
The two most frequently occurring hyperspectral bands in calibrating the estimation model were centered at 660 nm and 721 nm. The narrow band centered at 660 nm is known to be more sensitive to changes in chlorophyll content [5]. The changes in biomass, genetic cultivar, canopy structure, moisture, and nitrogen concentration affect the absorption in the red visible region [47]. The maximum change in the slope of the reflectance spectra on the red-edge region usually occurs at 720 nm [48]. It could be concluded that the variation in crop growth, vegetation stress, leaf area index, chlorophyll and nitrogen status in plants could be detectable by wavelength 720 nm [49]. The wavelengths, centered in the far-near infrared and shortwave infrared are sensitive to the plant canopy moisture. The centered wavelength at 1205 nm and 1265 nm, which related to the leaf nitrogen concentration in sugarcane at the field level [12], yields a high correlation with the nitrogen content.
Most of the previous research attempts to explain the spatial variation of nitrogen content in paddy rice, wheat, cotton and natural grass using the spectral signal both at the foliage and the canopy [28], Sustainability 2018, 10, 1266 13 of 16 but a little in sugarcane. There are a few experiments that were conducted, both at a leaf level [4,12] and at a space level [50] by applying the Random Forest (RF) algorithm used for feature selection and regression in hyperspectral data necessary for predicting sugarcane leaf nitrogen concentration. The results show that sugarcane leaf N concentration can be predicted using non-linear RF regression (R 2 = 0.67) with root mean square error of validation 0.15% derived from the first-order derivative of reflectance.
The results of the study indicate the high potential of SVR for estimating sugarcane nitrogen concentration from hyperspectral data in the combined cultivar environment. The efficiency of the proposed estimation models is more stable and reliable than those in previous studies. With the success in mapping nitrogen content across larger geographic areas from spaceborne hyperspectral data, it allows cane growers to monitor the nutritional status of the sugarcane earlier so that the nutrient deficient areas could be corrected by applying suitable management. However, the proposed method for nitrogen estimation needs to be validated at other sites. This study was conducted at one site representing the sugarcane plots. It remains to be seen whether sugarcane in different physical and climatic environmental conditions affects the spectral response and nitrogen content. To combine hyperspectral data for a precise agriculture concept requires the availability of spatially and temporally continuous imaging spectroscopy. In Precision agriculture studies from space images, date differences between image acquisition and field surveys results in some estimated errors for estimating sugarcane nitrogen.

Conclusions
The main objective of this research was to estimate the sugarcane Canopy Nitrogen Concentration (CNC) from the spaceborne hyperspectral data. The capabilities of two well-known methods were evaluated and applied to describe the spatial variations of CNC. The methods used to estimate nitrogen concentration consist of Stepwise Multiple Linear Regression (SMLR) and Support Vector Regression (SVR). Derivative spectra and transformed absorption features were calculated and used as an independent variable in the estimation models. The most important conclusion from this experiment is that spectral signals from the spaceborne hyperspectral data contain meaningful information for quantifying sugarcane CNC across larger geographic areas. The nutrient deficient areas could be corrected in time by applying suitable farm management. This is a benefit that can reduce intensive field visiting.
The variability of the sugarcane canopy nitrogen content in the study area is mainly due to canopy structure and water availability. Our results showed that sugarcane CNC can be accurately estimated by the proposed method. The best estimation model for sugarcane CNC was developed by non-linear transformed Radial Basis Function (RBF) kernel based SVR using Principal Components (PCs) of Band Depth data (BD). Such a model could explain 78% of CNC variation in cropland with a Root Mean Square Error (RMSE) value of 0.035% nitrogen. The SVR method has a higher performance compared to the SMLR method for estimating the sugarcane CNC at the mature stage. One of the research targets is to compare the performance of BD, First Derivative Spectrum (FDS) and Continuum-Removed Derivative Reflectance (CRDR) to enhance the absorption regions of the raw spectral data. The results of the study indicate that BD is the best independent variable for calibrating and validating the estimation model compared to FDS and CRDR. The high potential of hyperspectral wavelengths that correlate with the concentration of nitrogen at the sugarcane canopies are centered in the visible-red, longer portion of red edge, far-near infrared and shortwave infrared regions of the electromagnetic spectrum. The proposed estimation model was developed under the combined sugarcane cultivars and measured at the mature stage. The narrow wavelengths selected for this experiment are closely related to the changes in chlorophyll, nitrogen, biomass, and canopy moisture. The upcoming developments in hyperspectral sensors and of estimation methods allow us to monitor nutrient deficient areas, water stress and to predict crop production.