Evaluating the Applications of the Near-Infrared Region in Mapping Foliar N in the Miombo Woodlands

: Remote sensing has been widely used to estimate the distribution of foliar nitrogen (N) in a cost-effective manner. Although hyperspectral remote sensing targeting the red edge and shortwave infrared regions has proved successful at estimating foliar N, research has recently shifted to include exploring the beneﬁts of using the near-infrared (NIR) region, especially when using broadband sensing. Bootstrapped random forest regression analysis was applied on Sentinel 2 data to test the signiﬁcance of using the NIR in foliar N estimation in miombo woodlands. The results revealed a low ranking for individual NIR bands, but the ranking improved when spectral indices were used. In addition, the results indicated a marginal increase in the normalised root mean square error of prediction (nRMSE) from 11.35% N when all bands were used to 11.69% N when the NIR bands were excluded from the model. Bootstrapping results show higher accuracy and better consistency in the prediction of foliar N using combined spectral indices and individual bands. This study therefore underscores the signiﬁcance of spectral indices to increase the NIR region’s importance in estimating the distribution of foliar N as a key indicator of ecosystem health at the landscape scale in miombo systems.


Introduction
In natural ecosystems, such as the miombo woodlands, nitrogen (N) regulates several key processes, such as plant productivity [1,2], rates of litter decomposition, and dynamics of the terrestrial carbon cycle [3]. The remote estimation of foliar N, a key component of plant biochemistry, is therefore invaluable in the mapping and monitoring of ecosystem health. The major advantage of using remote sensing technology in estimating foliar N is cost-effectiveness, owing to its capability to give synoptic, repetitive, and objective observations of the earth's surface [4].
The remote estimation of foliar biochemicals is based on Beer's Law [5], which predicts how the amount of absorbed or reflected electromagnetic radiation is directly related to foliar chemical concentrations. However, unlike chemical extracts used in laboratory setups, pigments from field samples are integrally bound with leaf structure; therefore, reflectance is not a function of the chemicals alone [6]. Despite this challenge, some regions of the electromagnetic spectrum have emerged as important in the remote estimation of foliar biochemical constituents. For example, the red edge region has been shown to be highly correlated to foliar N at the field level [7][8][9][10][11][12][13]. These studies succeeded in Remote Sens. 2018, 10, 505 2 of 17 estimating foliar N by exploiting the strong relationship which exists between red edge reflectance and chlorophyll concentration, which holds a significant amount of leaf N. Despite the strong relationship between red edge reflectance and leaf N, the spatial extent of remote sensing methods of estimating N were limited by the low spatial coverage of the hyperspectral imagery [14]. Furthermore, the water absorption features in the shortwave infrared (SWIR) region were also found to mask the N absorption regions [15,16]. At the landscape scale, an increased absorption of radiation in the SWIR due to multi-layering of tree canopies further compounds the challenges of mapping foliar N. Thus, there is still a need to develop methods that can be used to estimate foliar N at the landscape level.
The application of broadband imagery has emerged as a promising method of estimating N at a landscape level [14,17]. Previous studies have successfully applied multispectral imagery in estimating foliar biochemicals using the near-infrared (NIR) region [18][19][20], including N [3,14,21]. These studies were mainly conducted in agro-ecosystems and temperate forests and had R 2 values averaging around 0.80. The remote estimation of foliar N using the NIR region is based on the spectral properties of total lignin, cellulose, and N [22]. Reflectance in the NIR region is also a function of biomass, leaf area index, and leaf thickness; properties which are all affected by leaf N [14]. The establishment of a functional relationship between NIR and foliar N is considered the missing link to the generalised estimation of NIR using broadband satellites [17,23]. A recent study has hinted at the existence of a relationship between vegetation albedo and canopy N, which creates new opportunities for the remote estimation of N using broadband imagery [20]. The authors in [24] proposed a generalised partial least squares (PLS) model that relates NIR reflectance to foliar N across different ecosystems. This model rides on the observed consistent performance of the NIR region in estimating foliar N across temperate, humid tropical, and boreal forests. However, the low spatial coverage of imaging spectrometers has limited the subsequent development and testing of such universal models [14]. Despite the recorded progress, some scepticism, for example [25], still exists in using broadband imagery for N estimation. They argued that the relationship between foliar N and NIR is spurious, and is a function of canopy configuration rather than N concentration. However, when fully developed, NIR methods of estimating N using broadband satellites have the potential of landscape level estimation of leaf N.
The miombo woodlands cover an estimated 2.7 million km 2 in central and southern Africa [26,27], and are one of the most extensive seasonally humid forest vegetation types in Africa. They are dominated by three genera of Julbernardia, Brachystegia, and Isoberlinia, which are seldom found outside the woodlands [26]. The miombo ecoregion, which is often confused with the woodlands, is made up of the miombo woodlands and other associated species, such as Vachellia spp., Colophospermum mopane, Baikiaea spp., and Burkea-Terminalia spp., which often form their own extensive stands [28]. The drier miombo woodlands (<1000 mm annual rainfall) cover most of Zimbabwe, Mozambique, and the southern parts of Malawi. The wetter miombo occurs in Angola, Tanzania, Zambia, and central Malawi, in areas receiving above 1000 mm of rainfall [26,28]. The miombo woodlands support a variety of large mammals in Africa, including 50% of the threatened species population of the African elephant (Loxodonta africana) and the rhinoceros (Rhinocerotidae spp.) in Africa [26]. The miombo woodlands have been receiving increasing biodiversity conservation attention [29] owing to their vastness, high endemicity levels, and unique habitat status [26,27,30]. To date, no study has tested the utility of the new generation multispectral sensors for estimating the distribution of foliar N at landscape scales in the miombo systems.
In this study, we test the utility of using the NIR region in mapping foliar N in the miombo woodlands of southern Africa. Using random forest regression, we used the recently launched Sentinel-2 satellite data to investigate the significance of the NIR region (a) as individual bands and (b) contained within spectral indices in the estimation of foliar N.

Study Area
The Kyle Game Reserve covers an area of approximately 44 km 2 . It was established as a recreational park in 1960 to conserve biological diversity. The game reserve is located between latitudes 20 • 04 and 20 • 14 South, and longitudes 31 • 07 and 30 • 50 East in southern Zimbabwe (Figure 1). It is situated about 56 km southeast of Masvingo city. The relief is dominated by the Beza mountain range and a plain that is dotted by a number of isolated hills, and stretches from the foot of the mountain range to the lake shore [31]. The altitude varies between 700 and 1485 m above sea level.
Remote Sens. 2018, 10, x FOR PEER REVIEW 3 of 17 The Kyle Game Reserve covers an area of approximately 44 km 2 . It was established as a recreational park in 1960 to conserve biological diversity. The game reserve is located between latitudes 20°04′ and 20°14′ South, and longitudes 31°07′ and 30°50′ East in southern Zimbabwe (Figure 1). It is situated about 56 km southeast of Masvingo city. The relief is dominated by the Beza mountain range and a plain that is dotted by a number of isolated hills, and stretches from the foot of the mountain range to the lake shore [31]. The altitude varies between 700 and 1485 m above sea level. The climate is subtropical savanna, with an annual average rainfall of 622 mm (Masvingo airport rainfall data: 1906-2015). Most of the rain falls during a single wet (growing) season that stretches from mid-November to end of March. The mean daily maximum temperature ranges from 21 °C in June to 29 °C in October. The mean daily minimum temperatures range from 5 °C in July to 17 °C in January [32]. The soils comprise a mixture of chromic-leptic lixisols and chromic luvisols that are mainly derived from granitic and gneiss parent materials [33]. The vegetation on the plain ranges from miombo woodlands, dominated by msasa (Brachystegia spiciformis) and mnondo (Julbernardia globiflora), to bushland thickets dominated by African weeping wattle (Peltophorum africanum), Silver terminalia (Terminalia sericea), Combretum spp., and sweet thorn (Vachellia karoo), and to extensive grasslands dominated by thatch grass (Hyparrhenia filipendula), red grass (Themeda triandra), and yellow thatching grass (Hyperthelia dissoluta) [32,34].

Field Nitrogen Data
A field campaign was conducted from 4 April 2017 to 16 April 2017. This time coincided with the end of the growing season in the miombo woodlands. A landcover map of the game reserve, generated from a Sentinel-2 image of May 2016 with a kappa statistic of 0.83, was used to locate relatively homogenous miombo woodland stands that were targeted for sampling. Using the Hawths tools in ARCGIS 10.1 [35], a total of 103 sampling points distributed only in the miombo woodlands were generated (see Figure 1). A minimum distance of at least 150 m was set between sampling plots to The climate is subtropical savanna, with an annual average rainfall of 622 mm (Masvingo airport rainfall data: 1906-2015). Most of the rain falls during a single wet (growing) season that stretches from mid-November to end of March. The mean daily maximum temperature ranges from 21 • C in June to 29 • C in October. The mean daily minimum temperatures range from 5 • C in July to 17 • C in January [32]. The soils comprise a mixture of chromic-leptic lixisols and chromic luvisols that are mainly derived from granitic and gneiss parent materials [33]. The vegetation on the plain ranges from miombo woodlands, dominated by msasa (Brachystegia spiciformis) and mnondo (Julbernardia globiflora), to bushland thickets dominated by African weeping wattle (Peltophorum africanum), Silver terminalia (Terminalia sericea), Combretum spp., and sweet thorn (Vachellia karoo), and to extensive grasslands dominated by thatch grass (Hyparrhenia filipendula), red grass (Themeda triandra), and yellow thatching grass (Hyperthelia dissoluta) [32,34]. relatively homogenous miombo woodland stands that were targeted for sampling. Using the Hawths tools in ARCGIS 10.1 [35], a total of 103 sampling points distributed only in the miombo woodlands were generated (see Figure 1). A minimum distance of at least 150 m was set between sampling plots to eliminate the possibilities of overlapping pixels in the satellite image data. The coordinates (xy) of the sampling points were uploaded into a hand-held global positioning system (GPS) receiver.

Field Nitrogen Data
In the field, a Garmin Etrex GPS receiver with a positional accuracy of ≤3 m was used to locate the random sampling points. Each sampling point was taken as the centre of a quadrat plot measuring 20 m by 20 m, which was north-oriented using a compass. A plot size of 400 m 2 was chosen to coincide with the coarser spatial resolution of the Sentinel-2 data. The centroid of each sampling plot was located such that for a minimum distance of 40 m around the plot, the entire plot fell within a miombo woodland cover. This measure was taken to minimise the potential contaminating effect of other landcover types on the spectral value of the sampling plot.
In each sampling plot, all miombo trees whose canopies contributed most to the percentage canopy cover were identified and assigned a unique number. Those individual trees with the largest crowns (diameter >5 m) were targeted for sampling in each plot. The bigger and more widespread the crown of an individual tree, the greater its influence on the canopy cover of the sampling plot, and consequently the more influential it is on the reflectance value. Then, about 2-4 individual trees were randomly chosen. From each selected tree, 6 to 10 fresh, fully expanded green leaves [36] were clipped from two layers within the canopy; namely, the sunlit leaves from the top of the canopy and the partially shaded leaves from within the canopy. Our sampling plots were largely homogeneous, being dominated (>80%) by a single species of either Julbernardia globiflora or Brachystegia spiciformis. Consequently, the foliar samples were from the dominant tree species. At each sampling plot, leaves from the same canopy level but different trees were mixed to make one composite sample and placed in an airtight container, which was stored in a cooler box full of ice. At the end of each day, the collected samples were oven dried at 80 • C for 48 h. All leaf samples were later sent to the Department of Biological Sciences at the University of Zimbabwe in Harare for foliar N determination.
In the laboratory, the standard colorimetric method was used to determine foliar N concentration [37,38]. In brief, the plant material was first digested using selenium powder, lithium sulphate, hydrogen peroxide, and sulphuric acid [39,40]. Then, the absorbance of each sample was read using a Cole and Parmer 1100 spectrophotometer set at 655 nm. N concentration was expressed as a percentage of the dry matter. The colorimetric method has produced reliable results elsewhere hence its use [4,38,41]. The results for the top and middle canopy were analysed and presented separately. In this study, however, an average of the two values was related to remotely sensed data.

Remote Sensing Data
A Sentinel-2 level 1C image was downloaded from the European Space Agency (ESA) website [42]. The image was acquired on 12 April 2017, which coincided with the period of the field campaign. Sentinel-2 data consists of 13 bands, from the visible to the shortwave infrared band. Two of the bands contain information on cirrus clouds (band 10) and aerosols (band 1); therefore, they were not used in this study. An atmospheric correction procedure was performed on the Sentinel-2 data using the SACP plugin [43] in QGIS software, version 2.18. All of the bands were later re-sampled using the nearest neighbour analysis to the same 10 m spatial resolution in the SNAP software [44] to facilitate layer stacking during analysis. Table 1 presents the wavelength as well as spatial resolution of the Sentinel-2 bands used in this study.

Spectral Indices
In this study, eight spectral indices that were previously related to field N were calculated from Sentinel-2 data and used to estimate foliar N. The indices used, and the mathematical expressions applied to derive them, are given in Table 2. Five of the indices included the NIR band, while only three indices did not include the NIR band.  B4, B8 [50] # the subscripts on the red edge band are used to distinguish the central wavelengths within the red edge band that were used for calculating the indices (see Table 1). Sentinel-2 has three red edge bands, which are identified in this table by subscripts 1-3.

Random Forest Regression
We used the Random Forest (RF) regression model in relating (a) individual Sentinel-2 bands, (b) spectral indices, and (c) combined spectral indices and individual bands to field-measured foliar N. The RF regression model was used because of its ability to handle several input variables, such as remote sensing bands, while circumventing overfitting and multi-collinearity problems [4,51]. The RF regression analysis was performed in the R environment software developed by [52]. Specifically, the "randomForest" package developed by [53] was used to execute random forest regression. The RF regression algorithm is a bagging method that is based on the classification and regression trees (CART) method [4,51]. Three parameters were optimized: node size, mtry, and ntree, where the node size is the minimal size of the terminal nodes of the trees, mtry is the number of predictor variables performing the data partitioning at each node, and ntree is the total number of trees to be grown in the model [51]. The best ntree values for predicting foliar N in the miombo woodlands were determined by optimising Remote Sens. 2018, 10, 505 6 of 17 ntree based on the root mean square error of calibration (RMSEC). The number of trees was tested from 100 to 1000 at intervals of 100 [54]. The default node size and mtry were accepted throughout the analysis. A more detailed explanation of random forest is outlined elsewhere [4,55,56]. The RF regression model was also applied to assess the importance of the NIR in the prediction of foliar N. Prior to running the model, the data set was randomly partitioned into modelling and validation data based on a 70:30 ratio (72 and 31 sampling points, respectively). The importance of each predictor variable was measured by calculating the percent increase in mean squared error (%IncMSE) when data not used in the prediction (out of the bag (OOB) data) for each variable are permuted, while all others are unchanged [54]. The mean variable importance was calculated from 500 bootstraps, and the results were shown. Bootstrapping is a resampling procedure that randomly selects samples with replacement, allowing researchers to draw conclusions from the existing sample rather than by making assumptions about the estimator [57]. In this study, each bootstrap iteration drew 72 plots with replacement from the 72 available samples. The remainder of the samples that were not drawn for modelling were used as holdout samples for an independent validation [58]. The mean percent increase in mean square error was used to rank the importance of (a) individual sentinel-2 bands and (b) each spectral index in predicting foliar N in the miombo woodlands. The variable with the highest ranking was the most important variable for predicting foliar N. After ranking the individual bands and spectral indices, we generated two more random forest models, which excluded (a) the NIR band (from the individual band list) and (b) those spectral indices which have information from the NIR band.

Model Evaluation
In order to evaluate the performance of the five random forest models, we made use of three statistical parameters. These are the normalised root mean square error (nRMSE), a square of the strength of the correlation (r 2 ) between the observed and predicted values, and the bias. Firstly, the RF models were calibrated using the individual bands and spectral indices. A bootstrapping procedure with 500 iterations was then applied on the random forest regression models to predict the value of N from an independent test dataset of 31 observations. The mean nRMSE, mean r 2 , and mean bias from the 500 bootstraps were calculated for each of the five RF models and were used as measures of accuracy of the RF models. Based on the nRMSE, the best RF model was used to produce a map of the mean N and standard deviation of N based on the 500 bootstraps.

Results
The field-measured foliar N was variable. It ranged from a minimum of 0.33% to a maximum of 5.33%. The mean N concentration in the study area was 2.43%, with a standard deviation of 1.07%.

Individual Bands
Results of this study show that the NIR region of the electromagnetic spectrum is ranked lowly in the prediction of foliar N in the miombo woodlands. Figure 2 shows that the NIR band (band 8), the best-ranked NIR band in predicting foliar N, is ranked eighth out of the ten bands that were considered. The other NIR band (band 8a) is ranked ninth, which is second from last. The results in Figure 2 indicate that the four most important bands are found within the red edge section of the electromagnetic spectrum (band 5 and band 7) as well as the shortwave infrared (SWIR) portions of the electromagnetic spectrum (band 11 and band 12). The fifth-most important band is the blue band (band 2). The other red edge band (band 6) is ranked lowly in the prediction of foliar N in the miombo woodlands.  Figure 3 illustrates that three of the five most important spectral indices for estimating foliar N (that is, simple ratio index (RVI), enhanced vegetation index (EVI), and normalised difference vegetation index (NDVI)) contain the NIR band. Specifically, NDVI, which is derived from the NIR and red bands, is the most important NIR-based spectral index for predicting variations in foliar N concentration in the miombo woodlands. The two spectral indices ranked higher than NDVI are the green chlorophyll index (CIG) and the green index (GI), both of which contain the green band. Only the CIG contains information from the red edge band. The spectral index containing only the red edge (NDVIre) was ranked lower than several spectral indices which contained the NIR and visible region bands. When combined in a spectral index, the NIR band improved in their ranking in foliar N prediction. However, combining the individual bands and spectral indices in the same model results in a low ranking of the NIR-based indices (Figure 4). This shows the low ranking of information from the NIR region in predicting N in the miombo woodlands.  Figure 3 illustrates that three of the five most important spectral indices for estimating foliar N (that is, simple ratio index (RVI), enhanced vegetation index (EVI), and normalised difference vegetation index (NDVI)) contain the NIR band. Specifically, NDVI, which is derived from the NIR and red bands, is the most important NIR-based spectral index for predicting variations in foliar N concentration in the miombo woodlands. The two spectral indices ranked higher than NDVI are the green chlorophyll index (CIG) and the green index (GI), both of which contain the green band. Only the CIG contains information from the red edge band. The spectral index containing only the red edge (NDVIre) was ranked lower than several spectral indices which contained the NIR and visible region bands. When combined in a spectral index, the NIR band improved in their ranking in foliar N prediction. However, combining the individual bands and spectral indices in the same model results in a low ranking of the NIR-based indices (Figure 4). This shows the low ranking of information from the NIR region in predicting N in the miombo woodlands.      Table 3 shows that individually, the NIR band makes a small contribution in the prediction of foliar N in the miombo woodlands. A comparison of the two models utilising individual Sentinel-2 bands revealed that an exclusion of the NIR bands in the random forest regression model increased the normalised root mean square error (nRMSE) from 11.35% to 11.69%. The coefficient of determination of the model marginally dropped from 0.94 to 0.93. On the other hand, it can be noted that excluding NIR-based spectral indices resulted in a marginal decline in the value of r 2 , while the normalised root mean square error increases from 12.90% to 13.45%. The marginal decrease in the r 2 when NIR-based indices were excluded from the model points towards the significance of using spectral indices in the prediction of foliar N in the miombo woodlands. Combining both individual bands and spectral indices in the regression model resulted in the lowest nRMSE recorded.

Predictions of Foliar N
Scatterplots of the relationship between predicted N and field-measured N are shown in Figure 5. The scatterplots show a generally consistent deviation from the 1:1 line by all the models. Figure 5 shows that there is an overestimation of foliar N at lower concentrations. At higher concentrations of N, there is an underestimation of foliar N. The goodness of fit statistics for the relationship between predicted N and field-measured N are given in Table 3. Figure 6 shows the distribution of the accuracy statistics of the five models after running a bootstrap with 500 iterations. Figure 6 shows that for the three accuracy statistics, the bias was the least varied, while the r 2 and the nRMSE were highly variable (more widely spread). While the nRMSE for the two models based on spectral indices was consistently high, the r 2 for the same models was low. The higher nRMSE indicates a lower prediction accuracy, while the wider range of values indicate increased variation in the performance of the models in predicting foliar N. Models utilising individual bands on the other hand have a narrower range of values, indicating a more consistent prediction of foliar N in the miombo woodlands. Figure 7 shows the spatial variation of mean %N in the miombo woodlands estimated using the RF model developed from all the individual bands and spectral indices. This model was chosen because it had the lowest nRMSE ( Table 3).
The map of the standard deviation in Figure 6 shows very low variation (0.001 to 0.031) in the prediction of foliar N. The low values of standard deviation are consistent with the results from Figure 6, where the combined model shows a narrower range of values of nRMSE. Figure 8 shows areas of high and low standard deviation of foliar N prediction using the RF model with all the spectral indices and all individual bands.
The areas that have a lower standard deviation (black oval in Figure 8) have more mature trees and a higher percentage cover, while those areas that have a higher standard deviation (black triangle in Figure 8) have lower percent canopy cover. The higher variations in predicted figures might be explained by the inclusion and exclusion of such sample plots in the calibration of the model, thereby altering the accuracy of the model. The soil background could also have affected the prediction accuracy, causing higher uncertainty in sample plots of medium canopy cover. The absence of the soil background in sample plots of high canopy cover could have led to small variations in the prediction of foliar N.   Figure 7 shows the spatial variation of mean %N in the miombo woodlands estimated using the RF model developed from all the individual bands and spectral indices. This model was chosen because it had the lowest nRMSE ( Table 3).
The map of the standard deviation in Figure 6 shows very low variation (0.001 to 0.031) in the prediction of foliar N. The low values of standard deviation are consistent with the results from Figure  6, where the combined model shows a narrower range of values of nRMSE. Figure 8 shows areas of high and low standard deviation of foliar N prediction using the RF model with all the spectral indices and all individual bands.
The areas that have a lower standard deviation (black oval in Figure 8) have more mature trees and a higher percentage cover, while those areas that have a higher standard deviation (black triangle in Figure 8) have lower percent canopy cover. The higher variations in predicted figures might be explained by the inclusion and exclusion of such sample plots in the calibration of the model, thereby altering the accuracy of the model. The soil background could also have affected the prediction accuracy, causing higher uncertainty in sample plots of medium canopy cover. The absence of the soil background in sample plots of high canopy cover could have led to small variations in the prediction of foliar N.

Discussion
The results demonstrate that, when single bands are used, information from the NIR region ranked very lowly in the prediction of foliar N within the miombo woodlands. Our results contradict several

Discussion
The results demonstrate that, when single bands are used, information from the NIR region ranked very lowly in the prediction of foliar N within the miombo woodlands. Our results contradict several studies that have ranked the NIR region highly in predicting variations in foliar N [14,22,59,60]. For example, [61] concluded that the NIR band significantly (r 2 = 0.73) predicted variation in foliar N in the Bavarian Forest National Park, a mixed temperate forest ecosystem. In addition, Lepine et al. [14] also reported an r 2 of 0.86 when relating foliar N to the NIR bands. However, in this study, the red edge and the SWIR regions are important in the prediction of N when we consider individual bands. However, the spectral indices containing the NIR and red regions were found to be important in the prediction of foliar N in the miombo woodlands.
Another important feature of the findings is that, when spectral indices are used, the indices which contained the NIR band, such as RVI and EVI2, ranked highly in the prediction of N in the miombo woodlands. This result provides additional empirical support to the hypothesis that spectral indices increase the content of the information in remotely sensed data and the NIR gains in importance when combined with other bands. The three best predicting spectral indices that contain the NIR are made up of the NIR and the red band, showing the prominence of the visible region in the estimation of N. The superior performance of the spectral indices derived from the NIR can be attributed to the fact that spectral indices generate new information by aggregating the relationships between foliar N and different portions of the electromagnetic spectrum [62]. The NIR region contains information on several leaf characteristics, including biomass, leaf area, and thickness [14], while the red edge is related to chlorophyll concentration; leaf properties which have both been related to foliar N concentration [11,17]. Spectral indices have evolved over time with the aim of enhancing the vegetation signal while decreasing the "noise effect" from several sources, including solar irradiance and soil background [63]. However, other critical factors, such as the leaf area index [64] and saturation problems, have to be accounted for if the spectral indices are to be used for mapping foliar biochemicals such as N.
Results of this miombo study share similarities with some studies that were carried out in grassland ecosystems of southern Africa at the landscape scale. For example, [65] used simulated Sentinel-2 data and came to the conclusion that the red edge and SWIR bands were significant in remotely estimating foliar N in a savanna grassland. In addition, the visible SWIR bands (583 nm, 679 nm, 710 nm, 2091 nm, 2128 nm, and 2146 nm) and red edge position were also important in predicting foliar N in Colophospermum mopane woodlands [66,67]. Our study is different in that the previous studies used hyperspectral remote sensing as well as simulated Sentinel-2 bands to relate to foliar N, where here broadband Sentinel-2 images were used in estimating foliar N. The high ranking of the red edge in this study is not surprising. This is because the red edge has been widely used in mapping leaf biochemical composition across different ecosystems [8,11,68,69]. However, the consistency in the importance of the SWIR region underlines its significance for foliar N estimation in seasonally humid landscapes. The SWIR region contains N absorption features that have been used in the remote estimation of N [15,16,21,70]. The development of SWIR-based methods, however, has been overshadowed by those from the red edge region mainly because the N absorption features in the SWIR are masked by water absorption.
The application of spectral indices containing NIR as well as the SWIR in foliar N mapping is a promising method of using broadband remote sensing in estimating N at the landscape level. On one hand, Ref. [14] noted that the %N-NIR relationship is driven by several canopy structural properties that influence reflectance. On the other hand, Pellissier et al. [71] concluded that the NIR and SWIR peaks that explained %N variations were also significant in explaining variations in canopy height, biomass, and water content. Taking these two conclusions together, other factors, such as water availability in an ecosystem, can potentially influence the relationship between foliar N and NIR. This is important given that the SWIR region contains both water and N absorption features [72,73]. However, information from the NIR region has ordinarily been used to explain leaf characteristics such as leaf area index, density, and canopy cover [14]. While our study was conducted within the miombo woodlands, within a single season, the results provide an insight into the remote estimation of foliar N in other resource-constrained environments where hyperspectral remote sensing is considered too expensive and broadband remote sensing is an alternative.

Conclusions
This study assessed the utility of the NIR region in estimating foliar N in the miombo woodlands. This study concluded that when used as single bands, the NIR region is ranked low when related to foliar N in the miombo woodlands. However, when combined with other bands in spectral indices, the importance of the NIR region improved markedly. This means that spectral indices which contain the NIR band are important in the estimation of foliar N in the miombo woodlands. This finding suggests that combining bands generates new information which is important in the prediction of foliar N in the miombo woodlands. Spectral indices improve the relationship between foliar N and remotely sensed imagery by minimising the "noise" in spectral values while amplifying the relationship with factors affecting leaf morphology, such as biochemicals. Information from a single broadband channel is often too "contaminated" to be meaningfully related to variations in foliar biochemical concentrations, including N.
Sentinel-2 broadband data has been proved to be very useful in foliar N mapping. The spatial resolution of 20 m potentially offers no challenges in less-frequently disturbed ecosystems. This is largely because few individual trees dominate the reflectance, making field sampling cheaper and correlation with field data easier. However, in frequently disturbed ecosystems made up of several smaller individual trees, relating reflectance to foliar N may be costly and compounded by the presence of several species. This method is significant in the mapping and monitoring of foliar N as a measure of ecosystem health in the miombo woodlands.