Wavelength Selection Method Based on Partial Least Square from Hyperspectral Unmanned Aerial Vehicle Orthomosaic of Irrigated Olive Orchards

: Identifying and mapping irrigated areas is essential for a variety of applications such as agricultural planning and water resource management. Irrigated plots are mainly identified using supervised classification of multispectral images from satellite or manned aerial platforms. Recently, hyperspectral sensors on-board Unmanned Aerial Vehicles (UAV) have proven to be useful analytical tools in agriculture due to their high spectral resolution. However, few efforts have been made to identify which wavelengths could be applied to provide relevant information in specific scenarios. In this study, hyperspectral reflectance data from UAV were used to compare the performance of several wavelength selection methods based on Partial Least Square (PLS) regression with the purpose of discriminating two systems of irrigation commonly used in olive orchards. The tested PLS methods include filter methods (Loading Weights, Regression Coefficient and Variable Importance in Projection); Wrapper methods (Genetic Algorithm-PLS, Uninformative Variable Elimination-PLS, Backward Variable Elimination-PLS, Sub-window Permutation Analysis-PLS, Iterative Predictive Weighting-PLS, Regularized Elimination Procedure-PLS, Backward Interval-PLS, Forward Interval-PLS and Competitive Adaptive Reweighted Sampling-PLS); and an Embedded method (Sparse-PLS). In addition, two non-PLS based methods, Lasso and Boruta, were also used. Linear Discriminant Analysis and nonlinear K-Nearest Neighbors techniques were established for identification and assessment. The results indicate that wavelength selection methods, commonly used in other disciplines, provide utility in remote sensing for agronomical purposes, the identification of irrigation techniques being one such example. In addition to the aforementioned, these PLS and non-PLS based methods can play an important role in multivariate analysis, which can be used for subsequent model analysis. Of all the methods evaluated, Genetic Algorithm-PLS and Boruta eliminated nearly 90% of the original spectral wavelengths acquired from a hyperspectral sensor onboard a UAV while increasing the identification accuracy of the classification.


Introduction
The intensification of agricultural practices, including better seeds, extensive fertilizer use and irrigation techniques, has altered the dynamics between humans and environmental systems across the world [1].Although these agricultural practices have allowed for increased food production, they have also caused significant environmental impact on many regions.Consequently, accurate and precise information is in high demand from Earth System Science and global change research [1].Today, irrigated agriculture is one of the most significant contributors of water consumption [2], necessitating modeling water exchange between land surface and atmosphere [3], managing water resources [4] and analyzing the variability of irrigation water requirements and supply [5].As a result, estimations of water demand do not consider the spatial variability in irrigation practices and do not reflect the characteristics of irrigation techniques [6].
Irrigation is currently used in most new olive orchards, which suggests that the percentage of irrigated olive trees is very high throughout the world and increasing [7].Since water is the largest agricultural input in many cultivation systems in Mediterranean areas, with water availability being one of the main limiting factors for crop yield, the use of more efficient irrigation techniques in olive orchards has become essential [8].Water Use Efficiency (WUE) is a term that was coined more than 100 years ago [9] and it functions as an indicator of the balance between productivity and water availability.WUE is an essential parameter nowadays due to the great pressure of increasingly intense and frequent droughts associated with climate change effects on agricultural water availability and crop yields worldwide [10].As WUE is a measurement of yield or biomass produced per unit of water [11], it is therefore particularly useful when trying to compare the efficiency of different irrigation systems.In this context, many research projects demonstrate that productivity could be increased with no change in the rate of water use resulting in greater WUE [12][13][14].For the olive grove, Martínez and Reca (2014) [8] compared Subsurface-Drip Irrigation (SDI) and surface Drip Irrigation (DI).Both yield and WUE for SDI outperformed DI, with a water savings of up to 20% for the former.As such, evaluating crop productivity and means of irrigation in relationship to WUE would allow a better management of water resources [10].
Remote sensing has been demonstrated to be an effective tool for locating, mapping and monitoring irrigation techniques by providing data in several regions of the electromagnetic spectrum and with a variety of spatial and temporal resolutions to assess crop growth, maturity and yield [15][16][17][18].Images are obtained remotely via a broad range of sensors on-board three main types of platforms: satellite, manned aerial and unmanned [19].Each of these approaches have pros and cons that involve economic, operational, and technological factors [19].Satellite imagery covers extensive areas, and some Earth observation programs provide free low spatial-resolution datasets, e.g., Moderate Resolution Imaging Spectroradiometer (MODIS), or medium resolution datasets, e.g., Sentinel-1 and -2 from the Copernicus program [20,21].The effectiveness of satellite imagery applied to arable crops, forests and extensive plantations has been demonstrated in many studies [1,21].However, satellite imagery may suffer from cloud cover and constrain image timing for specific phenological characteristics due to the limits of temporal resolution [22].Remote sensing becomes more challenging when considering crops with discontinuous layouts, such as olive trees, vineyards or orchards [23].The presence of inter-row paths may deeply affect the overall computation of spectral indices, leading to an inadequate assessment of crop status [23].On the other hand, manned aircraft surveys offer more operational flexibility, providing spatial resolutions in the range of centimeters, but comes with high operational and logistic costs [24,25], making it difficult to perform frequent flights in phenological studies [26].UAV platforms offer greater flexibility still [27], allowing for the possibility to differentiate pure vegetation pixels in images over woody crops.However, UAV platforms are limited both in regard to payload and flight time [28].Despite these limitations, UAV platforms have been shown to be very useful tools in the mapping of irrigated areas when working in concordance with traditional platforms [29].In the context of this research, UAV platforms have been used in post-classification correction of traditional platforms to identify anomalies in the mapped irrigated plots and improve classification accuracy [29].
The UAV payload included a variety of sensors, including RGB, multispectral, hyperspectral, thermal and LiDAR.Multi-and hyper-spectral sensors have been successfully used in many applications which require accurate spectral information [30,31].The main difference between both type of sensors is based on the number of spectral bands.While multispectral images generally range from 4 to 12 spectral bands that are represented in each pixel, hyperspectral images consist of hundreds of spectral bands arranged in a very narrow bandwidth [32].The high spectral resolution in hyperspectral images allow the detection of spectral details that can be imperceptible in multispectral images due to their discrete spectral nature [32].UAV-based hyperspectral imaging in agriculture has been successfully used in chlorophyll [33], biomass [34], nitrogen [35] or water [36] content estimation; the detection of diseases [37]; weed classification [38]; the evaluation and classification of crop water status [39]; etc.Therefore, hyperspectral remote sensing technologies have improved our capability for understanding the processes of biophysical and biochemical properties of vegetation [40].
As a result of the high number of spectral bands, many of them are highly correlated and therefore a dimension reduction or wavelength selection method is essential to apply in preprocessing of the hyperspectral image to improve its usability [41].These selection methods can be grouped in three categories: (i) wave band features [42,43]; (ii) spectral position features [44,45]; and (iii) vegetation indices [45,46].These methods are performed through a variety of techniques such as Principal Components Analysis (PCA) [47,48], Minimum Noise Fraction [49,50], Singular Value Decomposition [51] or Partial Least Square (PLS) [52,53] among others.As a result, these techniques reduce the data size by selecting those wavelengths sensitive to the object of interest [54].Specifically, PLS regression is a nonparametric and supervised technique particularly useful to achieve the "large p -small n" problem in high-dimensional datasets [55].This technique combines features from PCA and multiple regression to predict dependent variables from a set of orthogonal factors (latent variables) extracted from predictors with the best predictive power [56].To achieve this, a simultaneous decomposition of predictors and dependent variables are performed with the constraint that these components explain the covariance as much as possible.Afterwards, a regression step is performed where the decomposition of predictors is used to predict dependent variables [56].
Previous studies have identified wavelengths that are sensitive to crop properties such as chlorophyll content, nitrogen status and water content or estimation of biomass [57][58][59][60].PLS methods have proven to be very versatile for multivariate data analysis in applications related to bioinformatics [61,62] or chemometrics [63,64], as well as remote sensing [65,66].Initially, in these studies, PLS was not implemented to select variables, since the objective was to find a relevant linear subspace of the explanatory variables, but, eventually, several PLS selection methods for variable selection were finally proposed [67].These methods can be categorized in three types: filter, wrapper and embedded methods.Filter techniques evaluate the relevance of the characteristics by only looking at the intrinsic properties of the data.In most cases, a feature relevance score is calculated, and low scoring features are removed.Subsequently, this subset of characteristics is presented as input to the classification algorithm [67].These methods require some type of filter measure (loading weights, regression coefficients or importance of the variable in projection) that represents the response relationship with the respective variable, and, for this, a threshold is required to classify the variables as selected or not [55].For Loading Weights Method (LW-PLS), the peaks or valleys with the maximum absolute load weights from the first major factor to the optimal principal factor are selected as sensible wavelengths [68].For the Regression Coefficient Method (RC-PLS), the sensitive wavelengths are generally selected according to the regression coefficient of the optimal PLS model [67].In general, the peaks or bands are selected as the sensible wavelength or waveband when the absolute value of the regression coefficient is greater than the threshold [69].On the other hand, the basis of the Variable Importance in Projection Method (VIP-PLS) is to accumulate the importance of each variable, this being reflected by the load weights from each component [70].While filter techniques address the problem of finding a good subset of features regardless of the model selection step, wrapper methods incorporate the model hypothesis search into the feature subset search [67].The methods are mainly distinguished by the choice of the underlying filter method and how the wrapper is implemented, and they are primarily based on procedures that iterate between model fitting and variable selection [55].These procedures, for example, a Genetic Algorithm integrated with the PLS regression Method (GA-PLS), combine the advantage of GA and PLS and return a vector of variable numbers that corresponds to the model that has the lowest prediction error [71].For Uninformative Variable Elimination PLS (UVE-PLS), artificial noise variables are added to the predictor set before the PLS model is fitted and all original variables that are less important than the artificial noise variables are removed before the procedure is repeated until a stop criterion is reached [72].In the general procedure for Backward Variable Elimination PLS (BVE-PLS), variables are ordered first with respect to some measure of importance, and one of the filter measures described above is generally used [73].Secondly, a threshold is used to eliminate a subset of the least informative variables, and then a model is fitted again to the remaining variables and performance is measured.The procedure is repeated until the maximum performance of the model is achieved.Other wrapper methods are Subwindow Permutation Analysis coupled with PLS (SwPA-PLS), which provides the influence of each variable without considering the influence of the rest of the variables [74]; Iterative Predictive Weighting PLS (IPW-PLS), which is an iterative elimination procedure where a measure of predictor importance is computed after fitting a PLS model [75]; Regularized Elimination Procedure in PLS (REP-PLS), where a stability-based variable selection procedure is adopted [67]; Backward [76] and Forward [77] Interval PLS (BiPLS and FiPLS, respectively), where the dataset is divided into a given number of intervals, and the PLS models are then calculated with each interval left in a sequence, giving the first omitted interval the worst performance model with respect to the mean squared error of cross-validation (MSECV); and Competitive Adaptive Reweighted Sampling (CARS-PLS), which is a function variable selection method that combines Monte Carlo sampling with the PLS regression coefficient [78].For embedded methods, the search for an optimal subset of features is built into the classifier construction and can be seen as a search in the combined space of feature subsets and hypotheses [79].Similar to wrapper approaches, embedded approaches are thus specific to a given learning algorithm.One of these methods is Sparse-PLS (S-PLS), which is a version of PLS that aims to combine selection and modeling in a one-step procedure [80].
On the other hand, machine learning algorithms, including Linear Discriminant Analysis (LDA) and K-Nearest Neighbors (KNN), are powerful tools for analyzing hyperspectral information since they can process a large number of variables efficiently [81,82].LDA is a subspace technique that seeks to find the maximum Fisher's ratio [83] while KNN is a non-parametric learning algorithm since there is no assumption for the underlying data distribution.KNN also utilizes lazy learning, meaning that it does not need any training data point for the generation of the model [84].Specifically, these machine learning algorithms have been widely used in the remote sensing field for agricultural applications [37,85].For instance, Suarez et al. [85] estimated phenoxy herbicide dosage in cotton crops through the analysis of hyperspectral data with LDA.In addition, Bohnenkamp et al. [37] utilized KNN to detect yellow rust in wheat with the application of hyperspectral imaging technology.In general, these models have shown to be effectives for investigating agricultural features using hyperspectral imagery [86].
Many authors have compared selection PLS methods to analyze which offers better performance [69,87].Likely, there is no best variable selection method due to the interaction between method and properties of the analyzed data [55].As per the above discussion, this article focuses on the use of wavelength selection methods in UAV hyperspectral images to compare two irrigation systems commonly used in olive orchards, SDI and DI.For this purpose, 16 methods (13 based on PLS) were evaluated, and the quality of the results were assessed by two linear and nonlinear classification techniques.This paper is organized as follows.Section 2 describe the materials and methods used.Section 3 shows the results.Section 4 includes the final concluding remarks.

Materials and Methods
The workflow for classifying olive tree crowns according to the irrigation technique used is summarized in Figure 1.Two UAV flights were performed using RGB and hyperspectral sensors, respectively.The RGB UAV flight was used to geometrically define each of the olive trees while the hyperspectral UAV flight was used to characterize them radiometrically.Then, different PLS-and non-PLS-based methods were used to select the most significant wavelengths to classify the two irrigation techniques methods of the study area.Finally, with the selected wavelengths, two classifications were performed using LDA and KNN, evaluating the quality and efficiency of the results.

Study Area and UAV Flights
The study site was located in Marchena (Seville province, Spain) (37°24′52.7″N,5°27′44.9″W,WGS84) (Figure 2a) in a 7-ha commercial orchard planted in 2015 with the olive cultivar Arbequina at a spacing of 3 m × 1.5 m (Figure 2b).The hedgerow orchard was adapted in equal parts to DI or SDI with carry self-compensating dripper pipes at 2.2 L/h spaced 0.5 m apart along the irrigation line and placed 0.5 m from the trunks and, in the case of SDI, buried 0.45 m deep.The trees were irrigated weekly with the seasonal water amount equivalent to 100% of ETc.Soil properties and agricultural practices were similar in both areas where the irrigation systems were implemented.The climate is Mediterranean with an average annual temperature and precipitation (concentrated mostly in late fall and winter) of 18.8 °C and 544 mm, respectively.A representative area including 413 olive trees (200 olives trees irrigated with DI and 213 with SDI) was used for data analyses.

UAV Flights and Processing
Two UAV flights, RGB and hyperspectral, were performed on 22 May 2020 at solar noon to take advantage of the position of the sun and minimize shadows in the images acquired.The RGB UAV flight was used to precisely delineate each of the olive trees in the study area while the hyperspectral UAV flight characterized each of them spectrally.A DJI Mavic Pro 2 (SZ DJI Technology Co., Shenzhen, China) was used to perform the RGB flight.As the payload, a Hasselblad L1D-20c (Hasselblad Group, Göteborg, Sweden) was used, which provides an image of 13.2 mm × 8.8 mm, a focal length of 10.3 mm and an image size of 5472 × 3648 pixels.This UAV was flown at an altitude of 65 m above ground level (AGL) and forward and side-lap were 80% and 70%, respectively.Ground Sample Distance (GSD) was 1.6 cm taking into account the characteristics of the sensor.For the hyperspectral flight, a DJI Matrice 600 Pro (SZ DJI Technology Co., Shenzhen, China) was used.This UAV platform was also equipped with a Nano Hyperspec sensor (Headwall Photonics Inc., Boston, MA, USA) with 270 spectral bands (6-nm FWHM) both in the VNIR spectral range (400-1000 nm) with a sampling of 2.2 nm.Being a push-broom sensor, imagery is collected, line by line, at 12-bit radiometric resolution along the flight path where each line of pixels comprises 640 spatial pixels.The hyperspectral sensor was mounted on a Ronin-MX gimbal system (SZ DJI Technology Co., Shenzhen, China) to minimize external disturbances such as roll, pitch and yaw oscillations.This UAV platform was flown at an altitude of 100 m AGL, GSD being equal to 6 cm taking into account the characteristic of the sensor.
Prior to the UAV flights, five artificial targets were placed in the study area as Ground Control Points (GCPs), one in each corner and the other in the center.Each GCP was measured with the stopand-go technique through relative positioning by means of the NTRIP protocol (The Radio Technical Commission for Maritime Services, RTCM, for Networked Transfer via Internet Protocol) using two GNSS (Global Navigation Satellite System) receivers.One of the receivers was a reference station for the GNSS Red Andaluza de Posicionamiento (RAP) network from the Institute of Statistics and Cartography of Andalusia, Spain, and the other, a Leica GS15 GNSS (Leica Geosystems AG, Heerbrugg, Switzerland), functioned as the rover receiver.In addition, a known reflectance value calibration tarp was placed in the center of the plot to subsequently correct the spectral cubes radiometrically.Before the hyperspectral UAV flight, the sensor capture mode was configured to set up the number of frames per second in accordance with the flight speed and AGL flight height.Likewise, the exposure level was established through the determination of the reflectance curve of a 90% target in order to avoid saturating the sensor in the recording of the spectral cubes.In addition, black body spectral information was taken by closing the aperture to later convert the digital levels to radiance values.
To produce an RGB orthomosaic from the RGB UAV flight, photogrammetric processing was divided into four stages: aerial triangulation, Digital Surface Model (DSM) generation, rectification of individual images and orthomosaicking.Aerial triangulation determined the individual external orientation of each image of the photogrammetric block.Afterwards, a dense point cloud was generated using Structure from Motion (SfM) techniques [88] to create a DSM.Finally, individual images were rectified and mosaicked to generate an RGB UAV orthomosaic of the study area.This methodology has been validated in previous research projects [19,89,90] and was performed using Pix4Dmapper (Pix4D S.A., Prilly, Switzerland).The hyperspectral data processing was divided into three stages [91].Firstly, each hyperspectral cube was converted from digital number to radiance values, using dark reference captured prior to the UAV flight.Secondly, radiance values were converted to reflectance values using the values from the calibration tarp.Finally, all the reflectance hyperspectral cubes were orthorectified using data from the GNSS receivers, the Inertial Measurement Unit (IMU) and the Shuttle Radar Topography Mission (SRTM) Digital Elevation Model [92].The accuracy and spatial resolution of the SRTM model was adequate due to the smooth relief of the study area.This processing was performed using SpectralView software (Headwall Photonics Inc., Boston, MA, USA).
Due to the difficulty in segmenting individual trees, an RGB orthomosaic was used to manually digitize polygons of all the olive trees, using the QGIS desktop Geographic Information System.As an example, a partial view of the hedgerow in the RGB and hyperspectral orthomosaic is shown in Figure 3.Then, for each olive tree, the mean spectral reflectance value for each of the 270 spectral bands was calculated, generating a spectral curve at the canopy level.This step was automated using a script developed in the Python programming language [93].

Wavelength Selection Methods Used and Evaluation
Although there are other methods, this project focused on applying those methods that use the output of a PLS algorithm to identify a subset of important variables.As such, the variables analyzed in this project were the wavelengths registered by the hyperspectral sensor.The PLS selection methods used in this study are grouped by category (Table 1).More information on the PLS methods used in this study can be found in the work by Mehmood et al. [67].Other wavelength selection methods, such as the Least Absolute Shrinkage and Selection Operator (Lasso) and Boruta algorithms, were also included in this study [94,95].Lasso can handle ill-posed problems (i.e., a large number of correlated variables compared to sample size) and works by penalizing the magnitude of the characteristic coefficients along with minimizing the error between the predicted and actual observations [96].On the other hand, the Boruta algorithm was developed to identify all relevant variables within a classification framework [97].This method searches for relevant characteristics by comparing the importance of the original attributes with the importance that can be obtained at random, estimated using their permuted copies, and progressively eliminating irrelevant characteristics to stabilize the test.First, the dataset is duplicated, and the values are shuffled in each column; these are called shadow characteristics.Then, a classifier is trained on the dataset to extract the importance of each characteristic.Random Forest is one of the most widely used classifiers for this purpose [97].As a final step, a combined analysis was performed considering the individual results from all the PLS, Lasso and Boruta methods used, which has been termed the All-together method.

Evaluation of Wavelength Selection Methods
LDA and KNN were applied to classify irrigation techniques used in each olive tree.Of the total number of olive trees, 75% were used in the calibration phase (explained below) and 25% for prediction (Table 2).Before applying LDA or KNN classification, the dataset was divided into two smaller datasets that were used for calibration and prediction purposes.The calibration subset of data was used to estimate the parameters of the classifier model and the prediction subset of data was used to check the results of the model.Calibration and prediction were performed following an iterative process where subsets of data changed per iteration.Overall accuracy (OA) of LDA and KNN results were calculated by summing the number of correctly classified olive trees and dividing by the total number of trees.Moreover, the accuracy of each irrigation technique was evaluated [98].In addition, efficiency of methods applied were calculated in the prediction stage.According to Xia et al. [87], the efficiency of a wavelength selection method is based on the prediction rate and the number of variables, being calculated as follows: where E is the efficiency of the wavelength selection method evaluated, Ds is the OA obtained by a PLS method, Df is the OA obtained using all wavelengths registered by the UAV hyperspectral sensor, Nf is the total number of wavelengths and Ns is the number of wavelengths selected by the method used.If E is higher than or equal to 0.5, the selection method is highly efficient.On the other hand, if E ranges from −0.5 to 0.5 the method is shown to be efficient, except if E is equal to 0 and  −   is greater than or equal to 0.8, indicating the method to be highly efficient.Finally, if E is less than or equal to −0.5 the method is of low efficiency.

Spectral Reflectance Data
The mean spectral reflectance curves of olive trees, being irrigated with SDI or DI systems, were similar to the results obtained in related studies [107] (Figure 4).No differences were observed in the 400-550 and 880-947 nm ranges.However, different magnitudes of spectral reflectance were found in the 550-880 nm range.Therefore, within that range, there are 165 possible wavelengths to differentiate between both irrigation techniques.Given this high number, different wavelength selection methods, detailed above, were applied to identify those that best classify both types of irrigation.

Wavelength Selection Results
The results of the application of the methods used, including the number of the selected wavelengths as well as their wavelengths, are shown in Table 4. Depending on the method employed, the number of wavelengths varied considerably.The method that selected the least number of wavelengths was CARS, followed by LW-PLS, neither exceeding five wavelengths, less than 1% of the original wavelengths registered.A set of methods (RC-PLS, VIP-PLS and IPW-PLS) selected a range of 10 wavelengths, equal to 3.6% of the original wavelength number, and others selected 17-77 (6.2-28.1%)wavelengths (GA-PLS, BVE-PLS, SwPA-PLS, REP_PLS, FiPLS, Lasso and Boruta).Finally, the S-PLS and BiPLS methods were the ones that selected the largest number of wavelengths, 192 (70%) and 265 (96.7%), respectively.As such, the reduction in the number of wavelengths of interest will be higher or lower depending on the method used.The total number of times a wavelength was selected by the total number of methods simultaneously employed is shown in Figure 5.Although there was a difference in the spectrum of SDI and DI in the wavelengths of around 600-650 nm (Figure 4), the frequency of selection by the selection methods employed in this range was low (Figure 5).Of all the methods applied, 18 wavelengths were selected by almost half the methods (All-together method; Table 4).All within the infrared region and for a total of nine times, the wavelength 920 nm was selected the most to predict the irrigation method used in this study.The utility of the All-together method was demonstrated for investigating agricultural features using hyperspectral imagery.

Quality and Efficiency of Classification with Each Selection Method
The accuracy and efficiency results of the irrigation technique rating using LDA and KNN methods and their efficiency are shown in Table 5, while Figure 6 shows their efficiency.In general terms, LDA classifications showed better overall accuracy than KNN, in both calibration and prediction.Considering LDA, overall accuracy prediction (OAP) using all wavelengths was equal to 65%.Using only those selected wavelengths from a selection method, OAP improved, except in LW-PLS, the mean OPA being equal to 75%.The highest percentage of improvement was offered by GA-PLS, at 20%, while the lowest was offered by CARS, at 1%.In addition, ten of the methods evaluated improved the predictive quality of the irrigation technique by more than 10% (RC-PLS, GA-PLS, BVE-PLS, SwPA-PLS, IPW-PLS, BiPLS, S-PLS, Lasso, Boruta and All-together).On the other hand, the OAP applying KNN using all wavelengths was equal to 68.1% while just using the wavelengths selected by a PLS method generally resulted in a slightly lower OAP, with a mean value equal to 66.4%.Boruta was the method that showed the highest percentage of improvement, with an OAP of 74%.In total, seven of the methods offered an improvement in irrigation system prediction (GA-PLS, IPW-PLS, BiPLS, S-PLS, Lasso, Boruta and All-together).In addition, while LDA showed similar accuracy classifying both irrigation techniques, KNN offered worse results with the SDI technique.While LDA offered better results than KNN, all the methods used were highly efficient except for LW-PLS (Figure 6).GA-PLS and Lasso scored the highest with an efficiency equal to 17.7 and 17.8, respectively (Figure 6a).In contrast, only six of the sixteen methods were highly efficient using KNN, Boruta being the most efficient with a value equal to 5.3 (Figure 6b).In addition, the irrigation classification maps of some of the selection methods assessed are shown in Figure 7. From a visual analysis, no spatial correlation was detected in the errors obtained as well as the presence of a higher concentration of errors in the perimeter of each type of irrigation.the Boruta method has not been widely used for the selection of variables in remote sensing or other disciplines, results obtained in this study show that it could be considered as a promising method for the selection of hyperspectral wavelengths.On the other hand, LW-PLS showed low efficiency and lower OAP.This can be explained because this method greatly reduces wavelengths and therefore eliminated useful information as indicated in other works [87].The efficiency of RC-PLS, VIP-PLS, UVE-PLS, BVE-PLS, SwPA-PLS, REP-PLS, BiPLS, FiPLS, CARS-PLS and S-PLS varied depending on whether LDA or KNN was used.In general, LDA showed better classification results than KNN, which can be explained due to the fact that KNN needs to have large training data in order to achieve acceptable classification results, as indicated by Starzacher and Rinner (2008) [109].In general, these preliminary results show the need to analyze the relationships between wavelengths registered by a hyperspectral sensor and the object of study to optimize the number of wavelengths utilized.As a future line of research, the analysis of which combinations of wavelengths may be of interest to characterize irrigated areas in more detail is proposed.In addition, other features such as soil properties or olive cultivars along with other classifiers should be considered for improving the characterization of irrigation areas.

Conclusions
This study explored the use of UAV hyperspectral reflectance measurements of olive trees as a means for differentiating irrigation systems.Because of the high dimension and multicollinearity of the data, selection methods were found appropriate due to their capacity to extract useful wavelengths for analyzing categorial data, even when using individualized wavelengths.The results showed how the spectral response of olive trees is sensitive to the irrigation technique used, allowing improved information for the mapping of irrigated areas.Overall accuracy in the classification of irrigation systems in olives trees using LDA and KNN ranged from 54.8% and 85.2%.These variations showed the need to select the appropriate wavelength selection method.In addition, LDA offered more accurate results than KNN.In our study, GA-PLS, RC-PLS, Lasso, FiPLS, Boruta and Alltogether showed an overall accuracy of 75% or higher.They were all highly efficient methods and resulted in an improved classification.
The study has shown how the use of hyperspectral UAV data allows the irrigated areas of olive groves to be characterized in greater detail.This will generate information of interest for decisionmaking processes in the context of water use policies, enabling better understanding of irrigated olive groves and improving the management of water resources.

Figure 1 .
Figure 1.Flowchart used for the classification of irrigation techniques.

Figure 3 .
Figure 3. Partial view of (a) RGB and (b) hyperspectral orthomosaic of the study area.

Figure 4 .
Figure 4. Average raw spectra reflectance curves of olive canopies irrigated with SDI or DI.

Figure 5 .
Figure 5. Number of times a wavelength has been selected when performing the All-together method.

Figure 6 .
Figure 6.Efficiency of PLS method using (a) Linear Discriminant Analysis and (b) K-Nearest Neighbor.

Table 1 .
Evaluated wavelength selection PLS methods.

Table 2 .
Number of samples per irrigation technique for calibration and prediction set.

Table 3 .
R packages used for wavelength selection methods.

Table 4 .
Wavelengths selected by different methods.

Table 5 .
Overall accuracy (OA), Accuracy of Sub-Surface drip irrigation (A SDI), Accuracy of Surface drip irrigation (A DI) and Efficiency (E) results of Linear Discriminant Analysis (LDA) and K-Nearest Neighbors (KNN) using different selection methods.