Next Article in Journal
A Multipurpose Wearable Sensor-Based System for Weight Training
Next Article in Special Issue
Reinforcement Learning for Collaborative Robots Pick-and-Place Applications: A Case Study
Previous Article in Journal
Optimal Control Systems Using Evolutionary Algorithm-Control Input Range Estimation
Previous Article in Special Issue
Design and Implementation of a Robotic Arm Assistant with Voice Interaction Using Machine Vision
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predictive Performance of Mobile Vis–NIR Spectroscopy for Mapping Key Fertility Attributes in Tropical Soils through Local Models Using PLS and ANN

by
Mateus Tonini Eitelwein
1,
Tiago Rodrigues Tavares
2,
José Paulo Molin
3,*,
Rodrigo Gonçalves Trevisan
1,
Rafael Vieira de Sousa
4 and
José Alexandre Melo Demattê
5
1
Smart Agri Technological Solutions, Av. Limeira, 1131, 3rd Office, Piracicaba 13414-018, Brazil
2
Laboratory of Nuclear Instrumentation (LIN), Center for Nuclear Energy in Agriculture (CENA), University of São Paulo (USP), Piracicaba 13416-000, Brazil
3
Laboratory of Precision Agriculture (LAP), Department of Biosystems Engineering, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba 13418-900, Brazil
4
Robotics and Automation Group, Department of Biosystems Engineering, Faculty of Animal Science and Food Engineering, University of São Paulo (USP), Pirassununga 13635-900, Brazil
5
Geotechnologies in Soil Science Group (GeoCis), Soil Science Department, Luiz de Queiroz College of Agriculture (ESALQ), University of São Paulo (USP), Piracicaba 13418-900, Brazil
*
Author to whom correspondence should be addressed.
Automation 2022, 3(1), 116-131; https://doi.org/10.3390/automation3010006
Submission received: 29 December 2021 / Revised: 24 January 2022 / Accepted: 27 January 2022 / Published: 3 February 2022
(This article belongs to the Collection Smart Robotics for Automation)

Abstract

:
Mapping soil fertility attributes at fine spatial resolution is crucial for site-specific management in precision agriculture. This paper evaluated the performance of mobile measurements using visible and near-infrared spectroscopy (vis–NIR) to predict and map key fertility attributes in tropical soils through local data modeling with partial least squares regression (PLS) and artificial neural network (ANN). Models were calibrated and tested in a calibration area (18-ha) with high spatial variability of soil attributes and then extrapolated in the entire field (138-ha). The models calibrated with ANN obtained superior performance for all attributes. Although ANN models obtained satisfactory predictions in the calibration area (ratio of performance to interquartile range (RPIQ) ≥ 1.7) for clay, organic matter (OM), cation exchange capacity (CEC), base saturation (V), and exchangeable (ex-) Ca, it was not repeated for some of them when extrapolated into the entire field. In conclusion, robust mappings (RPIQ = 2.49) were obtained for clay and OM, indicating that these attributes can be successfully mapped on tropical soils using mobile vis–NIR spectroscopy and local calibrations using ANN. This study highlights the need to implement an independent test to assess reliability and extrapolability of previously calibrated models, even when extrapolating the models to neighboring areas.

1. Introduction

Site-specific management of soil fertility using precision agriculture (PA) approaches allows for the optimization of farm inputs, which increases profitability and sustainability of agricultural systems [1]. Soil fertility management based on mapping of its attributes is a common PA practice used all over the world [2,3]. This approach is especially important in Brazilian tropical soils that are naturally of low fertility and require the application of large amounts of fertilizer annually [4], which makes the country the fourth largest consumer of this input in the world [5]. About 15.3% of Brazilian soybean and corn farmers use this approach, totaling approximately 9 million hectares of mapped soils using the traditional technique that is based on a low spatial resolution sampling grid (e.g., 0.2−0.5 sample ha−1), traditional laboratory analyses, and geostatistics for data interpolation [6]. The low spatial resolution used in this traditional approach leads to unreliable mapping; it is a consensus that sampling grids larger than 100 × 100 m (<1 sample ha−1) are inefficient in characterizing most soil fertility attributes [7,8].
For an accurate mapping of soil fertility attributes, a high spatial density of soil data acquisition is required (e.g., >1 sample ha−1) [4,7,8]. However, this is not feasible using the traditional mapping technique due to costs related to laboratory analyses and time required for sample collection [9,10]. Alternatively, mobile platforms instrumented with proximal soil sensors acquire digital data about soil properties at fine scale through on-line measurements, i.e., with data acquisition performed on-the-go and at high frequency (e.g., one reading per second) [11,12]. This approach massively increases the spatial density of data acquisition (e.g., >50 data points ha−1) [4]. Thereby, it allows one to describe the spatial continuity of soil attributes through geostatistical approaches improving the accuracy of mapping [13], especially for fertility attributes that present abrupt spatial changes (e.g., <100 m) [4,7]. In addition, this approach opens the possibility of performing automatic data collection, as well as diagnoses and interventions in real time [14].
In this context, visible and near-infrared spectroscopy (vis–NIR) is a technique that has attracted the attention of soil and biosystems engineering scientists for instrumentation of mobile platforms since it is compatible with on-line measurements [11]. Vis–NIR spectra provide information related primarily to soil mineral constituents (e.g., clay minerals), organic compounds (e.g., organic matter (OM) content), and water content [15], which are designated as primary soil properties [16]. In the spectrum, this information is represented by its intensity, shape, and absorptions (spectral features) at specific wavelengths [17]. The vis–NIR technique also allows estimation of soil properties with indirect relationships with the spectrum, such as pH, cation exchange capacity (CEC), base saturation (V), and exchangeable (ex-) nutrients; these attributes are designated as secondary soil properties [16]. This is possible when there is covariation of primary with secondary soil attributes [18]. Although there is vast literature showing the potential of on-line measurements using vis–NIR sensor for soil mapping in regions of temperate climate [14,16,19,20,21,22,23], tropical soils present different features, and such knowledge may not always be extrapolated [24]. The high temperature and moisture of the tropics promotes marked changes in mineralogical and biological characteristics of tropical soils when compared with soils of temperate regions, e.g., temperate soils usually feature more complex mineralogy than tropical soils and, in turn, biological activity increases notably in tropical soils compared to temperate ones [25]. These differences make it necessary to conduct studies to adapt and develop protocols for the use of mobile vis–NIR spectroscopy in the tropics. In addition, to the best of our knowledge, only one paper has explored this technique in Brazilian tropical soils [26].
The selection of an optimal data modeling method for predicting soil fertility attributes is a fundamental step toward accurate mapping of agricultural fields using mobile vis–NIR spectroscopy. This selection should take into account the predictive performance of the model, avoid procedures with excessive complexity, and ensure extrapolation at different spatial scales and environments [27,28]. Partial least-squares regression (PLS) model is a widely used method for its simplicity and robustness to deal with strongly collinear and noisy X-variables, a common characteristic of vis–NIR spectra [19,29,30]. PLS is a useful approach to model linear relationships between spectral variables and soil attributes [28]. However, modeling more complex and non-linear relationships may perform poorly using PLS [31]. Models involving artificial intelligence (AI) techniques, such as artificial neural network (ANN) models, may present superior performances to PLS for modeling soil attributes that are not directly related to vis–NIR spectra [28,29].
Numerous studies have shown the performance of different predictive models for digital soil mapping using mobile vis–NIR spectroscopy in soils of temperate regions [14,16,28,31,32,33]. Conversely, an accurate and reliable method for the estimation of soil fertility in Brazilian tropical soils has not been implemented to date and has been little explored in the literature [26]. This research aims to fill this gap, advancing the use of mobile vis–NIR spectroscopy for assessing key fertility attributes in tropical soils. The objectives of this study were (i) to investigate the feasibility of using mobile vis–NIR spectroscopy to predict and map key soil fertility attributes in a Brazilian agricultural field, and (ii) to compare prediction accuracy of PLS and ANN models for this purpose.

2. Materials and Methods

2.1. Study Area and Soil Sampling

The study area consists of a 138 ha field in the municipality of Campo Novo do Parecis, Mato Grosso State, Brazil (central coordinates: 14.102° S and 57.764° W, using WGS-84 datum) (Figure 1A,B), a region of tropical climate with wet summers and dry winters [34]. Before conducting the present study, this agricultural field was under a yearly crop rotation system with cotton and some grain crop (corn or soybean). The soil type is a Ferralsol [35], with texture varying between sandy loam and sand clay loam. The topography is rather flat with an elevation that oscillates between 607 and 630 m.
By using a SWIR-2 image from the satellite Sentinel 2-A (Band 12 which is centered at 2190 nm and has 20 m spatial resolution), an 18 ha area with high soil spatial variability was selected (Figure 1B) and designated as the calibration area. Prior to collecting the soil samples, the calibration area and the entire 138 ha field were scanned with the vis–NIR sensor in two separate on-line data acquisitions, respectively designated as spectral data acquisition 1 (Figure 1D) and 2 (Figure 1E). Spectral data were georeferenced using a global navigation satellite system (GNSS) receiver (StarFire, NavCom/John Deere, Torrance, CA, USA) with SF1 correction signal (NavCom/John Deere, Torrance, CA, USA), which provides an accuracy of about 0.5 m on the coordinates. After spectral data acquisition 1, 347 soil samples were collected in the calibration area, and after spectral data acquisition 2, 10 samples were collected in the entire field (designated in this study as verification points). Soil samples were collected in the furrows left by the platform shank (Figure 1C), using as reference the spectral data coordinates. In the calibration area, the average distance between each collected soil sample was 22 m in the row and 20 m between rows. The samples collected in the 138 ha field were positioned to cover most of the soil spatial variability observed in the spectral data. Both spectral data acquisitions were performed at the end of March (when the dry season starts) on consecutive days, without the occurrence of precipitation between them, such that soil moisture was low and with similar contents (about 5% g−1).
In summary, first, the spectral data acquisition 1 was performed on the 18 ha area (designated as calibration area) followed by the collection of 347 soil in this same area (Figure 1D). This first dataset (spectral data acquisition 1 + soil analysis of the 347 soil samples) was used for the calibration and testing of the predictive models (as detailed in Section 2.4). Afterward, the spectral data acquisition 2 was performed on the entire 138 ha field (including a resampling of the calibration area), followed by the collection of the 10 soil samples (Figure 1E). This second dataset was used for independent testing of the previously calibrated models (as detailed in Section 2.5), the data from the first dataset were kept out of this new testing.

2.2. Mobile Platform and On-Line Vis–NIR Data Acquisition

On-line data acquisition was performed using the commercial mobile platform Veris® VisNIR (Veris Technologies, Salina, EUA) (Figure 2). It consists of a metal box with the vis–NIR system aligned with a sapphire window in its lower portion, attached to a subsoiler shank by a parallelogram mechanism to provide horizontality to the window (Figure 2B). The shank penetrates the soil, making a trench, where the optical unit acquire soil spectra from the smooth bottom of this trench. Fiber optic cables transmit the energy reflected from the soil to two spectrometers: an Ocean optics USB4000 (Ocean Optics Inc., Largo, FL, USA) and a Hamamatsu TG-cooled NIR-II (Hamamatsu Photonics K. K., Hamamatsu, Japan). The first one makes measurements from 373 to 1011 nm and the second one from 1170 and 2222 nm, both with spectral resolution around 5 nm. Spectral data are georeferenced using the GNSS system. The entire system was mounted onto the three-point linkage of a tractor.
Before data acquisition, the intensity of the light source was checked using four references materials with known spectral behavior. The system also self-calibrates periodically at each 10 min interval using a shutter system present inside the shank, which operates automatically. This periodic calibration is performed collecting a dark reference and a known internal reference material. The frequency of data acquisition was 1 Hz and each recorded spectrum corresponds to the average of readings collected during this interval. On-line measurements were carried out in transects relatively parallel, separated with about 20 m, at an average speed of 5 m s−1, resulting in a spatial density of around 100 spectra ha−1.
The acquired spectra presented noise in the intervals between 343–426, 989–1070 and 2153–2222 nm, which were removed for the subsequent analysis. The spectral behavior was descriptively analyzed to characterize its intensity, shape, and absorption features [36].

2.3. Laboratory Reference Analyses

Soil testing conducted in a commercial laboratory determined the contents of clay, OM, CEC, pH, V, ex-Ca, ex-Mg, ex-K, and ex-P, following the methodologies described by Van Raij [37]. Clay was quantified by the Bouyoucos hydrometer method. OM was determined via oxidation with potassium dichromate solution, and pH was determined via calcium chloride solution. Exchangeable nutrients were determined via ion exchange resin extraction. CEC was calculated by sum of the soil potential acidity and the bases ex-Ca, ex-Mg, and ex-K; for this, soil potential acidity was quantified via the buffer solution method (SMP). The results of these analyses were used as Y-variables for calibration and test of the models described in the following section.

2.4. Predictive Modeling Using Spectral Data Acquisition 1

The relationship between spectra (obtained from spectral data acquisition 1) and soil attributes measured in the 347 soil samples collected from the calibration area were established using partial least squares regression (PLS) and artificial neural network (ANN). The spectral data acquisition 2 and the 10 soil samples collected in the entire field were kept out of this analysis. The raw spectra, after removing intervals with noise, were used as X-variables.
The calibration and test of models were performed after subdividing dataset into two subsets of 85% (calibration/training set, n = 295) and 15% (test set, n = 52) using the Kennard–Stone algorithm performed on the measured fertility attributes. The Kennard–Stone algorithm is based on a random sample selection in which spectra from the original dataset are randomly assigned for training and testing. The algorithm uses Euclidean distance calculations to select the sample with maximum distance to all other samples, such that samples that are as far away from the selected samples as possible are selected, until the selected number of samples is reached. In this way, it is possible to avoid the presence of bias introduced manually or by a completely random data splitting [38]. For PLS models, the number of latent variables chosen was defined based on the maximum coefficient of determination (R2) and lowest root-mean-square error (RMSE) obtained with leave-one-out cross-validation. The PLS method combines the useful spectral data into the first several latent variables which are then used for predictive modeling [39]; in other words, latent variables originate from a linear combination of the original variables. PLS models and dataset division were implemented using the Unscrambler software, version 10.5.1 (Camo AS, Oslo, Norway).
The ANN-based models were established with the Neural Network Toolbox in MATLAB R2016b (Mathworks Inc., Portola Valley, CA, USA) according to the fitting methodology. The ANN-based models have a Perceptron feedforward and multi-layered architecture, with a sigmoid transfer function in the hidden layer and a linear transfer function in the output layer. Different ANN architectures were run to fine-tune the hyperparameters (the number of layers, number of neurons, learning rate and momentum) with the supervised learning approach [33]. After fine tuning, performed by changing the hyperparameters in the sequence in which they were presented, the following settings were obtained: number of layers = 1 (simulation range: 1 to 5, step 1), number of neurons = 30 (simulation range 5 to 50, step 5), learning rate = 0.5 (simulation range: 0.1 to 0.7, step 0.1) and moment = 0.1 (simulation range: 0.1 at 0.7, step 0.1).
In order to evaluate the models prediction efficiency, R2, RMSE, relative error (RMSE %), and the ratio of performance to interquartile range (RPIQ) [40] were evaluated. The following groups were used for RPIQ interpretation, as proposed by Nawar and Mouazen [32]: excellent models (RPIQ ≥ 2.5), very good models (2.5 > RPIQ ≥ 2.0), good models (2.0 > RPIQ ≥ 1.7), reasonable models (1.7 > RPIQ ≥ 1.4), and very poor models (RPIQ < 1.4). The best performing models were extrapolated to the data obtained from the on-line survey conducted on the entire area (spectral data acquisition 2) in order to map the 138 ha field. Their performances were again verified in terms of RMSE and RPIQ, using the 10 soil samples collected in the total area (as detailed below in the Section 2.5).

2.5. Model Test Using the Spectral Data Acquisition 2

In addition to the abovementioned test using 15% of the samples collected in the calibration area, an independent test using the spectral data acquired in the on-line survey conducted in the entire field was performed. The purpose of this independent test was to verify the reliability and the extrapolability of the calibrated models when applied in spectral data that were not used for model calibration. Only the best performing models were applied in the spectral data, and the 10 soil samples (verification points) were used to attest their predictive accuracy through their performance of RMSE and RPIQ.

3. Results

3.1. Laboratory Measured Soil Properties

Figure 3 shows the box plot and coefficient of variation (CV) of soil attributes for the calibration and test datasets. All attributes have a comparable range and CV for the sets used to calibrate and validate the models, which were guaranteed by splitting the data using the Kennard–Stone algorithm. It is a desired feature to ensure that the observed prediction quality is related to sensor performance and not over- or underestimated due to diverging characteristics in the dataset [12,41].
The correlation matrix of the different soil attributes is important to interpret possible indirect determinations, i.e., predictions of attributes that are not directly related to vis–NIR spectra (e.g., CEC, V, pH, and exchangeable nutrients) [18]. In this sense, significant correlations between CEC, V, ex-Ca, ex-Mg, and ex-P with OM, as well as, between CEC, ex-Mg, ex-K, and ex-P with clay can be highlighted (Table 1).
The spatial statistics presented in Table 1 show that all fertility attributes presented spatial dependence ranging from moderate to strong (38.90 ≥ SDD ≥ 8.40), and therefore amenable to being modeled and mapped [13]. The ex-P is the only exception, since it showed a pure nugget effect. It was also observed that the range of spatial dependence of clay, OM, and CEC reached higher values (≥100 m), while pH, V, ex-Ca, ex-Mg, and ex-K showed more abrupt variations (range ≤ 56 m).

3.2. Descriptive Analysis of Vis–NIR Spectra

Changes in soil physical, chemical, and mineralogical properties can influence the behavior of vis–NIR spectra, with reflections on (i) their overall intensity (albedo), (ii) the intensity and amplitude of absorption features, and (iii) the shape of the spectrum [36]. The main spectral features observed in this study (Figure 4), are: alterations of albedo; presence of iron oxide/hydroxide (Fe]-OH) features at 425, 650 and 903 nm; and hydroxyl group (O-H) features at 1400 and 1900 nm, which are related to the structure of water + 1:1 and water + 2:1 minerals, respectively [18].
In Figure 4, it is also possible to observe the similarity of the mean and standard deviation of the spectra obtained in both the calibration area and the entire field. This behavior emphasizes the representativeness of the calibration area to cover the spectral variation of the study area as a whole.

3.3. Predictive Performance of Mobile Vis–NIR Spectroscopy in the Calibration Area

The predictive models for clay, OM, CEC, V, and ex-Ca showed satisfactory performance in their test using both ANN and PLS models, with RPIQ oscillating between 1.4 and 2.4 for PLS, and varying between 1.7 and 2.7 with ANN (Table 2). The ANN prediction accuracy in the test set was higher than the PLS for all attributes predicted with satisfactory performance (clay, OM, CEC, V, and ex-Ca) (Table 2). The RMSE values reduced 8.1%, 36.4%, 2.4%, 19.1% and 1.2% for the prediction of clay, OM, CEC, V, and ex-Ca, respectively, when using ANN instead of PLS models (Table 2). In addition, when using ANN models, predictive performances with RPIQ ≥ 2.0 (qualified as very good and excellent) were obtained for clay, OM, V, and ex-Ca. In turn, when using PLS, predictive performances with RPIQ ≥ 2.0 were obtained just for clay and V. The prediction of pH, ex-Mg, ex-K, and ex-P showed poor performance (RPIQ ≤ 1.2) in the test of all evaluated models.

3.4. Prediction Performance of the Independent Test (Using the Spectral Data Acquisition 2)

The predictive models for clay, OM, CEC, V, and ex-Ca that performed satisfactorily in the calibration area (Table 2) were transferred to the spectra of the second on-line data acquisition, and their performances are shown in Figure 5. The results showed that only the clay and the OM prediction models were reliable (with RPIQ = 2.49 for both attributes) when extrapolated to the entire field. Predictions of CEC, V, and ex-Ca showed very poor performance, with RPIQ ≤ 1.08, and a scatter plot of measured versus predicted values with points completely outside the 1:1 reference line (Figure 5E,G,I).
The range and dispersion of clay, OM, CEC, V, and ex-Ca attributes of the calibration and independent test datasets are shown in Figure A1. Although clay has a higher upper limit of the independent test dataset than that of the calibration set (which is not recommended in predictive analyses), the relation between spectra and clay did not change in this extrapolated concentration range (i.e., between 352 and 482 mg dm−3) such that the calibrated model maintained its good performance. Conversely, although the CEC and ex-Ca attributes presented ranges compatible with those observed in the calibration set, their predictive models did not perform well, indicating that the spectra changed the relationship with these attributes when considering the entire field.

4. Discussion

In geostatistics, the range of an attribute configures the distance in which samples present spatial autocorrelation. It is a consensus that for a reliable mapping of soil attributes it is necessary that the sampling grid should have a distance between samples of at least half the range of the target attribute [13]. In other words, sampling in grids with a distance superior to the range of the target attribute is not sufficient for its spatial characterization, producing unreliable maps. Thus, considering the range of fertility attributes presented in Table 1 (35 ≤ range ≤ 176), the spatial resolution performed by the mobile vis–NIR platform (20 × 5 m, resulting in 100 data ha−1) was sufficient for mapping of all the attributes of interest. The only exception was the ex-P, which was not possible to obtain its range, since in the geostatistical analysis using 20 samples ha−1 it showed pure nugget effect. Although ex-P mappings have been successfully performed on soils in temperate regions with on-line surveys using vis–NIR platforms (R2 = 0.60) [19], tropical soils show strong adsorption of this nutrient by Fe oxides [25], which confers abrupt spatial variations to this nutrient due to its low mobility [4].
In the calibration area, predictive models with satisfactory performance (RPIQ ≥ 1.7) were obtained for clay (R2 = 0.77), OM (R2 = 0.57), CEC (R2 = 0.55), V (R2 = 0.65), and ex-Ca (R2 = 0.69). Accurate predictions (0.73 ≤ R2 ≤ 0.90) of clay and OM using mobile vis–NIR spectroscopy are commonly reported in the literature for soils from temperate regions [21,28,33]. Predictions of textural contents via vis–NIR spectra are justified by two factors: (i) the relationship between the albedo and the sample grain size, and (ii) the presence of clay mineral features, such as those observed at 1400 and 1900 nm [15,43]. In turn, OM is related to smoothing of features and modifications to the shape of the spectrum, making it more straight and concave mainly in the visible region [36,44]. Predictions of secondary attributes through vis–NIR data are less common [18], but have also been reported in several other studies in temperate soils, such as by Munnaf et al. [16] for pH, ex-Na, ex-Ca, ex-Mg, ex-K (0.58 ≤ R2 ≤ 0.81), by Mouazen and Kuang [19] for ex-P (R2 = 0.60), and by Ulusoy et al. [45] for CEC (0.70 ≤ R2 ≤ 0.75). Although these are good results, in tropical Brazilian soils, the only study found that used on-line vis–NIR spectroscopy obtained reasonable and poor performances for the prediction of key soil fertility attributes (e.g., CEC, V, pH, soil potential acidity, ex-Ca, and ex-Mg), with R2 oscillating between −0.01 and 0.40, with the pH model as the one that presented the best performance (R2 of 0.40 and RPIQ of 2.30) [26]. In general, predictions of secondary attributes, mainly macronutrients, are still a challenge worldwide [4]. The automation of soil mapping using mobile platforms or robots with multi-sensor systems (e.g., combining vis–NIR and X-ray fluorescence sensors) is an alternative to increase the number of soil fertility attributes predicted satisfactorily, since the fusion of data from these sensors has shown promising results for the prediction of key soil fertility attributes [46,47].
Our results showed a better prediction performance for models calibrated with ANN compared to PLS. For example, in the calibration area, predictive models for clay and OM showed, respectively, RMSE values (of the test set) 8.7 and 57.3% higher when using PLS instead of ANN models. Better predictive performances for ANN models compared to PLS have been reported by Kuang et al. [33], which showed RMSE reduction ranging from 2.6 to 5.2% for clay, organic carbon (OC), and pH prediction. Slightly higher predictive performances for ANN models compared to PLS have also been reported by Fariftech et al. [29] for modeling soil spectral data. Alterations in the vis–NIR spectrum caused by changes in soil properties are often subtle and do not always present a linear pattern, such that the predictive performance of predictive models changes depending on the ability of the algorithm to extract useful information [48,49]. Vis–NIR spectra obtained directly in the field suffer interference from external factors, such as variations in soil moisture and structure, these factors contribute to making the relationship between spectra and soil attributes more complex, which may justify the superior performance of non-linear ANN models. Non-linear models are being proposed in the literature for the development of predictive models using spectral data, assuming that they are solid alternatives to exploring hidden and non-linear spectral information [22,50]. In this study, we observed that although it is possible to obtain satisfactory predictions for some attributes using PLS, the optimal approach was modeling via ANN.
The inference about secondary attributes has an indirect nature [10]; hence, some care should be taken to not overstate the extrapolability of predictive models [51]. In our study, prediction models for CEC, V, and ex-Ca showed satisfactory performance in the calibration area but performed poorly when used in the entire field, indicating that the spectra changed the relationship with these attributes when considering the entire field. Conversely, the clay and OM models repeated the good performances observed in the calibration area when evaluated in the entire field. This behavior emphasizes the potential of using mobile vis–NIR spectroscopy for clay and OM mapping at a fine scale in Brazilian tropical soils. Conversely, secondary attribute predictions, although possible, need care for the extrapolation of their models. It also highlights the need to implement verification points to assess reliability and extrapolability of previously calibrated models, even for application in areas neighboring the site where the models were calibrated. Kuang et al. [33] observed a significant reduction in the performance of predictive models for clay, OC, and pH (with R2 values reducing from 0.71–0.86 to 0.37–0.55) when applying an independent testing (i.e., without using local spectra in the model calibration) using vis–NIR data collected on-line in two Danish agricultural fields. Despite this, the authors still obtained satisfactory performance (with residual prediction deviation (RPD) ≥ 1.4) for clay prediction in both fields and for OC prediction in one of the fields. The predictive performances observed in the present study for the independent testing of clay and OM models were superior to those reported by Kuang et al. [33], which may be explained by the fact that our calibration was performed in a neighboring area, whereas the other authors used data from a spectral library for calibration.
Simplifying the calibration process of predictive modeling using mobile vis–NIR sensors is key to automating and expanding the mapping of soil attributes [52]. Spectra acquired directly in the field may be affected by external factors, such as moisture and granulometric variations, making it necessary in some cases to apply methods to mitigate these factors in order to obtain accurate predictive models [53,54]. However, the use of these methods makes the calibration process more complex. In our study, excellent predictive models were obtained for clay and OM (in the independent test, Figure 5) only with local calibrations using ANN and without the application of methods for mitigating external factors. It is possible that this performance was obtained due to low soil moisture content during data collection (about 5% g−1), which is a common feature in Brazilian tropical soils during the dry season from April to September. This hypothesis needs to be verified in future studies and, if proven, the use of methods to mitigate external factors may be neglected in on-line spectral acquisitions performed in relatively dry soils. Finally, it is also suggested that further studies explore new generalist calibration methods (e.g., using regional and national spectral libraries and spiking techniques [32]) that allow calibration of predictive models compatible with larger scale applications (e.g., agricultural areas with spectrally compatible soils).

5. Conclusions

Mobile visible and near-infrared spectroscopy (vis–NIR) allowed for the acquisition of data with adequate spatial resolution (i.e., compatible with the range of spatial variation) for mapping eight out of the nine key soil fertility attributes assessed, these being clay, organic matter (OM), cation exchange capacity (CEC), pH, base saturation (V), and the exchangeable (ex-) nutrients ex-Ca, ex-Mg, and ex-K. However, predictive models with satisfactory performance (ratio of performance to interquartile range (RPIQ) ≥ 1.4) were obtained only for clay, OM, CEC, V, and ex-Ca, in the calibration area (18 ha area), and only for clay and OM (RPIQ = 2.49, for both) when performing an independent test, i.e., extrapolating these models to the entire field (138 ha field).
Artificial neural network (ANN) models showed superior performance versus partial least squares regression (PLS) models for all evaluated attributes. Local models calibrated with ANN combined with on-line survey conducted during the Brazilian dry season showed robust predictions (reliable and replicable) for clay and OM, even without using methods to mitigate external factors (e.g., soil moisture).
This study shows the potential of using mobile vis–NIR spectroscopy for mapping clay and OM in Brazilian agricultural fields, indicating the need to use an independent test to assess the performance of local models previously calibrated. Further research is needed to explore different datasets acquired from a larger number of fields with different soil types, textural characteristics, and agricultural practices. Nevertheless, it confirms that mobile vis–NIR systems operating directly in the field can provide successful mappings considering the element in analysis.

Author Contributions

Conceptualization, M.T.E. and J.P.M.; Methodology, M.T.E., T.R.T., R.V.d.S. and J.P.M.; Validation, T.R.T., R.G.T., J.A.M.D. and J.P.M.; Formal analysis, T.R.T., M.T.E. and R.G.T.; Investigation, T.R.T., M.T.E. and J.P.M.; Resources, M.T.E. and J.P.M.; Data curation, T.R.T., R.V.d.S. and R.G.T.; Writing—original draft preparation, T.R.T.; Writing—review and editing, M.T.E., R.V.d.S., R.G.T., J.A.M.D. and J.P.M.; Visualization, T.R.T., M.T.E. and J.P.M.; Supervision, J.A.M.D. and J.P.M.; Project administration, M.T.E. and J.P.M.; Funding acquisition, M.T.E. and J.P.M. All authors have read and agreed to the published version of the manuscript.

Funding

T.R.T. and M.T.E. were funded by the São Paulo Research Foundation (FAPESP), grant number 2020/16670-9 and 2014/10737-3, respectively. Soil fertility tests and field activities were funded by CNPq—“Edital de Chamada Universal”, grant number 458180/2014-9. The Funding Authority for Studies and Projects (FINEP) for the support to PROSENSAP project that allowed the acquisition of the mobile vis–NIR platform.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We thank the company Terra Santa Agro S.A. for ceding the study area and providing full support in the field data survey.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Prediction results of the independent test set obtained from artificial neural network (ANN) models calibrated using mobile visible and near-infrared spectroscopy (vis–NIR).
Table A1. Prediction results of the independent test set obtained from artificial neural network (ANN) models calibrated using mobile visible and near-infrared spectroscopy (vis–NIR).
ClayOM 1CEC 2V 3ex-Ca 4
R20.830.680.110.000.00
RMSE41.043.2115.9422.5212.83
RMSE %14.7013.9732.1434.3358.87
RPIQ 52.492.491.080.390.88
1 Organic matter; 2 cation exchange capacity; 3 base saturation; 4 exchangeable Ca; 5 ratio of performance to interquartile range. The root-mean-square error (RMSE) was given in g dm−3 for clay and OM; in mmolc dm−3 for CEC and ex-Ca; and in % for V.
Figure A1. Box plots presenting the variation for the contents of clay, organic matter (OM), cation exchange capacity (CEC), base saturation (V), and extractable (ex-) Ca for the calibration (Cal), test, and independent test (IT) datasets. The coefficient of variation (CV) is also presented above each box plot.
Figure A1. Box plots presenting the variation for the contents of clay, organic matter (OM), cation exchange capacity (CEC), base saturation (V), and extractable (ex-) Ca for the calibration (Cal), test, and independent test (IT) datasets. The coefficient of variation (CV) is also presented above each box plot.
Automation 03 00006 g0a1

References

  1. Gebbers, R.; Adamchuk, V.I. Precision agriculture and food security. Science 2010, 327, 828–831. [Google Scholar] [CrossRef] [PubMed]
  2. Nawar, S.; Corstanje, R.; Halcro, G.; Mulla, D.; Mouazen, A.M. Delineation of Soil Management Zones for Variable-Rate Fertilization. Adv. Agron. 2017, 143, 175–245. [Google Scholar] [CrossRef]
  3. Viscarra Rossel, R.A.; Bouma, J. Soil sensing: A new paradigm for agriculture. Agric. Syst. 2016, 148, 71–74. [Google Scholar] [CrossRef]
  4. Molin, J.P.; Tavares, T.R. Sensor sytems for mapping soil fertility attributes: Challenges, advances, and perspectives in Brazilian tropical soils. Engenharia Agrícola 2019, 39, 126–147. [Google Scholar] [CrossRef] [Green Version]
  5. FAO. World Fertilizer Trends and Outlook to 2020; Food and Agriculture Organization of the United Nations (FAO): Rome, Italy, 2017. [Google Scholar]
  6. Molin, J.P. Agricultura de Precisão: Números do Mercado Brasileiro. Available online: https://www.agriculturadeprecisao.org.br/boletim-tecnico-03-agricultura-de-precisao-numeros-do-mercado-brasileiro/ (accessed on 22 December 2021).
  7. Nanni, M.R.; Povh, F.P.; Demattê, J.A.M.; de Oliveira, R.B.; Chicati, M.L.; Cezar, E. Optimum size in grid soil sampling for variable rate application in site-specific management. Sci. Agric. 2011, 68, 386–392. [Google Scholar] [CrossRef] [Green Version]
  8. Cherubin, M.R.; Santi, A.L.; Eitelwein, M.T.; Amado, T.J.C.; Simon, D.H.; Damian, J.M. Dimensão da malha amostral para caracterização da variabilidade espacial de fósforo e potássio em Latossolo Vermelho. Pesquisa Agropecuária Brasileira 2015, 50, 168–177. [Google Scholar] [CrossRef] [Green Version]
  9. Demattê, J.A.M.; Alves, M.R.; Gallo, B.C.; Fongaro, C.T.; E Souza, A.B.; Romero, D.J.; Sato, M.V. Hyperspectral remote sensing as an alternative to estimate soil attributes. Revista Ciência Agronômica 2015, 46, 223–232. [Google Scholar] [CrossRef]
  10. Viscarra Rossel, R.A.; Adamchuk, V.I.; Sudduth, K.A.; McKenzie, N.J.; Lobsey, C. Proximal Soil Sensing: An Effective Approach for Soil Measurements in Space and Time. Adv. Agron. 2011, 113, 243–291. [Google Scholar] [CrossRef]
  11. Kuang, B.; Mahmood, H.S.; Quraishi, M.Z.; Hoogmoed, W.B.; Mouazen, A.M.; van Henten, E.J. Sensing Soil Properties in the Laboratory, In Situ, and On-Line. Adv. Agron. 2012, 114, 155–223. [Google Scholar] [CrossRef]
  12. Adamchuk, V.; Hummel, J.; Morgan, M.; Upadhyaya, S. On-the-go soil sensors for precision agriculture. Comput. Electron. Agric. 2004, 44, 71–91. [Google Scholar] [CrossRef] [Green Version]
  13. Cambardella, C.A.; Moorman, T.B.; Novak, J.M.; Parkin, T.B.; Karlen, D.L.; Turco, R.F.; Konopka, A.E. Field-Scale Variability of Soil Properties in Central Iowa Soils. Soil Sci. Soc. Am. J. 1994, 58, 1501–1511. [Google Scholar] [CrossRef]
  14. Mouazen, A.M.; Maleki, M.R.; De Baerdemaeker, J.; Ramon, H. On-line measurement of some selected soil properties using a VIS–NIR sensor. Soil Tillage Res. 2007, 93, 13–27. [Google Scholar] [CrossRef]
  15. Ben-Dor, E. Quantitative remote sensing of soil properties. Adv. Agron. 2002, 75, 173–243. [Google Scholar] [CrossRef]
  16. Abdul Munnaf, M.; Nawar, S.; Mouazen, A.M. Estimation of Secondary Soil Properties by Fusion of Laboratory and On-Line Measured Vis–NIR Spectra. Remote Sens. 2019, 11, 2819. [Google Scholar] [CrossRef] [Green Version]
  17. Demattê, J.A.M. Characterization and discrimination of soils by their reflected electromagnetic energy. Pesquisa Agropecuária Brasileira 2002, 37, 1445–1458. [Google Scholar] [CrossRef] [Green Version]
  18. Stenberg, B.; Viscarra Rossel, R.A.; Mouazen, A.M.; Wetterlind, J. Visible and Near Infrared Spectroscopy in Soil Science. Adv. Agron. 2010, 107, 163–215. [Google Scholar] [CrossRef] [Green Version]
  19. Mouazen, A.M.; Kuang, B. On-line visible and near infrared spectroscopy for in-field phosphorous management. Soil Tillage Res. 2016, 155, 471–477. [Google Scholar] [CrossRef]
  20. Mouazen, A.M.; Maleki, M.R.; Cockx, L.; Van Meirvenne, M.; Van Holm, L.H.J.; Merckx, R.; De Baerdemaeker, J.; Ramon, H. Optimum three-point linkage set up for improving the quality of soil spectra and the accuracy of soil phosphorus measured using an on-line visible and near infrared sensor. Soil Tillage Res. 2009, 103, 144–152. [Google Scholar] [CrossRef] [Green Version]
  21. Kodaira, M.; Shibusawa, S. Using a mobile real-time soil visible-near infrared sensor for high resolution soil property mapping. Geoderma 2013, 199, 64–79. [Google Scholar] [CrossRef]
  22. Nawar, S.; Mouazen, A.M. On-line vis-NIR spectroscopy prediction of soil organic carbon using machine learning. Soil Tillage Res. 2019, 190, 120–127. [Google Scholar] [CrossRef]
  23. Yang, M.; Xu, D.; Chen, S.; Li, H.; Shi, Z. Evaluation of machine learning approaches to predict soil organic matter and pH using Vis-NIR spectra. Sensors 2019, 19, 263. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Silva, S.H.G.; Ribeiro, B.T.; Guerra, M.B.B.; de Carvalho, H.W.P.; Lopes, G.; Carvalho, G.S.; Guilherme, L.R.G.; Resende, M.; Mancini, M.; Curi, N.; et al. pXRF in tropical soils: Methodology, applications, achievements and challenges. Adv. Agron. 2021, 167, 1–62. [Google Scholar] [CrossRef]
  25. Hartemink, A.E. Soil Science in Tropical and Temperate Regions—Some Differences and Similarities. Adv. Agron. 2002, 77, 269–292. [Google Scholar] [CrossRef]
  26. Franceschini, M.H.D.; Demattê, J.A.M.; Kooistra, L.; Bartholomeus, H.; Rizzo, R.; Fongaro, C.T.; Molin, J.P. Effects of external factors on soil reflectance measured on-the-go and assessment of potential spectral correction through orthogonalisation and standardisation procedures. Soil Tillage Res. 2018, 177, 19–36. [Google Scholar] [CrossRef]
  27. Reeves, J.B. Near- versus mid-infrared diffuse reflectance spectroscopy for soil analysis emphasizing carbon and laboratory versus on-site analysis: Where are we and what needs to be done? Geoderma 2010, 158, 3–14. [Google Scholar] [CrossRef]
  28. Nawar, S.; Buddenbaum, H.; Hill, J.; Kozak, J.; Mouazen, A.M. Estimating the soil clay content and organic matter by means of different calibration methods of vis-NIR diffuse reflectance spectroscopy. Soil Tillage Res. 2016, 155, 510–522. [Google Scholar] [CrossRef] [Green Version]
  29. Farifteh, J.; Van der Meer, F.; Atzberger, C.; Carranza, E.J.M. Quantitative analysis of salt-affected soil reflectance spectra: A comparison of two adaptive methods (PLSR and ANN). Remote Sens. Environ. 2007, 110, 59–78. [Google Scholar] [CrossRef]
  30. Vasques, G.M.; Grunwald, S.; Sickman, J.O. Comparison of multivariate methods for inferential modeling of soil carbon using visible/near-infrared spectra. Geoderma 2008, 146, 14–25. [Google Scholar] [CrossRef]
  31. Vohland, M.; Besold, J.; Hill, J.; Fründ, H.-C. Comparing different multivariate calibration methods for the determination of soil organic carbon pools with visible to near infrared spectroscopy. Geoderma 2011, 166, 198–205. [Google Scholar] [CrossRef]
  32. Nawar, S.; Mouazen, A.M. Predictive performance of mobile vis-near infrared spectroscopy for key soil properties at different geographical scales by using spiking and data mining techniques. CATENA 2017, 151, 118–129. [Google Scholar] [CrossRef] [Green Version]
  33. Kuang, B.; Tekin, Y.; Mouazen, A.M. Comparison between artificial neural network and partial least squares for on-line visible and near infrared spectroscopy measurement of soil organic carbon, pH and clay content. Soil Tillage Res. 2015, 146, 243–252. [Google Scholar] [CrossRef]
  34. Alvares, C.A.; Stape, J.L.; Sentelhas, P.C.; de Moraes Gonçalves, J.L.; Sparovek, G. Köppen’s climate classification map for Brazil. Meteorol. Z. 2013, 22, 711–728. [Google Scholar] [CrossRef]
  35. IUSS Working Group WRB. World Reference Base for Soil Resources 2014, Update 2015: International Soil Classification System for Naming Soils and Creating Legends for Soil Maps; FAO: Rome, Italy, 2015; ISBN 978-92-5-108369-7. [Google Scholar]
  36. Demattê, J.A.; Campos, R.C.; Alves, M.C.; Fiorio, P.R.; Nanni, M.R. Visible–NIR reflectance: A new approach on soil evaluation. Geoderma 2004, 121, 95–112. [Google Scholar] [CrossRef]
  37. Van Raij, B.; Andrade, J.C.; Cantarela, H.; Quaggio, J.A. Análise Química Para Avaliação de Solos Tropicais; IAC: Campinas, Brazil, 2001. [Google Scholar]
  38. Kennard, R.W.; Stone, L.A. Computer Aided Design of Experiments. Technometrics 1969, 11, 137–148. [Google Scholar] [CrossRef]
  39. Kawamura, K.; Tsujimoto, Y.; Rabenarivo, M.; Asai, H.; Andriamananjara, A.; Rakotoson, T. Vis-NIR Spectroscopy and PLS Regression with Waveband Selection for Estimating the Total C and N of Paddy Soils in Madagascar. Remote Sens. 2017, 9, 1081. [Google Scholar] [CrossRef] [Green Version]
  40. Bellon-Maurel, V.; Fernandez-Ahumada, E.; Palagos, B.; Roger, J.-M.; McBratney, A. Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy. TrAC Trends Anal. Chem. 2010, 29, 1073–1081. [Google Scholar] [CrossRef]
  41. Nawar, S.; Mouazen, A.M. Optimal sample selection for measurement of soil organic carbon using on-line vis-NIR spectroscopy. Comput. Electron. Agric. 2018, 151, 469–477. [Google Scholar] [CrossRef]
  42. Minasny, B.; McBratney, A.B.; Whelan, B.M. 2005. VESPER, version 1.62; Australian Centre for Precision Agriculture, McMillan Building A05; The University of Sydney: Camperdown, NSW, Australia, 2006. [Google Scholar]
  43. Lacerda, M.; Demattê, J.; Sato, M.; Fongaro, C.; Gallo, B.; Souza, A. Tropical Texture Determination by Proximal Sensing Using a Regional Spectral Library and Its Relationship with Soil Classification. Remote Sens. 2016, 8, 701. [Google Scholar] [CrossRef] [Green Version]
  44. Bellinaso, H.; Demattê, J.A.M.; Romeiro, S.A. Soil spectral library and its use in soil classification. Revista Brasileira de Ciência do Solo 2010, 34, 861–870. [Google Scholar] [CrossRef] [Green Version]
  45. Ulusoy, Y.; Tekin, Y.; Tümsavaş, Z.; Mouazen, A.M. Prediction of soil cation exchange capacity using visible and near infrared spectroscopy. Biosyst. Eng. 2016, 152, 79–93. [Google Scholar] [CrossRef]
  46. Tavares, T.R.; Molin, J.P.; Javadi, S.H.; de Carvalho, H.W.P.; Mouazen, A.M. Combined use of vis-nir and xrf sensors for tropical soil fertility analysis: Assessing different data fusion approaches. Sensors 2021, 21, 148. [Google Scholar] [CrossRef] [PubMed]
  47. Tavares, T.R.; Molin, J.P.; Nunes, L.C.; Wei, M.C.F.; Krug, F.J.; de Carvalho, H.W.P.; Mouazen, A.M. Multi-Sensor Approach for Tropical Soil Fertility Analysis: Comparison of Individual and Combined Performance of VNIR, XRF, and LIBS Spectroscopies. Agronomy 2021, 11, 1028. [Google Scholar] [CrossRef]
  48. Mitchell, J.B.O. Machine learning methods in chemoinformatics. WIREs Comput. Mol. Sci. 2014, 4, 468–481. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  49. Viscarra Rossel, R.A.; Behrens, T. Using data mining to model and interpret soil diffuse reflectance spectra. Geoderma 2010, 158, 46–54. [Google Scholar] [CrossRef]
  50. Panchuk, V.; Yaroshenko, I.; Legin, A.; Semenov, V.; Kirsanov, D. Application of chemometric methods to XRF-data—A tutorial review. Anal. Chim. Acta 2018, 1040, 19–32. [Google Scholar] [CrossRef]
  51. McBride, M.B. Estimating soil chemical properties by diffuse reflectance spectroscopy: Promise versus reality. Eur. J. Soil Sci. 2021, 73, e13192. [Google Scholar] [CrossRef]
  52. Kerry, R.; Escolà, A. Conclusions: Future Directions in Sensing for Precision Agriculture. In Sensing Approaches for Precision Agriculture; Kerry, R., Escolà, A., Eds.; Springer: Cham, Switzerland, 2021; pp. 399–407. ISBN 978-3-030-78431-7. [Google Scholar]
  53. Roger, J.-M.; Chauchard, F.; Bellon-Maurel, V. EPO–PLS external parameter orthogonalisation of PLS application to temperature-independent measurement of sugar content of intact fruits. Chemom. Intell. Lab. Syst. 2003, 66, 191–204. [Google Scholar] [CrossRef] [Green Version]
  54. Wang, Z.; Dean, T.; Kowalski, B.R. Additive Background Correction in Multivariate Instrument Standardization. Anal. Chem. 1995, 67, 2379–2385. [Google Scholar] [CrossRef]
Figure 1. Schematic figure showing the location of the studied field (A); soil spatial variability observed in the Sentinel image (B); on-line soil survey conducted with the mobile vis–NIR platform and furrow left by the platform shank (C); soil sampling location and trajectory of on-line survey performed in the calibration area (D) and the entire field (E).
Figure 1. Schematic figure showing the location of the studied field (A); soil spatial variability observed in the Sentinel image (B); on-line soil survey conducted with the mobile vis–NIR platform and furrow left by the platform shank (C); soil sampling location and trajectory of on-line survey performed in the calibration area (D) and the entire field (E).
Automation 03 00006 g001
Figure 2. Mobile vis–NIR platform (A) and its shank in detail (B).
Figure 2. Mobile vis–NIR platform (A) and its shank in detail (B).
Automation 03 00006 g002
Figure 3. Box plots presenting the variation for the contents of clay, organic matter (OM), cation exchange capacity (CEC), pH, base saturation (V), and extractable (ex-) P, ex-K, ex-Ca, and ex-Mg for each calibration (Cal) and test datasets. The coefficient of variation (CV) is also presented above each box plot.
Figure 3. Box plots presenting the variation for the contents of clay, organic matter (OM), cation exchange capacity (CEC), pH, base saturation (V), and extractable (ex-) P, ex-K, ex-Ca, and ex-Mg for each calibration (Cal) and test datasets. The coefficient of variation (CV) is also presented above each box plot.
Automation 03 00006 g003
Figure 4. Mean and standard deviation (SD) of visible and near-infrared (vis–NIR) spectra acquired through the on-line survey performed in both the calibration area (spectral data acquisition 1) and the entire field (spectral data acquisition 2).
Figure 4. Mean and standard deviation (SD) of visible and near-infrared (vis–NIR) spectra acquired through the on-line survey performed in both the calibration area (spectral data acquisition 1) and the entire field (spectral data acquisition 2).
Automation 03 00006 g004
Figure 5. Scatter plots (for the 10 verification points) of measured versus predicted fertility attributes that presented satisfactory performance (in the calibration area) while using artificial neural network (ANN) models (A,C,E,G,I); the prediction results of this independent testing are detailed in Table A1 (Appendix A). Fertility attribute maps obtained after interpolation of the predictions using ANN models (B,D,F,H,J). The data were interpolated by kriging using the Vesper 1.63 software [42] and a 10 × 10 m grid, i.e., with a resolution compatible with that of the data acquisition. On the maps, the boundary of the calibration area is represented in black dashed lines and the position of the soil samples used in the independent testing in black circles.
Figure 5. Scatter plots (for the 10 verification points) of measured versus predicted fertility attributes that presented satisfactory performance (in the calibration area) while using artificial neural network (ANN) models (A,C,E,G,I); the prediction results of this independent testing are detailed in Table A1 (Appendix A). Fertility attribute maps obtained after interpolation of the predictions using ANN models (B,D,F,H,J). The data were interpolated by kriging using the Vesper 1.63 software [42] and a 10 × 10 m grid, i.e., with a resolution compatible with that of the data acquisition. On the maps, the boundary of the calibration area is represented in black dashed lines and the position of the soil samples used in the independent testing in black circles.
Automation 03 00006 g005
Table 1. Spatial statistics and correlation matrix for the 347 samples collected in the calibration area. Soil sampling was carried out at a spatial density of about 20 samples ha−1.
Table 1. Spatial statistics and correlation matrix for the 347 samples collected in the calibration area. Soil sampling was carried out at a spatial density of about 20 samples ha−1.
ClayOM 1CEC 2pH V 3ex-Ca 4ex-Mg 4ex-K 4ex-P 4
Spatial Statistics:
Nugget effect173.106.6533.000.0130.0020.002.800.10PNE 7
Sill2050.2017.0989.500.07296.90127.308.100.30PNE 7
Range 5 1761131003555564742PNE 7
SDD 6 *8.4038.9036.9014.3010.1015.7034.8032.30-
Correlation Matrix:
Clay1.00
OM0.271.00
CEC0.350.301.00
pH−0.010.100.181.00
V0.080.170.560.501.00
ex-Ca0.010.130.170.14−0.081.00
ex-Mg0.200.140.19−0.08−0.060.371.00
ex-K0.16−0.080.800.310.900.01−0.061.00
ex-P0.170.220.430.690.520.040.110.081.00
1 Organic matter; 2 cation exchange capacity; 3 base saturation; 4 exchangeable (ex-) nutrients; 5 range values expressed in meters; 6 spatial dependence degree; 7 pure nugget effect. Bold values indicate a significant correlation (p value < 0.05); * SDD values can be interpreted as strong spatial dependence when less than 25%, moderate spatial dependence when between 25 and 75%, and weak spatial dependence when greater than 75% [13].
Table 2. Prediction results of the calibration and test set obtained from partial least squares regression (PLS) and artificial neural network (ANN) models calibrated using mobile visible and near-infrared spectroscopy (vis–NIR) in the calibration area.
Table 2. Prediction results of the calibration and test set obtained from partial least squares regression (PLS) and artificial neural network (ANN) models calibrated using mobile visible and near-infrared spectroscopy (vis–NIR) in the calibration area.
Clay OM 1CEC 2pHV 3ex-Ca 4ex-Mg 4ex-K 4ex-P 4
ANN Calibration (n = 295):
R20.890.660.690.320.820.760.480.500.29
RMSE13.151.784.150.237.115.512.070.3914.46
RMSE %6.4811.128.3612.229.6111.249.4111.408.71
RPIQ3.92.82.91.33.92.91.41.30.6
ANN Test (n = 52):
R20.770.570.550.100.650.690.230.140.12
RMSE19.892.327.240.2410.277.142.920.5518.11
RMSE %9.8014.5314.6012.8413.8814.5713.2516.2110.91
RPIQ2.62.21.71.22.72.21.00.90.4
PLS Calibration (n = 295):
R20.760.480.410.010.590.690.060.040.01
RMSE20.062.536.980.2710.376.112.790.5120.47
RMSE %9.8812.6714.0714.2614.0112.4712.6912.7312.33
RPIQ2.52.01.71.12.62.11.11.10.4
n VL98911114311
PLS Test (n = 52):
R20.750.290.520.000.490.670.080.010.03
RMSE21.643.657.420.2812.717.232.600.5515.84
RMSE %10.6618.2414.9614.8417.1814.7611.8313.879.54
RPIQ2.41.41.61.12.11.81.21.00.6
n VL 598911114311
1 Organic matter; 2 cation exchange capacity; 3 base saturation; 4 exchangeable (ex-) nutrients; 5 number of latent variables The coefficient of determination (R2) and ratio of performance to interquartile range (RPIQ) values are presented on grayscale, highlighting the highest values. The root-mean-square error (RMSE) was given in g dm−3 for clay and OM; in mmolc dm−3 for CEC, ex-K, ex-Ca, and ex-Mg; in % for V; and in mg dm−3 for ex-P.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Eitelwein, M.T.; Tavares, T.R.; Molin, J.P.; Trevisan, R.G.; de Sousa, R.V.; Demattê, J.A.M. Predictive Performance of Mobile Vis–NIR Spectroscopy for Mapping Key Fertility Attributes in Tropical Soils through Local Models Using PLS and ANN. Automation 2022, 3, 116-131. https://doi.org/10.3390/automation3010006

AMA Style

Eitelwein MT, Tavares TR, Molin JP, Trevisan RG, de Sousa RV, Demattê JAM. Predictive Performance of Mobile Vis–NIR Spectroscopy for Mapping Key Fertility Attributes in Tropical Soils through Local Models Using PLS and ANN. Automation. 2022; 3(1):116-131. https://doi.org/10.3390/automation3010006

Chicago/Turabian Style

Eitelwein, Mateus Tonini, Tiago Rodrigues Tavares, José Paulo Molin, Rodrigo Gonçalves Trevisan, Rafael Vieira de Sousa, and José Alexandre Melo Demattê. 2022. "Predictive Performance of Mobile Vis–NIR Spectroscopy for Mapping Key Fertility Attributes in Tropical Soils through Local Models Using PLS and ANN" Automation 3, no. 1: 116-131. https://doi.org/10.3390/automation3010006

Article Metrics

Back to TopTop