1. Introduction
The olive tree is one of the most representative crops in the Mediterranean basin closely linked to the economy and culture of the region. Worldwide, the Mediterranean basin is the producer of 99% of olive oil and the consumer of 87% [
1].
Traditional olive groves have been and still are a very important component of the Mediterranean landscape. To a considerable extent we could classify them as forests that produce an important range of ecosystem services: healthy food, biodiversity, living soils, carbon sequestration, culture, employment, life in villages, etc. Particularly with regard to olive grove landscapes, the EU has shown great interest in their conservation through strategies such as the European Landscape Convention [
2], developing a specific tool for the protection and management of olive groves.
The lack of economic sustainability is causing traditional olive groves to be at serious risk of survival, disappearing on many occasions. More recently, this crop has been changing from traditional rainfed olive groves with a low density of trees (less than 100 trees per hectare) to olive groves of medium and high density, mostly associated with the introduction of irrigation, which is also promoting the substitution of various crops (wheat, barley, sunflower, cotton) by high-density olive groves [
3,
4]. All this is causing major changes in the management of the olive groves, especially in their intensification, as well as in the economic, social and environmental impacts.
These changes have been strongly noted in one of the main olive-growing regions in the world, Andalusia.
Figure 1 and
Table 1 show the difference in olive grove areas between 2015 and 2018. In general, there has been an increase in high-density plantations to the detriment of low ones [
5].
Studies such as [
3] indicate that this is causing a high environmental impact, among which the problems derived from water needs stand out. However, there is also evidence that the intensification of deficit irrigation has improved carbon sequestration, as investigated by [
6], through the modeling of the implications of climate variability and agricultural management on the productivity and environmental performance of olive crops in the Mediterranean. In addition, an increase in such intensification would increase irrigation needs. For this reason, it is becoming more and more necessary to be able to systematize the monitoring of olive tree density with detailed and permanently updated information in large areas. Studies such as [
7,
8] show the need to provide more detailed information on olive farming practices and to make and quantify proposals to increase specific sustainable practices at the farm level [
9].
In this regard, there are numerous publications related to data-capture mechanisms in the field and the use of platforms for their management and visualization [
10,
11,
12,
13,
14,
15]. In a complementary way, important efforts are being made to create common data spaces in the agricultural field [
16,
17,
18,
19,
20,
21,
22], which try to overcome the existing barriers regarding the global management of data. The main obstacles found are the following: the complexity of data management [
23], the lack of interoperability [
16,
17,
22], the insufficiency of storage units and processing platforms [
24,
25], as well as the scarcity of reference architectures [
23,
26,
27,
28,
29,
30]. Overcoming these limitations would make it possible to take full advantage of the potential of data analysis and management, strengthening the capabilities of decision-making support systems.
Further, the characterization and monitoring of large areas of crops is becoming a key factor to improving and supporting decision-making. The combination of remote data with ground measurements, obtained from interpretations of high spatial resolution aerial photogrammetry through image analysis, significantly improves the ability to study land processes. In this line, different studies are being promoted, as is the case of [
31] where a methodology for landscape sampling, mapping and characterization of a complex agroforestry system in sub-Saharan Africa is provided.
Therefore, the use of remote sensing has an increasingly important role in the continuous, effective, precise and complete monitoring of large areas, being key in the decision-making processes of agroforestry management [
32,
33]. It allows crop mapping to be carried out at a low cost and with high frequency, which makes it possible to extend these studies [
6,
7,
8] to large areas.
For this reason, an aspect of great importance is the possibility of using the synergies between automated procedures to identify and characterize the different ecological units in olive groves from very high resolution images, such as orthophotos with a spatial resolution of 0.5 m or superior. The analyses carried out with satellite images of lower spatial resolution could substantially improve their usefulness and accuracy if models based on spectral mixtures were developed using previous segmentations carried out with image analysis techniques. These studies would make it possible to complement the temporal resolutions of the systems for obtaining digital aerial orthophotography, such as the National Plan for Aerial Orthography (PNOA) [
34], whose update period is every three years, which makes it impossible to update inventories periodically.
Regarding the treatment and processing of satellite images, there are platforms such as Google Earth Engine (GEE), which provide easy access to a wide catalogue of images, including those captured by the Sentinel 2-MSI (MultiSpectral Instrument) satellites, and allow for the extraction of relevant vegetation indices in a simple way [
35]. Vegetation indices are capable of monitoring crop growth with high-resolution satellite images [
36]. Among these indices, the NDVI (Normalized Difference Vegetation Index) has been defined as a good tool to indicate significant changes in land use and cover [
37,
38]. NDVI shows better results than other indices such as the adjusted vegetation index to the ground (SAVI) [
38] and is one of the most widespread, due to its simplicity and availability [
39,
40]. In addition, according to [
41], the analysis of the NDVI for the estimation of the surface of the crops and the qualitative evaluation of these with hydric stress, can lead to an optimization in irrigation management systems.
Thus, it is very important to be able to automate the inclusion of new data, coming from: (i) high-resolution aerial photogrammetry image analysis, such as the crown area, Fraction Canopy Cover (FCC), tree density or the identification of different typologies; (ii) analysis of the NDVI at the pixel and sub-pixel level; (iii) existing open data sources such as the Geographic Information System of Agricultural Parcels SIGPAC [
42], the Andalusian Phytosanitary Information and Alert Network RAIF [
43] and the Integrated Treatments in Andalusia in Agriculture TRIANA [
44]. Furthermore, it is necessary to enable the creation of common data spaces that make it possible to value the tools for the conservation of the olive grove and the detection of changes in its management.
In addition, the processing of measurements from high-resolution aerial frames is very useful for improving the interpretation of satellite images; they provide field data from which it is possible to calibrate the models. Recent studies have focused on the development of tools to generate automated agroforestry inventories for the analysis of large areas [
31]. These allow us to calibrate and optimize the analyses carried out with lower resolution satellite images, which are needed in order to complement the studies with spectral mixture analysis techniques of automatic pixel analysis, to improve feature extraction at the olive tree level. Nevertheless, further work is still necessary to delve further into in order to optimize results. In this study, the FCC-NDVI relationship has been evaluated.
We can summarize that it is necessary to obtain detailed information on large olive grove areas, having the plot as the sampling unit and, where appropriate, scaling it to larger territories, which will make it possible to carry out studies at the farm level and characterize the ecosystem services of the olive grove crop providing tools to help decision making.
In this regard, the general objective of the work was to develop and validate a methodology to carry out olive grove inventories based on automatic analysis techniques of photogrammetric images of PNOA, satellite images of the Sentinel constellation and open data sources.
4. Discussion
The objective of this work was to develop a system that allows for creating inventories of olive groves at different scales from the integration of open data sources and calculated automatically through image analysis. As a result, the characterization of 1,519,438 ha of olive groves (92% of the olive grove area of Andalusia) was obtained. This study is in line with [
8], where a systematic analysis of the effects on the typology of the olive grove in the countryside of Córdoba and with the strategies of the European Landscape Convention [
2] was carried out. In addition, it provides specific information at the polygon and plot level, which serves to be able to evaluate the specific practices at the farm level, a need detected in the studies [
7,
9].
Our proposal has achieved unified and operational access to the different data sources, allowing their publication and consumption through intuitive interfaces, facing the problem of lack of interoperability indicated in [
16,
17,
22]. To achieve this objective, configurable algorithms have been developed that extract key agronomic information for different attributes, including: (i) crop and phytosanitary information; (ii) access to PNOA high-resolution aerial photogrammetry; (iii) access to images for remote sensing; (iv) time series of the main vegetation indices. All these developments have a great potential to be used for other purposes and crops.
Furthermore, the analysis and integration of the different data sources has allowed their evaluation and comparison. With this, it has been possible to identify some non-coherent data between the different sources studied (see
Table 4). Additionally, as can be seen in
Table 5, in some cases there are quite a few differences between the data provided by the RAIF and the estimated data, detecting relative errors that reach up to 37%. A detailed analysis of these discrepancies has made it possible to identify that these deviations usually occur in super-intensive olive groves, where the estimation of the area with the methods used loses precision. For this reason, the use of different methodologies for calculating areas based on the plantation framework is proposed for future work. Other discrepancies could be partially explained considering that the data provided by the RAIF could to some extent be the result of rounding, concluding that at least the most important discrepancies with the estimated data would merit a comparison with real data. In the same sense, the automatic inclusion of this type of measurements would improve the confidence and precision of those collected in the RAIF.
Another point to highlight from the work is the interpretation of data models through image analysis and the use of remote sensing, which is key for the effective and continuous monitoring in large areas [
32,
33]. The results of our study indicate that the NDVI calculated from the Sentinel-MSI images, particularly in the summer season, has a high relationship with the FCC in all provinces. During this period, the NDVI signal is not influenced by vegetation cover between trees [
57,
58]. For the same reason, the prediction errors were greater in the remaining seasons (winter, spring and autumn), since they are influenced by the existing vegetation in the streets, which would also allow for characterizing this herbaceous stratum by subtraction.
Our proposal has achieved precise approximations in the different provinces (R-squared between 0.43 and 0.815), similar to those presented in other investigations for other crops [
59]. Regarding the results at the plot level, an R-squared of 0.79 was reached. The usefulness of the models at the plot level, in addition to the estimation of the FCC, allows for the identification of the plantation framework, as well as a more precise approximation of the crown area, allowing for the inclusion of more detailed cartographic information in data sources, including existing data, such as RAIF, SIGPAC or TRIANA.
The studies carried out at the pixel level, where R-squared results of 0.655 were achieved, have allowed us to delve into the calculation of spectral mixtures within pixels. Despite the fact that the results were worse than those achieved at the plot level, it is still an attractive line of work to interpret the satellite images at the pixel level and their distribution in the territory.
Regarding the processing of high-resolution image analysis, the tool developed in the study [
48] was used, parameterizing it for olive cultivation and the different geographical areas of Andalusia, which has shown the ability to extrapolate this tool to other ecosystems and study areas. In this sense, the shapefiles generated with detailed geographic and agronomic information are a valuable contribution to the inventory of olive groves that allow delving into studies such as sub-pixel classification and the estimation of mixtures, with the aim of classifying and accurately identifying the elements of the olive groves. This raises an interest in multispectral images provided by remote sensing, which is proposed for future studies.
5. Conclusions
The tools and protocols developed make it possible to automate the capture of images of different characteristics and origins, as well as from different open data sources, and integrate them and metadata them so that they can later be used for the development and validation of algorithms that can improve the characterization of the surfaces of olive grove at the plot and cadastral polygon scales.
The proposed system allows for identifying, locating, counting and measuring the fraction of canopy cover (FCC) of olive trees in different locations, plantation frameworks, varieties and tree cultivation techniques. It is robust and useful for carrying out automated inventories of olive groves and incorporating them into decision support strategies.
An inventory of the Andalusian olive grove has been automatically carried out at the level of cadastral polygons and provinces, which has accounted for a total of 1,519,438 hectares and 171,980,593 olive trees, data that have been contrasted with various official statistical sources allowing us to ensure the reliability of this study and even identify some inconsistencies or errors of some sources.
Obtaining singular information at the tree level opens up a great opportunity to systematize the measurement of the impact of various farming practices, the measurement of ecosystem services, the control of compliance with regulations and the granting of public aid.
The ability of Sentinel 2 satellite images to estimate the FCC at the cadastral polygon, plot and 10 × 10 m pixel levels, as well as to perform inventories with temporal resolutions of approximately up to 5 days, has been demonstrated and quantified.
The combination of object-oriented automatic image recognition techniques, with automatic pixel analysis techniques, have allowed us to explore the opportunity of mixture analysis to improve the estimation of olive trees and their characteristics, although it is still necessary to delve further in order to optimize results.