Sentinel-2 Recognition of Uncovered and Plastic Covered Agricultural Soil

: Medium resolution satellite data, such as Sentinel-2 of the Copernicus programme, offer great new opportunities for the agricultural sector, and provide insights on soil surface characteristics and their management. Soil monitoring requires a high-quality dataset of uncovered and plastic covered agricultural soil. We developed a methodology to identify uncovered soil pixels in agricultural parcels during seedbed preparation and considered the impacts of clouds and shadows, vegetation cover, and artiﬁcial covers, such as those of greenhouses and plastic mulch ﬁlms. We preserved the spatial and temporal integrity of parcels in the process and analysed spectral anomalies and their sources. The approach is based on freely available tools, namely Google Earth Engine and R Programming packages. We tested the methodology on the northern region of Belgium, which is characterised by small, fragmented parcels. We selected a period between mid-April to end-May, when active agricultural management practices leave the soil bare in preparation for the main cropping season. The spectral angle mapper was used to identify soil covered by non-plastic greenhouses or temporary soil covers, such as plastic mulch ﬁlms. The effect of underlying soil on temporary covers was considered. The retrogressive plastic greenhouse index was used for detecting plastic greenhouses. The result was a high quality dataset of potential bare uncovered agricultural soil that allows further soil surface characterisation. This offered an improved understanding of the use of artiﬁcial covers, their spatial distribution, and their corresponding crops during the considered period. Artiﬁcial covers occurred most frequently in maize parcels. The approach resulted in precision values exceeding 0.9 for the detection of temporary covers and non-plastic greenhouses and a sensitivity value exceeding 0.95 for non-plastic and plastic greenhouses.


Introduction
Agriculture is constantly in need of timely information that is key to various land management decisions. Soil mapping of temporal and spatial variability is essential for sustainable management of cropland and for the assessment and remediation of impacts of climate change [1,2]. In many parts of the world, it can be challenging to manage agricultural parcels without appropriate updates of digital agricultural databases [3,4].
Remote sensing-based information at the parcel-level is of interest to land managers [5,6]. Including spaceborne remote sensing data in the farm management scheme can lead to an improvement of farm profit [7] as they provide suitable monitoring that contributes to the sustainable development of the agricultural sector [8,9]. Their ability to support agricultural soil mapping and understanding soil attributes has been well illustrated using various sensors and platforms [10,11], where the exclusion of pixels contaminated by various factors has been highlighted. A combination of indices has been successfully used to detect bare agricultural soil including normalised difference vegetation index (NDVI) combined with "Normalized Burn Ratio" index (NBR2) [12], the bare soil index (BSI) [13], and the modified bare soil index (MBI) [14]. With the support of these bare soil detection approaches, researchers have also created composites of bare soil data acquired at different periods of time to enable a continuous representation of bare soil in various study areas [11,15].
Artificial covers, such as greenhouses and temporary soil covers, need to be addressed in the detection of bare soil. Plastic greenhouses have been successfully observed in medium resolution imagery, such as those of Landsat and Sentinel-2 [16][17][18][19][20][21][22]. Spectra of plastic greenhouses are of relatively high reflectance, holding information of the crops underneath the plastic cover [18]. The identification of non-plastic greenhouses has mainly focused on object-based detection and often using high spatial resolution data [21,23]. In addition to greenhouses, European farmers use temporary covers such as low plastic tunnels, mulching films, along with plastic films such as those used for asparagus. Those covers are diverse in type and material. The spectra of underlying crops or soil add to the complexity of their detection in most multispectral data while specific absorption features in the shortwave-infrared are not easily captured when compared to hyperspectral data [21,24,25]. Mulch films are mainly single-use films of low density polyethylene (LDPE). Unless biodegradable, these temporary covers are usually collected after harvest (although sometimes incompletely). Such mulch films are often placed in strips and can leave uncovered bare soil, such as in ridge-furrow tillage approaches [26]. Therefore, uncovered soil is expected to contribute to mixed pixel reflectance, increasing the impact that underlying soil has on the reflectance spectra. Approaches for monitoring mulch films in areas of intensive usage include the development of indices [27,28] along with fusion of Sentinel-1 and Sentinel-2 data [29]. All these artificial covers are a challenge for the detection of bare soil within agricultural parcels, while there is a lack of understanding of their spatiotemporal use.
The objective of this work was to detect potential bare soil in Sentinel-2 data in a complex fragmented landscape of agricultural parcels and to exclude any artificial soil covers. The approach aimed to emphasise the preservation of a parcel's spatial and temporal integrity through the selection of the most suitable acquisition date per parcel, allowing for insights on the types of crops utilising artificial covers. We tested the methodology in northern Belgium with a specific temporal window between mid April to end of May, capturing the period of land preparation activities that involve clearing of vegetation cover. This work contributes to detailed parcel-level topsoil information, as well as an improved understanding of the spatial, temporal, and crop-diversity of artificial covers of agricultural soil.

Study Area and the Agricultural Parcel Dataset
Since 1990, agriculture has accounted for an unchanged 45% of the land use in northern Belgium (Flanders). With a population density of 475 inhabitants per km 2 , the pressure on land is high; as a consequence agriculture is intensive and industrialised [6]. Agricultural land comprises 6,230 km 2 , out of which 70% is arable. Meadows, pastures, and fodder crops account for 56% of the total area in Flanders, and are a dominant land feature of the rural landscape. The largest portion of revenue is taken by intensive sectors such as pig breeding, poultry and dairy farming, horticulture, and ornamental plant culture.
The agricultural fields of Flanders are highly fragmented and relatively small, reaching in various cases less than 2 ha surface area with an average farm area of 15 ha [30]. A database of agricultural parcels compiled in 2020 was used, as parcels are annually documented in a regional database containing parcel surface area along with the planned major crop for the considered year. The database follows the procedures set by the Integrated Administration and Control System (IACS) of the European Common Agricultural Policy that aims to manage and control payments made by the member states to farmers. The extensive 2020 parcel dataset was filtered to exclude crops that were not expected to have a bare soil window between April and May, such as those with fruit and nut trees, winter grains, and grassland. The remaining selected crops were very diverse and dominated by maize, potatoes, and sugar beet ( Figure 1 Leek -Fresh Seed Grasses Carrot (non-early) (consumption) industry

Crop
Number of Parcels [-] (b) Greenhouse horticulture is an important sub-sector in Flemish agriculture although it only represents an estimated 0.3% of the agricultural area [31]. The greenhouses (GHs) used for commercial activities can be made of plastic or glass with frames of metal, wood, etc. Various crops that heavily depend on greenhouse cultivation in northern Belgium include strawberries, small fruits, vegetables and leafy greens, along with floriculture where 35% of flower farming consists of greenhouses [31]. Farmers in Flanders also make use of diverse temporary covers for various purposes including weed suppression and soil warming. Soil care is central to agriculture, and a soil monitoring system is currently lacking. A simplified soil map ( Figure 2) according to the World Reference Base for Soil Resources shows six reference soil groups, namely Cambisol (22%), Retisol (14%), Technosol (14%), Luvisol (13%), Podzol (12%), and Anthrosol (11%) [32], where Technosols are mostly associated with urban areas.

Sentinel-2 Imagery
The Copernicus Sentinel-2 two-satellite constellation (S2A and S2B) provides data acquired by the multispectral instruments (MSIs). The acquired imagery are freely available and reach a 5-day geometric revisit time. The MSI sensors provide images with thirteen bands where spatial resolution is band (B)-dependent and reaches 10m, 20m, or 60m [33] ( Table 1). This new generation of freely available spaceborne acquisitions provides great new opportunities to monitor changes in soil cover at the parcel level in regions with small-patched units such as the parcels in the study area. The data pre-processing aims at achieving high quality surface reflectance per pixel of a parcel with potential bare soil, and, thus, a combination of freely accessible tools was used for this purpose as shown in Figure 3. A time-series of Sentinel-2 imagery between mid-April and end-May was selected and pre-processed for a year of interest using Google Earth Engine (GEE), a cloud-based computing environment [34]. Further spatial data handling at the parcel-and pixel-level was carried out with R programming using various packages including sf [35], sp [36], rgdal [37], rgeos [38], Raster [39], SpatialEco [40], geojsonsf [41], and the tidyverse collection [42]. Spectral plots were prepared using the HyperSpec package [43].  Figure 3. Overview of the major preprocessing steps to achieve high quality reflectance per pixel of parcels with potential bare soil.

Filtering Parcel Geometry
Sentinel-2 data have performed well in the context of monitoring small agricultural parcels [44] where a minimum surface area of 0.5 ha is recommended. Agricultural parcels in Flanders consist of units with various shapes and orientations. To reduce the number of mixed soil pixels at the boundaries, an inner buffer of 20 m was applied for each parcel, thus reducing further the parcel surface area. Different strategies were used to exclude parcels that could reduce data quality. Prior to buffering, parcels of complex and mainly elongated geometry that could result in lower accuracy [44] were identified and excluded. A shape index (SI) guided the identification (Equation (1)), where SI = 1 resulted in a circular shape and increased for more elongated and complex geometrical shapes [45]. A threshold of SI of 2.5 was applied to exclude complex parcel geometries of higher values. After setting the parcel inner buffers, separate geometrical shapes could evolve from a one polygon parcel ( Figure 4). These resulting polygons were segregated into individual geometries while they held the same parcel metadata. All resulting polygons with areas below 6, 000m 2 were excluded. The first essential task prior to analysing the Sentinel-2 imagery was the detection of clouds and their shadows, and subsequently excluding affected pixels. The direct use of cloud and shadow masks and quality bands available for Sentinel-2 data through Sen2Cor [46] were challenging as various clouds and shadows could be falsely omitted [47]. Using GEE, a collection of level (L)1C top of atmosphere (TOA) reflectance data covering Flanders were filtered to exclude those with cloud coverage exceeding 50% and were spatially mosaicked per acquisition date. A pixel-based approach was used where clouds and shadows were detected using the supervised classification of the quality bands (QA60) [48,49]. The high quality Sen2Cor-produced cirrus cloud masks [50] were also utilised.
To preserve a parcel's spatial integrity, each parcel containing masked-out pixels was fully excluded from the corresponding acquisition date. Furthermore, to further detect any omitted clouds affecting the parcels, a multi-temporal approach comparing the acquisition dates to a clear reference date was utilised and considered sudden variations in surface reflectance in the red band (B4) and the blue band (B2) [51]. On the other hand, to detect omitted cloud shadows, their "darker" characteristics in the near-and shortwave infrared parts of the spectrum were considered [52] utilising the red band (B4) and the NIR band of 10m resolution (B8).

Acquisition-Date Selection per Parcel
Within the period from mid-April to end-May, a shadow-and cloud-free image of each parcel was required under potential bare soil conditions. NDVI (Equation (2)) with a limiting threshold of 0.3 was used to detect potentially bare soil where this threshold aligns with various studies using thresholds of 0.2 [13,53], 0.25 [11], 0.30 [54], or 0.35 [12].
Trade-offs existed between the best image for each parcel in the time series while limiting the number of images with different acquisition dates to reduce the influence of acquisition conditions, such as sun illumination, soil moisture content, and other factors. Hereto a ranking of the available imagery was carried out based on the number of complete parcels available per date after clouds and shadow removal. Accordingly, each parcel's bare soil pixels were selected from the corresponding highest-ranked acquisition date.

Anomaly Detection
For a functional dataset of bare soil, NDVI was not enough to extract uncovered agricultural parcels. The detection of anomalies was built upon the knowledge of bare soil characteristics in medium resolution spaceborne data. Supporting NDVI, green vegetation indices (GVI) (Equation (3)) were used, where values exceeding 0 have been attributed to bare soil pixels [11,12]. The soil-line (Red-NIR line) concept using near infrared and the red bands [14,55] was also used to identify major anomalies in the data, including various types of artificial covers. The Red-NIR line allowed a visual screening of the data, knowing that bare soil pixels were expected to follow a linear trend-line and were of relatively low reflectances [55]. High differences in inter-band reflectance in the visible part of the spectrum could also indicate the presence of anomalies. Sentinel-2 data includes three bands in the visible part of the spectrum at 10m resolution; the blue (BCC), green (GCC), and red (RCC) chromatic coordinates allowed the identification of such anomalies using Equation (4):

Large Non-Plastic Greenhouses and Temporary Soil Covers
As no inventory of covers in Flanders is available, nor is there a library of suitable reference spectra, the inspection of the detected anomalies provided diverse reference parcels and this opportunity to develop a machine-learning approach that identified other similarly covered sites. The spectra of non-plastic GHs and those of diverse soil covers had distinctive features from the underlying soil or crop. Although the spectra of various temporary soil covers, such as mulch films were highly impacted by the underlying soil, they were distinct from spectra of plastic GHs or uncovered bare soil ( Figure 5). The spectral angle mapper (SAM) was used to identify such artificial covers using the hsdar package in R [56]. SAM measured the similarity among spectra to reference spectra and proved a powerful spectral distance approach used in various studies related to soil characterisation [57,58]. A spectrum is a vector in an n-dimensional space with n representing the number of considered bands. SAM resulted in an angle (in radians) as expressed in Equation (5). Since the target was to measure the angle between a reference vector and another, the vector length was not considered with SAM, making it a featurebased approach that does not consider overall brightness. This short-coming is not relevant to non-plastic GH detection and temporary soil covering films.
where i ∈ [1, n], S is a reflectance spectrum to be analysed while R is a reference spectrum. Reference spectra of large permanent greenhouse structures and soil covers were identified through anomaly detection using the RED-NIR line, GVI1, and GVI2, and through visual inspection of satellite data. Examples of plastic and non-plastic greenhouse pixels, temporarily covered soil, and uncovered soil pixels, all satisfying NDVI < 0.3. The spectral plots show the mean and standard deviation per band considering 250 spectra for each group. The spectra were extracted from Sentinel-2 data acquired on 20 April 2020.
The lower the angle, the similar the spectra are to each other, making it essential to assess a threshold's suitability. With the presence of various types of greenhouses and cover films and with diverse spectra within each group, several reference spectra were needed, for which the angle was calculated separately. To create representative groups of reference spectra and extract end-members that provided computational efficiency, the vector length (||r||) of a reference spectrum was introduced (Equation (6)) as a quantitative metric to subset variable classes of representative end-members [59]. To define group end-members, spectra were partitioned into a number of equal intervals and a representative spectrum was achieved by calculating the mean spectrum of the subset. Various subset scenarios were tested for the choice of the end-members. The possible impacts of underlying soil groups on the detection of temporary covers was also considered. Reference spectra were compared for covered reference soil groups to corresponding uncovered pixels of all reference soil groups. Accordingly, decisions to split the dataset were made to detect the temporary cover per subset.

The Plastic Greenhouse Index
As plastic GHs are highly linked to the spectra of underlying crops or soils, their spectral features were expected to be similar yet with an overall higher reflectance. Thus, SAM, as a feature targeting approach, was not useful for the detection of such covers. The retrogressive plastic greenhouse index (RPGI) based on medium resolution multispectral imagery [18] was used for the detection of plastic GHs (Equation (7)). RPGI proved a robust measure for the binary selection of plastic GH pixels where NDVI < 0.73. However, this index has limitations, such as its dependency on the underlying vegetation and its low capability of detecting greenhouses in Summer when sheets are painted white [19], neither of which applied to our test area or temporal window. From the original development scenarios of the index on plastic greenhouses, a threshold of the index to derive a distinction between greenhouse pixels and others was tested and found to be a maximum of 1.6. We tested different threshold intervals, and a threshold based on graphical interpretation of the RPGI's overall accuracy was accordingly selected.

Accuracy Assessment Using High Resolution Data
An area of approximately 30,000 ha was covered by high resolution imagery provided by Google Earth acquired on 11 April 2020 and was used for accuracy assessment of the results. The presence of artificial covers was inspected visually for each considered parcel. precision (P r ) and sensitivity (S p ) values were calculated: P r = T/(T + Fp), where T is the number of true positives and Fp the number of false positives, and S p = T/(T + Fn), where Fn is the number of false negatives. Furthermore, the knowledge of the expected crops to utilise greenhouses allowed a better assessment of results and the ease of detection of commission errors.

Extracting Reflectance Information of Sentinel-2 for Agricultural Parcels
The clean-up of parcels based on crop types, geometries, and surface areas resulted in the retention of 66,665 parcels distributed among 173 planned crops for 2020. After excluding clouds and cloud shadows, the available number of valid pixels in the parcels of selected crops are shown in Figure 6a. The number of valid pixels varied in the temporal series, and a ranking of each date was carried out with respect to the number of valid parcels. Rank 1 was set for the data acquired on 20 April 2020, corresponding to the highest number of pixels (exceeding 17 million pixels). The corresponding number of parcels covered by each image is shown in Figure 6b, along with the remaining parcels after an NDVI threshold of 0.3. Rank 1 data provided 65,616 parcels of which 44,142 (67%) had a mean NDVI value < 0.3 for each parcel. The first three ranks were chosen and represented 99.1% of the parcels with NDVI < 0.3 (Figure 7).  Considering Rank 1 data, greenhouses that resulted in major anomalies in the parcel dataset are well visible in the RED-NIR scatter of Rank 1 pixels subjected to the NDVI threshold of 0.3 (Figure 8). The major segment of the data followed the expected line for bare soil and was mainly clustered around the lower reflectance. Very high reflectances, even exceeding 1, were observed for both NIR and RED, indicating the presence of high specular reflection. Values exceeding 0.75 in B4 were of 345 pixels of non-plastic greenhouses that were planned for planting tomatoes, strawberries, strawberry seedlings, and rucola, and had distinct reflectance spectra when compared to bare soil ( Figure 9). All parcels containing an identified anomaly were excluded, resulting in the loss of four parcels and 508 pixels. Such a relatively small number of pixels highly affected the RED-NIR line and emphasised the need to address such anomalies in soil data. GVI1 and GVI2 values below 0 corresponded to pixels of diverse artificial soil covers from greenhouses to thin temporary films ( Figure 10). This was confirmed by high resolution imagery provided by Google Earth including an acquisition on 11 April 2020, i.e., 9 days prior to the Rank 1 acquisition. These identified parcels were utilised as reference spectra for the detection of other covered parcels.

Endive -fresh Blackberries
Planting and propagating material of leguminous vegetables

Parsley industry
Crop

Number of Pixels [-]
(a) (b) Figure 10. Illustrating covered pixels detected with either GVI1 or GVI2 < 0. (a) Planned crops of pixels with GVI1 or GVI2 < 0 corresponding to diverse artificial soil covers. (b) An example of a parcel with GVI1 < 0 planned for maize (as per the IACS database). The parcel is covered with plastic mulch film stripes as viewed by Sentinel-2 (left) and high resolution imagery (right); the red polygon indicates the buffered border.

Temporary Covers on Various Soil Types
As underlying soil impacts the reflectance of the temporary covers, especially mulch films, reference spectra of covered sections were compared to non-covered sections. ||r|| was calculated for each spectrum to simplify dimensionality for broccoli, cabbage, carrots, and maize ( Figure 11). These plots illustrated the distinctive reflectance of covered soil, except for the case of Cambisols (Figure 11b) where covered pixels were overlapping with ||r|| values of other soil types. Cambisols were separated, and the temporary cover detection was carried out on each subset.

Detection of Covered Parcels by SAM and RPGI
A collection of 3575 pixels of 14 parcels and 10 crops were the reference to detect temporary covers. Thresholds of 0.05 and 0.1 were tested where the overall accuracy was reported for each different scenario (Figure 12a). Accordingly, 5 end-members (Figure 12b,c) were chosen for each subset of soil groups where they showed relatively high detection capability. Selected thresholds were 0.012 and 0.04 for Cambisol and others, respectively, where these thresholds set a boundary for a major increase in commissioned crop types and crops (Figure 12d,e).

Wavelength [nm]
Surface Reflectance [-] (c) For non-plastic greenhouses, 1,396 pixels corresponding to 7 parcels and 6 crop were used as reference. SAM was able to identify at least one pixel in each parcel and crop-type using a threshold of 0.1 or higher starting from 2 end-members ( Figure 13). To identify all pixels (achieving 100% overall accuracy) a threshold exceeding 0.2 would be needed. With the increase in the number of end-members, the overall accuracy of parcels increased at lower threshold values, but the overall accuracy of pixels did not significantly improve. This confirmed the challenge with such greenhouse detection as there was a large variability in the spectra within the same parcel. Four end-members were selected with a threshold of 0.06 that resulted in the omission of one reference parcel (Figure 13g).

Wavelength [nm]
Surface Reflectance [-] (d)  Figure 14 shows the number of pixels, parcels, and crops attributed to plastic greenhouses for a range of RPGI thresholds. RPGI = 0.4 represented a major breaking point in the attribution of pixels to plastic greenhouses, and thus RPGI > 0.4 was used for the detection on such greenhouses.

Detailing the Crops That Utilise Artificial Covers
Once a pixel was flagged as covered, all pixels in the corresponding parcel were also labelled as such and excluded from the potential bare soil database. An overview of identified covered pixels and corresponding crops are shown in Figure 15, where 20,626 pixels were attributed to temporary covers, 4,198 pixels to non-plastic greenhouses, and 13,401 to plastic greenhouses. Carrot (non-early) (consumption) industry Carrot (non-early) (consumption) -fresh Peas (other than dry harvested) -industry

Accuracy Assessment of Artificial Cover Detection Using High Resolution Data
The high resolution data around the centre of Flanders included approximately 46,450 agricultural parcels of Rank 1 imagery. The detected artificial covers are shown in Figure 16. The area included only 1 parcel with plastic greenhouses that was successfully detected, yet another parcel with a visual anomaly on the borders was commissioned, resulting in a relatively lower P r value. Furthermore, a number of partially covered parcels with mulch films were not captured, resulting in the relatively lower S p value for the temporary covers. There was a very limited number of false positives, resulting in an overall high S p of the approach (Figure 16).

Excluding Miscellaneous Anomalies
After the exclusion of all greenhouse pixels, the chromatic coordinates (CC) were used to detect any additional anomalies in the visible part of the spectrum, identifying non-detected artificial covers. Figure 17 shows a boxplot with all pixels and outliers that were considered as anomalies, where CCs were retained, such that 0.15 < BCC < 0.325, 0.25 < GCC < 0.375, 0.325 < RCC < 0.55. The excluded values referred to non-detected mulch covers, anomalies in the field that could most likely be attributed to farm vehicles, and a number of parcel-border pixels that were retained after buffering.

Detection of Artificial Covers in the Time-Series
The methodology to detect artificial soil covers illustrated for Rank 1 data was carried out for two subsequent ranks to cover the majority of agricultural parcels. Figure 18 shows all the detected artificial covers and an overview of their proportions. Figure 19 shows an update of the RED-NIR scatter for pixels excluding these anomalies for Rank 1, a composite of Rank 1 with Rank 2, and a composite of all the first three ranks. Figure 20 shows an overview of the crops with detected soil covers for the first three ranks of acquisition dates.

Garden and field beans (Vicia faba) -industry Brussels sprouts -industry
Leek industry

Discussion
Powerful approaches to detect bare soil have been already developed for Landsat, Sentinel-2, and a combination of sensors [11,54,60], and have successfully mapped large areas to analyse soil characteristics. These approaches focus on merging data within the time series at pixel level, aiming to reduce the occurrence of extreme reflectances and to obtain stable and representative values. The methodology developed in this paper preserves an agricultural parcel's spatial and temporal integrity and highlights the interpretation of anomalies and extreme values. These aspects allow the yearly mapping of parcels and provide insights on soil surface characteristics to the land-managers at the parcel level . Furthermore, they set the foundation for the development of suitable monitoring strategies for agricultural soil and provide the possibility to plan in-situ field campaigns and surveys.
The NDVI, GVI1, GVI2 thresholds to identify potentially bare soil were in tune with other research seeking bare soil spectra [10][11][12]61]. As the focus was only on agricultural parcels, GVI1 and GVI2 were important indicators for the presence of artificial covers. Agricultural parcels with artificial covers were relatively low in area and in number, yet they highly affected the data and, therefore, compromised the understanding of soil surface characteristics and patterns based on spectral signatures. The RED-NIR line allowed for the detection of those anomalies due to their known patterns in bare soil spectra [62]. The resulting cleaned data provided an improved dataset for subsequent cover analysis and monitoring. For example, using the RED-NIR line concept, higher values could represent straw and post-harvest residues [55]. Detected anomalies provided a valuable dataset of reference spectra of various soil covers. Furthermore, the time-series revealed land preparation activities where the fraction of parcels with a low NDVI was highest around the middle of the considered period when comparing the first three ranked dates of image acquisitions, i.e., 17 April, 15 May, and 30 May.
SAM was used for the detection of temporary covers and non-plastic greenhouses. The choice of end-members along with the selected constraints highly impacted the results in such a spectral matching approach. The constraints used by SAM were assessed by calculating the fraction of correct detection of artificial covers on pixels, parcels, and crops using various scenarios of end-members. With the increase in the angle, the detection of the covered pixels was increasing [63]. A trade-off existed between precision and sensitivity as with the increase in threshold to attain 100% accuracy, commission errors could increase. This is mainly due to the presence of diverse spectra in a covered parcel, especially for non-plastic greenhouses. A parcel could be labelled as covered once a pixel in the parcel was labelled, allowing higher constraints on the SAM angle and reducing commission errors.
Temporary covers in the considered period were limited to scattered parcels spread across the region, requiring high precision and sensitivity of the detection method. This setting has rarely been addressed in literature as typical study areas are extensively covered [27,64,65]. Research has emphasised the impact of diverse aspects, such as underlying soil or vegetation, the presence of dew on the cover, and the type of plastic material [24,28]. In our study, the underlying soil effects on the spectra of covered pixels was considered, and the spectra of covered areas were distinctly different from bare soil. Cambisols were best analysed separately, and accordingly, the spectral angle mapper resulted in high precision of the detection. The crops where these covers were identified included maize, root crops, and brassicas ( Figure 15). The detection of artificial covers is crucial to agricultural waste management and a starting point to prioritise areas of micro-plastic residues in agricultural soil.
Non-plastic greenhouses showed major anomalies in the data including very high surface reflectances, indicating the presence of high specular reflection [66]. Both plastic and non-plastic greenhouses are challenging to detect using pixel-based approaches due to the heterogeneity of the spectra within the same greenhouse. There was the impact of the underlying structure and crops in the case of the large non-plastic greenhouses, while for the plastic greenhouses, there was the impact of borders around each greenhouse. Due to those reasons, researchers have suggested the use of object-based approaches [67]. Since the integrity of each parcel was preserved, the pixel-based approach was also powerful where once pixels were labelled as greenhouse pixels, the corresponding parcels were labelled accordingly. SAM was used for identifying non-plastic greenhouses, while revealing their scattered locations and their planned crops for 2020 with a dominant presence of strawberry cultivation. Plastic greenhouses resulted in spectra with high similarities to baresoil spectra, thus the RPGI index was used for their detection. A suitable RPGI threshold was 0.4 emphasising the high mixed content of the pixels where the plastic greenhouses were estimated to represent 50% of a pixel's content while the rest was attributed to the surroundings [18]. Plastic greenhouses were limited in number during the considered temporal window and represented only 18% of all detected covers with major usage for carrots.
The top three ranked acquisition days provided 99.1% of the parcels with NDVI < 0.3 in the considered window, emphasising the reliability of the approach at different acquisition conditions. Our approach sets a foundation for analysing time-series of multiple years and monitoring of the use of artificial covers and their trends [22]. Compositing the data acquired from the three dates while preserving each parcel's spatial and temporal integrity resulted in a database of bare soil parcels ready for further analysis and in-line with the expected RED-NIR line representation ( Figure 19). Various anomalies were also excluded from the data and were mainly attributed to parcel borders. The insights on temporary covers from three acquisition dates allowed mapping all artificial soil covers detected in the considered window ( Figure 18) and revealed that artificial covers occurred most frequently in maize parcels ( Figure 20) .

Conclusions
We developed a methodology to identify potential bare soil pixels in uncovered agricultural parcels during the onset of cultivation when light conditions are optimal for further spectral analysis. We tested the methodology on Flanders, the northern region of Belgium, for the period mid April to end May when highly active agricultural management practices can leave the soil bare in preparation for main cropping seasons. The methodology provided a workflow to extract potentially bare soil by excluding the impacts of clouds, shadow, vegetated areas, and artificial soil covers in the dataset while preserving a parcel's spatial and temporal integrity.
The spectral angle mapper was used along with a plastic greenhouse index to identify various types of artificial covers, namely temporary soil covers, non-plastic greenhouses, and plastic greenhouses. The approach resulted in the detection of temporary covers and non-plastic greenhouses with P r exceeding 0.9, showing the low commission errors of the approach, while resulting in low P r for plastic greenhouses due to their low numbers in the study area and in the considered period. The approach achieved higher S p values for non-plastic and plastic greenhouses indicating a good detection, while S p for temporary covers was around 0.7.
The approach presented in this paper provides a foundation to serve the dual purpose of characterising bare soil and identifying soil cover types of artificially covered agricultural parcels. The presented approach and resulting database provide a tool for setting up detailed soil analysis and monitoring programmes. The identification of soil covers, including plastic mulch, provides new insights on agricultural management practices, contributes to agricultural waste management, and can support prioritisation of areas with potential mulch residues and micro-plastics in the soil. GEE's cloud computing platform along with Sentinel-2's freely available data have been supports to a great transformation in digital agriculture, and this work contributes to this transformation.