Knowledge-Based Classiﬁcation of Grassland Ecosystem Based on Multi-Temporal WorldView-2 Data and FAO-LCCS Taxonomy

: Grassland ecosystems can provide a variety of services for humans, such as carbon storage, food production, crop pollination and pest regulation. However, grasslands are today one of the most endangered ecosystems due to land use change, agricultural intensiﬁcation, land abandonment as well as climate change. The present study explores the performance of a knowledge-driven GEOgraphic-Object—based Image Analysis (GEOBIA) learning scheme to classify Very High Resolution (VHR) images for natural grassland ecosystem mapping. The classiﬁcation was applied to a Natura 2000 protected area in Southern Italy. The Food and Agricultural Organization Land Cover Classiﬁcation System (FAO-LCCS) hierarchical scheme was instantiated in the learning phase of the algorithm. Four multi-temporal WorldView-2 (WV-2) images were classiﬁed by combining plant phenology and agricultural practices rules with prior-image spectral knowledge. Drawing on this knowledge, spectral bands and entropy features from one single date (Post Peak of Biomass) were ﬁrstly used for multiple-scale image segmentation into Small Objects (SO) and Large Objects (LO). Thereafter, SO were labelled by considering spectral and context-sensitive features from the whole multi-seasonal data set available together with ancillary data. Lastly, the labelled SO were overlaid to LO segments and, in turn, the latter were labelled by adopting FAO-LCCS criteria about the SOs presence dominance in each LO. Ground reference samples were used only for validating the SO and LO output maps. The knowledge driven GEOBIA classiﬁer for SO classiﬁcation obtained an OA value of 97.35% with an error of 0.04. For LO classiﬁcation the value was 75.09% with an error of 0.70. At SO scale, grasslands ecosystem was classiﬁed with 92.6%, 99.9% and 96.1% of User’s, Producer’s Accuracy and F1-score, respectively. The ﬁndings reported indicate that the knowledge-driven approach not only can be applied for (semi)natural grasslands ecosystem mapping in vast and not accessible areas but can also reduce the costs of ground truth data acquisition. The approach used may provide di ﬀ erent level of details (small and large objects in the scene) but also indicates how to design and validate local conservation policies. M.A.; Project administration, P.B.; M.A., V.T., C.T., Writing—original &


Introduction
In 2011 the European Commission implemented the European (EU) Biodiversity Strategy in order to prevent the decline of biodiversity and ecosystem services in the Member States by 2020. The strategy compelled EU Member States to undertake steps in order to safeguard endangered species and ecosystems within the international Convention on Biological Diversity (CBD) in 2010. To this purpose, the European Commission established six measurable targets to be reached by 2020. Action 5, target 2 of the Biodiversity Strategy commits EU Member States to map and assess the extension and conservation status of ecosystems and their services [1,2].
Natural and semi-natural grasslands are among the most important ecosystem. Their conservation can strongly contribute to carbon dynamics, biodiversity and other ecosystem services, such as night cooling, soil erosion control and flood mitigation [3,4]. Besides, grasslands ecosystems can be directly affected by the increasing competition of land use for food and energy production [5], with a consequent dramatic decline of biodiversity [6][7][8]. In view of the decline suffered by grasslands, area-wide inventories and monitoring capacities have been requested for conservation and restoration purposes [9,10].
Earth Observation (EO) data and automatic classification techniques can support the mapping of grassland ecosystems. However, the spectral signature of such ecosystem can be rather complex due to the heterogeneous nature of the habitats composing them [3,11]. Recently, successful attempts in the EO based mapping of different grassland types have been reported in the literature [7,8,[12][13][14][15]. In these studies, the main challenges discussed concern: (a) the small spatial extent of grasslands habitats; (b) their spectral similarity and (c) the high spatial, structural and temporal diversity of the vegetation composition [8,12,[16][17][18][19]. Such peculiarities may hamper automatic grasslands classification from satellite data.
Very High Resolution (VHR) sensors (e.g., Worldview-2/3) may provide reliable fine scale measurements [3]. Nevertheless, grassland encroachment phenomena, related to land abandonment, livestock grazing reduction and wood harvesting practices, could be quantified through VHR data. To the best of our knowledge, only a few studies have used VHR images for automatic grassland monitoring. Such studies have mainly focused on pixel-based approach.
In particular, a Support Vector Machine (SVM) classifier was used to analyze, both separately and in combination, a series of four optical (5 m to 30 m) and five SAR images (12 m), for discriminating natural grasslands from croplands in areas affected by frequent clouds [29]. SVM was also used to distinguish seven grasslands habitats by integrating inter-annual time series of RapidEye and TerraSAR-X images [11]. This paper reported an OA value larger than 90%. The data driven Random Forest (RF) classifier was used for mapping different types of grassland communities by analyzing Landsat and Worldview-2 data [30]. RBF was applied to RapidEye imagery in [31]. The findings reported by the aforementioned studies are encouraging. However, on the one hand, the results obtained through the pixel-based approach have been considered inadequate for VHR image classification due to the lack of semantic meaning in the real world [32]; on the other hand, the data driven approach can be time consuming and costly for the collection of reference training dataset.
The above considerations have induced a shift from a pixel-based to an object-based approach for the classification of VHR imagery [32]. The object-based approach has proven advantageous for Land Cover (LC) and habitat mapping in agricultural and urban studies [33,34]. As a result, the number of papers referencing the technique called GEOgraphic-Object-Based Image Analysis (GEOBIA) has been increasing rapidly [35]. Actually, GEOBIA has proven to be more accurate than pixel-based approaches [33]. This technique can handle objects features, such as mean spectral response per The area is characterized by a karstic plateau with an altitude range of 300-686 m a.s.l. and is characterized mainly by Cretaceous calcareous bedrocks [49]. The climate is sub-Mediterranean, with annual temperatures ranging from 7 to 25 °C and rainfall of 570 to 700 mm per year, mostly in the autumn-winter period, with occasional snowfall above 500 m a.s.l. The phytoclimate is semi-continental, with meso-Mediterranean thermo-type and dry to sub-humid ombro-type [50].
The site is characterized by a typical Mediterranean agro-pastoral landscape, where semi-natural rocky dry grasslands, traditionally used as extensive pastures, cover almost 24% of the total area, while forest vegetation consists only of residual patches of downy oak (Quercus pubescens s.l.) woodlands and Aleppo pine (Pinus halepensis) plantations. Grasslands are primarily characterized by the endemic feathergrass Stipa austroitalica (Annex II Habitat Directive) and host numerous endemic and rare species. [51,52]. Therefore, Alta Murgia represents one of the most important areas for grassland ecosystems conservation in Italy.
Substantial losses of this ecosystem type have been observed since the beginning of the 20th century; to date, grasslands cover about 29,800 ha and represent what remains of the 80,000 ha existing at the beginning of the 20th century [53,54]. The main pressures (e.g., habitat fragmentation and degradation, woody encroachment) on local biodiversity are due to both agricultural intensification and land abandonment [55][56][57].
As with most grasslands of the western Palaearctic region, these semi-natural communities are among the most species-rich habitats in the world. The grasslands developed through a mix of anthropogenic and natural processes of grazing by domestic stock, cutting and deliberate light burning regimes, and they are still maintained by human activities, mainly through grazing [55,58- The area is characterized by a karstic plateau with an altitude range of 300-686 m a.s.l. and is characterized mainly by Cretaceous calcareous bedrocks [49]. The climate is sub-Mediterranean, with annual temperatures ranging from 7 to 25 • C and rainfall of 570 to 700 mm per year, mostly in the autumn-winter period, with occasional snowfall above 500 m a.s.l. The phytoclimate is semi-continental, with meso-Mediterranean thermo-type and dry to sub-humid ombro-type [50].
The site is characterized by a typical Mediterranean agro-pastoral landscape, where semi-natural rocky dry grasslands, traditionally used as extensive pastures, cover almost 24% of the total area, while forest vegetation consists only of residual patches of downy oak (Quercus pubescens s.l.) woodlands and Aleppo pine (Pinus halepensis) plantations. Grasslands are primarily characterized by the endemic feathergrass Stipa austroitalica (Annex II Habitat Directive) and host numerous endemic and rare species. [51,52]. Therefore, Alta Murgia represents one of the most important areas for grassland ecosystems conservation in Italy.
Substantial losses of this ecosystem type have been observed since the beginning of the 20th century; to date, grasslands cover about 29,800 ha and represent what remains of the 80,000 ha existing at the beginning of the 20th century [53,54]. The main pressures (e.g., habitat fragmentation and degradation, woody encroachment) on local biodiversity are due to both agricultural intensification and land abandonment [55][56][57].
As with most grasslands of the western Palaearctic region, these semi-natural communities are among the most species-rich habitats in the world. The grasslands developed through a mix of anthropogenic and natural processes of grazing by domestic stock, cutting and deliberate Remote Sens. 2020, 12, 1447 5 of 31 light burning regimes, and they are still maintained by human activities, mainly through grazing [55,[58][59][60]. The occurrence of plant communities included within the European Directive 92/43/EEC habitat categories (62A0-"Eastern sub-mediterranean dry grasslands (Scorzoneretalia villosae); 6220*-Pseudo-steppe with grasses and annuals of the Thero-Brachypodietea), together with other threatened plant and animal species, confirm the urgent need of actions aimed at implementing, monitoring and preserving endemic species and biodiversity in Murgia Alta grasslands.

In Situ Data Collection for Validation
A "stratified" sampling was adopted to ensure adequate samples for each class under consideration. This approach guarantees a selection of a minimum number of samples from each class (i.e., stratum) [61]. The polygons were collected through both in-field campaigns and visual interpretation of multi-temporal images. Such polygons corresponded to a total of 732,773 pixels that were considered separately only in the pixel-based validation process.
For the in-field campaigns, an open source mobile application (App) was used for ground data collection. This mobile App was developed within the framework of the Horizon 2020 ECOPOTENTIAL Project. The App can gather ground truth according to LCCS2 land cover class taxonomy [62][63][64]. For validating classification results, a total of 175 records were collected including grasslands, pastures, woodlands, scrublands, olive groves, vineyards, orchards, arable lands [64]. Table 1 presents the data set of Worldview-2 (WV-2) images used to produce the LC map of Murgia Alta site. The input images were pre-processed for the conversion of the Digital Number values in Top-of-Atmosphere reflectance and then geometrically co-registered. As discussed in [65], Top-of-Atmosphere reflectance values can guarantee a reduction of inter-scene variability across time and space when clear scenes are available. In addition, atmospheric correction requires ancillary data to be collected both at several locations within the satellite image frames and at the time of satellite acquisitions. However, such data are not generally available, thus atmospheric correction becomes an ill-or poorly posed issue [65]. Due to such considerations, no atmospheric correction of images was carried out.

The Imagery Data Set
VHR data can yield much information on the relationship between adjacent pixels (e.g., texture, shape). Such information can favor the identification of individual objects in the scene at multiple scale. The Digital Terrain Model (DTM) provided by Apulia Region was used as ancillary data characterized by a spatial resolution of 8 m.

LC Taxonomy
Tomaselli et al., 2013 [46], argue that the LCCS can be very useful for classification in ecological studies [62]. In LCCS, a land cover class can be defined by the combination of a set of independent diagnostic criteria, termed "classifiers", hierarchically arranged. Since the set of criteria can be indefinitely enlarged, LCCS results in an open (expandable) classification system with a virtually infinite number of mutually exclusive classes. The availability of a very flexible taxonomy would allow to fit the heterogeneity of grasslands.
The Modular-Hierarchical phase uses a combination of a predefined set of classifiers to provide more detailed land cover classes (Level 4). In each set, the classifiers are divided into three groups: (a) pure land cover classifiers; (b) environmental attributes; (c) specific technical attributes. Among LCCS classifiers, Table 2 reports the set of classifiers instantiated in the present work. Based on the FAO-LCCS taxonomy, grasslands in Murgia Alta can be coded as A12/A2.A6, where the dichotomous code A12 represents a natural and semi-natural terrestrial vegetated class, A2 stands for herbaceous life form and A6 is for graminoids leaf type, respectively. When available, leaf phenology information can be added using the codes E6 for perennial or E7 for annual graminoids.
Thus, unlike other LC taxonomies mostly relying on a set of pre-defined classes, LCCS allows the definition of many land cover classes through the combination of classifiers, which follow independent diagnostic criteria [62].

Expert Knowledge Elicitation
In this work, LCCS classifiers have been implemented in the classification algorithm developed for satellite image analysis along with expert knowledge rules from ecologists related to class description. Such description includes information about: (1) Core and context class components. In VHR images, classes can appear to be composed of individually detectable core elements and other surrounding elements constituting the context. As an example, an olive grove field is composed by olive trees, as class core elements, surrounded by bare soil, grassland or emerging rocks as context background [66,67]. (2) Class phenology. Vegetated class discrimination depends on different period of peak of biomass and plant development. Agricultural class discrimination is based on periodicity of agricultural practices (i.e., ploughing, harvesting/mowing). Aquatic class discrimination requires water seasonality information [46].
Expert knowledge exploitation depends on the implementation of an if-then rules set. Phenology and agricultural practices information can contribute to the selection of the most suitable multi-seasonal EO images, whereas core and context class description can support the selection of specific spectral, morphological, contextual and topological features and indices useful for target classes discrimination [68][69][70].
The list of expected LC classes in the scene are reported in Table 3. For Murgia Alta site, the information provided by ecologists to describe the classes (Table 3) is represented in Figures 2 and 3. The former summarizes the phenology of vegetated classes, the latter represents the agricultural practices of cultivated vegetation. Such a scheme can be modified and adapted to any site. Remote Sens. 2020, 12, x FOR PEER REVIEW 8 of 32

The Algorithm
The classification procedure is composed by (a) the segmentation module and (b) the classification module, which can yield LC classification up to LCCS Level 4. Figure 4 shows the classification procedure flowchart.

The Algorithm
The classification procedure is composed by (a) the segmentation module and (b) the classification module, which can yield LC classification up to LCCS Level 4. Figure 4 shows the classification procedure flowchart.

The Algorithm
The classification procedure is composed by (a) the segmentation module and (b) the classification module, which can yield LC classification up to LCCS Level 4. Figure 4 shows the classification procedure flowchart.
Among the eight Worldview-2 bands available, only the Red, Green, Blue and NIR1 ones were intentionally considered as input to the system, for each date. The four bands were selected to offer a data processing methodology that may be applied also to other existing VHR data, such as the ones collected either on board of UAV and airplane platforms or from previous missions (e.g., Quickbird, Geoeye, IKONOS). The methodology could be applied to past and future data for change detection.
The PoB image (Table 1) resulted very useful for the discrimination of vegetated areas. The PrePoB image was mainly used for the disambiguation of evergreen and winter-deciduous woody classes ( Figure 1a). The PostPoB image was mainly used for the identification of herbaceous graminoids cultivated classes (Figure 1b). The DS image was used for the disambiguation between herbaceous and woody classes. The latter was the only exhibiting a photosynthetic activity during the summer season ( Figure 1c). Among the eight Worldview-2 bands available, only the Red, Green, Blue and NIR1 ones were intentionally considered as input to the system, for each date. The four bands were selected to offer a data processing methodology that may be applied also to other existing VHR data, such as the ones collected either on board of UAV and airplane platforms or from previous missions (e.g., Quickbird, Geoeye, IKONOS). The methodology could be applied to past and future data for change detection.
The PoB image (Table 1) resulted very useful for the discrimination of vegetated areas. The PrePoB image was mainly used for the disambiguation of evergreen and winter-deciduous woody classes ( Figure 1a). The PostPoB image was mainly used for the identification of herbaceous graminoids cultivated classes ( Figure 1b). The DS image was used for the disambiguation between herbaceous and woody classes. The latter was the only exhibiting a photosynthetic activity during the summer season ( Figure 1c).

SO and LO Image Segmentation
In the present study, two different scales, i.e., the small object (SO) and the Large Object (LO) ones, were selected to segment VHR input imagery into landscape elements [71]. The two segmentation scales were carried out independently within an eCognition 8.0 framework [72].
To detect both SO and LO, only the spectral features of a single-date image were used, i.e., the October PostPoB image (R-G-B-Nir1 bands). This choice was based on the analysis of class separability values, with focus on the target class, i.e., grasslands. The Jeffreys-Matusita distance (JM) was used as separability indicator. As discussed by Hao et al., 2015 [73], previous research had shown that JM distance can provide a more accurate separability indicator than other distance measures, such as Euclidean distance or divergence. In particular, the multiclass extension of Jeffreys-Matusita distance, computed according to [74] for measuring the separability of grasslands with respect to other classes was considered to evaluate the best input images for segmentation. The analysis was carried out on both each single date and the whole multi-temporal dataset. The values of JM obtained were 2.2 for the PostPoB image, 2.0 for DS image, 1.6 for PrePoB image and 1.4 for

SO and LO Image Segmentation
In the present study, two different scales, i.e., the small object (SO) and the Large Object (LO) ones, were selected to segment VHR input imagery into landscape elements [71]. The two segmentation scales were carried out independently within an eCognition 8.0 framework [72].
To detect both SO and LO, only the spectral features of a single-date image were used, i.e., the October PostPoB image (R-G-B-Nir1 bands). This choice was based on the analysis of class separability values, with focus on the target class, i.e., grasslands. The Jeffreys-Matusita distance (JM) was used as separability indicator. As discussed by Hao et al., 2015 [73], previous research had shown that JM distance can provide a more accurate separability indicator than other distance measures, such as Euclidean distance or divergence. In particular, the multiclass extension of Jeffreys-Matusita distance, computed according to [74] for measuring the separability of grasslands with respect to other classes was considered to evaluate the best input images for segmentation. The analysis was carried out on both each single date and the whole multi-temporal dataset. The values of JM obtained were 2.2 for the PostPoB image, 2.0 for DS image, 1.6 for PrePoB image and 1.4 for PoB image. The value obtained for the whole multi-temporal data set was quite comparable (i.e., 2.3) with the one from PostPoB image (i.e., 2.2). Thus, we chose to use the PostPoB image only in the segmentation step for feature reduction purposes. This choice reduced the computational time of VHR image processing.
In addition, the PostPoB image appeared to better evidence not only the spectral differences between natural and agricultural herbaceous classes but also the contrast between tree crowns and their herbaceous context. Spectral band values were used for both SO and LO segmentation. For LO segmentation only, the 1st order entropy textural feature was added to discriminate high vegetation from low vegetation [75].
The SO scale was aimed at detecting all small elements in the scene (e.g., tree crowns) ( Figure 5a) characterized by spectral homogeneity. For SOs identification, the eCognition spectral differences algorithm was applied. This algorithm merges neighbor pixels that fall within a user-defined maximum spectral difference. The spectral difference between adjacent pixels is calculated by considering their spectral values in the R-G-B-Nir1 bands of the PostPoB image [76]. In our study, the maximum spectral difference was set to 0.015 by trial and error. The main criteria used for such trial-error process was to recognize some of the finest elements (e.g., tree crowns) of the landscape, without over-segmenting the scene. Thus, visual interpretation of the different output segmented scenes addressed the parameter selection.
The LO scale was aimed at identifying landscape macro-structure classes (e.g., agricultural fields, olive groves) (Figure 5b). For LO identification, the multi-resolution image segmentation algorithm was used, since it can handle both spectral and context-sensitive features. This algorithm requires the definition of the following parameters: scale, shape or color heterogeneity, compactness or smoothness and weight of each spectral band [72]. Shape is complementary to color heterogeneity, whereas compactness is complementary to smoothness heterogeneity [77]. Thus, the eCognition algorithm used requires the setting of four independent parameters, namely scale factor, shape, compactness and weight of each spectral band. The first parameter controls the size of image objects [35]; the second quantifies the weight assigned to the object shape with respect to its spectral property, i.e., color; the third controls the balance between the object shape and its edge length [35].
The SO scale was aimed at detecting all small elements in the scene (e.g., tree crowns) ( Figure  5a) characterized by spectral homogeneity. For SOs identification, the eCognition spectral differences algorithm was applied. This algorithm merges neighbor pixels that fall within a user-defined maximum spectral difference. The spectral difference between adjacent pixels is calculated by considering their spectral values in the R-G-B-Nir1 bands of the PostPoB image [76]. In our study, the maximum spectral difference was set to 0.015 by trial and error. The main criteria used for such trial-error process was to recognize some of the finest elements (e.g., tree crowns) of the landscape, without over-segmenting the scene. Thus, visual interpretation of the different output segmented scenes addressed the parameter selection.
The LO scale was aimed at identifying landscape macro-structure classes (e.g., agricultural fields, olive groves) (Figure 5b). For LO identification, the multi-resolution image segmentation algorithm was used, since it can handle both spectral and context-sensitive features. This algorithm requires the definition of the following parameters: scale, shape or color heterogeneity, compactness or smoothness and weight of each spectral band [72]. Shape is complementary to color heterogeneity, whereas compactness is complementary to smoothness heterogeneity [77]. Thus, the eCognition algorithm used requires the setting of four independent parameters, namely scale factor, shape, compactness and weight of each spectral band. The first parameter controls the size of image objects [35]; the second quantifies the weight assigned to the object shape with respect to its spectral property, i.e., color; the third controls the balance between the object shape and its edge length [35].
In our study, the optimal segmentation parameters were selected through a supervised iterative process by tuning these parameters [35,36]. The optimal values found were: 5 for scale, 0.1 for shape and 0.1 for compactness, respectively. In addition, all the bands used in segmentation were equally weighted.
(a) (b) Figure 5. Different segmentations are superimposed to a detail of the input images: a finer "small objects" one (a) and a coarser "large objects" one (b).

SO Classification
On the previous stage, each image segment identified can be considered a unit of analysis for which a number of attributes, including spectral response, texture, shape and location, can be In our study, the optimal segmentation parameters were selected through a supervised iterative process by tuning these parameters [35,36]. The optimal values found were: 5 for scale, 0.1 for shape and 0.1 for compactness, respectively. In addition, all the bands used in segmentation were equally weighted.

SO Classification
On the previous stage, each image segment identified can be considered a unit of analysis for which a number of attributes, including spectral response, texture, shape and location, can be extracted and used for classification. Specifically, every SO found in the image was labeled through LCCS classifiers related to Life Form, Leaf phenology and Leaf Type (Section 3.3.2).
To identify Level 1 and Level 2 LCCS classes, all the objects found in the SO map were classified by exploiting spectral indices only. Many indices were tested [44], the ones selected are summarized in Table 4. In the Table 4, ρ represents the TOA reflectance calculated as mean value of the image pixels inside each SO. The indices were extracted from the full set of multi-seasonal WV-2 images available.
Specifically, the Green/Red Ratio (GRR) [78] was used as a proxy for describing the photosynthetic status of vegetated classes [79]. Blue/NIR Ratio (BNR) performs well in discriminating water covered pixels [80]. The Brightness index was proven to be effective in identifying non-vegetated areas [44]. The indices reported were combined through a set of if-then rules grounded in remote sensing expert knowledge [65,81].
To obtain LCCS Level 1 classes, first the rules summarized in Table 5 were applied. All image SOs resulted labelled into four temporary classes, i.e., Urban, Barren Land, Photosynthetic Vegetation and Shadowed Vegetation. At the end of the process, these classes were grouped into Vegetated (LCCS label A) and Non-Vegetated (LCCS label B).

LCCS Level 1 Final Classes
Thereafter, the BNR index (Table 6) was exploited to label the same SOs as intermediate two classes, i.e., Terrestrial (1) and Aquatic (2).

Spectral Rules Logical Operators Level 2 Intermediate Classes
After that, the final LCCS Level 2 label was assigned by considering all possible combinations of Level 1 final class labels (Table 5) and Level 2 intermediate labels ( Table 6) To discriminate Level 4 vegetated classes, the procedure identified a set of intermediate classes that were matched into the expected classes reported in Table 3. For this purpose, context-sensitive features were combined with DTM data and multi-temporal spectral indices.
Regarding spectral indices, logical operators were used to combine NDVI, GRR, Brightness to take into account the specific phenology of the vegetation classes and the agricultural practices of the study area. Specifically, the Brightness from the DS image was used to distinguish between perennial natural and cultivated herbaceous classes. This choice was based on the fact that in July graminoids cultivated fields appear bright due to mowing.
Concerning context-sensitive features, the texture first order entropy (occurrence measure) [82], from the green band of the PostPoB image, namely Eg(PostPoB), was used to discriminate herbaceous and woody vegetation. Texture was also found related to the sparseness of vegetation. The window size used to compute entropy was selected in order to match the scale of heterogeneity related to variations in vegetation height, distribution and structure.
The rule sets implemented for LCCS Level 4 mapping are reported in the decision table shown in Figure 5. Such table summarizes the different decisions to be taken according to different possible conditions. On the left upper part of the table, the possible input conditions are listed, one per each row. On the right upper part of the table, each column provides a possible combination of input conditions. The lower left part of Figure 5, indicates the various actions that can be taken, one per row. In the right part, each column reports whether that decision is taken for the specific combination of conditions [83]. In this study, the decision represents the label to be assigned to each input SO according to the combination of conditions reported in the upper part of the same Figure. As a result, for each input vegetated (A1) SO, an LCCS Level 4 class label was identified. The label was then assigned to each pixels of the SO object considered in the output map.
The rules from the ecologists are reported in the first column of Figure 6 (upper part), whereas the rules from the remote sensing experts reported in the second column of the same Figure (upper part) involve spectral rules and ancillary data, such as the DTM. The latter (line 9 of Figure 6) was introduced to support the discrimination of woody cultivated from other woody classes. In fact, cultivated woody trees (e.g., olive trees and orchards) are located at low elevation in Murgia Alta.
Complementary topological rules related to adjacency of barren land and herbaceous SOs to cultivated trees SOs were introduced to strengthen the final class decision but are not reported in the table. Contrary to the expert rule reported in Figure 3, some olive groves in April do not appear ploughed due to either new olive grove grassing techniques or to land abandonment.

LO Classification
In the LO classification process, each segment was labeled according to LCCS rules based on the dominance criteria of SO classes within that LO. The dominance criteria were built upon a set of thresholds on the proportion of each SO class coverage included in the LO considered. The FAO-LCCS rules employed concerned only vegetated classes and involved Spatial Distribution and Spatial Aspect classifiers (Section 3.3.3), for natural vegetation and cultivated vegetation, respectively ( Table 2).
According to LCCS definition, for natural and semi-natural vegetation classification, i.e., A12 classes, Spatial Distribution involves two classifiers: C1 for continuous and C2 for fragmented vegetation. These classifiers are based on the percentage of LO covered by the vegetation type considered, compared to other classes present in that LO. For Cultivated and Managed Terrestrial Areas classification (A11), LCCS Spatial Aspect classifiers (B5 for continuous, B6 for fragmented) definition draws on rules based on the percentage of LO covered by one cultivated class type compared to other land cover classes present in the same LO. When two or three different LC classes are found in a LO without any specific class dominance, the latter is labeled as Mixed Unit.  The rules from the ecologists are reported in the first column of Figure 5 (upper part), whereas the rules from the remote sensing experts reported in the second column of the same Figure (upper part) involve spectral rules and ancillary data, such as the DTM. The latter (line 9 of Figure 5) was introduced to support the discrimination of woody cultivated from other woody classes. In fact, cultivated woody trees (e.g., olive trees and orchards) are located at low elevation in Murgia Alta.
Complementary topological rules related to adjacency of barren land and herbaceous SOs to cultivated trees SOs were introduced to strengthen the final class decision but are not reported in the table. Contrary to the expert rule reported in Figure 3, some olive groves in April do not appear ploughed due to either new olive grove grassing techniques or to land abandonment.

LO Classification
In the LO classification process, each segment was labeled according to LCCS rules based on the dominance criteria of SO classes within that LO. The dominance criteria were built upon a set of thresholds on the proportion of each SO class coverage included in the LO considered. The FAO-LCCS rules employed concerned only vegetated classes and involved Spatial Distribution and Spatial Aspect classifiers (Section 3.3.3), for natural vegetation and cultivated vegetation, respectively ( Table 2). In our study, the following rules were implemented to automatically classify LOs:

•
When inside an LO, there was a class dominance, i.e., more than 80 percent of the pixels were labeled as a specific A12 class, the procedure assigned to the same label as the dominant class together with the C1 code (i.e., continuous spatial distribution) to the LO investigated ( Figure 7a).

•
When inside the LO, there was a class dominance, i.e., more than 50 percent but less than 80 of pixels labeled as a specific A12 class, and if neither of the other SO elements reached a cover exceeding 20 percent, the procedure assigned to the analyzed LO the same label as that of the dominant class together with the C2 code (Figure 7b). • When more than 50 percent of cultivated SOs were found inside the LO analyzed, LO is to have A11 label with Continuous Spatial Aspect code (B5) (Figure 7c).

•
When LO pixels belonging to a SO class X covered less than 80% but more than 50% of the LO extension, and a second SO class Y covered more than 20% but less than 50%, the LO was labelled as X/Y. This labelling indicates that both classes X and Y from SOs are present in the LO (co-presence), with class X covering the majority of the LO area. Specifically, when class X belonged to A12, the C2 code was added. When Y class belonged to class A11, code B6 (scattered clustered) was added to the Y class label [58] (Figure 7d) • When the LO pixels met none of the above-mentioned rules, the LO was labeled as Mixed Unit.

•
Due to the lack of specific rules for non-vegetated classes, in this study, we adopted the label "Mixed Units with artificial" to identify classes including both vegetated and artificial components. Instead when LOs included only SO of either B15 or B16 classes, the LO was labeled as single B15 or B16 unit.

Accuracy Assessment
To identify sources of classification errors, in the automatic LCCS hierarchical classification scheme, accuracy assessment was carried out for each hierarchical classification step. Thus, the study provided the final SO output Confusion Matrix (CM) and four additional CMs, generated by Level 3 classes, LifeForm, Leaf type, Leaf phenology classifiers. Moreover, for LO validation, two additional CMs were produced. The first concerns the output of the Spatial Distribution or Spatial Aspect classifiers; the second concerns the Level 3 label of the co-dominant object (case (c) of Section 3.3.3).
EMA formulation is based on the reference marginal vector entropy conditional to the classified marginal vector H(Ref|Class). Conditional entropy measures the information of the reference marginal vector, which may have been missed by the classified one. EMA is invariant to permutations in the CM columns and independent from the CM's diagonal. This characteristic allows to evaluate classification accuracy without explicit matching between the reference and classified class labels, as in cases of non-square CMs.
In the present study, EMA was calculated as follows: Such formulation was introduced for normalizing EMA measurements to the percentage of correctly classified observations. LO classification validation was performed by randomly selecting LO segments found on the output map. Such segments were labelled either by expert visual interpretation of the multi-temporal VHR data set or through in-field campaigns. Specifically, a total of 1586 objects were considered for LO reference data set in the validation process. This validation approach is commonly used for the accuracy assessment of LC maps obtained through GEOBIA [85,86].

Results
The investigation was carried out to assess the effectiveness of an object-oriented knowledge driven classifier of multi-temporal VHR images instantiated according to the FAO-LCCS hierarchical classification scheme into SO and LO output maps. This application has yielded encouraging results for LC mapping in Murgia Alta National Park and subsequent extraction of the relative natural grassland ecosystem layer.

SO Classification
Some classes in VHR imagery correspond to core SO components surrounded by context SOs. For instance, an olive grove can include not only olive trees (core elements, A11/(A1orA2).A7.A9), which at the SO scale, can be individually observed, but also context components, such as herbaceous or barren land objects. The context components were labelled as bare or vegetated context in the list of SO output classes (Table 3). Figure 8 reports the CM related to SO output classes. The OA value from the CM resulted quite high, i.e., 97.35%, with an error of 0.04%. The UA and PA percentage values and the F1-score obtained for each class of the SO map are shown in Table 7.    The results reported indicate that: (a) The rule sets implemented perform well to discriminate natural vegetation classes. In particular, with exception to A12/A1.D1.E1 (natural/woody.broadleaved.evergreen), the F1-score resulted greater than or equal to 90%. (b) The rules appear to be less effective for cultivated woody classes discrimination.
The F1-score values of A11/A1orA2.A7.A9 (cultivated/trees or shrubs.broadleaved.evergreen) and A11/A1orA2.A7.A10 (cultivated/trees or shrubs.broadleaved.deciduous) resulted to be 37.9 and 66.5, respectively. (c) The CM indicates that cultivated and natural woody classes were effectively discriminated, based on DTM measurements, whereas core and context components of the cultivated classes were less distinguishable. This might be due to both ortho-rectification issues as well as to different tree crown cover. (d) The F1-score of A12/A2.A6 (natural/herbaceous.graminoids) resulted quite high, i.e., 96.1%.
However, the CM evidenced some misclassifications between such class and A11/A3 (cultivated/ herbaceous). On the contrary, A12/A2.A6 (natural herbaceous.graminoids) resulted to be well discriminated from all natural woody classes. Figure 9 shows close-ups of Murgia Alta. The Figure 9a,b show that natural and semi-natural grasslands have complex and highly fragmented patterns. Such fragmentation is due to both the interference of cultivated herbaceous vegetation within grassland patches and the encroachment by woody broadleaved deciduous vegetation.
Remote Sens. 2020, 12, x FOR PEER REVIEW 18 of 32 (a) The rule sets implemented perform well to discriminate natural vegetation classes. In particular, with exception to A12/A1.D1.E1 (natural/woody.broadleaved.evergreen), the F1-score resulted greater than or equal to 90%. (b) The rules appear to be less effective for cultivated woody classes discrimination. The F1-score values of A11/A1orA2.A7.A9 (cultivated/trees or shrubs.broadleaved.evergreen) and A11/A1orA2.A7.A10 (cultivated/trees or shrubs.broadleaved.deciduous) resulted to be 37.9 and 66.5, respectively. (c) The CM indicates that cultivated and natural woody classes were effectively discriminated, based on DTM measurements, whereas core and context components of the cultivated classes were less distinguishable. This might be due to both ortho-rectification issues as well as to different tree crown cover. (d) The F1-score of A12/A2.A6 (natural/herbaceous.graminoids) resulted quite high, i.e., 96.1%.
However, the CM evidenced some misclassifications between such class and A11/A3 (cultivated/ herbaceous). On the contrary, A12/A2.A6 (natural herbaceous.graminoids) resulted to be well discriminated from all natural woody classes. Figure 8 shows close-ups of Murgia Alta. The Figures 9a,b show that natural and semi-natural grasslands have complex and highly fragmented patterns. Such fragmentation is due to both the interference of cultivated herbaceous vegetation within grassland patches and the encroachment by woody broadleaved deciduous vegetation.  Hierarchical Accuracy Assessment Table 8 reports the classification accuracies obtained from CMs for the different LCCS classifiers (levels) together with the final SO map overall accuracy. The percentage of the reference data (P) used for validation purposes is reported in the last column of Table 8. The full set of reference class data set, i.e., p = 100%, was used only to obtain the OA and EMA values of the final output map, whereas for intermediate hierarchical classification layers (i.e., Level 3,Level 4), p values were lower due to the fact that the classifiers used (e.g., Life Form) involve only some of the reference data.
LCCS Life Form classifier involves the discrimination of woody/herbaceous classes while Leaf Type concerns the discrimination between broadleaved/needleleaved classes of only woody vegetation objects. The choice to consider only the latter classes was motivated by the fact that in Murgia Alta, herbaceous vegetation mainly consists of graminoids classes. In view of this, no discrimination rule was adopted to distinguish between graminoids and forbs Leaf Type. Leaf Phenology allows to discriminate between evergreen/deciduous. In the area investigated, herbaceous vegetation includes annual and mixed phenology, thus Leaf Phenology was not considered for herbaceous vegetation.
The OA values reported in Table 8, evidence that the algorithm yielded a lower accuracy in identifying different Leaf Types compared to other classifiers considered (i.e, Life Form and Leaf Phenology). The lower accuracy reported may have been due to the fact that only NIR reflectance data were exploited to discriminate different Leaf Types.
The good performance obtained at the LifeForm classifier level (98.99%) confirms that the entropy measure is a suitable texture feature for the discrimination of herbaceous and woody vegetation ( Supplementary Materials for further results).
The same Table reports also the EMA values. As can be observed, EMA yielded slightly lower values than the ones obtained through the classic accuracy approach.
With the exception of the Leaf Type classifier, the EMA values obtained from each layer considered resulted higher than the value obtained at the end of the SO classification procedure (89.75%). The latter finding appears to confirm the weakness of the rules applied at Leaf Type level.
It can be noted that the values reported in Table 8 show the EMA values have similar fluctuations as the ones in OA values.  Table 9 reports the list of LO output classes, whereas Figure 10 shows the LO classified map. It should be noted that the labels in Table 9, bearing an asterisk (*) represent vegetation classes that, although not found by the proposed knowledge-based classifier, were identified by visual image interpretation of the input images. Classes bearing a plus symbol (+) are instead those LO classes that, although recognized by the classification algorithm, were not identified by visual interpretation. In fact, LO labelled A12/A1D1E2C2 (natural/woody broadleaved evergreen) by visual interpretation were never detected by the automatic classifier. The latter was able to identify, instead, objects labelled as A12/A1D1E1C2 (natural/woody broadleaved deciduous). This finding might be due to co-registration issues since young trees' crown may not cover more than 1-2 pixels. It should be noted that the labels in Table 9, bearing an asterisk (*) represent vegetation classes that, although not found by the proposed knowledge-based classifier, were identified by visual image interpretation of the input images. Classes bearing a plus symbol (+) are instead those LO classes that, although recognized by the classification algorithm, were not identified by visual interpretation. In fact, LO labelled A12/A1D1E2C2 (natural/woody broadleaved evergreen) by visual interpretation were never detected by the automatic classifier. The latter was able to identify, instead, objects labelled as A12/A1D1E1C2 (natural/woody broadleaved deciduous). This finding might be due to co-registration issues since young trees' crown may not cover more than 1-2 pixels.  Due to the inhomogeneity between classes obtained through VHR classification and those representing ground truth data, the CM results are rectangular. Therefore, in order to estimate the LO map accuracy, the CM was transformed in a square matrix [87]. Where a systematic mismatch occurred between automatic (i.e., classified) and visual (i.e., reference) evaluation of dominance criteria, the EMA measurement was more effective for evaluating LO classification accuracy. Table 10  The data presented in Table 10 indicate a rather good performance obtained for Life Forms, Leaf Type, Leaf Phenology and spatial distribution of the dominant class. The OA and EMA values from both single layers and the final map appeared to be lower than those reported in the validation of SO (Table 8). The following should be noted:

LO Classification
The classifier parameter selection by a trial and error procedure can generate an LO segmentation inadequate to represent the structure of landscape patches. As a result, non-existing LO classes can occur, such as the ones indicated with ( + ) in Table 9.

2.
LO map accuracy can be influenced by the SO map accuracy as well as by the lack of specific FAO-LCCS rules for non-vegetated classes.

Discussion
The results reported in the present study seem to encourage the use of the knowledge-driven object-based classification system validated through the use of both accuracy and EMA values for the mapping of natural grassland ecosystems. The main novelty of the proposed methodology is rooted in the implementation of the hierarchical classification scheme of the Food and Agricultural Organization (FAO) Land Cover Classification System (LCCS) taxonomy.
To the best of our knowledge, FAO-LCCS taxonomy has been used extensively to label ground reference class samples collected by in-field campaigns. Except for a recent paper [44], the hierarchical set of rules of such taxonomy has not been used in GEOBIA classifiers. In particular, the novelty of this study includes the way SO were used to label LO, according to specific FAO-LCCS spatial distribution and spatial aspect properties of natural and cultivated classes adopted. As a specific advancement of previous work in [44], such properties were adopted to evaluate the dominance of SOs within each LO.
In addition, the proposed approach guarantees consistency between the rules adopted by the GEOBIA classifier and the ones used to collect reference-validation samples during in-field campaigns [61]. Thus, this coherence facilitates the validation of the output maps.
The knowledge driven approach proposed is closer to the way end-users conceptualize their domain expertise; hence, it contributes to deepen the gap between image processing scientists and end-users. As evidenced in [40], working with symbolic knowledge enhances knowledge management. It also facilitates collaborative and interdisciplinary research by connecting concepts from different scientific domains (in ecology, agriculture, geo-health etc.) This paper offers a contribution in such direction.

SO Results Discussion
The proposed method seems to perform well in the classification of natural vegetation. In particular, the results obtained for A12/A2.A6 (natural/herbaceous.graminoids) classification, i.e., OA values 97.35% and 92.6%, 99.9% and 96.1% of UA, PA and F1-score, respectively (Table 7), encourage the application of SO classification method for the monitoring of grassland and grassland encroachment at VHR.
The less satisfactory performance obtained for A12/A1.D1.E1 (natural/woody.broadleaved.evergreen) classification can be explained in consideration of the fact that, in the study site, this class results highly fragmented due to their interspersion with broadleaved deciduous. The Hierarchical accuracy assessment shows that, at the level of LifeForms, the classes were well discriminated. This is due to the use of 1st order entropy to discriminate low and high vegetation. Such result is in agreement with the findings of previous studies [68,75].
As evident in the CM, problems cropped up in discriminating B15 and B16. The classification of these classes relies on the exploitation of the Brightness index. Since the Murgia Alta region is characterized by bright rocky outcrops as well as by bright soil containing grained rocks, B15 (artificial areas) may have been misclassified as B16 (bare soil).
The values obtained for the SO map resulted in OA equal to 97.35%, with error 0.04%, and in UA, PA and F1-score values for grasslands equal to 92.6%, 99.9% and 96.1%, respectively. Even though our results regard only one site, these values can be considered comparable or better than the values from previous pixel-based and object-based supervised approaches. For instance, Raab et al., 2018 [31], discriminated several grassland communities from RapidEye (5 m pixel size) imagery through a pixel-based RF supervised classifier and reported a mean OA of 80%. In their study, these authors used a dense data set including pre-spring, first spring, full-spring, early summer, midsummer late summer, early autumn, full autumn and late autumn images. Even though in their paper the authors used more detailed phenological information, the comparison between their results and ours suggests that, when analyzing VHR imagery, an object-based approach can provide better performance than a pixel-based approach.
Schuster et al., 2015 [11], assessed the suitability of a RapidEye time series (21 images) to identify several grassland species. The authors adopted a pixel-based, supervised data driven SVM classifier and obtained and OA of 90%, which cannot be directly compared to our results. In fact, their findings deal with the discrimination of more than one species of grasslands from VHR time series.
Melville et al., (2018) [30], used an object-based supervised RF classifier to analyze a single date Worldview-2 image for the classification of three grassland classes and other land cover types. In their study, they report an OA of 80%. This may have been due to the use of only one date; thus, the phenology of different vegetated classes cannot be adequately represented. For this reason, in our work, we have used multi-seasonal imagery, selected on the basis of prior phenology knowledge rules of the study classes.
Onojeghuo and Onojeghuo, (2017) [37] used an object-based SVM classifier to analyze satellite multispectral, airborne hyperspectral and LiDAR DTM data set for habitat mapping, including grasslands. In this study, the results showed a significant increase in classification accuracy when the DTM was combined with spectral information with a final OA of 89.8%. These findings confirm the utility of DTM in combination with spectral information for VHR data classification, as done also in our study to discriminate cultivated from natural woody vegetation. Clerici et al., (2017) [36], integrated radar and visible-near infrared data set for object-based supervised classification. The data driven SVM algorithm produced the most accurate LCLU map, including grasslands class with an OA of 88.75%. The grasslands UA and PA obtained were 81.25% and 18-75%, which are significantly lower than the values obtained with our knowledge-driven classifier. Xu et al., (2018) [27], focused on the classification of different grassland types from multi-temporal Landsat images, in combination with a 30-m DTM. The authors used both SVM and RF object-based classifier. The OBJA classifiers were able to differentiate different grassland types from multi-source data. The overall classification accuracy exceeded 90% using both classifiers, and the classification accuracy of the grassland types considered ranged from 61.64% to 98.71%. In agreement with our findings, Xu et al. evidenced that multi-temporal images and auxiliary data can improve grasslands classification.
A comparison between the results reported in the above-mentioned studies and the ones obtained in our study suggests that the hierarchical knowledge-driven object-based classifier can yield high OA values comparable with the ones obtained by both pixel-and object-based data-driven supervised classifiers. Unfortunately, it is not always easy to collect a large number of quality training samples due to limited time, access or interpretability constraints. A further complication is that if data quality is a concern, such as mislabeled data samples, then it may be necessary to select an algorithm that is less sensitive for such issues [35].
As recently evidenced in [40,41,88,89], knowledge-driven approaches based on enhanced computing capacities are gaining importance to analyze big Earth Observation data. The remote sensing community considers the development of knowledge-driven approaches to be one of the most important directions of their research. However, so far, little effort has been dedicated to formalizing, aggregating and sharing expert knowledge in remote sensing applications. Such knowledge could be used to develop ontologies.
Ontologies have the capacity to represent both symbolic and numeric knowledge, to reason based on cognitive semantics and to share knowledge on the interpretation of remote sensing images. This approach will generate new knowledge and services by integrating multiple source/domains information [90]. Gu et al., (2017) [42], used a hybrid approach to analyze a single date VHR ZY-3 image. In their work, first, a data driven LCCS hierarchical classification phase was carried out. Then, an ontology was used to relabel the objects. The ontology was based on the combination land cover classes obtained in the previous phase with various features selected by prior knowledge as well as with existing ontologies. In term of OA values, similar results were obtained with and without the ontology, i.e., 85.95% and 84.32%, respectively, whereas the specific grasslands UA improved from 78.26 to 85.37, after ontology exploitation. Such UA increase seems due to a better generalization capability. Compared to such results, our knowledge-driven approach provided both quite high values of OA (97.35%) and grassland PA (92.6%) values. This is mainly due not only to the use of multi-seasonal VHR images, but also to elicitation of phenology and agricultural practices rules.
It seems also worth noting that despite the complexity of the knowledge elicitation process required by our approach, the analysis of VHR images through the hierarchical classifier can represent better the heterogeneity of the grassland ecosystem and its dynamic.

LO Results Discussion
Following the SO classification, a LO classification was applied to represent the landscape heterogeneity through a multi-scale approach, which characterizes the GEOBIA paradigm [35]. When using different segmentation and labelling scales a crucial step is the merging of small-scale objects within the large-scale ones.
Watmough et al., (2017) [33], used segmentation scales ranging from the coarser to the finest and applied different features to each scale examined in order to provide both different classification layers and different output classes. Momeni et al., (2016) [34], applied three different segmentation scales without adopting any merging criteria. Zhang et al., (2014) [71], proposed a multi-scale segmentation with equispaced scales. The latter authors merge different scale objects when finding a complete overlap between the different layers. The findings discussed by them cannot be adequately compared with the ones obtained in the present study. In fact, in our study, we used the LCCS taxonomy dominance criteria (Section 3.3.3) to merge the vegetated objects found at different scales. As far as we know, such approach was not exploited in previous studies.
However, the results of the dominance criteria application may differ when automatically applied to image object analysis or when evaluated through image visual interpretation, or in-field campaigns. Our study tried to address this challenging validation issue in search for a possible explanation. By analyzing the results obtained, the following factors seem to influence our validation process. These factors include SO labelling and its impact on the SO classification accuracy, LO segmentation accuracy and the effectiveness of the LCCS dominance rules for the classification of LOs.
Specifically, the LO classification process can be hampered by errors in the labelling of vegetated SO objects corresponding to cultivated orchard and olive groves (Figure 8). These errors may be mainly caused by the presence of small tree crowns, which include only a small number of pixels. Due to geo-referencing error (order of one pixel), the leaf phenology classifier might fail in classifying SOs. This might be due to pixels spectral features that could correspond to different objects on the ground (e.g., tree crown, soil, shadow). As a consequence, SO dominance in the LO may be mistaken. In addition, classification of LOs including SO of both cultivated orchard and olive grove were labeled as mixed units, due to the lack of LCCs criteria for evaluating the dominance of cultivated classes.
Another source of error may be the segmentation scale adopted. The latter may have aggregated landscape elements that should have been labeled as distinct objects into single LOs.
Moreover, for woody natural vegetation classes, confusion between either C1 (continuous) and C2 (fragmented) labels or C2 with or without co-dominant component may occur. Furthermore, confusion in the dominance of grassland or woody components in the LO considered when both classes are co-present may occur.
Confusion in validation process may depend on the perception that the expert may have had of the class dominance when providing reference samples to be used for LO classification. Figure 11 can exemplify this event.
validation process. These factors include SO labelling and its impact on the SO classification accuracy, LO segmentation accuracy and the effectiveness of the LCCS dominance rules for the classification of LOs.
Specifically, the LO classification process can be hampered by errors in the labelling of vegetated SO objects corresponding to cultivated orchard and olive groves (Figure 8). These errors may be mainly caused by the presence of small tree crowns, which include only a small number of pixels. Due to geo-referencing error (order of one pixel), the leaf phenology classifier might fail in classifying SOs. This might be due to pixels spectral features that could correspond to different objects on the ground (e.g., tree crown, soil, shadow). As a consequence, SO dominance in the LO may be mistaken. In addition, classification of LOs including SO of both cultivated orchard and olive grove were labeled as mixed units, due to the lack of LCCs criteria for evaluating the dominance of cultivated classes.
Another source of error may be the segmentation scale adopted. The latter may have aggregated landscape elements that should have been labeled as distinct objects into single LOs.
Moreover, for woody natural vegetation classes, confusion between either C1 (continuous) and C2 (fragmented) labels or C2 with or without co-dominant component may occur. Furthermore, confusion in the dominance of grassland or woody components in the LO considered when both classes are co-present may occur.
Confusion in validation process may depend on the perception that the expert may have had of the class dominance when providing reference samples to be used for LO classification. Figure 11 can exemplify this event. The LO evidenced by the red polygon in Figure 11 was labelled as A12/A1D2E1C2/A12/A2A6 (a coniferous forest fragmented by grassland) by the expert, but it was labelled as A12/A2A6C2/A12A1D2E1 (grassland with scattered trees) by the hierarchical classifier. The latter applied the dominance criteria to SO pixels found within the specific LO. All adjacent SOs belonging The LO evidenced by the red polygon in Figure 11 was labelled as A12/A1D2E1C2/A12/A2A6 (a coniferous forest fragmented by grassland) by the expert, but it was labelled as A12/A2A6C2/A12A1D2E1 (grassland with scattered trees) by the hierarchical classifier. The latter applied the dominance criteria to SO pixels found within the specific LO. All adjacent SOs belonging to other LOs perceived by the expert were not analyzed by the algorithm. This perception of the context may have guided the expert during the visual interpretation and labeling of the object.
An additional example is provided in Figure 12, where the LO (in red line) was labelled as A12/A1D2E1C1 by the expert and as A12/A1D2E1C2 by the hierarchical classifier. The incongruence may have been due to the presence of broadleaved SOs in the LO mixed with coniferous SO components detected by the classifier, but it is difficult to detect by expert visual image interpretation.
Remote Sens. 2020, 12, x FOR PEER REVIEW 26 of 32 to other LOs perceived by the expert were not analyzed by the algorithm. This perception of the context may have guided the expert during the visual interpretation and labeling of the object. An additional example is provided in Figure 12, where the LO (in red line) was labelled as A12/A1D2E1C1 by the expert and as A12/A1D2E1C2 by the hierarchical classifier. The incongruence may have been due to the presence of broadleaved SOs in the LO mixed with coniferous SO components detected by the classifier, but it is difficult to detect by expert visual image interpretation. The collection of reference data can be time-consuming and sometimes impossible to perform. The methodology proposed in the present study offers the advantage of the knowledge-driven approach that can avoid the need of training data.
Even though instantiation of the expert rules may require a significant amount of time to elaborate the decision tree, the expert knowledge requires less extensive in field campaigns. Moreover, such campaigns are not required to be repeated through time.

Conclusions
In the present paper we applied a knowledge-based GEOBIA classifier for mapping grasslands ecosystems. As a novelty, the classification algorithm instantiates the hierarchical FAO-LCCS scheme by exploiting: (1) A multi-temporal approach to identify different phenological vegetation attributes, combined with expert-knowledge from ecologists. This approach yielded satisfactory results in the discrimination of perennial and annual vegetation. (2) VHR imagery to reveal the suitability of features (i.e., context-sensitive) obtained through this approach. Without requiring additional data sources (i.e., CHM), entropy proved reliable in discriminating between high (trees and shrubs) and low vegetation (herbaceous). (3) R, G, B and NIR bands and derived indices were sufficient to recognize different ground targets. The choice to use only these bands was in light of the fact that VHR satellite data are very costly, whereas drone-flights with VIS-NIR bands become more and more frequently used to collect VHR images of local areas. As confirmed by Jabbour et al. 2020 [91], due to the cost of The collection of reference data can be time-consuming and sometimes impossible to perform. The methodology proposed in the present study offers the advantage of the knowledge-driven approach that can avoid the need of training data.
Even though instantiation of the expert rules may require a significant amount of time to elaborate the decision tree, the expert knowledge requires less extensive in field campaigns. Moreover, such campaigns are not required to be repeated through time.

Conclusions
In the present paper we applied a knowledge-based GEOBIA classifier for mapping grasslands ecosystems. As a novelty, the classification algorithm instantiates the hierarchical FAO-LCCS scheme by exploiting: (1) A multi-temporal approach to identify different phenological vegetation attributes, combined with expert-knowledge from ecologists. This approach yielded satisfactory results in the discrimination of perennial and annual vegetation. (2) VHR imagery to reveal the suitability of features (i.e., context-sensitive) obtained through this approach. Without requiring additional data sources (i.e., CHM), entropy proved reliable in discriminating between high (trees and shrubs) and low vegetation (herbaceous). (3) R, G, B and NIR bands and derived indices were sufficient to recognize different ground targets.
The choice to use only these bands was in light of the fact that VHR satellite data are very costly, whereas drone-flights with VIS-NIR bands become more and more frequently used to collect VHR images of local areas. As confirmed by Jabbour et al. 2020 [91], due to the cost of satellite images with spatial resolution lower than 5 m, end-users seem to prefer drone based acquisition of such data for decision making at local level. Thus, the proposed methodology can be applied to VHR data from drones.
These considerations suggest that that VHR resolution multi-temporal imagery in a GEOBIA classification framework complemented with expert knowledge can be reliable for grasslands ecosystem mapping in the Mediterranean area. This area is characterized by small fields of interconnected and fragmented habitats and ecosystems difficult to monitor with high resolution sensors such as Landsat and Sentinel-2. Thus, GEOBIA data driven classifiers seem adequate for detecting the landscape matrix at VHR. Without training samples, the proposed approach can be applied to other grasslands areas once local agricultural practices and phenology rules are elicited.
Concerning phenology, a hybrid approach could be used to automatically extract phenology parameters (e.g., yearly mean, yearly standard deviation, min-max day of the bio-mass peak) from intra-annual time series of Sentinel-2 free data in any site. Subsequently, VHR image could be tasked. The usefulness of phenological metrics obtained through the exploitation Time Series of high-resolution data within a GEOBIA classification scheme is going to be the object of a coming investigation.
Even though the use of LCCS criteria for LO classification would need further investigation, and in spite of the limited area in our study, the results obtained appear encouraging compared to supervised classifiers in the literature. So far, little effort has been dedicated to formalizing, aggregating and sharing expert knowledge in remote sensing applications [32]. This paper provides a methodology that can contribute to support the transition from a data-centric to a knowledge-centric approach [92]. This transition is strongly encouraged by the Group of Earth Observation (GEO) in support to decision makers.
Supplementary Materials: The following are available online at http://www.mdpi.com/2072-4292/12/9/1447/s1, Figure S1: Overview of the Land Cover Classification System, its two phases and the classifiers, Figure S2 Funding: This work was supported by the European Union's Horizon 2020 Research and Innovation Programme, within both ECOPOTENTIAL project: Improving Future Ecosystem Benefits through Earth Observations, grant agreement 641762 (www.ecopotential-project.eu) and myEcosystem showcase of the e-shape project (https: //e-shape.eu/; https://e-shape.eu/index.php/showcases), grant agreement 820852.