Comparison of Hyperspectral Versus Traditional Field Measurements of Fractional Ground Cover in the Australian Arid Zone

: The collection of high-quality ﬁeld measurements of ground cover is critical for calibration and validation of fractional ground cover maps derived from satellite imagery. Field-based hyperspectral ground cover sampling is a potential alternative to traditional in situ techniques. This study aimed to develop an e ﬀ ective sampling design for spectral ground cover surveys in order to estimate fractional ground cover in the Australian arid zone. To meet this aim, we addressed two key objectives: (1) Determining how spectral surveys and traditional step-point sampling compare when conducted at the same spatial scale and (2) comparing these two methods to current Australian satellite-derived fractional cover products. Across seven arid, sparsely vegetated survey sites, six 500-m transects were established. Ground cover reﬂectance was recorded taking continuous hyperspectral readings along each transect while step-point surveys were conducted along the same transects. Both measures of ground cover were converted into proportions of photosynthetic vegetation, non-photosynthetic vegetation, and bare soil for each site. Comparisons were made of the proportions of photosynthetic vegetation, non-photosynthetic vegetation, and bare soil derived from both in situ methods as well as MODIS and Landsat fractional cover products. We found strong correlations between fractional cover derived from hyperspectral and step-point sampling conducted at the same spatial scale at our survey sites. Comparison of the in situ measurements and image-derived fractional cover products showed that overall, the Landsat product was strongly related to both in situ methods for non-photosynthetic vegetation and bare soil whereas the MODIS product was strongly correlated with both in situ methods for photosynthetic vegetation. This study demonstrates the potential of the spectral transect method, both in its ability to produce results comparable to the traditional transect measures, but also in its improved objectivity and relative logistic ease. Future e ﬀ orts should be made to include spectral ground cover sampling as part of Australia’s plan to produce calibration and validation datasets for remotely sensed products.


Introduction
Satellite image-derived fractional ground cover mapping has proven to be an essential source of information for applications, including analysis of spatial and temporal vegetation dynamics [1], monitoring urban greenness [2], mapping bushfire burn severity levels [3], forest cover change [4], and deforestation [5]. Algorithms, including spectral mixture analysis [6][7][8], multiple endmember spectral mixture analysis [9], and relative spectral mixture analysis [10], are used to produce fractional cover (FC) maps. These algorithms can be applied to multispectral and hyperspectral imagery, decomposing each image pixel into a measure of similarity to two or more spectrally distinct land cover types. These showed low correlations. In our paper, hyperspectral ground surveys refer to the collection of spectral measurements over an area for the purpose of estimating ground cover fractions [33], quite a different application to the use of in situ spectroscopic measurements for radiometric calibration of imagery.
Another challenge for calibration and validation is to match the scale of field data with that of broad-scale FC products. When validating products developed from coarse resolution imagery (e.g., 500 m), it is common to up-scale field data recorded at a finer scale (e.g., 100 m) in order to determine the accuracy of the coarse resolution products. This is usually conducted under the assumption that the area around the sample site is homogenous and similar to that surveyed. For example, Meyer and Okin [35] conducted spectral sampling over 500-m transects to correspond to a MODIS pixel and compared the results to line-point intercept sampling that was conducted over smaller 100-m transects. A potential reason for the low correlation between the sampling methods is that the 100-m transects were insufficient to gain an adequate estimate of the ground cover for a 500-m pixel. Meyer and Okin [35] were following the Muir et al. [30] field sampling layout, where transects were placed in a radiating star pattern that was designed to relate field measurements to Landsat imagery. The Muir et al. [30] design samples three 100-m transects, which covers approximately 3 × 3 Landsat pixels, making it ideal for validating Landsat-based products but not necessarily adequate for coarser resolution products (MODIS).
The layout of transects also has the potential to affect how an area is sampled. For instance, the star-transect method includes a sampling bias that over-represents cover towards the centre of the plot. Three transects are placed in a star pattern with the result being that observation points are concentrated in the centre of the star and increasingly dispersed as the distance from the centre increases. Therefore, a sampling pattern that provides a more even distribution across a site may provide a better representation of the ground cover. Other studies have placed parallel transects evenly across sample sites [26,36] or placed transects in a grid pattern in order to more evenly sample the area [37].
Field-based hyperspectral ground cover sampling is a potential alternative to traditional techniques that may assist with the calibration and validation of remotely sensed products. The motivation for this research is to expand upon the work of Meyer and Okin [35] and trial hyperspectral ground cover sampling in Australia, with the ultimate aim of incorporating spectral sampling as part of Australia's national effort to collect validation and calibration data to meet our remote sensing needs. Our aim was to develop an effective sampling design for spectral ground cover surveys in order to estimate fractional ground cover. Our objectives were (1) to determine how spectral surveys and traditional step-point sampling compare when conducted at the same spatial scale, and (2) determine how these in situ methods compare to current Australian satellite-derived FC products.

Materials and Methods
Currently, in situ validation data collected using the Muir et al. [30] method is used to assess the accuracy of both the MODIS and Landsat products. Meyer and Okin [35] found that the in situ spectral measurements they collected could be used to validate fractional ground cover mapping developed from MODIS imagery over Botswana, but this has not been tested across other environments. To meet our objectives, we therefore completed ground cover surveys before inspecting how our in situ measurements would compare with the MODIS and Landsat products. Figure 1 provides an overview of the methods used in this study.

Study Area
The study was conducted in New South Wales (NSW), Australia within the arid zone ( Figure 2). Sites FG1-FG4 (Figure 3a,b) were situated at Fowlers Gap Arid Research Station, 110 km north of Broken Hill, NSW, while sites BH1-BH3 (Figure 3c,d) were located surrounding the City of Broken Hill. The climate for both regions is hot and persistently dry [38]. Fowlers Gap has a mean annual rainfall of 240 mm, a mean annual minimum temperature of 13 ᵒC, and a mean annual maximum temperature of 26.9 ᵒC [39]. The vegetation at Fowlers Gap comprises low open chenopodiaceous shrublands, some low open Acacia and Casuarina woodland as well as grasslands on the plains Mabbutt et al. [40]. Broken Hill has a mean annual rainfall of 250 mm with a mean annual minimum temperature of 11.8 ᵒC and a mean annual maximum temperature of 24.7 ᵒC [41]. The vegetation around Broken Hill is also composed of chenopod shrublands that includes saltbush and bluebush communities as well as Mulga (Acacia aneura) [42].

Study Area
The study was conducted in New South Wales (NSW), Australia within the arid zone ( Figure 2). Sites FG1-FG4 (Figure 3a,b) were situated at Fowlers Gap Arid Research Station, 110 km north of Broken Hill, NSW, while sites BH1-BH3 (Figure 3c,d) were located surrounding the City of Broken Hill. The climate for both regions is hot and persistently dry [38]. Fowlers Gap has a mean annual rainfall of 240 mm, a mean annual minimum temperature of 13 • C, and a mean annual maximum temperature of 26.9 • C [39]. The vegetation at Fowlers Gap comprises low open chenopodiaceous shrublands, some low open Acacia and Casuarina woodland as well as grasslands on the plains Mabbutt et al. [40]. Broken Hill has a mean annual rainfall of 250 mm with a mean annual minimum temperature of 11.8 • C and a mean annual maximum temperature of 24.7 • C [41]. The vegetation around Broken Hill is also composed of chenopod shrublands that includes saltbush and bluebush communities as well as Mulga (Acacia aneura) [42].

Ground Cover Surveys
To survey ground cover, six 500-m transects oriented north-south and spaced 100 m apart were established at each site. Across the six transects, two survey methods were used. Firstly, ground cover reflectance was recorded using an Analytical Spectral Devices Inc. FieldSpec 3 spectroradiometer (ASD) that measures the visible to shortwave infrared (350-2500 nm) parts of the electromagnetic spectrum. The sensor has 2150 bands with a spectral resolution of 3 nm from 350-1000 nm and 10 nm

Ground Cover Surveys
To survey ground cover, six 500-m transects oriented north-south and spaced 100 m apart were established at each site. Across the six transects, two survey methods were used. Firstly, ground cover reflectance was recorded using an Analytical Spectral Devices Inc. FieldSpec 3 spectroradiometer (ASD) that measures the visible to shortwave infrared (350-2500 nm) parts of the electromagnetic spectrum. The sensor has 2150 bands with a spectral resolution of 3 nm from 350-1000 nm and 10 nm

Ground Cover Surveys
To survey ground cover, six 500-m transects oriented north-south and spaced 100 m apart were established at each site. Across the six transects, two survey methods were used. Firstly, ground cover reflectance was recorded using an Analytical Spectral Devices Inc. FieldSpec 3 spectroradiometer (ASD) that measures the visible to shortwave infrared (350-2500 nm) parts of the electromagnetic spectrum. The sensor has 2150 bands with a spectral resolution of 3 nm from 350-1000 nm and 10 nm from 1000-2500 nm. An 8-degree field of view fore-optic was held 1 m above the ground, creating a 0.14-m diameter ground field of view. At the start of each transect, and as required, the device was optimized and white reference measurements taken following the recommended protocols [43]. The operator of the ASD walked along each transect at a consistent pace taking continuous readings of ground cover reflectance. The continuous readings were averaged by the ASD and 10 averaged spectra were recorded for each 25-m segment of the transect, totalling 200 spectra per transect (1200 measurements per site).
The second method used was step-point sampling, where an observer collected point-based observations of ground cover along the six transects. The observer marked a point on a boot tip and at 5-m intervals recorded the cover that intersected the point. Cover was categorised into a set number of cover types, including crust, rock, litter, green leaf, and dry leaf, as outlined in the Muir et al. [30] protocol. The cover for each of these categories was calculated as the proportion of the total number of point observations at the site (n = 600). These categories were grouped into three broad classes, PV, NPV, and BS, to give their FC percentage within the site.

Endmember Extraction and Spectral Unmixing
The hyperspectral reflectance measurements for each transect were converted into single raster files enabling them to be processed in ENVI 5.3.1 (Exelis Visual Information Solutions, Boulder, Colorado). The Sequential Maximum Angle Convex Cone (SMACC) tool was used to extract endmembers from the transect rasters and to perform linear spectral unmixing [44]. The SMACC tool automatically defines the most extreme point (i.e., the brightest pixel in multi-dimensional space) as the first endmember in the raster using a convex cone model. The next endmember is identified based on the angle it makes with the existing cone (i.e., the pixel that is most different from the brightest), which is then added to the cone to derive the next endmember. This process continues until a specific tolerance is reached or until a specific number of endmembers are identified.
For each of the transects, PV, NPV, and BS endmembers ( Figure 4) were extracted and abundance images of PV, NPV, BS, and shadow were produced, with each image displaying the proportion a specific endmember contributes to each pixel. These images were produced using a fully constrained linear spectral unmixing algorithm: where DN b is the apparent surface reflectance of a pixel in band b of an image; F i is the fraction of endmember i; DN i,b is the relative reflectance of endmember i in band b; n is the number of endmembers; and E b is the error for band b of the fit of n spectral endmembers, that of [7,8,45].

Comparison to Image-Based Fractional Cover Products
The field-based FC estimates were compared to two Australian image-derived FC products based on MODIS [46] and Landsat imagery [47]. The MODIS FC product was initially developed for monitoring the tropical savanna region of the Northern Territory, Australia [16] and was later applied across the continent by the Commonwealth Scientific and Industrial Research Organisation (CSIRO) [19]. This product uses MODIS imagery and describes the proportion of PV, NPV, and BS Australiawide. The Landsat product was developed by the Joint Remote Sensing Research Program (JRSRP) also as a national FC product utilising the Landsat archive. Initially developed for rangeland monitoring in Queensland, Australia, the product is now being implemented nationally by Geoscience Australia [21,24]. Key differences between the two products include their spatial and temporal resolutions. The MODIS product has a moderate resolution of 500 m while the Landsat product is at a finer scale of 25 m. The latest version of the MODIS product uses MODIS MC43A4 version 6 imagery, which is a 16-day composition of daily captures from 2000 to 2019 (on-going) [46]. The Landsat product utilises data from the Landsat archive from 1986 to the present. Recent versions of the Landsat and MODIS FC products use a similar unmixing process [23] that incorporates endmembers of PV, NPV, and BS derived from field spectra and the imagery itself. For this study, the Guerschman and Hill [46] version 3.1.0 MODIS product and the Landsat FC25 version 1.5 were used [47]. The MODIS and Landsat FC products were acquired for dates that corresponded with the collection of in situ ground cover measurements. The Landsat FC image is based on a single date (17 August 2018) while the MODIS FC product is developed from a 16-day composite of imagery collected from the 13 to 28 August 2018. The PV, NPV, and BS values for each of the seven sites were extracted from a single pixel for the MODIS product while an average of 400 Landsat pixels across the same 500 × 500 m area were calculated. These extracted values were then compared to both the step-point and the spectral PV, NPV, and BS fractions.

Statistical Analysis
To determine the relationship between the in situ FC estimates and the image-based estimates, two metrics were used: Spearman's rank-order correlation (rs) to measure the relationship between The proportions of PV, NPV, and BS across each site were calculated as the averages of the unmixed fractions derived from each transect spectrum.

Comparison to Image-Based Fractional Cover Products
The field-based FC estimates were compared to two Australian image-derived FC products based on MODIS [46] and Landsat imagery [47]. The MODIS FC product was initially developed for monitoring the tropical savanna region of the Northern Territory, Australia [16] and was later applied across the continent by the Commonwealth Scientific and Industrial Research Organisation (CSIRO) [19]. This product uses MODIS imagery and describes the proportion of PV, NPV, and BS Australia-wide. The Landsat product was developed by the Joint Remote Sensing Research Program (JRSRP) also as a national FC product utilising the Landsat archive. Initially developed for rangeland monitoring in Queensland, Australia, the product is now being implemented nationally by Geoscience Australia [21,24]. Key differences between the two products include their spatial and temporal resolutions. The MODIS product has a moderate resolution of 500 m while the Landsat product is at a finer scale of 25 m. The latest version of the MODIS product uses MODIS MC43A4 version 6 imagery, which is a 16-day composition of daily captures from 2000 to 2019 (on-going) [46]. The Landsat product utilises data from the Landsat archive from 1986 to the present. Recent versions of the Landsat and MODIS FC products use a similar unmixing process [23] that incorporates endmembers of PV, NPV, and BS derived from field spectra and the imagery itself. For this study, the Guerschman and Hill [46] version 3.1.0 MODIS product and the Landsat FC25 version 1.5 were used [47]. The MODIS and Landsat FC products were acquired for dates that corresponded with the collection of in situ ground cover measurements. The Landsat FC image is based on a single date (17 August 2018) while the MODIS FC product is developed from a 16-day composite of imagery collected from the 13 to 28 August 2018. The PV, NPV, and BS values for each of the seven sites were extracted from a single pixel for the MODIS product while an average of 400 Landsat pixels across the same 500 × 500 m area were calculated. These extracted values were then compared to both the step-point and the spectral PV, NPV, and BS fractions.

Statistical Analysis
To determine the relationship between the in situ FC estimates and the image-based estimates, two metrics were used: Spearman's rank-order correlation (r s ) to measure the relationship between methods and the mean absolute error (MAE) to measure the average error. MAE was calculated as follows: where f 1 and f 2 represent the two FC measures being tested and n is the number of measurements. MAE is an average of the absolute difference between FC measure 1 and FC measure 2 (i.e., the absolute error). MAE is calculated in the same units as the variables and is a negatively oriented score, with lower values indicating lower errors.

Results
The in situ methods showed strong positive relationships across all three ground cover types (Table 1). While r s was high, the MAE for NPV (r s = 0.61, MAE = 19.82) and BS (r s = 0.82, MAE = 19.26) was also relatively high. PV (r s = 0.87, MAE = 1.37) showed a high correlation with low errors. Overall, low errors were observed for all comparisons made for PV. When the in situ methods were compared to the image-based models (MODIS and Landsat), spectral transect sampling showed a strong relationship to the MODIS image for PV and was the strongest relationship observed (r s = 0.91, MAE = 4.21). For BS, the correlation between the MODIS imagery and the in situ methods was moderate and moderate to low for NPV. In comparison, the Landsat imagery showed a strong to moderate relationship with both in situ methods for BS, NPV, and PV. Table 1. Summary of correlations and errors for each ground cover type based on comparisons between in situ and image-based fractional cover methods (step-point, spectral, MODIS, and Landsat).

Bare Soil
Step-point Spectral

Non-photosynthetic Vegetation
Step-point Spectral The percent cover of PV, NPV, and BS calculated for the in situ and image-based method at each field site ( Figure 5) shows the in situ methods varied significantly. For Fowlers Gap sites 2-4, the step-point and spectral PV, NPV, and BS were very similar, whereas at the Fowlers Gap site 1 and the Broken Hill sites, PV followed a similar pattern but NPV and BS varied significantly from one another. Overall, PV was low across all sites and especially low for the Fowlers Gap sites, with approximately 10% less PV than the Broken Hill sites. As shown in Figure 3a,b, Fowlers Gap vegetation was extremely sparse with vast areas of exposed soil, and while the Broken Hill sites were also sparely vegetated, they still had considerably more vegetation than the Fowlers Gap sites.

Discussion
The motivation for this study was to test field-based hyperspectral ground cover sampling as a method of calibrating and validating image-based fractional ground cover products in Australia. By developing an alternative survey design for spectral transect sampling and comparing this method to step-point sampling at the same spatial scale, we developed an insight into the relationship between our two field methods and how they compare to current Australian image-derived fractional ground cover products.
Overall, the in situ methods were positively correlated with each other. Though neither method is truly 'ground truth', this strong positive linear relationship between the in situ methods suggests they provided relatively accurate estimates of ground cover at each field site. In contrast, Meyer and Okin [35] found little to no correlation between their two field methods. This is likely due to a scale mismatch in the Meyer and Okin study (the line point transects were 100 m, and the spectral transects were 500 m), whereas we avoided this mismatch by conducting both surveys over the same 500-m transects. Additionally, by avoiding a star-transect layout, our grid sample design more evenly distributed sample points across each site, ensuring that that we were not over sampling a specific area and collecting data evenly across each site.
Overall, there was relatively good correlation between both in situ methods and the image-based products. Previous validation of the MODIS and Landsat methods using in situ measurements similar to our step-point sampling reported good correlation between in situ measurements and the image-based products [24,46], but very few of these sites were located in areas with a very low percentage of vegetation. MAE was consistently low between the in situ measurements and imagebased values for PV compared to BS and NPV, which showed considerably higher errors (Table 1). This pattern of errors is also consistent with past studies, where PV has been successfully unmixed due to being spectrally unique, whereas BS and NPV are typically harder to distinguish due to their spectral similarity [15,35].

Discussion
The motivation for this study was to test field-based hyperspectral ground cover sampling as a method of calibrating and validating image-based fractional ground cover products in Australia. By developing an alternative survey design for spectral transect sampling and comparing this method to step-point sampling at the same spatial scale, we developed an insight into the relationship between our two field methods and how they compare to current Australian image-derived fractional ground cover products.
Overall, the in situ methods were positively correlated with each other. Though neither method is truly 'ground truth', this strong positive linear relationship between the in situ methods suggests they provided relatively accurate estimates of ground cover at each field site. In contrast, Meyer and Okin [35] found little to no correlation between their two field methods. This is likely due to a scale mismatch in the Meyer and Okin study (the line point transects were 100 m, and the spectral transects were 500 m), whereas we avoided this mismatch by conducting both surveys over the same 500-m transects. Additionally, by avoiding a star-transect layout, our grid sample design more evenly distributed sample points across each site, ensuring that that we were not over sampling a specific area and collecting data evenly across each site.
Overall, there was relatively good correlation between both in situ methods and the image-based products. Previous validation of the MODIS and Landsat methods using in situ measurements similar to our step-point sampling reported good correlation between in situ measurements and the image-based products [24,46], but very few of these sites were located in areas with a very low percentage of vegetation. MAE was consistently low between the in situ measurements and image-based values for PV compared to BS and NPV, which showed considerably higher errors (Table 1). This pattern of errors is also consistent with past studies, where PV has been successfully unmixed due to being spectrally unique, whereas BS and NPV are typically harder to distinguish due to their spectral similarity [15,35].
The observer and the spectral field data recorded less than 1.3% PV at the Fowlers Gap site, MODIS PV values ranged from 0% to 3%, and Landsat ranged from 0% to 9.72% PV. Considering the finer resolution of the Landsat product, we would have expected PV to be better correlated with the Landsat values rather than the MODIS values. A reason for this could be related to the image products. The Landsat FC product is based on a single image captured on one day, whereas the MODIS MCD43A4 product calculates the weighted estimate of albedo over a 16-day period. The Landsat image was captured during this 16-day composite period.
This comparison was conducted with a small number of samples located in the arid zone where we know these products tend to fail [19,24]. More extensive surveys are needed to determine if this pattern is more widespread in arid areas. It is also important to remember that we compared single MODIS pixels with an average of 400 Landsat pixels. Sampling a cluster of pixels is preferable for accuracy assessment to remove errors associated with positional accuracy [48]. This is feasible for image products with resolutions of 5, 10, or 30 m but becomes logistically taxing for clusters of MODIS pixels at 500 m. This is why the upscaling of field data is regularly used. Currently, the MODIS product is validated using upscaled in situ data initially collected over 100-m transects. Sampling the area of a single pixel in the field has limitations. We argue that overall, surveying the area of a single MODIS pixel is preferable to comparing upscaled field data to a MODIS pixel.
Arid shrublands and desert zone cover 48% of the Australian continent [49]. Having reliable long-term fractional cover data at varying scales is crucial for those managing or studying these regions, especially for areas that are inaccessible or unsafe to travel. The in situ methods used have both benefits and shortcomings.
Step-point sampling has been developed over time as a simple and easily repeatable method of collecting fractional ground cover estimates. Limitations of this technique include the time-consuming collection of field observations and the potential for human subjectivity and bias to be introduced, especially when classifying PV and NPV [32]. Utilizing standardized definitions and methods [30] may reduce subjective error but cannot remove it entirely. In order to further reduce human bias, spectral transect sampling provides a solution. This method allows for continuous, quantitative hyperspectral measurements to be taken over an area, providing an objective record of ground cover without the need for observers to make categorical decisions in the field. This hyperspectral record of ground cover may also have the potential to calibrate and validate a range of other remotely sensed products and this is an area of future research. With the continued demand for high-quality ground cover products, it is critical to ensure that we are collecting high-quality calibration and validation data for the assessment of these sought-after products.

Conclusions
Field-based estimation of fractional ground cover is critical for ensuring the accuracy and consistency of remotely sensed ground cover maps. Currently, Australia's national standard for the collection of field estimates of ground cover uses traditional field sampling techniques, but hyperspectral reflectance sampling of ground cover has considerable potential to improve field measurements collected for calibration and validation purposes. This study trailed the use of hyperspectral reflectance sampling in the sparsely vegetated NSW arid zone. Comparison of step-point and spectral transect sampling across the same transects, at the same spatial scale, demonstrated the significant potential of the spectral transect method, both in is ability to produce results comparable to the traditional transect measures and also in the improved objectivity and relative logistic ease of the method.
Overall, we found the in situ step-point and spectral sampling techniques to be positively correlated across the three ground cover classes. Comparing the in situ data and current Australian image-derived fractional cover products showed that overall, the Landsat product was strongly related to both in situ methods for non-photosynthetic vegetation and bare soil whereas the MODIS product was strongly correlated with both in situ methods for photosynthetic vegetation. These results are specific to our survey sites and further work is required to test their wider applicability.
While a limitation of spectral sampling is the availability and cost of the spectroradiometer itself, overall, the benefits outweigh the limitations. Spectral sampling is especially beneficial for repeat surveys or multi-temporal studies. Future efforts should be made to include spectral ground cover sampling as part of Australia's efforts to produce calibration and validation datasets for remotely sensed products and should further test this method to develop a national or global standard.