A Colourimetric Approach to Ecological Remote Sensing: Case Study for the Rainforests of South-Eastern Australia

: To facilitate the simpliﬁcation, visualisation and communicability of satellite imagery classiﬁcations, this study applied visual analytics to validate a colourimetric approach via the direct and scalable measurement of hue angle from enhanced false colour band ratio RGB composites. A holistic visual analysis of the landscape was formalised by creating and applying an ontological image interpretation key from an ecological-colourimetric deduction for rainforests within the variegated landscapes of south-eastern Australia. A workﬂow based on simple one-class, one-index density slicing was developed to implement this deductive approach to mapping using freely available Sentinel-2 imagery and the super computing power from Google Earth Engine for general public use. A comprehensive accuracy assessment based on existing ﬁeld observations showed that the hue from a new false colour blend combining two band ratio RGBs provided the best overall results, producing a 15 m classiﬁcation with an overall average accuracy of 79%. Additionally, a new index based on a band ratio subtraction performed better than any existing vegetation index typically used for tropical evergreen forests with comparable results to the false colour blend. The results emphasise the importance of the SWIR1 band in discriminating rainforests from other vegetation types. While traditional vegetation indices focus on productivity, colourimetric measurement offers versatile multivariate indicators that can encapsulate properties such as greenness, wetness and brightness as physiognomic indicators. The results conﬁrmed the potential for the large-scale, high-resolution mapping of broadly deﬁned vegetation types.


Introduction
False colour images have typically offered a starting point to the satellite imagery processing and analysis, allowing the interpreter to visualise and recognise distinguishable patterns in landscapes with the data from bands in and beyond the visual spectrum, into the near-infrared and shortwave infrared ranges. However, surprisingly, the direct measurement of colour metrics (or colourimetry) from false colour imagery has received little attention, particularly for vegetation classification. Recent machine learning approaches are still seen by many researchers as black boxes that are difficult to interpret, which is not very practical when end users will want to visualise, understand and communicate how mapping solutions work, how reliable mapping decisions are and how they were arrived at [1][2][3][4][5]. Indeed, the visualisation of what machine learning approaches are actually learning for the explanation of models and enhancement of trust is still an open area of research [3,[6][7][8][9]. The focus of this study was to revisit the colourimetric approaches, such as those from Wester et al. [10] to Pekel et al. [11], and develop them further, given the recent availability of temporal data at a high enough resolution to do it well. This study aimed to apply ecological knowledge, visual analytics and evidential reasoning [11,12] to simplify large-scale, high-resolution remote sensing classifications, keeping them feasible, scalable and more transparent and communicable for a wider public and non-expert audience with the direct and quantitative measurement of colour from false colour satellite imagery. It has shown via an extensive validation across several ecoregions that colourimetry can provide high mapping accuracies. There should always be a scope to choose the most appropriate method for an outcome, and the proposed method simply adds an option to the toolkit, with comparative strengths and weaknesses that differ from those of the other methodological options.
Satellite Imagery Interpretation (SII) can be considered as the quantitative (and, thus, less subjective) multispectral analogue to the commonly known, accepted and continually evolving technique of API (Aerial Photographic Interpretation) [13][14][15][16]. API has been described as a visual problem-solving activity whereby the analyst uses knowledge and experience to derive insights from the patterns of visual cues in an image [17]. A good example of the implementation of SII is the augmented visual interpretation applied in the Food and Agriculture Organization of the United Nations' Collect Earth software for land use and land cover assessment [18]. SII can also be seen as a remote sensing application of Visual Analytics, which has been described as "combin(ing) automated analysis techniques with interactive visualisations for an effective understanding, reasoning and decision making on the basis of very large and complex datasets" [12]. SII with a holistic visual analysis of the landscape [19,20] was formalised with the creation and application of an image interpretation key from an ecological colourimetric deduction. Image interpretation keys are intended to summarise complex information and to train inexperienced personnel in the interpretation of complex or unfamiliar topics or provide a reference for experienced interpreters to rapidly identify examples pertaining to specific features [21]. The interpretation key presented in the study is a type of domain ontology [22] that can be agreed upon by experts and validated by an accuracy assessment to resolve cognitive biases. A deductive colourimetric approach is thus proposed to map landscape associations that can be visually interpreted from an image by an analyst and communicated to and applied by non-experts.
A mapping case study was carried out for the rainforests of south-eastern Australia, a diverse vegetation formation within a variegated landscape, to demonstrate the utility of a deductive colourimetric approach. We show how the classification of a land cover class can be simplified by identifying the match between the ecological and false colour characteristics that make it unique within its environment during a particular season. Vegetation indices have been shown to be useful for estimating variables related to photosynthesis at the canopy or ecosystem scale, including phenology, primary productivity, net carbon fixation, gross primary productivity and processes related to plant transpiration [23]. However, rainforests in Australia vary from dry to very moist, with a wide range of structural variability [24]. Further, different unrelated vegetation types juxtaposed with rainforests can share similar structural or productivity-related values, and false colour RGB composites have the potential to distinguish land cover features uniquely, serving as useful indicators of physiognomic traits for vegetation. Band ratio RGBs were created and tested to increase the hue angle separability of classes by using band ratios for each RGB channel. While the use of band ratio arithmetic in RGB composites is not new-as an often-cited example, Sultan [25] used the multiplication of band ratios in just one channel to emphasise the distinction of minerals in a lithological mapping study-band ratio RGBs have never been used in vegetation studies before, and direct measurements from them have never been used to classify features.

Colourimetry and Interpretation
Image processing is performed to decrease the correlation between bands and enhance the contrast between colours to highlight the information of greatest interest to the analyst [10]. Commonly applied RGB combinations include 'Natural colour' (RGB: Red, Green, Blue), 'Colour infrared' (RGB: NIR, Red, Green), 'Land/water' (RGB: NIR, SWIR1, Red) and 'Agricultural' (SWIR2, NIR, Green) [26]. Further spectral distinction can be provided by the more spectrally representative multivariate visualisations provided by band ratio combinations [27]. Colour can be considered as a fundamental biophysical variable [28] and has been identified as the most commonly referenced perceptual cue that can be interpreted by inference and deduction [16,17,[29][30][31]. Wester et al. [10] recommended hue, because it is easy to interpret colours that consistently represent particular features. Colour, particularly hue, has thus been considered the most perceptibly intuitive way for humans to differentiate and communicate their observations about features. The HSV colour space simulates the way humans perceive colours, where hue represents the dominant spectral component [32]. Decorrelating hue increases the separability of features by reducing the variations in chromatic modulation (Saturation) and brightness (Value) [33].
In terms of direct or colourimetric measurements, Pekel et al. [34] applied the visually intuitive metric of hue angle and showed its temporal relationship with the vegetation index NDVI in a study to monitor the vegetation extent. Other terrestrial environment studies that applied a hue angle include: [35,36]. There have been many more aquatic detection and classification studies based on the hue angle, typically for RGB combinations of the Shortwave Infrared, Near-infrared and Red bands, including: [37][38][39][40][41][42]. The technique has been proven effective at a global level with the high-resolution mapping of global surface water [11].

Scalability
A major problem of purely statistical methods for high thematic resolution mapping is the scaling issues that can arise when features modelled by field samples at a particular scale cannot be transferred directly to different regions or scales [1, [43][44][45]. The principal components analysis (PCA) has typically been used in the past on band ratio RGBs to reduce the dimensionality of multivariate data and to classify landscape features based on the size and significance of the component loadings [27]. However, principal components are influenced by the sample size and assume that variables change linearly along underlying environmental gradients, and distorted ordinations can result from the nonlinearity and inclusion of collinear variables [46]. In contrast, the hue angle is scalable, since it is not parameterised, and visually determined thresholds from SII can mitigate the need for the intensive sampling that is typically required for statistically modelled approaches.

Reducing Sampling Cost with Satellite Imagery Interpretation
Sufficient field training data is rarely available, as it may be expensive and logistically difficult to collect for the requirements of conventional statistical approaches to classification, as well as their accuracy assessments across large and remote areas [47]. Other issues such as accessibility to private lands or places that are physically difficult to reach can sometimes make these approaches impossible. However, analysts can learn to identify many physiognomic and floristic classes in remotely sensed imagery, given a field-based knowledge of general distributions with the free and comprehensive reference data collection from high-resolution and multispectral image archives [48] that can be accessed and processed online, as it can now with Google Earth Engine (GEE) [49]. This study was thus primarily knowledge-based and measurement-driven in an attempt to promote simplicity and reduce the effort and cost of sampling, following recommendations for the abandonment of traditional statistical methods, which have been found to be sensitive to scaling effects, and placing more emphasis on the visualisation and interpretation of geospatial data over a range of scales [50]. This involved using ecological and field Remote Sens. 2021, 13, 2544 4 of 24 knowledge to deduce landscape patterns in false colour images and comparing and associating freely available Very High-Resolution (VHR; at one meter or less [33]) imagery with medium-resolution satellite imagery, such as the 15 m Sentinel-2 imagery provided by GEE to visually define classes with density sliced vegetation indices or false colour hue angles.
Most vegetation data from the past have been sampled preferentially (or purposively) and are thus considered not to fulfil the statistical assumption of independence of observations necessary for valid statistical testing and inference, and while systematic, simple random and stratified random sampling better meet some of the statistical assumptions, purposive sampling typically yields data sets that cover a broader range of vegetation variability [51]. Random sampling is not efficient at assisting pattern recognition but is, rather, only a measure of probability. Following a Holistic [20] and Gestalt approach [52], it is acknowledged in this study that the interpretation of part of an image depends on the interpretation of the rest of the image and, indeed, the surrounding landscape. The interpretation was largely contextual and used the aforementioned perceptual cues to associate what hues can uniquely and consistently define rainforest by visual ecological deduction, grouping observations by qualities, including proximity, similarity and continuity. SII is proposed to emulate the process of purposive sampling by allowing the interpreter to interactively take a holistic assessment of the landscape, iteratively panning around and zooming in and out from features that appear to form a pattern in multispectral imagery, and confirming assumptions colourimetrically in combination with interpretations from VHR imagery. This study will present the identification of colourimetric benchmarks that identify breaks and ranges of recognisable ecosystem features in the multispectral colour spaces of satellite imagery, which will be explained further by example in the Methods. The cognitive deduction involved can be seen as a visually interactive process of association and elimination analogous to the presence and background learning described by Deng et al. [53]. In the same way that we can generalise urban environments being bright and having angular geometries-and oceanic water appearing dark compared to land, for example-we can deduce that tree floristic classes and other tree communities can be distinguished in reference imagery from nearby associations by characteristic canopy appearance in high-resolution imagery or distinct phenologic observations from either high-or medium-resolution reference imagery [18,48].

Colourimetric Ontologies and Multidimensional Colour Blending
Ontology-driven classifications provide the identification of meaningful and communicable features [54]. In this study, false colour image classifications are driven by ontology. The RGB combinations selected for testing in this study was based on a priori knowledge and experimentation. Red and Near-Infrared bands in combination (either through a simple ratio or a normalised difference like NDVI) indicate vegetative vigour or photosynthetic activity, and the first Shortwave Infrared band (SWIR1) in either Landsat or Sentinel 2 sensors has a strong relationship with moisture, with the highest coefficient for wetness in the Tassel Capped transformation [55][56][57]. We can thus assume that combining these three bands in an RGB composite could provide a corresponding multivariate relationship in the HSV colour space, indicating the photosynthetic capacity and moisture. While simple RGBs like the 'Land/Water' RGB (which is composed of these three bands) provide good contrasting visualisations for interpretation, they tend to display narrow data distributions for hue angle. This could be resolved by applying image stretches like histogram equalisations; however, stretching was avoided in order to keep the solutions absolute and scalable. The band ratio RGBs were tested instead, as they usually provide wider data distributions and conveniently reduce hill shading compared with simple RGBs.
Various statistical methods previously proposed for the optimal selection of false colour RGB band combinations [58][59][60][61][62] are not scalable due to their dependence on available field samples and their extent. To avoid scaling issues, the band ratios in this study were created deductively by assigning and dividing the SWIR1 band in each RGB channel with the bands from the 'Natural colour' and 'Infrared colour' combinations to create band Remote Sens. 2021, 13, 2544 5 of 24 ratio RGBs with the expectation that they would separate features as much as possible with the wetness, structure and greenness related data provided by these combinations.
Miller et al. [63] presented multidimensional blending (or transparency weighted image combination) as a strategy for distilling and communicating multiple sources of information simultaneously in a controlled manner for communication to the human analyst. This study compared mapping results from (1) traditional multispectral indices with (2) the hues from false colour band ratio RGBs and (3) a false colour band ratio RGB blend; which were intended for quantitative measurement rather than just communication. The RGB combinations were selected from a matrix of the possible arithmetic combinations of multiplications and divisions between two band ratio RGBs, to be further explained and justified in Methods.

Simplicity and Transparency through One-Class, One-Index Density Slicing
For the sake of simplicity and transparency for general public use and to compare the precision of the results at the pixel level rather than a generalised and arbitrary segment level, a one-class, one-index thresholding approach was applied to extract one visibly distinguishable and ontologically communicable feature at a time directly from false colour imagery. A direct image analysis was preferred to hybrid approaches that combine image classification and environmental modelling, because it maintains the highest precision possible by not corrupting the raw data from satellite imagery with other spatial variables. The latter are generally only available at resolutions coarser than the imagery, making it impossible to accurately map fine features like riparian gullies, rivers, roads or disturbed or illegally deforested areas.
Multiple class classifications can be inefficient and suboptimal in terms of accuracy when the main goal is to classify only one or a few classes [64]. One-class classifications focus on the classes of interest, redirecting the resources and effort from other separable or potentially unrelated classes, providing opportunities to derive highly accurate classifications from small training sets. In this study, visual density slicing was applied to categorise the data in the candidate indices by dividing the range of values into quantile intervals, then assigning each interval a category colourimetrically. Examples of the simple, yet robust use of density slicing to define thresholds for mapping in the past include [65][66][67][68] and due to its facility, it is used operationally-for example, by land managers classifying and mapping wildfire severity [69].

Case Study: Rainforests of South-Eastern Australia
Worldwide, rainforests provide important and irreplaceable ecosystem services, including: sustaining high levels of biodiversity, storing carbon, moderating flood and drought cycles, limiting soil erosion, reducing downstream flooding, influencing rainfall patterns, providing habitat to wildlife and indigenous people and many useful products upon which local communities depend [48,70]. They thus have high conservation, as well as cultural values that have been affected by anthropogenic disturbances at multiple scales [71]. The rainforests of New South Wales (NSW) in south-eastern Australia were chosen to develop and test the colourimetric approach due to their diverse characteristics and extensive range over a large environmentally heterogeneous region of more than 18 million hectares-approximately 75% the size of the whole of the United Kingdom.

Existing Approaches to Rainforest Mapping and the Need for Detail and Automation
A review of the published literature indicates that, while there are some more recent large-scale maps of generic forest cover or biomass of up to 30 m in resolution, there have not been any automated large-scale forest type maps distinguishing rainforest extents since 2013 at less than 250 m resolution [72].
The existing maps of rainforest for large extents in Australia held within the National Vegetation Information System (NVIS) [73] and the National Forest Inventory (NFI) [74] are quite coarse at 100 m resolution. For New South Wales, mapping largely coincides with Remote Sens. 2021, 13, 2544 6 of 24 the rainforest formations described by Keith [75] and mapped by Keith and Simpson [76]. The mapping is known to have varying degrees of uncertainty, having been compiled from a variety of sources at different spatial and thematic resolutions and laboriously digitised by hand with API, which has typically been left to the varying discretion and skill of different operators across different mapping extents with photography from different years and different seasons and have unfortunately never had comprehensive accuracy assessments conducted or documented [47]. A great deal of feature variety can exist within a 100 m-squared pixel, and many features that are of interest to conservation efforts, including rivers and riparian rainforest features such as gullies, can be missed completely. Multispectral satellite imagery has been identified as the most accessible remotely sensed data for monitoring rainforests [48], since it is now free and can be acquired with regularity at relatively high resolutions (at 10 m with Sentinel-2 and 15 m with pan-sharpened Landsat imagery), making updating and monitoring much more feasible than costly oneoff API digitisations.

Study Area and Knowledge Base
This study was limited to the known extent of rainforests in NSW, Australia between latitudes 28 • 9 S to 37 • 30 S and longitudes 149 • 5 E to 153 • 38 E. The rainforests of NSW are the most varied of any in Australia and are one of the most important repositories of the State's biological diversity [75]. They are characterised by closed and continuous canopies dominated by non-eucalypt species, with high tree densities, sometimes in multiple vertical layers, with the foliage cover exceeding 70%. They vary from lush subtropical forests to dry vine thickets within broad climatic limits and are associated with the patchy occurrence of suitable soils and the influence of fire and humans [77,78]. Their patch sizes vary from less than one hectare in sheltered gullies to extensive tracts and mosaics within large eucalypt tall open forests [15,79]. They are generally composed of relatively soft to somewhat leathery, horizontally oriented leaves with high specific leaf areas, and their forms and ecology contrast greatly with that of the native sclerophyll taxa, which are generally absent from rainforests [75]. The full range of mainland rainforest types as defined by Keith [75] were considered, including: (1) Subtropical Rainforests, (2) Warm Temperate Rainforests, (3) Cool Temperate Rainforests, (4) Dry Rainforests (5) Littoral (coastal) Rainforests and (6) Western Vine Thickets. These types demonstrate substantial variations in structures, species compositions, climates, soils and biogeography, with further considerable intertype variations.

Ecological Colourimetric Deduction and Interpretation Ontology for Rainforests
An ecological colourimetric deduction was derived from SII applied on the 'Land/Water' false colour RGB combination. The appearance of rainforests was generalised to be greener and glossier/brighter relative to surrounding sclerophyll dominated forests, even in the drier rainforest formations with more coriaceous leaf textures. Figure 1 shows examples from an API perspective of VHR imagery from Google Earth, demonstrating the textural, brightness and greenness contrasts between the rainforest types and native sclerophylls and how they can be confirmed in the street view. The same can be done from Google Maps. It was postulated that false colour image hue angles should perform better than conventional productivity-related vegetation indices like NDVI, because rainforests on the east coast of Australia can vary in moisture from dry to very moist but share the common physiognomic traits of canopy leaf glossiness and greenness and site wetness at a stand level relative to the drier native sclerophylls in the same areas, the range of which most linear vegetation indices would not be able to take into account by themselves at high resolutions.
The SII inspection and association to the existing 100 m NVIS and NFI reference classifications confirmed that there is a noticeable pattern in the greenness and brightness of rainforests compared to sclerophyll forests in the 'Natural colour' RGB combination (Red, Green and Blue). This appears much more separable in the 'Land/Water' RGB combination (NIR, SWIR1 and Red), which differentiates rainforests by a range of orange hues uniquely from the other hues associated with sclerophyll vegetation. Table 1 presents a SII key that was created and used to interpret the range of expressions that rainforests can exhibit across different landscapes with the perceptual cues of hue, brightness, texture, shape/size and association.
If viewed with the same linear stretch, the orange hues are consistent for rainforests in all the ecoregions, except for the vine thickets in the Northwest slopes, which, on close inspection, were clearly over-mapped in the past due to the low (100 m) resolution, and require histogram stretches at the current extent to be visualised more clearly as orange.

Rainforest types Sclerophylls Rainforest types Sclerophylls
Rainforest types Sclerophylls Figure 1. Examples comparing rainforest types to native sclerophylls from Google Earth and Google Earth street views for the locations marked with a red circle.
It was postulated that false colour image hue angles should perform better than conventional productivity-related vegetation indices like NDVI, because rainforests on the east coast of Australia can vary in moisture from dry to very moist but share the common physiognomic traits of canopy leaf glossiness and greenness and site wetness at a stand level relative to the drier native sclerophylls in the same areas, the range of which most linear vegetation indices would not be able to take into account by themselves at high resolutions.
The SII inspection and association to the existing 100 m NVIS and NFI reference classifications confirmed that there is a noticeable pattern in the greenness and brightness of rainforests compared to sclerophyll forests in the 'Natural colour' RGB combination (Red, Green and Blue). This appears much more separable in the 'Land/Water' RGB combination (NIR, SWIR1 and Red), which differentiates rainforests by a range of orange hues uniquely from the other hues associated with sclerophyll vegetation. Table 1 presents a SII key that was created and used to interpret the range of expressions that rainforests can exhibit across different landscapes with the perceptual cues of hue, brightness, texture, shape/size and association.

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily sub-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily sub-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily sub-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily sub-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully lo-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully lo-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully lo-

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully lo-If viewed with the same linear stretch, the orange hues are consistent for rainforests in all the ecoregions, except for the vine thickets in the Northwest slopes, which, on close Remote Sens. 2021, 13, 2544 9 of 24 inspection, were clearly over-mapped in the past due to the low (100 m) resolution, and require histogram stretches at the current extent to be visualised more clearly as orange.

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully located rainforest. The north-western tile sampled the dry vine thickets west of the Northwest slopes, and the central tile sampled a variety of these rainforest types.

Two Stage Classification Design
A two-stage classification was undertaken to assess the scalability and regional transferability of each candidate index. The first stage focussed on four sample areas that represented the full range of ecological and climate variation across the whole study area. These sample areas were intended to reduce the effort in the first stage of image interpretation and were based on Landsat path/row tiles (Figure 2), which were selected for the consistency and convenience of their size. The northeast tile sampled primarily subtropical rainforest, while the southern tile sampled temperate, predominantly gully located rainforest. The north-western tile sampled the dry vine thickets west of the Northwest slopes, and the central tile sampled a variety of these rainforest types. The colourimetry analysis described in the following sections was applied to the four sample tiles and then the four ecoregions using a selection of candidate indices that produced alternative image classifications. The ecoregional strata for the second stage were rationalised from visual observation of colour differences across the landscape. Figure 2 shows that the sclerophyll forests of the south coast are generally darker than those of the north coast if viewed in the 'Land/Water' RGB. The Tablelands and Ranges of the Great Dividing Range are known to display different physiognomic traits to the coast and to the Northwest slopes, and the index thresholds from the first stage confirmed a corresponding colourimetric difference. The strata boundaries were constructed via manual selection of existing subregions from the Interim Biogeographic Regionalisation for Australia [80].

Stages for Sampling and Ecoregionalisation
The extent and performance ranking of candidate indices from the accuracy assessment in the first stage were used for the calibrations to guide the threshold definitions in the second stage. This method of inference avoided the use of any sample data. Instead, the available ground observations were used entirely for the accuracy assess- The colourimetry analysis described in the following sections was applied to the four sample tiles and then the four ecoregions using a selection of candidate indices that produced alternative image classifications. The ecoregional strata for the second stage were rationalised from visual observation of colour differences across the landscape. Figure 2 shows that the sclerophyll forests of the south coast are generally darker than those of the north coast if viewed in the 'Land/Water' RGB. The Tablelands and Ranges of the Great Dividing Range are known to display different physiognomic traits to the coast and to the Northwest slopes, and the index thresholds from the first stage confirmed a corresponding colourimetric difference. The strata boundaries were constructed via manual selection of existing subregions from the Interim Biogeographic Regionalisation for Australia [80].
The extent and performance ranking of candidate indices from the accuracy assessment in the first stage were used for the calibrations to guide the threshold definitions in the second stage. This method of inference avoided the use of any sample data. Instead, the available ground observations were used entirely for the accuracy assessments. Figure 3 illustrates the technical workflow that was applied between GEE and a GIS.
Remote Sens. 2021, 13, x FOR PEER REVIEW 10 of 25 ments. Figure 3 illustrates the technical workflow that was applied between GEE and a GIS. The two-stage approach is also intended to reduce the amount of imagery the analyst needs to download from GEE to process and analyse locally, since the image data is quite large. Once thresholds have been assessed from the first stage with the graphic user interfaces for density slicing available in a GIS (which are currently not available in GEE), then the thresholds for the second stage can be refined more easily for the whole extent of each of the ecoregions from GEE with some additional numeric adjustments through trial and error.

Phenologic Imagery Selection and Processing
In previous studies of seasonal phenology [81,82], the analysts' and the botanists' local ecological knowledge and visual inspection of interannual, median-based image composites by season created in GEE for the coast of NSW suggested that rainforests are more distinguishable from other vegetation types during the drier season of summer. Temporal aggregation from median-based composites have been shown to significantly reduce the data volumes on a per-band, per-pixel basis, reducing anomalies, clouds, shadows and abnormal pixilation, resulting in faster and easier analyses suitable for vegetation modelling with equally high accuracy as the time series data [83]. Phan et al. [84] purposively selected summer images to classify vegetation in an environment heavily affected by snow and cloud cover in the winter. Similarly, a median-based summer composite was applied in this study because of the phenological difference between rainforests and the seasonally varying native sclerophylls that are noticeably less green and, hence, more distinguishable from rainforests in the summer. The summer composites are also conveniently the least affected by clouds and hill shade. In this study, interannual, median-based composites were created for a range of years in order to account for the variability of a relatively steady state from all the available Sentinel 2 TOA  System). Currently, in a participatory mapping process, for example, end users would only need to apply the GIS side of the workflow, while a technician with coding expertise would be responsible for the GEE side of the workflow (unless a GEE application were customised for the GIS functionality).
The two-stage approach is also intended to reduce the amount of imagery the analyst needs to download from GEE to process and analyse locally, since the image data is quite large. Once thresholds have been assessed from the first stage with the graphic user interfaces for density slicing available in a GIS (which are currently not available in GEE), then the thresholds for the second stage can be refined more easily for the whole extent of each of the ecoregions from GEE with some additional numeric adjustments through trial and error.

Phenologic Imagery Selection and Processing
In previous studies of seasonal phenology [81,82], the analysts' and the botanists' local ecological knowledge and visual inspection of interannual, median-based image composites by season created in GEE for the coast of NSW suggested that rainforests are more distinguishable from other vegetation types during the drier season of summer. Temporal aggregation from median-based composites have been shown to significantly reduce the data volumes on a per-band, per-pixel basis, reducing anomalies, clouds, shadows and abnormal pixilation, resulting in faster and easier analyses suitable for vegetation modelling with equally high accuracy as the time series data [83]. Phan et al. [84] purposively selected summer images to classify vegetation in an environment heavily affected by snow and cloud cover in the winter. Similarly, a median-based summer composite was applied in this study because of the phenological difference between rainforests and the seasonally varying native sclerophylls that are noticeably less green and, hence, more distinguishable from rainforests in the summer. The summer composites are also conveniently the least affected by clouds and hill shade. In this study, interannual, median-based composites were created for a range of years in order to account for the variability of a relatively steady state from all the available Sentinel 2 TOA (Top-Of-Atmosphere reflectance) multispectral imagery for the selected months between November and February, depending on the latitude of the bioregions, and between the years 2015-2018 in order to avoid effects of the 'black summer' bush fires between 2019 and 2020, as shown in Table 2. The interannual, median-based composites, together with the cloud-masking function (maskS2clouds) [49] provided by the GEE public example, provided a seamless, wellcolour-balanced imagery mosaic. In order to reduce the noise or 'salt-and-pepper effect' on the resultant indices that is typical of pixel-based approaches, a low pass filter was applied to the imagery in GEE prior to classification with the convolve() function, which appeared to produce equivalent results to the focal_mean() or reduceNeighborhood() functions.
A visual inspection indicated that, for all the candidate indices, some agricultural and water features were misclassified as rainforest. Therefore, it was necessary to subtract these areas with an existing forest mask [85] before calculating the areas in the Results.

Selection of Candidate Indices and False Colour Band Ratio RGB Combinations
For the sake of sensor compatibility, the selection of indices and RGB combinations for comparison was limited to those involving satellite bands available at the higher resolutions for Sentinel 2, as well as Landsat, satellites. Thus, only the Blue, Green, Red, NIR, SWIR1 and SWIR2 bands were considered, because the Red-edge bands are only available for Sentinel 2, and the Thermal bands are only available for Landsat and are of a much lower resolution. Table 3 lists the vegetation indices and false colour RGB combinations that were tested and the reasons for their selection. Hue was preferred due to its aforementioned communicability and because it is less affected by shadows the way saturation and value or intensity are in the typically rugged terrain of rainforests on the south-eastern coast of Australia. Hue was calculated from GEE with the Image.rgbToHsv.select( hue ) function.
The band ratio RGB blend was selected from a matrix of possible arithmetic combinations of multiplications and divisions between the two-band ratio RGBs of NIR/Red, SWIR1/Green, Red/Blue (which is a combination of the 'Land/Water' RGB and the 'Natural colour' RGB) and the SWIR1/Infrared Colour Band Ratio RGB described in Table 3. The former band ratio RGB was chosen, because it retains the appearance of the 'Land/Water' RGB, which was used for the SII key for rainforests in Table 1 and as the Reference RGB visualisation in Table 3, with related HSV S and V values from the visual interpretation. Multiplications and divisions were chosen based on the assumption that they would further separate the relations present in the two selected RGBs as much as possible, more than additions and subtractions, which have more chance of producing negative or oversaturated values. The possible blends that were assessed are presented in Table A1 in Appendix A.    Two band functional equivalent to the EVI, commonly used in tropical forest studies [89]. EVI 2 has been shown to be less sensitive to background reflectance, including bright soils and non-photosynthetically active vegetation [90,91].  Replacing the NIR band (in NDVI) with the Green band appears to reduce the high value saturation produced by NDVI.  After testing the thresholds for the hue angles of each of these combinations, the combination *, /, / was selected to produce: Hue angle from band ratio R,G,B: SWIR1/NIR, SWIR1/Red, SWIR1/Green A distinct hue from a band ratio RGB differentiating the SWIR1 band with the bands from the 'Infrared Colour' RGB combination to include structural, greenness and wetness data. After testing the thresholds for the hue angles of each of these combinations, the combination *, /, / was selected to produce:
This combination was preferred for the visual and numeric separability it provided for rainforests and because it shifted the rainforest hue angle values to the beginning of the colour wheel for values from 0 to between 6.825 and 7.95 degrees (on a range of 0 to 360), depending on the ecoregion, which makes it convenient for the assessment of just one threshold value rather than two.

Density Slicing with Colourimetric Benchmarks
A benchmark is a standard by which the value of an indicator can be compared and judged and can be representative of central tendencies or boundary (or gradient) conditions and can be composite of measurement indicators [92]. Taking into consideration that density slicing of individual image bands can lead to the misclassification of features with similar brightness to the feature of interest [67] and that the HSV colour space transformation decorrelates value and saturation from hue. Colourimetric benchmarks, based on the transitions of hue between rainforest and non-rainforest, were used to determine the density slicing thresholds of the candidate indices.
A pseudo-colour visualisation was applied to the greyscale of each of the candidate indices with a quantile interval stretch in a GIS and overlayed on the reference imagery (Table 1), which defined the ecological colourimetric deduction-where the orange hues in the 'Land/Water' RGB combination were assumed to distinguish rainforest. The intervals from the candidate index pseudo colour stretch were visually associated to the reference image and classified as rainforest where they were orange on the reference image. Visually comparing each candidate index or hue angle to the same reference image in this way ensured that the decision making was applied consistently across the mapped area. The selection of colourimetric benchmarks was an interactive process of augmented visual interpretation, similar to that applied by Bey et al. [18], which involved an iterative process of elimination, in which the intervals of the candidate indices were progressively attributed for presence or absence of rainforest with visual confirmation from the VHR imagery available in GEE on multiple locations across the current ecoregion. In this process, nonrainforest sclerophylls were found to follow a moisture gradient from dry to moist that can be associated with a colourimetric gradient on the Land/Water RGB combination from cyan to red (a complementary relationship on the colour wheel, rather than a circular sequence). These colourimetric extremes thus provided ideal benchmarks for thresholding.

Accuracy Assessment Design
The accuracy of map outputs from each algorithm was assessed using an independent field data set of floristic observations from the New South Wales vegetation database, BioNet [93]. The data set consisted of almost 35,000 in situ vegetation plots (georeferenced points) that have been assigned to plant community types, which were reduced to a binary label indicating rainforest or non-rainforest. Each of the points was intersected with the candidate rainforest classifications and considered correct if it fell within 30 m of the classified rainforest extent, i.e., with a 1-pixel tolerance of the 15 m resolution classification. A repeated Monte Carlo cross-validation approach (with spatial blocking) was taken to estimate accuracy and confidence intervals, and to avoid the bias that can arise from clustered sample points, we included a spatial blocking procedure in the resampling [94]. This involved taking many random subsamples of the validation data, stratified by spatial blocks, and then expressing the accuracy as the summary of the sampling distribution of the accuracy metrics calculated for each subsample. One hundred and thirteen blocks (~2000 km 2 per block) were used across the study area, and the overall accuracy was calculated via a standard percentage agreement error matrix. Since rainforest samples were relatively rare in the overall dataset, the subsample was first limited to contain an even amount of rainforest and non-rainforest samples, and then, 5 points were sampled from each spatial block. One thousand iterations of the resampling were performed, and the median of the distribution was used as the reported overall accuracy value, with the 2.5th and 97.5th percentiles as the 95% confidence interval [94]. Each of the mapping regions (North Coast, South Coast, Tablelands and Ranges and the Northwest slopes) were assessed separately.

Results
The colourimetric hue-based indices and the new Aravena ratio subtraction index consistently produced more accurate maps of rainforest than the conventional indices in all the ecoregions, with the Aravena band ratio RGB blend consistently ranking highest ( Table 4). The only exception was in the Tablelands and Ranges where NGRDI ranked higher than in the other ecoregions and the Aravena Index ranked fractionally higher than the Aravena band ratio RGB blend. All indices performed worse in the Northwest slopes. Table A2 in Appendix A shows examples of the mapping results for the two highest ranking indices that performed comparably to each other in different landscape types, with the Aravena Rainforest band ratio RGB hue generally mapping a little bit more rainforest than the Aravena Rainforest Index. Table A3 in Appendix A provides a comprehensive accuracy assessment for all the indices by ecoregion, with errors of commission and omission. The area of rainforest mapped by the best-performing index in each ecoregion differed from the combined extent mapped in the existing 100 m resolution of the NVIS and NFI classifications ( Table 5). The visual inspection of Table A2 in Appendix A indicates that finer features such as riparian gullies and vine thickets were picked up more consistently and with more detail in the new 15 m resolution classification.
The area of rainforest mapped by the best-performing index in each ecoregion differed from the combined extent mapped in the existing 100 m resolution of the NVIS and NFI classifications ( Table 5). The visual inspection of Table A2 in Appendix A indicates that finer features such as riparian gullies and vine thickets were picked up more consistently and with more detail in the new 15 m resolution classification. Lastly, Table 6 shows a ranking of the non-forest agricultural areas that were misclassified as rainforest by the indices. The visual observation suggested that the indices will generally overspill onto agriculture with similar false colour hues. The rank order of such errors across the indices was the same for all ecoregions with the band ratios, having greater errors than the other indices, except that NGRDI had higher error rates than NDVI and EVI on the South Coast and Tablelands. The SWIR1/Infrared Colour Band Ratio RGB hue (and, to a much lesser degree, the NGRDI) also misclassified water as rainforest. Both the new Aravena Rainforest Index and band ratio RGB blend hue displayed higher percentages than conventional indices, with the index consistently lower than the hue but in varying degrees across the ecoregions.

Discussion
While false colour interpretation has been around for a long time, the direct measurement of false colour metrics for vegetation mapping has not been applied. In the past, most studies were limited to single images or small sets of images to maximise the phenological differences among forest types [57]. Detailed temporal and phenologic considerations are now more feasible with the increased accessibility of free imagery from NASA or the ESA and with the supercomputing power of platforms like GEE. The results of this study showed that high classification accuracies can be obtained from single interannual seasonal aggregate image composites, especially if band ratios are used instead of single bands and analysis are concentrated on bands from Red and beyond the visible spectrum (in the Near and Shortwave Infrared ranges), which are hardly affected by atmospheric effects and variations in illumination.
This mapping effort has produced the highest resolution (15 m) systematic classification of rainforest extent for NSW, Australia to date, with an overall average accuracy of 79%. This region is ecologically indicative of the whole east coast of Australia. SII and the direct colourimetry of false colour band ratio RGB blends can provide feasible and effective classifications once a consistent ecological-colourimetric deduction has been defined. This is because the band ratios per channel can accentuate and distinguish features arithmetically, keeping the solutions consistent and scalable without the need for parametrisation, leaving the larger budget of field samples available to accuracy assessments and refinements.
Both the Aravena Index and the Aravena band ratio RGB blend hue yielded accurate results that were comparable to each other. Decorrelating hue from RGB composites with the HSV transformation, after accentuating the separability of features with band ratio RGBs and their blends, can produce superior results to conventional vegetation indices. The physiognomic traits of leaf greenness and glossiness and the overall stand moisture can characterise rainforests more effectively than indicators of photosynthetic capacity or productivity alone, since the same hues can generalise the wide range of rainforest types from dry to very moist, which linear productivity indices cannot. They also emphasise the importance of the SWIR1 band, which was used in all the highest-ranking candidates. While this increase in performance comes at a price with the misclassification of nonrelated agriculture, such errors can be easily corrected by the application of a land use or forest mask. The NIR band seems to reduce rainforest discrimination, since even the NGRDI displayed a better performance than the more-often applied NDVI and EVI indices. The results also demonstrated the importance of ecoregionalisation, in that if the whole study area had been attempted with the one threshold for the ecoregion with sclerophylls with the highest productivity, then drier or less productive regions would have been under-mapped.
A lack of consensus remains in Australia as to how rainforests should be defined, and a particular point of debate seems to be the classification of communities with closed canopies of rainforest species below tall eucalypts. These ecotonal or mixed forests have been considered as either non-rainforest, seral stages of rainforest or as distinct vegetation types [95]. Perhaps some of the inaccuracy from this mapping effort is due to rainforest with non-rainforest emergents or sclerophyll-dominated forests with rainforest understoreys. Further field work would be required to validate this.
Rainforests can be considered visually distinguishable optical types [96] that can be directly measured and easily communicated with a false colour colourimetric ontology. This is due to the knowledge-based physiognomic and phenologic relations that can be visually deduced from ecological knowledge and validated with comprehensive geospatial accuracy assessments. False colour hues can therefore be considered scalable landscape metrics, and "Red in the Aravena Rainforest Band Ratio RGB Blend, with hue angle < x • , during Summer" has proven to be a simple, scalable, reusable and communicable colourimetric ontology for the rainforests of the east coast of Australia.
While some false colour combinations appear to be more perceptually intuitive and separable, other combinations statistically differentiate hue easier than others. A combination may distinguish a particular feature more effectively by simplifying rather than diversifying the range of colours represented. Although the 'Land/Water' RGB may initially appear to show the widest range of colours and, hence, the best separability of classes, the band ratio RGB blend that performed best in this study only appears to show a spread of predominantly red to blue hues. The use of band ratio RGBs have the additional benefit of reducing atmospheric and hill shade effects. From this study and experimentation with other classes such as water and urban areas, a set of three heuristics can be used to decide which RGB band ratio combination will best distinguish a particular feature of interest consistently throughout a region:

1.
Ideal combinations separate features of interest as much as possible on the colour wheel-Analogous (neighbouring) colours like red and orange can be difficult to distinguish; however, complementary or triadic colours like red and blue (as those in the Aravena Rainforest Band Ratio RGB Blend) are much easier to separate.

2.
The feature should ideally be distinctly represented from other features by either red or magenta in order to only require one threshold from an extreme to be determined and to avoid any saturation or loss of data across the colour wheel's discontinuity. This is because red is the first colour in the colour wheel from 0 degrees, while magenta is the last colour in the colour wheel before 360 degrees.

3.
RGBs with overly dark or bright tones are usually not preferred, as they have the potential to contain multiple hues that are difficult to discern visually.
Interpreting patterns directly from multispectral satellite imagery produces the highest fidelity end product of ecological observations by avoiding unnecessary loss of resolution or corruption of the raw image data with coarser or pre-modelled data (such as soil or precipitation data). A one-class, one-index approach can thus be robust, analytically simple, scalable, communicable, fast to process and, thus, more cost-effective. An additional advantage is that the products can be adjusted easily, quickly and transparently during stakeholder negotiations and with TEK and LEK (Traditional and Local Ecological Knowledge) holders [97,98]. An understanding of how mapping solutions work, how reliable decisions are and how they were arrived at is particularly valuable to gain the confidence of users during negotiations and stakeholder consultations, such as those of conservation and participatory projects. The approach should further democratise remote sensing by facilitating the transparency and communicability of ecological observations for environmental and land resource consultation, negotiations and management for decisionmakers, as well as traditional/indigenous and local stakeholder knowledge exchange and participation, while promoting simplicity and feasibility through rapid processing. While the colourimetric thresholds and evidential reasoning could be considered subjective by some, or requiring expertise, the SII interpretation key provided in this study with its associations and the quantitative colourimetric ontology should provide the formality and confidence necessary to guide and inform different users quite explicitly and consistently, especially if some training is provided to people with local ecological knowledge-as has been shown with various participatory mapping projects [99].
The performance ranking of the candidate indices in this study was intended to show that better inputs could be provided to modern statistical approaches through colourimetry. How colourimetric indices can be further refined, automated or integrated to modern statistical techniques is more of a matter for further investigation. Whether finding the most effective blend for the purposes of feature separability is determined by exploring factorial combinations, as in this study, or by the blending operations typically applied by image manipulation/editing software or by statistical methods from extensive large-scale field sampling is an area of study worth further investigation. Some clustering and machine learning algorithms for this could include K-means ++, Gaussian mixture, Hidden Markov models and Hierarchical or Tree models [100].
The results confirm the potential for large-scale, high-resolution mapping of broadly defined vegetation types and suggest that testing for other applications, such as temporal monitoring for disturbance typing, severity mapping and recovery assessment, is wellwarranted. It is hoped that more efficient purposive sampling strategies will be developed in the future with the benefits that colourimetric deductions and visual analytics can contribute and their requirements in mind.

Cautions for Implementation
For the approach to be truly scalable and consistent in its communicability, it is important that the imagery be viewed in, discussed and measured from the same stretch. This is not possible by default from a GIS where a subset will be stretched to the extent of the subset's histogram unless a fixed linear stretch is saved and applied to all the subsets. It is therefore preferable to attempt the colourimetric deduction from GEE where the whole world can be visualised consistently with the same stretch. Histogram stretching is, however, recommended for the visualisation and classification of the sample areas once the deduction is understood. As with this study, a forest mask or some post editing will be necessary to remove incorrectly classified agricultural features. Caution must be taken when using hue. It is a circular measure with a discontinuity at 360 • [100], requiring circular statistics and/or appropriate scaling for further correlation and multivariate analyses [101]. Additionally, while hue is traditionally represented by degrees between 0 and 360, some software work with hue angles between 0 and 240 or 0 and 255 for processing efficiency. Caution should also be taken when applying the inter-annual imagine composite technique in contexts where changes are known to occur. For example, for Australia, it should be understood that some rainforest will have been burnt during the black summer fires (2019 to 2020), and in countries where illegal logging persists, these changes will not be represented clearly. An uncertainty mask of the changes is recommended to account for this.

Conclusions
A continuing endeavour of remote sensing research is to maximise the performance of mapping workflows, as well as their simplicity and transparency. Colourimetric approaches have offered potential advances in this area. Accurate results were achieved ecoregionally at a minimal cost by processing a huge amount of data quickly with the freely available data and supercomputing power provided by GEE. While traditional vegetation indices focus on productivity, this study showed that separability-enhanced false colour band ratio RGB hue values can be considered scalable multivariate landscape indicators that can encapsulate multiple physiognomic properties such as greenness, wetness and brightness.
A formalised colourimetric approach to intuitive SII with a visual interpretation key and the use of one class density slicing demonstrates the effective simplification of remote sensing classifications for the purposes of accessibility, communicability, scalability, feasibility and repeatability for visually distinguishable features. A wider audience can be benefited, since no programming or advanced statistical knowledge is required for the user from a GIS once the deduction has been validated and the phenologic image composites have been constructed.
Due to the known representativeness of rainforest types in NSW, we conclude that either the Aravena Rainforest Band Ratio RGB Blend hue angle or the Aravena Rainforest Index could potentially be used to reliably map and monitor rainforests for the whole of Australia.