Land Cover Mapping in Northern High Latitude Permafrost Regions with Satellite Data : Achievements and Remaining Challenges

Most applications of land cover maps that have been derived from satellite data over the Arctic require higher thematic detail than available in current global maps. A range of application studies has been reviewed, including up-scaling of carbon fluxes and pools, permafrost feature mapping and transition monitoring. Early land cover mapping studies were driven by the demand to characterize wildlife habitats. Later, in the 1990s, up-scaling of in situ measurements became central to the discipline of land cover mapping on local to regional scales at several sites across the Arctic. This includes the Kuparuk basin in Alaska, the Usa basin and the Lena Delta in Russia. All of these multi-purpose land cover maps have been derived from Landsat data. High resolution maps (from optical satellite data) serve frequently as input for the characterization of periglacial features and also flux tower footprints in recent studies. The most used map to address circumpolar issues is the CAVM (Circum Arctic Vegetation Map) based on AVHRR (1 km) and has been manually derived. It provides the required thematic detail for many applications, but is confined to areas north of the treeline, and it is limited in spatial detail. A higher spatial resolution circumpolar land cover map with sufficient thematic content would be beneficial for a range of applications. Such a land cover classification should be compatible with existing global maps and applicable for multiple purposes. The thematic content of existing global maps has been assessed by comparison to the CAVM and regional maps. None of the maps provides the required thematic detail. Spatial resolution has been compared to used classes for local to regional applications. The required thematic detail increases with spatial resolution since coarser datasets are usually applied over larger areas covering more relevant landscape units. This is especially of concern when the entire Arctic is addressed. A spatial resolution around 30 m has been shown to be suitable for a range of applications. This implies that the current Landsat-8, as well as Sentinel-2 missions would be adequate as input data. Recent studies have exemplified the value of Synthetic Aperture Radar (SAR) in tundra regions. SAR missions may be therefore of added value for large-scale high latitude land cover mapping.


Introduction
Land cover information is of high value for applications in the Arctic.More than 65% of land area north of 60 degrees latitude is underlain by permanently frozen ground.Information on surface properties serves as an indicator or modeling input for the extraction of permafrost properties [1].This also applies to assessments of carbon stores and fluxes.Permafrost is expected to change with climate change in the upcoming decades [2].It has been pointed out by a range of studies (e.g., Westermann et al. [1] and Ottle et al. [3]) that there is a need for accurate land cover description in the northern high latitudes.A suitable circumpolar map does not yet exist to date, although a range of studies demonstrated the suitability of satellite data for this purpose on local and regional scales.
A first review on traditional Arctic land cover maps as a baseline for the development of a circumpolar map was already published in 1995 by Walker et al. [4].The Circum-Arctic Vegetation Map (CAVM) has in the following been developed using satellite data (Advanced Very High Resolution Radiometer (AVHRR)) and published [5].This map represents vegetation communities and is to date the only consistent map across the Arctic with the necessary detail for many applications.It is limited to the area north of the treeline and has a 1-km spatial resolution.This is insufficient for a range of applications, as this cannot capture the heterogeneous nature of Arctic land cover (e.g., [6,7]).Recent globally-available land cover maps offer higher spatial detail (300-500 m), but low thematic content.
A new land cover classification should be compatible with global maps and applicable for varying purposes.Local, regional and globally-developed datasets need to be reviewed and discussed in this context.
This paper presents the techniques used for local to regional, as well as national-scale land cover mapping and discusses their varying applications in the Arctic.This includes the use of classifications for soil carbon and carbon flux upscaling, permafrost features, as well as transition monitoring.To illustrate the need for consistent and higher thematic detail in land cover products, we have compared the classes of currently available global land cover maps to the CAVM, a widely-accepted map for coarse resolution land cover assessment [8].The purpose of this paper is to identify the required thematic detail in high latitude permafrost regions and suitable data sources, as well as to establish related problems in current global maps and potential methods to solve them.

Development of Techniques for Local-to Regional-Scale Mapping
The development of retrieval methods and applications has evolved with the increasing availability of satellite data.The vast majority of land cover mapping studies utilize optical satellite data.Of high importance was the launch of the Landsat series.A number of studies on habitat mapping in Alaska and the Canadian Arctic were undertaken in the 1970s and 1980s [9].This was extended in the early 1990s with SPOT MSS (Satellite Pour l'Observation de la Terre Multispectral Scanner System) [10] and SPOT HRV (High Resolution Visible) [9,11].These early studies aimed specifically at habitat assessments, including water fowls, muskox and the impact of reindeer grazing.Synthetic Aperture Radar (SAR) data became available with the launch of the European ERS-1 (C-band, 5.6 cm) in 1991 and the Japanese JERS-1 (L-band, 23.5 cm) in 1992.The Russian Almaz SAR mission (S-band, 9.6 cm) operated for 17 months from 1991-1992.Shortly afterwards, studies emerged analyzing the relative information content of the data from classification algorithms applied to Almaz, ERS-1, JERS-1 and Landsat-TM (Thematic Mapper) [12] to discriminate wet tundra habitats.SAR properties alone have also been evaluated for tundra habitat classifications [13].The efficiency of SAR data in discriminating traditional classes in geobotanical maps was shown to be lower compared to optical [12].In 1995, Belchansky et al. [13] demonstrated that multi-look processing of the SAR data was favorable for land cover studies.Such pre-processing steps are to date standard for SAR-based land cover applications.The additional bands available with Landsat TM compared to MSS were shown to be of high value for classifications in the context of wild live studies [14].This spectral resolution is available from most optical high to medium resolution spaceborne sensors to date [15].
Many of these early studies focused on the Alaskan North Slope ( [10,[12][13][14], [1994][1995][1996].SPOT related studies were published for the Canadian High Arctic [9] and Scandinavia [11].Beyond that, Mosbech and Hansen [16] compared satellite imagery and infrared aerial photography for vegetation mapping methods in East Greenland using both SPOT and Landsat TM data.They highlighted the inadequacy of satellite data to capture the fine-scaled tundra Arctic vegetation mosaic.This problem has been partially overcome to date with high spatial resolution satellites, but remains a challenge when mapping larger areas.The heterogeneity in these landscapes has been also addressed for a study site on Svalbard by Brossard and Joly [17].Landscape types have been retrieved from Landsat.Vegetation probabilities have been assigned to these classes rather than specific class names.This approach provides a better representation, but is until now not commonly applied. Although AVHRR images have been available since 1978, they were not studied until the late 1990s, when the data were first analyzed for Arctic land cover assessments.Due to its coarse spatial resolution (1 km), the sensor had not been of interest, as most research at that time had a local to regional focus only.A method for mapping land cover in Canada was however developed by Cihlar et al. [18] and published in 1996.An AVHRR-based land cover map for the Alaskan North Slope was published in 1997 [19].This was soon followed by a prototype for a circumpolar map [20].The thematic application areas of satellite data widened at the same time.The retrieval of biogeophysical parameters, such as Net and Gross Primary Production (NPP/GPP) [21,22], came into focus.Vegetation productivity can be directly linked to the reflectance in the visible and near-infrared part of the electromagnetic spectrum.NPP and GPP are determined using multiple sources, not only from satellite-derived products alone.Liu et al. [21] demonstrated the application potential for such studies using the AVHRR land cover map of Cihlar et al. [18] for a study site in Canada in 1999.
Takeuchi et al. [23] investigated in 2003 the potential of AVHRR for up-scaling of methane emissions in the West Siberian Lowlands, which are mostly located in the boreal biome, but of interest to high latitude studies, as the boreal biome is partially underlain by permafrost.SPOT HRV data from a case study area were used to develop an approach for the retrieval of land cover fractions within the AVHRR pixels, including the classes birch forest, conifer forest, forested bog and open bog.AVHRR land cover fraction maps were subsequently used to upscale methane emissions over the entire West Siberian Lowland area.
Landsat-based classifications and their applications are well established, and methods that combine them with high spatial resolution satellite data have been frequently applied (e.g., [7,[24][25][26]).Specific methods are required to deal with the information content in the high spatial resolution images.Moody et al. [27] presented a technical solution for unsupervised classification of land cover in multispectral satellite imagery from WorldView-2, using so-called sparse representations in learned dictionaries for Barrow on the Alaskan North Slope.This approach also considers texture information.The resulting classes seem to detect real variability, but re-grouping based on expert knowledge is suggested (e.g., vegetative versus hydrologic studies).An approach for a hybrid map (pixel-, as well as object-based classification) for up-scaling soil organic carbon using Landsat-5 TM and WorldView-2 has recently been suggested by Broderick et al. [26].Fraser et al. [28] demonstrate that predictive ecosystem mapping can be improved with a supervised classification of SPOT HRV that uses a 28-class ecosystem map based on air-photos as training data.A number of studies have recently been published that make use of SAR data only, but utilize the different polarizations, which are acquired by modern SAR systems.Atwood et al. [29] investigated the impact of radiometric correction on the accuracy of polarimetric land cover classification with ALOS PALSAR data in 2012.This led to significant qualitative and quantitative improvements.Banks et al. [30] assessed the potential of dual polarimetric TerraSAR-X and quad-pol Radarsat-2 imagery for land cover mapping using backscatter coefficients and Kennaugh matrix elements.A combination of both sensors is suggested for the best results.The impact of the incidence angle on the scattering behavior of quad-pol Radarsat-2 data has in addition been evaluated and assessed for its potential for unsupervised classification using three polarimetric SAR classifiers [31].Shallow incidence angle images provided better results than steep angle images.A combination of polarimetric SAR data with optical data also led to better results than using SAR data alone [32].Duguay et al. [33] confirm the potential of polarimetric SAR for shrub mapping using the support vector machine method.
The land cover class 'water' has received comparably great attention at high latitudes [34] as it supports the identification of thaw lakes, which are associated with permafrost features.Both, optical and SAR data are commonly used for this purpose.Spatial resolution has been identified as crucial for their mapping [6,35], but their extent is highly variable in time.Mapping Arctic water bodies therefore demands SAR methods, as the sensor's cloud independence ensures better sampling [36,37].This issue is discussed in more detail in Section 4.

Super-Sites
A number of sites across the Arctic developed into super-sites for satellite-derived land cover-based applications.Maps have been developed that served as input for applications, such as upscaling of soil properties and fluxes.This includes the Kuparuk basin on the Alaskan North Slope, the Usa basin in northeastern European Russia and the Lena Delta in northeastern Russia.All of these multi-purpose maps rely on Landsat MSS and/or TM data.Their location is shown in Figure 1, and their properties are summarized in Table 1.Initial developments have been part of a set of studies with a focus on the Kuparuk River Basin.The applicability of satellite data products (mostly Landsat based) for carbon cycle studies [22,41,42] and subsurface thermal properties ( [43], Active Layer Thickness (ALT) over permafrost) was investigated.A multi-purpose land cover map has eventually been developed using Landsat MSS data in 1997 [39,44].It was based on an unsupervised (ISODATA) classification method using the green, red and near-infrared bands.Twenty classes have been eventually distinguished, which have been interpreted for methane flux up-scaling (see Table 1).Tundra is largely distinguished by periglacial features and terrain units within them.The twenty classes are usually grouped into six units [45].The dominating group is 'moist graminoid, prostrate shrub Tundra' [46].SPOT and AVHRR NDVI (Normalized Difference Vegetation Index) values have also been investigated with respect to a potential application estimating carbon dioxide fluxes [41].The approach used to generate the maps was subsequently transferred to the entirety of Northern Alaska [46] and compared to the AVHRR product by Walker [20], but only a 55.4% agreement was determined.Most affected are wetland/moist tundra classes and shrublands.Probable reasons are the difference in spatial resolution and the respective acquisition time.The spatial detail of wetlands and moist tundra in wetland complexes cannot be sufficiently captured by AVHRR.The acquisition time of the satellite datasets also differed with respect to the phenological stage, which has important implications for the sensor's capabilities to identify shrublands.
The majority of methane flux up-scaling investigations in the high latitudes started only several years later, although Reeburgh et al. [41] demonstrated the capability of land cover data in tundra areas already in 1998.Schneider et al. [40] used Landsat 7 ETM+ for up-scaling of methane emissions over the Lena Delta in 2009 (see Table 1).Classification was carried out in two steps using the unsupervised ISODATA method and supervised maximum-likelihood methods.Only the land cover class wet sedge-and moss-dominated tundra was eventually used due to its abundance in the area and importance regarding emissions.To estimate soil organic carbon and nitrogen stocks in the Lena Delta, Zubrzycki et al. [47] grouped the land cover classes to differentiate between the Holocene river terrace and the active floodplains.A Landsat-based map was also recently developed for the West Siberian Lowlands, suggesting its potential use for up-scaling of methane emissions [48].
A multi-purpose land cover classification based on Landsat TM5 has also been derived for the Usa basin in northeastern European Russia [38] using a semi-supervised approach and a similar classification scheme as for the Kuparuk basin in Alaska.The Usa basin does however also extend into the boreal biome.In total, 21 classes were therefore considered, including nine non-water classes outside forested areas (see Table 1).Applications included carbon flux up-scaling [49], as well as up-scaling of soil carbon estimates [50].Heikkinen et al. [49] grouped the original set of classes into seven new classes for one of the sub-basins: wet peaty tundra, dry peaty tundra, peat plateau, tundra heath, willow dominated stand, lake and pond margins and thermokarst lakes and profundal.Landsat TM classes were eventually combined with a high spatial resolution QuickBird image for a subset of the area [25].The same thematic vegetation classes have been used in a nested approach.The original classes by Virtanen et al. [38] were grouped into peat plateau, fen, tundra heath, shrub tundra, willow/meadow and forest in order to use them as a proxy for soil properties.The application of the same QuickBird map was also demonstrated for up-scaling of carbon dioxide fluxes [51].An interesting aspect of that study, beyond land cover mapping, is the description of the close relationship between the leaf area index and carbon dioxide fluxes.Finally, the Landsat land cover map of the northern part of the basin has also been assessed for the application of the up-scaling method to the circumpolar domain [52], where the spatial resolution was artificially reduced to simulate the use of vegetation products from the MODIS sensors and existing global land cover products.The land cover mapping accuracy was significantly lower due to the characteristic small patch size of land cover in these areas.Peat plateaus form however relatively homogeneous areas and contain the majority of soil carbon.It is suggested that the forest land cover classes, as well as willows and meadows are merged, as this procedure would have no considerable effect on the overall estimates.It was thus concluded that coarse resolution peatland maps can be nevertheless of value for soil carbon up-scaling.The issue of patch size has also been addressed by Virtanen and Ek [7] who compared different land cover classifications from QuickBird, ASTER and Landsat TM5.It was concluded that specifically water bodies and fens are problematic in medium resolution classifications, as they occur in the landscape in small or elongated patches and, thus, cannot be realistically classified from larger pixel sized images.

Further Studies
Other regions where satellite data classifications have been used for upscaling purposes are Zackenberg on Greenland, Svalbard and Kytalyk in Russia.Jorgensen et al. [53] used the classes dry tundra, moist tundra, fen and wet grassland derived from Landsat TM normalized difference water index composites to estimate fluxes in the Zackenberg valley, Greenland, in 2015.A more detailed, earlier classification by Elberling et al. [54], which was made for analyses of soil properties from airborne data, was used as the training dataset.This map also served as input for an estimate of soil organic carbon [55].A similar upscaling method, but using GeoEye satellite images, has been applied over Kytalyk using a maximum-likelihood approach for the retrieval of land cover classes [56].A vegetation map from Landsat TM/ETM+ has also been developed for Svalbard [57].It was used for the selection of regions suitable for phytomass retrieval from NDVI over the Nordenskjöld peninsula [58].A further subarctic study in northern Finland utilized high spatial resolution IKONOS data [59].The classes mire, deciduous forest, evergreen forest, tundra and bare rock and lakes were considered to characterize an eddy-flux tower footprint.The up-scaling is limited to the extent of the satellite image in that case.

Permafrost Subsurface and Land Surface Features
The application of land cover classifications has also proven significant for the retrieval of permafrost-related features.Specifically, thermokarst-related phenomena are studied, but periglacial processes in mountainous terrain have also been of interest.Many of these applications combine remotely-sensed terrain data with land cover information, e.g., [60][61][62].Vegetation coverage provides information on process activity (erosion).Changes in hydrology due to permafrost degradation can be obtained from water body classifications [63].Related features are in general rather small; the availability of high spatial resolution satellite data has therefore been the main driver for recent studies.Long-term studies are still limited.General process domains can however be captured using medium resolution datasets [62].
Morgenstern et al. [60] performed morphometric analyses of lakes based on Landsat-7 ETM+ in the Lena Delta.Landsat-7 ETM+ has also been used for the quantification of thermokarst-affected terrain types [64].Ulrich et al. [65] used a combination of Landsat ETM+ (15 classes) and CHRIS-Proba (six classes) for spectral characterization of periglacial surfaces and retrieval of geomorphological process areas.Landsat 5 TM was applied by Turner et al. [66] in 2014 for identifying relations between catchment land cover characteristics and lake hydrology, concluding that remote sensing data alone cannot answer the related questions.Aeolian soil erosion, which can also be responsible for erosional landforms, or deflation patches have been studied in Kangerlussuaq, West Greenland [67], as well as the Yamal peninsula in Russia [68].In both cases, high resolution satellite data were analyzed (WorldView-2 and GeoEye-1, respectively).Beck et al. [69] concluded that only high resolution satellite data can be used to map changes of so-called lithalsas.These are low, circular or oval features around 10-30 m wide and up to 150 m long that result from frost heave and are therefore permafrost-related surface features.High spatial detail is also required for ice wedge mapping.Liljedahl et al. [63] investigated high resolution satellite data from IKONOS, Corona, Gambit KH-7, QuickBird, WorldView-1/-2, SPOT 6/7 and aerial photographs to study ice wedge degradation across the Arctic, where changes over long time periods could be quantified.
The discrimination of lakes has been identified as a major challenge.Optical data can be easily used to identify lakes that are larger than the sensor's spatial resolution.Muster et al. [35] used CHRIS-Proba and Landsat-7 ETM+ to investigate subpixel-scale heterogeneity with special emphasis on Evapotranspiration (ET) estimation.Environments with inundated low centered polygons have been shown to be only detectable with high spatial resolution data.The majority of lakes are however smaller than, e.g., Landsat or MODIS, which are used for global maps [6,35], and frequent cloud cover limits the mapping of these highly dynamic features.SAR data have been suggested to overcome the latter problem.Large areas could be theoretically covered, but wind action, which is common in the Arctic, roughens the lake surface and reduces the detectability [36].In the case of low wind conditions, higher spatial resolution X-band SAR data (2 m) have been shown to be applicable to capture small lake objects [70].They provide more realistic representation of lakes than with Landsat as used for the regional map of the entire Lena Delta (see Table 1 [40]).
Land cover is also used to indirectly map sub-ground permafrost properties.ALT estimates have been made using different approaches, both by evaluation of land cover classifications, as well as by analysis of reflectance values themselves.In this context, there is also potential for X-band SAR applications to infer ALT from the radar backscatter, since the backscatter signal is strongly related to land cover [71].Ou et al. [72,73] used Landsat-5 TM, as well as Radarsat-2 (C-band SAR) for permafrost modeling.A land cover map was produced, and values for peat thickness and vegetation height were associated with the resulting classes.Ground temperatures have been up-scaled using an ecotype land cover map (from among other sources, Landsat, stratified NDVI and unsupervised classification of Jorgenson et al. [74]) on a regional scale [75].The available classes were grouped into four clusters representing certain ground temperature ranges in a permafrost transition zone in Alaska.Ecotypes with thick moss and organic layers dominate the group with lowest temperatures, whereas thinner organic layers can be found at the warmer sites.Sites with mean annual ground temperature above zero degrees are sites with alder, spruce and also burned areas.A potential information source on organic layer properties is also C-band SAR, as measurements under frozen conditions reflect soil organic carbon content north of the treeline [76].Extent mapping of peatland with C-band SAR (unfrozen period) has been shown in numerous studies (e.g., [77,78]).It is assumed that the near surface is comparably wet.An alternative approach is to take advantage of the backscatter signal arising from frozen surface conditions, which can provide general information on wetland distribution with up to four classes [79].
Land cover serves also as input for permafrost modeling, including semi-empirical equilibrium models [1].It has strong implications for the small-scale distribution of the snow cover.Westermann et al. [1] reclassified a MODIS land cover product into high vegetation/forest ("International Geosphere-Biosphere Programme" (IGBP) Classes 1-9, 11) and bare ground/low vegetation (IGBP Classes 10, 15, 16).Areas with bare ground or low vegetation are locations where strong redistributions of snow due to wind drift can occur.Snow cover is more uniform in areas with trees and high vegetation.In some areas the model failed to reproduce the permafrost patterns.Especially the patchiness of shrublands cannot be resolved.Westermann et al. [1] suggest that subsurface properties should also be considered to better estimate the active layer damping factor.Selkowitz [80] has defined the class 'shrub' as vegetation that is less than 0.5 m tall.This specification is supposed to reflect certain properties, such as trapping by snow, where the threshold represents the approximate height boundary between shrubs that extend significantly above tussocks and shrubs that grow primarily between tussocks [80].To better understand changes in permafrost temperatures, information on snow cover extent and depth, including redistribution potential and nival traps, is imperative.Not unexpectedly this demands clear delineation of assorted vegetation types through appropriate thematic detail.

Changing Land Cover and Transition Zones
Land cover change in high latitudes is mostly associated with shrub expansion, forest fires, thaw lake variations, grazing or settlement development (e.g., [81,82]).Change can be identified using, e.g., post-classification approaches or assessing trends of the measured reflectance.The latter yields especially large potential, since it avoids the problem of classification accuracy.The review of Stow et al. [83] in 2004 focused on land cover change in Arctic Tundra ecosystems.A number of selected studies has been summarized demonstrating various approaches with one example for each type of data: repeat terrestrial photography, aerial photography, high, medium and coarse resolution satellite image analyses.Land cover change from satellites is quantified based on indexes rather than classifications in all examples.
AVHRR NDVI is commonly used for pan-Arctic assessment [84,85], but Landsat on the regional scale.Especially the latter points to varying patterns across the Arctic.Greening is associated with lowland areas in Canada [86], but the opposite case is in Siberia where upland areas have been linked to positive NDVI trends [87].This upland greening in the Urals, Gydan and Taymyr regions seems to be strongly connected to cryogenic disturbance processes in areas of patterned ground.According to Frost et al. [87], this is not captured by AVHRR analyses due to the limited size of single disturbance sites.The applicability of various further indexes has been demonstrated by Fraser et al. [88], who used pixel-based trend analysis of long-term Landsat TM/ETM+ image stacks to map gradual, long-term changes to vegetation (TC brightness, greenness and wetness indexes and fractional land cover trained from IKONOS high resolution images).Nitze and Grosse [89], Raynolds and Walker [90] confirmed the suitability of such an approach in the Lena Delta and on the Alaskan North Slope, respectively.These regions coincide with two of the areas with multi-purpose land cover mapping (see Figure 1).The advantage of Landsat is its potential for regional to global mapping and records longer than 30 years, which are especially exploited for water area changes (e.g., [81,91]).However, not only the spatial resolution, also the interpretation of water surface changes is challenging due to the inconsistent data coverage.NDVI trends are also interpreted for wetting and drying [89,92].A general constraint for all Landsat-based approaches is the lack of data for 1989-1998 for much of the Arctic [87].To capture and interpret subtle changes, high resolution satellite images are required [85,87].
A further challenging task is the detection of the tundra-taiga transition area.This is of specific interest to long-term studies on land cover change.Earlier studies, such as by Silapaswan et al. [93] in 2001, concluded that a combination of index analyses and unsupervised classification performs best.A hybrid unsupervised-supervised classification has been suggested by Rees and Williams [94] in 1997 to study the effect of air pollution on the Kola peninsula.Tundra and forest areas were analyzed by unsupervised classification separately.Their separation however is based on supervised classification.This approach results in classes that can be well distinguished rather than targeting predefined purpose-driven classes.Separate parametrization is however required for each scene.A similar approach was applied for the same purpose by Tommervik et al. [95].
Ranson et al. [96] found that methods using multi-spectral Landsat ETM+ summer images, multi-angle MISR red band reflectance images, RADARSAT images with larger incidence angle or multi-temporal and multi-spectral MODIS data provide similar classification results (supervised) over central Siberia for a certain date.Hufkens et al. [97] validated a sigmoid wave curve fitting algorithm to detect and quantify a forest-tundra ecotone based on Landsat ETM+ data in the Northwest Territories, Canada.The evaluation of a global forest cover map based on Landsat showed that forests closely matched canopies of at least 2 m in height in the tundra-taiga transition zone [98].Apart from distinguishing forest from non-forested areas, shrubs were studied in the transition zone.Beck et al. [99] applied a random forest approach to map small and tall shrubs (>1 m) as their distribution reflects climatic conditions.Very high resolution datasets reveal shrub encroachment on disturbance sites [87].
The tundra-taiga transition area is also of interest to reindeer herding.Pastures, especially the abundance of lichens, have therefore been investigated with satellite data.For herding purposes, winter and summer pastures (different vegetation communities as a proxy for certain lichen types, on trees or within tundra heath) have been distinguished by Bartsch et al. [100] applying supervised (maximum-likelihood) classification.Hybrid classification has been used by Rees et al. [101] and Tommervik et al. [102] for grazing impact assessment.Grazing impact has also been addressed by the analysis of the fractional coverage of fruticose lichens with supervised (maximum-likelihood) classification [103].

National-and Regional-Scale Land Cover Maps
The main sources of regional-and/or national-scale land cover images are Landsat data.A special characteristic of such maps is that they are usually a combination of data from several years in order to achieve complete spatial coverage; e.g., Cihlar et al. [104] compiled a Canada-wide map fuzzy K-means clustering with 30-m resolution from three years of data.To some extent, information from coarser datasets, which are also used on the global scale, is applied.
Unsupervised classification with a preceding mosaicking step is commonly used for regional-scale mapping.Such an approach has been, e.g., also used for the first map of Northern Alaska based on Landsat MSS [46].Landsat was also combined with e.g., SPOT VGT for normalization [8,105] in order to create a land cover map for the Canadian Northern Territories.IKONOS was utilized to develop an approach that provides land cover fractions within the Landsat resolution cells [24].The national land cover map of the U.S. (from Landsat TM/ETM+) has been assessed regionally by Selkowitz and Stehman [106] over Alaska.An agreement of 59.4% for Level II (highest thematic detail) was found.This result underlines the need to address high latitude environments separately.
A different scheme has been adopted by Gould et al. [107,108].NOAA-AVHRR data (1 km, including multi-temporal NDVI 10-day composites) are processed for Canada [107,108].Polygons were hand-drawn based on visual interpretation of variations in the AVHRR base map and interpretation of ancillary data.This strategy corresponds to the one chosen for the production of the Circum-Arctic Vegetation Map (CAVM; Figure 2) from NOAA-AVHRR [5,109], which was the first attempt to produce a consistent map with the necessary thematic content for vegetation studies.Only regional maps existed before [4].Many studies have shown that the CAVM is the best regional-scale land cover product in the Arctic; e.g., Rees and Danks [110] have demonstrated its prevailing performance over global land cover maps for reindeer pasture mapping; however, it still lacks the necessary detail regarding separation of lichen-and shrub-dominated land cover types.A further constraint is its limitation to non-forested areas.A range of approaches and sensors has been tested for Northern Eurasia, including usage of seasonal mosaics of SPOT4-VEGETATION (1 km, [111]), MODIS (500 m, [112]) and even radar data [113] from JERS-1 (L-band SAR, 100 m).This sensor is specifically suited to detect wetlands, as demonstrated by Whitcomb et al. [114].Bartalev et al. [111] applied unsupervised ISODATA classification to SPOT data, Sulla-Menashe et al. [112] and De Grandi et al. [113] a maximum likelihood supervised scheme to JERS-1 and MODIS, respectively.One major issue of SPOT-4 products for applications in the Arctic is their spatial limitation to areas south of 70 degrees latitude.The random forest approach, which also utilizes training data, has been applied by Whitcomb et al. [114] to L-band SAR data.This approach proved to be especially applicable to wetland mapping.
SAR-based land cover maps have until recently only been based on summer (unfrozen condition) acquisitions.Widhalm et al. [79] demonstrated that winter C-band backscatter reflects certain land cover characteristics, especially wetlands.The 1-km SAR data better represent wetland spatial distribution than global maps, which have been developed from MODIS or MERIS (500 m and 300 m, respectively).The SAR data provide mostly surface roughness and only to a limited amount of volume scattering in vegetation in tundra regions.Traditional global maps include only classes for flooded shrublands.There is, however, a much higher diversity in wetland types across the Arctic.A common feature is the low surface roughness with respect to C-band (5.6-cm wavelength).One cannot obtain different types of wetlands with this approach, but their presence can be detected (at least four-times more than in global maps).This method can also potentially be applied to higher spatial resolution C-band SAR data as, e.g., available from Sentinel-1.
An alternative, a rather coarse (74 km × 43 km footprint) passive microwave dataset was produced by Melentyev and Matelenok [115] for latitudinal discrimination of Siberian landscapes using AMSR-E brightness temperature.A consistent threshold classification on the basis of seven informative features is suggested.Eight landscape types, which also include the boreal biome, could be distinguished: wetland complexes, tundra, transition regions, tundra-forest, forest-steppes, steppes, deciduous and mixed forest and taiga.

The Arctic in Global Land Cover Maps
The Arctic plays an important role for the global climate.Sustained increases in air temperatures are warming the Arctic more than twice as fast compared to low/mid-latitudes [116].Rapid changes in the Arctic system feed into the global climate.The Arctic therefore needs to be addressed in its entirety rather than simply regionally.The impact of climate warming on permafrost and associated carbon fluxes is one of the issues addressed in current climate (land surface) modeling studies [3].Land cover is required for benchmarking and initializing Earth system models.Such datasets are also required for large-scale numerical permafrost modeling [1].A further potential application at this scale is up-scaling of carbon pools in permafrost areas.It has been shown in a range of studies that high resolution data can be used at the regional scale [25].Land cover classes need to reflect certain land cover types that are associated with a certain Soil Organic Carbon Content (SOCC) range.The thematic detail for this application is missing in global maps to date.An alternative approach that does not require land cover classification has been developed by Bartsch et al. [76].SOCC has been directly derived from C-band SAR backscatter.C-band SAR may thus provide complementary information to optical data on soil properties.Organic carbon content has a significant impact on the thermal properties of the soil, information required for permafrost modeling.
Global land cover maps are based on AVHRR ( [117,118] IGBP DISCover and UMD maps), MODIS [119,120], SPOT 4 VEGETATION ([121] GLC2000), ENVISAT MERIS ( [122,123] GlobCover and subsequent CCI maps) and recently also Landsat (e.g., Globland30 by the National Geomatics Center of China [124]).Existing land cover and seasonal datasets (from AVHRR) have been used to produce the ECOCLIMAP [125,126], a dataset developed for climate modeling purposes.The GlobCover and CCI maps are based on bi-monthly, seasonal and annual mosaics [122,123]; ECOCLIMAP on eight years of NDVI time series (see also Table 2).
Classes representing Plant Functional Types (PFTs) are required by Earth system models to define the vegetation characteristics in terms of, e.g., photosynthesis capacity, phenology and roughness [3].The spatial resolution has been shown to be a clear advantage of GlobCover compared to GLC2000 by Ottle et al. [3].GlobCover tends to have better distributions of tree cover, but may lead to overestimation of forest PFTs.Ottle et al. [3] suggested the use of additional information to address this issue.The opposite seems to be the case for the other maps.An assessment of the IGBP DisCover and MODIS land cover map over the West Siberian Lowlands demonstrated that at high latitudes, both underestimate deciduous needle-leaf forest in favor of woody savannas or open shrublands, respectively [127].The transformation to PFTs by Ottle et al. [3] using GlobCover led to an increase of bare soil in vegetated tundra areas in Siberia (except for Taymir) compared to a previous definition for the model ORCHIDEE.
The spatial resolution poses a major constraint for the applicability of coarse resolution data, such as AVHRR, MODIS and MERIS, in the Arctic.This is apparent and has been demonstrated for the water class [6,127].Insufficient thematic detail for reindeer pasture mapping has also been demonstrated for the GLC2000 by Rees and Danks [110].An option is the retrieval of the water fraction, which can be obtained by the fusion of datasets of different temporal and spatial resolution (e.g., [128] 5 km, AMSR-E and MODIS).Such approaches do, however, not provide information on lake objects as required for, e.g., thermokarst lake studies.
Global land cover maps are mostly produced by variants of unsupervised classification approaches.Table 2 provides details on the input data, method and validation of ECOCLIMAP-, GLC2000-and the MERIS-based land cover maps.They are shown in Figure 2 over the extent of the CAVM.Figures 3-5 provide a direct comparison of their classes with the CAVM.ECOCLIMAP contains significantly more global classes than the other maps, which use the FAO system.Their number is however similar in the Arctic domain.The actual thematic detail is lower for ECOCLIMAP, as it provides separate classes for North America and Asia for the same land cover type, but it is the only map that includes a tundra class.This class covers however only a very small proportion within the CAVM domain.It has been demonstrated that global land cover datasets disagree especially in the transition zone from boreal to tundra [129,130].Furthermore, the classes bare and shrub covered have been differently assigned [129].This has been demonstrated when compared to selected Landsat-based land cover maps.It has also been pointed out by Westermann et al. [1] that shrublands cannot be resolved in coarse resolution global maps.
Already a visual comparison of the CAVM with global maps demonstrates the limited thematic content of global maps in the Arctic (Figure 2).The classes available from the CAVM (which represent vegetation communities) show largely different spatial patterns than the classes in the global maps, as they differ significantly by their definition (Table 3).The CCI land cover product has received increasing attention in recent studies.It includes a high latitude dedicated 'mosses and lichens' class.The full name is 'closed to open (>15%) herbaceous vegetation (grassland, savannas or lichens/mosses)'.It is expected by users that mosses and lichens have a certain thickness and are cooling the ground [131].This class corresponds however largely to mountain complexes with cryptogam crusts over the CAVM domain in the CCI map.These are very thin and often show a large active layer thickness [68,71].A large proportion of this class also corresponds to dwarf shrub tundra, mostly of the graminoid type (see Figure 5).Lichens and mosses can be found within most land cover types in the high latitudes [100].The species composition and amount differ between communities.A certain lichen coverage is associated with land cover types that can be identified with Landsat.Nordberg and Allard [103] demonstrated that lichen coverage increases reflectance across all Landsat channels.The higher the reflectance, the higher the lichen coverage.Grazing reduces the lichen height and, thus, also reflectance [95].Large areas are affected by reindeer herding, as well as wild herds.
Thematic detail with respect to shrubs is highest for the ECOCLIMAP map.It distinguishes between closed and open shrublands.The closed shrublands correspond mostly to dwarf and low shrub tundra.The majority of shrub categories of the CAVM are however classified as open shrubland, which can be found in almost all CAVM classes (Figure 4) and is also the most common one with 37%.The GLC2000 also separates herbaceous from shrub cover over the Arctic.The most dominant class is 'sparse herbaceous or sparse shrub cover' (about 40%; Figure 3).Similar classes are included in the CCI land cover legend, but not mapped over the Arctic.This class named 'closed to open (>15%) (broadleaved or needle-leaved, evergreen or deciduous) shrubland (<5 m)' can be found in steppe zones and covers large areas of the Tibetan plateau, which is also a permafrost-affected region.Moreover, Western Greenland is inconsistent with the Arctic class pattern in the CCI product.It is the only region in the Arctic mapped as grassland.The majority (approximately 33%) is mapped as sparse vegetation (with less than 15% vegetation coverage).Although dominant classes are similar in quantity across the different maps, they overlap only partially.Water surfaces in all global maps are better resolved than in the CAVM.
Table 3 lists classes of the global maps, as well as CAVM within the extent of the multi-purpose maps described in Section 3.This exemplifies the large difference of the type of classes available in these maps.The dominating class (more than 20% coverage) in the CAVM differs among the three locations.The Usa basin is dominated by low shrubs, Kuparuk by tussocks, sedges and shrubs and the Lena Delta by graminoids, shrubs and forbs and wetlands with sedges.The CAVM agrees well over Kuparuk, as this has been the initial study site of the CAVM team.The dominating land cover also matches for the other two sites.The differences between the sites are not well represented in the global maps.The CCI land cover provides only one type of dominating land cover across the sites.ECOCLIMAP comes closest to the CAVM, but addresses the classes rather by vegetation structure, than with detail on actual vegetation communities.The GLC2000 reflects the transitional nature of the Usa basin.None of the global map represents the thematic content of the Landsat-derived regional maps (Table 1).

Challenges and Promising Approaches
There is a range of comparison studies of different methods for land cover classifications in high latitudes.These are usually limited to small areas.Land cover change is mostly addressed on the local scale.Classifications of land cover types other than water surfaces have only rarely been used for change detection in Arctic environments, except for identifying disturbances, such as air pollution or grazing effects.Indexes are commonly used to address land cover change issues.Classifications are however of benefit for identifying the tundra-taiga transition zone, which is of special interest to climate change impact studies.
Radar (SAR)-based large-scale maps often use supervised approaches, optical mostly unsupervised.Unsupervised classifications are also the basis for national to global mapping.Single purpose-driven maps are prepared with a priori information applying supervised classification approaches, such as data on the impact of air pollution and reindeer pasture inventories, or the maps are made manually, like the CAVM by Walker et al. [5].The latter is widely used and based on AVHRR data.Object-based Image Analysis (OBIA) is a frequently-used approach classifying high resolution satellite data in general, but has little importance for applications in the high latitudes.That might be due to the fractional nature of the landscape and limited occurrence of artificial objects.Settlements are only rarely studied with land cover maps.This might however become more important in the future due to an increasing need for climate change impact assessments on local communities.
Different require to some extent varying spatial and thematic detail (Figure 6).Habitat studies have mostly been made with respect to mammals.Medium spatial resolution maps, which can be supplied by Landsat, are widely used.Mostly six to eight classes are derived.The required thematic detail increases with decreasing spatial resolution in the case of soil carbon, carbon flux, permafrost surface features and water cycle applications.Medium resolution data studies usually cover larger areas and therefore include a higher diversity of relevant landscape units.This implies that a higher thematic content is required from global maps than currently available.For the application of a resolution of 20-30 m (as available from, e.g., Landsat and also Sentinel-2) on average seven classes are common for regional applications.The CAVM also provides six to nine none-water classes for specific regions (see Table 3).However, twelve distinct classes are already required to represent the three featured multi-purpose maps together across the Arctic.Past studies on indirect assessment of permafrost subsurface features partially use a rather low number of classes, which are grouped from higher level detail maps.Even in the case of ten (surficial material) classes [73], a lower number is eventually relevant for permafrost-related classes, as they represented also non-permafrost types.This demonstrates that the potential of satellite data to conclude on sub-ground thermal properties is limited.Only general classes can be derived, and applicability has been proven in permafrost transition zones only.The relevant types are peatlands (with differentiation by vegetation cover) [73] and ecotypes that distinguish between low and tall shrubs, as well as their characteristic moss and organic layers [75].This demand for details on shrubs coincides with statements in studies for numerical permafrost modeling [1,80].They are important for the modeling of snow distribution (and subsequent impact on ground thermal conditions).A height threshold of 0.5 m is suggested for permafrost modeling purposes [106].A range of recent publications on the use of polarimetric SAR data [31][32][33] has shown the potential of such data for shrub mapping, but the applicability remains limited since SAR data are only locally acquired in fully-polarimetric mode.Current acquisition strategies of SAR missions only provide equal and dual polarized data.Shrub mapping in general needs to consider phenological stages [20].
There is no generally agreed upon classification scheme for the northern high latitudes.Walker et al. [45] have also pointed this out as an issue for in situ mapping across the Arctic recently.Classes of regional maps for multiple purposes, such as up-scaling of carbon pools and fluxes, represent shrub physiognomy (dwarf species or higher) and wetness patterns.Shrubs are included as a class in global land cover maps, but are either not present in their Arctic parts or patterns differ among them.Wetlands or wetness descriptions are in general under-represented in global maps.They can have differing vegetation coverage.The same applies to lichens.They are omnipresent.Vegetation classes need to be reviewed and described for their associated lichens.The class 'lichens and mosses' in the CCI land cover can be interpreted as either cryptogam crust or graminoid dwarf shrub tundra.When lichen species are dominant and not covered by the canopy of higher vegetation, they are part of cryptogam crusts or develop within vegetation communities with graminoids.This should be labeled accordingly to avoid confusion with fruticose lichen heaths in boreal environments.Alternatively, lichens can be addressed by a vegetation community approach or by quantification of their bias on reflectance.A good example for thematic content is the scheme developed by Gould et al. [108] for classification of Landsat data for Canada.Several graminoid and growth from dependent shrub classes are included.They specifically also address PFTs.In total, 17 PFTs are suggested for north of the treeline.The suitability of the suggested scheme for high latitudes, as well as the classification for Northern Alaska by Muller et al. [46] has been recently pointed out in a review by Wullschleger et al. [137].Classes for high Arctic tundra are also suggested by Langford et al. [138] (graminoid types, lichens, mosses and forb).They are however derived using high spatial resolution (WorldView) data.The heterogeneous distribution of the PFTs can be captured at the local scale only.Time series representing phenology have been applied in all three cases [46,108,138].
Strategies for classifications on global and regional level differ regarding the type of input data.Acquisitions from several points in time are used also regionally, but seasonality is in general not considered.Global classification schemes make use of mosaics of different time intervals and comply with international classification schemes.These schemes do not reflect the diversity of landscape types in high latitudes, which are required in permafrost regions for common applications.Since global mapping efforts need to comply with internationally-agreed upon standards, only a substitution over the high latitudes, which is to some extent compatible with the global maps, can be made.Existing classes need to be re-named and subdivided.The same sub-class may have to be assigned to different parent classes eventually.This is, e.g., the case for wetlands.Alternatively, they could be provided as a separate information layer for the purpose of up-scaling of fluxes.
Global maps are in general based on optical data.Only SAR-based lake mapping has been so far considered in one case (CCI Landcover, [139]).SAR can, however, deliver similar thematic information content (e.g., [32,73]), especially when combinations of different polarizations are used.It provides additional information on surface roughness [76], moisture [78] and vegetation structure [140,141].These parameters reflect information on wetlands [79], shrubs and soil properties which is required for high latitude applications.The thickness of mosses represents to some extent soil organic carbon content, and it relates to wetlands.C-band SAR has been shown to provide information on both in tundra areas.L-band SAR is also suitable for wetland mapping.A number of L-band maps already exist with a forest or wetland mapping focus for Northern Eurasia [113] and America [114].They may need to be reviewed for consistency and harmonized, as well as gap filled across the Arctic.Such data could complement the traditional optical data-based retrieval methods.So far, mostly unfrozen period SAR data have been used.Recent studies demonstrated the potential of winter acquisitions [76,79,140,141] for land surface characterization.The advantage is the exclusion of pixel to pixel variation, the difficult to model backscatter contributions due to differing dielectric properties when unfrozen [76].Passive microwave data, such as AMSR-E, provide coarse resolution information on the water fraction [128], but also different land cover classes [115], including wetland complexes.Their resolution is similar to climate models, and wetland information is of need for land surface modeling in high latitudes [36].Such maps would be therefore useful, but not applicable beyond.Medium resolution satellite data may, however, provide sufficient spatial resolution for soil properties up-scaling over peatlands [52].On the local scale, X-band SAR, as well as UAVs become more and more important (e.g., [142][143][144]).Such investigations can complement regional mapping efforts in a nested approach in order to reflect the fragmented nature of permafrost regions and to identify the subtle changes at the land surface.
The majority of publications reviewed in this paper deal with retrieval techniques.Only about 40 of them make use of the classification results for, e.g., habitat mapping, soil carbon and flux upscaling or periglacial features detection (Figure 6).Most local and regional maps are not shared on online repositories.Exceptions are the CAVM [5], the Lena Delta map [40,145] and also the Kuparuk basin maps.This fosters their usage, but usually one of their co-authors is directly involved in new applications in case of regional maps.The CAVM has been used beyond the original development team due to its circumpolar coverage.

Conclusions
Global maps need to be conform with international classification schemes, which are of limited applicability in the Arctic.They might be to some extent suitable for treeline monitoring, but not for up-scaling or modeling to the north.New classifications that serve the needs at high latitudes need to be developed separately, but circumpolar and compatible with global maps.Existing methodologies for SAR data could be exploited to widen the thematic detail available from such maps.Winter, as well as summer acquisitions from C-band missions (e.g., Sentinel-1 and Radarsat) might be suitable to complement traditional classifications with C-band based approaches.A spatial resolution around 30 m has been shown suitable for a range of applications.This implies that the current Landsat-8, as well as Sentinel-2 missions would be adequate as input data.
Of special interest are shrubs and wetlands, as well as the occurrence of mosses and lichens.The identification of shrubs and their height is of interest to permafrost modeling, as this can serve as a proxy for snow distribution.The 'shrub' class is available in global land cover products, but it is not mapped at all, or not consistently, at high latitudes.Wetness, as well as moss and lichen coverage are in general sub-features across all traditional land cover classes and, therefore, mostly not used as separate classes in local to regional maps.The predominance of shrubs, tussocks and sedges provides the background information for purpose-driven classification schemes.This should be also considered for circumpolar mapping.

Figure 1 .
Figure 1.Location of the discussed sites with multi-purpose land cover maps from Landsat data.More details are provided in Table1.

Figure 2 .
Figure 2. CAVM [5] and global land cover maps for the CAVM domain.For a description, see Table2.
Figure 2. CAVM [5] and global land cover maps for the CAVM domain.For a description, see Table2.

Table 1 .
Non-water classes of selected multi-purpose land cover classifications based on Landsat.The dominating land cover class is underlined.For the location, see Figure1.

Table 2 .
Details of selected global land cover maps and CAVM.The number of classes refers to none-water/ice classes >1% within CAVM (total).

Table 3 .
Non-water classes with more than 1% coverage of global land cover maps and CAVM within regions of multi-purpose land cover classifications.Dominating classes (more than 20%) are underlined.For the location, see Figure1and specifications Table1.