Next Article in Journal
Uniform and Competency-Based 3D Keypoint Detection for Coarse Registration of Point Clouds with Homogeneous Structure
Next Article in Special Issue
IoT Enabled Deep Learning Based Framework for Multiple Object Detection in Remote Sensing Images
Previous Article in Journal
Dual-View Stereovision-Guided Automatic Inspection System for Overhead Transmission Line Corridor
Previous Article in Special Issue
PBL Height Retrievals at a Coastal Site Using Multi-Instrument Profiling Methods
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Mapping Two Decades of New York State Forest Aboveground Biomass Change Using Remote Sensing

1
Department of Environmental Resources Engineering, State University of New York College of Environmental Science and Forestry (ESF), Syracuse, NY 13210, USA
2
CORE and Department of Electrical and Computer Engineering, Memorial University of Newfoundland, St. John’s, NL A1B 3X5, Canada
3
Sustainable Resources Management, State University of New York College of Environmental Science and Forestry (SUNY-ESF), Syracuse, NY 13210, USA
4
Graduate Program in Environmental Science, State University of New York College of Environmental Science and Forestry (SUNY-ESF), Syracuse, NY 13210, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(16), 4097; https://doi.org/10.3390/rs14164097
Submission received: 29 June 2022 / Revised: 24 July 2022 / Accepted: 19 August 2022 / Published: 21 August 2022
(This article belongs to the Special Issue New Developments in Remote Sensing for the Environment)

Abstract

:
Forest aboveground biomass (AGB) provides valuable information about the carbon cycle, carbon sink monitoring, and understanding of climate change factors. Remote sensing data coupled with machine learning models have been increasingly used for forest AGB estimation over local and regional extents. Landsat series provide a 50-year data archive, which is a valuable source for historical mapping over large areas. As such, this paper proposed a machine learning-based workflow for historical AGB estimation and its change analysis from 2001 to 2019 for the New York State’s forests using Landsat historical imagery, airborne LiDAR, and forest plot data. As the object-based image analysis (OBIA) is able to incorporate spectral, contextual, and textural features into the regression model, the proposed method utilizes an OBIA approach and a random forest (RF) regression model implemented on the Google Earth Engine (GEE) cloud computing platform. Results demonstrated that there is a considerable decrease of 983.79 × 106 Mg/ha in the AGB of deciduous forests from 2001 to 2006, followed by an increase of 618.28 × 106 Mg/ha from 2006 to 2011, continued with an increase of 229.12 × 106 Mg/ha of deciduous forests from 2011–2016. Finally, the results demonstrated a slight change in AGB from 2016 to 2019. The transferability of the proposed framework provides a practical solution for monitoring forests in other states or even on a national scale.

Graphical Abstract

1. Introduction

Forest covers about 31 percent of the world’s total land area, the largest natural land-based ecosystem on Earth [1]. Since 1990, the world has lost an estimated 178 million hectares of forest [1]. The rate of net forest loss decreased from 1990 to 2020 due to the reduction of deforestation in some countries and the increase in forest area in others on account of afforestation and the natural expansion of forests [1]. In spite of this, forest loss continues to be of concern, particularly given increasing rates of carbon emissions across the globe and the ability of forest ecosystems to absorb carbon emissions. Forests absorb carbon dioxide (CO2) from the atmosphere through the photosynthesis process. This absorbed carbon is integrated into various biophysical aspects of trees in the form of biomass, both dead and alive, including branches, trunks, and leaves [2]. Thus, providing accurate aboveground biomass (AGB) estimations is of paramount importance for sustainable forest management, carbon accounting, and climate change monitoring [3]. In particular, sustainable forest management contributes to the ecological, economical, and sociocultural aspects of the environment and requires accurate, consistent, and timely forest monitoring [4].
Large-scale AGB change mapping faces some limitations in both time and space. From the time perspective, there has been a vital demand for AGB mapping and carbon stock monitoring throughout the years [5,6]. On the one hand, traditional methods are destructive, labor-intensive, costly, and time-consuming. On the other hand, covering a large region of interest for national/state-wide AGB mapping using remote sensing techniques requires the processing of big geospatial data. Data collection over time and space raises challenges in big data processing, such as volume, variety, and velocity, which are the characteristics of big data [7,8,9]. A huge volume of big data comprises issues regarding storage and analysis. Variety is referred as various types and formats of big data. The velocity deals with the unprecedented speed of big geospatial data coming from different sources [9]. Earth observation (EO) data has been extensively used for quantifying forest AGB. While the spectral reflectance of optical and backscatter intensity of synthetic aperture radar (SAR) is used to “indirectly” measure forest vertical structure, lidar, interferometric SAR (InSAR), and photogrammetric 3D products give “direct” measurements of forest vertical structure. Combining direct and/or indirect EO-based forest structure measurements with allometric equations provide information for AGB change monitoring over time and in large areas [10]. Thanks to the freely available remote sensing data of the Landsat mission and cloud computing platforms, such as the Google Earth Engine (GEE) [10], large-scale AGB mapping can be conducted to monitor forest AGB and carbon stock in a timely manner. Recently, cloud computing platforms (e.g., GEE, Amazon Web Services, and Microsoft Azure) paved the road for processing and implementing remote sensing data [9].
Due to the capabilities of GEE for handling geo big data, such as freely available satellite imagery and built-in machine learning models [9,11], the historical AGB mapping over large areas has been facilitated. The AWS and Microsoft Azure platforms also contain machine learning, artificial intelligence services, and several types of satellite imagery. However, GEE provides an access to a comprehensive archive of Landsat imagery since 1972, while AWS and Azure host Landsat 8 OLI datasets [9]. In particular, free aerial imagery, optical, SAR, and ready-to-use data, which are highly important for environmental change monitoring, can be accessed and processed by the GEE cloud platform. Traditional demand to download and store terabytes of satellite imagery in big data analysis can be overcome by GEE infrastructure. The availability of machine learning models in GEE, such as classification and regression tree (CART), random forest (RF), and the support vector machine (SVM), provides great tools to solve different classification and regression problems. Specifically, GEE works based on the parallelizing processes that takes place in the CPUs of Google’s data centers, which remarkably reduces remote sensing big data computational time [9]. It should be noted that complex tasks in GEE are required to be paid using the online or batch option. While some might be interested in upgrading their local computers to implement complicated tasks, using a cloud computing platform such as GEE could be a safer choice for the batch processing and storing of a huge volume of data.
In recent years, light detection and ranging (LiDAR), optical, and SAR datasets have been extensively used for forest AGB estimation [12,13,14,15,16,17,18]. Landsat data with nearly five decades of continuous imagery, started in 1972, is one of the most valuable sources for historical environmental monitoring [10]. Despite the increasing number of EO satellites with high-spatial and temporal resolutions, Landsat plays a key role in the historical environmental mapping [10]. Landsat provides 30 m spatial and 16 days of temporal resolution, which is of paramount importance for change detection applications, mainly historical AGB mapping [12]. As such, forest AGB mapping using Landsat archive enables researchers to obtain enough information for carbon stock changes and monitor climate change patterns over years. A problem with the indirect measurements of the forest structure and biomass estimation using optical or SAR data is the saturation issue in high-biomass areas. Studies show that backscatter or spectral reflectance (or its derived indices such as NDVI) increases with increasing AGB for low-to-medium AGB, but gradually loses its sensitivity for higher AGB and asymptotes to a saturation level ([6,17]). LiDAR data, on the other hand, directly measures the forest vertical structure and canopy height and thus are not affected by saturation problems in forest AGB estimation.
AGB mapping using remote sensing can be implemented at two different processing units, pixel-based and object-based image analysis [19]. Object-based image analysis (OBIA) has shown unprecedentedly improved results, especially for AGB mapping [20,21]. This is because extracting textural and morphological (shape, extend, size) features add contextual information to spectral features extracted using pixel-based approaches [19]. Moreover, the mixed pixel issue is solved by applying OBIA through clustering objects based on spectral similarities [19].
The primary objective of this study is to develop an OBIA framework for state-wide historical AGB mapping. So far, some studies have been focused on object-based AGB mapping on a local extent [20,21,22,23,24]. Hirata et al. (2018) [20] used OBIA for AGB estimation in seasonal tropical forests in Cambodia using multiple linear regression and LiDAR data as training. Another object-based study has been done over the mountain tropical Brazilian forest using RF as a machine learning algorithm, terrain, and environmental data as input variables [21]. Chen et al. (2022) [23] have used a combination of spaceborne LiDAR data, optical, and SAR data for estimating the AGB of a protected temperate forest in China using an object-based approach. Since these studies have been conducted over tropical and temperate forests and on a local scale using single-date satellite imagery, this research can contribute to AGB and carbon change monitoring over large areas and more than 20 years. Second, this study aims to use the developed OBIA for the AGB mapping of the entire New York State temperate forests from 2001 to 2019. To achieve this goal, Landsat 5 TM, Landsat 7 ETM, and Landsat 8 OLI images, along with environmental and topographical data, have been processed using the RF regression model in the GEE cloud platform. The developed framework can be transferred to other northeastern states for AGB estimation.

2. Study Area and Datasets

2.1. Study Area

The study area is the entire New York State, United States, which covers an approximate area of 141,297 km2 (Figure 1). About 61% of New York State is covered by forest, which is about 7.5 million ha of land area across the state [25]. Publicly owned forest land covers at least 1.5 million ha, while privately-owned forest land covers 5.8 million ha, approximately 76% of forest land. More specifically, maple, beech, and birch are the most common forest types, covering 53% of forestland area. New York forestland includes a broad spectrum of species resource in the region, since it contains 94 tree species and 55 forest types [26]. As such, New York’s forests play a crucial role in economic resources, wildlife habitat, and recreational opportunities, to name a few.

2.2. Forest Inventory and Analysis (FIA) Plots

The United States Forest Service (USFS) forest inventory and analysis (FIA) program is the nation’s forest census [26]. The FIA program has created a database that can be used to generate a set of reference data consisting of forest inventory information attached to georeferenced plots [27]. These plot data are relatively current, collected in a standardized fashion, and distributed relatively uniformly across the continental United States, Hawaii, US territories, and parts of Alaska [27]. FIA program reports information on the species, size, trees’ health, total tree growth, and forest land ownership [26]. Annual FIA inventory plots in NYS are available since 2000, providing a valuable database for training and the precise assessment of AGB maps, especially for large-scale historical monitoring. Thus, this study used these plots to produce and evaluate AGB maps at a state-wide scale. The long history of FIA data provides trend information to resource managers, policymakers, investors, and the public through a system of annual resource inventory that covers both public and private forest lands across the United States [28]. Species-specific allometric equations developed by Jenkins et al. (2003) [29] were used to calculate the AGB of FIA plots.
FIA plots consist of a 0.067 ha (0.17 ac) plot cluster distributed over approximately 0.405 ha (1 ac) [27]. They are collected at a standard sampling intensity of 1 plot per 2428 ha (6000 ac), as in NY and 5198 plots in NY after a full cycle [27]. In each full cycle, which is a 5-year cycle, one fifth of the plots are measured each year, resulting in a full cycle of plots to be completed every 5 years.
The standard FIA plots consist of four circular 24-foot radius (approximately 0.017 ha) subplots and four circular 6.8-foot radius microplots (approximately 0.0013 ha) [30]. Trees 5 inches and greater in diameter were measured in subplots, while trees smaller than 5 inches were measured in microplots. Table 1 lists the statistical characteristics of calculated AGB values of FIA plots from 2002 to 2019.

2.3. Remote Sensing Data

2.3.1. Airborne LiDAR

Figure 1 demonstrates four pilot areas with available airborne LiDAR collected by New York State GIS program office [27] (NYSGPO). The primary purpose of this initial LiDAR data collection was to produce a 1 m cell size high-accuracy 3D digital elevation model (DEM) for conservation planning, design, research, floodplain mapping, dam safety assessments, and elevation modeling [31]. In this study, three counties (i.e., Ulster, Dutchess, and Orange), Warren Washington Essex, Allegany Steuben, and Cayuga Oswego LiDAR datasets were utilized for training/testing purposes, which were collected in 2014, 2015, 2016, and 2018, respectively (Figure 1).
Airborne LiDAR data were collected over four pilot areas in NYS by the New York State GIS program office (NYSGPO) [31]. First, LiDAR data was collected in three counties of Ulster, Dutchess, and Orange for an extent of 7371.11 km2 in 2014 [32] using the Leica ALS70-HP LiDAR system. The nominal pulse spacing for this project was no greater than 0.7 m. The second pilot area, 6278.13 km2, spans five counties, including Warren, Washington, Essex, Hamilton, and Franklin [33], using the ALS70 SN#7123 at a max flying height of 3500 m AGL. This dataset was acquired in 2015, in ground conditions of water at normal levels, no annual inundation, no snow, and leaf off season with nominal pulse spacing of 0.5557 m. LiDAR data for Allegany and Steuben counties, covering approximately 3408.42 km2, was collected in 2016 at a nominal pulse spacing of 0.7 m while no snow was on the ground and rivers were at or below normal levels [34]. Cayuga and Oswego County LiDAR data was collected in 2018 at a nominal pulse spacing of 0.7 m in 2018 [35]. The ground condition of this dataset was at no snow and rivers were at or below normal levels. The data was formatted according to the USNG Tile naming convention, with each tile covering an area of 1500 m by 1500 m.

2.3.2. Landsat Imagery

Since 1972, the Landsat mission, a joint program of the United States Geological Survey (USGS) and the National Aeronautics and Space Administration (NASA), has provided uninterrupted imagery of the Earth’s surface [10]. These continuous images are available at a 30 m spatial resolution and 16 days of revisit time [36]. Landsat datasets are produced by USGS in three categories, including Tier1, Tier2, and real-time (RT) [37]. Tier 1 data that meets geometric and radiometric quality requirements were used in this study. As mentioned earlier, this study has been conducted in GEE, where the Landsat satellite images are freely available [9]. First, Landsat images, with less than 5% cloud coverage, were collected for July–August each year. Then a median function was applied to each pixel of the multitemporal set to create the composite image (single image) of the entire NYS for each year. Landsat 5 TM surface reflectance (SR), Landsat 7 ETM+ SR, and Landsat 8 OLI/TIRS SR imagery was used for AGB mapping over the time span of 2000–2011, 2012, and 2013–2020, respectively. To solve the Landsat 7 striping issue, the SLC Gap-Filled Products Phase One Methodology was used. Generally, this technique uses a local linear histogram matching method to fill the scan gap with previously acquired Landsat 7 imagery [38].
First, a Landsat-based detection of trends in disturbance and recovery (LandTrendr) function known as “buildSRcollection” has been used to produce a cloud-masked medoid composite of the Landsat SR TM-equivalent bands [39] for July–August of each year. The output of this step is six spectral bands, including blue, green, red, near-infrared (NIR), shortwave infrared-1 (SWIR1), and shortwave infrared-2 (SWIR2) bands. Then, 13 vegetation indices: normalized difference vegetation index (NDVI), enhanced vegetation index (EVI), ratio vegetation index (RVI), difference vegetation index (DVI), soil adjusted vegetation index (SAVI), normalized green–red difference index (NGRDI), wide dynamic range vegetation index (WDRVI), excess green index (ExG), Chlorophyll Index–green (CI green), visible atmospherically resistant index (vari), green leaf index (GLI), normalized burn ration (NBR), normalized difference moisture index (NDMI), and four tasseled cap coefficients: brightness, wetness, greenness, and angle were extracted using spectral bands. LandTrendr-GEE code [40] was used to extract LandTrendr metrics, such as fit-to-vertex (FTV) data (i.e., LandTrendr fitted brightness, LandTrendr fitted greenness, LandTrendr fitted wetness, and LandTrendr fitted normalized burn ratio) corresponding to LiDAR data year, delta matrices (difference between each year’s FTV and the previous year), magnitude, and disturbance matrices since last disturbance.

2.4. Climate and Topographic Data

Climate and topographic datasets were also included as predictors in the RF model for historical AGB mapping. Climate data were downloaded from the parameter-elevation regressions on independent slopes model (PRISNM) climate group at Oregon State University [41] at 30-year normals. The PRISM data are models while the term “Normals” refers to them being 30 year averages. These datasets cover the period 1981–2020, which contains the average annual conditions over the most recent three full decades [42]. For this study, mean annual precipitation, minimum annual temperature, and maximum annual temperature were used in addition to other input variables. Moreover, a 30 m topographic wetness index (TWI) surface was also added to predictors. The topographic data originated from a 30 m DEM downloaded using the terrainr [43] package, which provides ease-of-use functions for accessing DEMs from the United States Geological Survey National Map [44]. Slope and upslope were also extracted from DEM. Both slope and upslope area were computed from the original 30 m DEM. TWI was then calculated as follows:
TWI = log ( upslope   area tan ( slope + 0 . 0001 )   )                            
Note that 0.0001 was added to the slope to prevent a 0 in the denominator.
DEM, aspect, and slope were included in input predictors as topographic data. These datasets were downloaded from Shuttle Radar Topography Mission (SRTM) products at 30 m resolution.

3. Methods

Our state-wide AGB mapping approach comprises three main steps, as depicted in Figure 2. First, airborne LiDAR datasets (Section 2.1) were used to produce LiDAR AGB rasters for each pilot area for 2014, 2015, 2016, and 2018. Second, Landsat imagery were preprocessed (e.g., cloud masking) in GEE to extract spectral bands and vegetation indices. In addition, Landsat-based detection of trends in disturbance and recovery (LandTrendr) algorithm [45,46] has been applied to compute LandTrendr metrics. Thus, spectral bands, vegetation indices, LandTrendr, climate, and topographical data were stacked as input predictors for further analysis. In the next step, the segmentation has been applied using the simple noniterative clustering (SNIC) technique in GEE to extract objects for OBIA. Then, stratified random sampling of generated LiDAR AGB rasters was used to split the reference data for training/testing purposes. Since pilot areas belong to different years, Landsat data for each year have been used separately. For each pilot area, 6000 objects were used as reference samples. Then, all samples for all pilot areas were merged to train the RF model for the whole NYS. Finally, a stacked layer of input predictors for each year (2001–2019) was used to provide the historical AGB maps using the OBIA approach. For historical state-wide AGB mapping, a LiDAR-based training method, which was developed by Hudak et al. (2020) [12], has been employed to produce the historical AGB maps of the entire NYS from 2001 to 2019.

3.1. Airborne LiDAR AGB Raster

One of the challenges in applying machine learning models is the need for enough training samples. Having access to private field measurements, such as FIA plots, is very limited due to the legal concerns. In addition, even with access to FIA plots, their sampling density is relatively sparse both in space and time (repetition in each year) [47]. From the spacing perspective, there is only one plot for each 25 square kilometers. From the time perspective, not all plots are measured each year. In fact, the legislative mandates require measurements of 10% to 20% of plots in each State, each year [47]. To leverage training samples, several studies tried to produce AGB rasters using airborne LiDAR predictors and use these airborne LiDAR-derived AGB rasters as a reference dataset [12,20]. We adapted this approach as described in Hudack et al. (2020). We first generated airborne LiDAR AGB for 4 pilot areas (Figure 1) using an RF model. We then used a stratified random sampling method to create a training/testing split. For stratified random sampling, samples were sorted from 0 to maximum AGB with 2 Mg/habins and 200 objects were randomly extracted from each bin. For OBIA, objects’ boundaries were overlaid on airborne LiDAR-produced AGB maps to calculate the mean AGB within each object. When a bin had less than 200 objects, one-half of the objects were randomly selected [12].
The procedure of generating airborne LiDAR AGB rasters started with row-wise observation filtering, deleting observations that indicate poor alignment between FIA and airborne LiDAR data at a plot. For instance, at this stage, observations with missing values, plots with 0 Mg/ha AGB and a maximum LiDAR height above 10 m, and plots with 500 Mg/ha AGB and a maximum LiDAR height below 10 m were deleted. Second, a set of predictors containing airborne LiDAR height, intensity, environmental, topographical, and tax parcel codes (cadastral information related to land-use and management) was created to train the model. Then, the final set of predictors was split into a training set (containing 70% of the data) and a holdout test set (containing the other 30% of the data). It is worth mentioning that the training set is used for all model tuning and validation stages, while the test set is used to assess the performance of the final model. Finally, the ranger package in R was used to implement an RF regression model to produce airborne LiDAR AGB rasters for each pilot area. A grid search approach was used to determine values for model hyperparameters. For each combination of values, model performance was assessed using the training set and 5-fold cross validation. Parameter combinations are then ranked by mean RMSE. The best performing ranges of each parameter are then used to build another grid, this time using a more constrained set of values. This process is repeated until a relatively steady set of parameters are identified; while there is no strict stopping rule used, the best models at the final tuning step are typically separated by less than 1% RMSE. It is certainly a subjective process. However, we tuned until there was no point in refining the hyper parameter values any further. Here is an example with the “mtry” parameter: First, tune: values 1–5, winner is 5. Second, tune: values 1–10, winner is 10. Third, tune: values 8–15, winner is 13. Stop there. We have tuned be smallest increment (1, it does not make sense to tune mtry by fractions) and found the winner. This implementation provides the following hyperparameters to tune:
  • num.trees: The number of trees to aggregate. Values evaluated span 50–5000.
  • mtry: Number of variables to split at in each node. The initial grid evaluates using 5–40% of variables at each split (for our current dataset, 3–40); our end model typically uses 40–50% of variables at each split.
  • min.node.size: Minimum node size (number of observations in the terminal nodes of each tree).
    • Higher values result in less complex trees (which can reduce overfitting with noisy predictors). Our initial grid evaluates values from 1 to 10, while our end model typically uses a value around 6 (with the default for regression being 5).
  • replace: Boolean: Take samples with replacement? Tends to be TRUE.
  • sample.fraction: What fraction of observations to sample. Our initial grid evaluates values from
    • 0.33 to 0.8; this parameter tends towards either 0.2 or 0.8.
  • splitrule: Whether to use maximally selected rank statistics (maxstat) or estimated response variances (variance) as a variable selection splitting rule.
It should be noted that in the process of generating airborne LiDAR data, inevitably, errors are included in the final predicted model. Thus, although airborne LiDAR AGB rasters provide sufficient training samples, the potential propagation of errors from these reference datasets into our model predictions should be considered as well.

3.2. Object-Based Feature Extraction

An object-based approach has shown promising results in image analysis and image classification applications [19]. However, its potential has not been explored completely in forest AGB estimation. Thus, in this study, the OBIA technique, along with the RF regression model, was used to investigate the ability of OBIA for historical forest AGB mapping. OBIA approach initially segments pixels into objects based on their spectral similarity [48]. OBIA mitigates the “mixed pixels” issue existing in a pixel-based approach by categorizing similar pixels (canopy covers) into objects [19]. Among various segmentation techniques, the simple noniterative clustering (SNIC) algorithm in GEE [48] was used to cluster pixels of Landsat imagery in each year. The SNIC algorithm works by initializing centroids (superpixels) with pixels chosen in the image [49]. It selects the pixels one-by-one from the queue and assigns them to a superpixel until no more pixels remain unassigned. The next pixel to be assigned to a superpixel is chosen by selecting the pixel with the lowest metric value to any superpixel [50]. There are some parameters such as size, compactness, connectivity, neighborhoodSize, and seeds that need to be determined. In this research, a trial-and-error approach was taken to select the best value for each parameter. It is worth mentioning that segments vary through time because biomass changes by the time. First, SNIC segmentation was applied to Landsat imagery for each year separately to create the objects. The values were selected based on the shape of the objects and the RMSE of the RF model in an iterative process. In each iteration, training/testing samples from LiDAR AGB rasters were calculated based on produced objects using different parameter values. Then, the RF model was run to evaluate the performance of the model using the RMSE. The best RMSE and objects’ shape were considered to choose the wining values for SNIC segmentation. Finalized object boundaries were used to generate training/testing samples from LiDAR AGB rasters. Finally, mean, variance, angular second moment (ASM), contrast, entropy, and homogeneity, known as gray level co-occurrence matrix (GLCM) features, within each object have been calculated to create the input set of predictors.
Figure 3 shows a true color composite of Landsat 5 imagery and the result of segmentation by applying the SNIC algorithm using different values for seeds. According to Figure 3b,c, changing the seeds value has changed the object boundaries and segments in (b) are closer to the boundaries of objects on the ground.

3.3. Map Accuracy Assessment Using FIA Plots

Because of the limitations in type, magnitude, frequency, and the location of errors that exist in geospatial data assessments, and due to the uncertainties regarding the spatial support between modeled and reference datasets, another approach is suggested to be utilized here. The adopted map accuracy assessment protocol was developed by [27]. It has the potential to overcome the mentioned limitations. Additionally, the uncertainties about spatial support between predicted and measured datasets were considered through accounting for the mismatches in spatial support between the two datasets. First, their approach overcomes the limitations in type, magnitude, frequency, and the location of errors. Second, they consider mismatches in spatial support between modeled and reference datasets, and account for their uncertainties. Although this approach is fundamentally an individual pixel-based comparison between AGB predictions and FIA plot estimates, it can be used for validating object-based predictions. It should be noted that the smallest sample unit of OBIA is inherently a pixel. In an object-based approach, we are dealing with pixels grouped as an object by their reflectance similarity. Thus, it is highly recommended to use sensors with high spatial resolutions for generating object boundaries.
This protocol consists of four types of assessments that incorporate both graphical and geographical visualizations and both qualitative and quantitative measures of agreement.
Assessment 1—Examining data distribution (at several scales):
An empirical cumulative distribution function (ecdf) is a valuable tool for comparing the distributions of modeled and reference datasets qualitatively. It can identify the differences in data distribution such as 0′s, max values, and missing range of values. The Kolmogorov–Smirnov (KS) statistic quantifies the agreement between two dataset’s distribution in terms of the maximum difference in their empirical distribution [27]. The KS statistic is defined as:
D KS = max   | F ( x ) G ( x ) |
where F ( x ) and G ( x ) are the cumulative distribution functions of reference data and the estimated values.
Assessment 2—Examining overall agreement of area estimates:
In this scenario, a comparison of modeled estimates to measurements from reference datasets is conducted using several types of relative errors. A scatter plot of modeled vs. plotted biomass is used to find systematic or bias and unsystematic or random differences.
The geometric mean functional relationship (GMFR) regression line is a symmetric regression model that can be used to describe the relationship between modeled and reference datasets.
We used several agreement statistics representing systematic and random differences between the two datasets (i.e., model and FIA plots). Most of these statistics are explained in Riemann et al. (2010). The statistics include R2 value, root mean square error (RMSE), mean bias error (MBE), and mean absolute error (MAE). In addition, the agreement coefficient (AC), developed by [51], which is a systematic, symmetric, bounded, and a nondimensional measure of agreement and is defined as follows:
AC = 1 S S D S P O D
SSD is:
SSD = i = 1 n ( X i Y i ) 2
SPOD is:
SPOD = i = 1 n ( | X _ Y _ | + | X i X _ | ) ( | X _ Y _ | + | Y i Y _ | )
where X _ and Y _ are the mean values of datasets X and Y , respectively.
The AC value ranges 0 1 , where AC = 1 means the agreement between X and Y is perfect, while values less than or equal to zero is a no agreement indicator.
Ji and Gallo (2006) described an unsystematic sum of product difference (SPDu), which is defined as:
SPD u   =   i = 1 n ( | X i X ^ i | ) ( | Y i Y ^ i | )
where X ^ and Y ^ are from the GMFR regression model and SPDs can be obtained by SPDs = SSD − SPDu. Thus, systematic and unsystematic agreement coefficients can be calculated by:
AC sys   = 1 S P D s S P O D
AC uns   = 1   S P D u S P O D
where ACsys = 1 if the GMFR line is perfectly in line with the 1:1 line and ACuns = 1 if all points fall directly on the GMFR line.
Assessment 3—Examining spatial and distribution pattern of local differences:
Due to the spatial variations of field measurements in ground condition, quality, and the local applicability of regression model techniques, the predicted values may have different levels of error across the area of interest. Accuracy assessment throughout the region of interest will provide important information regarding the magnitude and direction of the relative error. To conduct this assessment, choosing the optimal size of the scale is of paramount importance. The scale needs to be to be large enough to contain sufficient reference data for associated confidence intervals to be of an acceptable size. In parallel, it has to be small enough to provide a reasonable spatial illustration of regional variation of study area.
Assessment 4—Examining local variability:
Optimizing the accuracy for local or global processes, a lack of input information at the desired scale, etc., can lead to loss of local variability. A choropleth map of local variance or standard deviation presents a qualitative interpretation of the magnitude and spatial patterns of differences between datasets in the study area. Local variability provides information on local spatial variability in the modeled dataset and its relationship to the reference dataset.
The following lines describe the FIA plot inclusion criteria:
All subplots must be sampled.
Annual assessment vs. pooled assessment:
For an annual assessment: only FIA plots sampled in the designated year were considered and compared to the modeled AGB surface for said year.
For a pooled assessment: FIA plots inventoried in all years (2001–2019) were aggregated within each hex. Modeled values were extracted from each annual surface to match the inventory year for a particular FIA plot but aggregated within each hex just the same.

4. Results and Discussion

The following subsections describe the qualitative and quantitative evaluation of the produced AGB maps.

4.1. Historical AGB Mapping

The national land cover database (NCLD) only provided the land cover maps for 2001, 2006, 2011, 2016, and 2019, respectively; hence, in order to analyze the change over the last 19 years, these available maps were used. Figure 4 shows the AGB maps of the entire NYS for 2001, 2006, 2011, 2016, and 2019. As the figure shows, the AGB has increased in the central upstate part of the NYS between 2001 and 2006, followed by regrowth in the northern regions in 2011. According to the 2016 AGB map, the AGB has increased in comparison to 2011. However, there is no remarkable AGB change between 2016 and 2019.
Loss, gain, and unchanged areas are demonstrated in Figure 5 for the 5-year increment. Areas with gain are shown in green, loss in red, and unchanged regions in blue. For the 2001–2006 time period, green areas are more than red areas, which indicates an increase in the amount of AGB in the year 2006. In the 2006–2011 change map, the loss seems to be the dominant phenomenon (Figure 5). For the 2011–2016 AGB change map, both loss and gain are evident. Within the time span of 2016–2019, it is hard to see significant loss or gain. In order to analyze further the loss/gain of AGB, specific regions across NYS were selected (Figure 6 and Figure 7) and investigated according to the US National Maps Attributing Forest Change: 1986–2010 [45].
Schleeweis et al. (2020) [45] utilized the Landsat archive and an ensemble of disturbance algorithms to produce the US national maps attributing forest change from 1986 to 2010. The disturbance map includes the following classes: stable (i.e., no change), removals, fire, stress, wind, conversion, and other. The most dominant class of forest change in NYS is removal. For all training datasets, land cover and land use observations were collected. Plots labeled as mechanical forest change were labeled as removal or conversion depending on the evidence of vegetation recovery and/or land use change. The removal class has 82.3% user’s and 72.2% producer’s accuracy, respectively. Figure 6 shows the removal disturbance maps of a region in north part of NYS from 2000 to 2010. The AGB map of each year clearly shows the removal occurrence according to that year, which indicates the reliability of our RF historical AGB modeling.
In addition, Figure 7 shows two zoomed areas of forest disturbance, with one demonstrating the removal occurred in 2004 and 2005, while the other indicates the forest fire in 2008. First, the rectangles on the left show the AGM map of the years from 2003 to 2006. The removal disturbance has been clearly demonstrated in the AGB maps of 2004 and 2005, respectively. Second, according to Figure 7, the 2007 AGB map is before the fire and the fired area is green. Then, in 2008, the year of forest fire, the map clearly shows the burned region with red area indicating close to zero AGB values. Importantly, the AGB maps of 2009 and 2010 show the recovery/regrowth of forest in this region.
Changes in the forest and three different forest types have been calculated based on the NLCD maps classification. Changes in the forest and three different forest types have been calculated based on the NLCD maps classification. The NLCD map consists of a 30 m resolution land cover classes with a 16-class legend based on a modified Anderson Level II classification system [52]. First, the raster of each forest class type, including deciduous, evergreen, and mixed forests, was converted to a shapefile. Second, the shapefiles were used to clip the AGB maps to calculate the amount of the AGB for each year. Then, the AGB changes were calculated using the difference of AGB values between years. Figure 8 shows the total forest AGB of 2001, 2006, 2011, 2016, and 2019. As shown, there is a decrease in the amount of AGB from 2001 to 2006. Then, there was a considerable increase in 2011, followed by a slight increase in 2016. The AGB’s loss from 2016 to 2019 can be seen in Figure 8. Figure 9 demonstrates the AGB changes in 106 Mg/ha for 2001, 2006, 2011, 2016, and 2019 in deciduous, evergreen, and mixed forests using NLCD maps. First, there is a decrease, 983.8 × 106 Mg/ha, in deciduous forests, about 132.7 × 106 Mg/ha for evergreen forests, and 173.2 × 106 Mg/ha for mixed forests from 2001 to 2006. From 2006 to 2011, deciduous, evergreen, and mixed forests areas have increased 618.3 × 106 Mg/ha, 82.5 × 106 Mg/ha, and 137.4 × 106 Mg/ha, respectively. Again in 2016, a regrowth can be seen in deciduous, evergreen, and mixed forests in comparison to 2011 for about 229.1 × 106 Mg/ha, 23.5 × 106 Mg/ha, and 27.8 × 106 Mg/ha, respectively. Deciduous, evergreen, and mixed forests have decreased 67.2 × 106 Mg/ha, 41.9 × 106 Mg/ha, and 26.4 × 106 Mg/ha, respectively, in 2019.

4.2. Accuracy Assessment Using FIA Plots

Results of Assessment 1: The KS distance measures the maximum difference between the ecdf of the model and that of the FIA plot data. In Table 2, Table 3, Table 4 and Table 5, the KS distance is the largest value at the plot:pixel scale for all years because a much larger number of plots with very small biomass values are included. For the years 2019 and 2016, the KS distance at 78,100 ha and 216,500 ha scales is decreased close to zero because the two distributions became more similar (Table 4 and Table 5, and Figure 10).
Although increasing the scale may be associated with the decrease in the KS distance, this decrease may be due to a difference in spatial support and/or dataset uncertainty than the actual difference between the modeled and reference dataset.
Results of Assessment 2: Table 2 also presents the R2 and RMSE values for each dataset and scale. Plots per hexagons (PPH) is computed as the average number of plots in each hexagon for a given aggregation size. Our results support those found in Ji and Gallo (2006) that the R2 value does not reflect the level of systematic difference present in the relationship and simply increases with increasing scale. The RMSE values reflect the effects systematic and unsystematic differences and provide valuable information regarding differences between the datasets in data units. Based on RMSEs only, it is difficult to establish what portion of the difference is due to systematic versus unsystematic disagreement. Increasing the scale also indicates improvements in the RMSE and R2 patterns, which might be due to considering different reference samples at larger scales.
Figure 10 shows the ecdf graph comparisons over the years (i.e., 2002, 2006, 2011, 2016, and 2019) with different scales. Initially, there is no flat section at the beginning of the pixel ecdf for all graphs, which indicates that the generated AGB maps are not able to predict as many as zero AGB values as the FIA data. This means that the NYS AGB maps include less mapped zeros for developed and water areas, expected to have a zero and near-zero AGB value in comparison to FIA plots. This issue can be related to issues of optical imagery, such as Landsat, which struggle to predict zero biomass values for green reflected areas without biomass. Moreover, it could be caused by FIA reference data labeled as “structural zeros”, where they do not record biomass due to forest/nonforest definitions while trees exist there. All ecdf graphs at different scales for 2002 reach their maximum predicted values around 170 Mg/ha. For 2006 graph, at plot:pixel and 8660 ha scales, the ecdf reaches 150 Mg/ha, while at the 78,100 and 216,500 scales, it reaches 125 Mg/ha. For 2011, the maximum predicted value comes about 170 Mg/ha for both the plot:pixel and 8660 ha scale. Then, the maximum predicted value reaches 150 Mg/ha and 140 Mg/ha at 78,100 and 216,500 ha scales, respectively. At the plot:pixel and 8660 ha scales for 2016 and 2019, the maximum predicted value reaches 170 Mg/ha, whereas for the 78,100 and 216,500 ha scales, it is about150 Mg/ha. For the pooled datasets, the maximum predicted value is about 150 Mg/ha for the plot:pixel and 8660 ha scales and at 120 Mg/ha for the 78,100 and 216,500 ha scales. Reaching to the maximum predicted values for ecdf is a sign of underestimation in the RF model. Therefore, the AGB maps saturate sooner than the reference data. This could be a problem relating to both optical imagery and using airborne LiDAR rasters as training/testing sample. First, saturation is a common issue in optical imagery at regions with high biomass [17,53]. Second, in the process of generating airborne LiDAR rasters, underestimation of high biomass areas exist which seem to be inevitable.
Figure 11 represents the scatterplot comparison of modeled estimates versus estimates derived from the FIA plots at all four scales. The 1:1 line and the GMFR regression lines for each model–plot relationship are added to aid visual interpretation. The scatter about the GMFR line decreased with scale as expected, resulting in increasing ACu values with the scale as well (Table 2, Table 3, Table 4 and Table 5). The ACs remain approximately the same at the plot:pixel and 8660 ha scales (Table 2 and Table 3), while it decreases as the scale increases (Table 4 and Table 5). It can be interpreted as the difference between the modeled AGB and the reference data. When the scale increases, samples with more variety are available and the systematic error decreases. Remaining steady and decreasing off the 1:1 line (Figure 11) indicates real error presented over and above-expected differences due to differences in spatial support and uncertainty in the reference datasets. This interpretation is more robust when the assessment includes scales with sufficient ground plots to be reasonably confident of reference data estimates. The distance and direction of the GMFR line from the 1:1 line provides an approximate indication of the direction and magnitude of the difference between the datasets at that scale. Moreover, if the GMFR line crosses the 1:1 line, it indicates that the direction of relative over- and underestimation changes at a threshold mean biomass/ha value [27]. These differences also may be due to the spatial pattern of biomass values in NY, in which high and low values are more spatially intermixed throughout the state.
Results of Assessment 3: Figure 12 demonstrates the mapped differences in the modeled area estimates at the 216,500 ha scale with respect to plot-based confidence intervals for 2002, 2006, 2011, 2016, 2019, and the pooled datasets. In the 2002 map, about 60% of the time, the mean biomass estimates fall within the 95% confidence interval. About 66% of the time, the mean biomass estimates fall within the 95% confidence interval for 2006. As the 2011 distribution map indicates, approximately 72% of the time, the mean biomass estimates fall within the 95% confidence interval. About 77% of the time, the mean biomass estimates fall within the 95% confidence interval for 2016. The 2019 mean biomass values fall within the 95% CI associated with the plot means of 86% of the time. For the pooled dataset, 28% of the time, the mean biomass estimates fall within the 95% confidence interval. In Figure 12, the largest errors occur outside the airborne LiDAR AGB pilot areas. It should be noted that training/testing datasets are within the four pilot areas; however, the proposed model predicts the AGB for the entire NYS.
Results of Assessment 4: Figure 13 shows the mapped results of local variability of plot AGB values in the 216,500 ha scale for 2002, 2006, 2011, 2016, 2019, and the pooled data in terms of the standard deviation. The local variability map of the FIA plot and the produced maps at the 216,500 ha scale for each year are shown separately. Generally, comparing the FIA and target maps shows that among all the maps, the local variability of the 2002 target map is more similar to the FIA plots than the others.
The assessment provided in this section presents useful information about the magnitude and direction of the error across the range of distribution of AGB values, such as Figure 11. Figure 12 shows the magnitude of the error in different regions of NYS. This assessment helps us to identify the type of the error. For instance, Figure 10 shows the frequency distribution of values and Figure 13 indicates the local spatial variability of the modeled geospatial dataset. Moreover, it can be analyzed what portion of the relative error could be attributed to systematic versus unsystematic differences in Table 2, Table 3, Table 4 and Table 5.
Implementing assessments at different scales provides informative information which is related to the application. Finding the best scale is a matter of having sufficient reference data, which depends on the spatial intensity of the field measurements representing the application of the study. The ecdf graph assessment at different scale presents some information on which scale improves the model performance. However, at the same time, they need to be interpreted in association with the results from broader and finer scales. For example, when an error present at the 216,500 ha scale also occurs at finer scales, it is more likely to indicate that the difference observed at the finer scale contains a large component of real bias over and above uncertainty in the plot-based estimate.

5. Conclusions

This study proposed a framework for the historical AGB mapping of the entire NYS from 2001 to 2019 using the RF model in GEE. To achieve this goal, Landsat5 TM, Landsat 7 ETM, and Landsat 8 OLI imagery, along with environmental and topographical data, were utilized for AGB modeling using an object-based image analysis (OBIA) approach. For model training and validation, LiDAR-based AGB rasters from 2014, 2015, 2016, and 2018 were used. Then, the trained RF model was applied to generate the AGB maps for the remaining years. Results showed a decrease of 1289.70 × 106 Mg/ha in the AGB between 2001 and 2006, while the biomass increased about 838.2 × 106 Mg/ha from 2006 to 2011. An approximate AGB of 280.4 Mg/ha increased between 2011 and 2016, while the AGB difference between 2016 and 2019 was a decrease of 135.5 Mg/ha. In addition, map accuracy assessment of local variability, and spatial and distribution patterns of local differences provided information about the magnitude and direction of local bias. The results of the mapped differences in the modeled area estimates at the 216,500 ha scale showed that about 60%, 66%, 72%, 77%, 86%, and 28% of the time, the mean biomass estimates fall within the 95% confidence interval for 2002, 2006, 2011, 2016, 2019, and pooled datasets, respectively. The novelty of this study is associated with the use of the OBIA approach for accurate temporal state-wide AGB mapping. One of the limitations of this study is that, as these maps are generated based on the LiDAR raster, the uncertainty in the reference data can be prorogated throughout the workflow. However, the historical AGB maps can contribute to carbon stock monitoring over large areas, resulting in practical solutions for climate change issues.

Author Contributions

Conceptualization, B.S., M.M., C.M.B., H.T. and L.J.; supervision: B.S., C.M.B. and M.M.; formal analysis: H.T. and L.J.; data collection, H.T. and L.J.; visualization, H.T.; writing-original draft preparation, H.T.; writing-review and editing, B.S., M.M., C.M.B., H.T. and L.J. All the authors have read and agreed to the published version of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This project was financially supported by a McIntire-Stennis grant, funded through the USDA-NIFA (United States Department of Agriculture- National Institute of Food and Agriculture) and The Climate and Applied Forest Research Institute (CAFRI) at SUNY ESF funded through the NYS Department of Environmental Conservation.

Data Availability Statement

Not applicable.

Acknowledgments

This project was financially supported by a McIntire-Stennis grant, funded through the USDA-NIFA (United States Department of Agriculture- National Institute of Food and Agriculture) and The Climate and Applied Forest Research Institute (CAFRI) at SUNY ESF funded through the NYS Department of Environmental Conservation. We acknowledge both agencies for supporting this project.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Food and Agriculture Organization of the United Nations. Global Forest Resources Assessment 2020—Key Findings; Food and Agriculture Organization of the United Nations: Rome, Italy, 2020. [Google Scholar]
  2. Chavan, B.; Rasal, G. Total sequestered carbon stock of Mangifera indica. J. Environ. Earth Sci. 2012, 2, 37–49. [Google Scholar]
  3. Zhu, Y.; Feng, Z.; Lu, J.; Liu, J. Estimation of Forest Biomass in Beijing (China) Using Multisource Remote Sensing and Forest Inventory Data. Forests 2020, 11, 163. [Google Scholar] [CrossRef] [Green Version]
  4. Siry, J.P.; Cubbage, F.W.; Ahmed, M.R. Sustainable forest management: Global trends and opportunities. For. Policy Econ. 2005, 7, 551–561. [Google Scholar] [CrossRef]
  5. Rodríguez-Veiga, P.; Wheeler, J.; Louis, V.; Tansey, K.; Balzter, H. Quantifying Forest Biomass Carbon Stocks from Space. Curr. For. Rep. 2017, 3, 1–18. [Google Scholar] [CrossRef] [Green Version]
  6. Saatchi, S. SAR methods for mapping and monitoring forest biomass. In SAR Handbook: Comprehensive Methodologies for Forest Monitoring and Biomass Estimation; Flores, A., Herndon, K., Thapa, R., Cherrington, E., Eds.; SERVIR Global Science: Huntsville AL, USA, 2019; pp. 207–246. [Google Scholar]
  7. Laney, D. 3D data management: Controlling data volume, velocity and variety. META Group Res. Note 2001, 6, 1. [Google Scholar]
  8. Amani, M.; Ghorbanian, A.; Ahmadi, S.A.; Kakooei, M.; Moghimi, A.; Mirmazloumi, S.M.; Moghaddam, S.H.A.; Mahdavi, S.; Ghahremanloo, M.; Parsian, S.; et al. Google Earth Engine Cloud Computing Platform for Remote Sensing Big Data Applications: A Comprehensive Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5326–5350. [Google Scholar] [CrossRef]
  9. Tamiminia, H.; Salehi, B.; Mahdianpari, M.; Quackenbush, L.; Adeli, S.; Brisco, B. Google Earth Engine for geo-big data applications: A meta-analysis and systematic review. ISPRS J. Photogramm. Remote Sens. 2020, 164, 152–170. [Google Scholar] [CrossRef]
  10. Hemati, M.; Hasanlou, M.; Mahdianpari, M.; Mohammadimanesh, F. A Systematic Review of Landsat Data for Change Detection Applications: 50 Years of Monitoring the Earth. Remote Sens. 2021, 13, 2869. [Google Scholar] [CrossRef]
  11. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  12. Hudak, A.T.; Fekety, P.A.; Kane, V.R.; Kennedy, R.E.; Filippelli, S.K.; Falkowski, M.J.; Tinkham, W.T.; Smith, A.M.S.; Crookston, N.L.; Domke, G.M.; et al. A carbon monitoring system for mapping regional, annual aboveground biomass across the northwestern USA. Environ. Res. Lett. 2020, 15, 095003. [Google Scholar] [CrossRef]
  13. De Jong, S.M.; Pebesma, E.J.; Lacaze, B. Above-ground biomass assessment of Mediterranean forests us-ing airborne imaging spectrometry: The DAIS Peyne experiment. Int. J. Remote Sens. 2003, 24, 1505–1520. [Google Scholar] [CrossRef]
  14. Kelsey, K.C.; Neff, J.C. Estimates of Aboveground Biomass from Texture Analysis of Landsat Imagery. Remote Sens. 2014, 6, 6407–6422. [Google Scholar] [CrossRef] [Green Version]
  15. Luo, S.; Wang, C.; Xi, X.; Nie, S.; Fan, X.; Chen, H.; Ma, D.; Liu, J.; Zou, J.; Lin, Y.; et al. Estimating forest aboveground biomass using small-footprint full-waveform airborne LiDAR data. Int. J. Appl. Earth Obs. Geoinf. ITC J. 2019, 83, 101922. [Google Scholar] [CrossRef]
  16. Hussin, Y.A.; Gilani, H.; van Leeuwen, L.; Murthy, M.S.R.; Shah, R.; Baral, S.; Tsendbazar, N.-E.; Shrestha, S.; Shah, S.K.; Qamer, F.M. Evaluation of object-based image analysis techniques on very high-resolution satellite image for biomass estimation in a watershed of hilly forest of Nepal. Appl. Geomat. 2014, 6, 59–68. [Google Scholar] [CrossRef]
  17. Urbazaev, M.; Thiel, C.; Cremer, F.; Dubayah, R.; Migliavacca, M.; Reichstein, M.; Schmullius, C. Estimation of forest aboveground biomass and uncertainties by integration of field measurements, airborne LiDAR, and SAR and optical satellite data in Mexico. Carbon Balance Manag. 2018, 13, 5. [Google Scholar] [CrossRef] [Green Version]
  18. Mutanga, O.; Adam, E.; Cho, M.A. High density biomass estimation for wetland vegetation using WorldView-2 imagery and random forest regression algorithm. Int. J. Appl. Earth Obs. Geoinf. 2012, 18, 399–406. [Google Scholar] [CrossRef]
  19. Salehi, B.; Daneshfar, B.; Davidson, A.M. Accurate crop-type classification using multi-temporal optical and multi-polarization SAR data in an object-based image analysis framework. Int. J. Remote Sens. 2017, 38, 4130–4155. [Google Scholar] [CrossRef]
  20. Hirata, Y.; Furuya, N.; Saito, H.; Pak, C.; Leng, C.; Sokh, H.; Ma, V.; Kajisa, T.; Ota, T.; Mizoue, N. Object-Based Mapping of Aboveground Biomass in Tropical Forests Using LiDAR and Very-High-Spatial-Resolution Satellite Data. Remote Sens. 2018, 10, 438. [Google Scholar] [CrossRef] [Green Version]
  21. Silveira, E.M.O.; Silva, S.H.G.; Acerbi-Junior, F.W.; Carvalho, M.C.; Carvalho, L.M.T.; Scolforo, J.R.S.; Wulder, M.A. Object-based random forest modelling of aboveground forest biomass outperforms a pixel-based approach in a heterogeneous and mountain tropical environment. Int. J. Appl. Earth Obs. Geoinf. 2019, 78, 175–188. [Google Scholar] [CrossRef]
  22. Zhang, C.; Denka, S.; Cooper, H.; Mishra, D. Quantification of sawgrass marsh aboveground biomass in the coastal Everglades using object-based ensemble analysis and Landsat data. Remote Sens. Environ. 2018, 204, 366–379. [Google Scholar] [CrossRef]
  23. Chen, L.; Ren, C.; Bao, G.; Zhang, B.; Wang, Z.; Liu, M.; Man, W.; Liu, J. Improved Object-Based Estimation of Forest Aboveground Biomass by Integrating LiDAR Data from GEDI and ICESat-2 with Multi-Sensor Images in a Heterogeneous Mountainous Region. Remote Sens. 2022, 14, 2743. [Google Scholar] [CrossRef]
  24. Pham, L.T.; Brabyn, L. Monitoring mangrove biomass change in Vietnam using SPOT images and an object-based approach combined with machine learning algorithms. ISPRS J. Photogramm. Remote Sens. 2017, 128, 86–97. [Google Scholar] [CrossRef]
  25. Forests—NYS Dept. of Environmental Conservation. Available online: https://www.dec.ny.gov/lands/309.html (accessed on 13 December 2021).
  26. Albright, T.A. Forests of New York, 2017. In Resource Update FS-170; US Department of Agriculture, Forest Service, Northern Research Station: Newtown Square, PA, USA, 2018; pp. 1–4. [Google Scholar]
  27. Riemann, R.; Wilson, B.T.; Lister, A.; Parks, S. An effective assessment protocol for continuous geospatial datasets of forest characteristics using USFS Forest Inventory and Analysis (FIA) data. Remote Sens. Environ. 2010, 114, 2337–2352. [Google Scholar] [CrossRef]
  28. Forest Inventory and Analysis National Program. Available online: https://www.fia.fs.fed.us/ (accessed on 13 December 2021).
  29. Jenkins, J.C.; Chojnacky, D.C.; Heath, L.S.; Birdsey, R.A. National-scale biomass estimators for United States tree species. For. Sci. 2003, 49, 12–35. [Google Scholar]
  30. Burrill, E.A.; Wilson, A.M.; Turner, J.A.; Pugh, S.A.; Menlove, J.; Christiansen, G.; Conkling, B.L.; David, W. The Forest Inventory and Analysis Database: Database Description and User Guide Version 8.0 for Phase 2; US Department of Agriculture, Forest Service: Jackson, WY, USA, 2018; p. 946.
  31. NYS-LIDAR-Coverage. Available online: https://gis.ny.gov/elevation/lidar-coverage.htm (accessed on 13 December 2021).
  32. LAS. Available online: https://gis.ny.gov/elevation/metadata/Ulster-Dutchess-Orange-Counties-NY-Classified-LAS.xml (accessed on 13 December 2021).
  33. NY_WarrenWashingtonEssex_Spring2015. Available online: https://gis.ny.gov/elevation/metadata/Warren-Washington-Essex-2014-15.xml (accessed on 13 December 2021).
  34. Allegany and Steuben Counties, New York Lidar; Overall Project Metadata. Available online: https://gis.ny.gov/elevation/metadata/2016NY-Allegany-Steuben-Classified-Point-Cloud-USGSv1.2.xml (accessed on 13 December 2021).
  35. LIDAR Collection (QL2) for Cayuga County and Most of Oswego County, New York Lidar; Classified Point Cloud. Available online: https://gis.ny.gov/elevation/metadata/2018NY-Cayuga-Oswego-Classified-Point-Cloud.xml (accessed on 13 December 2021).
  36. Young, N.E.; Anderson, R.S.; Chignell, S.M.; Vorster, A.G.; Lawrence, R.; Evangelista, P.H. A survival guide to Landsat preprocessing. Ecology 2017, 98, 920–932. [Google Scholar] [CrossRef] [Green Version]
  37. Bai, B.X.; Tan, Y.M.; Wu, P. The spatial and temporal availability differences of cloud-free landsat images over three gorges reservoir area. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-3/W9, 1–8. [Google Scholar] [CrossRef] [Green Version]
  38. Scaramuzza, P.; Micijevic, E.; Chander, G. SLC gap-filled products phase one methodology. Landsat Tech. Notes 2004, 5. [Google Scholar]
  39. API|LT-GEE Guide. Available online: https://emapr.github.io/LT-GEE/api.html#buildsrcollection (accessed on 13 December 2021).
  40. Kennedy, R.E.; Ohmann, J.; Gregory, M.; Roberts, H.; Yang, Z.; Bell, D.M.; Kane, V.; Hughes, M.J.; Cohen, W.B.; Powell, S.; et al. An empirical, integrated forest biomass monitoring system. Environ. Res. Lett. 2018, 13, 025004. [Google Scholar] [CrossRef]
  41. Daly, C.; Halbleib, M.; Smith, J.I.; Gibson, W.P.; Doggett, M.K.; Taylor, G.H.; Curtis, J.; Pasteris, P.P. Physiographically sensitive mapping of climatological temperature and precipitation across the conterminous United States. Int. J. Clim. 2008, 28, 2031–2064. [Google Scholar] [CrossRef]
  42. PRISM Climate Group at Oregon State University. Available online: https://prism.oregonstate.edu/normals/ (accessed on 13 December 2021).
  43. GitHub. terrainr: Retrieve Data from the USGS National Map and Transform it for 3D Landscape Visualizations, Issue #416 Ropensci/Software-Review. Available online: https://github.com/ropensci/software-review/issues/416 (accessed on 29 November 2021).
  44. Services. Available online: https://apps.nationalmap.gov/services/ (accessed on 29 November 2021).
  45. Kennedy, R.E.; Yang, Z.; Cohen, W.B. Detecting trends in forest disturbance and recovery using yearly Landsat time series: 1. LandTrendr—Temporal segmentation algorithms. Remote Sens. Environ. 2010, 114, 2897–2910. [Google Scholar] [CrossRef]
  46. Kennedy, R.E.; Yang, Z.; Gorelick, N.; Braaten, J.; Cavalcante, L.; Cohen, W.B.; Healey, S. Implementation of the LandTrendr Algorithm on Google Earth Engine. Remote Sens. 2018, 10, 691. [Google Scholar] [CrossRef] [Green Version]
  47. Burkman, B. Forest inventory and analysis—Sampling and plot design. FIA Fact Sheet Ser. USDA For. Serv. 2005. [Google Scholar]
  48. Addink, E.; van Coillie, F. Object-based image analysis. GIM Int. 2010, 24, 12–15. [Google Scholar]
  49. Achanta, R.; Süsstrunk, S. Superpixels and Polygons Using Simple Non-iterative Clustering. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 4895–4904, ISBN 1063-6919. [Google Scholar]
  50. West, P.W. Tree and Forest Measurement; Springer: Berlin/Heidelberg, Germany, 2015. [Google Scholar] [CrossRef]
  51. Ji, L.; Gallo, K. An Agreement Coefficient for Image Comparison. Photogramm. Eng. Remote Sens. 2006, 72, 823–833. [Google Scholar] [CrossRef]
  52. Homer, C.; Dewitz, J.; Jin, S.; Xian, G.; Costello, C.; Danielson, P.; Gass, L.; Funk, M.; Wickham, J.; Stehman, S.; et al. Conterminous United States land cover change patterns 2001–2016 from the 2016 National Land Cover Database. ISPRS J. Photogramm. Remote Sens. 2020, 162, 184–199. [Google Scholar] [CrossRef]
  53. Wang, L.; Zhou, X.; Zhu, X.; Dong, Z.; Guo, W. Estimation of biomass in wheat using random forest regression algorithm and remote sensing data. Crop J. 2016, 4, 212–219. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Location of the study area (NYS), boundaries of 4 pilot areas of LiDAR data (red: three counties—2014; dark green: Warren Washington Essex—2015; light green: Allegany and Steuben—2016; orange: Cayuga Oswego—2018), training (purple), and validation (pink) samples for each LiDAR raster.
Figure 1. Location of the study area (NYS), boundaries of 4 pilot areas of LiDAR data (red: three counties—2014; dark green: Warren Washington Essex—2015; light green: Allegany and Steuben—2016; orange: Cayuga Oswego—2018), training (purple), and validation (pink) samples for each LiDAR raster.
Remotesensing 14 04097 g001
Figure 2. The proposed object-based AGB estimation workflow using random forest regression model and Landsat imagery.
Figure 2. The proposed object-based AGB estimation workflow using random forest regression model and Landsat imagery.
Remotesensing 14 04097 g002
Figure 3. An example of SNIC segmentation in GEE using different seeds and same size, compactness, connectivity, and neighborhoodSize. (a) Landsat 5 image collection for July/01/2011 to August/31/2011. (b) SNIC segmentation using 60 seeds. (c) SNIC segmentation using 20 seeds.
Figure 3. An example of SNIC segmentation in GEE using different seeds and same size, compactness, connectivity, and neighborhoodSize. (a) Landsat 5 image collection for July/01/2011 to August/31/2011. (b) SNIC segmentation using 60 seeds. (c) SNIC segmentation using 20 seeds.
Remotesensing 14 04097 g003aRemotesensing 14 04097 g003b
Figure 4. NYS object-based historical AGB mapping using random forest regression model in GEE and Landsat imagery.
Figure 4. NYS object-based historical AGB mapping using random forest regression model in GEE and Landsat imagery.
Remotesensing 14 04097 g004aRemotesensing 14 04097 g004b
Figure 5. The 5-year increment change maps to analyze loss (red), gain (green), and unchanged (blue) areas in NYS for time spans: (a): 2001–2006, (b): 2006–2011, (c): 2011–2016, and (d): 2016–2019.
Figure 5. The 5-year increment change maps to analyze loss (red), gain (green), and unchanged (blue) areas in NYS for time spans: (a): 2001–2006, (b): 2006–2011, (c): 2011–2016, and (d): 2016–2019.
Remotesensing 14 04097 g005
Figure 6. Forest change (removal) from 2000 to 2010 in an area in northern NYS and the related AGB maps.
Figure 6. Forest change (removal) from 2000 to 2010 in an area in northern NYS and the related AGB maps.
Remotesensing 14 04097 g006
Figure 7. Two zoomed areas for demonstrating the forest disturbance (left: forest removal in 2004 and 2005; right: forest fire in 2008).
Figure 7. Two zoomed areas for demonstrating the forest disturbance (left: forest removal in 2004 and 2005; right: forest fire in 2008).
Remotesensing 14 04097 g007
Figure 8. Total forest AGB changes over time in NYS (2001, 2006, 2011, 2016, and 2019).
Figure 8. Total forest AGB changes over time in NYS (2001, 2006, 2011, 2016, and 2019).
Remotesensing 14 04097 g008
Figure 9. Forest type changes over time in NYS using classes of NLCD maps (2001, 2006, 2011, 2016, and 2019).
Figure 9. Forest type changes over time in NYS using classes of NLCD maps (2001, 2006, 2011, 2016, and 2019).
Remotesensing 14 04097 g009
Figure 10. ECDF comparisons across scales—FIA plots vs. mapped (SI-corrected).
Figure 10. ECDF comparisons across scales—FIA plots vs. mapped (SI-corrected).
Remotesensing 14 04097 g010
Figure 11. The 1:1 and GMFR lines across scales—FIA plots vs. mapped (SI corrected).
Figure 11. The 1:1 and GMFR lines across scales—FIA plots vs. mapped (SI corrected).
Remotesensing 14 04097 g011
Figure 12. Mapped differences in modeled area estimates at the 216,500 ha scale with respect to plot-based confidence intervals (2002, 2006, 2011, 2016, 2019, and pooled).
Figure 12. Mapped differences in modeled area estimates at the 216,500 ha scale with respect to plot-based confidence intervals (2002, 2006, 2011, 2016, 2019, and pooled).
Remotesensing 14 04097 g012
Figure 13. Mapped results of local variability of plot values in each 216,500 ha area (2002, 2006, 2011, 2016, 2019, and pooled).
Figure 13. Mapped results of local variability of plot values in each 216,500 ha area (2002, 2006, 2011, 2016, 2019, and pooled).
Remotesensing 14 04097 g013
Table 1. The statistical characteristics of AGB values of FIA plots.
Table 1. The statistical characteristics of AGB values of FIA plots.
YearMin_AGB (Mg/ha)Max_AGB (Mg/ha)Mean_AGB (Mg/ha)Median_AGB (Mg/ha)
20020317.1871.2652.73
20030378.7371.8151.22
20040298.5167.6137.55
20050308.0474.1961.67
20060369.9470.8043.69
20070318.0671.1254.44
20080324.5973.1356.10
20090403.7973.7353.59
20100320.6973.0349.37
20110392.3175.1350.61
20120330.8673.3460.18
20130327.1676.8761.34
20140422.6278.0758.50
20150336.4774.1049.20
20160360.5679.4562.64
20170424.9980.1958.43
20180349.0176.9965.12
20190322.5679.2457.48
Table 2. Agreement statistics (SI-corrected)—pixel comparison.
Table 2. Agreement statistics (SI-corrected)—pixel comparison.
GroupnPPHMean FIAMBE
(Mg/ha)
RMSE
(Mg/ha)
MAE
(Mg/ha)
R2KSACACsACu
target_20021017NA71.3318.5352.7141.830.560.350.490.840.65
target_2006880NA70.889.4758.6646.040.470.360.150.710.43
target_2011940NA75.2112.0356.6645.140.550.340.320.750.57
target_2016640NA79.458.0353.1341.250.590.340.390.810.57
target_2019606NA79.465.7654.5342.670.600.380.300.750.56
pooled 14,333NA74.2414.0956.2244.550.530.350.340.770.57
Table 3. Agreement statistics (SI-corrected)—8660 Ha Hex.
Table 3. Agreement statistics (SI-corrected)—8660 Ha Hex.
GroupnPPHMean FIAMBE
(Mg/ha)
RMSE
(Mg/ha)
MAE
(Mg/ha)
R2KSACACsACu
target_20029211.1071.7518.8451.9041.030.560.320.490.840.65
target_20068211.0772.208.9157.6945.300.470.340.130.700.43
target_20118551.1076.6711.6256.0644.620.550.320.300.730.57
target_20165761.1178.498.6652.040.070.600.340.410.820.59
target_20195261.1580.305.1753.6440.890.600.350.280.730.54
pooled 15289.3773.5814.3836.6729.840.660.230.540.780.76
Table 4. Agreement statistics (SI-corrected)—78,100 Ha Hex.
Table 4. Agreement statistics (SI-corrected)—78,100 Ha Hex.
GroupnPPHMean FIAMBE
(Mg/ha)
RMSE
(Mg/ha)
MAE
(Mg/ha)
R2KSACACsACu
target_20021935.2565.4720.2532.3727.310.710.250.670.820.84
target_20061914.6069.5910.1036.1529.770.600.260.370.700.67
target_20111904.9474.3112.0032.7627.020.730.230.570.770.80
target_20161843.4878.366.6438.1728.000.680.200.460.780.68
target_20191793.3779.995.2937.2028.440.650.170.410.780.63
pooled 20170.1669.9615.7625.8121.120.810.270.660.760.90
Table 5. Agreement statistics (SI-corrected)—216,500 Ha Hex.
Table 5. Agreement statistics (SI-corrected)—216,500 Ha Hex.
GroupnPPHMean FIAMBE
(Mg/ha)
RMSE
(Mg/ha)
MAE
(Mg/ha)
R2KSACACsACu
target_20028112.4462.6222.4430.9525.710.760.320.660.770.90
target_20067910.9669.5610.8933.1126.470.470.330.240.650.59
target_20117911.7173.0714.0931.9022.840.550.240.520.800.72
target_2016778.3175.938.6926.7921.040.780.180.680.860.82
target_2019748.0979.475.3830.7924.710.640.270.400.770.63
pooled 82169.7968.7517.8125.8721.200.810.330.620.710.92
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tamiminia, H.; Salehi, B.; Mahdianpari, M.; Beier, C.M.; Johnson, L. Mapping Two Decades of New York State Forest Aboveground Biomass Change Using Remote Sensing. Remote Sens. 2022, 14, 4097. https://doi.org/10.3390/rs14164097

AMA Style

Tamiminia H, Salehi B, Mahdianpari M, Beier CM, Johnson L. Mapping Two Decades of New York State Forest Aboveground Biomass Change Using Remote Sensing. Remote Sensing. 2022; 14(16):4097. https://doi.org/10.3390/rs14164097

Chicago/Turabian Style

Tamiminia, Haifa, Bahram Salehi, Masoud Mahdianpari, Colin M. Beier, and Lucas Johnson. 2022. "Mapping Two Decades of New York State Forest Aboveground Biomass Change Using Remote Sensing" Remote Sensing 14, no. 16: 4097. https://doi.org/10.3390/rs14164097

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop