An Integrated GIS and Remote Sensing Approach for [0.95]Monitoring Harvested Areas from Very High-Resolution, Low-Cost Satellite Images

Advanced monitoring and mapping of forest areas using the latest technological advances in satellite imagery is an alternative solution for sustainable forest management compared to conventional ground measurements. Remote sensing products have been a key source of information and cost-effective options for monitoring changes in harvested areas. Despite recent advances in satellite technology with a broad variety of spectral and temporal resolutions, monitoring the areal extent of harvested forest areas in managed forests is still a challenge, primarily due to the highly dynamic spatiotemporal patterns of logging activities. Our goal was to introduce a plot-based method for monitoring harvested forest areas from very high-resolution (VHR), low-cost satellite images. Our method encompassed two data categories, which included vegetation indices (VIs) and texture analysis (TA). Each group of data was used to model the amount of harvested volume both independently and in combination. Our results indicated that the composition of all spectral bands can improve the accuracy of all models of average volume by 23.52 RMSE reduction and total volume by 33.57 RMSE reduction. This method demonstrated that monitoring and extrapolation of the calculated relation and results from smaller forested areas could be applied as an automatic remote-based supervised monitoring method over larger forest areas.


Introduction
Knowing the extent of forest cover disturbances related to anthropogenic interventions, particularly timber harvesting, over large managed areas is important to assess: (1) the construction or update of forest inventories [1,2]; (2) forest health and productivity [3,4]; (3) risks of climate change [5]; (4) carbon flux rates [6]; and (5) support required for forest management plans (FMPs) and decision making [7]. Despite the economic importance of the timber industry in many regions, logging activities on a global scale have resulted in significant ecological damage due to excessive timber volume extraction, which in some cases is considered to be irreversible [8].
It is evident that logging has multiple effects on forest composition and structure [9]. Logging not only decreases the tree density [10] and basal area [11], but also regulates canopy gaps dynamics with altered light conditions [12]. Even-aged forest stand management commonly leads to the final harvest, by removing the mature stand, either by applying several shelterwood cutting phases or in one-cut (clear-cut). On the contrary, selective logging, and uneven-aged stand management never decrease the growing stock to zero. Selective logging is a source of anthropogenic disturbance and therefore is regarded as forest degradation. Approximately, 50%-90% of forest cover is maintained after selective logging activities [13,14], crown openness can cause significant environmental consequences, like for instance semantic timber stock reduction [15].
More traditional methods of ground truth data collection to support forest resource management, such as field samples, are usually expensive and time-consuming. Also, there are challenges related to the size of study areas, accessibility, and the capacity to conduct repeated measurements over time (regular time-series datasets). However, the collection of ground truth data can help to calibrate other sources of data, such as remote sensing data using geographic information systems (GIS) and also support the interpretation and analysis of satellite imagery and aerial photography, among other methods of remote data collection.
To alleviate these challenges, satellite remote sensing (SRM) is an economical and practical solution to consistently monitor forest cover changes over large geographical areas [16]. The detection and monitoring of deforested areas using satellite images are essential for forest managers because they allow for the creation of regular time-series datasets with different spatial resolutions. These datasets can be used for multiple purposes, including forest cover and change detection [17,18] and estimation of structural (e.g., diameter at breast height, height, stem volume) and textural metrics, with acceptable accuracy [19][20][21][22][23]. In addition to volume estimation, satellite images can be used to estimate the area of forest classes, for which volume can be extrapolated using field-based measurements [24].
Some of the most significant challenges to mapping and monitoring forested areas are related to: (1) highly dynamic spatiotemporal patterns of logging activities, where SRM is able to detect changes in patterns for only short period of time, because of the quick canopy closure after silviculture activities [8]; (2) high levels of complexity inherent in forest ecosystems [25]; and (3) timber harvesting techniques of differing intensities and patterns [26]. Indeed, logging impacts can vary significantly depending on the intensity and the frequency of logging activities, since they can vary from very low, to more intensive and repeated logging activities.
Consistent or long-term remote-based monitoring is the key to dynamically discriminating and monitoring degraded forest areas either caused by biotic (e.g., pest infestation) or abiotic factors (e.g., detection of illegal logging activities, or as a control method for the evaluation of silvicultural treatment in larger regions. It is also suitable as support for terrestrial data collection for management plan elaboration, forest inventory and monitoring of carbon sequestration. Even though a single-date image [27] can identify the type of degradation, however, a time-series satellite is needed in order to better detect and assess forest cover dynamics [28]. To address the dynamics of monitoring vegetation by optical means, the best compromise is using sensors with a large field of view (FOV), such as MODIS or NOAA/AVHRR (these sensors combine moderate spatial resolution with high temporal resolution/revisit at a rate of approximately one per day); sensors with finer spatial resolution (e.g., Landsat~30 m) or very high spatial resolution (e.g., WorldView-4~0.3 m, Pleiades' (1A/1B)~0.5 m, and Quick bird~1m) tend to have revisit periods of several weeks. For example, the monitoring roles and mapping of timber harvested areas in Brazilian Amazon has largely relied on Landsat images [25]. The availability of large volumes of free images, sufficient computational capacity, and advances in image processing techniques have stimulated widespread use of Landsat imagery for a broad range of uses, including assessment of vegetation phenology [29] and the classification and assessment of forest changes [30]. In another study, Toutanova et al. [31], used Landsat-based archive to assess long-term regional forest dynamics in eastern Europe [31]. Their results clearly showed that long-term monitoring from remote sensing-based products that employ biophysical criteria of forest cover (defined according to tree canopy cover thresholds) is better suitable for monitoring of timber harvesting than forest inventory data. On the other hand, the uses of finer spatial resolution satellite images can significantly enhance the detection of harvested areas, but there may be computational constraints in many cases and they are typically costly [8]. In some cases, there are also geographical territories with limited or no access to satellite data for a particular time period.
Spectral readings are based on the interaction between objects and incoming radiation; this interaction allows for the detection and discrimination of target objects because of the characteristic signatures of reflected light, as all objects return different amounts of energy in different bands of the electromagnetic spectrum. These signatures are essentially functioning of vegetation light absorption properties at given wavelengths that can be used to estimate spectral indices [32].
Numerous vegetation indices (VIs) utilizing spectral information as a proxy for vegetation productivity, monitoring, and mapping have been developed; the normalized difference vegetation index (NDVI) is the most common index for high-throughput screening [33]. Using the visible and near-infrared (NIR) parts of the electromagnetic spectrum, NDVI estimates the fraction of the photosynthetically active radiation (FPAR) absorbed by the forest canopy [34]. The enhanced vegetation index (EVI) is an optimized index developed to reduce the saturation of the reflectance signal at increasing levels of green biomass, and diminish the noise produced by atmospheric aerosols and soil background [34]. Huene [35] intended to minimize the effects of soil background on the vegetation signal by incorporating a constant soil adjustment factor (L) into the denominator of the NDVI equation. The customized NDVI equation was named soil-adjusted vegetation index (SAVI). The adjustment factor (L) value varies based upon the soil reflectance characteristics, such as brightness and color, among others, and the appropriate value depends exclusively on vegetation density. For example, in the case of high densities, L = 0.25; for intermediate densities, L = 0.5; and for low vegetation densities, L = 1.0 [36]. However, Eastman [37] proposed that the best value for L is the point at which the difference between SAVI for light and dark soil is minimum. Therefore, for L = 1, SAVI approaches perpendicular vegetation index (PVI), which uses the perpendicular distance from each pixel coordinate to the soil line, while for L = 0, SAVI is equal to NDVI [37]. Many studies have identified the usefulness of long series of vegetation indices (mainly NDVI) derived from satellite data for monitoring purposes of forest degradation, but most have been based on low-to medium-resolution satellite images (e.g., [38][39][40]).
In this context, our main goal and novelty of this paper relies on integrating GIS and remote sensing, to utilize all potential information from very high-resolution (VHR) satellite images to help examine (i) if multi-spectral VHR, low-cost satellite images, can be used to successfully detect and estimate the amount of harvested trees vs living trees based on satellite image time series (SITS) from two different periods before and after the logging activities, (ii) the possibility to create a continuous map of harvested areas (iii) which composition of independent data can increase the model accuracy.

Characterization of the Study Area
Our study area was located in the city of Doksy on the shores of Lake Mácha in northern Bohemia in the Czech Republic; it is largely surrounded by dense forest area, covering approximately 300 km 2 . The entire study area is five hectares in a flat terrain, extending geographically from 50 • 33'26.48"N; 14 • 43'33.72"E to 50 • 33'19.88"N; 14 • 43'13.95" E at 310 m a.s.l. We selected 48 circular permanent plots with a radius of 12.6 m representing different harvest intensities in mature forest stands (mean = 113 years; Figure 1). All plots were dominated by Scots pine (Pinus sylvestris L.) with admixtures of Norway spruce (Picea abies (L.) Karst.). The experimental plots were originally established for monitoring of natural regeneration success of Scots pine in different stand densities. For this purpose, stripes of shelterwood cutting of different intensity using harvester technology were established. Harvests were conducted in March 2017 and a total wood volume of 680.5 m 3 was harvested from the plots.
Parent rock in the area was formed by late Cretaceous sandstone, with dominant soil types being Arenic Cambisol and Arenic Podzol [41]. The vegetative period is typically rather warm and dry. Mean annual air temperature is 7.3 • C and average maximum temperature is 31.

Terrestrial Data Acquisition and Processing
To determine the center of each circular plot, we used a Leica TCRP 1201+ robotic total station using the polar coordinate method and the GNSS GPS Leica system 1200 (Leica Geosystems AG: Heerbrugg, Switzerland) with centimeter accuracy. Within each circular plot, we measured in two perpendicular directions the diameter at breast height (1.3 m; DBH1.3) of standing trees and the stump diameter of harvested trees with a computer caliper DP II (accuracy 1 mm; Haglöf Inc., Sweden). Forty trees of each species were measured separately to create a regression equation for estimating the DBH1.3 of harvested trees. Similarly, 25 trees of each species were also sampled to evaluate tree heights using the Vertex IV hypsometer (accuracy 0.1 m; Haglöf Inc., Sweden) ( Table 1).

Terrestrial Data Acquisition and Processing
To determine the center of each circular plot, we used a Leica TCRP 1201+ robotic total station using the polar coordinate method and the GNSS GPS Leica system 1200 (Leica Geosystems AG: Heerbrugg, Switzerland) with centimeter accuracy. Within each circular plot, we measured in two perpendicular directions the diameter at breast height (1.3 m; DBH 1.3 ) of standing trees and the stump diameter of harvested trees with a computer caliper DP II (accuracy 1 mm; Haglöf Inc., Sweden). Forty trees of each species were measured separately to create a regression equation for estimating the DBH 1.3 of harvested trees. Similarly, 25 trees of each species were also sampled to evaluate tree heights using the Vertex IV hypsometer (accuracy 0.1 m; Haglöf Inc., Sweden) ( Table 1). The stem volume was calculated using DBH 1.3 and tree height as predictors based on volumetric equations (Equations (1) and (2); Table 2) [43] by Petráš and Pajtík (Figures 2-4). The field plot data were used as reference data to evaluate the extrapolation process of harvested volumes in the form of a continuous map using extracted spectral data for all plots.
All the processing and statistical analyses of the terrestrial data were conducted in Matlab R2017b professional edition (MathWorks©, Inc.; Massachusetts, U.S.A.).

Acquisition and Pre-Processing of Satellite Images
We used two satellite images by Pléiades-HR 1A and Pléiades-HR 1B, which included an image from 2016 (pre-harvest) and one from 2019 (post-harvest); the images were taken on 27 March 2016 and 6 April 2019, respectively. The image consisted of one panchromatic (PAN) and four multi-spectral (red = R, green = G, blue = B, and NIR) bands (Table 3). To achieve accurate georeferencing of the satellite images, we used 20 ground control points (GCPs) randomly distributed around the study area; these points were measured using the Leica real-time kinematic (RTK) system, model RX1250XC (Leica Geosystems AG; Heerbrugg, Switzerland) with centimeter accuracy. The RTK correction was conducted using a base/rover set, which sends and receives fast-rate over-the-air RTK data corrections, using the PDLGFU15 radio module. The entire phase of satellite image pre-processing was conducted in ArcGIS desktop V.10.6.1 (ESRI Inc.; Redlands, CA, U.S.A.) and PCI geomatica 2016 (PCI Geomatics; Markham, ON, Canada).

Satellite Image Processing
To develop a deeper understanding of the spatial and temporal changes in patterns caused by timber harvesting, we applied several different VIs to evaluate the amount of vegetation cover change for the period between 2016 and 2019. We categorized the VIs into two major groups, denoted as the slope-based (Figures 5 and 6) and distance-based (Figures 7 and 8) groups (Table 4). These indices were calculated by simply dividing the reflectance values of the red band by those of the NIR. Since soil effect is found to have a considerable impact in sparsely-vegetated areas [44], we measured the soil line slope (Figure 9) to create the SAVI index using algebraic equations of the NIR and R bands ( Table 4). The SAVI index was used to suppress the soil effect of the sandstone nature which was dominant in the study area.
We conducted one-way ANOVA analyses (p-value < 0.01) to test the differences in VI values in 2016 and 2019 using Matlab R2017b professional edition (Figures 6 and 8; Table 5).
The PCI geomatica software (PCI Geomatics; Markham, ON, Canada) was used to create the texture analysis (TA) for each band separately. We used 13 TA variables, including mean, variance, entropy, contrast, heterogeneity, homogeneity, angular second moment, correlation, gray-level difference vector (GLDV), angular second moment, GLDV entropy, GLDV mean, GLDV contrast, and inverse difference with kernel window size 12 × 12 for RGB and NIR. For each year we included 52 TA data layers ( Figure 10).

Modeling and Extrapolation of Harvested Parameters
We initially used a feature selection algorithm to identify the most important layers in the modeling process for each dependent variable separately (Supplementary Tables S1-S3). The most significant layers are presented in Table 6 which includes both VI and TA layers.
Using regression analysis as part of random forest (RF) algorithm in STATISTICA software V.12 (Dell Inc.; Round Rock, TX, U.S.A.), we modeled the variety of dependent variables (Table 7). For the application of the RF algorithm, we considered the following options, for random test data: 30%, subsample proportion: 50%, minimum n of cases: 5, minimum n in child node: 5, maximum n of levels: 20, maximum n of nodes: 100, and number of trees: 1000.
For evaluating the model's accuracy, root mean square error (RMSE) was calculated as below where X obs is observed values and X model is modeled values at time/place i. where X obs is mean of observed values. After the evaluation of the modeled parameters, we created a continuous map of harvested parameters ( Figure 11). To show the differences between observed and estimated parameters we used bias and bias% as below where n is the number of samples, x i is the observed value and x j is mean of estimated values.

Vegetation Indices
We created several VI maps to contribute in modeling the differences between the forest parameters pre-and post-harvest. To measure VIs, including the SAVI, MSAVI 1, and MSAVI 2, the slope of soil line, known as the L factor, was required ( Table 4). As evident in Figure 8, the slope of the soil line differed for 2016 and 2019, thus we calculated the L factor for each year separately in the form of a raster layer.
The VI values calculated for pre-and post-harvest images were significantly different between years (Figures 6 and 8; Table 5). Among all VIs, both the slope-based and distance-based, RVI and NRVI indices were the only indices that significantly increased post-harvest. Figure 10 presents the results obtained from the processing of multi-spectral bands with 13 different descriptive statistic algorithms using a kernel size of 12 × 12 pixels. Since irrelevant or partially relevant features can negatively impact model performance, a feature selection algorithm was used to ensure that all independent variables were significantly related to the dependent variables (Supplementary Tables S1-S3). Some of the most significant TA and VI layers are presented in Table 6. Due to feature selection algorithm results, all the TA layers had a significant impact on modeling the input variables. As a result, non of the created layers were excluded from the modeling process.

Modeling and Extrapolation
After the preparation of all VIs and TA layers, the RF algorithm was used to model changes in several forest parameters (Table 7).
Using each group of independent values independently and in combination, we were able to model the relation between the living trees and spectral information for the years 2016 and 2019 separately. In most cases, the combination of VIs and TA data increased the accuracy of the models, and it increased by as much as 20% when modeling the changes due to harvesting. For example, the fusion of VIs and TA data increased the accuracy of the modeling of the total harvested volumes (m 3 ) per plot by almost 20% (Table 7).
Based on the equations estimated using RF, we constructed maps of continuous changes of different forest parameters. Based on our plot-based study design, the output cell size was equal to the plot size (25 × 25 m) (Figure 11).
where is the number of samples, is the observed value and ̅ is mean of estimated values.

Vegetation Indices
We created several VI maps to contribute in modeling the differences between the forest parameters pre-and post-harvest. To measure VIs, including the SAVI, MSAVI 1, and MSAVI 2, the slope of soil line, known as the L factor, was required ( Table 4). As evident in Figure 8, the slope of the soil line differed for 2016 and 2019, thus we calculated the L factor for each year separately in the form of a raster layer.  The VI values calculated for pre-and post-harvest images were significantly different between years (Figures 6,8; Table 5). Among all VIs, both the slope-based and distance-based, RVI and NRVI indices were the only indices that significantly increased post-harvest.   The VI values calculated for pre-and post-harvest images were significantly different between years (Figures 6,8; Table 5). Among all VIs, both the slope-based and distance-based, RVI and NRVI indices were the only indices that significantly increased post-harvest.    . Bi-spectral plot of NIR reflectance versus red reflectance of all pixels between 2016 and 2019, as described in subsection 2.4. The straight red and black lines corresponds to the bare soil for this particular dataset. Both datasets were plotted together using the same x-axis and y-axis values so that the differences are clearly evident.      . Bi-spectral plot of NIR reflectance versus red reflectance of all pixels between 2016 and 2019, as described in Section 2.4. The straight red and black lines corresponds to the bare soil for this particular dataset. Both datasets were plotted together using the same x-axis and y-axis values so that the differences are clearly evident.

Modeling and Extrapolation
After the preparation of all VIs and TA layers, the RF algorithm was used to model changes in several forest parameters (Table 7).

Discussion
We demonstrated the potential of low-cost satellite imagery to detect and model structural changes in forested areas [13,51]. Based on our analyses, all VIs, with the exceptions of RVI and NRVI, exhibited decreased values after the harvest operations [51]. Forest harvesting contributed to lower absorption capabilities of red light (less trees, less chlorophyll), thus resulting in higher values in the red band, and lower reflectance values in NIR (fewer trees, fewer leaves), thus resulting in lower values in the NIR band [52]. Consequently, VI values significantly decreased; our results clearly

Discussion
We demonstrated the potential of low-cost satellite imagery to detect and model structural changes in forested areas [13,51]. Based on our analyses, all VIs, with the exceptions of RVI and NRVI, exhibited decreased values after the harvest operations [51]. Forest harvesting contributed to lower absorption capabilities of red light (less trees, less chlorophyll), thus resulting in higher values in the red band, and lower reflectance values in NIR (fewer trees, fewer leaves), thus resulting in lower values in the NIR band [52]. Consequently, VI values significantly decreased; our results clearly identified the contrasts between the red and NIR bands for vegetated pixels, with high VI values produced by the combinations of low red (due to the low absorption rates by chlorophyll) and high NIR (as a result of the decreased leaf density) reflectance. Because RVI is the inverse of the standard ratio VI, it was no surprise that RVI values increased post-harvest.
The results of the feature selection algorithm demonstrated that TA layers were significantly related to the dependent values ( Figure 10, Supplementary Tables S1-S3), as evident in Table 6 Although the fusion of VIs and TA layers exhibited the highest model accuracy in most cases, using solely TA layers provided reliable accuracy in modeling the forest parameters. This proves that the harvesting operation caused significant changes in the crown canopy texture, which led to significant differences in TA values pre-and post-harvesting. The results from TA are in accordance with the result of our previous study [23], where TA had a significant performance on spatial detection of dominant tree species (i.e., Fagus orientalis L.), that had the highest influence on canopy structure. That verifies the capability of TA in modeling variables with significant influence on canopy structure.
The RF algorithm modeled the total volume and average volume (m3) per plot for the years 2016 and 2019 with high accuracy ( Table 7). The RMSE reduction for the average volume total for 2016 and 2019 was 16.04% and 16.97%, respectively (Table 7). From all estimated parameters, the total harvested volume was the only one with the lowest accuracy (RMSE reduction 33.57%) comparing to the rest forest parameters (Table 7).
In addition, using a combination of VIs and TA values, RF could model changes in stem volume and crown density related to harvesting activities. Models were able to detect the harvested areas within plots (i.e., those that experienced selective logging) as well as outside the plots. Although it is beyond the scope of this study, it is worthwhile to note that the RF algorithm also successfully detected the clear-cut area [53] located south of the area of interest ( Figure 11). The detection ability of our method allows for the successful detection of clear-cut areas, and, more importantly, it demonstrates an ability to detect areas that have been selectively logged. Our findings are consistent with those of two other studies [52,54], where the authors tried to compare IKONOS 1-m and 4-m resolution and Sentinel-2 MSI images with Landsat 7 ETM+ images to identify the significance of higher resolution sensors compared to lower or medium spatial resolution sensors to adequately identify areas of selective logging. Our study revealed the potential of our method to monitor larger forested areas with high reliability because it allows for the extrapolation of the results (local observations) to larger areas.

Conclusions
Our main objective was to determine if multi-spectral, VHR, time-series satellite images could be used to successfully detect and estimate the amount of harvested and living trees, and also create a continuous map of the harvested areas. In addition, we aimed to define which composition of independent data could be used to increase mapping accuracy. Our results demonstrated how low-cost satellite images can be successfully used for the detection of forest structural changes (i.e., volume, canopy density, stand density, etc.). Consequently, this method can be used to monitor volume dynamics caused by anthropogenic forest disturbances. Our findings also showed that estimated variables can be applied as an input for remote sensing-based supervised extrapolation over large areas, without the need for terrestrial interventions. For example, non-accessible areas such as national parks or private forest landowners could benefit by using this method for monitoring purposes. Because the proposed method is also based on the spectral and structural changes that mainly occur in forest canopies, it can also be used for the assessment of abiotic stress factors, such as windthrow and post-fire monitoring. Thus, when it comes to cost-effective monitoring of forest ecosystems over larger areas, the proposed method is considered to be a practical, data sufficient, and low-cost solution for the structural detection and estimation of the amount of harvested areas.
Author Contributions: A.A. conceived the idea for this manuscript; D.P. and A.A. designed and wrote the manuscript; A.A. processed the remote sensing and terrestrial data. D.P. and A.A. designed and performed the statistical analyses; L.B. provided all the terrestrial data and helped in finalizing the manuscript.
Funding: We acknowledge that this work was financially supported by the following projects: (1) "Advanced research supporting the forestry and wood-processing sector's adaptation to global change and the 4th industrial