Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam

Cat Tuong, Truong Thi; Tani, Hiroshi; Wang, Xiufeng; Quang Thang, Nguyen

doi:10.3390/f10080673

Open AccessArticle

Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam

by

Truong Thi Cat Tuong

^1,2

,

Hiroshi Tani

^3,*

,

Xiufeng Wang

³ and

Nguyen Quang Thang

⁴

¹

Mientrung Institute for Scientific Research, Thua Thien Hue Province 530000, Vietnam

²

Graduate School of Agriculture, Hokkaido University, Sapporo 060-8589, Japan

³

Research Faculty of Agriculture, Hokkaido University, Sapporo 060-8589, Japan

⁴

Central Sub Forest Inventory and Planning Institute, Thua Thien Hue Province 530000, Vietnam

^*

Author to whom correspondence should be addressed.

Forests 2019, 10(8), 673; https://doi.org/10.3390/f10080673

Submission received: 3 July 2019 / Revised: 7 August 2019 / Accepted: 7 August 2019 / Published: 9 August 2019

(This article belongs to the Special Issue Impact of Land Use Change on Forest Biodiversity)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Research Highlights: In this study, we classified natural forest into four forest types using time-series multi-source remotely sensed data through a proposed semi-supervised model developed and validated for mapping forest types and assessing forest transition in Vietnam. Background and Objectives: Data on current forest state and changes detection are always essential for forest management and planning. There is, therefore, a need for improved tools to classify and evaluate forest dynamics more accurately and effectively. Our objective is to develop such tools using a semi-supervised model and landscape metrics to classify and map changes in natural forest types by using multi-source remotely sensed data. Materials and Methods: A combination of Landsat data with PALSAR and PALSAR-2 was used for forest classification through the proposed semi-supervised model. This model turned a kernel least square into a self-learning algorithm, trained by a small number of samples with given labels, and then used this classifier to assign labels to the unlabeled data. The overall accuracy, kappa, user’s accuracy, and producer’s accuracy were used to evaluate the classification accuracy by comparing the classified image with the results of ground truth interpretation. Based on the classified images, forest transition was evaluated using certain landscape metrics at the class and landscape levels. Results: The multi-source data approach achieved improved discrimination of forest types compared to only using single data (optical or radar data). Good classification accuracies were obtained, with kappas of 0.81, 0.76, and 0.74 for the years 2007, 2010, and 2016, respectively. The analysis of landscape metrics indicated that there were different behaviors in the four forest types, as well as provided much information about the trends in spatial pattern changes. Conclusions: This study highlights the utilization of a semi-supervised model in forest classification, and the analysis of forest transition using landscape metrics. However, future research should include a comparison of different models to estimate the improvement of the proposed model. Another important study that should be conducted is to test the proposed method on larger areas.

Keywords:

forest types classification; forest transition; semi-supervised model; landscape metrics; Landsat data; synthetic aperture radar

1. Introduction

Since the early 1990s, the tropical forest in several countries has been undergoing a transition period from degradation to reforestation [1,2,3]. Forest transition is considered from the perspective of forest area changes and the conversion from other land use/land cover types to forest. With the rapid development of remote sensing technology and the wide application of landscape ecology, they supply effective tools to analyze spatial-temporal changes and related ecological processes. Improved understanding of forest transition provides many benefits, such as global carbon balance or land use and forest policy implementation [4,5]. Therefore, there is a need to further develop new methods for forest type classification and forest transition assessment.

Recently, remote sensing combined with the conventional method to supply validation data has been extensively used in forest inventory. The advantages of the remote sensing technique are cost- and labor-saving as well as swift observation of large scale forest changes over the long term. However, the classification accuracy associated with using remote sensing is affected by many factors, such as the classification techniques, training samples, and the signal reflected from objects.

A natural forest [6] is a naturally regenerated forest comprising native species, where there are no clear or clearly visible indications of human activities and the ecological processes are not significantly disturbed. In this study, we classified natural forests based on the timber reserve of standing trees into four main types: rich, medium, poor, and restoration forest. Although these four types differ in species composition and timber reserves, we found that with only a single source of data (optical or radar data) it is often difficult to discriminate between different kinds of natural forest types because of the very similar information on canopy and forest structure captured by remotely sensed data [7]. This highlights the need for multi-source remote sensing data to extract more information of interest regarding the objects for classification. By using multi-source data, the classification accuracy is improved compared to single data source. This has been shown, for example, with a combination of optical data and synthetic aperture radar (SAR, Congo Basin and Malawi city, Mzimba) [8,9]. The fusion of different frequencies (L– and P–band) of SAR products has also received much attention in recent years [10,11,12].

Another challenge is that sampling is restricted because of the complexity of ecosystems and inaccessible regions [13]. In this study, we used semi-supervised classification to overcome the paucity of ground truth samples. Semi-supervised classification focuses on enhancing supervised classification by minimizing errors in the labeled examples, but it must also be compatible with the input distribution of unlabeled instances [14]. While supervision often provides higher classification accuracy, it requires a good dataset to ensure both the quantity and quality of training samples collected from the field survey. The constraint of field data collection is that it is not always achievable, owing to limitations in finance, terrain, or availability of the data source. To avoid this issue, semi-supervised classification aims at solving the limited number of labeled samples and taking advantage of the abundant unlabeled samples. Many semi-supervised classification algorithms such as expectation-maximization, co-training, and self-training have been developed. The graph-based method has also attracted an increasing amount of interest [15,16,17,18]. This method works by summarizing base model outputs in a group-object bipartite graph and maximizing the consensus by promoting smoothness of label assignment over the graph and consistency with the initial labeling. Recently, machine learning has received much attention and has been applied to the semi-supervised learning problem. This technology has been successfully developed for binary classification, such as in [19], where a Laplacian Twin Support Vector Machine was used for semi-supervised classification that can exploit the geometry information of the marginal distribution embedded in unlabeled data to construct a more reasonable classifier-semi-supervised classification with graph convolutional networks [20] which scales linearly in the number of graph edges and learns hidden layer representations that encode both the local graph structure and the features of nodes.

For land use/land cover, semi-supervised classification has been successfully adopted in the literature. For instance, in [21], semi-supervised logistic regression was applied. This is a specific instance of the generalized maximum entropy that finds a probability distribution that minimizes a divergence based on the entropy of the weights of classifiers. In [22], a semi-supervised clustering was presented that is simultaneously optimized using a modern multi-objective optimization technique based on the concepts of simulated annealing. In [23], the weight support vector machine was used to keep the training effort low with a manually-collected set of pixels of the class of interest and a random sample of pixels. In [24], extended label propagation and rolling guidance filtering that uses superpixel propagation were applied to assign the same labels to all pixels within the superpixels that are generated by the image segmentation method.

In this paper, we present a self-learning approach for forest classification that can propagate labels from labeled samples to unlabeled data to build a large volume of training data. This model does not make any specific assumptions for the input data, but it does accept that its own predictions tend to be correct [14]. Self-learning, also known as Yarowsky’s algorithm, is a rule-based semi-supervised classification. The term “self-learning” is used because the algorithm uses its own prediction to teach itself. Self-learning is very popular, with an initial classifier trained by a small number of training data with given labels, before using this classifier to assign labels to the unlabeled sample. For each unlabeled sample, confidence values are extracted from the probabilistic of learning models [14,25]. The samples that have been labeled with the most confident prediction are then selected to combine with the training data and create a new training set. The classifier is then retrained on that new training set and the procedure repeated. Self-learning has been applied in several text processing tasks in the last few years. Recently, it has been applied with some developed supervisor classifiers to image classification [23,26]. This study developed self-learning with a kernel least square classifier for forest types classification. Least squares is a standard approach of statistical analysis and has been well-known for a long time. It was developed by applying kernel functions in high dimensional feature space to solve the problem of a large number of parameters [27]. Kernel functions are an algorithm with the advantage of being able to flexibly transform an originally non-linear vector into a linear version in feature space. Therefore, they are widely applied in solving classification problems involving multiple features [28,29,30].

In this study, we also used time-series remotely sensed data for the evaluation of forest changes by landscape ecology. Landscape ecology can be generally defined as the science and art of studying and improving the relationship between spatial patterns and ecological processes on a multitude of scales and organizational levels [31]. One fundamental aspect has been its explicit attention to the spatial dimension of ecological processes [32]. Landscape metrics are one of the classical landscape ecological tools for measurement, analysis, and interpretation of spatial patterns [33]. The contribution of remote sensing to landscape planning and conservation is mainly in the inventory and determination of objects of interest and in monitoring changes by time-series satellite data [34]. A basic concern in forest management is spatial processes over time, such as deforestation, degradation, or restoration. The analysis of landscape structure is a classic approach for the understanding of spatial processes using various landscape metrics [32,35,36,37]. Several studies provide evidence of the value of remote sensing and landscape metrics for forest management [38,39,40,41,42].

In summary, there are two main objectives in this study. The first objective is to assess the potential of a semi-supervised model to classify natural forest types by using multi-source remote sensing data. The second objective is to assess the process of forest transition from the perspective of landscape ecology by using multi-temporal data.

2. Study Area

In Vietnam, the forest plays an important role in the socio-economic system in the mountainous province, where local people have a low income and agroforestry-based livelihoods. Although centralization of forest resource management began in Vietnam very early in the 1950s [4], the natural forest experienced a rapid decrease over the long term [43], causing negative impacts to the environment, such as loss of carbon stock, biodiversity degradation, and habitat fragmentation [44]. Since 2005, however, Vietnam has been experiencing a positive period in the application of forestry policies [45], which is contributing to development of the forested area. This dramatic forest transition has resulted in changes to the biophysical, ecological process, as well as to the spatial landscape. However, there is a lack of up-to-date information on forest changes in Vietnam in the period from 2005 to the present, particularly in central Vietnam where the socio-economic dynamics have recently been increasing. To create a reliable forest management strategy, an improved understanding of forest changes is essential. This can be achieved by spatial analysis through multi-temporal remote sensing images processing, combined with landscape metrics assessment.

Thua Thien Hue province, located in central Vietnam (Figure 1d), has a surface area of 5054 km² and the natural forest area accounts for approximately 40% of the total area. According to the General Statistics Office (GSO) in Vietnam, the natural forest in this study area slightly decreased from 203,800 ha in 2008 to 202,700 ha in 2010, with the principal causes of deforestation comprising the conversion from forest to other land uses (e.g., hydropower, roads, cultivation) and illegal exploitation of forest products. Conversely, from 2010, there was a significant extension of natural forest with the area reaching 212,200 ha in 2016. These fluctuations have not only caused changes in the area, but also in the forest landscape structure.

We classified the natural forest into four types based on the specific condition of the study site as well as circular number 34/2009/TT-BNNPTNT of June 10, 2009 [46] published by Vietnam Ministry of Agriculture and Rural Development, on the criteria for forest identification and classification in Vietnam:

Rich forests are forests with a timber reserve of standing trees of between 201 and 300 m³/hectare;
Average forests (or medium forests) are forests with a timber reserve of standing trees of between 101 and 200 m³/hectare;
Poor forests are forests with a reserve of standing trees of between 10 and 100 m³/hectare;
Forests with no reserve (“Restoration forest” in the case of our study site) are forests with a timber tree average diameter of less than 8 cm and a timber reserve of standing trees of less than 10 m³/hectare.

3. Data and Methods

3.1. Data

We used time-series SAR data and Landsat data acquired in 2007, 2010, and 2016 (Figure 1a,b,c). Two scenes of SAR data were collected per year, which were then used to create a mosaic covering 77% of the study area. The SAR data differed in term of acquisition mode, which led to a difference in the incidence angle and the size of the range and azimuth. Therefore, preprocessing was necessary to synchronize these data. Two polarization HH (horizontal transmitting, horizontal receiving) and HV (horizontal transmitting, vertical receiving) were used to process the data in this study. Landsat data were also selected to combine with SAR data for forest type classification. Landsat data were provided by the United States Geological Survey (USGS) with moderate resolution and wide spectral coverage. The swath width of Landsat is 185 km; therefore, it could cover the full study area. The characteristics of these data are described in Table 1.

A ground sample was also collected to support training data and accuracy assessment. These data were provided by the Central Sub Forest Inventory and Planning Institute, Thua Thien Hue province, Vietnam (Sub-FIPI). The data collection was evenly distributed over the entire study area at three time periods—In 2007, 2010, and 2016. The samples were then divided into 80% training data and 20% validation data. In 2007, 13 measured plots were covered by the PALSAR scene, with each plot measuring 1 km² (1000 × 1000 m), while in 2010, there were 10 such plots. In each plot, 40 subplots of 25 × 20 m were set to measure forest parameters and describe characteristics. However, not all 40 subplots were measured and selected for classification; only some met the conditions of being natural forests with reserves, not separated by other obstacles such as rivers, streams and roads, and terrain. In 2007, 170 subplots were selected for this study, while in 2010, 115 subplots were selected. In 2016, 106 plots were covered by PALSAR-2 data. Each rectangular plot measured 30 × 33 m with the longer aspect running in an east-west direction and the shorter aspect running north-south. The distribution of samples for the four forest types is described in Table 2.

Apart from these samples, a larger amount of unlabeled data was supplied for forest types classification. A total of 200 unlabeled samples was randomly created over the study area. The proportion of unlabeled samples accounted for approximately 40–60% of the total samples to ensure the accuracy of the classification results. In particular, the number of unlabeled samples was equivalent to 55% for 2007, 64% for 2010, and 65% for 2016.

3.2. Methods

A flowchart of the methodology employed in this study is presented in Figure 2.

3.2.1. Preprocessing

Landsat digital numbers (DNs) were converted to reflectance and atmospheric correction using the fast line-of-sight atmospheric analysis of hypercubes (FLAASH) tool. The enhanced vegetation index (EVI) was then calculated using band near-infrared (0.7–1.1μm), red (0.6–0.7 μm), and blue (0.45–0.52 μm) in accordance with the work of Liu and Huete (1997) [47]:

E V I = G \times \frac{ρ_{n i r} - ρ_{r e d}}{ρ_{n i r} + (C_{1} \times ρ_{r e d} - C_{2} \times ρ_{b l u e}) + L}

(1)

where L is a soil adjustment factor, and C₁ and C₂ are coefficients used to correct aerosol scattering in the red band by using the blue band. In general, G = 2.5, C₁ = 6.0, C₂ = 7.5, and L = 1.

In this study, when observing the relationship between reflectance value and the cosine of the solar incidence angle, there was a low correlation coefficient with the value of 0.0075 and 0.0197 for TM and OLI data, respectively. This means that the terrain does not significantly affect this test site. Therefore, topography correction is unnecessary in this case.

For radar data, dual-polarized images (HH, HV polarizations) were created in the single-look complex (SLC) format. The preprocessing data were operated to convert the digital number value into sigma naught (σ^o) values using the following equation:

σ^o = 10.log₁₀(I² + Q²) + CF − A

(2)

where I and Q are the real and imaginary parts of the SLC product. A is a conversion factor equal to 32.0. The calibration factor CF is -83.

A refined Lee filter was used with a window size of 7 × 7 to reduce the speckle noise. The topography effect was eliminated using range—Doppler terrain correction with digital elevation model (DEM) from the Shuttle Radar Topography Mission, and all of the product images were resampled to reach 15 m in pixel spacing.

The preprocessed SAR data were next transformed into covariance matrix elements, and then eigenvalue and eigenvector polarimetric parameters. The cross-pol ratio of HH and HV was also calculated and used as a variable for the classification model. In addition, SAR data and Landsat data were fused and resampled to 15 m. The parameters set for polarimetric SAR (PolSAR) and Landsat data comprise the input features for classification, as detailed in the next section.

To illustrate the polarimetric data, we adopted eigen decomposition of the 2 × 2 covariance matrix for dual polarization data as defined by [48]:

[\begin{matrix} C_{H H, H H} & C_{H H, H V} \\ C_{H V, H H} & C_{H V, H V} \end{matrix}]

H/A/Alpha decomposition was used to decompose the backscatter value into three components: entropy, anisotropy, and alpha (H/A/α). The H/A/α is a polarimetric parameters decomposition based on eigenvalue and eigenvector that was introduced by Cloude and Pottier [49]. In this technique, backscattering is decomposed into entropy (H), anisotropy (A), and alpha angle (α). Entropy is a parameter describing randomness in target scattering, which is defined as:

H = - ({\bar{λ}}_{1} l n {\bar{λ}}_{1} + {\bar{λ}}_{2} l n {\bar{λ}}_{2}) / l n 2 with {\bar{λ}}_{i} = λ_{i} / (λ_{1} + λ_{2})

(3)

where H_T is target entropy and λ_i (i = 1 to 2) are eigenvalues.

Entropy values vary from 0 for a single scattering mechanism to 1 for pure noise and random targets.

Mean alpha angle is defined as:

α = {\bar{λ}}_{1} α_{1} + {\bar{λ}}_{2} α_{2}

(4)

The alpha angle varies between 0° for trihedral scattering from a planar surface to 90° for dihedral scattering from a metallic surface. Another element is anisotropy (A), which is a parameter complementary to entropy, which can be employed as a source of discrimination only when H >0.7 owing to the high effect of noise [50].

3.2.2. Masking Undesirable Areas

In this study, we created a mask to remove undesirable areas before classifying natural forest types. The classification method of the random forest algorithm was applied based on entropy, alpha, and anisotropy parameters extracted from dual polarization data for images in 2010 and 2016. For the image in 2007, the polarization data of HH, HV, and EVI from Landsat data were used for classification. For other land use/land cover types, samples such as rivers, urban areas, and agricultural land were collected through visual interpretation based on discrimination in color, geometric shapes, and brightness. For the natural forest, 170 samples were collected for 2007 with a plot area of 25 × 40 m, 115 samples for 2010, and 106 samples for 2016 with the same area of 30 × 33 m. Polarimetric data were derived from the image for each sample with a window size of 2 × 2 pixels, with a pixel size of 15 m. The classification results create natural forest maps for the study area.

Furthermore, in this study area, because the natural forest is mainly distributed on topography at an elevation above 200 m, a digital elevation map (DEM) was applied to mask out low-altitude forest areas while retaining forests with elevations above 200 m. This DEM map was downloaded from NASA Shuttle Radar Topography Mission data. The masked forest images were then used for the forest types classification.

3.2.3. Self-Learning with the Kernel Least Squares (SL-KLS) Classifier for Forest Types Classification

Kernel Least Squares (KLS)

In this study, the presence of a large number of parameters in the classification problem created computational difficulties due to a high number of dimensions. To solve this problem, we used the KLS technique in the R environment with RSSL package version 0.7. Here, KLS is described as a method using least squares regression as a classification technique with numeric encoding of classes as targets. A detailed description of KLS can be found in various studies [27,51], with the optimal parameter vector identified by ϴ =

{[b α_{1} α_{2} \dots α_{n}]}^{T}

. The minimized vector has the form

L (θ) = ‖ Y - P θ ‖^{2}

,

with Y = {[y_{1} y_{2} \dots y_{n}]}^{T} and P = [\begin{matrix} 1 & k (x_{1}, x_{2}) \dots & k (x_{1}, x_{n}) \\ ⋮ & ⋮ ⋮ & ⋮ \\ 1 & k (x_{n}, x_{1}) \dots & k (x_{n}, x_{n}) \end{matrix}]

A radial basis function was used with the form below:

k (x_{i}, x_{j}) = \exp (- \frac{‖ x_{i} - x_{j} ‖^{2}}{σ^{2}})

(5)

where x_i: are training data, x_j is a feature vector, and σ is a free parameter. Kernel k has a value in the range of 0 to 1. With α_i as real numbers, the prediction function f(x) can be written as follows:

f (x) = \sum_{i = 1}^{n} α_{i} k (x_{i}, x) + b

(6)

Self-learning with the Kernel Least Squares (SL-KLS) Classifier

In this study, a self-learning algorithm was used to turn the KLS classifier into a semi-supervised model to solve the problem of the small amount of labeled data. Based on the training data, KLS was applied to assign labels to unlabeled objects, which were then added to the set of labeled objects for classification. There is a given set of labeled data (L) and a set of unlabeled data (U) (Figure 3). By applying a KLS classifier, k number of labels are assigned to unlabeled data. The result of predicted data U then joins with L to create a new training set for classifying the entire segmented images. In this study, we classified the forest into four classes: rich forest, medium forest, poor forest, and restoration forest. The features of the four classes were extracted from Landsat bands reflectance, EVI, HH, HV signals, covariance elements, and H/A/Alpha decomposition.

The indicator of overall accuracy (OA), kappa, user’s accuracy, and producer’s accuracy were used to evaluate classification accuracy by comparing the classified image with the results of ground truth interpretation. The overall accuracy comprises the ratio of the sum of accuracy in an individual class and the number of observed samples, with 100% as the perfect classification. Kappa, user’s, and producer’s accuracy were proposed by Congalton and have been used widely to date. The function of these indicators is clearly described in [52].

3.2.4. Forest Pattern Analysis Using Landscape Metrics

Extraction of Landscape Metrics

After the classification step, the forest was divided into four forest types: rich, medium, poor, and restoration forest in the years 2007, 2010, and 2016. For each year, the classified images were then clipped into 14 non-overlapping sub-landscapes of 2000 × 2000 m. This size was selected to ensure the representativeness of the sample and to reduce computation time. To conduct spatial analysis of the forest pattern, landscape metrics were computed at two levels, class and landscape, for all samples in each year. We calculated 56 metrics at the class level and 63 metrics (Appendix A) at the landscape level for each sub-landscape image using Fragstat version 4.2.1. With a large number of landscape metrics, we then selected the appropriate metrics for analysis of the natural forest process for the study area longitudinally.

Selection of a Set of Landscape Metrics

Principal components analysis was used to identify components and cluster them into various groups. In these groups, the three indices of universality, consistency, and strength were then calculated to select the group of metrics. This operation was conducted using PROC FACTOR in SAS.

Based on the assessment of metrics through the three indices of universality, consistency, and strength, we created a list of selected metrics at the class and landscape levels. At the class level, 11 clusters were created from 56 metrics. Through cluster analysis, two clusters (approximately 16 metrics) were selected with a high level of these three indices at a total percentage >90%, variation explained >7%, and the average in-group correlation >0.8 (Appendix B). Similarly, two clusters (approximately 20 metrics) were selected through analysis of the 10 clusters created from 63 metrics for the landscape level (Table 3). The other two metrics—total area (CA in hectares_ha) and percentage of landscape (PLAND_%)—were also added for change analysis of the area in general.

Analysis of Forest Pattern Change

From the set of representative metrics in the study area, we selected various metrics that support the analysis of spatial processes over time, containing aggregation, compactness, and fragmentation. To evaluate the spatial structure change of forest types in the period 2007–2016, we selected 11 metrics for class level and five metrics for landscape level.

The aggregation is expressed by increasing the size of patches from the combination of small fragments. Therefore, this indicator relates to the recovery of forests from the previously deforested area. The metrics are related to aggregation including the aggregation index (AI), proportion of like adjacencies (PLADJ), and clumpiness index (CLUMPY) for the class level, and Interspersion/juxtaposition index (IJI) for the landscape level. Another term that is strongly involved in the aggregation is forest connectivity, which evidently increases the patch cohesion index (COHESION) that is related to the physical connectedness of the corresponding patch type.

Forest fragmentation is an opposite process to aggregation and occurs when a large contiguous forest is broken down into many small fragments, leading to loss of biodiversity and animal habitat and degradation of forest health and its economic and environmental functions. This process is closely related to the shrinkage ratio of area-weighted mean patch size (AREA_AM) and effective mesh size (MESH) of the landscape over time.

Compaction involves the formation of rounded patches in a circular shape that makes them more compact [53]. The more closely a patch shape is to a circle, the more it exhibits compaction. While a natural forest has a complex and irregular shape, basic geometry patch shapes show unnatural objects. Therefore, analysis of forest compaction enables us to assess disturbance in the forest using various shape metrics such as the shape index (SHAPE_MN, _AM) and circumscribing circle (CIRCLE_MN, _AM) at the class level. At the landscape level, area-weighted radius of gyration (GYRATE, _AM) is used to analyze compaction. Furthermore, GYRATE_AM also provides the overall characterization of the level of connectivity or subdivision of the landscape [53].

4. Results

4.1. Forest Type Classification

For the result of masking undesirable areas, we compared the predicted products with the reference data and evaluated them based on the index of overall accuracy (OA) for each year. The results obtained high accuracies for the images in 2007, 2020 and 2016 with an OA of over 0.87. The 2010 predicted image was the best with an OA of 0.99, followed by 2016 with 0.92 and 2007 with 0.87.

The behavior survey of only Landsat or only PolSAR data on forest objects does not show observable discrimination (Appendix C). For radar images, polarimetric parameters are not used to classify forest objects due to the saturation of entropy throughout the study area. Alpha and anisotropy display slight fluctuations on different forest objects, but they do not create good results in the discrimination. Nor does relying on the polarization of HH and HV signals provide better results. Therefore, with efforts to improve accuracy in forest classification, we have used combined data from optical and SAR data to extract information for forest types classification.

Another difficulty encountered in the classification process was the limited number of samples collected from the field, particularly in 2016 when only 106 samples were collected for the four forest types over the entire study area. The small number of samples was inadequate to develop a reliable classification algorithm based on the supervised method. To solve this problem, we used the semi-supervised classification with the addition of information from an unlabeled data source. However, to ensure the accuracy of the classification results, it was necessary to select an appropriate ratio between the number of labelled and unlabeled samples. The higher the percentage of unlabeled samples, the lower the accuracy [16]. To balance the number of unlabeled samples required and the classification accuracy, 200 random samples were created in the study area to ensure the ratio was approximately 60% for each year.

Overall, the classification accuracies were relatively high for 2007 with a kappa of 0.81 and OA of 0.86, respectively (Figure 4), while they were adequate for 2010 and 2016 with kappas of 0.76 and 0.74, respectively. The accuracies are generally the best for the rich forest over the entire time, with a user’s accuracy of 100% in the years 2007 and 2010, and of 85.71% in 2016. This is followed by medium forest with over 75% in both user’s and producer’s accuracies, although sometimes it was misclassified as rich or poor forest. On the other hand, the classification accuracies were the lowest in 2010 for poor forest and in 2016 for restoration forest. The confusion matrix in 2010 and 2016 reveals a significant confusion between the poor and restoration forests, and they therefore cause the values of OA and kappa to be reduced at these times (Appendix D).

4.2. Forest Pattern Analysis at the Class Level

Based on the metrics of class area (CA) and percentage of landscape (PLAND_%), the natural forest of the study displayed a significant fluctuation within the nine years from 2007 to 2016 (Figure 5).

In the period 2007–2010, CA decreased quickly with an average loss of 1713 ha per year. However, in the period 2010–2016, signs of recovery in CA appeared with an average gain of 144 ha each year. From 2007 to 2016, rich, medium, and restoration forests mainly demonstrated an increase, as shown by the gain of PLAND 1–8%, while PLAND showed reductions for poor forest of up to –15%. Furthermore, to assess the spatial variation of forest patterns, a set of parameters, comprising 11 metrics at the class level and 15 metrics at the landscape level, was selected based on evaluation of the indicators for universality, strength, and consistency. The selection method was based on factor analysis, clustering, and evaluation for the four different forest types at three different time points. Therefore, this set of metrics ensures the appropriateness and representativeness of forest structure analysis over time for this test site. The changes in each forest type, based on analyzing landscape metrics from 2007 to 2016, are shown in Table 4.

In general, from 2007 to 2016, the forest types exhibited a relatively stable pattern with no significant changes in the group metrics of aggregation (AI, CLUMPY, PLADJ) but their pattern did show significant changes in patch shape structure (SHAPE, CIRCLE, CONTIG). In particular, rich, medium, and restoration forests had a low level of aggregation with the change percentage of AI and PLADJ ranging from just +1 to +4%. Conversely, the poor forest demonstrated an increased dispersion (AI −7%). However, this period was expressed by the moderate changes in shape with more compactness (SHAPE −4% to −12%) and contiguity (CONTIG_AM up to 4% excluding the poor forest). The poor forest demonstrated the largest variation and had a trend of disaggregation (nLSI 60%) due to decreasing total area and percentage of the landscape. In summary, when evaluating the subperiods between 2007–2010 and 2010–2016, the forest types reflect an extreme fluctuation and totally different behavior.

4.2.1. Period 2007–2010

This period expressed a growth in the percentage of landscape occupied by medium and restoration forests, as well as a decline in rich and poor forests. Therefore, they exhibit completely different processes in spatial fluctuations (Table 4).

Rich forest displayed a moderate decrease (PLAND −4%) and strong disaggregation in this period. The disaggregation is reflected in a decrease in AI −15%, CLUMPY −12%, and PLADJ −15%, and an increase in nLSI (+67%). The patterns show more compactness and less physical connectivity, as a result of reducing complexity in geometric shape (SHAPE_MN −20% and CIRCLE_AM −14%), decreasing contiguity, and continuity (CONTIG_MN –51% and COHESION −12%). The related circumscribing circle coefficient of variation (CIRCLE_CV) with a high value indicates the various changes in patch shapes for rich forest.

Similar to rich forest, poor forest exhibited slightly increased dispersion corresponding to a decrease in clumpiness and aggregation (–7% for both the change of CLUMPY and AI). This is due to shrinkage in the percentage of landscape (PLAND −19%) and the disappearance of like adjacencies in the same patch type (PLADJ −6%). It also coincides with the tendency to increase compactness (SHAPE_MN −9%).

The medium and restoration forests had growth in terms of area (PLAND 3% and 20%, respectively) and demonstrated a different process than rich and poor forest. The patterns display a moderate aggregation, higher connectivity, and compactness. In addition, the growth in area, together with the drop in contiguity index (CONTIG_MN -3% and −24% for the medium and restoration, respectively), reflect the process of creating larger patches from the clumpiness of small adjacencies and the distribution scattered in the landscape.

4.2.2. Period 2010–2016

In this period, rich forest performed more aggregation than other forest types. The appearance of new patches (PLADJ +16%) resulted in an increase in spatial connectedness (CONTIG_MN +58%) and improved the continuity of this class in the landscape (COHESION +12%). This also meant a gain in the aggregation process (AI +16%, CLUMPY +11%, and nLSI –38%). The growth in PLAND coincided with a higher area-weighted mean contiguity of each patch (CONTIG_AM +29%), indicating the appearance of larger patch sizes.

Medium and poor forest demonstrated less area variation than in the previous period with a slight increase. However, there was a negligible decrease in tendency of aggregation (AI –1 to –2%), continuity (COHESION –6 to –8%), and connectedness (CONTIG_AM –2% to –5%) for both types. In poor forest, there was a different tendency of the mean index and area-weighted mean index in CIRCLE and CONTIG due to measuring the patch-centric and landscape-centric perspectives. The increase in the related circumscribing circle shows a trend of elongation based on evaluating entire patches (CIRCLE _MN +18%), but displays the opposite trend based on evaluating an arbitrary patch selected randomly from the landscape (CIRCLE_AM –6%).

Similarly, CONTIG_MN demonstrated a significant increase (+24%) and performed a higher level of spatial connectedness in poor forest. However, the drop in CONTIG_AM (–2%) together with the expansion of area partly revealed the presence of new small patches.

An extreme decline in the restoration area was recorded during this period, resulting in increasing dispersion (nLSI +33%), a higher level of complexity in shape structure (SHAPE_MN +3% and CIRCLE_MN +9%), and less contiguity (CONTIG_AM –7%).

4.3. Forest Pattern Analysis in Landscape Level

This period was marked by a rapid decrease in the total landscape area of natural forest, from 202,300 ha in 2007 to 197,200 ha in 2010, followed by a slight increase to 198,100 ha in 2016. This caused a reduction in the percentage of the landscape and a sharp decline in patch size distribution (AREA_AM –60%) (Table 5). In addition, there was a decrease in symmetry in the patch distribution in the landscape (IJI –8%). The moderate decline in SHAPE (–20%) and GYRATE (–26%) demonstrated more compactness and less complexity in spatial patterns. The continuity and connectedness of the forest pattern also tended to decrease (CONTIG_MN –21%)). In general, the natural forest experiences increased fragmentation over the entire landscape, which involved an increase in landscape area with shrinkage of patch size and disproportionate distribution of patches.

5. Discussion

To assess the trend of natural forest changes in the study area, we compared the results with those in global and tropical regions, as well as in Vietnam overall, in the same period. Keenan et al. [3] reviewed the dynamics of global forest area between 1990 and 2015 based on statistics from the FAO global forest resources assessment 2015. Worldwide, the natural forest area declined by 2% between 2005 and 2015, with the vast majority of the losses occurring in the tropics where the rate of loss fell by 7.2 million ha.y⁻¹. Compared to the trend of forest transition worldwide and in Vietnam overall, the status of forest loss in the study area is similar in the period from 2007–2010. This status is confirmed by the findings of Quy Van Khuc et al. [54] that degradation mainly occurred in natural forest at the rate of 3–4%. Cochard’s [55] review of studies also demonstrated a slow increase in natural forest in the period 2000–2013 in Thua Thien Hue province. From 2010–2016, the natural forest in the study area demonstrated the opposite trend. While there was a significant decrease in the natural forest worldwide and in the tropics generally, growth occurred in Vietnam and in the study area. Due to the shortage of previous studies, it was only possible to compare the general trend of natural forests. It is difficult to compare fluctuations in forest types of rich, medium, poor, and restoration types because there are few documented records for the study area in particular, and Vietnam in general, particularly in the period 2010–2016. Therefore, the findings of this study contribute to the understanding of the transition of natural forest types in recent years, particularly in the ecological processes in terms of spatial patterns that have still not received adequate attention.

Analysis of the reflectance behavior on some bands on Landsat and backscatter on SAR images (Figure A1) demonstrates on histograms the overlap of all four forest types. In image data from 2007 and 2010, rich forest exhibited better distinctions than other forest types. Histogram analysis of forest types in 2016 shows little separation, so its accuracy was lower than that of 2007 and 2010. This low separation is due to the characteristic of natural forest, with its combination of various canopy stories and species diversity. Sparser wood trees have more vines, which cover the whole canopy. Therefore, it is difficult to detect the difference between forest types based on optical images. Despite having a long wavelength that can supposedly penetrate the canopy and reach the trunk, L-band signals still demonstrate a low difference between polarization signals. In this study, to enhance the differences between classes, a multivariate model was essential to observe objects under multidimensional space and provide more information and attributes for objects.

The change of certain forest types between any two periods comprises the net effect [3] of conversion from any one forest type to another or non-forested area and natural regeneration or restoration. A conversion matrix was used to clearly illustrate transition areas between forest types in this study area in the period 2007–2016 (Table 6). In this table, the cross cells demonstrated no change values in terms of percentage of forest types’ area. The rows demonstrate an increase in the proportion converted from other types. The columns demonstrate a decrease in the proportion converted into other types. The net area change is the total effect of increase and decrease in the area of specific forest types.

From the conversion matrix of forest types between 2007 and 2016 in this study area, we considered three main findings. First, the net trend of natural forest comprised a small loss of area, but this was due to two opposite trends of an area increase in one place and a decrease elsewhere. Second, the levels of forest restoration and deforestation were nearly equal (total of 145,873 ha and 150,143 ha, respectively) and occurred simultaneously in the study area during this period. Third, there was a strong internal transition between forest types and an external transition between them and other land use/land cover types. Medium forest had the highest gain area, followed by restoration forest, at 14,795 ha and 11,852 ha, respectively. Poor forest showed a sharp loss, while rich forest had an adequate increase. When considering the percentage of conversion area, the most dramatic transformation was in rich forest, which changed to medium forest at a rate of 36%, but was compensated by medium (16%) and poor forests (13%). However, when considering the changing area, medium and poor forest had areas of both high increase and decrease. Changes from natural forest to other types were the strongest in restoration forest at 31% of its area. Restoration forest is the most vulnerable forest type because it is often distributed in areas that are easily accessible and affected by human and agricultural activities.

The spatial changes in natural forest types presented in Figure 6 showed two change directions: increase and decrease. The area loss of forest types occurred throughout the study area, but it was mainly distributed near water bodies such as rivers and streams. The local population distribution is often concentrated in the downstream of rivers where conditions for agriculture are developing. Therefore, natural forests near rivers are easily deforested and degraded due to human activity. In the other direction, the expansion of rich forest created larger fragments and scattered distribution in the study area, resulting in increasing compactness, less connectivity, and higher isolation. The expansion in other types occurred more evenly and therefore with greater connectivity.

During this period, there were many factors affecting forest dynamics. The policies of prohibiting logging in natural forests and enhancing forest protection and restoration are considered to be the correct policies in terms of reducing natural forest degradation, which was implemented by the Vietnamese government since the early 1990s [56]. However, illegal logging still occurred [57] due to the increasing demand for wood from population pressure, which is the main reason for the continued decline of natural forests in the period 2005–2010. There were also many other causes, such as poverty, forest resources, population density, agricultural production, and province-level governance [54]. In parallel with the logging ban policy in natural forest, Vietnam has successfully socialized forestry organization, calling for public participation in afforestation and forest protection, and resulting in reduced deforestation and degradation and improved long-term income for people in rural mountainous areas. The speed of loss of natural forests has also decreased slightly and there have been signs of increase from 2010 to the present day. In 2016, Vietnam began to introduce bans on natural forest wood exploitation into the law on forest protection and development, which is the most powerful law in forestry. Simultaneously, it maximized the closure of natural forests, did not convert natural forests to other purposes, and did not convert poor natural forests to industrial crops. This is the driving force behind reductions in degradation and prevention of illegal logging, and allows us to predict recovery and increase in the quality of natural forests in the future.

Generally, this study provides information on the dynamics and spatial processes of natural forest change in a given study site between 2007 and 2016. The result obtained demonstrates the general trend of forest types conversion and provides useful information for sustainable forest planning.

6. Conclusions

There is an essential requirement for forest management and protection to classify natural forests and assess their fluctuations over time. However, classifying the natural forest types in tropical areas using remote sensing images is challenging because of the very similar information captured by remotely sensed data as well as the constraint of samples data. Furthermore, there is a lack of research assessing forest transition in the natural forest from the perspective of landscape ecology, which can be used for forest structure management, and to quantitatively characterize the spatial patterns of forest landscapes. In this study, we addressed these issues by applying semi-supervised classification for data integration of optical and SAR data.

The combination of Landsat and PolSAR data resulted in improved discrimination of forest types. The using of multi-source remotely sensed data can provide more information about the object, as well mitigate the disadvantages of Landsat images (cloud, lower spatial resolution), and limited information regarding objects in PALSAR/PALSAR-2 image (only two polarization HH and HV).

In this study, we assessed the potential of a proposed semi-supervised model developed and validated for mapping forest types and assessed the process of forest transition in a tropical natural forest in Vietnam. The model produced high accuracies in the classified images in 2007, 2010, and 2016 with over 0.74 for kappa, and over 0.8 for OA. Additionally, landscape metrics were used to evaluate the forest changes based on the spatial processes, such as aggregation, fragmentation, and compaction. At the class level, the poor forest demonstrated the largest variation with more dispersed growth patterns, while other types had a low level of aggregation. At the landscape level, the natural forest experiences increased fragmentation, which involved an increase in landscape area with shrinkage of patch size and disproportionate distribution of patches.

We recommend that future research include comparison of different models to estimate the improvement resulting from the proposed model. Another important study that should be conducted is testing of the proposed methods on larger areas.

Author Contributions

Conceptualization, T.T.C.T.; methodology, T.T.C.T.; validation, T.T.C.T., H.T.; formal analysis, T.T.C.T.; investigation, T.T.C.T.; resources, H.T., N.Q.T.; data curation, T.T.C.T.; writing—original draft preparation, T.T.C.T.; writing—review and editing, T.T.C.T., H.T.; visualization, T.T.C.T.; supervision, H.T., X.W.; project administration, T.T.C.T.; funding acquisition, H.T.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A.

Table A1. List of 56 metrics in class level (C) and 63 metrics in landscape level (L).

Number	Variable		Description
1	CA	C	Total class area
2	CLUMPY	C	Clumpiness index
3	CPLAND	C	Core area percentage of landscape
4	NLSI	C	Normalized landscape shape index
5	AI	C, L	Aggregation index
6	AREA_AM	C, L	Area-weighted mean patch size
7	AREA_CV	C, L	Patch size coefficient of variation
8	AREA_MN	C, L	Mean patch size
9	CAI_AM	C, L	Area-weighted mean core area index
10	CAI_CV	C, L	Core area coefficient of variation
11	CAI_MN	C, L	Mean core area index
12	CIRCLE_AM	C, L	Area-weighted mean circumscribing circle
13	CIRCLE_CV	C, L	Circumscribing circle coefficient of variation
14	CIRCLE_MN	C, L	Mean coefficient of variation
15	COHESION	C, L	Patch cohesion
16	CONNECT	C, L	Connectance index
17	CONTIG_AM	C, L	Area-weighted contiguity index
18	CONTIG_CV	C, L	Contiguity index coefficient of variation
19	CONTIG_MN	C, L	Mean coefficient of variation
20	CORE_AM	C, L	Area-weighted mean core area
21	CORE_CV	C, L	Core area coefficient of variation
22	CORE_MN	C, L	Mean core area
23	DCAD	C, L	Disjunct core area density
24	DCORE_AM	C, L	Area-weighted mean disjunct core area
25	DCORE_CV	C, L	Disjunct core area coefficient of variation
26	DCORE_MN	C, L	Mean disjunct core area
27	DIVISION	C, L	Division index
28	ED	C, L	Edge density
29	ENN_AM	C, L	Area-weighted mean nearest neighbor distance
30	ENN_CV	C, L	Nearest neighbor distance coefficient of variation
31	ENN_MN	C, L	Mean nearest neighbor distance
32	FRAC_AM	C, L	Area-weighted mean fractal dimension
33	FRAC_CV	C, L	Fractal dimension coefficient of variation
34	FRAC_MN	C, L	Mean fractal dimension
35	GYRATE_AM	C, L	Mean radius of gyration
36	GYRATE_CV	C, L	Radius of gyration coefficient of variation
37	GYRATE_MN	C, L	Mean radius of gyration
38	IJI	C, L	Interspersion/juxtaposition index
39	LPI	C, L	Largest patch index
40	LSI	C, L	Landscape shape index
41	MESH	C, L	Mesh index
42	NDCA	C, L	Number of disjunct core areas
43	NP	C, L	Number of patches
44	PAFRAC	C, L	Perimeter–area fractal dimension
45	PARA_AM	C, L	Area-weighted mean perimeter–area ratio
46	PARA_CV	C, L	Perimeter–area ratio coefficient of variation
47	PARA_MN	C, L	Mean perimeter-area ratio
48	PD	C, L	Patch density
49	PLADJ	C, L	Proportion of like adjacencies
50	PLAND	C, L	Proportion of landscape
51	SHAPE_AM	C, L	Area-weighted mean shape index
52	SHAPE_CV	C, L	Shape index coefficient of variation
53	SHAPE_MN	C, L	Mean shape index
54	SPLIT	C, L	Splitting index
55	TCA	C, L	Total core area
56	TE	C, L	Total edge
57	CONTAG	L	Contagion
58	MSIDI	L	Modified Simpson’s diversity index
59	MSIEI	L	Modified Simpson’s evenness index
60	PR	L	Patch richness
61	PRD	L	Patch richness density
62	RPR	L	Relative patch richness
63	SHDI	L	Shannon’s diversity index
64	SHEI	L	Shannon’s evenness index
65	SIDI	L	Simpson’s patch density
66	SIEI	L	Simpson’s patch evenness
67	TA	L	Total landscape area

Appendix B.

Table A2. Universality, strength, and consistency at class level.

Cluster	Members	% Total	Eigenvalue	% Variation Explained	Average in Group Correlation
1	9	96	4.01	7.29	0.81
2	4	83	2.13	3.87	0.97
3	6	93	2.35	4.27	0.71
4	8	100	4.05	7.36	0.92
5	4	75	1.88	3.43	0.85
6	3	92	1.00	1.83	0.61
7	2	83	0.71	1.29	0.65
8	7	89	2.62	4.77	0.68
9	3	100	1.22	2.22	0.74
10	4	83	1.69	3.07	0.76
11	5	85	2.15	3.91	0.78

Table A3. Universality, strength, and consistency at the landscape level.

Cluster	Members	% Total	Eigenvalue	% Variation Explained	Average in Group Correlation
1	10	100	5.27	8.50	0.85
2	7	100	4.17	6.72	0.96
3	10	100	4.85	7.83	0.78
4	8	100	3.55	5.73	0.72
5	8	100	4.03	6.49	0.81
6	5	100	2.13	3.43	0.69
7	2	100	1.22	1.98	0.99
8	2	100	1.24	2.00	1.00
9	8	100	3.64	5.87	0.73
10	2	100	1.18	1.91	0.95

Appendix C.

Figure A1. Distribution and density of some parameters (HH and HV signals in decibels for SAR data, and red, near-infrared, and shortwave infrared 1 in reflectance for Landsat data) in four forest types in three years 2007 (a,b), 2010 (c,d), and 2016 (e,f).

Appendix D.

Table A4. Confusion matrix of classification in 2007.

Prediction	Medium	Rich	Poor	Restoration	User’s
Medium	15	1	4	0	75.00
Rich	0	5	0	0	100.00
Poor	0	0	14	0	100.00
Restoration	0	1	1	9	81.82
Producer’s	100.00	71.43	73.68	100.00
Overall accuracy	0.86
Kappa	0.81

Table A5. Confusion matrix of classification in 2010.

Prediction	Medium	Rich	Poor	Restoration	User’s
Medium	7	0	0	0	100.00
Rich	0	4	0	0	100.00
Poor	2	0	4	1	57.14
Restoration	0	0	1	4	80.00
Producer’s	77.78	100.00	80.00	80.00
Overall accuracy	0.82
Kappa	0.76

Table A6. Confusion matrix of classification in 2016.

Prediction	Medium	Rich	Poor	Restoration	User’s
Medium	7	0	2	1	70.00
Rich	0	12	2	0	85.71
Poor	0	0	12	1	92.31
Restoration	1	2	0	5	62.50
Producer’s	87.50	85.71	75.00	71.43
Overall accuracy	0.81
Kappa	0.74

References

Mather, A.S.; Needle, C.L. The forest transition: A theoretical basis. Area 1998, 30, 117–124. [Google Scholar] [CrossRef]
Meyfroidt, P.; Lambin, E.F. Forest transition in Vietnam and displacement of deforestation abroad. PNAS 2009, 106, 16139–16144. [Google Scholar] [CrossRef] [PubMed]
Keenan, R.J.; Reams, G.A.; Achard, F.; de Freitas, J.V.; Grainger, A.; Lindquist, E. Dynamics of global forest area: Results from the FAO Global Forest Resources Assessment 2015. For. Ecol. Manag. 2015, 352, 9–20. [Google Scholar] [CrossRef]
Webb, E.L.; Honda, K. Biophysical and policy drivers of landscape change in a central Vietnamese district. Environ. Conserv. 2007, 34, 164–172. [Google Scholar] [CrossRef]
MacDicken, K.G. Global Forest Resources Assessment 2015: What, why and how? For. Ecol. Manag. 2015, 352, 3–8. [Google Scholar] [CrossRef]
FAO. Forest Resources Assessment 2015: Forest Resources Assessment Working Paper: Terms and Definitions; FAO: Quebec City, QC, Canada, 2015. [Google Scholar]
Li, G.; Lu, D.; Moran, E.; Dutra, L.; Batistella, M. A comparative analysis of ALOS PALSAR L-band and RADARSAT-2 C-band data for land-cover classification in a tropical moist region. ISPRS J. Photogramm. Remote Sens. 2012, 70, 26–38. [Google Scholar] [CrossRef]
Bwangoy, J.R.B.; Hansen, M.C.; Roy, D.P.; Grandi, G.D.; Justice, C.O. Wetland mapping in the Congo Basin using optical and radar remotely sensed data and derived topographical indices. Remote Sens. Environ. 2010, 114, 73–86. [Google Scholar] [CrossRef]
Hirschmugl, M.; Sobe, C.; Deutscher, J.; Schardt, M. Combined Use of Optical and Synthetic Aperture Radar Data for REDD + Applications in Malawi. Land 2018, 7, 116. [Google Scholar] [CrossRef]
Hong, S.; Moon, W.M.; Paik, H.Y. Gi-Hyuk Choi Data fusion of multiple polarimetric SAR images using discrete wavelet transform (DWT). IEEE Int. Geosci. Remote Sens. Symp. 2003, 6, 3323–3325. [Google Scholar] [CrossRef]
Yong-an, Z.; Wen-ming, Z.; Rui-hua, W. False Color Fusion for Multi-band SAR Images Based on Contourlet Transform. Acta Autom. 2007, 33, 337–341. [Google Scholar] [CrossRef]
Shimoni, M.; Borghys, D.; Heremans, R.; Perneel, C.; Acheroy, M. Fusion of PolSAR and PolInSAR data for land cover classification. Int. J. Appl. Earth Obs. Geoinf. 2009, 11, 169–180. [Google Scholar] [CrossRef]
Chambers, J.Q.; Asner, G.P.; Morton, D.C.; Anderson, L.O.; Saatchi, S.S.; Espírito-Santo, F.D.B.; Palace, M.; Souza, C. Regional ecosystem structure and function: Ecological insights from remote sensing of tropical forests. Trends Ecol. Evol. 2007, 22, 414–423. [Google Scholar] [CrossRef] [PubMed]
Triguero, I.; García, S.; Herrera, F. Self-labeled techniques for semi-supervised learning: Taxonomy, software and empirical study. Knowl. Inf. Syst. 2015, 42, 245–284. [Google Scholar] [CrossRef]
Gao, J.; Liang, F.; Fan, W.; Sun, Y.; Han, J. Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models. Adv. Neural Inf. Process. Syst. 2009, 22, 585–593. [Google Scholar]
Gao, J.; Liang, F.; Fan, W.; Sun, Y.; Han, J. A graph-based consensus maximization approach for combining multiple supervised and unsupervised models. IEEE Trans. Knowl. Data Eng. 2013, 25, 15–28. [Google Scholar] [CrossRef]
Ma, L.; Ma, A.; Ju, C.; Li, X. Graph-based semi-supervised learning for spectral-spatial hyperspectral image classification. Pattern Recognit. Lett. 2016, 83, 133–142. [Google Scholar] [CrossRef]
Sawant, S.S.; Prabukumar, M. A review on graph-based semi-supervised learning methods for hyperspectral image classification. Egypt. J. Remote Sens. Space Sci. 2018, 1–6. [Google Scholar] [CrossRef]
Qi, Z.; Tian, Y.; Shi, Y. Laplacian twin support vector machine for semi-supervised classification. Neural Netw. 2012, 35, 46–53. [Google Scholar] [CrossRef] [PubMed]
Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2016, arXiv:1609.02907. [Google Scholar]
Erkan, A.N.; Camps-Valls, G.; Altun, Y. Semi-supervised remote sensing image classification via maximum entropy. In Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing, Kittilä, Finland, 29 August–1 September 2010; pp. 313–318. [Google Scholar] [CrossRef]
Alok, A.K.; Saha, S.; Ekbal, A. Pixel Classification of Remote Sensing Satellite Image using Semi-supervised Clustering. In Proceedings of the 2014 9th International Conference on Industrial and Information Systems (ICIIS), Gwalior, India, 15–17 December 2015. [Google Scholar]
Silva, J.; Bacao, F.; Caetano, M. Specific land cover class mapping by semi-supervised weighted support vector machines. Remote Sens. 2017, 9, 181. [Google Scholar] [CrossRef]
Cui, B.; Xie, X.; Hao, S.; Cui, J.; Lu, Y. Semi-Supervised Classification of Hyperspectral Images Based on Extended Label Propagation and Rolling Guidance Filtering. Remote Sens. 2018, 10, 515. [Google Scholar] [CrossRef]
Yarowsky, D. Unsupervised word sense disambiguation rivaling supervised methods. In Proceedings of the 33rd Annual Meeting on Association for Computational Linguistics, Cambridge, MA, USA, 26–30 June 1995; pp. 189–196. [Google Scholar]
Dópido, I.; Member, S.; Li, J.; Marpu, P.R.; Plaza, A.; Member, S.; Dias, J.M.B.; Benediktsson, J.A. Semisupervised Self-Learning for Hyperspectral Image Classification. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4032–4044. [Google Scholar] [CrossRef]
Saunders, C.; Gammerman, A.; Vovk, V. Ridge Regression Learning Algorithm in Dual Variables. In Proceedings of the Fifteenth International Conference on Machine Learning, Madison, WI, USA, 24–27 July 1998; Volume 37, pp. 515–521. [Google Scholar] [CrossRef]
Du, H.Q.; Ge, H.L.; Liu, E.B.; Xu, W.B.; Jin, W. A new Classifier for Remote Sensing Data Classification: Partial Least Squares. In International Workshop on Earth Observation and Remote Sensing Applications; IEEE: Beijing, China, 2008; p. 302. [Google Scholar]
Laparra, V.; Gómez-Chova, L.; Malo-López, J.; Camps-Valls, G.; Muñoz-Marí, J. A Review of Kernel Methods in Remote Sensing Data Analysis. In Optical Remote Sensing; Springer: Berlin, Germany, 2011; pp. 171–206. ISBN 3496354402. [Google Scholar]
Antropov, O.; Rauste, Y.; Häme, T.; Praks, J. Polarimetric ALOS PALSAR time series in mapping biomass of boreal forests. Remote Sens. 2017, 9, 999. [Google Scholar] [CrossRef]
Wu, J. Landscape ecology. In Encyclopedia of Ecology; Jorgensen, S.E., Ed.; Elsevier: Oxford, UK, 2008; pp. 2103–2108. [Google Scholar] [CrossRef]
Botequilha Leitão, A.; Ahern, J. Applying landscape ecological concepts and metrics in sustainable landscape planning. Landsc. Urban Plan. 2002, 59, 65–93. [Google Scholar] [CrossRef]
Antrop, M.; Mander, Ü.; Marja, R.; Roosaare, J.; Uuemaa, E. Landscape Metrics and Indices: An Overview of Their Use in Landscape Research. Living Rev. Landsc. Res. 2009, 3, 1–28. [Google Scholar]
Tlapakova, L.; StejskalovA, D.; Karasek, P.; Podhrazka, J. Landscape metrics as a tool for evaluation landscape structure-case study hustopece. Eur. Ctry. 2013, 5, 52–70. [Google Scholar] [CrossRef]
Lausch, A.; Herzog, F. Applicability of landscape metrics for the monitoring of landscape change: Issues of scale, resolution and interpretability. Ecol. Indic. 2002, 2, 3–15. [Google Scholar] [CrossRef]
Neel, M.C.; McGarigal, K.; Cushman, S.A. Behavior of class-level landscape metrics across gradients of class aggregation and area. Landsc. Ecol. 2004, 19, 435–455. [Google Scholar] [CrossRef]
Gyenizse, P.; Bognár, Z.; Czigány, S. Landscape shape index, as a potential indicator of urban development in Hungary. Landsc. Environ. 2014, 8, 78–88. [Google Scholar]
Sano, M.; Miyamoto, A.; Furuya, N.; Kogi, K. Using landscape metrics and topographic analysis to examine forest management in a mixed forest, Hokkaido, Japan: Guidelines for management interventions and evaluation of cover changes. For. Ecol. Manag. 2009, 257, 1208–1218. [Google Scholar] [CrossRef]
Yang, Y.; Zhou, Q.; Gong, J.; Wang, Y. Gradient analysis of landscape spatial and temporal pattern changes in Beijing metropolitan area. Sci. China Technol. Sci. 2010, 53, 91–98. [Google Scholar] [CrossRef]
Geri, F.; Rocchini, D.; Chiarucci, A. Landscape metrics and topographical determinants of large-scale forest dynamics in a Mediterranean landscape. Landsc. Urban Plan. 2010, 95, 46–53. [Google Scholar] [CrossRef]
Smiraglia, D.; Ceccarelli, T.; Bajocco, S.; Perini, L.; Salvati, L. Unraveling Landscape Complexity: Land Use/Land Cover Changes and Landscape Pattern Dynamics (1954–2008) in Contrasting Peri-Urban and Agro-Forest Regions of Northern Italy. Environ. Manag. 2015, 56, 916–932. [Google Scholar] [CrossRef] [PubMed]
Martinez del Castillo, E.; García-Martin, A.; Longares Aladrén, L.A.; de Luis, M. Evaluation of forest cover change using remote sensing techniques and landscape metrics in Moncayo Natural Park (Spain). Appl. Geogr. 2015, 62, 247–255. [Google Scholar] [CrossRef]
FAO FAOSTAT. Available online: http://www.fao.org/faostat/en/#compare (accessed on 25 April 2019).
Meyfroidt, P.; Lambin, E.F. Forest transition in Vietnam and its environmental impacts. Glob. Chang. Biol. 2008, 14, 1319–1336. [Google Scholar] [CrossRef]
Cong Thang, T. Review of Vietnam Forestry Policies; Food and Fertilizer Technology Center for the Asian and Pacific Region: Vietnam, 2015. [Google Scholar]
Ministry of Agriculture and Rural development (MARD) in Vietnam. Circular No. 34/2009/TT-BNNPTNT of June 10, 2009, on criteria for forest identification and classification.
Huete, A.R.; Liu, H.Q.; Batchily, K.V.; van Leeuwen, W. A comparsion of vegetation indices over a Global set of TM images for EO-MODIS. Remote Sens. Environ. 1997, 59, 440–451. [Google Scholar] [CrossRef]
Ainsworth, T.L.; Preiss, M.; Stacy, N.; Nord, M.; Lee, J. Analysis of Compact Polarimetric SAR Imaging Modes Compact Polarimetry-Enhancing Dual-Pol Imagery. In Proceedings of the POLinSAR Workshop 2007, Frascati, Italy, 22–26 January 2007. [Google Scholar]
Van Zyl, J.; Kim, Y. Synthetic Aperture Radar Polarimetry; Google eBook; California Institute of Technology: Pasadena, CA, USA, 2011; Volume 2, p. 313. [Google Scholar]
ESA Tutorial on SAR Polarimetry. Available online: https://earth.esa.int/documents/653194/656796/Polarimetric_Decompositions.pdf (accessed on 6 October 2017).
Sun, P. Sparse Kernel Least Squares Classifier. In Proceedings of the Fourth IEEE International Conference on Data Mining, Houston, TX, USA, 27–30 November 2005; pp. 539–542. [Google Scholar] [CrossRef]
Congalton, R.G. A Review of Assessing the Accuracy of Classifications of Remotely Sensed Data. Remote Sens. Environ. 1991, 37, 35–46. [Google Scholar] [CrossRef]
Aguilera, F.; Valenzuela, L.M.; Botequilha-Leitão, A. Landscape metrics in the analysis of urban land use patterns: A case study in a Spanish metropolitan area. Landsc. Urban Plan. 2011, 99, 226–238. [Google Scholar] [CrossRef]
Van Khuc, Q.; Tran, B.Q.; Meyfroidt, P.; Paschke, M.W. Drivers of deforestation and forest degradation in Vietnam: An exploratory analysis at the national level. For. Policy Econ. 2018, 90, 128–141. [Google Scholar] [CrossRef]
Cochard, R.; Tri Ngo, D.; Waeber ETH Zurich, P.; Waeber, P.O.; Kull, C.A.; Cochard, R.; Ngo, D.; Waeber, P.; Kull, C. Extent and causes of forest cover changes in Vietnam’s provinces 1993–2013: A review and analysis of official data Alaotra Resilience Landscape (AlaReLa) Madagascar View project Global Forest Watch in Madagascar View project REVIEW Extent and causes of forest cover changes in Vietnam’s provinces 1993–2013: A review and analysis of official data. Environ. Rev. 2017, 25, 199–217. [Google Scholar] [CrossRef]
Vu, H.T.; Pham, X.P. Impacts and effectiveness of logging bans in natural forests in Vietnam. In Asia-Pacific Forestry Commission Forests out of Bounds: Impacts and Effectiveness of Logging Bans in Natural Forests in Asia-Pacific; Durst, P.B., Waggener, T.R., Enters, T., Cheng, T.L., Eds.; RAP Publication: Bangkok, Thailand, 2001; pp. 185–207. [Google Scholar]
Phuc, X.; Junior, T.S. Illegal timber logging in Vietnam: Who profits from forest privatization connected with a logging ban? In Survival of the Commons: Mounting Challenges and New Realities. In Proceedings of the Eleventh Conference of the International Association for the Study of Common Proverty, Bali, Indonesia, 19–23 June 2006. [Google Scholar]

Figure 1. Cover of synthetic aperture radar (SAR) images and in-situ data in (a) 2007, (b) 2010, (c) 2016, and (d) location map of the study area in Landsat data with pseudo colors (R: SWIR 2, G: near-infrared, B: green).

Figure 2. Flowchart of the methodology employed in this study.

Figure 3. Flowchart of classification using the combination of self-learning with kernel least squares classifier in this study.

Figure 4. Forest types classification accuracies in user, producer (%), overall accuracy (OA), and kappa in the years 2007, 2010, and 2016.

Figure 5. Variation of four forest types in the total class area of natural forest (CA_ha) and percentage of landscape (PLAND_%) for each forest type from 2007 to 2016.

Figure 6. Changes in (a) rich forest, (b) medium forest, (c) poor forest, and (d) restoration forest between 2007 and 2016.

Table 1. Characteristics of satellite image data used in this study.

Date	Types	Level	Incidence Angle at Scene Center	Resolution (m)	Polarization/Band
2016/05/29	PALSAR2	1.1	38.99	3.12 × 4.55	HH + HV + VH + VV
2016/09/04	PALSAR2	1.1	40.5	3.4 × 6.6	HH + HV
2010/07/10	PALSAR	1.1	38.7	3.2 × 15	HH + HV
2010/07/27	PALSAR	1.1	38.7	3.2 × 15	HH + HV
2007/07/02	PALSAR	1.5	38.7	12.5	HH + HV
2007/07/19	PALSAR	1.5	38.7	12.5	HH + HV
2007/04/24	Landsat TM	1	-	30	5
2010/02/11	Landsat TM	1	-	30	5
2016/04/16	Landsat OLI	1	-	15, 30	11

Table 2. Ground data for the four forest types in the study area in 2007, 2010, and 2016.

Types	Number of Samples
Types	2007	2010	2016
Rich forest	17	20	29
Medium forest	68	34	23
Poor forest	48	34	37
Restoration forest	37	27	17
Total	170	115	106

Table 3. Set of high representative metrics for analyzing multi-temporal forest types structure at class and landscape level in the study area.

No		Metric Name	Level	Description
1	Aggregation/ Fragmentation	AI	C	Aggregation index
2		CLUMPY	C	Clumpiness index
3		COHESION	C	Patch cohesion
4		NLSI	C	Normalized landscape shape index
5		PLADJ	C	Proportion of like adjacencies
6		IJI	L	Interspersion/ juxtaposition index
7		MESH	L	Effective mesh size
8	Area and edge metrics	AREA_AM	L	Area-weighted mean patch size
9		AREA_CV	L	Patch size coefficient of variation
10		GYRATE_AM	L	Area-weighted radius of gyration
11	Core area metrics	CAI_CV	C	Core area coefficient of variation
12		CORE_AM	L	Area-weighted mean core area
13		CORE_CV	L	Core area coefficient of variation
14		DCORE_AM	L	Area-weighted mean disjunct core area
15		DCORE_CV	L	Disjunct core area coefficient of variation
16	Shape metrics	CIRCLE_AM	C	Area-weighted related circumscribing circle
17		CIRCLE_CV	C, L	Circumscribing circle coefficient of variation
18		CIRCLE_MN	C, L	Mean related circumscribing circle
19		CONTIG_AM	C	Area-weighted contiguity index
20		CONTIG_MN	C, L	Mean contiguity index
21		CONTIG_CV	L	Contiguity index coefficient of variation
22		SHAPE_MN	C, L	Mean shape index
23		SHAPE_AM	L	Area-weighted mean shape index
24		SHAPE_CV	L	Shape index coefficient of variation
25		FRAC_AM	L	Area-weighted mean fractal dimension
26		FRAC_MN	C, L	Mean fractal dimension
27		FRAC_CV	C, L	Fractal dimension coefficient of variation
28		PARA_MN	C, L	Mean perimeter–area ratio
29		PARA_AM	C	Area-weighted mean perimeter–area index

Table 4. Pattern metrics changes in the four forest types in class level metrics.

Types	Metrics	% Change
Types	Metrics	2007–2010	2010–2016	2007–2016
Rich forest	SHAPE_MN	−20	18	−5
	CIRCLE_MN	−28	27	−9
	CIRCLE_AM	−14	19	2
	CIRCLE_CV	43	−17	18
	CONTIG_MN	−51	58	−22
	CONTIG_AM	−21	29	2
	CLUMPY	−12	11	−3
	PLADJ	−15	16	1
	COHESION	−12	12	1
	AI	−15	16	1
	nLSI	67	−38	4
Medium forest	SHAPE_MN	−16	5	−12
	CIRCLE_MN	−3	−3	−6
	CIRCLE_AM	6	−9	−4
	CIRCLE_CV	−6	6	−1
	CONTIG_MN	−11	−1	−11
	CONTIG_AM	9	−5	4
	CLUMPY	7	1	9
	PLADJ	7	−4	3
	COHESION	5	−8	−3
	AI	6	−2	4
	nLSI	−29	16	−17
Poor forest	SHAPE_MN	−9	6	−4
	CIRCLE_MN	−19	18	−4
	CIRCLE_AM	−6	−6	−11
	CIRCLE_CV	29	−24	−1
	CONTIG_MN	−32	29	−12
	CONTIG_AM	−8	−2	−10
	CLUMPY	−7	6	−1
	PLADJ	−6	−2	−8
	COHESION	−2	−6	−8
	AI	−7	−1	−7
	nLSI	53	4	60
Restoration forest	SHAPE_MN	−8	3	−6
	CIRCLE_MN	−14	9	−6
	CIRCLE_AM	9	−7	2
	CIRCLE_CV	25	−15	6
	CONTIG_MN	−24	24	−6
	CONTIG_AM	8	−7	0
	CLUMPY	−3	−2	−4
	PLADJ	6	−5	1
	COHESION	5	−6	−1
	AI	5	−4	1
	nLSI	−25	33	0

Table 5. Pattern metrics changes in landscape level metrics.

Metrics	% Change
Metrics	2007–2010	2010–2016	2007–2016
AREA_AM	−49	−22	−60
GYRATE_AM	−7	−20	−26
SHAPE_AM	7	−25	−20
CONTIG_MN	−30	14	−21
IJI	−5	−2	−8

Table 6. Conversion matrix of forest types between 2007 and 2016 in percentage (%) and area (ha).

	Rich	Medium	Poor	Restoration	Others	Area increase (ha)
2007	Rich	Medium	Poor	Restoration	Others	Area increase (ha)
Rich	19	16	13	1	1	24,569
Medium	36	29	30	13	4	52,749
Poor	21	23	26	29	3	34,970
Restoration	14	18	15	26	2	33,585
Others	11	14	15	31	90
Area decrease (ha)	−24,046	−37,955	−66,409	−21,732
Net area change (ha)	522	14794	−29,437	11,853

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cat Tuong, T.T.; Tani, H.; Wang, X.; Quang Thang, N. Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam. Forests 2019, 10, 673. https://doi.org/10.3390/f10080673

AMA Style

Cat Tuong TT, Tani H, Wang X, Quang Thang N. Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam. Forests. 2019; 10(8):673. https://doi.org/10.3390/f10080673

Chicago/Turabian Style

Cat Tuong, Truong Thi, Hiroshi Tani, Xiufeng Wang, and Nguyen Quang Thang. 2019. "Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam" Forests 10, no. 8: 673. https://doi.org/10.3390/f10080673

APA Style

Cat Tuong, T. T., Tani, H., Wang, X., & Quang Thang, N. (2019). Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam. Forests, 10(8), 673. https://doi.org/10.3390/f10080673

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Semi-Supervised Classification and Landscape Metrics for Mapping and Spatial Pattern Change Analysis of Tropical Forest Types in Thua Thien Hue Province, Vietnam

Abstract

1. Introduction

2. Study Area

3. Data and Methods

3.1. Data

3.2. Methods

3.2.1. Preprocessing

3.2.2. Masking Undesirable Areas

3.2.3. Self-Learning with the Kernel Least Squares (SL-KLS) Classifier for Forest Types Classification

Kernel Least Squares (KLS)

Self-learning with the Kernel Least Squares (SL-KLS) Classifier

3.2.4. Forest Pattern Analysis Using Landscape Metrics

Extraction of Landscape Metrics

Selection of a Set of Landscape Metrics

Analysis of Forest Pattern Change

4. Results

4.1. Forest Type Classification

4.2. Forest Pattern Analysis at the Class Level

4.2.1. Period 2007–2010

4.2.2. Period 2010–2016

4.3. Forest Pattern Analysis in Landscape Level

5. Discussion

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A.

Appendix B.

Appendix C.

Appendix D.

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI