Stem Taper Approximation by Artificial Neural Network and a Regression Set Models

Socha, Jaroslaw; Netzel, Pawel; Cywicka, Dominika

doi:10.3390/f11010079

Open AccessArticle

Stem Taper Approximation by Artificial Neural Network and a Regression Set Models

by

Jaroslaw Socha

¹

,

Pawel Netzel

^1,2 and

Dominika Cywicka

^1,*

¹

Department of Forest Resources Management, Faculty of Forestry, University of Agriculture in Krakow, Al. 29 Listopada, 31-425 Krakow, Poland

²

Space Informatics Lab, University of Cincinnati, Cincinnati, OH 45221, USA

^*

Author to whom correspondence should be addressed.

Forests 2020, 11(1), 79; https://doi.org/10.3390/f11010079

Submission received: 30 October 2019 / Revised: 30 December 2019 / Accepted: 31 December 2019 / Published: 9 January 2020

(This article belongs to the Section Forest Ecology and Management)

Download

Browse Figures

Versions Notes

Abstract

Variation in tree stem form depends on species, age, site conditions, etc. Stem taper models that estimate stem diameter at any height and volume should comply with this complexity. In the paper, we propose new methods taking into account both unbiased estimates and stem variability: (i) an expert model based on an artificial neural network (ANN) and (ii) a statistical model built using a regression tree (REG). We used the variable-exponent taper equation (STE) as a reference for these two models. Input data contain information about 2856 trees representing eight dominant forest-forming tree species in Poland (birch, beech, oak, fir, larch, alder, pine, and spruce). The trees were selected across stands varied in terms of age and site conditions. Based on the data, we built ANN and REG models and calculated both stem taper and tree volumes. The results show that ANN is a universal approach that offers the most precise estimation of stem diameter at a particular stem height for different tree species. The results for alder are an exception. In this case, the REG model performs slightly better than ANN. In terms of volume prediction, the ANN model provides the most accurate predictions for coniferous and beech. In general, flexibility and predictive performance of the ANN are better than REG and reference the STE equation.

Keywords:

stem form; stem profile; tree volume; stem taper modeling, stem diameter at any height

1. Introduction

Models for estimating a stem taper enable one to estimate stem volume, thus being useful in both the assessment of the economic value of timber production and forest conservation management [1]. According to the IPCC methodology, merchantable timber volume is used to convert the growing stock of forest stands into the amounts of biomass and carbon accumulated in trees [2], with the help of either biomass conversion and expansion factor (BCEF) or biomass expansion factor (BEF). Such conversion requires accurate and unbiased systematic errors as well as methods for timber volume determination. Taper models, which allow the estimation of tree shape and wood assortment volume are also one of the most important types of practical information used in the forest management and timber industry [3]. Therefore, the accurate determination of the shape of a tree stem and the tree volume is crucial for forest research and practice.

Generally, stem taper can be described by either a set of linear models, describing diameters at different relative tree heights or by nonlinear models describing the whole stem profile [4]. The linear models are less biased in tree diameter estimation, however these models suffer from a serious disadvantage: they do not enable one to estimate a stem diameter at any height [5,6]. Nonlinear models, such as segmented taper equations [7,8,9,10,11], or variable-exponent taper equations [12,13,14,15,16,17,18] overcome this problem—but not without a cost [19].

A comparison of linear and nonlinear models should take into account their robustness to assumptions, in particular the one of data homogeneity. Nonlinear models are more sensitive to such phenomena as outlier values, stochastic noise in the variables, anomalies in sample randomness, and measurement errors, which all can increase systematic errors. Among the nonlinear heuristic models used for stem profile modeling, variable-exponent taper equations are considered the best [13,19].

The shape of a tree stem is genetically determined, but it also depends on various factors, such as the site conditions, climate, the height of the crown base, and age. Therefore, it also varies among trees of the same species [20]. Socha [21] showed that a pine stand’s density affects the shape of the upper part of the stems. A good fit of a model largely results from the independent variables used. For example, Murhaiwe [22] showed that including a crown ratio variable in a variable-exponent taper equation [13] helped improve the model’s fit for shore pine but not for common aspen. Thus, many taper equations are specific for a given species, stands of a particular age, or a particular site conditions. What is more, trees of irregular shapes often are excluded during data preprocessing. While among most coniferous species, trees of irregular shape are rare, among deciduous species and other species with high plasticity in morphology they can be quite frequent. Removing them might limit the sample to regularly—shaped trees, and so the usefulness of such models would only be limited to such trees—such models, thus, should not be used for irregular trees [23]. If such irregulars constitute a significant part of the population—a likely scenario in the case of deciduous trees—then such models are not useful. Assessment of stem taper models for mentioned cases is usually based on results of unbiased estimates [4]. Nonetheless, a model’s precision of prediction does not always correspond to its universality, an important aspect in practice.

One of universal approach that includes both—unbiased estimates and precise estimation of a stem diameter at a particular stem height, can be artificial neural networks (ANNs). The literature have shown examples indicating that in this context ANNs give better results than do mathematical and statistical methods [24,25]. The most popular among multilayer perceptron (MLP), ANNs with three layers can theoretically be considered universal approximators [26].

Since ANNs can learn, they do not require one to have the full a priori knowledge of a system studied: Thus, they enable one to build models without the prior formulation of statistical hypotheses [27]. They can also work with data that are noisy and low quality [28,29,30]; the latter aspect—paradoxically—can even improve the network’s learning capacity and the generalization of its results. ANNs can model complex, multivariate nonlinear relationships, often difficult to represent with known mathematical functions. For example, [31] applied ANNs to analyze a tree diameter distribution, and sigmoid activation functions they implemented in the ANN led to a better fit to bimodal distributions than that of the Weibull function.

ANNs have also been widely used in forest management, including the estimation of tree height, diameter, and volume [27,32].Various types of networks have been used to meet these aims, including radial basis functions (RBF) [33] and, the most frequent ones, one- or two-layer perceptron MLP [34,35,36,37]. Conceptually, the models used in these works differed not only in architecture, but also in data input and model evaluation. According to Kozak and Smith [38], the evaluation of taper models should include evaluating the precision of both diameter and volume prediction as well as the universality of predicting stem shape for various species. From the point of view of forestry practice, such a model should also work with various independent variables and be simple to implement.

Most ANN models for stem taper approximation focus on only one species [36,37]. Models focused on many species incorporate species information as a parameter [35]. Often, the information about species is a part of an equation or a direct input value [34].

A regression approach to model stem taper was used by Kilkki, Varmola [39]. The authors considered three models: single-equation, simultaneous-equation, and multi-equation. The first two lead to a single equation while the last one to a system of regression equations. In such a system, equations are related to each other. Socha and Kulej [20] introduced a parallel-equation model, in which all regression equations are independent of each other. They examined different selections of independent variables and only used a set of 20 equations (with only ten cross-sections) to describe a stem taper shape. Moreover, this solution works with just one species.

In the paper, we propose two new solutions for modeling tree taper: (i) an expert model based on an artificial neural network model and (ii) a statistical model built using a regression tree. The solutions aim to provide tools for calculating stem profile and tree volume with high efficiency and low bias. The results of tree taper modelling using models (i) and (ii) were compared with a well-known and frequently used in forest research variable-exponent taper model developed by Kozak [14,19]. An outcome of all these models give a stem profile, which is then integrated along tree height, giving tree volume. For the models, we will use training data for eight tree species, with high vertical resolution of 0.01 of normalized tree height. The trees in the data set vary in terms of age and site conditions in which they grow.

We compare three solutions for modeling tree taper (two new models and a reference solution): (i) an machine learning model based on an artificial neural network model, (ii) a statistical model built using a regression tree, and (iii) variable exponent taper equation by Kozak (2004) recommended in [19,40].

The specific aims of this study were:

(1): to compare the modelling techniques with respect to their performance to estimate stem profile and tree volume;
(2): to rank the modelling techniques according to predictive performance for various tree species;
(3): to find a modelling technique that combines estimating of stem taper shape for many tree species into one model.

2. Data and data Preprocessing

Cross-sectional measurements of the outside-bark diameter were collected in 357 stands, distributed throughout Poland and representing the whole range of site conditions and age of the tree species analyzed (Figure 1). In each stand, a sample plot with at least 100 trees was established. After their diameters had been measured, the trees were divided into eight size classes of equal sizes (i.e., consisting of the same number of trees). In the next step, from each class one tree with average diameter and height was selected and felled for cross-sectional measurements; thus, eight trees representing were measured in each stand. Altogether, a total of 2856 trees representing eight major forest-forming tree species in Poland were collected, including 504 Scots pines (Pinus sylvestris L.), 458 Norway spruces (Picea abies (L.) H. Karst), 262 European larches (Larix decidua Mill.), 219 silver firs (Abies alba Mill.), 479 common oaks (Quercus robur L.), 430 common beeches (Fagus sylvatica L.), 270 black alders (Alnus glutinosa Gaertn.), and 234 silver birches (Betula pendula Roth.).

The diameter measurements were taken directly with a caliper at the following heights: 0.0, 0.5, 1.3, and 2.0 m, and then every 1 m to the top. The diameter at breast height (dbh) ranged from 0.30 to 79.20 cm and their height (h) from 1.35 to 42.05 m (Table 1). Additionally, total tree height and height up to 7 cm of stem diameter were measured. For final analysis, we selected data from trees with a diameter at breast height larger than 7 cm.

The dataset was preprocessed to provide input for a model’s calibration. The height of each tree was normalized to a range from 0 to 1, and stem diameter was interpolated every 0.01 of normalized height. Interpolations were necessary in the case of regression model, in which particular equations describe tree diameters at given relative heights. We used piecewise cubic Hermite polynomials [41] to obtain interpolated values.

3. Methods

3.1. Models

Our model works as an expert system. It consists of three modules: for controlling data-flow, for summarizing output, and for calculating tree volume; each species has its own dedicated model (see Figure 2A,C).

STE and ANN models have similar structures (Figure 2A,C). The input data contain information about a tree species, based on which the species selector sends parameters to the internal modeling module, calibrated for this species. Next, the output of the internal model is integrated by the volume calculator to obtain a stem taper volume. In the REG model’s case (Figure 2B), input data are sent to a set of regression equations, calibrated for the specific species. Based on the results of these regressions, a stem taper shape is built. Finally, like in STE and ANN models, the output is integrated to obtain a stem volume.

3.2. Methods Used in Models

We compared following methods for approximating a stem shape: a taper equation, a regression set model, and a feed-forward neural network. As a reference, we calibrated a variable exponent taper equation introduced by Kozak [14].

Kozak introduced a variable exponent equation in the following form

d_{i} = a_{0} * D^{a_{1}} * H^{a_{2}} * X_{i}^{(a_{3} * z_{i}^{4} + \frac{a_{4} * 1}{e^{(\frac{D}{H})}} + a_{5} * X_{i}^{0.1} + \frac{a_{6} * 1}{D} + a_{7} * H^{Q_{i}} + a_{8} * X_{i})}

(1)

where:

X_{i} = \frac{Q_{i}}{(1 - p^{\frac{1}{3}})}

,

Q_{i} = 1 - z_{i}^{\frac{1}{3}}

,

z_{i} = \frac{h_{i}}{H}

,

p = \frac{1.3}{H}

.

In this notation, dbh means diameter at breast height in cm, H is total tree height, and

d_{i}

is a stem diameter in cm at

z_{i}

relative height.

To calibrate this model, we transformed it with logarithmic transformation and calculated the coefficients with the least squares method [14].

A regression set model consists of two components: a decision rule and a set of regressions. The decision rule takes into account the normalized height. The normalized height is ranging from 0 to 1. Next, the rule selects a proper regression model for this normalized height. Since stem taper is approximated with the resolution of 0.01, our set of regressions contains 100 regression equations.

Each equation has the following form:

d = a_{0} + a_{1} * D + a_{2} * H

(2)

where: d—stem diameter in cm, D—diameter at breast height in cm, and H—total tree height in meters.

As a result, we obtained a set of 100 vectors in the form [

a_{0}

,

a_{1}

,

a_{2}

] that describes a stem taper shape.

The third method—an artificial neural network model (ANN)—was implemented as a multilayer perceptron network [42]. It contains four layers: an input layer of 4 neurons, two hidden layers of 5 and 17 neurons, and an output layer of 101 neurons. Each neuron was activated with a sigmoid function. To each hidden layer, we added a bias neuron. Each network layer was connected to the next layer only. The network was trained with a simple backpropagation algorithm [42]. The training set was presented to the network 5000 times (5000 epochs). The number of epochs were set to 5000 to avoid network’s overfitting. The number of hidden layers was selected to speed up learning process.

As a network input, the following parameters were used: diameter at breast height in cm multiplied by 0.005 for scaling purposes, total tree height in meters multiplied by 0.01 (H), H’/H (where H’ is a height up to 7 cm of stem diameter), and p (p = 1.3/H). The network’s output provides expected diameters at 101 (from 0 to 1 with a 0.01 step) normalized heights. The output stem diameter values were multiplied by 0.005 in the training set.

Unlike Nunes and Görgens [34], we did not force ANN to recognize a tree species, but we assumed that the models should be calibrated for each tree species separately. Thus, we built an ANN model for each species, an approach that leads to a more straightforward and better-trained network.

To calibrate/train each model, we divided using random selection of cases the data set into two sets: a training set (60% of the data) and a testing set (40%). Table 2 presents the distribution of trees between the training and testing sets for all the species studied.

The training sets were used to calibrate all three models while the testing sets for the assessment of the stem taper models, based on the following model statistics:

root mean squared error, $RMSE = \sqrt{\frac{\sum {(y_{i} - {y^{'}}_{i})}^{2}}{n}}$ ,
mean error/bias, $ME = \frac{\sum (y_{i} - {y^{'}}_{i})}{n}$
model efficiency, $EF = 1 - \frac{\sum {(y_{i} - {y^{'}}_{i})}^{2}}{\sum {({y^{'}}_{i} - {\bar{y_{i}}}^{'})}^{2}}$ ,

where

y_{i}

—interpolated measurements at height i,

y_{i}^{'}

—modeled value at height i,

{\bar{y_{i}}}^{'}

—average modeled value at height i.

For the model evaluation criteria a ranking was made on a relative scale to compare the modeling techniques without separation into species. Moreover, we calculated stem volumes V, integrating the stem taper shapes along the tree heights. The values of RMSE, EF and bias in diameter and volume estimation for all three models were assumed in blocks. Inside each block, they were ranked by assigned numbers from 1 to 3 (1 for the best and 3 for the worst result). Each block contained information about RMSE, EF and bias for single species so the ranks have been added to give general information for all eight species. The sum of ranks for each model was used to assess the quality of the model.

4. Results

4.1. Diameter Estimates

A mean bias in diameter of all the three models for all the species ranged from −0.0178 to −0.0838. For most species, the model mean errors did not differ from zero (p < 0.05), with the only exceptions of beech and alder: For the former, all the three models for beech were significantly biased, and for the latter, the ANN model was.

The most efficient technique for diameter estimation was ANN (its mean EF was 0.9777). This model was most efficient for larch and pine. For larch, EF for ANN was 0.9887, for Kozak’s model 0.9846, and for REG 0.9878. For pine, EF for ANN was 0.9865, for Kozak’s model 0.9859, and for REG 0.9825. ANN was the least efficient for beech, with EF of 0.9643; the other models were even less efficient, with EF 0.9640 for REG and 0.9590 for Kozak’s model.

ANN was also the most stable taper equation in terms of RMSE, except for alder (Table 3). For this species, REG’s RMSE was 1.4277 while ANN’s and Kozak’s were higher (1.5848 and 1.5522, respectively).

To check the models, we plotted their residuals of diameter predictions versus diameter (d) (Figure 3, Figure 4 and Figure 5). The plots showed that for all the models, the errors symmetrically distributed around zero. Kozak’s taper equation model, however, showed the tendency for higher errors in the bottom and middle parts of the stems. For most species (beech, spruce, alder, larch, fir, and pine), the REG model showed errors in the lower part of the stem.

We ranked the validation results (Table 4), and the most precise technique for diameter estimation was ANN, followed by REG and Kozak’s model taper equation.

4.2. Stem Volume Estimates

Model validation for the stem volume prediction showed similar results to those for the diameter. ANN had the lowest mean RMSE over the species (0.1457), but its RMSE for beech was the highest among the three models (0.2817).

ANN was the best model for birch, fir, larch, and spruce. For oak, ANN was better than the REG model but worse than Kozak’s. Kozak’s model was the most accurate and efficient for pine and oak. For alder, however, REG had the lowest RMSE (0.0796), the lowest RSME among all the models and all the species.

Mean error ranged from −0.0551 (for the ANN model for beech) to 0.0051 (for the REG model for alder).

In summary, for stem volume estimation, the best model was ANN, but—unlike for diameter estimation—Kozak’s model was better than the REG one (Table 3).

4.3. The Models Ranking

To compare models as a complex expert systems without division to the particular tree species we applied ranking method. The sum of ranks for all evaluation criteria was on a [8–24] scale, which means that the best model obtained the lowest values and the worst model obtained the highest values (Table 4). Ranks have been calculated for the estimation of d and V separately.

For prediction d, the range of variability is very high. It covers almost all possible variability range and takes from 9 to 24. For RMSE, the ANN model obtained almost the lowest possible rank value—9, which indicates that it obtained the lowest error of all three models for almost all tree species—both coniferous and deciduous. Similarly, the ANN model obtained a value close to the minimum for EF—10. The worst ranks for prediction d were assigned to Kozak (2004) model for RMSE and EF. RMSE and EF rank values indicates that, the ANN model was significantly better than Kozak (2004) and REG models.

For prediction V, the range of rank’s variability is smaller than for prediction d. It covers variability from 15 to 18. For RMSE and EF, the ANN model obtained the best rank value—15. The worst ranks for prediction V were assigned to REG model for RMSE and EF. For ME, Kozak (2004) and REG turned out to be better than the ANN model. The ranks for prediction V assigned for Kozak (2004) and REG model was 15 in this case. The ANN model was slightly worse and obtained the rank—18.

5. Discussion

Choosing the best model describing the shape of a tree stem is a difficult task, especially when the choice is based on criteria related to a model’s prediction quality and utility. A model’s usefulness can be defined in various ways, but it is usually closely related to the purpose of the modeling [5]. For the models presented, the main aim was to select a model with the highest quality of predictions diameters and total volume calculated based on them as well as universality in terms of obtaining forecasts for different tree species and stands of various ages. In current studies on this subject that include the use of ANN, this last condition has seldom been kept. For example, Soares, Flores, Cabacinha, Carrijo, Veiga [36,37], using neural networks, applied the recursively series prediction method, which helped them limit the number of input variables to three. They developed this model, however, for one species, for trees of the same age obtained from clonal genetic material. In turn, Reis’ et al. [35] model was not verified in terms of the accuracy of tree volume’s estimation, and network input data based on plenty of previously prepared variables (e.g., competition index, forest class, etc.). In our opinion, such an approach limited the usefulness of the model. The model used by Nunes and Görgens [34] included as many as 72 deciduous tree species, classified, however, into three types and used as a qualitative variable. Due to the number of the species and the diversity of stem shapes in different sites, the information on the origin of a tree was crucial for the model. In studies by Castaño-Santamaría, Crecente-Campo, Fernández-Martínez, Barrio-Anta, Obeso [43], who predicted tree height based on their diameter in tree stands of various ages, neural networks gave worse results than did other methods, likely resulting from the instability in the learning process.

In terms of RMSE, our analyses indicate that artificial neural networks allowed for the most precise determination of a stem shape for all the species studied except for alder, for which the REG model was better. In terms of estimating tree volume, ANNs were the best for coniferous species and among deciduous ones only for birch (Betula pendula). The latter result may have to do with irregular morphological forms of deciduous species [23], whose shape is difficult to describe using a single function. However, the results obtained using neural networks were close enough to those obtained using the regression and Kozak (2004) models to suppose that they can improve, for example, after increasing the number of learning epochs or the size of the training set. Slightly higher errors were observed for regression models (REG) than for ANNs (except for alder). For both methods (ANN and REG), no systematic errors were found in the determination of diameters at different heights of a stem.

In terms of total volume estimation, Kozak’s model proved to be the best for oak and pine. Attempts to describe stem shape using the Kozak taper-equation model resulted in obtaining systematic errors for some sections of the stem, especially in its lower and middle parts. Similar results were obtained by Rojo and Perales [19]: Using Kozak’s taper-equation model to describe stem shape, they showed overestimation for bottom parts of the stem, which, according to the authors, may result from a lack of data for larger and/or older trees, whose stems have more neiloidal shapes in their lower parts. Other authors reported similar effects related to the occurrence of systematic errors when a single function was used to describestem curves: For example, in Li and Weiskittel [4], the Kozak model, being compared to other models based on one function, gave errors in the upper and middle parts of stems for red spruce. In the aforementioned studies, the Kozak model predicted better than to other models did for red spruce and white pine. In the study by Rojo and Prales [19], the Akaike criteria also indicated the Kozak model as the best choice for maritime pine in Galicia. We obtained similar results in our research in terms of the RMSE error only in terms of the determination of total volume for pine (Pinus sylvestris L.). At first sight the overall performance of all models seems to be similar. Although, the summarizing ranks (Table 4) in terms of RMSE and EF show that the ANN model was the best for prediction both diameter and volume. The other two models (Kozak 2004 and REG) had lowest bias. High rank for bias of ANN model was caused by result for Alder.

6. Conclusions

It can be concluded that by creating stem shape models according to the recommendations by MacFarlane and Weiskittel [23], that is, avoiding exclusion criteria for irregularly shaped trees and for various species and different age stands, the neural networks model gave the most precise results in terms of diameter prediction.This model’s predominance was also visible in the prediction of tree volume for all coniferous species and birch. The REG model was estimated with smaller systematic errors, but its disadvantage is that it does not guarantee to well represent a monotonic convergent shape of a tree, because the individual regression equations are independent of each other. Like the Kozak model (2004), the REG model cannot be improved, which means that the class of the function describing a stem diameter for a given height does not change. Despite these inconveniences and its simplicity, the REG model works well, especially for describing stem volume.

The neural model allows one to estimate a stem diameter at any height with high precision—just like the linear model—but its additional advantage is the possibility to improve it, for example, by increasing the training set, better matching the structure of the model to the shape topology of the species, changing the non-linear transition function, or changing the learning parameters. The analyses thus indicate that ANNs are a universal tool for constructing models of a stem shape and volume. They allow the construction of models with a very good fit to empirical data and without systematic errors at any part of the stem, at the same time allowing the determination of a diameter at any height of the tree. ANNs can therefore be used to build local models of a stem shape and volume, used in forest practice for forest inventory. In our opinion, further research should focus on optimizing the performance of ANN models.

Author Contributions

Conceptualization, J.S. and D.C.; Formal analysis, P.N.; Investigation, J.S. and D.C.; Methodology, P.N.; Software, P.N.; Writing—original draft, J.S., P.N. and D.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by grant number BIOSTRATEG1/267755/4/NCBR/2015.

Acknowledgments

This work was supported by the project REMBIOFOR “Remotesensing based assessment of woody biomass and carbon storage in forests”, financed by The National Centre for Research and Development in Poland under BIOSTRATEG program, agreement no. BIOSTRATEG1/267755/4/NCBR/2015.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Cushman, K.C.; Muller-Landau, H.C.; Condit, R.S.; Hubbell, S.P. Improving Estimates of Biomass Change in Buttressed Trees Using Tree Taper Models. Methods Ecol. Evol. 2014, 5, 573–582. [Google Scholar] [CrossRef]
Eggleston, S.; Buedia, L.; Miwa, K.; Ngara, T.; Tanabe, K. IPCC Guidelines for National Greenhouse Gas Inventories; Institute for Global Environmental Strategie: Hayama, Japan, 2006. [Google Scholar]
Bronisz, K.; Zasada, M. Comparison of Fixed- and Mixed-Effects Approaches to Taper Modeling for Scots Pine in West Poland. Forests 2019, 10, 975. [Google Scholar] [CrossRef]
Li, R.; Weiskittel, A.R. Comparison of Model Forms for Estimating Stem Taper and Volume in the Primary Conifer Species of the North American Acadian Region. Ann. For. Sci. 2010, 67, 302. [Google Scholar] [CrossRef]
Kozak, A.; Munro, D.D.; Smith, J.H.G. Taper Functions and Their Application in Forest Inventory. For. Chron. 1969, 45, 278–283. [Google Scholar] [CrossRef]
Sharma, M.; Oderwald, R.G. Dimensionally Compatible Volume and Taper Equations. Can. J. For. Res. 2001, 31, 797–803. [Google Scholar] [CrossRef]
Brink, C.; Gadow, K.V. On the use of growth and decay functions for modeling stem profiles. EDV Med. Biol. 1986, 17, 20–27. [Google Scholar]
Brook, J.R.; Jiang, L.C.; Ozcelik, R. Compatible stem volume and taper equations for Brutian pine, Cedar of Lebanon, and Cilicica fir in Turkey. For. Ecol. Manag. 2008, 256, 147–151. [Google Scholar] [CrossRef]
Burkhart, H.E. Segmented Polynomial Regression Applied to Taper Equations. For. Sci. 1976, 22, 283–289. [Google Scholar]
Cao, Q.V.; Wang, J. Calibrating fixed- and mixed-effects taper equations. For. Ecol. Manag. 2011, 262, 671–673. [Google Scholar] [CrossRef]
Clark, A.C.; Souter, R.A.; Schlaegel, B.E. Stem Profile Equations for Southern Tree Species; US Forest Service, Southeastern Forest Experiment Station: Asheville, NC, USA, 1991. [Google Scholar]
Bi, H. Trigonometric variable-form taper equations for Australian eucalypts. For. Sci. 2000, 46, 397–409. [Google Scholar]
Kozak, A. A variable-exponent taper equation. Can. J. For. Res. 1998, 18, 1363–1368. [Google Scholar] [CrossRef]
Kozak, A. My last words on taper equations. For. Chron. 2004, 80, 507–515. [Google Scholar] [CrossRef]
Newberry, J.D.; Burkhart, H.E. Variable-form stem profile models for loblolly pine. Can. J. For. Res. 1986, 16, 109–114. [Google Scholar] [CrossRef]
Newnham, R.M. Variable-form taper functions for four Alberta tree species. Can. J. For. Res. 1992, 22, 210–223. [Google Scholar] [CrossRef]
Riemer, T.; Gadow, K.V.; Sloboda, B. Ein model zurbeschreibung von Baumschäften. Allg. Und Jagdztg. 1995, 166, 144–147. [Google Scholar]
Sharma, M.; Zhang, S.Y. Variable-exponent taper equations for jack pine, black spruce, and balsam fir in eastern Canada. Can. J. For. Res. 2004, 198, 39–53. [Google Scholar] [CrossRef]
Rojo, A.; Perales, X.; Sánchez-Rodríguez, F.; Álvarez-González, J.G.; von Gadow, K. Stem Taper Functions for Maritime Pine (Pinus Pinaster Ait.) in Galicia (Northwestern Spain). Eur. J. For. Res. 2005, 124, 177–186. [Google Scholar] [CrossRef]
Socha, J.; Kulej, M. Provenance-dependent variability of Abies grandis stem form under mountain conditions of Beskid Sadecki (southern Poland). Can. J. For. Res. 2005, 35, 1–14. [Google Scholar] [CrossRef]
Socha, J. Estimation of the effect of stand density on scots pine stem form, Acta Scientiarum Polonorum Silv. Colendar. Rat. Ind. Lignar 2007, 6, 59–70. [Google Scholar]
Muhairwe, C.K. Tree form and taper variation over time for interior lodge- pole pine. Can. J. For. Res. 1994, 24, 1904–1913. [Google Scholar] [CrossRef]
MacFarlane, D.W.; Weiskittel, A.R. A New Method for Capturing Stem Taper Variation for Trees of Diverse Morphological Types. Can. J. For. Res. 2016, 46, 804–815. [Google Scholar] [CrossRef]
Özesmi, S.L.; Tan, C.O.; Özesmi, U. Methodological Issues in Building, Training, and Testing Artificial Neural Networks in Ecological Applications. Ecol. Model. 2006, 195, 83–93. [Google Scholar] [CrossRef]
Paruelo, J.M.; Tomasel, F. Prediction of Functional Characteristics of Ecosystems A Comparison of Artificial Neural Networks and Regression Models. Ecol. Model. 1997, 98, 173–186. [Google Scholar] [CrossRef]
Hornik, K.; Stinchcombe, M.; White, H. Multilayer Feedforward Networks Are Universal Approximators. Neural Netw. 1989, 2, 359–366. [Google Scholar] [CrossRef]
Peng, C.; Wen, X. Recent Applications of Artificial Neural Networks in Forest Resource Management an Overview Applications in Forest Resource Management; AAAI Technical Report WS-99-07; Association for the Advancement of Artificial Intelligence: Palo Alto, CA, USA, 1999. [Google Scholar]
Friedman, J.H. On bias, variance, 0/1-loss and the curse-of-dimensionality. Data Min. Knowl. Discov. 1997, 1, 55–77. [Google Scholar] [CrossRef]
Geman, S.; Bienenstock, E.; Doursat, R. Neural networks and the bias/variance dilemma. Neural Comput. 1992, 4, 1–58. [Google Scholar] [CrossRef]
Kohavi, R. A study of cross-validation and bootstrap for estimation and model selection. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, Montreal, QC, Canada, 20–25 August 1995; pp. 1137–1143. [Google Scholar]
Diamantopoulou, M.J.; Özçelik, R.; Crecente-Campo, F.; Eler, Ü. Estimation of Weibull Function Parameters for Modelling Tree Diameter Distribution Using Least Squares and Artificial Neural Networks Methods. Biosyst. Eng. 2015, 133, 33–45. [Google Scholar] [CrossRef]
Imada, A. A Literature Review: Forest Management A Literature Review: Forest Management with Neural Network and Artificial Intelligence. Commun. Comput. Inf. Sci. 2014, 440, 9–21. [Google Scholar]
Monteiro da Silva, E., Jr.; Maia, R.D.; Cabacinha, C.D. Bee-Inspired RBF Network for Volume Estimation of Individual Trees. Comput. Electron. Agric. 2018, 152, 401–408. [Google Scholar] [CrossRef]
Nunes, M.H.; Görgens, E.B. Artificial Intelligence Procedures for Tree Taper Estimation within a Complex Vegetation Mosaic in Brazil. PLoS ONE 2016, 11, e0154738. [Google Scholar] [CrossRef]
Reis, L.P.; de Souza, A.L.; Mazzei, L.; Marques dos Reis, P.C.; Leite, H.G.; Soares, C.P.B.; Torres, C.M.M.E.; da Silva, L.F.; Ruschel, A.R. Prognosis on the Diameter of Individual Trees on the Eastern Region of the Amazon Using Artificial Neural Networks. For. Ecol. Manag. 2016, 382, 161–167. [Google Scholar] [CrossRef]
Soares, F.A.A.M.N.; Flores, E.L.; Cabacinha, C.D.; Carrijo, G.A.; Veiga, A.C.P. Recursive Diameter Prediction and Volume Calculation of Eucalyptus Trees Using Multilayer Perceptron Networks. Comput. Electron. Agric. 2011, 78, 19–27. [Google Scholar] [CrossRef]
Soares, F.A.A.M.N.; Flores, E.L.; Cabacinha, C.D.; Carrijo, G.A.; Veiga, A.C.P. Recursive Diameter Prediction for Calculating Merchantable Volume of Eucalyptus Clones Using Multilayer Perceptron. Neural Comput. Appl. 2013, 22, 1407–1418. [Google Scholar] [CrossRef]
Kozak, A.; Smith, J. Standards for evaluating taper estimating systems. For. Chron. 1993, 69, 438–444. [Google Scholar] [CrossRef]
Kilkki, P.; Varmola, M. A nonlinear simultaneous equation model to determine taper curve. Silva Fenn. 1979, 4, 293–303. [Google Scholar] [CrossRef][Green Version]
Poudel, P.P.; Temesgen, H.; Gray, N.A. Estimating upper stem diameters and volume of Douglas-fir and Western hemlock trees in the Pacific northwest. For. Ecosyst. 2018, 5, 16. [Google Scholar] [CrossRef]
Kosma, Z. Metody numeryczne dla zastosowan inzynierskich. In Numerical Methods for Engineering Use; Politechnika Radomska: Radom, Poland, 1999. (In Polish) [Google Scholar]
Silipo, R. Neural Networks. In Intelligent Data Analysis An Introduction; Berthold, M., Hand, D.J., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; pp. 269–320. [Google Scholar]
Castaño-Santamaría, J.; Crecente-Campo, F.; Fernández-Martínez, J.L.; Barrio-Anta, M.; Obeso, J.R. Forest Ecology and Management Tree Height Prediction Approaches for Uneven-Aged Beech Forests in Northwestern Spain. For. Ecol. Manag. 2013, 307, 63–73. [Google Scholar]

Figure 1. Map of the locations of sample plots. On each sample plot 8 trees have been cut for sectional measurement.

Figure 2. From left to right: (A) the stem taper equation (STE) model, (B) the regression tree (REG) model, and (C) the artificial neural network (ANN) model.

Figure 3. Residuals versus fitted values for Kozak’s model for pine.

Figure 4. Residuals versus fitted values for the REG model for pine.

Figure 5. Residuals versus fitted values for the ANN model for pine.

Table 1. The number of trees in the subsets after data filtering.

Species	Total	In a Training Set	In a Testing Set
Silver birch	234	139	95
Common beech	430	257	173
Common oak	479	286	193
Silver fir	219	130	89
European larch	262	156	106
Black alder	270	161	109
Scots pine	504	301	203
Norway spruce	458	274	184
Total	2856

Table 2. Summary statistics of diameter at breast height (dbh) and total tree height (H) for different tree species.

Species	dbh (cm)				H (m)
	min	max	avg	std	min	max	avg	std
Silver birch	2.10	51.00	20.81	10.50	5.13	38.30	20.90	6.69
Common beech	0.50	66.25	26.36	16.10	2.68	42.05	23.31	10.33
Common oak	16.00	79.20	28.77	14.68	2.75	36.40	22.21	7.38
Silver fir	0.70	66.85	26.08	15.26	1.58	37.50	20.96	9.30
European larch	0.50	68.60	31.40	13.44	2.73	40.70	25.85	7.99
Black alder	3.15	46.00	20.75	9.64	6.66	31.27	20.59	6.36
Scots pine	0.30	65.15	25.96	11.75	1.35	35.21	22.43	7.42
Norway spruce	0.80	76.40	26.45	13.85	1.75	39.54	22.55	9.03

Table 3. Summary statistics of stem diameters (d) and volume (V) estimation using three methods for different tree species.

	d			V
	RMSE	ME	EF	RMSE	ME	EF
Artificial neural network model (ANN)
Birch	1.5819	0.0115	0.9696	0.1062	0.0083	0.9517
Beech	2.6475	−0.2433	0.9643	0.2817	0.0551	0.9498
Oak	1.8229	0.1431	0.9812	0.1245	−0.0150	0.9861
Fir	1.7000	0.3868	0.9819	0.1687	−0.0355	0.9761
Larch	1.3573	0.1446	0.9887	0.1137	−0.0054	0.9904
Alder	1.5848	−0.6322	0.9707	0.1111	0.0481	0.9488
Pine	1.1928	0.2328	0.9865	0.0970	−0.0224	0.9799
Spruce	1.5930	0.1525	0.9787	0.1631	−0.0195	0.9649
Average	1.6850	0.0245	0.9777	0.1457	0.0017	0.9685
Regression set model (REG)
Birch	1.6394	0.0253	0.9674	0.1084	0.0127	0.9497
Beech	2.6596	0.1650	0.9640	0.2638	0.0431	0.9560
Oak	1.8651	−0.2855	0.9803	0.1279	−0.0274	0.9853
Fir	1.7405	−0.1368	0.9810	0.1696	−0.0072	0.9759
Larch	1.4105	−0.1825	0.9878	0.1395	−0.0159	0.9856
Alder	1.4277	0.0025	0.9762	0.0796	0.0051	0.9737
Pine	1.2170	−0.1370	0.9859	0.0957	−0.0104	0.9804
Spruce	1.8815	−0.1215	0.9703	0.2121	−0.0162	0.9407
Average	1.7302	−0.0838	0.9766	0.1496	−0.0020	0.9684
Variable exponent taper equation model Kozak (2004)
Birch	2.9160	0.0773	0.9646	0.1097	0.0157	0.9485
Beech	8.0456	0.3021	0.9590	0.2706	0.0424	0.9537
Oak	3.9225	−0.1914	0.9778	0.1221	−0.0224	0.9866
Fir	3.3972	−0.1200	0.9787	0.1784	−0.0135	0.9733
Larch	2.5084	−0.1402	0.9846	0.1271	−0.0087	0.9880
Alder	2.4097	0.0847	0.9719	0.0814	0.0073	0.9725
Pine	1.8411	−0.1064	0.9825	0.0925	−0.0114	0.9817
Spruce	3.6950	−0.0484	0.9690	0.2045	−0.0137	0.9449
Average	3.5919	−0.0178	0.9735	0.1483	−0.0005	0.9686

Table 4. Ranks of estimation errors for the diameters and volume for the three models on testing sets.

Model	RMSE	ME	EF
	Diameters at relative heights
ANN	9	18	10
Kozak (2004)	24	14	23
REG	15	16	15
	Tree volume
ANN	15	18	15
Kozak (2004)	16	15	16
REG	17	15	17

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Socha, J.; Netzel, P.; Cywicka, D. Stem Taper Approximation by Artificial Neural Network and a Regression Set Models. Forests 2020, 11, 79. https://doi.org/10.3390/f11010079

AMA Style

Socha J, Netzel P, Cywicka D. Stem Taper Approximation by Artificial Neural Network and a Regression Set Models. Forests. 2020; 11(1):79. https://doi.org/10.3390/f11010079

Chicago/Turabian Style

Socha, Jaroslaw, Pawel Netzel, and Dominika Cywicka. 2020. "Stem Taper Approximation by Artificial Neural Network and a Regression Set Models" Forests 11, no. 1: 79. https://doi.org/10.3390/f11010079

APA Style

Socha, J., Netzel, P., & Cywicka, D. (2020). Stem Taper Approximation by Artificial Neural Network and a Regression Set Models. Forests, 11(1), 79. https://doi.org/10.3390/f11010079

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stem Taper Approximation by Artificial Neural Network and a Regression Set Models

Abstract

1. Introduction

2. Data and data Preprocessing

3. Methods

3.1. Models

3.2. Methods Used in Models

4. Results

4.1. Diameter Estimates

4.2. Stem Volume Estimates

4.3. The Models Ranking

5. Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI