Next Article in Journal
Upregulation of Placental Vitamin D Receptor Expression in Gestational Diabetes Is Not Directly Related to Vitamin D Concentration
Previous Article in Journal
Characterization of the Complete Mitogenome of Polypedates braueri (Anura, Rhacophoridae, Polypedates) and Insights into the Phylogenetic Relationships of Rhacophoridae
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Provenance-Specific Height–Diameter Modeling for Chinese Fir: A Clustered Mixed-Effects Approach

1
College of Forestry, Fujian Agriculture and Forestry University, Fuzhou 350002, China
2
Chinese Fir Engineering Research Center of National Forestry and Grassland Administration, Fuzhou 350002, China
*
Author to whom correspondence should be addressed.
Biology 2025, 14(9), 1301; https://doi.org/10.3390/biology14091301
Submission received: 24 July 2025 / Revised: 5 September 2025 / Accepted: 18 September 2025 / Published: 20 September 2025
(This article belongs to the Section Ecology)

Simple Summary

Chinese fir (Cunninghamia lanceolata), extensively planted across southern China, exhibits marked growth pattern divergence among provenances from different regions, complicating stand-level growth predictions. To address this variation, we analyzed four repeated measurements from successive ages in a provenance trial at Zhangping Wuyi Forest Farm, Fujian Province. Using cluster analysis, provenances were grouped by height, diameter at breast height (DBH), or their combination; these clusters were then incorporated as random effects into height–diameter (H-D) models, with tree age added as a fixed predictor. Height-based clustering yielded the most significant improvements in model fit and predictive accuracy. Further enhancements were achieved when age parameters preceded the integration of height-clustering random effects. The optimized model provides a robust tool for forecasting growth in mixed-provenance Chinese fir stands, enabling informed silvicultural decisions to support sustainable forest management.

Abstract

Chinese fir is the predominant afforestation species in southern China, exhibiting distinct provenances due to long-term climatic adaptation. This study utilized data from four surveys conducted at different ages in a provenance trial forest at Zhangping Wuyi Forest Farm, Fujian Province, to classify Chinese fir provenances using cluster analysis based on growth metrics. The resulting clusters were integrated as random effects into height–diameter models. Model performance was enhanced by incorporating age parameters and validated through five-fold cross-validation. The findings reveal that: (1) the Logistic model best captured the fundamental height–diameter relationship of Chinese fir; (2) the inclusion of provenance-clustering random effects improved model fit and predictive accuracy, with height-based clustering outperforming other methods; (3) the addition of age parameters further refined the base models beyond the clustering effects, and the combination of both approaches achieved the highest precision. Among clustering techniques, height-based clustering surpassed combined height–diameter at breast height (DBH) clustering, while DBH-based clustering was the least effective. The developed models facilitate precise growth predictions for multi-provenance Chinese fir across extensive geographic ranges, offering a theoretical basis for provenance-specific management.

1. Introduction

Populations of the same tree species often develop distinct ecogeographic types due to prolonged habitat isolation, reproductive barriers, and natural selection, resulting in significant variation in growth traits and environmental adaptability [1]. For instance, Alexandru et al. [2] identified divergent climatic adaptations among Norway spruce (Picea abies) provenances, while Jožica et al. [3] demonstrated substantial variation in annual ring width of European beech (Fagus sylvatica) provenances. Chinese fir, a vital fast-growing timber species in southern China, is extensively cultivated in plantations for its rapid growth and high-quality wood [4]. Extensive provenance trials have demonstrated that genetic background and habitat heterogeneity collectively contribute to significant genetic variation in physiological traits and growth metrics, such as height, DBH, and volume, across Chinese fir provenances [5,6,7].
Tree height and DBH are fundamental mensurational variables for assessing stand structure and are critical for estimating volume and biomass [8,9,10]. Height–diameter models are indispensable tools in forest resource inventory and management, facilitating predictions of growth dynamics, biomass and carbon estimation, and informed stand optimization [11]. Traditional models, such as Logistic [12], Richards [13], and Weibull [14], describe biological height–diameter relationships through global data fitting. However, these relationships are influenced not only by species-specific traits but also by site conditions [15], competition [16], silvicultural practices [17], and climate change [18], rendering single-growth curves insufficient. Consequently, recent studies have refined these models by incorporating site indices [19,20], competition indices [21], climatic factors [22,23], and silvicultural practices [24]. Mixed-effects models further enhance fit and predictive accuracy by accounting for both population trends and individual variations, achieving greater precision while maintaining parsimony, and have thus gained widespread adoption [25,26].
Current height–diameter models primarily operate at the species level, capturing growth variations across stands and environments but often overlooking provenance-specific influences on growth trajectories at broader scales [27,28,29]. Different provenances may exhibit distinct allometric height–diameter patterns under identical conditions, and ignoring these differences can undermine prediction accuracy. For example, Buford et al. [30] adjusted site index models to account for provenance effects, while Sharma [31] documented significant diameter growth variation in red pine (Pinus resinosa) across Canadian regions, highlighting notable provenance differences. Therefore, developing multi-provenance growth models necessitates explicit consideration of provenance divergence and a scientific approach to identifying growth heterogeneity.
Cluster analysis, an unsupervised machine learning technique, provides an objective method to identify distinct growth patterns among provenances based on key traits such as height and DBH, thereby minimizing subjective bias. For instance, previous work has successfully grouped Chinese fir provenances by age class using K-medoids clustering [32] and by similar temperature–humidity responses [33]. These studies underscore the value of cluster analysis in grouping provenances with comparable growth patterns. However, current clustering applications primarily inform provenance selection for afforestation, with limited integration into mechanistic growth models. A deeper investigation into provenance-specific growth patterns holds promise for developing more robust predictive methodologies. Therefore, this study hypothesizes that objectively grouping provenances with similar growth characteristics using cluster analysis can effectively capture this variation. By incorporating these data-driven provenance groups as a random effect in mixed-effects models, we aim to significantly improve the accuracy of height–diameter predictions.
Our primary objective is to evaluate how integrating provenance clustering impacts H-D model predictive performance. Using long-term data from the provenance trial, we developed and tested a clustered mixed-effects model, in which provenance groups identified via cluster analysis were treated as a random effect to account for inherent growth variations. The ultimate goal of this approach is to develop a more accurate, provenance-aware modeling framework, providing theoretical support for precise management and selection of Chinese fir provenances.

2. Materials and Methods

2.1. Study Area

The study was conducted at Zhangping Wuyi Forest Farm in Fujian Province, China (117°19′–117°38′ E, 25°06′–25°17′ N). The site is located in a subtropical monsoon climate zone characterized by distinct seasons, warm temperatures, and high humidity, with a mean annual temperature ranging from 18 to 20 °C. The mean elevation of the area is 425 m, and soils are predominantly classified as Acrisols (yellow-red soil variant). The forest farm has a standing stock volume of 2.722 million m3 and a forest coverage of 91.7%, with plantations of Chinese fir and Masson pine (Pinus massoniana) being the dominant forest types.

2.2. Data

Data were obtained from a Chinese fir provenance trial established in 1985 at the Zhangping Wuyi Forest Farm. Figure 1 shows the location of the experimental site and the origin of the provenances. The detailed geographic coordinates and climatic factors for each provenance origin are provided in Appendix A Table A1. Field inventories were conducted in 1989, 1990, 2011, and 2023. For each provenance, ten sample trees of Chinese fir in good condition (i.e., without significant stem curvature) were selected. Following the removal of outliers, a total of 5088 raw datasets encompassing tree height and DBH were acquired (Figure 2).

2.3. Cluster Analysis

Growth variations among Chinese fir provenances, resulting from long-term environmental adaptation, exhibit dynamic patterns across developmental stages. This study employed K-means clustering to group provenances, aiming to identify assemblages of provenances exhibiting similar growth phenotypic responses (e.g., DBH, height). Critically, this grouping reflects phenotypic convergence arising from long-term environmental adaptation interacting with genetic backgrounds, rather than constituting a pure genetic provenance classification. For instance, geographically distant provenances may demonstrate comparable growth patterns due to adaptation to analogous environments, consequently being grouped together. Three growth characteristics, namely DBH, height, and their combined metric, were employed to group provenances in this study.

2.4. Height–Diameter Model Development

2.4.1. Base Model

Nonlinear models are generally more effective at capturing the biological relationship between tree height and diameter than linear models. Therefore, six biologically meaningful nonlinear models were selected as candidates: Richards, Logistic, Gompertz [34], Curtis [35], Meyer, and Wykoff [36] (Table 1). These six base models have demonstrated superior performance in other foundational model selection studies [37,38,39,40] and represent a range of curve shapes, including sigmoidal and asymptotic, allowing for a comprehensive evaluation of the underlying H-D relationship. The best-performing base model was selected based on a comprehensive evaluation of their fit statistics.

2.4.2. Mixed-Effects Model

A nonlinear mixed-effects modeling approach was employed to account for the hierarchical structure of the data (i.e., multiple trees nested within each provenance cluster). This approach allows us to model both the overall population average trend (fixed effects) and the specific deviations of each provenance cluster from that trend (random effects), thereby accommodating the inherent correlation among measurements within the same cluster and improving model accuracy [25]. The model structure is expressed as
H i j = f β , μ i , D i j + ε i j
μ i ~ N ( 0 , G ) ε i j ~ N ( 0 , R i j )
where H i j and D i j are the height and DBH, respectively; the j-th sample tree is the i-th provenance cluster; f β , μ i , D i j is the nonlinear function defined by the selected base model; β is the vector of fixed-effects parameters; μ i is the vector of random effects for the i-th provenance cluster; G is the variance–covariance matrix for the random effects; ε i j is the within-group random error term; R i j is the variance–covariance matrix for the within-group error. The variance–covariance matrix is used to describe the variances and pairwise covariances of multiple random variables. It quantifies both the magnitude of differences introduced by different provenance clusters (variance) and whether these differences are correlated across various model parameters (covariance).
To determine the optimal mixed-effects model, random effects were systematically incorporated into every parameter of the selected base model, both individually and in combination. The final model structure was chosen by comparing the goodness-of-fit statistics of these different parameter combinations.

2.5. Model Evaluation and Validation

Model performance was assessed using two-fold cross-validation. The Akaike Information Criterion (AIC), Bayesian Information Criterion (BIC), and −2 × log-likelihood (−2LogL) were used to evaluate model fit, with lower values indicating a better fit.
The predictive accuracy of the models was evaluated using the fit index (R2), mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE). Higher R2 values and lower values for MAE, MAPE, and RMSE indicate superior model precision and predictive performance. The formulas are as follows:
A I C = 2 p 2 l n ( L )
B I C = 2 ln L + p l n ( n )
2 L o g L = 2 ln L
M A E = i = 1 n H ^ i H i n
M A P E = 100 % i = 1 n H ^ i H i H i n
R M S E = i = 1 n H ^ i H i n
R 2 = 1 i = 1 n ( H i H ^ i ) 2 i = 1 n ( H i H ¯ i ) 2
where p is the number of model parameters, L is the maximum value of the likelihood function, n is the total number of sample trees, and H ^ i , H i , and H ¯ are the observed, predicted, and mean observed heights for the i-th tree, respectively.

2.6. Methodology Flowchart

A methodological flowchart was created to enhance understanding of the study design (Figure 3).

2.7. Statistical Software

All statistical analyses were conducted using R software (version 4.3.2). The K-means clustering was performed using the cluster package (version 2.1.4). The nonlinear mixed-effects models were fitted using the nlme package (version 3.1.162). Figures were generated using the ggplot2 package (version 3.4.4).

3. Results

3.1. Cluster Analysis Outcomes

K-means clustering was performed on the provenance data using tree height, DBH, and a combined height–DBH metric as the classification variables. For all three approaches, the elbow method consistently identified an optimal cluster number of 3. Consequently, the provenances were classified into three distinct groups for each clustering scheme.

3.2. Base Model Selection

Six nonlinear H-D models (Logistic, Richards, Gompertz, Curtis, Meyer, and Wykoff) were evaluated as potential base models. All estimated parameters were statistically significant (p < 0.001). The Logistic model yielded the lowest values for all fit statistics (AIC = 1616.889, BIC = 1633.642, and −2LL = 1608.889), as shown in Table 2. Therefore, it was selected as the base model for subsequent mixed-effects modeling of Chinese fir.

3.3. Mixed-Effects Models with Different Clustering Approaches

Random effects derived from the three clustering approaches (DBH-based, height-based, and combined) were sequentially incorporated into the Logistic base model. When a single random parameter was added, the greatest model improvement (i.e., lowest fit metrics) was observed with random effects on parameter c for height-based and combined clustering, and on parameter b for DBH-based clustering. When two random parameters were added, all fit metrics improved further. The optimal two-parameter combination was a and b for height-based and combined clustering, and b and c for DBH-based clustering. Models with three random parameters failed to converge.
As shown in Table 3, all mixed-effects models outperformed the base model (M1). Among the clustering methods, height-based clustering (M3) yielded the most substantial reduction in fit statistics, while DBH-based clustering (M2) resulted in the least improvement.

3.4. Mixed-Effects Models Incorporating Age and Clustering

The growth pattern of Chinese fir changes with age, exhibiting distinct growth patterns at different ages, and its classification category may shift accordingly. To develop a dynamic H-D model that accounts for these age-dependent relationships while also incorporating provenance-level variation, an age parameter was first incorporated into the base model to create a fixed-effects model (M5) accounting for age-dependent height–diameter relationships. This addition reduced all fit metrics by an average of 18.6% compared to the base model (M1). Subsequently, mixed effects from the three clustering approaches were introduced into M5 by adding random effects (Table 4).
The inclusion of clustering-based random effects further improved model performance, maintaining a consistent hierarchy: height-based clustering provided the greatest enhancement, followed by combined clustering, with DBH-based clustering yielding the smallest improvement. Specifically, the model incorporating both age and height-based clustering (M7) achieved the most substantial improvement, with an average reduction of 60.6% in fit metrics relative to the original base model (M1).

3.5. Model Validation

The validation metrics for all developed height–diameter models are presented in Table 5. Model M7, which integrates both age and height-based mixed effects, was identified as the optimal model, ranking first across all validation indices. Models that included the age parameter (M5–M8) consistently outperformed those without it. Furthermore, all mixed-effects models demonstrated superior predictive accuracy compared to their fixed-effects counterparts, with height-based clustering models (M3 and M7) showing the highest accuracy. Table 6 lists the final parameter estimates for the optimal mixed-effects model (M7) fitted to the full dataset.

3.6. Model Performance

Figure 4 illustrates the prediction residuals by each age for four key models: the base model (M1), the height-clustering mixed model (M3), the base model incorporating age (M5), and the final model incorporating age and height-clustering (M7). Compared to the M1 (Figure 4a), all other models exhibited residual medians closer to zero. The inclusion of height-based clustering effects (Figure 4b) significantly reduced the dispersion of residuals. Both models incorporating age (Figure 4c,d) displayed stable residual medians near zero across all ages, indicating improved symmetry and reliability. While M7 demonstrated the best overall performance, its prediction errors tended to increase with tree age. A closer comparison reveals that at younger ages (5–6 years), M5 had a smaller residual range than M7; conversely, at later growth stages (27–39 years), M7 showed tighter residual distributions, indicating superior stability for mature trees.
The height growth curves predicted by the optimal model (M7) for each of the three provenance clusters are displayed in Figure 5. The distinct trajectories for each cluster underscore the importance of provenance classification in accurately modeling the height–diameter relationship in Chinese fir.

4. Discussion

This study successfully demonstrates that integrating provenance clustering into mixed-effects models significantly enhances the predictive accuracy of height–diameter relationships in Chinese fir. Our approach addresses a critical challenge in forestry, where genetic divergence among provenances can undermine the reliability of generalized growth models [27,28]. By objectively grouping provenances based on phenotypic growth traits, we developed a model that is more sensitive to these inherent variations. This improved accuracy translates directly into a practical tool for forest managers, enabling them to tailor silvicultural prescriptions (e.g., thinning schedules, rotation age) to specific provenance groups and thereby enhance overall stand productivity and value.

4.1. Rationale for Base Model Selection

The six candidate models selected have demonstrated robust performance in extensive previous studies. In this study, the Logistic model exhibited the best results, a finding consistent with some research, such as that by Lumbres et al. [38]. However, the choice of the optimal base model is often species- and site-specific; for instance, Ng’andwe et al. [39] found the Weibull model to be superior, while Patrício et al. [40] reported the Richards model as the top performer. The Logistic model’s success in our context is likely due to its biologically interpretable parameters. Its ability to define distinct growth phases—slow initial increase, rapid acceleration, and asymptotic stabilization—aligns well with the life history of Chinese fir. This biological interpretability facilitates clear comparisons of growth strategies across provenances and is a key contributor to its superior performance.

4.2. Superiority of Height-Based Clustering

A key finding of this study was the superior performance of height-based clustering. This underscores the inconsistencies in height–diameter relationships among provenances and highlights the differential efficacy of clustering approaches [41]. The superiority of height-based clustering is likely because maximum height is strongly constrained by site conditions and, in turn, heavily influences H-D allometry [42,43]. As a proxy for site quality, height helps capture provenance-specific adaptations to local edaphic conditions.
Furthermore, height growth is a more direct indicator of a provenance’s adaptation to its native climate. It is primarily governed by apical meristem activity—a process driven by genetic potential and macro-climatic factors such as temperature and photoperiod [23]. In contrast, diameter growth, which is governed by cambial activity, is more susceptible to local stand conditions, including density and resource availability. Consequently, height growth more effectively captures the fundamental climatic adaptations of a provenance, justifying the superior performance of this clustering approach.

4.3. Importance of Age for Dynamic Modeling

The inclusion of age as a covariate was crucial for transforming a static H-D relationship into a dynamic model that predicts growth trajectories over time. Our results revealed that the models exhibited lower accuracy at younger ages but superior performance as the stand matured. This discrepancy can be attributed to the higher phenotypic plasticity characteristic of early developmental stages [44]. As the stand ages, genetic control over growth traits strengthen, and heritability increases and stabilizes, as demonstrated by Wu et al. [32] for Chinese fir. This increasing stability of genetically determined growth patterns with age explains why our final model, which accounts for both age and provenance, achieved its highest accuracy in mature trees.

4.4. Specification of Random Effects in Mixed Models

Consistent with prior studies [25,26], our clustered mixed-effects models significantly outperformed the base model. While strategies for specifying random effects vary, our analysis revealed that models with two random parameters on the asymptote (a) and a shape parameter (b) performed optimally. This finding is biologically significant: it suggests that the primary differences among the provenance clusters are their maximum growth potential (asymptote) and the rate at which they achieve it (growth curve shape). This aligns with the work of Özçelik et al. [20] and confirms that the optimal allocation of random effects is not merely a statistical choice but can reveal underlying biological differentiation.

4.5. Limitations and Future Directions

While our approach is robust, its application is currently limited by the use of data from a single provenance trial. Because provenances exhibit different growth responses when planted in different environments [45], future research should incorporate data from a network of trials across diverse geographic regions to enhance the model’s broader applicability.
Beyond expanding the dataset, future model enhancements could involve incorporating additional explanatory variables. Integrating data on the climate and geography of the provenances’ origins could provide a more comprehensive understanding of the drivers of growth differentiation. Furthermore, including explicit measures of stand competition and dominant height as fixed effects represents a promising avenue for improving predictive accuracy [46,47]. Methodologically, machine learning algorithms offer powerful alternatives for both the initial provenance classification and the final regression modeling [48,49]. Future research could therefore focus on developing hybrid models that integrate machine learning classifiers with a broader set of ecological indicators to advance the next generation of predictive H-D models.

5. Conclusions

To address the significant growth divergence among Chinese fir provenances, this study incorporated cluster analysis of key growth traits as a random effect within mixed-effects models. This approach substantially enhanced both model fit and predictive accuracy for height–diameter relationships across these provenances. Notably, height-based clustering yielded optimal improvements. While introducing tree age improved performance beyond the initial cluster-based models, the highest precision was achieved by combining age with cluster-derived random effects. Here again, height-based clustering proved superior. This approach establishes an effective methodology for precise growth modeling in multi-provenance Chinese fir stands and provides a theoretical foundation for efficient, provenance-specific forest management.

Author Contributions

Conceptualization, X.W. and S.S.; data curation, Y.W., Y.L. and W.C.; methodology and software, X.W. and Y.W.; visualization, Y.L. and W.C.; writing—original draft preparation, X.W.; writing—review and editing, M.L. and S.S.; funding acquisition, S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China project (grant number 32201558).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Geographic coordinates and environmental variables for the provenance origins of Chinese fir.
Table A1. Geographic coordinates and environmental variables for the provenance origins of Chinese fir.
ProvinceProvenance OriginNumber of
Provenances
Longitude (°N)Latitude (°E)Mean Annual
Temperature (°C)
Mean Annual
Precipitation (mm)
AnhuiQimen429.85117.7215.61781
Qianshan230.63116.5816.31336
Xiuning829.79118.1917.01750
FujianSanming726.27117.6319.21700
Pucheng427.92118.5413.7512
Yangkou326.80117.9118.51880
GuangdongLiannan424.73112.2919.51670
Xinyi322.35110.9522.42012
GuangxiLiuzhou624.33109.4320.51500
Liuwanshan122.53109.8621.51900
Naping222.93108.1118.01350
Zhenglong223.03109.2521.51477
GuizhouJinping426.68109.2016.71280
Rongjiang425.93108.5218.11190
HubeiEnshi430.3109.4816.31500
Tongshan329.61114.4817.01450
Yichang230.69111.2917.01175
HunanHuaihua826.69109.4616.81400
JiangxiGanzhou225.51114.7018.91500
Quannan224.82114.4517.5701
Ruijin125.66115.9819.01652
Yichun828.21114.9717.61642
SichuanHongya229.92103.3716.61370
Jianwei429.21103.9517.51146
Guxian430.97103.9316.11200
ZhejiangLinan730.23119.7217.91630
Longquan828.07119.1417.81650

References

  1. Rehfeldt, G.E.; Leites, L.P.; Clair, J.B.S.; Jaquish, B.C.; Sáenz-Romero, C.; López-Upton, J.; Joyce, D.G. Comparative Genetic Responses to Climate in the Varieties of Pinus ponderosa and Pseudotsuga menziesii: Clines in Growth Potential. For. Ecol. Manag. 2014, 324, 138–146. [Google Scholar] [CrossRef]
  2. Alexandru, A.M.; Mihai, G.; Stoica, E.; Curtu, A.L. Tree Resilience Indices of Norway Spruce Provenances Tested in Long-Term Common Garden Experiments in the Romanian Carpathians. Plants 2024, 13, 2172. [Google Scholar] [CrossRef]
  3. Gričar, J.; Arnič, D.; Krajnc, L.; Peter, P.; Božič, G.; Westergren, M.; Mátyás, C.; Kraigher, H. Different Patterns of Inter-Annual Variability in Mean Vessel Area and Tree-Ring Widths of Beech from Provenance Trials in Slovenia and Hungary. Trees 2024, 38, 179–195. [Google Scholar] [CrossRef]
  4. Yu, X. Cunninghamia lanceolata Cultivation; Fujian Science and Technology Publishing House: Fuzhou, China, 1997. (In Chinese) [Google Scholar]
  5. Gong, X.; Wan, Z.; Jin, P.; Jin, S.; Li, X. Drought-Driven Divergence in Photosynthetic Performance Between Two Cunninghamia anceolata Provenances: Insights from Gas Exchange and Chlorophyll Fluorescence Dynamics. Plants 2025, 14, 1487. [Google Scholar] [CrossRef]
  6. Wu, P.; Tigabu, M.; Ma, X.; Odén, P.C.; He, Y.; Yu, X.; He, Z. Variations in Biomass, Nutrient Contents and Nutrient use Efficiency Among Chinese Fir Provenances. Silvae Genet. 2011, 60, 95–105. [Google Scholar] [CrossRef]
  7. Xu, T.; Niu, X.; Wang, B. Provenance Differences and Factors Influencing Transpiration of Cunninghamia lanceolata in a Common Garden Experiment. Front. Plant Sci. 2025, 16, 1515534. [Google Scholar] [CrossRef] [PubMed]
  8. Dorado, F.C.; Diéguez-Aranda, U.; Anta, M.B.; RodríguezM, S.; Gadow, K. A Generalized Height-Diameter Model Including Random Components for Radiata pine Plantations in Northwestern Spain. For. Ecol. Manag. 2006, 229, 202–213. [Google Scholar] [CrossRef]
  9. Gómez-García, E.; Diéguez-Aranda, U.; Castedo-Dorado, F.; Crecente-Campo, F. A Comparison of Model Forms for the Development of Height-Diameter Relationships in Even-Aged Stands. For. Sci. 2014, 60, 560–568. [Google Scholar] [CrossRef]
  10. Rupšys, P. Height-Diameter Models with Stochastic Differential Equations and Mixed-Effects Parameters. J. For. Res. 2015, 20, 9–17. [Google Scholar] [CrossRef]
  11. Baia, A.L.P.; Nascimento, H.E.M.; Guedes, M.; Hilário, R.; Toledo, J.J. Tree Height-Diameter Allometry and Implications for Biomass Estimates in Northeastern Amazonian Forests. PeerJ 2025, 13, 18974. [Google Scholar] [CrossRef]
  12. Pearl, R.; Reed, L.J. On the Rate of Growth of the Population of the United States since 1790 and its Mathematical Representation. Proc. Natl. Acad. Sci. USA 1920, 6, 275–288. [Google Scholar] [CrossRef]
  13. Richards, F.J. A Flexible Growth Function for Empirical Use. J. Exp. Bot. 1959, 10, 290–300. [Google Scholar] [CrossRef]
  14. Weibull, W. A Statistical Distribution Function of Wide Applicability. J. Appl. Mech. 1951, 18, 293–297. [Google Scholar] [CrossRef]
  15. Kuehne, C.; Maleki, K.; Merlin, M.; Granhus, A. Interactive Effects of Species Composition, Site Quality, and Drought on Growth Dynamics of Norway Spruce and Scots pine Stands in Norway. For. Ecol. Manag. 2025, 590, 122804. [Google Scholar] [CrossRef]
  16. Wang, J.; Wang, Y.; Zhang, Z.; Wang, W.; Jiang, L. Enhanced Awareness of Height-Diameter Allometry in Response to Climate, Soil, and Competition in Secondary Forests. For. Ecol. Manag. 2023, 548, 121386. [Google Scholar] [CrossRef]
  17. Pukkala, T.; Lähde, E.; Laiho, O. Optimizing the Structure and Management of Uneven-Sized Stands of Finland. For. Int. J. For. Res. 2010, 83, 129–142. [Google Scholar] [CrossRef]
  18. Wang, X.; Fang, J.; Tang, Z.; Zhu, B. Climatic Control of Primary Forest Structure and DBH-Height Allometry in Northeast China. For. Ecol. Manag. 2006, 234, 264–274. [Google Scholar] [CrossRef]
  19. Ismail, M.J.; Poudel, T.R.; Ali, A.; Dong, L. Incorporating Stand Parameters in Nonlinear Height-Diameter Mixed-Effects Model for Uneven-Aged Larix gmelinii Forests. Front. For. Glob. Change 2025, 7, 1491648. [Google Scholar] [CrossRef]
  20. Özçelika, R.; Cao, Q.V.; Trincadoc, G.; Göçerd, N. Predicting Tree Height from Tree Diameter and Dominant Height using Mixed-Effects and Quantile Regression Models for Two Species in Turkey. For. Ecol. Manag. 2018, 419–420, 240–248. [Google Scholar] [CrossRef]
  21. Temesgen, H.; Gadow, K. Generalized Height-Diameter Models-an Application for Major Tree Species in Complex Stands of Interior British Columbia. Eur. J. For. Res. 2004, 123, 45–51. [Google Scholar] [CrossRef]
  22. Tian, D.; Jiang, L.; Shahzad, M.K.; He, P.; Wang, J.; Yan, Y. Climate-Sensitive Tree Height-Diameter Models for Mixed Forests in Northeastern China. Agric. For. Meteorol. 2022, 326, 109182. [Google Scholar] [CrossRef]
  23. Zhang, X.; Chhin, S.; Fu, L.; Lu, L.; Duan, A.; Zhang, J. Climate-Sensitive Tree Height-Diameter Allometry for Chinese Fir in Southern China. For. Int. J. For. Res. 2019, 92, 167–176. [Google Scholar] [CrossRef]
  24. Hao, X.; Mu, C.; Cui, Y.; Ji, W.; Xu, W.; Zhao, H. Prediction of Liberation Cutting Intensity Effect on the Growth of Korean pine in Secondary Forest Based on Double Dummy Variable Model. J. Appl. Ecol. 2024, 35, 1463–1473. [Google Scholar]
  25. Ciceu, A.; Leca, S.; Badea, O.; Mehtätalo, L. Nonlinear Multilevel Seemingly Unrelated Height-Diameter and Crown Length Mixed-Effects Models for the Southern Transylvanian Forests, Romania. For. Ecosyst. 2025, 13, 100322. [Google Scholar] [CrossRef]
  26. Sağlam, F.; Sakici, O.E. Ecoregional Height-Diameter Models for Scots pine in Turkiye. J. For. Res. 2024, 35, 103. [Google Scholar] [CrossRef]
  27. Huang, Y.; Deng, X.; Zhao, Z.; Xiang, W.; Yan, W.; Ouyang, S.; Lei, P. Monthly Radial Growth Model of Chinese Fir (Cunninghamia lanceolata (Lamb.) Hook.), and the Relationships between Radial Increment and Climate Factors. Forests 2019, 10, 757. [Google Scholar] [CrossRef]
  28. Lu, L.; Chhin, S.; Zhang, J.; Zhang, X. Modelling Tree Height-Diameter Allometry of Chinese Fir in Relation to Stand and Climate Variables through Bayesian Model Averaging Approach. Silva Fenn. 2021, 55, 10415. [Google Scholar] [CrossRef]
  29. Zhang, B.; Sajjad, S.; Chen, K.; Zhou, L.; Zhang, Y.; Yong, K.K.; Sun, Y. Predicting Tree Height-Diameter Relationship from Relative Competition Levels Using Quantile Regression Models for Chinese Fir (Cunninghamia lanceolata) in Fujian Province, China. Forests 2020, 11, 183. [Google Scholar] [CrossRef]
  30. Buford, M.A.; Burkhart, H.E. Genetic Improvement Effects on Growth and Yield of Loblolly Pine Plantations. For. Sci. 1987, 3, 707–724. [Google Scholar] [CrossRef]
  31. Sharma, M. Modelling Climate Effects on Diameter Growth of Red pine Trees in Boreal Ontario, Canada. Trees For. People 2021, 4, 100064. [Google Scholar] [CrossRef]
  32. Wu, H.; Lei, J.; Li, X.; Wang, H.; Duan, A.; Zhang, J. Aggregation Distributions Across Stand Age in Provenances of Cunninghamia lanceolata (Lamb.) Hook. For. Ecol. Manag. 2021, 494, 119317. [Google Scholar] [CrossRef]
  33. Wang, H.; Zhu, A.; Duan, A.; Wu, H.; Zhang, J. Responses to Subtropical Climate in Radial Growth and Wood Density of Chinese Fir Provenances, Southern China. For. Ecol. Manag. 2022, 521, 120428. [Google Scholar] [CrossRef]
  34. Gompertz, B. XXIV. On the Nature of The Function Expressive of The Law of Human Mortality, and on a New Mode of Determining the Value of Life Contingencies. In a letter to Francis Baily, Esq. F. R. S. &c. Philos. Trans. R. Soc. Lond. 1825, 115, 513–583. [Google Scholar] [CrossRef]
  35. Curtis, R.O. Height-Diameter and Height-Diameter-Age Equations for Second-Growth Douglas-Fir. For. Sci. 1967, 13, 365–375. [Google Scholar]
  36. Robinson, A.P.; Wykoff, W.R. Imputing Missing Height Measures using a Mixed-Effects Modeling Strategy. Can. J. For. Res. 2004, 34, 2492–2500. [Google Scholar] [CrossRef]
  37. Mensah, S.; Pienaar, O.L.; Kunneke, A.; Toit, B.; Seydack, A.; Uhl, E.; Pretzsch, H.; Seifert, T. Height-Diameter Allometry in South Africa’s Indigenous High Forests: Assessing Generic Models Performance and Function Forms. For. Ecol. Manag. 2018, 410, 1–11. [Google Scholar] [CrossRef]
  38. Lumbres, R.L.C.; Lee, Y.J.; Calora, F.G., Jr.; Parao, M.R. Model Fitting and Validation of Six Height–DBH Equations for Pinus kesiya Royle ex Gordon in Benguet Province, Philippines. For. Sci. Technol. 2013, 9, 45–50. [Google Scholar] [CrossRef]
  39. Ng’andwe, P.; Chungu, D.; Yambayamba, A.M.; Chilambwe, A. Modeling the Height-Diameter Relationship of Planted Pinus kesiya in Zambia. For. Ecol. Manag. 2019, 447, 1–11. [Google Scholar] [CrossRef]
  40. Patrício, M.S.; Dias, C.R.G.; Nunes, L. Mixed-effects Generalized Height-Diameter Model: A Tool for Forestry Management of Young Sweet Chestnut Stands. For. Ecol. Manag. 2022, 514, 120209. [Google Scholar] [CrossRef]
  41. Wu, H.; Duan, A.; Zhang, J. Long-Term Growth Variation and Selection of Geographical Provenances of Cunninghamia lanceolata (Lamb.) Hook. Forests 2019, 10, 876. [Google Scholar] [CrossRef]
  42. Noordermeer, L.; Bollandsås, O.M.; Gobakken, T.; Næsset, E. Direct and Indirect Site Index Determination for Norway Spruce and Scots pine using Bitemporal Airborne Laser Scanner Data. For. Ecol. Manag. 2018, 428, 104–114. [Google Scholar] [CrossRef]
  43. Sharma, M.; Parton, J. Height-Diameter Equations for Boreal Tree Species in Ontario using a Mixed-Effects Modeling Approach. For. Ecol. Manag. 2007, 249, 187–198. [Google Scholar] [CrossRef]
  44. Wu, H.; Duan, A.; Zhang, J. Growth Variation and Selection Effect of Cunninghamia lanceolata Provenances at Different Stand Ages. For. Sci. 2019, 55, 181–192. (In Chinese) [Google Scholar]
  45. Wang, H.; Duan, A.; Zhang, J. Intraspecific Responses to Climate Change in Cunninghamia lanceolata (Lamb.) Hook.: Local May Not be the Best. For. Ecol. Manag. 2025, 590, 122784. [Google Scholar] [CrossRef]
  46. Köhler, P.; Huth, A. The Effects of Tree Species Grouping in Tropical Rainforest Modelling: Simulations with The Individual-Based Model FORMIND. Ecol. Model. 1998, 109, 301–321. [Google Scholar] [CrossRef]
  47. Phillips, P.D.; Yasman, I.; Brash, T.E.; Gardingen, P.R. Grouping Tree Species for Analysis of Forest Data in Kalimantan (Indonesian Borneo). For. Ecol. Manag. 2022, 157, 205–216. [Google Scholar] [CrossRef]
  48. Özçelik, R.; Diamantopoulou, M.J.; Crecente-Campo, F.; Eler, U. Estimating Crimean Juniper Tree Height Using Nonlinear Regression and Artificial Neural Network Models. For. Ecol. Manag. 2013, 306, 52–60. [Google Scholar] [CrossRef]
  49. Ou, Y.; Quiñónez-Barraza, G. Modeling Height–Diameter Relationship Using Artificial Neural Networks for Durango Pine (Pinus durangensis Martínez) Species in Mexico. Forests 2023, 14, 1544. [Google Scholar] [CrossRef]
Figure 1. Provenance trial site and geographic origins of the provenances.
Figure 1. Provenance trial site and geographic origins of the provenances.
Biology 14 01301 g001
Figure 2. Scatterplot of height–DBH observational data.
Figure 2. Scatterplot of height–DBH observational data.
Biology 14 01301 g002
Figure 3. A schematic flowchart of the methodological workflow employed in this study.
Figure 3. A schematic flowchart of the methodological workflow employed in this study.
Biology 14 01301 g003
Figure 4. Comparison of residual distributions by tree age for different models. (a) The base model (M1), (b) the height-clustering mixed model (M3), (c) the age-parameterized model (M5), (d) the optimal model incorporating age and height-clustering (M7). M7 exhibits the narrowest residual distribution and a stable median, indicating the highest predictive accuracy and reliability, particularly for mature trees. The boxplot elements are defined as follows: central line (median), box boundaries (quartiles), whiskers (1.5 times the interquartile range), and points (outliers).
Figure 4. Comparison of residual distributions by tree age for different models. (a) The base model (M1), (b) the height-clustering mixed model (M3), (c) the age-parameterized model (M5), (d) the optimal model incorporating age and height-clustering (M7). M7 exhibits the narrowest residual distribution and a stable median, indicating the highest predictive accuracy and reliability, particularly for mature trees. The boxplot elements are defined as follows: central line (median), box boundaries (quartiles), whiskers (1.5 times the interquartile range), and points (outliers).
Biology 14 01301 g004
Figure 5. Height growth prediction for the three provenance clusters based on the optimal model (M7).
Figure 5. Height growth prediction for the three provenance clusters based on the optimal model (M7).
Biology 14 01301 g005
Table 1. Candidate height–diameter base models for Chinese fir.
Table 1. Candidate height–diameter base models for Chinese fir.
ModelModel Equation
Logistic H = 1.3 + a 1 + b e c D + ε
Richards H = 1.3 + a ( 1 e b D ) c + ε
Gompertz H = 1.3 + e b e c D
Curtis H = 1.3 + a D 1 + D b + ε
Meyer H = 1.3 + a 1 e b D + ε
Wykoff H = 1.3 + e ( a + b / ( D + 1 ) ) + ε
Note: H represents tree height; D represents diameter at breast height (DBH); a , b , and c are model parameters; ε denotes the error term.
Table 2. Fitting results of the six candidate base models.
Table 2. Fitting results of the six candidate base models.
ModelParameter EstimationFitting Statistic
abcAICBIC−2LogL
Logistic15.9630 ***8.3112 ***0.1979 ***1616.8891633.6421608.889
Richards18.4873 ***0.0810 ***1.3807 ***1679.1221695.8751671.122
Gompertz17.1237 ***2.7269 ***0.1250 ***1644.0141660.7671636.014
Curtis21.1884 ***9.0065 *** 1734.2801746.8441728.280
Meyer26.8212 ***0.0348 *** 1705.7521718.3171699.752
Wykoff3.0956 ***−10.1254 *** 1718.1971730.7621712.197
Note: *** indicates statistical significance at p < 0.001.
Table 3. Optimal mixed-effects model for different clustering approaches.
Table 3. Optimal mixed-effects model for different clustering approaches.
No.ModelRandom ParametersFixed ParameterFitting Statistic
abcAICBIC−2LogL
M1Base modelNone15.96308.31120.19791616.8891633.6421608.889
M2DBH-clustering mixed modelb, c15.84908.58900.20401605.6701634.9871591.670
M3Height-clustering mixed modela, b14.68669.93880.25541490.4291519.7471476.429
M4Combined clustering mixed modela, b15.19539.23210.22421568.4301597.7481554.430
Table 4. Model fitting with age and clustering-based mixed effects.
Table 4. Model fitting with age and clustering-based mixed effects.
No.ModelFixed ParameterFitting Statistic
abcc0AICBIC−2LogL
M5Incorporating age13.743317.3032−1.08670.27001315.0501335.9921305.050
M6Incorporating age and DBH-clustering13.838115.5125−1.06940.26181107.4001140.9061091.400
M7Incorporating age and height-clustering13.775912.7433−1.07370.2557643.648677.154627.648
M8Incorporating age and combined clustering13.652917.3881−1.11690.2760818.134851.640802.134
Table 5. Cross-validation results for the different models.
Table 5. Cross-validation results for the different models.
No.ModelValidation CriterionRanking
RMSEMAEMAPE/%R2
M1Base model1.26541.083317.86290.94048
M2DBH-clustering mixed model1.24261.053117.17870.94257
M3Height-clustering mixed model1.09360.869615.40750.95555
M4Combined clustering mixed model1.19960.980715.72970.94646
M5Base model incorporating age0.92460.58525.04780.96824
M6Incorporating age and DBH-clustering0.74030.47084.27670.97963
M7Incorporating age and height-clustering0.45820.31453.27070.99221
M8Incorporating age and combined clustering0.55100.36833.56480.98872
Table 6. Parameter estimates of the optimal mixed-effects model.
Table 6. Parameter estimates of the optimal mixed-effects model.
ParameterEstimated Value
a 13.7760
b 12.7461
c −1.0737
c 0 0.2557
Random effect variance–covariance structure 1.7007 2.0700 2.0700 2.7865
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, X.; Wang, Y.; Lyu, Y.; Chen, W.; Li, M.; Sun, S. Provenance-Specific Height–Diameter Modeling for Chinese Fir: A Clustered Mixed-Effects Approach. Biology 2025, 14, 1301. https://doi.org/10.3390/biology14091301

AMA Style

Wu X, Wang Y, Lyu Y, Chen W, Li M, Sun S. Provenance-Specific Height–Diameter Modeling for Chinese Fir: A Clustered Mixed-Effects Approach. Biology. 2025; 14(9):1301. https://doi.org/10.3390/biology14091301

Chicago/Turabian Style

Wu, Xiangrong, Yuhan Wang, Yanjuan Lyu, Wanrong Chen, Ming Li, and Shuaichao Sun. 2025. "Provenance-Specific Height–Diameter Modeling for Chinese Fir: A Clustered Mixed-Effects Approach" Biology 14, no. 9: 1301. https://doi.org/10.3390/biology14091301

APA Style

Wu, X., Wang, Y., Lyu, Y., Chen, W., Li, M., & Sun, S. (2025). Provenance-Specific Height–Diameter Modeling for Chinese Fir: A Clustered Mixed-Effects Approach. Biology, 14(9), 1301. https://doi.org/10.3390/biology14091301

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop