1. Introduction
Accurate tree species identification through bark characteristics is a vital skill for foresters, enabling precise differentiation critical for effective forest management. There are many ways to identify trees, including leaves, fruit, and bark [
1]. Traditionally, this expertise demands years of experience and advanced training, limiting its accessibility. Recent advancements in artificial intelligence (AI), particularly deep learning and convolutional neural networks (CNNs), offer the potential to replicate this precision with automated, rapid, and remote identification tools [
2]. This project aims to develop AI-based tree bark identification tools to streamline digital forestry and support conservation efforts without requiring extensive professional training.
Deep learning has enabled human-like pattern recognition by learning from large, labeled datasets [
3]. In tree bark identification, CNNs have demonstrated accuracy comparable to or surpassing traditional texture classification methods, with straightforward implementation and end-to-end training [
1,
4,
5]. However, existing datasets, such as BarkTex [
6], Trunk12 [
7], and BarkNet 1.0 [
4], which provide labeled bark images of hardwood species across the United States, Canada, and Europe, are limited by variability in scope, species diversity, and accessibility [
8]. These constraints hinder comprehensive AI training, resulting in identification systems that currently underperform compared to experienced foresters.
To address these limitations, this study leverages the CentralBark dataset [
9], which includes over 19,000 images of 25 commercially valuable hardwood species from the Central Hardwood and Central Appalachian regions, accompanied by metadata such as diameter at breast height (DBH), bark moisture content (wet or dry), GPS location, timestamp, date, and camera settings [
9]. We are focusing on a new, smaller three-species dataset separate from the CentralBark dataset—northern red oak, hackberry, and bitternut hickory. We are calling this new dataset CBDS_Small. This research investigates the impact of three key variables contributing to inaccuracies in AI-based bark identification: (1) time of day (affecting lighting conditions), (2) bark moisture content, and (3) cardinal direction of observation (north, south, east, or west). Identifying primary sources of error will guide improvements in data collection and preprocessing, such as standardizing image capture protocols or augmenting datasets with underrepresented conditions.
By enhancing the robustness of deep learning models, this work aims to develop reliable AI tools for real-world forestry applications, where environmental variability is a common occurrence. Improved bark identification systems will streamline forest inventory processes, enable precise monitoring of species diversity and health, and support sustainable forest management.
3. Results and Discussion
Analysis of the CBDS_Small dataset revealed classification accuracies that both aligned with and diverged from trends observed in our prior research using the larger CentralBark dataset [
9]. As presented in
Table 1, hackberry and northern red oak exhibited a reversal in relative performance compared to earlier studies. Specifically, hackberry achieved a classification accuracy of 90.00%, outperforming northern red oak at 86.46% by a margin of 3.54%. This shift is notable, given that northern red oak previously demonstrated higher accuracy (91.28%) in the CentralBark dataset. However, the smaller sample size of the current dataset—comprising 96 images for northern red oak, 84 for hackberry, and 72 for bitternut hickory—warrants cautious interpretation. The dataset involved repeated imaging of the same trees under varying light conditions, moisture levels, and cardinal directions, potentially amplifying specific environmental influences not fully captured in the larger dataset. Consistent with expectations, bitternut hickory exhibited the lowest classification accuracy at 26.76%, aligning with its prior performance (63.38%) and underscoring persistent challenges in accurately classifying this species. The low accuracy for bitternut hickory may stem from its subtle bark features, which are less visually distinct than those of hackberry or northern red oak, particularly under variable conditions. These findings suggest that while hackberry’s distinctive features may confer an advantage in smaller, controlled datasets, scaling up data collection and diversifying environmental conditions are critical to stabilizing model performance across species.
Further analysis examined the influence of three environmental variables—cardinal direction of image capture, bark moisture condition, and time of day—on classification accuracy.
Table 2 shows that no consistent pattern emerged to suggest that cardinal direction significantly impacted model performance. The maximum observed difference in accuracy was 4.72%, with east-facing images yielding the highest accuracy, 88.42%, and west-facing images the lowest, 83.70%. north- and south-facing images had accuracies of 84.61% and 85.95%, respectively. These results align with our hypothesis that directional effects, particularly those tied to sunrise (east) and sunset (west), may introduce variability due to differences in lighting and shadow patterns. However, the modest variation suggests that cardinal direction is a less dominant factor than anticipated, possibly because the EfficientNet-B3 model [
10] is robust to subtle lighting differences caused by directional orientation. This finding has practical implications for field data collection, suggesting that strict control of cardinal direction may not be necessary, thereby allowing for greater flexibility in image capture protocols. Nonetheless, the slight advantage of east-facing images may warrant further investigation into whether morning light enhances bark feature visibility, particularly for species with complex textures.
Analysis of bark moisture conditions revealed a pronounced impact on classification accuracy, as shown in
Table 3. Images captured under dry conditions yielded an accuracy of 89.32%, surpassing wet conditions by 8.19% (81.13% accuracy). This disparity was anticipated for the selected species—northern red oak, hackberry, and bitternut hickory—which exhibit slate-gray bark when dry but darken significantly when wet, altering their visual appearance. For northern red oak, the characteristic “ski track” furrows [
14] transitioning from slate gray to off-white become less distinct under wet conditions, potentially confusing the model. Similarly, hackberry’s wart-like protuberances [
14] a defining feature, may appear subdued when wet, reducing contrast. Bitternut hickory, with its tightly packed bark and narrow slits, is particularly susceptible to misclassification when wet, as moisture masks its subtle peeling texture [
15].
The classification accuracies for northern red oak, hackberry, and bitternut hickory across morning (07:00–10:00), afternoon (12:00–15:00), and evening (17:00–20:00), as shown in
Table 4, reveal the impact of lighting conditions on the EfficientNet-B3 model’s performance in bark-based tree identification. Northern red oak achieved the highest accuracies in morning (91.67%) and evening (90.00%) but dropped to 80.80% in the afternoon, likely due to harsh midday light creating high-contrast shadows that obscure its deep furrows. Hackberry exhibited a progressive increase, from 85.00% in the morning to 95.56% in the evening, where low-angle light enhanced its bumpy protuberances, supported by a larger evening sample (45 images vs. 30 for morning/afternoon). Bitternut hickory consistently performed poorly (20.83–33.33%), with its faint bark slits and peeling challenging the model across all conditions, exacerbated by a smaller evening sample (25 images).
These results align with prior findings, where bark moisture significantly reduced accuracy by 8.19% in wet conditions (
Table 3), likely interacting with lighting effects (e.g., wet bark in afternoon light worsening northern red oak’s performance). Cardinal direction had minimal impact (4.72% maximum difference,
Table 2), emphasizing time of day as a more critical variable. The species’ morphological differences—northern red oak’s prominent furrows, hackberry’s distinctive protuberances, and bitternut hickory’s fine features—explain their varying sensitivities to lighting, with hackberry’s robustness contrasting with bitternut hickory’s persistent challenges.
The findings underscore the need for robust models to handle lighting variations in AI-based tree identification. Augmenting the CentralBark dataset [
9] with diverse lighting conditions, particularly afternoon images for northern red oak and varied conditions for bitternut hickory, is essential [
3]. Techniques such as illumination normalization or attention mechanisms can enhance feature detection, while species-specific strategies (e.g., oversampling for bitternut hickory) address morphological challenges. These improvements are crucial for practical applications, such as UAV-based monitoring, which supports precision forestry and conservation in dynamic forest environments.
The Chi-Square test results in
Table 5 evaluate the association between time of day (morning, afternoon, evening) and classification outcomes (correct or incorrect) for hackberry, northern red oak, and bitternut hickory using the EfficientNet-B3 model for AI-based tree bark identification. Hackberry
2 = 10.9877,
p = 0.0041) and northern red oak (
2 = 7.6633,
p = 0.0217) show significant associations (
p < 0.05), confirming that lighting conditions significantly influence their classification accuracy. Hackberry’s accuracy peaks at 95.56% in the evening, likely due to low-angle light enhancing its nodular appearance, while northern red oak’s accuracy drops to 80.80% in the afternoon, where harsh light obscures its grooved furrow pattern, compared to 91.67% in the morning and 90.00% in the evening. Bitternut hickory’s non-significant result ((
2 = 4.8251), (
p = 0.0896)) aligns with its consistently low accuracies (20.83–33.33%), reflecting challenges in detecting its subtle bark slits across all lighting conditions, compounded by a smaller evening sample (25 images vs. 30 for others).
These findings complement prior results found in Warner et al. (2024) [
9], where bark moisture reduced accuracy by 8.19% in wet conditions, likely exacerbating lighting effects (e.g., wet bark in afternoon light for northern red oak), and cardinal direction had minimal impact (4.72% difference). The significant results for hackberry and northern red oak highlight their sensitivity to lighting due to distinct morphological features, while bitternut hickory’s non-significant result highlights its morphological challenges. The study suggests augmenting the CentralBark dataset [
9] with more afternoon images for northern red oak and diverse conditions for bitternut hickory, alongside preprocessing techniques like illumination normalization [
2]. Hackberry’s robustness suggests less augmentation for distinct species, while bitternut hickory needs species-specific strategies.
Limitations
We can confirm that no data leakage occurred in this study. Our CBDS_Small dataset, comprised of 252 images, was used exclusively for testing and not included in the training of the EfficientNet-B3 model. Furthermore, both Centralbark and CBDS_Small, the training is split by trees and not by images.
The moisture-induced color shift highlights a critical limitation in current deep learning models, which rely heavily on color and texture cues [
3]. Addressing this challenge requires targeted augmentation of training datasets with wet bark images and potential preprocessing techniques, such as color normalization, to mitigate the impact of moisture on feature detection. These findings underscore the need for robust models capable of generalizing across environmental conditions encountered in real-world forestry applications.
4. Conclusions
The novelty of this paper is the evaluation of the EfficientNet-B3 model’s robustness in AI-based bark classification using CBDS_Small, focusing on environmental impacts–time of day, bark moisture, and cardinal direction on northern red oak, hackberry, and bitternut hickory classification. The analysis of a CBDS_Small dataset revealed that hackberry outperformed northern red oak (90.00% vs. 86.46% accuracy), reversing prior trends from the CentralBark dataset [
9], while bitternut hickory consistently exhibited low accuracy (26.76%), aligning with its prior performance (63.38%). These results were influenced by the smaller sample size and repeated imaging under varied conditions. Furthermore, the observed reversal in trends for hackberry vs. northern red oak is likely to stem from its distinct physical characteristics, which facilitate identification across all life stages. Hackberry’s characteristic wart-like protrusions are not as prominent on young trees, unlike red oak, where diagnostic features like “ski tracks” show pretty early in the tree’s life cycle. At the 10-inch DBH threshold used in Warner et al., 2024 northern red oak is more readily distinguishable than hackberry [
9]. This difference becomes more pronounced in our study, which targets a 16–18-inch DBH range, where hackberry’s prominent features enhance its recognizability in our smaller dataset. and underscore the interplay of species morphology and environmental factors in model performance.
Bark moisture condition emerged as a significant factor, with dry conditions yielding 8.19% higher accuracy than wet conditions (89.32% vs. 81.13%), as wet bark darkens the normal appearance like northern red oak’s “ski tracks”, hackberry’s wart-like texture, and bitternut hickory’s imperceptible bark characteristics. Time of day also significantly affected accuracy, with Chi-Square tests confirming strong associations for hackberry (2 = 10.99, p = <0.05) and northern red oak 2 = 7.66, p = 0.02), driven by evening (95.56% for hackberry) and morning (91.67% for northern red oak) light enhancing feature visibility, while afternoon’s harsh light reduced northern red oak’s accuracy (80.80%). Bitternut hickory’s non-significant result (2 = 4.83, p = 0.09) reflects its morphological challenges, as its subtle features remain difficult to classify across all lighting conditions. Cardinal direction had minimal impact (4.72% maximum difference), suggesting that lighting intensity and angle are more critical than directional shadows.
These findings highlight the need for robust deep learning models to address environmental variability in AI-based tree identification. Targeted augmentation of the CentralBark dataset with wet bark and afternoon images, particularly for northern red oak, and diverse conditions for bitternut hickory, is essential to improve model generalization. Preprocessing techniques, such as illumination normalization and attention mechanisms, could enhance feature detection under varying light and moisture conditions [
2]. Species-specific strategies, including oversampling or transfer learning for bitternut hickory, are crucial to overcome morphological challenges. These improvements are vital for practical applications like UAV-based monitoring [
16], enabling reliable automated species identification in dynamic forest environments.