Figure 1.
Distribution map of vegetation types on the Qinghai–Tibet Plateau.
Figure 1.
Distribution map of vegetation types on the Qinghai–Tibet Plateau.
Figure 2.
Schematic diagram of pollen sampling points. Black triangles indicate historical sediment profiles samples, whereas red circles mark the newly discovered stratigraphic section used in this study.
Figure 2.
Schematic diagram of pollen sampling points. Black triangles indicate historical sediment profiles samples, whereas red circles mark the newly discovered stratigraphic section used in this study.
Figure 3.
Modern pollen standard glass slide.
Figure 3.
Modern pollen standard glass slide.
Figure 4.
Shaqu profile: (a) Profile sedimentation (vertical height of the profile); (b) Slanted view of the same profile (slope distance of the profile); (c) surrounding environment). The vertical height in (a) is 2 m, while the slanted length in (b) is 5.3 m. The scale bars in both images correspond to the respective measurements.
Figure 4.
Shaqu profile: (a) Profile sedimentation (vertical height of the profile); (b) Slanted view of the same profile (slope distance of the profile); (c) surrounding environment). The vertical height in (a) is 2 m, while the slanted length in (b) is 5.3 m. The scale bars in both images correspond to the respective measurements.
Figure 5.
Representative imaging results of the TPPOL23 pollen database. (a) Apiaceae; (b) Asteraceae; (c) Betula; (d) Brassicaceae; (e) Caprifoliaceae; (f) Chenopodiaceae; (g) Corylus; (h) Cyperaceae; (i) Elaeagnaceae; (j) Gentianaceae; (k) Hippophae; (l) Juglandaceae; (m) Lamiaceae; (n) Lycopodiaceae; (o) Ostrya; (p) Picea; (q) Pinus; (r) Poaceae; (s) Ranunculaceae; (t) Salix; (u) Thymelaeaceae—Daphne; (v) Ulmus; (w) Zygophyllaceae.
Figure 5.
Representative imaging results of the TPPOL23 pollen database. (a) Apiaceae; (b) Asteraceae; (c) Betula; (d) Brassicaceae; (e) Caprifoliaceae; (f) Chenopodiaceae; (g) Corylus; (h) Cyperaceae; (i) Elaeagnaceae; (j) Gentianaceae; (k) Hippophae; (l) Juglandaceae; (m) Lamiaceae; (n) Lycopodiaceae; (o) Ostrya; (p) Picea; (q) Pinus; (r) Poaceae; (s) Ranunculaceae; (t) Salix; (u) Thymelaeaceae—Daphne; (v) Ulmus; (w) Zygophyllaceae.
Figure 6.
Examples of data augmentation applied to Picea pollen images in the TPPOL23 database. (a) Original image; (b) HSV color adjustment; (c) Histogram equalization with color normalization; (d) Random cropping; (e) Horizontal flip; (f) Vertical flip; (g) Gaussian blur; (h) Perspective transformation; (i) Multi-scale Gaussian pyramid fusion; (j) Random scaling.
Figure 6.
Examples of data augmentation applied to Picea pollen images in the TPPOL23 database. (a) Original image; (b) HSV color adjustment; (c) Histogram equalization with color normalization; (d) Random cropping; (e) Horizontal flip; (f) Vertical flip; (g) Gaussian blur; (h) Perspective transformation; (i) Multi-scale Gaussian pyramid fusion; (j) Random scaling.
Figure 7.
Manual annotation of pollen grains in the TPPOL23 database by palynological experts. (a) Picea; (b) Chenopodiaceae. Yellow bounding boxes indicate expert-annotated pollen grains used as ground-truth labels for model training.
Figure 7.
Manual annotation of pollen grains in the TPPOL23 database by palynological experts. (a) Picea; (b) Chenopodiaceae. Yellow bounding boxes indicate expert-annotated pollen grains used as ground-truth labels for model training.
Figure 8.
Architecture of the Dual-Path Excitation Block (DPEB) embedded in PollenSENet. The input channel descriptor undergoes channel recalibration (Equations (6)–(8)) and spatial modulation (Equations (9) and (10)) to produce the modulated feature map . Different colors and arrows are used only to visually distinguish processing steps within the block and do not represent additional categorical or functional meanings.
Figure 8.
Architecture of the Dual-Path Excitation Block (DPEB) embedded in PollenSENet. The input channel descriptor undergoes channel recalibration (Equations (6)–(8)) and spatial modulation (Equations (9) and (10)) to produce the modulated feature map . Different colors and arrows are used only to visually distinguish processing steps within the block and do not represent additional categorical or functional meanings.
Figure 9.
Visualization of channel attention in PollenSENet. (Left): input pollen micro-image; (Right): learned channel-attention distribution , highlighting discriminative responses associated with germinal apertures and exine ornamentation.
Figure 9.
Visualization of channel attention in PollenSENet. (Left): input pollen micro-image; (Right): learned channel-attention distribution , highlighting discriminative responses associated with germinal apertures and exine ornamentation.
Figure 10.
Characteristic curves of activation functions used in PollenSENet: ReLU, LeakyReLU (slope = 0.2), Sigmoid, and Hard-Sigmoid.
Figure 10.
Characteristic curves of activation functions used in PollenSENet: ReLU, LeakyReLU (slope = 0.2), Sigmoid, and Hard-Sigmoid.
Figure 11.
Structure of the adjustment module in PollenSENet (based on a ResNet-152 residual block). The module takes multi-scale feature maps as input, generates scale-specific weights through convolution and Softmax normalization (Equations (11) and (12)), fuses them into (Equation (13)), and applies deformable convolution calibration (Equation (14)) with residual connection (Equation (15)).
Figure 11.
Structure of the adjustment module in PollenSENet (based on a ResNet-152 residual block). The module takes multi-scale feature maps as input, generates scale-specific weights through convolution and Softmax normalization (Equations (11) and (12)), fuses them into (Equation (13)), and applies deformable convolution calibration (Equation (14)) with residual connection (Equation (15)).
Figure 12.
Architecture of the Pollen-YOLO model, where the YOLOv11 backbone is replaced with PollenSENet while the Neck (PANet) and Head modules remain unchanged. Arrows indicate the direction of feature propagation, and different colors are used only to visually distinguish network modules; they do not represent additional functional or categorical meanings.
Figure 12.
Architecture of the Pollen-YOLO model, where the YOLOv11 backbone is replaced with PollenSENet while the Neck (PANet) and Head modules remain unchanged. Arrows indicate the direction of feature propagation, and different colors are used only to visually distinguish network modules; they do not represent additional functional or categorical meanings.
Figure 13.
Training and validation loss curves of Pollen-YOLO. (a) Training losses, including box_loss, cls_loss, and dfl_loss; (b) Validation losses, including val/box_loss, val/cls_loss, and val/dfl_loss. Epoch denotes one complete pass through the training dataset; the x-axis represents the number of training epochs.
Figure 13.
Training and validation loss curves of Pollen-YOLO. (a) Training losses, including box_loss, cls_loss, and dfl_loss; (b) Validation losses, including val/box_loss, val/cls_loss, and val/dfl_loss. Epoch denotes one complete pass through the training dataset; the x-axis represents the number of training epochs.
Figure 14.
Performance curves of Pollen-YOLO during training on the TPPOL23 dataset across 200 epochs. An epoch denotes one complete training cycle over the entire training dataset. The horizontal axis represents the epoch number, while the vertical axis represents the corresponding performance metric values (ranging from 0 to 1). (a) Precision; (b) Recall; (c) mAP@0.5; (d) mAP@0.5:0.95.
Figure 14.
Performance curves of Pollen-YOLO during training on the TPPOL23 dataset across 200 epochs. An epoch denotes one complete training cycle over the entire training dataset. The horizontal axis represents the epoch number, while the vertical axis represents the corresponding performance metric values (ranging from 0 to 1). (a) Precision; (b) Recall; (c) mAP@0.5; (d) mAP@0.5:0.95.
Figure 15.
Evolution of the F1-score of Pollen-YOLO during training on the TPPOL23 dataset across 200 epochs. An epoch denotes one complete training cycle over the entire training dataset. The horizontal axis represents the epoch number, and the vertical axis represents the F1-score value (ranging from 0 to 1).
Figure 15.
Evolution of the F1-score of Pollen-YOLO during training on the TPPOL23 dataset across 200 epochs. An epoch denotes one complete training cycle over the entire training dataset. The horizontal axis represents the epoch number, and the vertical axis represents the F1-score value (ranging from 0 to 1).
Figure 16.
Recognition performance of Pollen-YOLO evaluated on the validation set of the TPPOL23 dataset, comprising 5159 images (20% of the augmented dataset), displayed as a heatmap. Recognition accuracy is calculated based on expert-annotated pollen grains within each class. Darker blue indicates higher recognition accuracy.
Figure 16.
Recognition performance of Pollen-YOLO evaluated on the validation set of the TPPOL23 dataset, comprising 5159 images (20% of the augmented dataset), displayed as a heatmap. Recognition accuracy is calculated based on expert-annotated pollen grains within each class. Darker blue indicates higher recognition accuracy.
Figure 17.
Representative examples of pollen detection and classification by Pollen-YOLO on the TPPOL23 validation set. All images have a resolution of 2048 × 2048 pixels. (a) Apiaceae; (b) Betula; (c) Brassicaceae; (d) Caprifoliaceae; (e) Chenopodiaceae; (f) Corylus; (g) Thymelaeaceae—Daphne; (h) Elaeagnaceae; (i) Gentianaceae; (j) Hippophae; (k) Lamiaceae; (l) Ostrya; (m) Picea; (n) Pinus; (o) Ranunculaceae; (p) Ulmus; (q) Zygophyllaceae; (r) Asteraceae; (s) Cyperaceae; (t) Juglandaceae; (u) Lycopodiaceae; (v) Poaceae; (w) Salix.
Figure 17.
Representative examples of pollen detection and classification by Pollen-YOLO on the TPPOL23 validation set. All images have a resolution of 2048 × 2048 pixels. (a) Apiaceae; (b) Betula; (c) Brassicaceae; (d) Caprifoliaceae; (e) Chenopodiaceae; (f) Corylus; (g) Thymelaeaceae—Daphne; (h) Elaeagnaceae; (i) Gentianaceae; (j) Hippophae; (k) Lamiaceae; (l) Ostrya; (m) Picea; (n) Pinus; (o) Ranunculaceae; (p) Ulmus; (q) Zygophyllaceae; (r) Asteraceae; (s) Cyperaceae; (t) Juglandaceae; (u) Lycopodiaceae; (v) Poaceae; (w) Salix.
Figure 18.
Grad-CAM visualization on full microscopic images. High-response regions (red) correspond to pollen grains, while background and debris are represented by low-response regions (blue), demonstrating the model’s ability to localize pollen against complex backgrounds. All images have a resolution of 2048 × 2048 pixels. (a) Asteraceae; (b) Lycopodiaceae.
Figure 18.
Grad-CAM visualization on full microscopic images. High-response regions (red) correspond to pollen grains, while background and debris are represented by low-response regions (blue), demonstrating the model’s ability to localize pollen against complex backgrounds. All images have a resolution of 2048 × 2048 pixels. (a) Asteraceae; (b) Lycopodiaceae.
Figure 19.
Grad-CAM visualization on individual pollen grains. (a) Juglandaceae, (b) Apiaceae, (c) Picea. Red regions highlight critical morphological features such as apertures and exine ornamentation, illustrating how Pollen-YOLO focuses on biologically relevant structures during classification.
Figure 19.
Grad-CAM visualization on individual pollen grains. (a) Juglandaceae, (b) Apiaceae, (c) Picea. Red regions highlight critical morphological features such as apertures and exine ornamentation, illustrating how Pollen-YOLO focuses on biologically relevant structures during classification.
Figure 20.
Age–depth model for the Shaqu profile based on six OSL dates using Bacon Bayesian modeling. The black shaded envelope represents the posterior age distributions estimated by the Bacon Bayesian age–depth model, with darker tones indicating higher probability density. The red line denotes the modeled mean age–depth curve, and the green violin-shaped distributions represent the OSL ages with their associated uncertainty ranges.
Figure 20.
Age–depth model for the Shaqu profile based on six OSL dates using Bacon Bayesian modeling. The black shaded envelope represents the posterior age distributions estimated by the Bacon Bayesian age–depth model, with darker tones indicating higher probability density. The red line denotes the modeled mean age–depth curve, and the green violin-shaped distributions represent the OSL ages with their associated uncertainty ranges.
Figure 21.
Comparison of pollen assemblages from the Shaqu profile obtained by traditional manual identification (“artificially identified”) and automated recognition using the Pollen-YOLO framework (“model-identified”). Both assemblages are based on the same sediment samples and prepared slides. (
A) Artificially identified pollen concentration diagram; (
B) Model-identified pollen concentration diagram generated by Pollen-YOLO. Both diagrams were produced using Tilia (version 2.6.1, Illinois State Museum, Springfield, IL, USA) [
60].
Figure 21.
Comparison of pollen assemblages from the Shaqu profile obtained by traditional manual identification (“artificially identified”) and automated recognition using the Pollen-YOLO framework (“model-identified”). Both assemblages are based on the same sediment samples and prepared slides. (
A) Artificially identified pollen concentration diagram; (
B) Model-identified pollen concentration diagram generated by Pollen-YOLO. Both diagrams were produced using Tilia (version 2.6.1, Illinois State Museum, Springfield, IL, USA) [
60].
Figure 22.
Heatmap of normalized relative errors between artificial and model-based pollen identification results for the Shaqu profile. Relative errors were calculated as |Model−Artificial|/Artificial and normalized to [0, 1]. White represents low errors (close agreement), whereas red indicates large deviations. Crosses (×) denote taxa with zero counts in artificial results where errors could not be computed. The heatmap highlights that major discrepancies are concentrated in rare or morphologically similar taxa (e.g., Betulaceae vs. Corylus), while dominant taxa exhibit relatively low errors.
Figure 22.
Heatmap of normalized relative errors between artificial and model-based pollen identification results for the Shaqu profile. Relative errors were calculated as |Model−Artificial|/Artificial and normalized to [0, 1]. White represents low errors (close agreement), whereas red indicates large deviations. Crosses (×) denote taxa with zero counts in artificial results where errors could not be computed. The heatmap highlights that major discrepancies are concentrated in rare or morphologically similar taxa (e.g., Betulaceae vs. Corylus), while dominant taxa exhibit relatively low errors.
Table 1.
Names and detailed information of pollen sampling sites. “Pollen sample count” indicates the number of sediment samples analyzed for pollen at each site. 14C denotes radiocarbon dating, and OSL denotes optically stimulated luminescence dating.
Table 1.
Names and detailed information of pollen sampling sites. “Pollen sample count” indicates the number of sediment samples analyzed for pollen at each site. 14C denotes radiocarbon dating, and OSL denotes optically stimulated luminescence dating.
| Section Name | Sediment Type | Stratigraphic Age (Dating Method) | Pollen Sample Count |
|---|
| Daiqu Site | Aeolian loess–fluvial deposit | 12.1–10.9 ka (OSL) | 65 |
| Laodaqiao Site | Aeolian loess–fluvial deposit | 13–3.3 ka (OSL) | 97 |
| Donggicuona Lake | Aeolian loess–lacustrine deposit | 2.01–8.77 ka (OSL) | 45 |
| Shalongka | Aeolian loess–flood deposit | 8.5–3.9 ka (14C) | 208 |
| Nankanyan Site | Aeolian loess–fluvial deposit | 13.7–0.8 ka (OSL) | 66 |
| Xiadawu Site | Aeolian loess–fluvial deposit | 0–6.3 ka (14C) | 62 |
| Eling Lake Site | Aeolian loess–lacustrine deposit | 2.1–14.6 ka (OSL) | 105 |
| Zhongda Site | Aeolian loess–fluvial deposit | 6.2–19 ka (OSL) | 31 |
Table 2.
Composition of the TPPOL23 dataset. “Original image count” denotes the number of raw microscopic images acquired prior to data augmentation for each pollen taxon. These images constitute the primary dataset from which individual pollen grains were annotated and subsequently expanded through class-specific augmentation strategies.
Table 2.
Composition of the TPPOL23 dataset. “Original image count” denotes the number of raw microscopic images acquired prior to data augmentation for each pollen taxon. These images constitute the primary dataset from which individual pollen grains were annotated and subsequently expanded through class-specific augmentation strategies.
| Pollen Type | Original Image Count | Augmentation Factor | Augmented Image Count | Key Augmentation Methods (Intensity Parameters) | Augmentation Purpose |
|---|
| Apiaceae | 1084 | 1.2 | 1300 | Horizontal flip (p = 0.5) | Improve robustness to rotational symmetry features |
| HSV color jitter (ΔH = ±15°, ΔS = ±0.1, ΔV = ±0.1) |
| Random crop (scale 0.8–1.0) |
| Asteraceae | 135 | 8 | 1080 | Vertical flip (p = 0.5) | Simulate different section views, enhance radial pattern recognition |
| Gaussian blur (σ = 1.5) |
| Perspective transform (rotation ± 20°, scaling ± 15%) |
| Betulaceae | 250 | 4 | 1000 | Random scaling (0.7–1.3×) | Enhance invariance to aperture position |
| Color inversion (p = 0.3) |
| Local cropping (retain ≥ 70% area) |
| Brassicaceae | 203 | 6 | 1218 | Perspective transform (tilt ± 15°) | Simulate microscope slide tilt effects |
| Gaussian noise (SNR = 25 dB) |
| Motion blur (length = 5 px) |
| Caprifoliaceae | 345 | 4 | 1380 | Color normalization (histogram matching) | Suppress color bias, enhance mesh pattern consistency |
| Center crop (80%) |
| Random flip (p = 0.5) |
| Chenopodiaceae | 315 | 4 | 1260 | Elastic deformation (α = 50, σ = 5) | Improve generalization for irregular edge structures |
| Color enhancement (saturation ± 20%) |
| Multi-scale scaling (0.5–2.0×) |
| Corylus (Betulaceae) | 181 | 7 | 1267 | Affine transform (translation ± 10%) | Reduce overfitting, improve robustness of aperture ring detection |
| Gaussian blur (σ = 0.5–2.0) |
| Random occlusion (≤15%) |
| Cyperaceae | 92 | 8 | 736 | Perspective distortion (grid warp) | Simulate optical diffraction in thin-walled pollen |
| Color channel shift (RGB offset ± 3 px) |
| Frequency domain filtering (hybrid high/low pass) |
| Elaeagnaceae | 726 | 1.5 | 1089 | Random flip + crop combination | Improve multi-angle recognition of surface tubercles |
| Color balance (white point correction) |
| Motion blur (random angle) |
| Gentianaceae | 91 | 8 | 728 | Scaling + perspective joint transform | Enhance stability of striate patterns across resolutions |
| CLAHE (Contrast Limited Adaptive Histogram Equalization) |
| Local Gaussian blur |
| Hippophae | 198 | 6 | 1188 | Color space conversion (LAB enhancement) | Optimize representation of oily layer optical properties |
| Random grid deformation |
| Directional blur (along long axis) |
| Juglandaceae | 353 | 4 | 1412 | Non-rigid deformation (thin-plate spline) | Handle morphological variation due to folding |
| Color jitter (brightness ± 10%) |
| Random rotation (±180°) |
| Lamiaceae | 375 | 3 | 1125 | Multi-view perspective projection | Enhance symmetry-breaking feature extraction |
| Color channel weighted blending |
| Anisotropic Gaussian filtering |
| Lycopodiaceae | 312 | 4 | 1248 | Frequency domain enhancement (high-frequency boost) | Increase contrast of tiny germination structures |
| Random cropping (retain spiny structures) |
| Color normalization |
| Ostrya (Betulaceae) | 103 | 8 | 824 | Light Gaussian noise (σ = 0.5) | Prevent overfitting, preserve aperture ring authenticity |
| Random flip (p = 0.5) |
| Local color perturbation |
| Picea (Pinaceae) | 527 | 2.5 | 1317 | 3D volume rendering view simulation | Correct morphological distortion due to angle in air sac structures |
| Multi-scale Gaussian pyramid fusion |
| Color channel shift |
| Pinus (Pinaceae) | 177 | 6.5 | 1150 | Selective enhancement of air sac regions | Improve classification specificity of bisaccate structures |
| Polarized light effect simulation |
| Non-uniform illumination synthesis |
| Poaceae | 329 | 3.5 | 1151 | Morphological dilation/erosion (kernel = 3 × 3) | Enhance rotational invariance of monoporate pollen |
| Random perspective deformation |
| Color space enhancement |
| Ranunculaceae | 103 | 8 | 824 | Focus stack simulation (multi-plane blending) | Handle randomness in surface granule distribution |
| Color inversion (p = 0.2) |
| Elastic deformation |
| Salix (Salicaceae) | 238 | 5 | 1190 | Frequency-spatial joint enhancement | Improve generalization for tricolpate pollen |
| Random occlusion (≤10%) |
| HSV color space perturbation |
| Ulmus | 135 | 8 | 1080 | Selective Gaussian blur | Compensate for spherical projection distortion, highlight aperture features |
| Elastic deformation |
| Spherical projection transform |
| Zygophyllaceae | 209 | 5.5 | 1149 | Morphological skeleton enhancement | Optimize edge feature extraction of spiny protrusions |
| Color channel separation enhancement |
| Motion blur synthesis |
| Daphne (Thymelaeaceae) | 135 | 8 | 1080 | Multi-directional lighting rendering | Enhance recognition of reticulate patterns under varying illumination |
| Gaussian-Poisson noise mixing |
| Non-rigid deformation |
Table 3.
Training parameters and experimental setup for Pollen-YOLO model.
Table 3.
Training parameters and experimental setup for Pollen-YOLO model.
| Parameter | Value |
|---|
| Epochs | 200 |
| Batch size | 16 |
| Workers | 4 |
| Learning rate | 0.001 |
| Optimizer | Adam |
| Input image size | 640 |
| Ratio of training set to validation set | 8:2 |
Table 4.
Evaluation metrics used in this study, including their mathematical definitions and analytical purposes.
Table 4.
Evaluation metrics used in this study, including their mathematical definitions and analytical purposes.
| Metric | Formula | Definition | Purpose |
|---|
| Loss function (L) | | Measures the error between predicted and ground-truth values across C classes. In YOLO, it combines bounding box regression loss, classification loss, and distribution focal loss. | Evaluates training stability and effectiveness of learning rate scheduling. |
| Precision | | Proportion of correctly predicted positives among all predicted positives. | Reflects the accuracy of positive predictions. |
| Recall | | Proportion of actual positives correctly identified by the model. | Reflects the completeness of positive detection. |
| Mean Average Precision (mAP) | | The mean of AP values across all N classes. Includes (IoU = 0.5) and (averaged across IoU thresholds from 0.5 to 0.95). | Provides a comprehensive measure of detection precision and localization. |
| F1 Score | | Harmonic mean of precision and recall. | Provides a balanced evaluation when precision and recall are uneven. |
Table 5.
Ablation study results on the TPPOL23 dataset.
Table 5.
Ablation study results on the TPPOL23 dataset.
| Experiment ID | Model Variant | Precision (%) | Recall (%) | F1-Score (%) | mAP@0.5 (%) | mAP@0.5:0.95 (%) |
|---|
| 1 | YOLOv11 (Original Backbone) | 78.32 | 88.2 | 83.26 | 80.56 | 70.25 |
| 2 | +ResNet-152 Backbone | 78.59 | 89.5 | 84.05 | 82.32 | 72.1 |
| 3 | +ResNet-152 + Dual-Modal Pooling (Ours) | 84.5 | 91.1 | 87.8 | 81.69 | 73.95 |
| 4 | +ResNet-152 + Standard SE Attention | 88.8 | 90.3 | 89.55 | 93.2 | 73.2 |
| 5 | +ResNet-152 + DPEB (Ours) | 90.1 | 92.4 | 91.25 | 94.8 | 74.9 |
| 6 | +PollenSENet (Full, Ours) | 90.67 | 93.16 | 91.92 | 95.29 | 75.79 |
Table 6.
Comparison of different models on the TPPOL23 dataset.
Table 6.
Comparison of different models on the TPPOL23 dataset.
| Model | Precision (%) | Recall (%) | F1-Score (%) | mAP@0.5 (%) | mAP@0.5:0.95 (%) |
|---|
| YOLOv3 | 82.5 | 87.3 | 84.8 | 86.2 | 71.4 |
| YOLOv4 | 86.9 | 89.2 | 88 | 90.5 | 73.6 |
| YOLOv7 | 88.3 | 90.1 | 89.2 | 92.7 | 74.5 |
| EfficientDet-D3 | 85.6 | 88.7 | 87.1 | 89.8 | 72.1 |
| Pollen-YOLO (Ours) | 90.7 | 93.2 | 91.9 | 95.3 | 75.8 |
Table 7.
Quantitative performance metrics of Pollen-YOLO for individual pollen taxa.
Table 7.
Quantitative performance metrics of Pollen-YOLO for individual pollen taxa.
| Class | Precision | Recall | mAP50 | mAP50-95 | F1-Score |
|---|
| Apiaceae | 0.864 | 0.806 | 0.888 | 0.514 | 0.834 |
| Asteraceae | 0.872 | 0.976 | 0.955 | 0.742 | 0.921 |
| Brassicaceae | 0.917 | 0.89 | 0.952 | 0.718 | 0.903 |
| Betulaceae | 0.853 | 0.937 | 0.951 | 0.762 | 0.893 |
| Caprifoliaceae | 0.951 | 0.964 | 0.992 | 0.928 | 0.957 |
| Chenopodiaceae | 0.938 | 0.913 | 0.968 | 0.83 | 0.925 |
| Corylus | 0.929 | 0.902 | 0.956 | 0.806 | 0.915 |
| Cyperaceae | 0.903 | 0.953 | 0.968 | 0.827 | 0.927 |
| Daphne | 0.948 | 0.925 | 0.981 | 0.789 | 0.936 |
| Elaeagnaceae | 0.896 | 0.947 | 0.937 | 0.703 | 0.921 |
| Gentianaceae | 0.95 | 0.923 | 0.983 | 0.778 | 0.936 |
| Hippophae | 0.843 | 0.965 | 0.906 | 0.765 | 0.900 |
| Juglandaceae | 0.972 | 0.997 | 0.993 | 0.927 | 0.984 |
| Lamiaceae | 0.762 | 0.877 | 0.839 | 0.512 | 0.815 |
| Lycopodiaceae | 0.968 | 0.993 | 0.994 | 0.784 | 0.980 |
| Ostrya | 0.946 | 0.913 | 0.979 | 0.78 | 0.929 |
| Picea | 0.942 | 0.996 | 0.994 | 0.945 | 0.968 |
| Pinus | 0.953 | 0.886 | 0.934 | 0.809 | 0.918 |
| Poaceae | 0.923 | 0.98 | 0.985 | 0.839 | 0.951 |
| Ranunculaceae | 0.838 | 0.881 | 0.919 | 0.585 | 0.859 |
| Salix | 0.938 | 0.953 | 0.979 | 0.778 | 0.945 |
| Ulmus | 0.842 | 0.941 | 0.903 | 0.581 | 0.889 |
| Zygophyllaceae | 0.902 | 0.909 | 0.963 | 0.81 | 0.905 |