Object Detection of Northern Chinese Rock Art Images Using YOLOv8 with Omni-Dimensional Dynamic Convolution
Abstract
1. Introduction
- To address discontinuous carving lines, abstract symbols, and subtle inter-class differences in Northern rock art images, we propose a dynamic feature enhancement algorithm based on YOLOv8. Differing from prior dynamic convolution applications in regular, continuous textures, the method embeds Omni-Dimensional Dynamic Convolution (ODConv) into key Backbone and Neck layers. This enables adaptive operations across spatial, channel, and kernel dimensions, overcoming static convolution limitations and extending dynamic convolution to highly abstract symbolic visual structures.
- A dedicated dataset for Northern rock art recognition was constructed, consisting of three categories: Deer, Human_Horse, and Anthropomorphic_Face. Data augmentation techniques such as brightness adjustment, noise simulation, and geometric transformations were applied to improve model robustness under varying illumination, image noise, and structural degradation.
- Baseline models including SSD, YOLOv5, YOLOv8, and YOLOv8–Backbone–ODConv were used for comparison experiments. Evaluation metrics included F1-score, Recall, mean Average Precision, and FPS. Experimental results demonstrate that YOLOv8-ODConv achieves superior recognition accuracy, stability, and adaptability on complex cultural heritage images.
2. Related Work
2.1. Rock Art Research and Digital Development
2.2. Cultural Heritage Image Recognition
2.3. YOLOv8 and Dynamic Convolution Improvements
2.4. Summary of Current Research and Problem Formulation
3. Materials and Methods
3.1. Dataset Construction and Preprocessing
3.1.1. Image Acquisition and Category Definition
3.1.2. Image Preprocessing and Data Augmentation
- 1.
- Image Darkening
- 2.
- Image Brightening
- 3.
- Salt-and-Pepper Noise
- 4.
- Rotation
3.2. Key Techniques and Model Construction
3.2.1. Applicability of YOLOv8 in Rock Art Image Analysis
3.2.2. Omni-Dimensional Dynamic Convolution (ODConv)
- 1.
- Spatial Attention ()
- 2.
- Input Channel Attention ()
- 3.
- Output Channel Attention (Filter Attention, )
- 4.
- Kernel Attention ()
3.2.3. YOLOv8-ODConv Network Architecture and Integration Strategy
3.3. Experimental Environment and Training Settings
3.4. Evaluation Metrics
4. Results
4.1. Training Process and Detection Performance of YOLOv8-ODConv
4.2. Comparison with Different Models and Ablation Analysis
4.3. Visualization of Detection Results
5. Discussion
5.1. Mechanism of Performance Improvement in YOLOv8-ODConv
5.2. Performance Differences Across Categories and Visual Interpretation
5.3. Role of Dynamic Convolution in Complex Visual Environments
5.4. Limitations and Future Work
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Bednarik, R.G. The Main Problems in Rock Art Research. Man India 2008, 88, 199–213. [Google Scholar]
- Ilmi, M.M.; Maryanti, E.; Nurdini, N.; Setiawan, P.; Kadja, G.T.M.; Ismunandar. A review of radiometric dating and pigment characterizations of rock art in Indonesia. Archaeol. Anthropol. Sci. 2021, 13, 120. [Google Scholar] [CrossRef]
- Jalandoni, A.; Zhang, Y.S.; Zaidi, N.A. On the use of Machine Learning methods in rock art research with application to automatic painted rock art identification. J. Archaeol. Sci. 2022, 144, 105629. [Google Scholar] [CrossRef]
- Zhang, W.J. Types and Distribution of Yinshan Rock Art. Res. Front. Archaeol. 2012, 265–273. Available online: https://d.wanfangdata.com.cn/thesis/J0066231 (accessed on 25 May 2026).
- Wei, J.Z.Y. Types and Chronology of Rock Art in the Great Bend Region of the Yellow River from an Archaeological Perspective. Qinghai Natl. Stud. 2024, 127–133. [Google Scholar] [CrossRef]
- Liu, Y. Image Modeling and Cultural Implications of Regional Rock Art in Inner Mongolia. Ph.D. Thesis, Fudan University, Shanghai, China, 2012. Available online: https://kns.cnki.net/kcms2/article/abstract?v=A2Z-m-A1gckbeDxy4NRd7sjyho4pIkb_uglRx_C4lPFnWUieEzi1c1uOCNRYkxi3efrzaUnOBjlDq195cqPNWeLHzHZvmudlawZ_lYh-MbsDn10Z4IYjpbd5e2JLSpO4rSZ-zC3HCQRcXSPpTK8-yxLhWDCLZAWUTHo05bFxuxkh2NJwnTV4qqKfRu8K5ooy&uniplatform=NZKPT&language=CHS (accessed on 1 May 2026).
- Ning, Y. A Study of Stylized Deer Rock Art in Northwest China. Res. Front. Archaeol. 2024, 338–352. Available online: https://kns.cnki.net/kcms2/article/abstract?v=CP5zkKmU_7eAnGDn6J5TmZPlpkP04Sbd94PbZ7qfzS9DPOJ8oq5_uFvUnbhp-UcGqfcw2bV5R9QsdWbo-kDK27yRLUVPndosWkxGFvfAMxN1IpI-YbFwrHemTznt0SrPMd6LPLF3D1CBXpep7mIB-0YU2_0rMZEe934GgNg7RTfoZsHsZT2zdKpy2VS7l4XG&uniplatform=NZKPT&captchaId=d5ef201c-10e5-4704-812a-340e80281f1e (accessed on 25 May 2026).
- Zhang, L.M. The Inspiration of “Human Face Images” in Northern Rock Art for Contemporary Visual Design. Master’s Thesis, Lanzhou University of Finance and Economics, Lanzhou, China, 2024. Available online: https://link.cnki.net/doi/10.27732/d.cnki.gnzsx.2024.000145 (accessed on 25 May 2026).
- Wang, X.; Cheng, Y.; Zhang, R.Y.; Fan, Y.; Huang, J.Z.; Zhang, Y.; Yan, H.B. Classification of Weathering Environments for Stone Cultural Heritage Based on Hyperspectral Imaging Technology. Laser Optoelectron. Prog. 2025, 62, 1037005. [Google Scholar] [CrossRef]
- Guo, T.; Xu, X. Salient object detection from low contrast images based on local contrast enhancing and non-local feature learning. Vis. Comput. 2021, 37, 2069–2081. [Google Scholar] [CrossRef]
- Sultan, W.; Anjum, N.; Stansfield, M.; Ramzan, N. Hybrid Local and Global Deep-Learning Architecture for Salient-Object Detection. Appl. Sci. 2020, 10, 8754. [Google Scholar] [CrossRef]
- Wei, X.-S.; Song, Y.-Z.; Mac Aodha, O.; Wu, J.; Peng, Y.; Tang, J.; Yang, J.; Belongie, S. Fine-grained image analysis with deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 8927–8948. [Google Scholar] [CrossRef] [PubMed]
- Ji, N.; Ju, F.; Wang, Q. An Application Study on Digital Image Classification and Recognition of Yunnan Jiama Based on a YOLO-GAM Deep Learning Framework. Appl. Sci. 2026, 16, 1551. [Google Scholar] [CrossRef]
- Gao, C.; Zhao, G.F.; Gao, S.; Du, S.X.; Kim, E.; Shen, T. Advancing architectural heritage: Precision decoding of East Asian timber structures from Tang dynasty to traditional Japan. Herit. Sci. 2024, 12, 219. [Google Scholar] [CrossRef]
- Zhang, S.Z.; Liu, Z.H.; Wang, K.P.; Huang, W.W.; Li, P. OBC-YOLOv8: An improved road damage detection model based on YOLOv8. Peerj Comput. Sci. 2025, 11, e2593. [Google Scholar] [CrossRef] [PubMed]
- Xu, S.X.; Huang, W.J.; Wang, D.C.; Zhang, B.Y.; Sun, H.; Yan, J.Y.; Ding, J.L.; Wang, J.J.; Yang, Q.L.; Huang, T.C.; et al. Automatic pine wilt disease detection based on improved YOLOv8 UAV multispectral imagery. Ecol. Inform. 2024, 84, 102846. [Google Scholar] [CrossRef]
- Huang, Z.B.; Li, X.H.; Fan, S.T.; Liu, Y.; Zou, H.; He, X.C.; Xu, S.; Zhao, J.H.; Li, W.F. ORD-YOLO: A Ripeness Recognition Method for Citrus Fruits in Complex Environments. Agriculture 2025, 15, 1711. [Google Scholar] [CrossRef]
- Zhang, Y.J.; Gao, G.F.; Chen, Y.D.; Yang, Z.J. ODD-YOLOv8: An algorithm for small object detection in UAV imagery. J. Supercomput. 2025, 81, 202, Correction in J. Supercomput. 2025, 81, 384. https://doi.org/10.1007/s11227-024-06829-9. [Google Scholar] [CrossRef]
- Li, Y.; Zhao, M.; Mao, J.; Chen, Y.; Zheng, L.; Yan, L. Detection and recognition of Chinese porcelain inlay images of traditional Lingnan architectural decoration based on YOLOv4 technology. Herit. Sci. 2024, 12, 137. [Google Scholar] [CrossRef]
- Prasomphan, S. Toward Fine-grained Image Retrieval with Adaptive Deep Learning for Cultural Heritage Image. Comput. Syst. Sci. Eng. 2023, 44, 1295–1307. [Google Scholar] [CrossRef]
- Wu, H.Y.; Kong, L.Y.; Liu, D.H. Crack Detection on Road Surfaces Based on Improved YOLOv8. IEEE Access 2024, 12, 190850–190864. [Google Scholar] [CrossRef]
- Xi, X.; Wang, J.; Li, F.; Li, D. IRSDet: Infrared small-object detection network based on sparse-skip connection and guide maps. Electronics 2022, 11, 2154. [Google Scholar] [CrossRef]
- Bai, L.; Xia, C.; Liu, F.; Yang, X.; Zhang, T. Full-dimensional dynamic convolution and progressive learning strategy for strawberry recognition based on YOLOv8. Front. Plant Sci. 2025, 16, 1541365. [Google Scholar] [CrossRef]
- Mao, Q.; Wang, Y.; Xue, X.; Zhou, T.; Su, Y. Intelligent Identification Method for Abnormal Pipelines in Coal Mine Roadways Based on Improved YOLOv8n. Ind. Mine Autom. 2026, 52, 63–72. [Google Scholar]
- Chen, Z.X.; Feng, J.J.; Zhu, X.Y.; Wang, B. YOLOv8-OCHD: A Lightweight Wood Surface Defect Detection Method Based on Improved YOLOv8. IEEE Access 2025, 13, 84435–84450. [Google Scholar] [CrossRef]
- Ning, T.; Fu, M.; Wang, Y.Z.; Duan, X.D.; Abedin, M.Z. Application of deep learning for automatic detection of table tennis balls from an intelligent serving machine. Appl. Soft Comput. 2024, 167, 112329. [Google Scholar] [CrossRef]
- Jalandoni, A.; Taçon, P.S.C. A New Recording and Interpretation of the Rock Art of Angono, Rizal, Philippines. Rock Art Res. 2018, 35, 47–61. [Google Scholar]
- Li, L.; Huang, W.; Gu, I.Y.-H.; Tian, Q. Statistical modeling of complex backgrounds for foreground object detection. IEEE Trans. Image Process. 2004, 13, 1459–1472. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.; Yin, G.; Sun, K.; Dong, Y. Multi-view robust discriminative feature learning for remote sensing image with noisy labels. Mob. Netw. Appl. 2022, 27, 2487–2505. [Google Scholar] [CrossRef]
- Jin, W.N.; Zhang, Y.S.; Xing, J. Complete Works of Chinese Art Rock Art and Engraving Prints; Huangshan Publishing House: Hefei, China, 2010. [Google Scholar]
- Editorial Committee of Complete Collection of Chinese Rock Art. Complete Collection of Chinese Rock Art Northern Rock Art; Liaoning Fine Arts Publishing House: Shenyang, China, 2006. [Google Scholar]
- Choi, Y.; Seo, S.; Jang, H.; Yoon, D. Improving the Performance of Deep-Learning-Based Ground-Penetrating Radar Cavity Detection Model using Data Augmentation and Ensemble Techniques. Geophys. Geophys. Explor. 2023, 26, 211–228. [Google Scholar]
- Lerch, L.; Huber, L.S.; Kamath, A.; Pöllinger, A.; Pahud de Mortanges, A.; Obmann, V.C.; Dammann, F.; Senn, W.; Reyes, M. DreamOn: A data augmentation strategy to narrow the robustness gap between expert radiologists and deep learning classifiers. Front. Radiol. 2024, 4, 1420545. [Google Scholar] [CrossRef]
- Tran, T.L.C.; Huang, Z.-C.; Tseng, K.-H.; Chou, P.-H. Detection of bottle marine debris using unmanned aerial vehicles and machine learning techniques. Drones 2022, 6, 401. [Google Scholar] [CrossRef]
- Hao, S.; Han, X.; Guo, Y.; Wang, M. Decoupled low-light image enhancement. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 2022, 18, 1–19. [Google Scholar] [CrossRef]
- Singh, S.; Kumari, R.; Pallavi, P.; Saurabh, P. A systematic review of deep learning methods for low-light image enhancement and object detection. Discov. Appl. Sci. 2026, 8, 241. [Google Scholar] [CrossRef]
- Djurović, I. BM3D filter in salt-and-pepper noise removal. EURASIP J. Image Video Process. 2016, 2016, 13. [Google Scholar] [CrossRef]
- Li, C.; Zhou, A.; Yao, A. Omni-dimensional dynamic convolution. arXiv 2022, arXiv:2209.07947. [Google Scholar] [CrossRef]
- Kuang, X.; Tao, B. ODGNet: Robotic grasp detection network based on omni-dimensional dynamic convolution. Appl. Sci. 2024, 14, 4653. [Google Scholar] [CrossRef]









| Rock Art Category | Example Images | |||
|---|---|---|---|---|
| Anthropomorphic_Face | ![]() | ![]() | ![]() | ![]() |
| Deer | ![]() | ![]() | ![]() | ![]() |
| Human_Horse | ![]() | ![]() | ![]() | ![]() |
| Original Image | Darkening | Brightening | Salt-and-Pepper Noise | Rotation |
|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() |
![]() | ![]() | ![]() | ![]() | ![]() |
![]() | ![]() | ![]() | ![]() | ![]() |
| Modules | F1 | Recall | mAP@0.5 | mAP@0.5–0.95 | FPS | FLOPs/G |
|---|---|---|---|---|---|---|
| SSD | 82.5% | 79.8% | 85.6% | 70.8% | 102.5 | 23.7 |
| YOLOv5 | 92.7% | 86.7% | 95.8% | 82.1% | 94.3 | 10.5 |
| YOLOv8 | 93.0% | 88.5% | 97.3% | 83.9% | 91.4 | 8.6 |
| YOLOv8–Backbone–ODConv | 93.8% | 88.4% | 97.6% | 83.7% | 90.2 | 8.1 |
| YOLOv8–Neck–ODConv | 94.1% | 88.6% | 98.1% | 83.8% | 90.0 | 7.9 |
| YOLOv8-ODConv | 95.0% | 89.3% | 98.9% | 84.6% | 89.7 | 7.2 |
| YOLOv8 | YOLOv8-ODConv |
|---|---|
![]() | ![]() |
![]() | ![]() |
![]() | ![]() |
![]() | ![]() |
![]() | ![]() |
![]() | ![]() |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Guo, L.; Ju, F. Object Detection of Northern Chinese Rock Art Images Using YOLOv8 with Omni-Dimensional Dynamic Convolution. Appl. Sci. 2026, 16, 5522. https://doi.org/10.3390/app16115522
Guo L, Ju F. Object Detection of Northern Chinese Rock Art Images Using YOLOv8 with Omni-Dimensional Dynamic Convolution. Applied Sciences. 2026; 16(11):5522. https://doi.org/10.3390/app16115522
Chicago/Turabian StyleGuo, Lizhong, and Fei Ju. 2026. "Object Detection of Northern Chinese Rock Art Images Using YOLOv8 with Omni-Dimensional Dynamic Convolution" Applied Sciences 16, no. 11: 5522. https://doi.org/10.3390/app16115522
APA StyleGuo, L., & Ju, F. (2026). Object Detection of Northern Chinese Rock Art Images Using YOLOv8 with Omni-Dimensional Dynamic Convolution. Applied Sciences, 16(11), 5522. https://doi.org/10.3390/app16115522








































