Deformable 1D Directional Convolution with Bidirectional Offsets for Oriented Object Detection
Highlights
- A deformable 1D directional convolution is proposed to implement rotated 1D convolution for adaptively extracting the features of oriented objects.
- A tri-branch convolution layer is desinged by combining the deformable 1D directional convolution with the standard square-shaped convolution in a parallel manner.
- An orientation-aware feature pyramid network is presented by integrating the tri-branch convolution layer with the feature pyramid network.
- The proposed deformable 1D directional convolution only requires simple bidirectional offsets to efficiently implement a rotated 1D convolution, avoiding the time-consuming rotation operation.
- The oriented features of objects can be effectively extracted by the orientation-aware feature pyramid network.
- The performance of oriented object detection can be improved by adopting the orientation-aware feature pyramid network.
Abstract
1. Introduction
- We design a simple and effective deformable 1D directional convolution with bidirectional offsets (D1DD-Conv), which can work as a rotated 1D convolution to adaptively extract the features of oriented objects. Different from existing rotated convolutions, our D1DD-Conv only requires a simple bidirectional offset while avoiding time-consuming rotation and interpolation operations.
- We present an orientation-aware feature pyramid network by integrating D1DD-Conv with the commonly used feature pyramid network (FPN) and also design a tri-branch convolution layer by combining the standard convolution with a horizontal convolution and a vertical convolution in a parallel manner. Building upon them, we propose an effective two-stage model for oriented object detection.
- We introduce an additional deep supervision loss function for model training, which adopts the angles of oriented objects estimated by the offsets of D1DD-Conv. In addition, some experiments are conducted on three popular datasets to verify the effectiveness of the aforementioned components.
2. Related Work
2.1. CNN-Based Oriented Object Detection
2.2. Oriented Convolution
3. Method
3.1. Architecture Overview
3.2. Orientation-Aware Feature Pyramid Network
3.3. D1DD-Conv with Bidirectional Offsets
3.4. Detection Head
3.4.1. Oriented RPN Module
3.4.2. Oriented Detection Module
3.5. Loss Functions
4. Results and Discussion
4.1. Experimental Setup
4.1.1. Datasets
4.1.2. Implementation Details
4.1.3. Performance Evaluation Metrics
4.2. Ablation Study
4.3. Experimental Results
4.3.1. Comparisons on DOTA-v1.0 Dataset
4.3.2. Comparisons on HRSC2016 Dataset
4.3.3. Comparisons on UCAS-AOD Dataset
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Xie, X.; Cheng, G.; Wang, J.; Li, K.; Yao, X.; Han, J. Oriented R-CNN and beyond. Int. J. Comput. Vis. 2024, 132, 2420–2442. [Google Scholar] [CrossRef]
- Wang, X.; Han, C.; Huang, L.; Nie, T.; Liu, X.; Liu, H.; Li, M. AG-Yolo: Attention-guided yolo for efficient remote sensing oriented object detection. Remote Sens. 2025, 17, 1027. [Google Scholar] [CrossRef]
- Zhang, C.; Chen, Z.; Xiong, B.; Ji, K.; Kuang, G. EOOD: End-to-end oriented object detection. Neurocomputing 2025, 621, 129251. [Google Scholar] [CrossRef]
- Ahmad, I.; Lu, W.; Chen, S.-B.; Tang, J.; Luo, B. Lightweight oriented object detection with dynamic smooth feature fusion network. Neurocomputing 2025, 628, 129725. [Google Scholar] [CrossRef]
- Pu, Y.; Wang, Y.; Xia, Z.; Han, Y.; Wang, Y.; Gan, W.; Wang, Z.; Song, S.; Huang, G. Adaptive rotated convolution for rotated object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 6589–6600. [Google Scholar]
- Xu, H.; Liu, X.; Xu, H.; Ma, Y.; Zhu, Z.; Yan, C.; Dai, F. Rethinking boundary discountinuity problem for oriented object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–22 June 2024; pp. 17406–17415. [Google Scholar]
- Ma, J.; Shao, W.; Ye, H.; Wang, L.; Wang, H.; Zheng, Y. Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimed. 2018, 20, 3111–3122. [Google Scholar] [CrossRef]
- Pan, X.; Ren, Y.; Sheng, K.; Dong, W.; Yuan, H.; Guo, X.; Ma, C.; Xu, C. Dynamic refinement network for oriented and densely packed object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 11207–11216. [Google Scholar]
- Yang, X.; Yan, J. On the arbitrary-oriented object detection: Classification based approaches revisited. Int. J. Comput. Vis. 2022, 130, 1340–1365. [Google Scholar] [CrossRef]
- Ding, J.; Xue, N.; Long, Y.; Xia, G.-S.; Lu, Q. Learning RoI transformer for oriented object detection in aerial images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 2849–2858. [Google Scholar]
- Yu, Y.; Da, F. On boundary discontinuity in angle regression based arbitrary oriented object detection. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 6494–6508. [Google Scholar] [CrossRef]
- Wei, L.; Zheng, C.; Hu, Y. Oriented object detection in aerial images based on the scaled smooth l1 loss function. Remote Sens. 2023, 15, 1350. [Google Scholar]
- Chen, Z.; Chen, K.; Lin, W.; See, J.; Yu, H.; Ke, Y.; Yang, C. PIoU loss: Towards accurate oriented object detection in complex environments. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020; pp. 195–211. [Google Scholar]
- Yang, X.; Yang, J.; Yan, J.; Zhang, Y.; Zhang, T.; Guo, Z.; Sun, X.; Fu, K. SCRDet: Towards more robust detection for small, cluttered and rotated objects. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 8232–8241. [Google Scholar]
- Yang, X.; Yan, J.; Ming, Q.; Wang, W.; Zhang, X.; Tian, Q. Rethinking rotated object detection with Gaussian Wasserstein distance loss. In Proceedings of the 38th International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 11830–11841. [Google Scholar]
- Yang, X.; Yang, X.; Yang, J.; Ming, Q.; Wang, W.; Tian, Q.; Yan, J. Learning high-precision bounding box for rotated object detection via Kullback-Leibler divergence. In Proceedings of the 35th Conference on Neural Information Processing Systems, Virtual, 6–14 December 2021; pp. 18381–18394. [Google Scholar]
- Yang, X.; Zhou, Y.; Zhang, G.; Yang, J.; Wang, W.; Yan, J.; Zhang, X.; Tian, Q. The KFIoU loss for rotated object detection. In Proceedings of the International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
- Xu, Y.; Fu, M.; Wang, Q.; Wang, Y.; Chen, K.; Xia, G.-S.; Bai, X. Gliding vertex on the horizental bounding box for multi-oriented object detection. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 43, 1452–1459. [Google Scholar]
- Nie, G.; Huang, H. Multi-oriented object detection in arerial images with double horizontal rectangles. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 4932–4944. [Google Scholar]
- Guo, Z.; Zhang, X.; Liu, C.; Ji, X.; Jiao, J.; Ye, Q. Convex-hull feature adpation for oriented and densely packed object detection. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 5252–5265. [Google Scholar] [CrossRef]
- Zhang, M.; Ouyang, Y.; Yang, M.; Guo, J.; Li, Y. ORPSD: Outer rectangular projection-based representation for oriented ship detection in SAR images. Remote Sens. 2025, 17, 1511. [Google Scholar]
- Zhou, S.; Liu, Z.; Luo, H.; Qi, G.; Liu, Y.; Zuo, H.; Zhang, J.; Wei, Y. GCA2Net: Global-consolidation and angle-adaptive network for oriented object detection in aerial imagery. Remote Sens. 2025, 17, 1077. [Google Scholar] [CrossRef]
- Dang, M.; Xu, Q.; Liu, G.; Li, H.; Wang, X. RA2Net: Rotated alignment and aggregation network for oriented object detection in aerial images. Neurocomputing 2026, 659, 131798. [Google Scholar] [CrossRef]
- Han, J.; Ding, J.; Li, J.; Xia, G.-S. Align deep features for oriented object detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5602511. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-end object detection with transformers. In Proceedings of the European Conference on Computer Vision, Glasgow, UK, 23–28 August 2020; pp. 213–229. [Google Scholar]
- Sun, H.; Liu, T.; Zhang, X.; Guo, Q. Resolution-aware criss-cross attention detector for small object detection in aerial images. In Proceedings of the International Conference on Multimedia Retrieval, Chicago, IL, USA, 30 June–3 July 2025; pp. 1219–1227. [Google Scholar]
- Hui, S.; Guo, Q.; Geng, X.; Zhang, C. Multi-guidance CNNs for salient object detection. Acm Trans. Multimed. Comput. Commun. Appl. 2023, 19, 117. [Google Scholar] [CrossRef]
- Zhao, M.; Guo, Q. Reconstruction-based distillation for anomaly detection. Comput. Graph. 2025, 132, 104328. [Google Scholar] [CrossRef]
- Yang, X.; Yan, J.; Feng, Z.; He, T. R3Det: Refined single-stage detector with feature refinement for rotating object. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; pp. 3163–3171. [Google Scholar]
- Cheng, G.; Wang, J.; Li, K.; Xie, X.; Lang, C.; Yao, Y.; Han, J. Anchor-free oriented proposal generator for object detection. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5625411. [Google Scholar] [CrossRef]
- Zhao, J.; Ding, Z.; Zhou, Y.; Zhu, H.; Du, W.-L.; Yao, R. OrientedFormer: An end-to-end transformer-based oriented object detector in remote sensing images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5640816. [Google Scholar] [CrossRef]
- Zhao, J.; Ding, Z.; Zhou, Y.; Zhu, H.; Du, W.-L.; Yao, R. ReDiffDet: Rotation-equivariant diffusion model for oriented object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 10–17 June 2025; pp. 24429–24439. [Google Scholar]
- Ou, Z.; Chen, Z.; Shen, S.; Fan, L.; Yao, S.; Song, M.; Hui, P. Free3Net: Gliding free, orientation free, and anchor free network for oriented object detection. IEEE Trans. Multimed. 2023, 25, 7089–7100. [Google Scholar]
- Wang, L.; Zhan, Y.; Liu, W.; Yu, B.; Tao, D. Bounding box vectorization for oriented object detection with tanimoto coefficient regression. IEEE Trans. Multimed. 2024, 26, 5181–5193. [Google Scholar] [CrossRef]
- Dang, M.; Liu, G.; Li, H.; Wang, D.; Pan, R. PRA-Det: Anchor-free oriented object detection with polar radius representation. IEEE Trans. Multimed. 2025, 27, 145–157. [Google Scholar] [CrossRef]
- Zhou, K.; Zhang, M.; Dong, Y.; Tan, J.; Zhao, S.; Wang, H. Vector decomposition-based arbitrary-oriented object detection for optical remote sensing images. Remote Sens. 2023, 15, 4738. [Google Scholar] [CrossRef]
- Xu, C.; Ding, J.; Wang, J.; Yang, W.; Yu, H.; Yu, L.; Xia, G.-S. Dynamic coarse-to-fine learning for oriented tiny object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 7318–7328. [Google Scholar]
- Yuan, X.; Zheng, Z.; Li, Y.; Liu, X.; Liu, L.; Li, X.; Hou, Q.; Cheng, M.-M. Strip R-CNN: Large strip convolution for remote sensing object detection. arXiv 2025, arXiv:2501.03775. [Google Scholar] [CrossRef]
- Chen, C.; Ling, Q. Adaptive convolution for object detection. IEEE Trans. Multimed. 2019, 21, 3205–3217. [Google Scholar] [CrossRef]
- Freeman, W.T.; Adelson, E.H. The design and use of steerable filters. IEEE Trans. Pattern Anal. Mach. Intell. 1991, 13, 891–906. [Google Scholar] [CrossRef]
- Weiler, M.; Hamprecht, F.A.; Storath, M. Learning steerable filters for rotation equivalent CNNs. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 849–858. [Google Scholar]
- Worrall, D.E.; Garbin, S.J.; Turmukhambetov, D.; Brostow, G.J. Harmonic networks: Deep translation and rotation equivariance. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 5028–5037. [Google Scholar]
- Dai, J.; Qi, H.; Xiong, Y.; Li, Y.; Zhang, G.; Hu, H.; Wei, Y. Deformable convolutional networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 764–773. [Google Scholar]
- Jeon, Y.; Kim, J. Active convolution: Learning the shape of convolution for image classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 4201–4209. [Google Scholar]
- Kirchmeyer, A.; Deng, J. Convolutional networks with oriented 1d kernels. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 6222–6232. [Google Scholar]
- Dang, M.; Liu, G.; Kong, A.W.-K.; Zheng, Z.; Luo, N. RO2-DETR: Rotation-equivariant oriented object detection transformer with 1D rotated convolution kernel. Isprs J. Photogramm. Remote Sens. 2025, 228, 166–178. [Google Scholar]
- Lin, T.-Y.; Dollár, P.; Girshick, R. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Ding, X.; Guo, Y.; Ding, G.; Han, J. ACNet: Strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October 2019–2 November 2019; pp. 1911–1920. [Google Scholar]
- Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable DETR: Deformable transformers for end-to-end object detection. In Proceedings of the International Conference on Learning Representations, Virtual, 3–7 May 2021. [Google Scholar]
- Xia, G.-S.; Bai, X.; Ding, J.; Zhu, Z.; Belongie, S.; Luo, J.; Datcu, M.; Pelillo, M.; Zhang, L. DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 3974–3983. [Google Scholar]
- Liu, Z.; Wang, H.; Weng, L.; Yang, Y. Ship rotated bounding box space for ship extraction from high-resolution optical satellite images with complex backgrounds. IEEE Geosci. Remote Sens. Lett. 2016, 13, 1074–1078. [Google Scholar] [CrossRef]
- Zhu, H.; Chen, X.; Dai, W.; Fu, K.; Ye, Q.; Jiao, J. Orientation robust object detection in aerial images using deep convolutional neural network. In Proceedings of the IEEE International Conference on Image Processing, Quebec City, QC, Canada, 27–30 September 2015; pp. 3735–3739. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.-J.; Li, K.; Fei-Fei, L. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar]
- Zhou, Y.; Yang, X.; Zhang, G.; Wang, J.; Liu, Y.; Hou, L.; Jiang, X.; Liu, X.; Yan, J.; Lyu, C.; et al. MMRotate: A rotated object detection benchmark using PyTorch. In Proceedings of the ACM International Conference on Multimedia, Lisboa, Portugal, 10–14 October 2022; pp. 7331–7334. [Google Scholar]
- Han, J.; Ding, J.; Xue, N.; Xia, G.-S. Redet: A rotation-equivariant detector for aerial object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 2786–2795. [Google Scholar]
- Guo, Q.; Zhang, Y.; Qiu, S.; Zhang, C. Accelerating patch-based low-rank image restoration using kd-forest and Lanczos approximation. Inf. Sci. 2021, 556, 177–193. [Google Scholar] [CrossRef]
- Pu, X.; Xu, F. Low-rank adaption on transformer-based oriented object detector for satellite onboard processing of remote sensing images. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5202213. [Google Scholar] [CrossRef]







| Method | Backbone | Orientation-Aware | Tri-Branch | Deep Supervision | DOTA-v1.0 | UCAS-AOD | HRSC2016 | |
|---|---|---|---|---|---|---|---|---|
| FPN | Convolution Layer | Loss | mAP | mAP | mAP2007 | mAP2012 | ||
| Oriented R-CNN | ResNet-101 | ✗ | ✗ | ✗ | 76.26 | 90.11 | 90.50 | 97.60 |
| Ours | ResNet-101 | ✓ | ✗ | ✗ | 76.93 | 90.24 | 90.54 | 97.66 |
| ✓ | ✓ | ✗ | 77.58 | 90.29 | 96.56 | 97.71 | ||
| ✓ | ✓ | ✓ | 78.02 | 90.35 | 90.57 | 97.74 | ||
| Method | Backbone | PL | BD | BR | GTF | SV | LV | SH | TC | BC | ST | SBF | RA | HA | SP | HC | mAP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RoI Transformer | ResNet-101 | 88.57 | 78.21 | 43.51 | 75.77 | 68.64 | 73.56 | 83.31 | 90.66 | 77.00 | 81.29 | 58.31 | 53.40 | 62.65 | 58.71 | 47.68 | 69.42 |
| Gliding Vertex | ResNet-101 | 89.61 | 84.87 | 52.06 | 77.13 | 72.81 | 73.07 | 86.65 | 90.59 | 78.82 | 86.63 | 59.41 | 70.68 | 72.76 | 70.64 | 57.18 | 74.86 |
| ReDet | ReRNet-50 | 88.68 | 82.61 | 53.85 | 74.02 | 78.06 | 84.00 | 87.97 | 90.74 | 87.69 | 85.57 | 61.77 | 60.53 | 75.99 | 68.01 | 63.31 | 76.19 |
| Oriented R-CNN | ResNet-101 | 88.81 | 83.46 | 55.25 | 76.81 | 74.19 | 82.06 | 87.50 | 90.87 | 85.43 | 85.35 | 65.50 | 66.79 | 74.37 | 70.16 | 57.30 | 76.26 |
| Ours | ResNet-101 | 88.89 | 85.27 | 55.88 | 76.51 | 78.67 | 83.22 | 87.99 | 90.87 | 87.69 | 86.85 | 66.23 | 67.11 | 76.12 | 74.29 | 64.71 | 78.02 |
| Method | Backbone | mAP2007 | mAP2012 |
|---|---|---|---|
| RoI Transformer | ResNet-101 | 86.00 | – |
| Gliding Vertex | ResNet-101 | 88.20 | – |
| ReDet | ReRNet-50 | 90.46 | 97.63 |
| Oriented R-CNN | ResNet-101 | 90.50 | 97.60 |
| Ours | ResNet-101 | 90.57 | 97.74 |
| Method | Backbone | Airplane | Car | mAP |
|---|---|---|---|---|
| RoI Transformer | ResNet-101 | 90.01 | 88.03 | 89.03 |
| Gliding Vertex | ResNet-101 | 90.15 | 89.42 | 89.97 |
| ReDet | ReRNet-50 | 90.24 | 89.73 | 90.02 |
| Oriented R-CNN | ResNet-101 | 90.32 | 89.90 | 90.11 |
| Ours | ResNet-101 | 90.47 | 90.22 | 90.35 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Li, Y.; Li, X.; Zhang, C. Deformable 1D Directional Convolution with Bidirectional Offsets for Oriented Object Detection. Remote Sens. 2026, 18, 934. https://doi.org/10.3390/rs18060934
Li Y, Li X, Zhang C. Deformable 1D Directional Convolution with Bidirectional Offsets for Oriented Object Detection. Remote Sensing. 2026; 18(6):934. https://doi.org/10.3390/rs18060934
Chicago/Turabian StyleLi, Ying, Xuemei Li, and Caiming Zhang. 2026. "Deformable 1D Directional Convolution with Bidirectional Offsets for Oriented Object Detection" Remote Sensing 18, no. 6: 934. https://doi.org/10.3390/rs18060934
APA StyleLi, Y., Li, X., & Zhang, C. (2026). Deformable 1D Directional Convolution with Bidirectional Offsets for Oriented Object Detection. Remote Sensing, 18(6), 934. https://doi.org/10.3390/rs18060934

