MDPI - Publisher of Open Access Journals

41 pages, 4814 KB

Open AccessArticle

CORE-Net: A Collaborative Optimization Framework for Rotated Ship Detection in Complex SAR Scenes

by Yongqi Kang and Haiping Qu

Sensors 2026, 26(12), 3707; https://doi.org/10.3390/s26123707 - 10 Jun 2026

Viewed by 209

Rotated ship detection in complex synthetic aperture radar (SAR) scenes remains a critical yet challenging task for maritime remote sensing applications. Existing methods are plagued by three core bottlenecks: inconsistent directional responses across multi-scale features, unstable rotation angle regression, and non-uniform supervision quality [...] Read more.

Rotated ship detection in complex synthetic aperture radar (SAR) scenes remains a critical yet challenging task for maritime remote sensing applications. Existing methods are plagued by three core bottlenecks: inconsistent directional responses across multi-scale features, unstable rotation angle regression, and non-uniform supervision quality of positive samples during training, which collectively lead to elevated false alarms, missed detections, and severe localization degradation, especially under high IoU thresholds in complex inshore environments. To address these challenges, we propose CORE-Net, a collaborative optimization framework integrating three dedicated modules in the forward detection stage: a Rotation-Consistent Feature Pyramid (RCFP) to alleviate cross-scale directional mismatch, a Progressive Cascade Rotation Head (PCR Head) to improve progressive angle prediction stability, and an Orientation-Aware Regression Enhancement Unit (OAREU) to strengthen directional geometric representation in regression features, alongside an Uncertainty-Aware Sample Reliability Steering (UARS) module for training-stage optimization to softly downweight the regression contribution of positive samples with high classification confidence but low geometric consistency. Extensive experiments on three public SAR ship detection datasets (RSDD-SAR, SSDD+, and RSAR) demonstrate that the proposed method consistently improves AP_50:95 while maintaining high Recall and Precision, validating that joint optimization of feature representation, rotated regression, and sample reliability is an effective strategy to enhance both the robustness and fine-grained localization capability of rotated ship detection in complex SAR scenes. In addition, large-scene inference experiments on uncropped Sentinel-1 and RSDD-SAR images further demonstrate that CORE-Net can be extended from patch-based evaluation to high-resolution SAR scene interpretation using a sliding-window inference strategy. Full article

(This article belongs to the Special Issue Application of SAR and Remote Sensing Technology in Earth Observation)

20 pages, 5808 KB

Open AccessTechnical Note

LMRD: A Large-Scale Multi-Source Rotated Dataset for SAR Ship Detection

by Yujia Cheng, Zhaocheng Wang, Yu Chen, Yu Zhang, Yong Chen and Hongdong Zhao

Remote Sens. 2026, 18(10), 1639; https://doi.org/10.3390/rs18101639 - 20 May 2026

Viewed by 154

Abstract

The rapid development of synthetic aperture radar (SAR) imaging technology has significantly enhanced maritime monitoring capabilities; however, SAR ship detection remains constrained by the limited scale and representation capacity of existing rotated bounding box datasets. Most publicly available datasets rely on horizontal annotations, [...] Read more.

The rapid development of synthetic aperture radar (SAR) imaging technology has significantly enhanced maritime monitoring capabilities; however, SAR ship detection remains constrained by the limited scale and representation capacity of existing rotated bounding box datasets. Most publicly available datasets rely on horizontal annotations, which introduce redundancy and localization ambiguity in densely distributed and nearshore scenarios. Although rotated bounding boxes provide more precise geometric representation, large-scale multi-source rotated SAR datasets are still insufficient to support robust model training. To address this limitation, we construct a large-scale multi-source rotated SAR ship dataset (LMRD) consisting of 13,024 high-resolution image chips with over 38,000 annotated ship instances, covering multiple satellite sources, polarization modes, and diverse maritime environments, including offshore, nearshore, complex coastal, and densely distributed port scenes, thereby enhancing scene diversity and annotation precision. Furthermore, independent of the dataset construction, we propose a multi-domain feature fusion (MDF) framework built upon Oriented RCNN, which integrates high-frequency information and visual saliency cues to improve feature representation under complex backgrounds. Experimental results on the LMRD demonstrate that, compared with the baseline Oriented RCNN, the proposed MDF framework achieves a 2.7% improvement in average precision. Additional analysis indicates that the dataset characteristics and the multi-domain fusion strategy contribute to performance enhancement at different stages of the detection pipeline, validating the effectiveness of the proposed dataset for rotated ship detection while demonstrating the complementary role of multi-domain feature enhancement. Full article

(This article belongs to the Special Issue SAR Monitoring of Marine and Coastal Environments)

► Show Figures

Figure 1

27 pages, 18185 KB

Open AccessArticle

SAR-Based Rotated Ship Detection in Coastal Regions Combining Attention and Dynamic Angle Loss

by Ning Wang, Wenxing Mu, Yixuan An and Tao Liu

Electronics 2026, 15(8), 1557; https://doi.org/10.3390/electronics15081557 - 8 Apr 2026

Viewed by 470

Abstract

With the expanding application of synthetic aperture radar (SAR) in ocean monitoring and port regulation, nearshore ship detection based on SAR image faces notable challenges arising from strong background scattering, dense target occlusion, and large pose variations. Therefore, this paper proposes a two-stage [...] Read more.

With the expanding application of synthetic aperture radar (SAR) in ocean monitoring and port regulation, nearshore ship detection based on SAR image faces notable challenges arising from strong background scattering, dense target occlusion, and large pose variations. Therefore, this paper proposes a two-stage oriented detection network named EARS-Net to improve the accuracy of ship detection in complex nearshore environments. Specifically, a lightweight convolutional block attention module (CBAM) is embedded into the high-level semantic stages of ResNet50 to enhance discriminative ship features while suppressing interference from port infrastructures and shoreline structures. Then, the dynamic angle regression loss (DAL) is proposed, and the angle weight function is designed according to the ship direction distribution characteristics, which allocates higher regression weight to the ship target with larger tilt angle, improving the defect of insufficient positioning accuracy for large angle ships. Moreover, a training strategy that combines focal loss, multi-scale training, and rotated online hard example mining (ROHEM) is employed to alleviate sample imbalance and improve generalization in dense scenes. Experimental results on the nearshore subset of the SSDD show that EARS-Net achieves an average precision (AP) of 0.903 on the test set, demonstrating reliable detection capability under complex backgrounds and dense target distributions. These results validate the effectiveness of our method and highlight its potential as a practical engineering solution for enhancing port situational awareness and coastal security monitoring. Full article

(This article belongs to the Topic Radar Signal and Data Processing with Applications, 2nd Edition)

► Show Figures

Figure 1

24 pages, 3448 KB

Open AccessArticle

Gaussian-Guided Stage-Aware Deformable FPN with Coarse-to-Fine Unit-Circle Resolver for Oriented SAR Ship Detection

by Liangjie Meng, Qingle Guo, Danxia Li, Jinrong He and Zhixin Li

Remote Sens. 2026, 18(7), 1019; https://doi.org/10.3390/rs18071019 - 29 Mar 2026

Viewed by 413

Abstract

Synthetic Aperture Radar (SAR) enables all-weather maritime surveillance, yet ship-oriented bounding box (OBB) detection remains challenging in complex scenes. Strong sea clutter and dense harbor scatterers often mask the slender characteristics of ships as well as the weak responses of small ships. Meanwhile, [...] Read more.

Synthetic Aperture Radar (SAR) enables all-weather maritime surveillance, yet ship-oriented bounding box (OBB) detection remains challenging in complex scenes. Strong sea clutter and dense harbor scatterers often mask the slender characteristics of ships as well as the weak responses of small ships. Meanwhile, the periodicity of angle parameterization introduces regression discontinuities, and near-symmetric, bright-scatterer-dominated signatures further cause heading ambiguity, undermining the stability of orientation prediction. Moreover, in most detectors, multi-scale feature fusion and angle estimation lack explicit coordination, and rotated-box localization performance is often jointly affected by feature degradation and unstable orientation prediction. To this end, we propose a unified framework that simultaneously strengthens multi-scale representations and stabilizes orientation modeling. Specifically, we design a Gaussian-Guided Stage-Aware Deformable Feature Pyramid Network (GSDFPN) and a Coarse-to-Fine Unit-Circle Resolver (CF-UCR). GSDFPN enhances multi-scale fusion with two plug-in components: (i) a Gaussian-guided High-level Semantic Refinement Module (GHSRM) that suppresses clutter-dominated semantics while strengthening ship-responsive cues, and (ii) a Stage-aware Deformable Fusion Module (SDFM) for low-level features, which disentangles channels into a geometry-preserving spatial stream and a clutter-resistant semantic stream, and couples them via deformable interaction with bidirectional cross-stream gating to better capture the inherent slender characteristics of ships and localize small ships. For orientation, CF-UCR decomposes angle prediction into direction-cluster classification and intra-cluster residual regression on the unit circle, effectively mitigating periodicity-induced discontinuities and stabilizing rotated-box estimation. On SSDD+ and RSDD, our method achieves AP/AP₅₀/AP₇₅ of 0.5390/0.9345/0.4529 and 0.4895/0.9210/0.4712, respectively, while reaching AP_s75/AP_m75/AP_l75 of 0.5614/0.8300/0.8392 and 0.4986/0.8163/0.8934, evidencing strong rotated-box localization across target scales in complex maritime scenes. Full article

(This article belongs to the Topic Computer Vision and Image Processing, 3rd Edition)

► Show Figures

Figure 1

27 pages, 10703 KB

Open AccessArticle

WE-KAN: SAR Image Rotated Object Detection Method Based on Wavelet Domain Feature Enhancement and KAN Prediction Head

by Mingchun Li, Yang Liu, Qiang Wang and Dali Chen

Sensors 2026, 26(7), 2011; https://doi.org/10.3390/s26072011 - 24 Mar 2026

Cited by 1 | Viewed by 479

Abstract

Synthetic aperture radar (SAR) imagery plays a vital role in critical applications such as military reconnaissance and disaster monitoring. These applications require high detection accuracy. Therefore, rotated object detection has gained increasing attention. By predicting an object orientation angle, it offers advantages over [...] Read more.

Synthetic aperture radar (SAR) imagery plays a vital role in critical applications such as military reconnaissance and disaster monitoring. These applications require high detection accuracy. Therefore, rotated object detection has gained increasing attention. By predicting an object orientation angle, it offers advantages over horizontal bounding boxes, especially for elongated structures such as ships and bridges in SAR scenes. However, challenges such as speckle noise and complex backgrounds in SAR imagery still hinder high-precision detection. To address this, we propose WE-KAN, a novel rotated object detection framework based on wavelet features and Kolmogorov–Arnold network (KAN) prediction. First, we enhance the backbone by incorporating wavelet domain features from SAR grayscale images. The extracted wavelet domain features and image features are fused by a proposed attention module. Second, considering the sensitivity to angle prediction, we design a angle predictor based on KAN. This architecture provides a powerful and dedicated solution for accurate angle regression. Finally, for precise rotated bounding box regression, we employ a joint loss function combining a rotated intersection over union (RIoU) with a Gaussian distance loss function. These designs improve the model’s robustness to noise and its perception of fine object structures. When evaluated on the large-scale public RSAR dataset, our method achieves an AP50 of 70.1 and a mAP of 35.9 under the same training schedule and backbone network, significantly outperforming existing baselines. This demonstrates the effectiveness and robustness of our method for dense, small, and highly oriented objects in complex SAR scenes. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

23 pages, 3059 KB

Open AccessArticle

Research on Ship Target Detection in Complex Sea Surface Scenarios Based on Improved YOLOv7

by Zhuang Cai and Weina Zhou

Appl. Sci. 2026, 16(4), 1769; https://doi.org/10.3390/app16041769 - 11 Feb 2026

Cited by 1 | Viewed by 538

Abstract

Ships target detection plays a crucial role in safeguarding maritime transportation. However, affected by factors such as ocean waves, extreme weather, and target diversity (e.g., large size differences, arbitrary rotation, and occlusion), existing deep learning-based detection methods struggle to achieve a satisfactory balance [...] Read more.

Ships target detection plays a crucial role in safeguarding maritime transportation. However, affected by factors such as ocean waves, extreme weather, and target diversity (e.g., large size differences, arbitrary rotation, and occlusion), existing deep learning-based detection methods struggle to achieve a satisfactory balance among accuracy, speed, and model size in complex marine environments. To address this challenge, this paper proposes a real-time ship detection algorithm (C-YOLO) integrating global perception and multi-scale feature enhancement. First, a Transformer encoder is added before the detection head, which suppresses interference from sea clutter and cloud mist occlusion through long-range dependency modeling, improving the detection of small and occluded ships. Second, a Dual-Effect Focused Residual Fusion Module is designed to replace the backbone’s multi-scale pooling structure, combining the advantages of CBAM (background noise suppression) and SK-Net (dynamic scale adaptation) to simultaneously capture features of ships of different sizes. Finally, a CZIoU loss function is proposed, which integrates constraints on angle, center point, vertex, and area to address rotation, deformation, and multi-scale issues in ship detection. Experimental results on the SeaShips 7000 dataset show that the proposed C-YOLO achieves a Recall of 0.842, mAP@50 of 0.797, and mAP@50:95 of 0.552, outperforming mainstream algorithms such as YOLOv7 (Recall = 0.785, mAP@50 = 0.781), YOLOv9s (Recall = 0.819, mAP@50 = 0.755), and SSD (Recall = 0.802, mAP@50 = 0.833). With 76.75 M parameters and an inference speed of 119 FPS, the model maintains efficient real-time performance while ensuring detection accuracy. This method effectively reduces false detection and missed detection rates in complex scenarios such as port monitoring and maritime traffic control, providing a reliable technical solution for intelligent maritime surveillance and safe navigation—with significant practical value for improving maritime transportation efficiency and reducing safety risks. Full article

(This article belongs to the Special Issue Advances in Computer Vision and Digital Image Processing)

► Show Figures

Figure 1

23 pages, 14239 KB

Open AccessArticle

Dense Representative Points-Guided Rotated-Ship Detection in Remote Sensing Images

by Ning Zhao, Yongfei Xian, Tairan Zhou, Jiawei Shi, Zhiguo Jiang and Haopeng Zhang

Remote Sens. 2026, 18(3), 458; https://doi.org/10.3390/rs18030458 - 1 Feb 2026

Cited by 1 | Viewed by 625

Abstract

Withcontinuous advancements in remote sensing technology, object detection in remote sensing images has emerged as a critical research direction in maritime surveillance, port management, and national defense. Among these applications, ship detection is a key task. Due to the fact that ships in [...] Read more.

Withcontinuous advancements in remote sensing technology, object detection in remote sensing images has emerged as a critical research direction in maritime surveillance, port management, and national defense. Among these applications, ship detection is a key task. Due to the fact that ships in images typically exhibit arbitrary rotations, multi-scale distributions, and complex backgrounds, conventional detection methods based on horizontal or rotated bounding boxes often fail to adequately capture the fine-grained information of the targets, thereby compromising detection accuracy. This paper proposes the Dense Representative Points-Guided Rotated-Ship Detection (DenseRRSD) method. The proposed approach represents ship objects using dense representative points (RepPoints) to effectively capture local semantic information, thereby avoiding the background noise issues associated with traditional rectangular bounding box representations. To further enhance detection accuracy, an edge region sampling strategy is devised to uniformly sample RepPoints from critical ship parts, and a Weighted Residual Feature Pyramid Network (WRFPN) is introduced to efficiently fuse the multi-scale features through residual connections and learnable weights. In addition, a Weighted Chamfer Loss (WCL) combined with a staged localization loss strategy is employed to progressively refine localization from coarse to fine stages. Experimental results on both the HRSC2016 dataset and the newly constructed DOTA-SHIP dataset demonstrate that DenseRRSD achieves state-of-the-art detection accuracy, with mean Average Precision (mAP) scores of 91.2% and 83.2%, respectively, significantly outperforming existing methods. These results verify the effectiveness and robustness of the proposed approach in rotated-ship detection under diverse conditions. Full article

(This article belongs to the Special Issue Intelligent Processing and Analysis of Multi-Modal Remote Sensing Data)

► Show Figures

Figure 1

36 pages, 35595 KB

Open AccessArticle

Robust ISAR Autofocus for Maneuvering Ships Using Centerline-Driven Adaptive Partitioning and Resampling

by Wenao Ruan, Chang Liu and Dahu Wang

Remote Sens. 2026, 18(1), 105; https://doi.org/10.3390/rs18010105 - 27 Dec 2025

Viewed by 754

Abstract

Synthetic aperture radar (SAR) is a critical enabling technology for maritime surveillance. However, maneuvering ships often appear defocused in SAR images, posing significant challenges for subsequent ship detection and recognition. To address this problem, this study proposes an improved iteration phase gradient resampling [...] Read more.

Synthetic aperture radar (SAR) is a critical enabling technology for maritime surveillance. However, maneuvering ships often appear defocused in SAR images, posing significant challenges for subsequent ship detection and recognition. To address this problem, this study proposes an improved iteration phase gradient resampling autofocus (IIPGRA) method. First, we extract the defocused ships from SAR images, followed by azimuth decompression and translational motion compensation. Subsequently, a centerline-driven adaptive azimuth partitioning strategy is proposed: the geometric centerline of the vessel is extracted from coarsely focused images using an enhanced RANSAC algorithm, and the target is partitioned into upper and lower sub-blocks along the azimuth direction to maximize the separation of rotational centers between sub-blocks, establishing a foundation for the accurate estimation of spatially variant phase errors. Next, phase gradient autofocus (PGA) is employed to estimate the phase errors of each sub-block and compute their differential. Then, resampling the original echoes based on this differential phase error linearizes non-uniform rotational motion. Furthermore, this study introduces the Rotational Uniformity Coefficient (β) as the convergence criterion. This coefficient can stably and reliably quantify the linearity of the rotational phase, thereby ensuring robust termination of the iterative process. Simulation and real airborne SAR data validate the effectiveness of the proposed algorithm. Full article

(This article belongs to the Special Issue Advances in Imaging Radar Signal Processing, Target Feature Extraction and Recognition)

► Show Figures

Figure 1

23 pages, 11094 KB

Open AccessArticle

RSDB-Net: A Novel Rotation-Sensitive Dual-Branch Network with Enhanced Local Features for Remote Sensing Ship Detection

by Danshu Zhou, Yushan Xiong, Shuangming Yu, Peng Feng, Jian Liu, Nanjian Wu, Runjiang Dou and Liyuan Liu

Remote Sens. 2025, 17(23), 3925; https://doi.org/10.3390/rs17233925 - 4 Dec 2025

Cited by 1 | Viewed by 822

Abstract

Ship detection in remote sensing imagery is hindered by cluttered backgrounds, large variations in scale, and random orientations, limiting the performance of detectors designed for natural images. We propose RSDB-Net, a Rotation-Sensitive Dual-Branch Detection Network that introduces innovations in feature extraction, fusion, and [...] Read more.

Ship detection in remote sensing imagery is hindered by cluttered backgrounds, large variations in scale, and random orientations, limiting the performance of detectors designed for natural images. We propose RSDB-Net, a Rotation-Sensitive Dual-Branch Detection Network that introduces innovations in feature extraction, fusion, and detection. The Swin Transformer–CNN Backbone (STCBackbone) combines a Swin Transformer for global semantics with a CNN branch for local spatial detail, while the Feature Conversion and Coupling Module (FCCM) aligns and fuses heterogeneous features to handle multi-scale objects, and a Rotation-sensitive Cross-branch Fusion Head (RCFHead) enables bidirectional interaction between classification and localization, improving detection of randomly oriented targets. Additionally, an enhanced Feature Pyramid Network (eFPN) with learnable transposed convolutions restores semantic information while maintaining spatial alignment. Experiments on DOTA-v1.0 and HRSC2016 show that RSDB-Net performs better than the state of the art (SOTA), with mAP-ship values of 89.13% and 90.10% (+5.54% and +44.40% over the baseline, respectively), and reaches 72 FPS on an RTX 3090. RSDB-Net also demonstrates strong generalization and scalability, providing an effective solution for rotation-aware ship detection. Full article

► Show Figures

Figure 1

21 pages, 13473 KB

Open AccessArticle

Ship Ranging Method in Lake Areas Based on Binocular Vision

by Tengwen Zhang, Xin Liu, Mingzhi Shao, Yuhan Sun and Qingfa Zhang

Sensors 2025, 25(20), 6477; https://doi.org/10.3390/s25206477 - 20 Oct 2025

Cited by 1 | Viewed by 925

Abstract

The unique hollowed-out catamaran hulls and complex environmental conditions in lake areas hinder traditional ranging algorithms (combining target detection and stereo matching) from accurately obtaining depth information near the center of ships. This not only impairs the navigation of electric tourist boats but [...] Read more.

The unique hollowed-out catamaran hulls and complex environmental conditions in lake areas hinder traditional ranging algorithms (combining target detection and stereo matching) from accurately obtaining depth information near the center of ships. This not only impairs the navigation of electric tourist boats but also leads to high computing resource consumption. To address this issue, this study proposes a ranging method integrating improved ORB (Oriented FAST and Rotated BRIEF) with stereo vision technology. Combined with traditional optimization techniques, the proposed method calculates target distance and angle based on the triangulation principle, providing a rough alternative solution for the “gap period” of stereo matching-based ranging. The method proceeds as follows: first, it acquires ORB feature points with relatively uniform global distribution from preprocessed binocular images via a local feature weighting approach; second, it further refines feature points within the ROI (Region of Interest) using a quadtree structure; third, it enhances matching accuracy by integrating the FLANN (Fast Library for Approximate Nearest Neighbors) and PROSAC (Progressive Sample Consensus) algorithms; finally, it applies the screened matching point pairs to the triangulation method to obtain the position and distance of the target ship. Experimental results show that the proposed algorithm improves processing speed by 6.5% compared with the ORB-PROSAC algorithm. Under ideal conditions, the ranging errors at 10m and 20m are 2.25% and 5.56%, respectively. This method can partially compensate for the shortcomings of stereo matching in ranging under the specified lake area scenario. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

27 pages, 26151 KB

Open AccessArticle

EfficientRDet: An EfficientDet-Based Framework for Precise Ship Detection in Remote Sensing Imagery

by Weikang Zuo and Shenghui Fang

Remote Sens. 2025, 17(18), 3160; https://doi.org/10.3390/rs17183160 - 11 Sep 2025

Cited by 2 | Viewed by 1610

Abstract

Detecting arbitrarily oriented ships in remote sensing images remains challenging due to diverse orientations, complex backgrounds, and scale variations, leading to a struggle in balancing detector accuracy with efficiency. We propose EfficientRDet, an enhanced rotated-ship detection algorithm built upon the EfficientDet framework. EfficientRDet [...] Read more.

Detecting arbitrarily oriented ships in remote sensing images remains challenging due to diverse orientations, complex backgrounds, and scale variations, leading to a struggle in balancing detector accuracy with efficiency. We propose EfficientRDet, an enhanced rotated-ship detection algorithm built upon the EfficientDet framework. EfficientRDet adapts to rotated objects via an angle prediction branch and then significantly boosts accuracy with a novel pseudo-two-stage paradigm comprising a Rotated-Bounding-Box Refinement Branch (RRB) and a Class-Score Refinement Branch (CRB). Further precision is gained through an optimized Enhanced BiFPN (E-BiFPN), an Attention Head, and Distribution Focal (DF) angle representation. Extensive experiments on the HRSC2016 (optical) and RSDD-SAR datasets show that EfficientRDet consistently outperforms state-of-the-art methods, achieving 97.60% AP50 on HRSC2016 and 93.58% AP50 on RSDD-SAR. Comprehensive ablation studies confirm the effectiveness of all proposed mechanisms. EfficientRDet thus offers a promising and practical solution for precise, efficient ship detection across diverse remote sensing imagery. Full article

(This article belongs to the Special Issue Target Recognition and Detection Based on High Resolution Radar Images)

► Show Figures

Figure 1

22 pages, 4922 KB

Open AccessArticle

PDE-Guided Diverse Feature Learning for SAR Rotated Ship Detection

by Mingjin Zhang, Zhongkai Yang, Jie Guo and Yunsong Li

Remote Sens. 2025, 17(17), 2998; https://doi.org/10.3390/rs17172998 - 28 Aug 2025

Cited by 5 | Viewed by 1240

Abstract

Detecting ships in Synthetic Aperture Radar (SAR) images poses a complex challenge, with recent progress primarily attributed to the development of rotated detectors. However, existing methods often neglect the crucial influence of inherent characteristics in SAR images, such as common speckle noise. Moreover, [...] Read more.

Detecting ships in Synthetic Aperture Radar (SAR) images poses a complex challenge, with recent progress primarily attributed to the development of rotated detectors. However, existing methods often neglect the crucial influence of inherent characteristics in SAR images, such as common speckle noise. Moreover, a notable gap exists in modeling diverse features, particularly the fusion of rotational and high-frequency features. To address these challenges, this paper introduces a high-accuracy detector called PRDet, which builds on two key innovations: partial differential equation (PDE)-Guided Wavelet Transform (PGWT) and Diverse Feature Learning Block (DFLB). The PGWT enhances high-frequency features, such as edges and textures, while eliminating speckle noise by optimizing wavelet transform with PDE, leveraging the ability of PDE to model local variations and preserve structural details. The DFLB, with strong expressive capability, extracts and fuses multi-form ship features through three branches, enabling more accurate ship localization. Extensive experimental evaluations on the publicly available RSSDD and SRSDD-V1.0 benchmarks demonstrate PRDet’s superiority over other SAR rotated ship detectors. For example, on the RSSDD dataset, PRDet achieves an offshore precision of 0.938 and an mAP of 0.908, confirming its effectiveness for practical maritime surveillance applications. Full article

(This article belongs to the Special Issue Synthetic Aperture Radar (SAR) Image Object Detection and Information Extraction: Methods and Applications (Second Edition))

► Show Figures

Figure 1

19 pages, 7851 KB

Open AccessArticle

Ship Plate Detection Algorithm Based on Improved RT-DETR

by Lei Zhang and Liuyi Huang

J. Mar. Sci. Eng. 2025, 13(7), 1277; https://doi.org/10.3390/jmse13071277 - 30 Jun 2025

Cited by 3 | Viewed by 1792

Abstract

To address the challenges in ship plate detection under complex maritime scenarios—such as small target size, extreme aspect ratios, dense arrangements, and multi-angle rotations—this paper proposes a multi-module collaborative detection algorithm, RT-DETR-HPA, based on an enhanced RT-DETR framework. The proposed model integrates three [...] Read more.

To address the challenges in ship plate detection under complex maritime scenarios—such as small target size, extreme aspect ratios, dense arrangements, and multi-angle rotations—this paper proposes a multi-module collaborative detection algorithm, RT-DETR-HPA, based on an enhanced RT-DETR framework. The proposed model integrates three core components: an improved High-Frequency Enhanced Residual Block (HFERB) embedded in the backbone to strengthen multi-scale high-frequency feature fusion, with deformable convolution added to handle occlusion and deformation; a Pinwheel-shaped Convolution (PConv) module employing multi-directional convolution kernels to achieve rotation-adaptive local detail extraction and accurately capture plate edges and character features; and an Adaptive Sparse Self-Attention (ASSA) mechanism incorporated into the encoder to automatically focus on key regions while suppressing complex background interference, thereby enhancing feature discriminability. Comparative experiments conducted on a self-constructed dataset of 20,000 ship plate images show that, compared to the original RT-DETR, RT-DETR-HPA achieves a 3.36% improvement in mAP@50 (up to 97.12%), a 3.23% increase in recall (reaching 94.88%), and maintains real-time detection speed at 40.1 FPS. Compared with mainstream object detection models such as the YOLO series and Faster R-CNN, RT-DETR-HPA demonstrates significant advantages in high-precision localization, adaptability to complex scenarios, and real-time performance. It effectively reduces missed and false detections caused by low resolution, poor lighting, and dense occlusion, providing a robust and high-accuracy solution for intelligent ship supervision. Future work will focus on lightweight model design and dynamic resolution adaptation to enhance its applicability on mobile maritime surveillance platforms. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

22 pages, 11308 KB

Open AccessArticle

TIAR-SAR: An Oriented SAR Ship Detector Combining a Task Interaction Head Architecture with Composite Angle Regression

by Yu Gu, Minding Fang and Dongliang Peng

Remote Sens. 2025, 17(12), 2049; https://doi.org/10.3390/rs17122049 - 13 Jun 2025

Cited by 6 | Viewed by 1444

Abstract

Oriented ship detection in Synthetic Aperture Radar (SAR) images has broad applications in maritime surveillance and other fields. While deep learning advancements have significantly improved ship detection performance, persistent challenges remain for existing methods. These include the inherent misalignment between regression and classification [...] Read more.

Oriented ship detection in Synthetic Aperture Radar (SAR) images has broad applications in maritime surveillance and other fields. While deep learning advancements have significantly improved ship detection performance, persistent challenges remain for existing methods. These include the inherent misalignment between regression and classification tasks and the boundary discontinuity problem in oriented object detection. These issues hinder efficient and accurate ship detection in complex scenarios. To address these challenges, we propose TIAR-SAR, a novel oriented SAR ship detector featuring a task interaction head and composite angle regression. First, we propose a task interaction detection head (Tihead) capable of predicting both oriented bounding boxes (OBBs) and horizontal bounding boxes (HBBs) simultaneously. Within the Tihead, a “decompose-then-interact” structure is designed. This structure not only mitigates feature misalignment but also promotes feature interaction between regression and classification tasks, thereby enhancing prediction consistency. Second, we propose a joint angle refinement mechanism (JARM). The JARM addresses the non-differentiability problem of the traditional rotated Intersection over Union (IoU) loss through the design of a composite angle regression loss (CARL) function, which strategically combines direct and indirect angle regression methods. A boundary angle correction mechanism (BACM) is then designed to enhance angle estimation accuracy. During inference, BACM dynamically replaces an object’s OBB prediction with its corresponding HBB if the OBB exhibits excessive angle deviation when the angle of the object is near the predefined boundary. Finally, the performance and applicability of the proposed methods are evaluated through extensive experiments on multiple public datasets, including SRSDD, HRSID, and DOTAv1. Experimental results derived from the use of the SRSDD dataset demonstrate that the mAP50 of the proposed method reaches 63.91%, an improvement of 4.17% compared with baseline methods. The detector achieves 17.42 FPS on 1024 × 1024 images using an RTX 2080 Ti GPU, with a model size of only 21.92 MB. Comparative experiments with other state-of-the-art methods on the HRSID dataset demonstrate the proposed method’s superior detection performance in complex nearshore scenarios. Furthermore, when further tested on the DOTAv1 dataset, the mAP50 can reach 79.1%. Full article

(This article belongs to the Special Issue Synthetic Aperture Radar (SAR) Image Object Detection and Information Extraction: Methods and Applications (Second Edition))

► Show Figures

Figure 1

30 pages, 8985 KB

Open AccessArticle

Dynamic Cascade Detector for Storage Tanks and Ships in Optical Remote Sensing Images

by Tong Wang, Bingxin Liu and Peng Chen

Remote Sens. 2025, 17(11), 1882; https://doi.org/10.3390/rs17111882 - 28 May 2025

Viewed by 1210

Abstract

Regional Convolutional Neural Network (RCNN)−based detectors have played a crucial role in object detection in remote sensing images due to their exceptional detection capabilities. Some studies have shown that different stages should have different Intersections of Union (IoU) thresholds to distinguish positive and [...] Read more.

Regional Convolutional Neural Network (RCNN)−based detectors have played a crucial role in object detection in remote sensing images due to their exceptional detection capabilities. Some studies have shown that different stages should have different Intersections of Union (IoU) thresholds to distinguish positive and negative samples because each stage has different IoU distributions. However, these studies have overlooked the fact that the IoU distribution at each stage changes continuously during the training process. Therefore, the IoU threshold at each stage should also be adjusted continuously to adapt to the changes in the IoU distribution. We realized that the IoU distribution at each stage is very similar to a Gaussian skewed distribution. In this paper, we introduce a novel dynamic IoU threshold method based on the Cascade RCNN architecture, called the Dynamic Cascade detector, with reference to the Gaussian skewed distribution. We tested the effectiveness of this method by detecting horizontal storage tanks and rotated ships in optical remote sensing images. Our experiments demonstrated that this technique can significantly improve detection results, as evaluated based on the COCO metric. In addition, the threshold range of the last stage impacts other stages, so the threshold range of one stage may change significantly when the number of stages changes. Furthermore, the threshold may not always increase during the training process and may decrease when the IoU distribution resembles a negatively skewed distribution. Full article

► Show Figures

Graphical abstract

Search Results (93)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (93)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI