MDPI - Publisher of Open Access Journals

11 pages, 525 KB

Open AccessArticle

Assessment of Stage Two Hypertension Treatment Plans Written by Generative AI

by Tai Metzger, Zaheen Hossain, Kody Park, Stephen Vu, Simon Dixon and Tracey A. H. Taylor

J. Clin. Med. 2026, 15(8), 3103; https://doi.org/10.3390/jcm15083103 (registering DOI) - 18 Apr 2026

Background/Objectives: As use of large language models (LLMs) in clinical practice, in medical education, and by patients increases, it is essential to ensure that information provided is accurate and safe. Our objective was to compare stage two hypertension treatment plans generated by [...] Read more.

Background/Objectives: As use of large language models (LLMs) in clinical practice, in medical education, and by patients increases, it is essential to ensure that information provided is accurate and safe. Our objective was to compare stage two hypertension treatment plans generated by popular LLMs. Methods: ChatGPT (GPT-4o), Claude (Claude 4 Sonnet), ClinicalKey AI, Microsoft Copilot (Wave 2), DeepSeek-V3-0324, Dyna AI, Google Gemini (2.5 Flash), Grok (version 3), Meta AI assistant (Llama 4 Maverick), OpenEvidence (version 2.0), Perplexity (Sonar backend model), and Pi (Inflection-2.5) were prompted to generate a treatment plan for stage two hypertension. Six blinded reviewers scored each response in three domains: adherence to clinical guidelines, detail/clarity, and reliability/safety. Results: Perplexity received the highest composite score (8.17 out of 9), followed by OpenEvidence (7.92 out of 9). Dyna AI had the lowest overall score (3.75 out of 9). Perplexity (3.00 out of 3), Grok (2.83 out of 3), and OpenEvidence (2.75 out of 3) had the highest scores for detail/clarity, while Dyna AI had the lowest for both detail/clarity (1.00 out of 3) and reliability/safety (1.00 out of 3). ChatGPT had the highest score for adherence to guidelines (2.75 out of 3) while Pi had the lowest (1.58 out of 3). Kruskal–Wallis test showed p < 0.05 across sub-score domains and composite scores. Conclusions: LLMs tended to adhere to clinical guidelines and provide detailed responses but often did not provide sources or instruct users to see a healthcare professional. There was notable variability in quality, and medicine-specific LLMs were not superior to popular LLMs. Full article

(This article belongs to the Section Machine Learning and Artificial Intelligence in Clinical Medicine)

► Show Figures

Figure 1

16 pages, 6822 KB

Open AccessArticle

Fish Resource Assessment in the Huoyanshan Waters of Poyang Lake Using DIDSON and Deep Learning Models

by Wei Shen, Zhaowei Yin, Bao Zhang, Lekang Li, Enze Qian and Xiaoling Gong

Fishes 2026, 11(4), 236; https://doi.org/10.3390/fishes11040236 - 16 Apr 2026

Viewed by 5

Abstract

To scientifically assess the fish resource status and spatial distribution in the Huoyanshan waters of Poyang Lake for the conservation of endangered species like Coilia nasus, an acoustic survey was conducted using a dual-frequency identification sonar (DIDSON) in July 2024. Fish targets [...] Read more.

To scientifically assess the fish resource status and spatial distribution in the Huoyanshan waters of Poyang Lake for the conservation of endangered species like Coilia nasus, an acoustic survey was conducted using a dual-frequency identification sonar (DIDSON) in July 2024. Fish targets were identified and extracted by combining an Echoview-based identification and deep learning models. Catch statistics were integrated to estimate fish density, abundance, biomass, and spatial distribution patterns. A total of 1891 fish targets were detected. The Echoview model achieved an average accuracy of 90.83%, while the YOLO model attained average precision and recall of 0.941 and 0.869, and the DeepSORT model attained precision and recall of 0.887 and 0.911. The total fish abundance was estimated at approximately 223,775 individuals, with a total biomass of about 199,742 kg. Spatially, fish were predominantly distributed in nearshore areas horizontally and concentrated at depths of 5–15 m vertically. The integrated approach combining DIDSON, Echoview and deep learning models proved effective for high-accuracy fish target identification and resource estimation, with deep learning models offering greater objectivity and processing efficiency. This study provides a technical reference for intelligent fish target identification in sonar images and provides baseline data and a technical reference for subsequent fish resource monitoring and management in the Huoyanshan waters of Poyang Lake. Full article

(This article belongs to the Special Issue Technology for Fish and Fishery Monitoring—2nd Edition)

► Show Figures

Figure 1

23 pages, 1940 KB

Open AccessArticle

Prediction of Sound Speed Profiles Under Disturbance of Strong Internal Solitary Waves Using Bidirectional Long Short-Term Memory Network

by Hong Yin, Ke Qu, Han Wang and Guangming Li

J. Mar. Sci. Eng. 2026, 14(8), 735; https://doi.org/10.3390/jmse14080735 - 15 Apr 2026

Viewed by 169

Abstract

Time-series machine learning models represented by long short-term memory (LSTM) networks provide an effective way to obtain high-precision sound speed profiles (SSPs) quickly and at low cost, which can meet the practical application requirements of underwater sonar systems. However, in sea areas with [...] Read more.

Time-series machine learning models represented by long short-term memory (LSTM) networks provide an effective way to obtain high-precision sound speed profiles (SSPs) quickly and at low cost, which can meet the practical application requirements of underwater sonar systems. However, in sea areas with frequent strong internal solitary waves, the large-amplitude sound speed anomalies caused by them will seriously interfere with model learning in the form of strong outlier features, resulting in a sharp drop in SSP prediction accuracy and significant degradation of the generalization stability and robustness of the model. To address this problem, this paper proposes a time-series SSP prediction method based on a bidirectional long short-term memory (Bi-LSTM) network. First, Empirical Orthogonal Function (EOF) decomposition is used to realize the low-dimensional feature representation of SSPs, and then the bidirectional time-series feature capture capability of Bi-LSTM is used to predict the SSP sequence with large disturbances caused by strong internal solitary waves. Multiple groups of comparative experiments based on the measured temperature chain data in the continental slope area of the South China Sea show that the Bi-LSTM model has a significant improvement in prediction accuracy and robustness compared with the classical LSTM model. Among them, the Bi-LSTM model with EOF decomposition achieves a correlation coefficient of 0.995 and an average Root Mean Square Error (RMSE) as low as 0.387 m/s. Under the condition of internal solitary wave disturbance, the classical LSTM is difficult to effectively capture the large abrupt change in sound speed, while the proposed Bi-LSTM model can still achieve accurate prediction of the SSP in the disturbance section, and has both the feature recognition and evolution prediction capabilities for the strongly nonlinear internal solitary wave process. This method provides effective technical support for the rapid and large-scale reconstruction of the sound speed field under the disturbance of strong internal solitary waves. Full article

(This article belongs to the Section Ocean Engineering)

24 pages, 1954 KB

Open AccessArticle

Feasibility Analysis of Underwater Vehicle Detection Based on Homogeneous Ellipsoidal Hull Model Using Gravity Gradient

by Hexing Zheng, Jinguo Liu and Haitao Gu

J. Mar. Sci. Eng. 2026, 14(8), 734; https://doi.org/10.3390/jmse14080734 - 15 Apr 2026

Viewed by 104

Abstract

In recent years, as underwater vehicles continue to improve their noise reduction capabilities, sonar-based detection has faced significant challenges, and non-acoustic detection has become a research focus. Gravity gradient detection, owing to its excellent concealment and anti-interference capability, is regarded as an important [...] Read more.

In recent years, as underwater vehicles continue to improve their noise reduction capabilities, sonar-based detection has faced significant challenges, and non-acoustic detection has become a research focus. Gravity gradient detection, owing to its excellent concealment and anti-interference capability, is regarded as an important non-acoustic means for underwater target detection. Based on the structural characteristics of an underwater vehicle, this paper establishes a homogeneous ellipsoidal hull (HEH) model composed of two similar rotating ellipsoids. This model assumes that the mass of an underwater vehicle is completely uniformly distributed over the outer hull. Analytical formulas for the gravity anomaly and gravity gradient anomaly generated by this model are derived, and their spatial distribution characteristics are analyzed. Furthermore, based on the HEH model, the feasibility underwater vehicle detection using the vertical gravity gradient component is analyzed. Results show that when the accuracy of the gravity gradiometer reaches 10⁻⁴ E, the detection distance for a large underwater vehicle with a displacement of 18,750 t can reach 570 m. Full article

(This article belongs to the Special Issue Advanced Modeling and Intelligent Control of Marine Vehicles)

30 pages, 1499 KB

Open AccessArticle

Environment-Aware Optimal Placement and Dynamic Reconfiguration of Underwater Robotic Sonar Networks Using Deep Reinforcement Learning

by Qiming Sang, Yu Tian, Jin Zhang, Yuyang Xiao, Zhiduo Tan, Jiancheng Yu and Fumin Zhang

J. Mar. Sci. Eng. 2026, 14(8), 733; https://doi.org/10.3390/jmse14080733 - 15 Apr 2026

Viewed by 96

Abstract

Underwater dynamic target detection, classification, localization, and tracking (DCLT) is central to maritime surveillance and monitoring and increasingly relies on distributed AUV-based robotic sonar networks operating in passive listening and, when required, cooperative multistatic modes. Achieving a robust performance in realistic oceans remains [...] Read more.

Underwater dynamic target detection, classification, localization, and tracking (DCLT) is central to maritime surveillance and monitoring and increasingly relies on distributed AUV-based robotic sonar networks operating in passive listening and, when required, cooperative multistatic modes. Achieving a robust performance in realistic oceans remains challenging, because sensor placement must adapt to time-varying acoustic conditions and target priors while preserving acoustic communication connectivity, and because frequent reconfiguration under dynamic currents makes classical large-scale planning computationally expensive. This paper presents an integrated deep reinforcement learning (DRL)-based framework for passive-stage sonar placement and dynamic reconfiguration in distributed AUV networks. First, we cast placement as a constructive finite-horizon Markov decision process (MDP) and train a Proximal Policy Optimization (PPO) agent to sequentially build a collision-free layout on a discretized surveillance grid. The terminal reward is formulated to jointly optimize the environment-aware detection performance, computed from BELLHOP-based transmission loss models, and global network connectivity, quantified using algebraic connectivity. Second, to enable time-critical reconfiguration, we estimate flow-aware motion costs for all AUV–destination pairs using a PPO with a Long Short-Term Memory (LSTM) trajectory policy trained for partial observability. The learned policy can be deployed onboard, allowing each AUV to refine its path online using locally sensed currents, improving robustness to ocean-model uncertainty. The resulting cost matrix is solved via an efficient zero-element assignment method to obtain the optimal one-to-one reassignment. In the reported simulation studies, the proposed Sequential PPO placement method achieves a final reward 16–21% higher than Particle Swarm Optimization (PSO) and 2–3.7% higher than the Genetic Algorithm (GA), while the proposed PPO + LSTM planner reduces average travel time by 30.44% compared with A*. The proposed closed-loop architecture supports frequent re-optimization, scalable fleet operation, and a seamless transition to communication-supported cooperative multistatic tracking after detection, enabling efficient, adaptive DCLT in dynamic marine environments. Full article

(This article belongs to the Section Ocean Engineering)

17 pages, 3201 KB

Open AccessArticle

Underwater Acoustic Target Detection Using a Miniaturized MEMS Hydrophone Array

by Xiao Chen and Ying Zhang

Micromachines 2026, 17(4), 468; https://doi.org/10.3390/mi17040468 - 12 Apr 2026

Viewed by 155

Abstract

Sonar is a fundamental tool for underwater target detection. However, conventional detection systems often suffer from poor sensor consistency and high fabrication costs. More critically, for low-frequency operation, the required array aperture becomes prohibitively large, limiting their deployment on small, mobile underwater platforms. [...] Read more.

Sonar is a fundamental tool for underwater target detection. However, conventional detection systems often suffer from poor sensor consistency and high fabrication costs. More critically, for low-frequency operation, the required array aperture becomes prohibitively large, limiting their deployment on small, mobile underwater platforms. To address the demand for compact, high-performance sensing solutions, this paper presents a miniaturized Micro-electromechanical Systems (MEMS) hydrophone array designed for underwater target detection. The array consists of six elements with a spacing of 0.25 m. Each element is approximately 22 mm in diameter and encapsulated in polyurethane via a casting and curing process. The core sensing element, a MEMS acoustic pressure hydrophone, exhibits a sensitivity of −177.2 ± 1.5 dB (re: 1 V/µPa) across the 20 Hz to 4 kHz frequency range and a noise resolution of approximately 59.5 dB (re: 1 µPa/√Hz) at 1 kHz. A key challenge in array-based detection is the phase mismatch among acquisition channels, which degrades algorithm performance. To mitigate this, we propose a phase self-correction method based on interleaved ADC acquisition control, enabling synchronous multi-channel sampling and effectively eliminating system-level phase errors. Furthermore, to overcome the inherent aperture limitations of conventional beamforming (CBF) applied to a miniaturized array, a differential beamforming (DBF) algorithm is adopted. This approach is less frequency-dependent and can approximate a frequency-invariant beam pattern, making it well-suited for miniaturized arrays. Simulation results confirm the theoretical validity of the DBF algorithm for the proposed MEMS hydrophone array. Sea trial data further demonstrate that this method achieves higher target detection accuracy compared to CBF techniques. Full article

(This article belongs to the Special Issue Acoustic Transducers and Their Applications, 3rd Edition)

22 pages, 4045 KB

Open AccessArticle

Optimization-Based Mismatched-Channel Filtering Using ADMM for Continuous Active Sonar

by Zitao Su, Juan Yang and Lu Yan

J. Mar. Sci. Eng. 2026, 14(8), 711; https://doi.org/10.3390/jmse14080711 - 11 Apr 2026

Viewed by 193

Abstract

Generalized Sinusoidal Frequency Modulation (GSFM) signals can enhance Continuous Active Sonar (CAS) performance by providing high sub-signal processing gain while achieving high target update rates. However, conventional processing methods for GSFM often exhibit high sidelobe levels arising from the waveform’s autocorrelation which degrade [...] Read more.

Generalized Sinusoidal Frequency Modulation (GSFM) signals can enhance Continuous Active Sonar (CAS) performance by providing high sub-signal processing gain while achieving high target update rates. However, conventional processing methods for GSFM often exhibit high sidelobe levels arising from the waveform’s autocorrelation which degrade detection performance, especially in severe multipath environments. To address this issue, a Mismatched-Channel Filtering (MMCF) method for GSFM in CAS is proposed to focus multipath energy while suppressing sidelobe levels. Adopting the sub-pulse processing scheme, we incorporate the orthogonality of GSFM sub-signals (optimized via a genetic algorithm) and sparse channel estimates into the MMCF design for each sub-signal. The design is formulated as a Quadratically Constrained Quadratic Program (QCQP) and solved iteratively using the Alternating Direction Method of Multipliers (ADMM) for long-duration signal processing in CAS. Numerical simulations demonstrate that, compared with the matched filtering and matched channel filtering methods, the proposed MMCF method effectively suppresses sidelobe levels by approximately 20 dB and produces a Dirack-like main-lobe peak, while efficiently focusing multipath energy. The method’s effectiveness is further validated using experimental data from a lake trial. Therefore, this algorithm has distinct advantages for signal processing in multipath environments. Full article

(This article belongs to the Special Issue Underwater Acoustics: Advances in Modelling, Measurement, and Technological Applications)

13 pages, 459 KB

Open AccessArticle

An Adaptive Binary Particle Swarm Optimization with Hybrid Learning for Feature Selection

by Lan Ma, Pei Hu and Jeng-Shyang Pan

Electronics 2026, 15(7), 1523; https://doi.org/10.3390/electronics15071523 - 5 Apr 2026

Viewed by 298

Abstract

Particle swarm optimization (PSO) improves classification performance and reduces computational complexity in feature selection. However, it frequently experiences from premature convergence and insufficient exploration. To address these constraints, this paper suggests an adaptive binary PSO (ABPSO) algorithm specifically designed for feature selection. First, [...] Read more.

Particle swarm optimization (PSO) improves classification performance and reduces computational complexity in feature selection. However, it frequently experiences from premature convergence and insufficient exploration. To address these constraints, this paper suggests an adaptive binary PSO (ABPSO) algorithm specifically designed for feature selection. First, an adaptive transfer function and two adaptive learning coefficients are introduced to achieve a better balance between exploration and exploitation during the search process. Second, a hybrid learning mechanism that integrates personal best, global best, and elite solutions is utilized to enhance population diversity. Finally, a simulated annealing (SA)–based local search strategy is employed to further refine candidate solutions and improve convergence behavior. Experimental results demonstrate that ABPSO outperforms binary PSO (BPSO), harris hawks optimization (HHO), whale optimization algorithm (WOA), and ant colony optimization (ACO) in classification accuracy. In particular, ABPSO achieves the lowest classification error rates on the Dermatology (0.0106), Ionosphere (0.0705), Lung (0.1521), Sonar (0.0996), Spambase (0.0758), Statlog (0.1446), and Wine (0.0280) datasets. Full article

(This article belongs to the Special Issue Machine Learning Meets Large-Scale Model: Current Trends and Future Challenges)

► Show Figures

Figure 1

28 pages, 5258 KB

Open AccessArticle

Dual-View Entropy-Driven AIS–Sonar Fusion for Surface and Underwater Target Discrimination

by Xiaoshuang Zhang, Jiayi Che, Xiaodan Xiong, Yucheng Zhang, Xinbo He, Mengsha Deng and Dezhi Wang

J. Mar. Sci. Eng. 2026, 14(7), 675; https://doi.org/10.3390/jmse14070675 - 4 Apr 2026

Viewed by 312

Abstract

Distinguishing surfaces from underwater targets in complex marine environments is challenging when relying solely on physical sonar features. To address the high uncertainty inherent in single-modal features and the conflicts arising from heterogeneous data, we propose a Dual-View Entropy-Driven Negation Dempster–Shafer (DVE-NDS) fusion [...] Read more.

Distinguishing surfaces from underwater targets in complex marine environments is challenging when relying solely on physical sonar features. To address the high uncertainty inherent in single-modal features and the conflicts arising from heterogeneous data, we propose a Dual-View Entropy-Driven Negation Dempster–Shafer (DVE-NDS) fusion method that integrates AIS kinematic priors with passive sonar signals. First, a heterogeneous recognition framework is constructed. LOFAR and DEMON features are extracted via convolutional neural networks (CNNs), while a Negation Basic Probability Assignment (Negation BPA) strategy is introduced to transform AIS spatiotemporal mismatches into effective "negation support" for non-cooperative underwater targets. Instead of relying on a single conflict coefficient, the proposed method jointly considers evidence self-information and inter-source consistency. Evidence quality is quantified using improved Deng entropy and negation belief entropy, while mutual trust is evaluated via the Jousselme distance. Heterogeneous evidence is weighted and corrected by generated coupling weights, effectively suppressing low-quality evidence and sharpening decision boundaries. Simulation results confirm that DVE-NDS improves macro-F1 over classical fusion, indicating the framework’s potential for handling conflicting evidence, though the current validation remains simulation-based and should be regarded as a methodological proof-of-concept. Full article

(This article belongs to the Special Issue Emerging Computational Methods in Intelligent Marine Vehicles)

► Show Figures

Figure 1

18 pages, 10428 KB

Open AccessArticle

T2C-DETR: A Transformer + Convolution Dual-Channel Backbone Network for Underwater Sonar Image Object Detection

by Xiaobing Wu, Panlong Tan, Xiaoyu Zhang and Hao Sun

Algorithms 2026, 19(4), 281; https://doi.org/10.3390/a19040281 - 3 Apr 2026

Viewed by 304

Abstract

Underwater sonar object detection is challenging because targets are often small, boundaries are blurred, background clutter is strong, and labeled sonar data are limited. To address these issues, we propose T2C-DETR, a detector built on RT-DETR with three task-oriented improvements: (i) a Transformer–Convolution [...] Read more.

Underwater sonar object detection is challenging because targets are often small, boundaries are blurred, background clutter is strong, and labeled sonar data are limited. To address these issues, we propose T2C-DETR, a detector built on RT-DETR with three task-oriented improvements: (i) a Transformer–Convolution dual-channel backbone (TCDCNet) for complementary global-context and local-detail modeling, (ii) a Noise Filtering Module (NFM) inserted before neck fusion to suppress noise-dominated activations, and (iii) a stage-wise transfer-learning strategy tailored to small sonar datasets. We evaluate the method under three pre-training sources (COCO 2017, DOTA, and an infrared dataset) and then fine-tune on a self-built sonar dataset. Experimental results show that T2C-DETR achieves AP50 of 97.8%, 98.2%, and 98.5% at 72–73 FPS, consistently outperforming the RT-DETR baseline, YOLOv5-Imp, and MLFFNet in the accuracy–speed trade-off. These results indicate that combining global–local representation learning with targeted noise suppression is effective for practical real-time sonar detection. Full article

► Show Figures

Figure 1

36 pages, 1163 KB

Open AccessArticle

A Multicriteria Framework for Evaluation and Selection of Conversational AI Assistants in Mental Health

by Constanta Zoie Radulescu, Marius Radulescu and Alexandra Ioana Mihailescu

Future Internet 2026, 18(4), 191; https://doi.org/10.3390/fi18040191 - 1 Apr 2026

Viewed by 493

Abstract

The rapid proliferation of Conversational Artificial Intelligence Assistants (CAIs) has transformed access to mental health information through freely accessible web interfaces, mobile applications, and public APIs (Application Programming Interfaces), yet systematic methodologies for their evaluation remain limited. This paper introduces SELCAI-MH, a multicriteria [...] Read more.

The rapid proliferation of Conversational Artificial Intelligence Assistants (CAIs) has transformed access to mental health information through freely accessible web interfaces, mobile applications, and public APIs (Application Programming Interfaces), yet systematic methodologies for their evaluation remain limited. This paper introduces SELCAI-MH, a multicriteria framework for CAI evaluation and selection. This framework integrates four complementary multicriteria methods: Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS), VIseKriterijumska Optimizacija I Kompromisno Resenje (VIKOR), Complex Proportional Assessment Method (COPRAS), and Combinative Distance-based Assessment (CODAS), capturing distance-based, compromise-based, proportional, and negative-ideal logics, and proposes SOLAG, an aggregation method that produces a consensus ranking across methods. SELCAI-MH employs a dual evaluation mechanism combining psychiatric expert assessment with AI-based scoring, expert-derived criterion weights, and domain-relevant conversational datasets. The framework is applied to nine internet-accessible CAIs: proprietary platforms (ChatGPT 5.2, Claude Sonnet 4.5, Gemini 1.5 Flash, Perplexity Sonar, Bing AI/Copilot) and open-source Llama variants deployed via cloud inference endpoints. Using a set of anxiety-related questions and CAI responses, evaluated across seven criteria, Claude Sonnet 4.5 emerged optimal, followed by ChatGPT 5.2 and Gemini 1.5 Flash. SOLAG produced highly consistent rankings across the four multicriteria decision-making (MCDM) methods (Spearman ρ ≥ 0.98). Overall, SELCAI-MH provides a structured and reproducible decision-support framework for selecting accessible CAIs in sensitive mental health contexts. Full article

(This article belongs to the Special Issue Artificial Intelligence-Enabled Smart Healthcare)

► Show Figures

Graphical abstract

25 pages, 16006 KB

Open AccessArticle

Underwater Target Recognition with Fusion of Multi-Domain Temporal Features

by Xiaochun Liu, Chenyu Wang, Yunchuan Yang, Xiangfeng Yang, Youfeng Hu and Jianguo Liu

Acoustics 2026, 8(2), 22; https://doi.org/10.3390/acoustics8020022 - 25 Mar 2026

Viewed by 368

Abstract

The dynamic nature of acoustic environments—particularly the fluctuation of underwater channels and time-varying target observation angles—poses significant challenges for active sonar target recognition, a problem further aggravated by the scarcity of labeled training samples. To address these limitations, this paper proposes a novel [...] Read more.

The dynamic nature of acoustic environments—particularly the fluctuation of underwater channels and time-varying target observation angles—poses significant challenges for active sonar target recognition, a problem further aggravated by the scarcity of labeled training samples. To address these limitations, this paper proposes a novel recognition method enabling deep fusion of multi-domain temporal features extracted from target echoes. First, complementary features are extracted across spatial, time–frequency, and Doppler domains to achieve a comprehensive and discriminative representation of targets. Subsequently, we introduce a feature vector-level fusion mechanism designed specifically for few-shot learning, integrating a meta-knowledge-driven multi-stream feature extractor with an internal memory module within the feature tensor framework. This architecture constitutes the Multi-domain Temporal Feature Fusion Recognition Network (MTFF-RNet). The proposed approach is evaluated on a hybrid dataset combining simulated and experimental data, achieving a high recognition accuracy of 96.2% for both targets and interferents. Experimental results demonstrate that MTFF-RNet significantly enhances robustness and adaptability under varying underwater acoustic conditions and dynamic viewing geometries. Full article

► Show Figures

Figure 1

16 pages, 4555 KB

Open AccessArticle

3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm

by Fan Zhang, Shaobo Li, Haolong Gao and Yunlong Wu

J. Mar. Sci. Eng. 2026, 14(7), 597; https://doi.org/10.3390/jmse14070597 - 24 Mar 2026

Viewed by 259

Abstract

Three-dimensional (3D) sonar overcomes the limitations of traditional measurement methods regarding imaging coverage and accuracy, making it indispensable for underwater structure monitoring. However, complex underwater environments often introduce significant noise into 3D sonar data, degrading monitoring performance. To address this, we propose a [...] Read more.

Three-dimensional (3D) sonar overcomes the limitations of traditional measurement methods regarding imaging coverage and accuracy, making it indispensable for underwater structure monitoring. However, complex underwater environments often introduce significant noise into 3D sonar data, degrading monitoring performance. To address this, we propose a geometry-based filtering method. First, Total Least Squares (TLS) is employed to construct local spatial features, which guides a region-growing segmentation based on normal vector attributes. Subsequently, the resulting clusters are refined using these local geometric characteristics. Finally, statistical filtering is applied to eliminate residual outliers from a local to a global scale. Experimental results demonstrate that the proposed method achieves F1 scores of 78.65% and 84.49% in outlier removal, effectively suppressing noise while preserving structural integrity. Full article

(This article belongs to the Special Issue Advanced Studies in Marine Structures)

► Show Figures

Figure 1

25 pages, 13561 KB

Open AccessArticle

An Underwater Target Recognition Method Based on Feature Fusion and Balanced Ensemble Transfer Learning

by Haoqian Zhang, Hong Liang, Linfeng Zhu and Wenbo Gou

J. Mar. Sci. Eng. 2026, 14(6), 579; https://doi.org/10.3390/jmse14060579 - 20 Mar 2026

Viewed by 258

Abstract

In underwater target recognition scenarios, challenges arise as a result of the limited representational capability of acoustic images with single time-frequency features and poor recognition performance due to class imbalances in sample numbers. To tackle these issues, this paper proposes an underwater target [...] Read more.

In underwater target recognition scenarios, challenges arise as a result of the limited representational capability of acoustic images with single time-frequency features and poor recognition performance due to class imbalances in sample numbers. To tackle these issues, this paper proposes an underwater target recognition method based on feature fusion and balanced ensemble transfer learning. A LiT-INN dual-branch auto-encoder network architecture is employed for time-frequency image feature fusion to solve the weak feature representation capability of single time–frequency features. The Restormer network serves as a shared feature encoder to extract fundamental features, enabling feature fusion of underwater target echo time–frequency image data and generating a fusion image dataset with richer feature information. In order to address class imbalance in sample sizes, a balanced ensemble transfer learning method is constructed using a two-stage decoupled fine-tuning learning method. The first stage employs a uniform sampler strategy to fine-tune the feature extraction module of a pre-trained transfer learning model. The second stage uses multiple balanced sampling optimization methods to fine-tune the classifier. Then, a weight averaging ensemble learning method performs decision-level fusion of multiple weak classifiers. Field test data from three target classes validated the performance of the algorithm, demonstrating a 3% improvement in average recognition accuracy compared to deep transfer learning methods under different imbalance ratios. This method effectively enhances recognition performance for classes with limited samples while significantly boosting overall recognition accuracy, offering a novel solution for underwater target recognition. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

27 pages, 28242 KB

Open AccessArticle

Physics-Informed Side-Scan Sonar Perception: Tackling Weak Targets and Sparse Debris via Geometric and Frequency Decoupling

by Bojian Yu, Rongsheng Lin, Hanxiang Zhou, Jianxiong Zhang and Xinwei Zhang

Sensors 2026, 26(6), 1938; https://doi.org/10.3390/s26061938 - 19 Mar 2026

Viewed by 318

Abstract

Side-scan sonar (SSS) serves as the primary perceptual instrument for Autonomous Underwater Vehicles (AUVs) in large-scale marine search and rescue (SAR) operations. However, the detection of critical targets is frequently hindered by severe hydro-acoustic noise, the spatial discontinuity of wreckage, and the weak [...] Read more.

Side-scan sonar (SSS) serves as the primary perceptual instrument for Autonomous Underwater Vehicles (AUVs) in large-scale marine search and rescue (SAR) operations. However, the detection of critical targets is frequently hindered by severe hydro-acoustic noise, the spatial discontinuity of wreckage, and the weak visual signatures of small targets. To surmount these challenges, this paper presents WPG-DetNet. First, we introduce a Wavelet-Embedded Residual Backbone (WERB) to reconstruct the conventional downsampling paradigm. By substituting standard pooling with the Discrete Wavelet Transform (DWT), this architecture explicitly disentangles high-frequency noise from structural information in the frequency domain, thereby achieving the adaptive preservation of edge fidelity for large human-made targets while filtering out speckle interference. Then, addressing the distinct challenge of discontinuous aircraft wreckage, the framework further incorporates a Debris Graph Reasoning Module (D-GRM). This module models scattered fragments as nodes in a topological graph to capture long-range semantic dependencies, transforming isolated instance recognition into context-aware scene understanding. Finally, to bridge the gap between AI and underwater physics, we design a Shadow-Aided Decoupling Head (SADH) equipped with a physics-informed geometric loss. By enforcing mathematical consistency between target height and acoustic shadow length, this mechanism establishes a rigorous discriminative criterion capable of distinguishing weak-echo human bodies from seabed rocks based on shadow geometry. Experiments on the SCTD dataset demonstrate that WPG-DetNet achieves a mean Average Precision (

m A P_{50}

) of 97.5% and a Recall of 96.9%. Quantitative analysis reveals that our framework outperforms the classic Faster R-CNN by a margin of 12.8% in

m A P_{50}

and surpasses the Transformer-based RT-DETR-R18 by 5.6% in high-precision localization metrics (

m A P_{50 : 95}

). Simultaneously, WPG-DetNet maintains superior efficiency with an inference speed of 62.5 FPS and a lightweight parameter count of 16.8 M, striking an optimal balance between robust perception and the real-time constraints of AUV operations. Full article

(This article belongs to the Section Physical Sensors)

► Show Figures

Figure 1

Search Results (970)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (970)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI