MDPI - Publisher of Open Access Journals

23 pages, 1302 KiB

Open AccessArticle

Deep Learning-Enhanced Ocean Acoustic Tomography: A Latent Feature Fusion Framework for Hydrographic Inversion with Source Characteristic Embedding

by Jiawen Zhou, Zikang Chen, Yongxin Zhu and Xiaoying Zheng

Information 2025, 16(8), 665; https://doi.org/10.3390/info16080665 - 4 Aug 2025

Viewed by 110

Abstract

Ocean Acoustic Tomography (OAT) is an important marine remote sensing technique used for inverting large-scale ocean environmental parameters, but traditional methods face challenges in computational complexity and environmental interference. This paper proposes a causal analysis-driven AI FOR SCIENCE method for high-precision and rapid [...] Read more.

Ocean Acoustic Tomography (OAT) is an important marine remote sensing technique used for inverting large-scale ocean environmental parameters, but traditional methods face challenges in computational complexity and environmental interference. This paper proposes a causal analysis-driven AI FOR SCIENCE method for high-precision and rapid inversion of oceanic hydrological parameters in complex underwater environments. Based on the open-source VTUAD (Vessel Type Underwater Acoustic Data) dataset, the method first utilizes a fine-tuned Paraformer (a fast and accurate parallel transformer) model for precise classification of sound source targets. Then, using structural causal models (SCM) and potential outcome frameworks, causal embedding vectors with physical significance are constructed. Finally, a cross-modal Transformer network is employed to fuse acoustic features, sound source priors, and environmental variables, enabling inversion of temperature and salinity in the Georgia Strait of Canada. Experimental results show that the method achieves accuracies of 97.77% and 95.52% for temperature and salinity inversion tasks, respectively, significantly outperforming traditional methods. Additionally, with GPU acceleration, the inference speed is improved by over sixfold, aimed at enabling real-time Ocean Acoustic Tomography (OAT) on edge computing platforms as smart hardware, thereby validating the method’s practicality. By incorporating causal inference and cross-modal data fusion, this study not only enhances inversion accuracy and model interpretability but also provides new insights for real-time applications of OAT. Full article

(This article belongs to the Special Issue Advances in Intelligent Hardware, Systems and Applications)

► Show Figures

Figure 1

32 pages, 9845 KiB

Open AccessArticle

Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification

by Mojgan Mirzaei Hotkani, Bruce Martin, Jean Francois Bousquet and Julien Delarue

Acoustics 2025, 7(3), 44; https://doi.org/10.3390/acoustics7030044 - 24 Jul 2025

Viewed by 328

Abstract

This study introduces an algorithm for quantifying oceanic wind speed and identifying sound sources in the local underwater soundscape. Utilizing low-complexity metrics like one-minute spectral kurtosis and power spectral density levels, the algorithm categorizes different soundscapes and estimates wind speed. It detects rain, [...] Read more.

This study introduces an algorithm for quantifying oceanic wind speed and identifying sound sources in the local underwater soundscape. Utilizing low-complexity metrics like one-minute spectral kurtosis and power spectral density levels, the algorithm categorizes different soundscapes and estimates wind speed. It detects rain, vessels, fin and blue whales, as well as clicks and whistles from dolphins. Positioned as a foundational tool for implementing the Ocean Sound Essential Ocean Variable (EOV), it contributes to understanding long-term trends in climate change for sustainable ocean health and predicting threats through forecasts. The proposed soundscape classification algorithm, validated using extensive acoustic recordings (≥32 kHz) collected at various depths and latitudes, demonstrates high performance, achieving an average precision of

89 %

and an average recall of

86.59 %

through optimized parameter tuning via a genetic algorithm. Here, wind speed is determined using a cubic function with power spectral density (PSD) at 6 kHz and the MASLUW method, exhibiting strong agreement with satellite data below 15 m/s. Designed for compatibility with low-power electronics, the algorithm can be applied to both archival datasets and real-time data streams. It provides a straightforward metric for ocean monitoring and sound source identification. Full article

► Show Figures

Figure 1

35 pages, 8048 KiB

Open AccessArticle

Characterization and Automated Classification of Underwater Acoustic Environments in the Western Black Sea Using Machine Learning Techniques

by Maria Emanuela Mihailov

J. Mar. Sci. Eng. 2025, 13(7), 1352; https://doi.org/10.3390/jmse13071352 - 16 Jul 2025

Viewed by 215

Abstract

Growing concern over anthropogenic underwater noise, highlighted by initiatives like the Marine Strategy Framework Directive (MSFD) and its Technical Group on Underwater Noise (TG Noise), emphasizes regions like the Western Black Sea, where increasing activities threaten marine habitats. This region is experiencing rapid [...] Read more.

Growing concern over anthropogenic underwater noise, highlighted by initiatives like the Marine Strategy Framework Directive (MSFD) and its Technical Group on Underwater Noise (TG Noise), emphasizes regions like the Western Black Sea, where increasing activities threaten marine habitats. This region is experiencing rapid growth in maritime traffic and resource exploitation, which is intensifying concerns over the noise impacts on its unique marine habitats. While machine learning offers promising solutions, a research gap persists in comprehensively evaluating diverse ML models within an integrated framework for complex underwater acoustic data, particularly concerning real-world data limitations like class imbalance. This paper addresses this by presenting a multi-faceted framework using passive acoustic monitoring (PAM) data from fixed locations (50–100 m depth). Acoustic data are processed using advanced signal processing (broadband Sound Pressure Level (SPL), Power Spectral Density (PSD)) for feature extraction (Mel-spectrograms for deep learning; PSD statistical moments for classical/unsupervised ML). The framework evaluates Convolutional Neural Networks (CNNs), Random Forest, and Support Vector Machines (SVMs) for noise event classification, alongside Gaussian Mixture Models (GMMs) for anomaly detection. Our results demonstrate that the CNN achieved the highest classification accuracy of 0.9359, significantly outperforming Random Forest (0.8494) and SVM (0.8397) on the test dataset. These findings emphasize the capability of deep learning in automatically extracting discriminative features, highlighting its potential for enhanced automated underwater acoustic monitoring. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

23 pages, 8011 KiB

Open AccessArticle

Efficient Prediction of Shallow-Water Acoustic Transmission Loss Using a Hybrid Variational Autoencoder–Flow Framework

by Bolin Su, Haozhong Wang, Xingyu Zhu, Penghua Song and Xiaolei Li

J. Mar. Sci. Eng. 2025, 13(7), 1325; https://doi.org/10.3390/jmse13071325 - 10 Jul 2025

Viewed by 241

Abstract

Efficient prediction of shallow-water acoustic transmission loss (TL) is crucial for underwater detection, recognition, and communication systems. Traditional physical modeling methods require repeated calculations for each new scenario in practical waveguide environments, leading to low computational efficiency. Deep learning approaches, based on data-driven [...] Read more.

Efficient prediction of shallow-water acoustic transmission loss (TL) is crucial for underwater detection, recognition, and communication systems. Traditional physical modeling methods require repeated calculations for each new scenario in practical waveguide environments, leading to low computational efficiency. Deep learning approaches, based on data-driven principles, enable accurate input–output approximation and batch processing of large-scale datasets, significantly reducing computation time and cost. To establish a rapid prediction model mapping sound speed profiles (SSPs) to acoustic TL through controllable generation, this study proposes a hybrid framework that integrates a variational autoencoder (VAE) and a normalizing flow (Flow) through a two-stage training strategy. The VAE network is employed to learn latent representations of TL data on a low-dimensional manifold, while the Flow network is additionally used to establish a bijective mapping between the latent variables and underwater physical parameters, thereby enhancing the controllability of the generation process. Combining the trained normalizing flow with the VAE decoder could establish an end-to-end mapping from SSPs to TL. The results demonstrated that the VAE–Flow network achieved higher computational efficiency, with a computation time of 4 s for generating 1000 acoustic TL samples, versus the over 500 s required by the KRAKEN model, while preserving accuracy, with median structural similarity index measure (SSIM) values over 0.90. Full article

(This article belongs to the Special Issue Data-Driven Methods for Marine Structures)

► Show Figures

Figure 1

46 pages, 5911 KiB

Open AccessArticle

Leveraging Prior Knowledge in Semi-Supervised Learning for Precise Target Recognition

by Guohao Xie, Zhe Chen, Yaan Li, Mingsong Chen, Feng Chen, Yuxin Zhang, Hongyan Jiang and Hongbing Qiu

Remote Sens. 2025, 17(14), 2338; https://doi.org/10.3390/rs17142338 - 8 Jul 2025

Viewed by 355

Abstract

Underwater acoustic target recognition (UATR) is challenged by complex marine noise, scarce labeled data, and inadequate multi-scale feature extraction in conventional methods. This study proposes DART-MT, a semi-supervised framework that integrates a Dual Attention Parallel Residual Network Transformer with a mean teacher paradigm, [...] Read more.

Underwater acoustic target recognition (UATR) is challenged by complex marine noise, scarce labeled data, and inadequate multi-scale feature extraction in conventional methods. This study proposes DART-MT, a semi-supervised framework that integrates a Dual Attention Parallel Residual Network Transformer with a mean teacher paradigm, enhanced by domain-specific prior knowledge. The architecture employs a Convolutional Block Attention Module (CBAM) for localized feature refinement, a lightweight New Transformer Encoder for global context modeling, and a novel TriFusion Block to synergize spectral–temporal–spatial features through parallel multi-branch fusion, addressing the limitations of single-modality extraction. Leveraging the mean teacher framework, DART-MT optimizes consistency regularization to exploit unlabeled data, effectively mitigating class imbalance and annotation scarcity. Evaluations on the DeepShip and ShipsEar datasets demonstrate state-of-the-art accuracy: with 10% labeled data, DART-MT achieves 96.20% (DeepShip) and 94.86% (ShipsEar), surpassing baseline models by 7.2–9.8% in low-data regimes, while reaching 98.80% (DeepShip) and 98.85% (ShipsEar) with 90% labeled data. Under varying noise conditions (−20 dB to 20 dB), the model maintained a robust performance (F1-score: 92.4–97.1%) with 40% lower variance than its competitors, and ablation studies validated each module’s contribution (TriFusion Block alone improved accuracy by 6.9%). This research advances UATR by (1) resolving multi-scale feature fusion bottlenecks, (2) demonstrating the efficacy of semi-supervised learning in marine acoustics, and (3) providing an open-source implementation for reproducibility. In future work, we will extend cross-domain adaptation to diverse oceanic environments. Full article

(This article belongs to the Special Issue Remote Sensing Target Recognition and Detection: Theory and Applications (Second Edition))

► Show Figures

Figure 1

36 pages, 8664 KiB

Open AccessArticle

A Novel Transfer Learning-Based OFDM Receiver Design for Enhanced Underwater Acoustic Communication

by Muhammad Adil, Songzuo Liu, Suleman Mazhar, Ayman Alharbi, Honglu Yan and Muhammad Muzzammil

J. Mar. Sci. Eng. 2025, 13(7), 1284; https://doi.org/10.3390/jmse13071284 - 30 Jun 2025

Viewed by 291

Abstract

The underwater acoustic (UWA) communication system faces challenges due to environmental factors, extensive multipath spread, and rapidly changing propagation conditions. Deep learning based solutions, especially for orthogonal frequency division multiplexing (OFDM) receivers, have been shown to improve performance. However, the UWA channel characteristics [...] Read more.

The underwater acoustic (UWA) communication system faces challenges due to environmental factors, extensive multipath spread, and rapidly changing propagation conditions. Deep learning based solutions, especially for orthogonal frequency division multiplexing (OFDM) receivers, have been shown to improve performance. However, the UWA channel characteristics are highly dynamic and depend on the specific underwater conditions. Therefore, these models suffer from model mismatch when deployed in environments different from those used for training, leading to performance degradation and requiring costly, time-consuming retraining. To address these issues, we propose a transfer learning (TL)-based pre-trained model for OFDM based UWA communication. Rather than training separate models for each underwater channel, we aggregate received signals from five distinct WATERMARK channels, across varying signal to noise ratios (SNRs), into a unified dataset. This diverse training set enables the model to generalize across various underwater conditions, ensuring robust performance without extensive retraining. We evaluate the pre-trained model using real-world data from Qingdao Lake in Hangzhou, China, which serves as the target environment. Our experiments show that the model adapts well to these challenging environment, overcoming model mismatch and minimizing computational costs. The proposed TL-based OFDM receiver outperforms traditional methods in terms of bit error rate (BER) and other evaluation metrics. It demonstrates strong adaptability to varying channel conditions. This includes scenarios where training and testing occur on the same channel, under channel mismatch, and with or without fine-tuning on target data. At 10 dB SNR, it achieves an approximately 80% improvement in BER compared to other methods. Full article

(This article belongs to the Special Issue Advances in Underwater Acoustic Communication and Ocean Sensor Networks)

► Show Figures

Figure 1

33 pages, 3207 KiB

Open AccessArticle

Machine Learning Ship Classifiers for Signals from Passive Sonars

by Allyson A. da Silva, Lisandro Lovisolo and Tadeu N. Ferreira

Appl. Sci. 2025, 15(13), 6952; https://doi.org/10.3390/app15136952 - 20 Jun 2025

Viewed by 419

Abstract

The accurate automatic classification of underwater acoustic signals from passive SoNaR is vital for naval operational readiness, enabling timely vessel identification and real-time maritime surveillance. This study evaluated seven supervised machine learning algorithms for ship identification using passive SoNaR recordings collected by the [...] Read more.

The accurate automatic classification of underwater acoustic signals from passive SoNaR is vital for naval operational readiness, enabling timely vessel identification and real-time maritime surveillance. This study evaluated seven supervised machine learning algorithms for ship identification using passive SoNaR recordings collected by the Brazilian Navy. The dataset encompassed 12 distinct ship classes and was processed in two ways—full-resolution and downsampled inputs—to assess the impacts of preprocessing on the model accuracy and computational efficiency. The classifiers included standard Support Vector Machines, K-Nearest Neighbors, Random Forests, Neural Networks and two less conventional approaches in this context: Linear Discriminant Analysis (LDA) and the XGBoost ensemble method. Experimental results indicate that data decimation significantly affects classification accuracy. LDA and XGBoost delivered the strongest performance overall, with XGBoost offering particularly robust accuracy and computational efficiency suitable for real-time naval applications. These findings highlight the promise of advanced machine learning techniques for complex multiclass ship classification tasks, enhancing acoustic signal intelligence for military maritime surveillance and contributing to improved naval situational awareness. Full article

(This article belongs to the Section Marine Science and Engineering)

► Show Figures

Figure 1

31 pages, 6761 KiB

Open AccessArticle

Improved Modulation Classification Based on Hough Transforms of Constellation Diagrams Using CNN for the UWA-OFDM Communication System

by Mohamed A. Abdel-Moneim, Mohamed K. M. Gerwash, El-Sayed M. El-Rabaie, Fathi E. Abd El-Samie, Khalil F. Ramadan and Nariman Abdel-Salam

Eng 2025, 6(6), 127; https://doi.org/10.3390/eng6060127 - 14 Jun 2025

Viewed by 430

Abstract

The Automatic Modulation Classification (AMC) for underwater acoustic signals enables more efficient utilization of the acoustic spectrum. Deep learning techniques significantly improve classification performance. Hence, they can be applied in AMC work to improve the underwater acoustic (UWA) communication. This paper is based [...] Read more.

The Automatic Modulation Classification (AMC) for underwater acoustic signals enables more efficient utilization of the acoustic spectrum. Deep learning techniques significantly improve classification performance. Hence, they can be applied in AMC work to improve the underwater acoustic (UWA) communication. This paper is based on the adoption of Hough Transform (HT) and Edge Detection (ED) to enhance modulation classification, especially for a small dataset. Deep neural models based on basic Convolutional Neural Network (CNN), Visual Geometry Group-16 (VGG-16), and VGG-19 trained on constellation diagrams transformed using HT are adopted. The objective is to extract features from constellation diagrams projected onto the Hough space. In addition, we use Orthogonal Frequency Division Multiplexing (OFDM) technology, which is frequently utilized in UWA systems because of its ability to avoid multipath fading and enhance spectrum utilization. We use an OFDM system with the Discrete Cosine Transform (DCT), Cyclic Prefix (CP), and equalization over the UWA communication channel under the effect of estimation errors. Seven modulation types are considered for classification, including Phase Shift Keying (PSK) and Quadrature Amplitude Modulation (QAM) (2/8/16-PSK and 4/8/16/32-QAM), with a Signal-to-Noise Ratio (SNR) ranging from −5 to 25 dB. Simulation results indicate that our CNN model with HT and ED at perfect channel estimation, achieves a 94% classification accuracy at 10 dB SNR, outperforming benchmark models by approximately 40%. Full article

► Show Figures

Figure 1

16 pages, 1439 KiB

Open AccessArticle

An Underwater Acoustic Communication Signal Modulation-Style Recognition Algorithm Based on Dual-Feature Fusion and ResNet–Transformer Dual-Model Fusion

by Fanyu Zhou, Haoran Wu, Zhibin Yue and Han Li

Appl. Sci. 2025, 15(11), 6234; https://doi.org/10.3390/app15116234 - 1 Jun 2025

Cited by 1 | Viewed by 508

Abstract

Traditional underwater acoustic reconnaissance technologies are limited in directly detecting underwater acoustic communication signals. This paper proposes a dual-feature ResNet–Transformer model with two innovative breakthroughs: (1) A dual-modal fusion architecture of ResNet and Transformer is constructed using residual connections to alleviate gradient degradation [...] Read more.

Traditional underwater acoustic reconnaissance technologies are limited in directly detecting underwater acoustic communication signals. This paper proposes a dual-feature ResNet–Transformer model with two innovative breakthroughs: (1) A dual-modal fusion architecture of ResNet and Transformer is constructed using residual connections to alleviate gradient degradation in deep networks and combining multi-head self-attention to enhance long-distance dependency modeling. (2) The time–frequency representation obtained from the smooth pseudo-Wigner–Ville distribution is used as the first input branch, and higher-order statistics are introduced as the second input branch to enhance phase feature extraction and cope with channel interference. Experiments on the Danjiangkou measured dataset show that the model improves the accuracy by 6.67% compared with the existing Convolutional Neural Network (CNN)–Transformer model in long-distance ranges, providing an efficient solution for modulation recognition in complex underwater acoustic environments. Full article

(This article belongs to the Special Issue Emerging Technologies for Underwater Acoustic Sensing and Communication)

► Show Figures

Figure 1

22 pages, 4360 KiB

Open AccessFeature PaperArticle

Underwater Target Recognition Method Based on Singular Spectrum Analysis and Channel Attention Convolutional Neural Network

by Fang Ji, Shaoqing Lu, Junshuai Ni, Ziming Li and Weijia Feng

Sensors 2025, 25(8), 2573; https://doi.org/10.3390/s25082573 - 18 Apr 2025

Viewed by 524

Abstract

In order to improve the efficiency of the deep network model in processing the radiated noise signals of underwater acoustic targets, this paper introduces a Singular Spectrum Analysis and Channel Attention Convolutional Neural Network (SSA-CACNN) model. The front end of the model is [...] Read more.

In order to improve the efficiency of the deep network model in processing the radiated noise signals of underwater acoustic targets, this paper introduces a Singular Spectrum Analysis and Channel Attention Convolutional Neural Network (SSA-CACNN) model. The front end of the model is designed as an SSA filter, and its input is the time-domain signal that has undergone simple preprocessing. The SSA method is utilized to separate the noise efficiently and reliably from useful signals. The first three orders of useful signals are then fed into the CACNN model, which has a convolutional layer set up at the beginning of the model to further remove noise from the signal. Then, the attention of the model to the feature signal channels is enhanced through the combination of multiple groups of convolutional operations and the channel attention mechanism, which facilitates the model’s ability to discern the essential characteristics of the underwater acoustic signals and improve the target recognition rate. Experimental Results: The signal reconstructed by the first three-order waveforms at the front end of the SSA-CACNN model proposed in this paper can retain most of the features of the target. In the experimental verification using the ShipsEar dataset, the model achieved a recognition accuracy of 98.64%. The model’s parameter count of 0.26 M was notably lower than that of other comparable deep models, indicating a more efficient use of resources. Additionally, the SSA-CACNN model had a certain degree of robustness to noise, with a correct recognition rate of 84.61% maintained when the signal-to-noise ratio (SNR) was −10 dB. Finally, the pre-trained SSA-CACNN model on the ShipsEar dataset was transferred to the DeepShip dataset with a recognition accuracy of 94.98%. Full article

(This article belongs to the Section Sensor Networks)

► Show Figures

Figure 1

12 pages, 2593 KiB

Open AccessArticle

Multiclass CNN Approach for Automatic Classification of Dolphin Vocalizations

by Francesco Di Nardo, Rocco De Marco, Daniel Li Veli, Laura Screpanti, Benedetta Castagna, Alessandro Lucchetti and David Scaradozzi

Sensors 2025, 25(8), 2499; https://doi.org/10.3390/s25082499 - 16 Apr 2025

Cited by 1 | Viewed by 897

Abstract

Monitoring dolphins in the open sea is essential for understanding their behavior and the impact of human activities on the marine ecosystems. Passive Acoustic Monitoring (PAM) is a non-invasive technique for tracking dolphins, providing continuous data. This study presents a novel approach for [...] Read more.

Monitoring dolphins in the open sea is essential for understanding their behavior and the impact of human activities on the marine ecosystems. Passive Acoustic Monitoring (PAM) is a non-invasive technique for tracking dolphins, providing continuous data. This study presents a novel approach for classifying dolphin vocalizations from a PAM acoustic recording using a convolutional neural network (CNN). Four types of common bottlenose dolphin (Tursiops truncatus) vocalizations were identified from underwater recordings: whistles, echolocation clicks, burst pulse sounds, and feeding buzzes. To enhance classification performances, edge-detection filters were applied to spectrograms, with the aim of removing unwanted noise components. A dataset of nearly 10,000 spectrograms was used to train and test the CNN through a 10-fold cross-validation procedure. The results showed that the CNN achieved an average accuracy of 95.2% and an F1-score of 87.8%. The class-specific results showed a high accuracy for whistles (97.9%), followed by echolocation clicks (94.5%), feeding buzzes (94.0%), and burst pulse sounds (92.3%). The highest F1-score was obtained for whistles, exceeding 95%, while the other three vocalization typologies maintained an F1-score above 80%. This method provides a promising step toward improving the passive acoustic monitoring of dolphins, contributing to both species conservation and the mitigation of conflicts with fisheries. Full article

(This article belongs to the Section Intelligent Sensors)

► Show Figures

Figure 1

12 pages, 1295 KiB

Open AccessArticle

A Novel ViT Model with Wavelet Convolution and SLAttention Modules for Underwater Acoustic Target Recognition

by Haoran Guo, Biao Wang, Tao Fang and Biao Liu

J. Mar. Sci. Eng. 2025, 13(4), 634; https://doi.org/10.3390/jmse13040634 - 22 Mar 2025

Cited by 2 | Viewed by 636

Abstract

Underwater acoustic target recognition (UATR) technology plays a significant role in marine exploration, resource development, and national defense security. To address the limitations of existing methods in computational efficiency and recognition performance, this paper proposes an improved WS-ViT model based on Vision Transformers [...] Read more.

Underwater acoustic target recognition (UATR) technology plays a significant role in marine exploration, resource development, and national defense security. To address the limitations of existing methods in computational efficiency and recognition performance, this paper proposes an improved WS-ViT model based on Vision Transformers (ViTs). By introducing the Wavelet Transform Convolution (WTConv) module and the Simplified Linear Attention (SLAttention) module, WS-ViT can effectively extract spatiotemporal complex features, enhance classification accuracy, and significantly reduce computational costs. The model is validated using the ShipsEar dataset, and the results demonstrate that WS-ViT significantly outperforms ResNet18, VGG16, and the classical ViT model in classification accuracy, with improvements of 7.3%, 4.9%, and 2.1%, respectively. Additionally, its training efficiency is improved by 28.4% compared to ViT. This study demonstrates that WS-ViT not only enhances UATR performance but also maintains computational efficiency, providing an innovative solution for efficient and accurate underwater acoustic signal processing. Full article

► Show Figures

Figure 1

15 pages, 4634 KiB

Open AccessArticle

Efficient One-Dimensional Network Design Method for Underwater Acoustic Target Recognition

by Qing Huang, Xiaoyan Zhang, Anqi Jin, Menghui Lei, Mingmin Zeng, Peilin Cao, Zihan Na and Xiangyang Zeng

J. Mar. Sci. Eng. 2025, 13(3), 599; https://doi.org/10.3390/jmse13030599 - 18 Mar 2025

Viewed by 432

Abstract

Many studies have used various time-frequency feature extraction methods to convert ship-radiated noise into three-dimensional (3D) data suitable for computer vision (CV) models, which have shown good results in public datasets. However, traditional feature engineering (FE) has been enhanced to interface matching–feature engineering [...] Read more.

Many studies have used various time-frequency feature extraction methods to convert ship-radiated noise into three-dimensional (3D) data suitable for computer vision (CV) models, which have shown good results in public datasets. However, traditional feature engineering (FE) has been enhanced to interface matching–feature engineering (IM-FE). This approach requires considerable effort in feature design, larger sample duration, or a higher upper limit of frequency. In this context, this paper proposes a one-dimensional network design for underwater acoustic target recognition (UATR-ND1D), only combined with fast Fourier transform (FFT), which can effectively alleviate the problem of IM-FE. This method is abbreviated as FFT-UATR-ND1D. FFT-UATR-ND1D was applied to the design of a one-dimensional network, named ResNet1D. Experiments were conducted on two mainstream datasets, using ResNet1D in 4320 and 360 tests, respectively. The lightweight model ResNet1D_S, with only 0.17 M parameters and 3.4 M floating point operations (FLOPs), achieved average accuracies were 97.2% and 95.20%. The larger model, ResNet1D_B, with 2.1 M parameters and 5.0 M FLOPs, both reached optimal accuracies, 98.81% and 98.42%, respectively. Compared to existing methods, those with similar parameter sizes performed 3–5% worse than the methods proposed in this paper. Additionally, methods achieving similar recognition rates require more parameters of 1 to 2 orders of magnitude and FLOPs. Full article

► Show Figures

Figure 1

16 pages, 1813 KiB

Open AccessArticle

Innovative Regression Model for Frequency-Dependent Acoustic Source Strength in the Aquatic Environment: Bridging Scientific Insight and Practical Applications

by Moshe Greenberg, Uri Kushnir and Vladimir Frid

Sensors 2025, 25(5), 1560; https://doi.org/10.3390/s25051560 - 3 Mar 2025

Cited by 1 | Viewed by 1150

Abstract

This study addresses the challenge of predicting acoustic source strength in freshwater environments, focusing on frequencies between 100–400 kHz. Acoustic signal attenuation is inherently frequency-dependent and influenced by water properties as well as the total propagation path of the acoustic wave, complicating the [...] Read more.

This study addresses the challenge of predicting acoustic source strength in freshwater environments, focusing on frequencies between 100–400 kHz. Acoustic signal attenuation is inherently frequency-dependent and influenced by water properties as well as the total propagation path of the acoustic wave, complicating the accurate determination of source strength. To address this challenge, we developed a non-linear regression model for solving the inverse problem of attenuation correction in reflected signals from typical aquatic reflectors, addressing the current absence of robust correction tools in this frequency range. The novelty of our approach lies in designing a non-linear regression framework that incorporates key physical parameters—signal energy, propagation distance, and frequency—enabling accurate source strength prediction. Using an experimental setup comprising ultrasonic transducers and a signal generator under controlled conditions, we collected a comprehensive dataset of 366 samples. The results demonstrate that our proposed model achieves reliable source strength prediction by simplifying Thorpe’s equation for freshwater environments. This research represents a significant advancement in underwater acoustics, providing a practical and reliable tool for source strength estimation in freshwater systems. The developed methodology may have broad applications across sonar technology, environmental monitoring, and aquatic research domains. Full article

(This article belongs to the Section Physical Sensors)

► Show Figures

Figure 1

21 pages, 28941 KiB

Open AccessArticle

A Spatially Informed Machine Learning Method for Predicting Sound Field Uncertainty

by Xiangmei Chen, Chao Li, Haibin Wang, Yupeng Tai, Jun Wang and Cyrille Migniot

J. Mar. Sci. Eng. 2025, 13(3), 429; https://doi.org/10.3390/jmse13030429 - 25 Feb 2025

Viewed by 617

Abstract

Predicting the uncertain distribution of underwater acoustic fields, influenced by dynamic oceanic parameters, is critical for acoustic applications that rely on sound field characteristics to generate predictions. Traditional methods, such as the Monte Carlo method, are computationally intensive and thus unsuitable for applications [...] Read more.

Predicting the uncertain distribution of underwater acoustic fields, influenced by dynamic oceanic parameters, is critical for acoustic applications that rely on sound field characteristics to generate predictions. Traditional methods, such as the Monte Carlo method, are computationally intensive and thus unsuitable for applications requiring high real-time performance and flexibility. Current machine learning methods excel at improving computational efficiency but face limitations in predictive performance, especially in shadow areas. In response, a machine learning method is proposed in this paper that balances accuracy and efficiency for predicting uncertainties in deep ocean acoustics by decoupling the scene representation into two components: (a) a local radiance model related to environmental factors, and (b) a global representation of the overall scene context. Specifically, the internal relationships within the local radiance are first exploited, aiming to capture fine-grained details within the acoustic field. Subsequently, local clues are combined with receiver location information for joint learning. To verify the effectiveness of the proposed approach, a dataset of historical oceanographic data has been compiled. Extensive experiments validate the efficiency compared to traditional Monte Carlo techniques and the superior accuracy compared to existing learning method. Full article

(This article belongs to the Section Ocean Engineering)

► Show Figures

Figure 1

Search Results (96)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (96)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI