Next Article in Journal
Effects of Anthropogenic Vibratory Noise on Plant Development and Herbivory
Previous Article in Journal
Optimizing Museum Acoustics: How Absorption Magnitude and Surface Location of Finishing Materials Influence Acoustic Performance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification

by
Mojgan Mirzaei Hotkani
1,2,*,†,
Bruce Martin
2,†,
Jean Francois Bousquet
1,† and
Julien Delarue
2,†
1
Department of Electrical and Computer Engineering, Dalhousie University, Halifax, NS B3H 4R2, Canada
2
JASCO Applied Sciences Ltd., 20 Mount Hope Avenue, Dartmouth, NS B2Y 4S3, Canada
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Acoustics 2025, 7(3), 44; https://doi.org/10.3390/acoustics7030044
Submission received: 31 March 2025 / Revised: 23 June 2025 / Accepted: 3 July 2025 / Published: 24 July 2025

Abstract

This study introduces an algorithm for quantifying oceanic wind speed and identifying sound sources in the local underwater soundscape. Utilizing low-complexity metrics like one-minute spectral kurtosis and power spectral density levels, the algorithm categorizes different soundscapes and estimates wind speed. It detects rain, vessels, fin and blue whales, as well as clicks and whistles from dolphins. Positioned as a foundational tool for implementing the Ocean Sound Essential Ocean Variable (EOV), it contributes to understanding long-term trends in climate change for sustainable ocean health and predicting threats through forecasts. The proposed soundscape classification algorithm, validated using extensive acoustic recordings (≥32 kHz) collected at various depths and latitudes, demonstrates high performance, achieving an average precision of 89 % and an average recall of 86.59 % through optimized parameter tuning via a genetic algorithm. Here, wind speed is determined using a cubic function with power spectral density (PSD) at 6 kHz and the MASLUW method, exhibiting strong agreement with satellite data below 15 m/s. Designed for compatibility with low-power electronics, the algorithm can be applied to both archival datasets and real-time data streams. It provides a straightforward metric for ocean monitoring and sound source identification.

1. Introduction

In 2018, the Global Ocean Observing System (GOOS) formally introduced Ocean Sound as a cross-disciplinary Essential Ocean Variable (EOV) [1], which officially recognized sound’s role as a parameter for assessing ecosystem health, climate change, and human impact on the ocean [2]. Ocean sound is inherently multidisciplinary, as it can be analyzed to assess physical parameters like wind speed [3,4,5], rain rate [6,7], and sea-ice presence [8] and detect the presence of sound-producing animals including marine mammals, many fish, some invertebrates [9] and sounds from human activities. Human sounds such as shipping noise [10,11], seismic airgun surveys, sonars, and pile driving are known to injure and disturb marine life of all taxa [12,13,14] as well as mask the sounds animals use for communication, socialization, foraging, and navigation [15,16]. Thus, tracking the contribution of human sounds to ecosystems around the globe is an indicator of the acoustic quality of an ecosystem for the animals. For example, the quantity and quality of sound on shallow water coral reefs is correlated with the health of those reefs [17,18,19].
The Ocean Sound EOV was established to provide a framework for detecting and interpreting ecosystem changes. It facilitates this by comparing acoustic metrics in various periods and geographical contexts. Specifically, a systematic analysis of soundscape variations allows researchers to track changes in biodiversity, ecosystem health, and overall ecological balance [20]. By tracking the dynamics of marine environments, the global community will develop the ability to predict future changes, contributing to informed conservation and management strategies. This EOV will offer valuable information on the scale and impacts of anthropogenic activity by analyzing global Ocean Sound data over extended periods [21]. This knowledge will guide us in creating sustainable practices and policies, thus promoting the harmony between human activities and marine life.
Changes in human activity during the COVID-19 pandemic highlight how systematic Ocean Sound reporting enables the community to measure environmental changes.Although a systematic EOV is not yet in place, several groups have studied changes in marine acoustic environments during lockdowns and restrictions, revealing the direct impact of reduced human activity on underwater soundscapes [22]. The measures enacted to mitigate the COVID-19 pandemic led to significant changes in human activity, resulting in a prolonged reduction in seismic noise. The seismic noise quiet period experienced in 2020 stands as a notable global anthropogenic seismic noise reduction on record [23,24]. In addition, measurements of power spectral density (PSD) at 100 Hz in the Port of Vancouver indicated a decline in shipping activity during the COVID-19 pandemic [25]. It was also observed that the investigation of changes in the acoustic environment due to COVID-19 was hampered by a lack of accessible and standardized data products from around the globe [26,27].
In 2023, the Ocean Sound EOV Implementation Plan [28] was published. It documents the role of Ocean Sound in supporting the GOOS core mandates of measuring climate change and ocean health and monitoring threats. The plan identifies 11 tasks to fully implement the EOV, which are divided into four broad groups: (1) maintaining the existing Ocean Sound measurement sites and gathering essential oceanic data; (2) developing standardized metrics to report Ocean Sound; (3) funding openly accessible databases to host and distribute Ocean Sound data; and (4) developing hindcast, nowcast, and forecast Ocean Sound models. Numerous international workshops have been held to establish a minimum set of soundscape metrics [29,30]. These have been demonstrated and further refined in large-scale monitoring projects [31,32] and are now being formalized in the first revision of the ISO Standard 7605 (Underwater acoustics—Measurement of underwater ambient sound standard) [33] for the measurement and reporting of ambient sound. These initial recommendations focus on documenting the sound pressure levels in time and frequency by reporting, as a minimum, the one-minute decidecade and decade sound pressure levels. In general, it is also recommended to report the per-second values to increase the temporal resolution and hybrid millidecade values [34] to increase the frequency resolution.
Currently, there are no standardized methods for measuring wind speed, rain, or human and biological contributions to the soundscape. The works of [3,4] provide a means of quantifying wind and rain rates; however, they are presented as appropriate for specific environments (the Mediterranean Sea and the Greek Sea, respectively). Based on the successful quantification of the diversity of species in the air, attempts have been made to employ metrics such as entropy, diversity, and complexity to describe the underwater soundscape [35,36,37]. However, these have been unsuccessful because of the long underwater acoustic propagation ranges, as well as the temporal and frequency overlap between human, natural, and biological sound sources. However, some metrics have shown the ability to distinguish between soundscapes [38]. However, detecting individual sound sources using various detectors has proven more effective than relying on general-purpose metrics for underwater soundscapes [39,40,41].
An effective algorithm for oceanic data analysis must meet several key requirements to ensure its utility. Oceanic data analysis involves processing and interpreting various environmental and biological signals collected from the ocean to monitor weather patterns, assess marine biodiversity, and evaluate human impacts on underwater ecosystems. One critical aspect of this analysis is the ability to assess atmospheric conditions. For example, accurately estimating wind speed and detecting the presence of rain can provide valuable in situ measurements that contribute to weather forecasting and storm mitigation [42]. Oceanic gliders and drifters (e.g., ARGOS floats) can enhance meteorological models and improve disaster preparedness if these data are transmitted in near-real-time.
Beyond environmental factors, the algorithm should facilitate biological monitoring by identifying sentinel species, such as fin and blue whales, and recognizing delphinid clicks and whistles. In addition, it should be capable of detecting human-induced sound sources, particularly vessels, to provide information on anthropogenic impacts on the underwater environment. To ensure data reliability, the algorithm must also recognize and account for flow noise as part of a quality assurance process. The reliability of the algorithm is crucial; It should maintain high accuracy in detecting and estimating environmental and biological parameters across various water depths and latitudes, ensuring its applicability in various oceanic environments. Finally, to improve practicality and accessibility, the algorithm must be compatible with low-power embedded systems, allowing efficient computation in resource-constrained environments [43].
An algorithm was introduced by Nystuen et al. [4] to measure the wind speed and classify sources in underwater soundscapes. In their algorithm, shipping and odontocete echolocation sound sources are detected; then those samples are excluded, and rainfall categories are established. Finally, the wind speed was measured using the third-order polynomial of the PSD at 8 kHz.
This paper proposes an algorithm that extends the work of Nystuen et al. [4] and can serve as an easily accessible means of detecting sound sources for the Ocean Sound EOV. Regarding physical ocean parameters, the algorithm estimates the wind speed and the type of rain that occurs. Concerning biological sound sources, it detects the presence of fin and blue whale choruses as well as odontocete clicks and whistles. The algorithm also detects the presence of shipping activities as an indicator of human activity.
In this study, analyses were performed on 19 different datasets, totaling 1,016,890 min of data, stored in the form of millisecond millidiameter spectral data. An empirical algorithm was developed, based on [4], to analyze each one-minute millidecade spectrum to identify rain, vessels, fin and blue whales, and odontocete clicks and whistles. The algorithm first identified sound sources, which were then validated using an automated data selection and validation algorithm [44]. This algorithm selected 100 min from each dataset for the manual verification of the detected sound sources. The classifier was then refined through the application of the Genetic Optimization algorithm.
After the classification process, the wind speed is calculated using a cubic function of PSD at 6 kHz based on the recording depth. The resulting wind speed estimation shows good agreement with satellite data, particularly for speeds below 15 m/s, and maintains a low complexity suitable for embedding on remote nodes.
The subsequent sections of this paper are organized as follows: Section 2 describes the datasets employed. Section 3 presents an overview of the dataset pre-processing, including the depth correction process for the sound pressure level. In Section 4, the classification algorithm is explained, and its performance is documented. Section 5 describes the quantification of the wind speed rate and compares the estimates to wind speeds derived from hindcast models. Finally, Section 6 provides the conclusion.

2. Data Collection

The one-minute acoustic data utilized in this paper were collected between March 2013 and October 2021 using autonomous multichannel acoustic recorders (AMARs) with a nominal sensitivity of 164   dB   V / μ Pa , manufactured by JASCO Applied Science Ltd. The AMARs utilize hydrophones from GeoSpectrum Technologies Inc. (Dartmouth, NS, Canada). The recordings encompass various water depths and a broad range of latitudes, with all data sampled at 32 kHz or higher. For all recordings used in the wind speed analysis, the system noise floor at 6 kHz was 35 dB re 1 μ Pa 2 / Hz or lower.
The data locations are shown in Figure 1. Table 1 details the location, deployment duration, water depth, sample rates, and total minutes of acoustic data collected for each dataset.
Wind speed information incorporated in this study is derived from the ERA5 dataset for meteorological data. ERA5, a global atmospheric reanalysis tool developed by the European Center for Medium-Range Weather Forecasts (ECMWF), provides a detailed and high-resolution representation of historical weather conditions [45]. Wind speed data are retrieved from the Copernicus Climate Change Service (C3S) web service, serving as an accessible interface for ERA5 data [46].
The Grand Passage (Nova Scotia, Canada) dataset was measured in an area with high currents. The current speeds in this area were obtained using an acoustic doppler current profiler (ADCP). For this purpose, the current data is instrumental in enabling the analysis of flow noise. To support the analysis of shipping detection, automatic identification system (AIS) data near the Gaspé stations [47] was obtained for the corresponding acoustic recordings from September to October 2019.
Figure 2 shows the process of analyzing millidecade PSD data. Initially, the raw PSD data undergo offset correction. After correction, the data are fed into two distinct algorithms. One algorithm estimates the wind speed, while the other classifies the soundscape into different sources.

3. The Offset Correction Process

Underwater ambient noise measurements are influenced by various factors, such as source directionality, spreading losses, attenuation, refraction, and multipath propagation, all of which vary with frequency. To ensure reliable and comparable measurements, the PSD must be standardized for depth and seawater absorption. In an isovelocity sound speed profile—where sound speed remains constant with depth—a correction factor compensates for signal loss due to absorption and path geometry. Higher frequencies require more correction because they experience greater absorption and spreading losses. The absorption coefficient is calculated using Ainslie and McColm’s model [48]. This correction improves the robustness of PSD measurement, adapting it to varying environmental conditions such as depth, salinity, and temperature. This method is particularly valuable for estimating oceanic wind speed and analyzing other underwater acoustic phenomena.

4. The Classifier Algorithm

An optimized algorithm was designed to classify a broad range of underwater acoustic signatures. Each one-minute segment, formatted in hybrid millidecades of recorded underwater acoustics, undergoes processing to determine its class and soundscape type. Adopting hybrid millidecade-formatted data proves to be an effective solution to significantly reduce the size of the spectral data. The hybrid format utilizes a 1 Hz resolution up to 435 Hz and millidecade frequency bands beyond 435 Hz. Millidecades represent logarithmically spaced frequency bands with a bandwidth equivalent to 1/1000 th of a decade [34].
Figure 3 shows an example of the classification of different acoustic sources. This classification was validated through automated and manual analyses, which will be explained in further detail.
As evident in the examples presented in Figure 3, the PSD for flow noise shows a prominent increase at frequencies below 30 Hz, surpassing 75 dB re 1 μ Pa 2 / Hz . A shallow slope in frequencies between 2 kHz to 6 kHz is observed in the PSD for moderate rain. Also, the PSD level within this frequency range exceeds 70 dB re 1 μ Pa 2 / Hz . The distinctive features of the PSD include a sharp peak at a high frequency of 30 kHz, indicative of dolphin clicks, and a peak at 10 kHz, representing dolphin whistles. Identifying seismic activity requires both temporal and spectral analyses, with discernible peaks in the PSD occurring in the frequency range of 20 to 120 Hz.
The presence of heavy shipping (single-screw vessels) is characterized by a PSD peak around 50 Hz, while light shipping exhibits a peak around 200 Hz, with PSD levels exceeding 80 dB re 1 μ Pa 2 / Hz for any shipping activities. The presence of fin whales is discernible by a peak at 20 Hz.
Section 4.1 provides an overview of key acoustic characteristics crucial for the proposed classification of underwater soundscapes. Following this, the classification approach is described in Section 4.2. The process of manual data analysis is explained in Section 4.3. Subsequently, Section 4.4 introduces the proposed classification algorithm. Finally, the classifier’s performance is documented in Section 4.5.

4.1. Feature Extraction and Employed Metrics

The proposed algorithm classifies underwater soundscapes by identifying discriminative features in the frequency spectrum.
Spectral features play a vital role in revealing the energy distribution across diverse frequencies, with PSD standing out as the most important metric in this domain. PSD provides valuable insights into how power or energy is distributed across the various frequency components of a signal. Distinctive patterns can be identified through the analysis of PSD, including the presence of harmonics, peaks, or specific frequency bands associated with unique characteristics of underwater acoustics.
Spectral kurtosis is a statistical metric that quantifies the sharpness of peaks in the frequency spectrum. It is mathematically represented by β f and is calculated using
β f = 1 / N f f ( P S D ( f ) P S D ¯ ) 4 σ f 2 ,
where N f is the number of points for the fast Fourier transform (FFT), P S D ( f ) represents the PSD value at frequency f, P S D ¯ is the mean of the PSD, and σ f is the standard deviation of the PSD. Kurtosis [49] theoretically spans an unbounded range. However, practical interpretation commonly focuses on values around 3. A kurtosis of 3 indicates a mesokurtic distribution, the standard or normal level. Values greater than 3 indicate leptokurtic distributions with heavier tails and a more peaked shape (impulsive or tonal features, e.g., dolphin clicks). In contrast, values less than 3 suggest platykurtic distributions characterized by lighter tails and a flatter shape (uniformly distributed spectral content).
The slope of the spectrum serves various purposes within the algorithm. Meanwhile, the coefficient of determination ( R 2 ) is employed to assess the goodness of fit of a linear regression model, ensuring that the data accurately represent the parameter being characterized. R 2 is calculated using
R 2 = 1 i = 1 n ( PSD i P S D ^ i ) 2 i = 1 n ( P S D i P S D ¯ ) 2
where P S D i represents the observed power spectral density at frequency i, P S D ¯ denotes the mean power spectral density, and P S D ^ i signifies the predicted power spectral density at frequency i. Also, n is the number of frequencies.

4.2. Optimized Classification Algorithm

The presented classification algorithm introduces an optimization approach by fine-tuning parameters that serve as thresholds for key features in the classification process, such as the kurtosis threshold, the R-squared range ( R 2 ), the slope range, the frequency range (either singular or multiple), and the PSD threshold for a specified frequency range. These thresholds are dynamically optimized using a genetic algorithm (GA) to achieve the best possible classification performance [50].
The algorithm commences with the initialization of these parameters, and the 1-min millidecade data undergoes offset correction as specified in Section 3. The subsequent step involves feature extraction, specifically focusing on temporal and spectral characteristics like kurtosis, regression features, and the mean PSD, as detailed for each source in later sections.
The iterative optimization process shown in Figure 4 begins, guided by the genetic algorithm, which systematically refines the thresholds ( a , b , c , d , …) by comparing the model predictions with known truth data (see Section 4.3) to enhance the performance of the classification model. The genetic algorithm generates a population of potential threshold configurations and evaluates them using the F1 score, which balances precision and recall in binary classification tasks. The GA then selects solutions for reproduction, performs crossover to exchange genetic material, introduces variations through mutations, and updates the population with new offsprings [51,52,53]. The termination condition is met when the cost function, derived from the F1 score, falls below a predefined threshold, indicating satisfactory classification performance.
It is important to note that the initial population of threshold values was seeded using an expert-based approach. Specifically, the threshold ranges were informed by prior domain knowledge and preliminary analyses of characteristic spectral features associated with various sound sources. The algorithm’s modular design allows easy adaptation to different datasets and requirements. Integrating pre-processing, feature extraction, classification, and parameter optimization in a unified framework provides an effective solution for tailored classification optimization.
The Algorithm 1 outlines the steps of the proposed optimized classification process. Detailed explanations of the truth dataset and the automated classifier will follow in subsequent sections.
Algorithm 1 Optimized Classification Algorithm
1:
function PreprocessData(Raw Data)
2:
      Implement offset correction algorithm
3:
      return Pre-processed Data
4:
end function
5:
function ExtractFeatures( a , b , c , d , Pre-processed Data)
6:
      Implement feature extraction algorithm
7:
      return Features
8:
end function
9:
function RunClassifier(Features)
10:
      Implement classifier algorithm
11:
      return Predictions
12:
end function
13:
function CalculateF1Score(Predictions, Truth Data)
14:
      Implement F1 score calculation logic
15:
      return F1_Score
16:
end function
17:
function CostFunction(F1_Score)
18:
       F opt 1 F1_Score
19:
      return  F opt
20:
end function
21:
function OptimizeParameters( a , b , c , d , F opt )
22:
       [ output , population , scores ] ga ( F opt , N , l b , u b , o p t i o n s )
23:
      Update parameters based on optimization strategy
24:
      return  a new , b new , c new , d new
25:
end function
26:
procedure  Main
27:
      Raw Data ← Load Raw Data
28:
      Pre-processed Data ← PreprocessData(Raw Data)
29:
       a , b , c , d InitialParameters()
30:
      while True do
31:
          Features ←ExtractFeatures( a , b , c , d , Pre-processed Data)
32:
          Predictions ← RunClassifier(Features)
33:
          F1_Score ← CalculateF1Score(Predictions, Truth Data)
34:
           F opt  CostFunction(F1_Score)
35:
          if  F opt < threshold then
36:
               break
37:
          end if
38:
           a , b , c , d OptimizeParameters( a , b , c , d , F opt )
39:
      end while
40:
end procedure

4.3. Manual Data Analysis

To assess the classifier’s performance, a rigorous manual analysis was undertaken to establish a broad truth dataset that includes a diverse set of acoustic events amid local noise. Using the automated data selection for validation (ADSV) algorithm [44], a carefully curated subset of one-minute acoustic files was selected from the dataset shown in Table 1. These files were chosen to represent the detailed acoustic conditions within the dataset, allowing for manual review and validation. Trained analysts examined these files using PAMlab (JASCO Applied Sciences’tool). The analyst annotated each file, discerning the presence or absence of each source in the annotation sets based on the time and frequency behavior of the acoustic signal. The annotations for each one-minute file may encompass fin whales, blue whales, delphinid clicks and whistles, seismic activity, light shipping, heavy shipping, light rain, moderate rain, heavy rain, and flow noise. Within PAMLab, the analysts employ specific functionalities for file annotation, utilizing tools such as the average power spectral density (Box (e) in Figure 4) and a spectrogram with user-defined FFT settings. Figure 4 shows a spectrogram of a recorded file sampled on a one-minute interval. The data is from the Scotian Shelf and was recorded for the Environmental Studies Research Fund client in November 2016.
Various soundscape sources in this figure were annotated as follows: Box (a) denotes drizzle, attributed to a distinct peak at 15 kHz in Box (e). Box (c) includes clicks produced by sperm whales, while Box (b) captures the effects of shipping activities. Additionally, normalizing the spectrogram across time brought to light impulsive sounds, such as whistles, within this soundscape. To annotate Box (d), the analyst fine-tuned the frame length and time step settings to discern details at low frequencies, conclusively labeling it as flow noise and the presence of fin whales, as shown in Figure 5.
Referring to Figure 5, the observed peaks in the 18 to 25 Hz range strongly indicate the potential presence of fin whales. Furthermore, harmonic tones detected within the 10 to 35 Hz range are likely associated with flow noise.
In this study, 1900 files were selected for annotation, comprising 100 files per dataset chosen by the ADSV algorithm. After completing the annotation process, a .csv file was created, capturing details about time, recorder location, and annotations for each file at specific time points. This file served as the truth data for optimizing the algorithm’s performance.

4.4. Automated Classification Algorithm

The typical sound levels of ocean background noise across different frequencies, as originally measured by Wenz [54], form the foundation for understanding the variability in underwater acoustic environments. Building upon this, an automated classification algorithm was developed for identifying various acoustic sources within the ocean soundscape. This algorithm leverages multivariate analysis, considering combinations of acoustic PSD levels at specific frequencies, spectral slopes, and R-squared values obtained from linear regression model fits to PSD levels across selected frequency bands. It further incorporates key factors such as spectral kurtosis and sound pressure levels within specific bands to enhance accuracy.
The classification algorithm automatically categorizes the soundscape into three distinct classifications, namely geophysical, biological, and anthropological, as shown in Figure 6. Geophysical sounds include those generated by natural processes such as flow noise, wind, and rainfall. Biological sounds encompass acoustic signals from marine mammals, including fin whales, blue whales, and dolphins, while anthropological sounds refer to noise created by human activities, such as shipping and seismic surveys. This comprehensive algorithm provides a detailed classification system that not only identifies the acoustic sources but also helps in monitoring the underwater environment effectively by categorizing soundscapes based on their origin.

4.4.1. Geophysicallt Generated Sound Sources

This section delves into the detection mechanism for flow noise [55], presenting models to identify its characteristics. Then, an improved wind speed estimation algorithm is described. Finally, the proficiency of the algorithm in detecting and categorizing rainfall sounds is detailed, offering insights into heavy, moderate, and light rainfall.
Flow Noise
To identify flow noise in the proposed algorithm, the boolean variable F p (flow present) is expressed as
F p = ( R flow 2 > 0.95 ) ( α 1 < m flow < α 2 ) ( PSD ¯ 30   Hz > α 3 ) ( PSD ¯ 30   Hz > PSD ¯ HS + α 4 )
where ∧ is the logical AND operator and ∨ is the logical OR operator. R flow 2 represents the coefficient of determination resulting from linear regression on PSDs within the frequency range of 5 to 50 Hz. m flow is the slope of the linear regression line derived from fitting PSDs within the GA-determined frequency range. PSD ¯ 30   Hz signifies the mean PSD within the frequency range of 25 to 35 Hz. PSD ¯ HS is the mean PSD for heavy shipping activities within the frequency range of 49 to 79 Hz. The values for thresholds ( α 1 = 20 , α 2 = 15 , α 3 = 75 , and α 4 = 10 ) were optimized by the GA.
The logical operators ∧ (logical AND) and ∨ (logical OR) combine these conditions to determine the presence of flow noise. Specifically, Equation (3) asserts that if the regression fit is strong ( R Flow 2 > 0.95 ) and the slope falls within a specified range, or if the mean PSD at 30 Hz exceeds a threshold and is greater than the mean PSD for heavy shipping with an additional threshold, then flow noise is considered present.
Figure 7 illustrates the correlation between flow velocity and the associated flow noise. The observed negative slope in PSD from 5 to 50 Hz and evident PSD levels exceeding 70 dB re 1 μ Pa 2 / Hz indicate an increased likelihood of flow noise with higher flow velocity.
Wind
In this work, PSD levels of wind noise, corresponding to wind speeds ranging from 0 to 14 m/s, were categorized into seven groups, with each category spanning a 2 m/s interval. An eighth category is incorporated for wind speeds exceeding 14 m/s. Figure 8 displays the average PSD estimates for each wind speed category at deep locations. The shaded area surrounding each average PSD visually represents the frequency-dependent standard deviation. To enhance clarity, the shaded regions around the average PSDs depict one-quarter of the standard deviation. The correlation between PSD levels within the 4–15 kHz frequency range and varying wind speeds, as inspired by [5], is expressed as PSD ^ w and is modeled by
PSD ^ w = m w · log 10 ( f ) + b w ,
where PSD is the power spectral density level, f is the frequency, and m w and b w are the linear regression coefficients, which were fitted to the data.
The regression curves are shown as dashed lines in Figure 8. In this figure, the PSD demonstrates a general increase in level with rising wind speeds for all frequencies. Simultaneously, the slope of the linear regression model for the 4–15 kHz frequency range experiences a decline. Applying Equation (4) to the PSD within this frequency range reveals a superior fit compared to samples of high wind speeds. To quantify the alignment of this model with Equation (4), the parameters of the regression model were measured and are detailed in Table 2.
As wind speed increases, the R-Squared values generally increase, indicating an improvement in the model fit. The highest R-squared value is 0.9951 for wind speeds over 14 m/s, suggesting a strong fit in that range. The negative slope values suggest a decreasing trend in PSD as wind speed increases. The steepest decline is observed for wind speeds over 14 m/s, with a slope of −23.08 dB/decade.
To ensure the reliability of the chosen frequency range for categorizing wind speed, a comparison is made between the R-squared values of our regression model and Schwock’s work [5], which was conducted in deep water within the frequency range of 3–25 kHz, as depicted in Figure 9. It can be observed that the proposed algorithm provides better accuracy in fitting the regression model of the PSD for wind speeds exceeding 4 m/s when compared to our chosen frequency range. This emphasizes the superiority of the proposed algorithm for the categorization of wind speed in the 4–15 kHz range.
Drizzle
Raindrop size naturally influences splash and sound. Studies identified different acoustics by different raindrop sizes, with small drops (<0.8 mm) producing gentle splashes, (0.8–1.2 mm) generating loud sounds due to bubbles, and large drops (>3.5 mm) creating energetic splashes with a loud, low-frequency sound. [56]. These findings contribute to understanding the distinct underwater “sound of drizzle” between 13 and 25 kHz, associated with small raindrops across various rainfall types, as shown in Figure 3.
The automated classification for drizzle involves examining the following conditions to identify the peak PSD at 15 kHz ( PSD 15 kHz ).
If   R 2 < 0.95   and   PSD 15 kHz     PSD 10 kHz   >   α , PSD 15 kHz   >   PSD 20 kHz , then   drizzle   is   present .
Here, α d was determined to be 7 dB re 1 μ Pa 2 / Hz using the GA to minimize the loss function between the manual and automated algorithms.
Figure 10 illustrates the PSD as a function of frequency for three recorded cases in Cook Strait, as collected in 2016, each associated with different wind speeds. It can be noted that peaks at a frequency of 15 kHz are observed in all three cases, indicating the occurrence of drizzle during these specific timeframes. The classification has been validated through both manual and automated analyses. Furthermore, a discernible relationship between wind speed and drizzle is apparent. With increasing wind speed, the 15 kHz PSD peak is observed to have higher PSD values. This phenomenon is attributed to the intensified turbulence generated by stronger winds, which leads to increased agitation and the mixing of water masses. Consequently, there is greater acoustic energy at higher frequencies, such as 15 kHz.
Heavy Rain
To assess the presence of heavy rain, the boolean variable HR (heavy rain) is asserted as expressed by
HR = ( m < α 1 ) ( R r a i n 2 > α 2 ) ( β 8 15 kHz   <   α 3 ) ( m r a i n < α 4 ) ( m < α 5 × m r a i n )
m represents the linear regression slope on the PSD levels in the frequency range of 4 kHz to 15 kHz, while R rain 2 is the R-squared value for the linear regression on the PSD level in the frequency range of high rain (2 kHz to 6 kHz), and m r a i n is the slope of the linear regression in the same high rain frequency range as shown in Figure 3. Additionally, β 8 15 kHz represents the spectral kurtosis in the frequency range of 8 kHz to 15 kHz. The conditions for heavy rain involve comparisons between these parameters and optimized constants ( α 1 , α 2 , α 3 , α 4 , α 5 ) using logical operations, with the constants having been determined through optimization by the GA and assigned the unitless values of −19, 0.955, 2.57, −5.2, and 1.5, respectively.
Moderate Rain
To identify moderate rain, similar analyses are conducted as those for heavy rain, involving the frequency range of 4 kHz to 15 kHz and 2 kHz to 6 kHz, as well as specific frequencies of 6 kHz, 8 kHz, and 10 kHz. Specifically, the boolean variable MR indicates that moderate rain is detected and is expressed as
MR = ( R R a i n 2 > α 1 ) ( m r a i n > α 2 ) ( m r a i n < α 3 ) ( β 8 15 kHz   <   α 4 ) ( min ( PSD 6 kHz , PSD 8 kHz , PSD 10 kHz ) > α 5 ) ( max ( PSD 6 kHz , PSD 8 kHz , PSD 10 kHz ) < α 6 ) ,
where the parameters of the slope, R Rain 2 , m rain , β 8 15 kHz are defined the same as for Equation (6). The conditions for moderate rain involve specific comparisons between these parameters and the optimized constants ( α 1 , α 2 , α 3 , α 4 , α 5 , α 6 ), which are assigned the values of 0.95, −10, 0, 2.5, 52, and 70, respectively. Furthermore, PSD 6 kHz , PSD 8 kHz , and PSD 10 kHz represent the PSD levels at frequencies of 6 kHz, 8 kHz, and 10 kHz, respectively. The moderate rain condition is satisfied when all specified conditions are simultaneously met.
The rain classification from the Scotian Shelf dataset is presented in Figure 11. Samples for moderate rain consistently showed insufficient representation across all datasets. Despite this commonality, distinct patterns emerge among different rain classes. Subfigure (a) depicts the rain classification patterns based on the relationships between the PSD levels at 8 kHz and 15 kHz.
The analysis suggests that a two-linear regression model approach is effective in rain classification, demonstrating minimal outliers. Notably, the PSD level at 8 kHz is observed to be higher for heavy rain compared to light rain (drizzle). This distinction arises from the predominant occurrence of PSD levels at the frequency of 15 kHz during light rain conditions.
In Figure 11b, rain classification is achieved based on the slope of the regression line fitted to the PSD level for the frequency range of 8–15 kHz and the PSD level at 15 kHz. Interestingly, this classification also follows a two-horizontal regression model, indicating a powerful method for categorizing rain based on the mentioned features.
The validity of Figure 11 is established through verification by both manual and automated classification algorithms. Consequently, it is inferred that the slope of the regression of PSD levels for the frequency range of 8–15 kHz exhibits distinct characteristics for different rain intensities. Specifically, a slope higher than −10 dB/dec indicates drizzle, while a slope between −10 dB/dec and −20 dB/dec corresponds to moderate rain, and a slope lower than −20 dB/dec characterizes heavy rain. This analysis contributes valuable insights into the nuanced relationship between rain characteristics and underwater acoustics, paving the way for enhanced rain classification methodologies in underwater environments.

4.4.2. Biologically Generated Sound Sources

In this study, the underwater biological soundscape is categorized into three distinct sources: clicks and whistles produced by dolphins and the moan-type sounds emitted by fin whales and blue whales. In this section, we will examine these various types of underwater biological sound sources.
Click
Although click peak frequencies vary among dolphin species, many dolphin species produce clicks with peak energy in the 25–40 kHz range [57,58]. As such, this work confirms the detection of dolphin clicks (DCs) if the boolean variable DC is asserted. It is defined as
DC = PSD 30   kHz > PSD 20   kHz PSD 30   kHz > α 1 β 20 30   kHz > α 2 .
Suppose the PSD at 30 kHz ( PSD 30   kHz ) surpasses that at 20 kHz ( PSD 20   kHz ), and PSD 30   kHz is also higher than a specified threshold ( α 1 ), while also having the spectral kurtosis in the frequency range of 20 to 30 kHz ( β 20 30   kHz ) exceed the threshold ( α 2 ); then it is inferred that Dolphin clicks have been observed. In this work, α 1 and α 2 are optimized to be 48 dB and 2.06, respectively.
Whistle
The analysis of the whistle soundscape is performed within the frequency range of 8 to 15 kHz [59,60]. This is achieved by assessing if the boolean variable DW meets the following conditions:
DW = ( R W 2 < α 1 ) ( max PSD > α 2 ) ( β 8 15   kHz   >   α 3 ) ( m w > α 4 )
In the context of this analysis, R W 2 represents the R-squared value derived from the regression model for PSD levels within the frequency range of 5–15 kHz. The term m w refers to the slope of the regression model applied to the PSD level on this frequency range. Additionally, β 8 15   kHz signifies the spectral kurtosis of PSD within the frequency range of 8–15 kHz. The variable max PSD is defined as the maximum value among PSD 10   kHz , PSD 12   kHz , and PSD 15   kHz . These parameters collectively contribute to whistle detection. The optimized values for α 1 , α 2 , α 3 , and α 4 are 0.95, 46 dB, 2, and −12 dB/dec, respectively.
A comparative analysis was conducted to assess the effectiveness of both manual analysis and an automatic algorithm for whistle detection, as depicted in Figure 12, based on the manual analysis of the Gaspe (Site B) dataset. The x-axis represents the date and time when the user observed the whistle through PAMlab analysis. Static calculations of the millidecade recorded data determine the y-axis. Figure 12a illustrates the spectral kurtosis variations within the frequency range of 8–15 kHz during the observation of the whistle. This figure validates the condition of the whistle detection algorithm, where β 8 15   kHz > 2 . Additionally, Figure 12b displays the linear regression slope of PSD within the 5–15 kHz frequency range observed during the whistle. This figure demonstrates that the slope must be greater than −12 dB/Dec for the whistle detection algorithm, as shown in Equation (9). This helps distinguish whistle data from other sources.
Fin and Blue Whale
Fin whales worldwide produce songs whose dominant notes have a peak frequency in the 20–22 Hz range [61]. Blue whales show more variability in song features worldwide, with note peak frequencies ranging from 17 Hz to 65 Hz [62]. To identify fin and blue whales, it is necessary to exclude samples related to shipping activity that occurred at the same timestamp. Subsequently, the following set of equations is examined. The blue whale (BW) is observed if the condition in Equation (10) is met.
BW = ( bw 1 > α 1 ) ( bw 2 > α 2 ) ,
The presence of the fin whale (FW) is confirmed if the condition in Equation (11) is satisfied.
FW = ( fw > α 3 ) ( PSD ¯ 20   Hz > ( PSD ¯ 18   Hz + α 4 ) ( PSD ¯ 20   Hz > PSD ¯ 25   Hz ) ( PSD ¯ 20   Hz < α 5 ) ,
The intermediate variables bw 1 , bw 2 , and fw are calculated based on
fw = 3 × ( PSD 21   Hz + PSD 22   Hz ) ( PSD 11   Hz + PSD 12   Hz + PSD 13   Hz + PSD 33   Hz + PSD 34   Hz + PSD 35   Hz ) , bw 1 = 3 × ( PSD 16   Hz + PSD 17   Hz ) ( PSD 11   Hz + PSD 12   Hz + PSD 13   Hz + PSD 23   Hz + PSD 24   Hz + PSD 25   Hz ) , bw 2 = 3 × ( PSD 43   Hz + PSD 44   Hz ) ( PSD 36   Hz + PSD 37   Hz + PSD 38   Hz + PSD 49   Hz + PSD 50   Hz + PSD 51   Hz ) ,
where the symbol PSD ¯ i   Hz represents the PSD at frequencies around i. The logical OR operation is denoted by ∨. The parameters α 1 , α 2 , α 3 , α 4 , and α 5 are optimized thresholds with the following values: 2, 1.2, 1.04, 4, and 105 dB, respectively.

4.4.3. Anthropogenically Generated Sound Sources

This study classifies shipping activities into two types: heavy shipping and light shipping. As illustrated in Figure 3, heavy shipping exhibits a peak frequency within the range of 50 to 80 Hz, while light shipping is analyzed in the frequency range of 100 to 400 Hz. This section explains the methodologies employed to discern the type of shipping activity.
Shipping activities are categorized as “Heavy” or “Light” based on conditions involving average PSD in different frequency ranges and spectral kurtosis for the frequency range of 49 to 500 Hz. Satisfying (13) confirms the occurrence of shipping activities, whether heavy or light.
( PSD ¯ 30   Hz < α 1 ) ( β 50 500   >   α 2 ) ( β 50 500   <   α 3 ) ,
where the average PSD level for the frequency range of 25 to 33 Hz is denoted as PSD ¯ 30   Hz . The spectral kurtosis in the frequency range of 50 to 500 Hz is denoted as β 50 500 . Firstly, the mentioned conditions must be assessed for both “Heavy” and “Light”. If these conditions are satisfied, the identification of either heavy shipping or light shipping is further examined using the corresponding conditions. To determine the presence of heavy shipping, the boolean variable HS is defined as
HS = ( d 1 > α 4 ) ( d 1 < α 5 ) ( d 2 > α 6 ) ( d 2 < α 7 ) ( PSD ¯ HS > α 8 ) ( PSD ¯ HS > PSD ¯ LS ) .
Also, by taking into account the boolean variable LS for light shipping, the detection of light shipping activity occurs when LS is asserted. LS is defined as
LS = ( d 1 > α 4 ) ( d 1 < α 5 ) ( d 2 < α 6 ) ( d 2 > α 7 ) ( PSD ¯ LS > α 9 ) ( PSD ¯ LS > PSD ¯ HS ) ,
where parameters α 1 to α 9 were optimized to the best value by the GA as detailed in Table 3. The calculations for d 1 , d 2 , d 1 , and d 2 are outlined as
d 1 = PSD ¯ HS PSD ¯ 30   Hz ,
d 2 = PSD ¯ HS PSD ¯ 100   Hz ,
d 1 = PSD ¯ LS PSD ¯ 100   Hz ,
d 2 = PSD ¯ LS PSD ¯ 100 400   Hz .
where PSD ¯ HS is the mean PSD for heavy shipping activities within the frequency range of 49 to 79 Hz (optimized by the GA), and PSD ¯ LS is the mean PSD for light shipping activities within the frequency range of 360 to 448 Hz (optimized by GA). PSD ¯ 100   Hz and PSD ¯ 100 400   Hz present the average PSD level for the frequency ranges of 89 to 103 Hz and 107 to 408 Hz, respectively (both optimized by the GA).
To assess the accuracy of shipping activity detection, three ships were chosen for tracking using the AIS dataset. The PSDs when the ships passed around Gaspé Station B in 2019 were recorded, as shown in Figure 13b. Relevant AIS data for these ships is compiled in Table 4. The prominent peak observed in the frequency range of 49 to 79 Hz serves as a clear indicator of heavy shipping activity in the region, aligning with established patterns of maritime traffic. Interestingly, the identified peaks around 20 Hz offer compelling evidence for the presence of fin whales, which are known for emitting acoustic signals within this frequency range. This demonstrates that the algorithm can detect multiple sources within the one-minute spectrum. These distinctive spectral features not only confirm the heavy shipping dynamics but also highlight the coexistence of marine life, specifically fin whales, in the study area. This detailed frequency analysis enhances our understanding of the acoustic environment, enabling an assessment of both shipping and marine life dynamics in the vicinity of Gaspé Station B.

4.5. Classifier Performance

The performance evaluation of the proposed classification algorithm requires the computation of precision (positive predictive value), which is given by
Precision = n T P n T P + n F P ,
where n T P represents the number of true positives, n F P represents the number of false positives, and n F N represents the number of false negatives. Recall (sensitivity or the true predictive rate) is expressed as
Recall = n T P n T P + n F N ,
while the F γ score (the harmonic mean of precision and recall) is expressed as
F γ = ( 1 + γ 2 ) · Precision · Recall ( γ 2 · Precision ) + Recall
Note that γ is a parameter that controls the relative importance of precision and recall in the evaluation of a model’s performance. When γ = 1 , it is equivalent to the F1 score, giving equal weight to precision and recall. When γ < 1 , it emphasizes precision more; when γ > 1 , it emphasizes recall more. The Matthews Correlation Coefficient (MCC) is specifically designed to account for the four essential components of a confusion matrix and is calculated using
MCC = n T P · n T N n F P · n F N ( n T P + n F P ) ( n T P + n F N ) · 1 ( n T N + n F P ) ( n T N + n F N ) ,
where n T N represents the number of true negatives.
Table 5 provides a wide overview of the performance percentages achieved by our classification algorithm across various sound source types. The evaluation metrics include precision, recall, the F 1 score, the Matthews Correlation Coefficient (MCC), and the F 0.5 Score. Additionally, the column “Occurrence” displays the frequency of each source in the manual data collection.
The algorithm demonstrates good precision in identifying several sound sources. Notably, it excels in precision for “Fin whale” (100%), “Clicks” (93.75%), “Whistle” (92.38%), and “Flow noise” (91.30%). This indicates a high degree of accuracy in classifying instances as positive for these specific sound sources.
“Heavy rain” and “Light rain” achieve perfect recall (100%), implying that the algorithm successfully identifies all positive instances of classifying rain sounds.
“Clicks” exhibit a balanced performance with an F 1 score of 96.77%, showcasing a harmonious trade-off between precision and recall. This suggests that the algorithm maintains a good balance between minimizing false positives and false negatives for “Clicks.”
The Matthews Correlation Coefficient (MCC) values further emphasize the algorithm’s effectiveness. “Clicks” and “Light rain” stand out with MCC values of 96.26% and 89.47%, respectively. These values indicate a strong correlation between the algorithm’s predictions and the actual classifications, reinforcing its reliability.
The F 0.5 score, which places more emphasis on precision, highlights the algorithm’s proficiency across different sound sources. “Fin whale” and “Clicks” particularly excel in this metric, with scores of 94.59% and 94.93%, respectively.
In summary, Table 5 reflects a good overall performance of the proposed classification algorithm in sound source identification. The high precision, recall, and balanced F 1 scores across various sound sources indicate the algorithm’s versatility and effectiveness in diverse audio environments. These results underscore its potential for practical applications in sound classification scenarios. We acknowledge that performance metrics for certain classes are not reported in the table because of the low sample sizes for those classes in those datasets.

5. Quantifying the Wind Speed

While satellite-based measurements are expected to play a significant role in monitoring essential climate variables, certain factors necessitate in situ observations, such as their availability to aid in the interpretation and calibration of satellite data. The measurement of surface wind speed encounters challenges posed by destructive surface wave fields on buoys. An alternative method involves leveraging wind-generated ambient sound at ocean depth for more powerful observations. This concept was introduced by Vagle et al. [63]. Their quantitative algorithm utilizes the sound level at 8 kHz, and wind speed estimation, denoted as v w , v a g e l in m/s, is expressed as
v w , v a g l e = 10 SPL 8 20 + 104.3 53.91 ,
where SPL 8 denotes the sound level at 8 kHz. This relationship is valid within the range of 3.3 < v w , v a g l e < 15 m/s. Above 15 m/s, the influence of bubble clouds absorbing sound causes an underestimation of wind speed in the 8 kHz signal. Conversely, in wind conditions below 2 m/s, where wave breaking is absent, no acoustic signal is available for measuring wind speed. The most recently proposed quantitative wind speed relationship by Nystuen is calculated as a third-order polynomial in terms of SPL 8 , and the speed estimate v w , n y s t u e n in m/s is expressed as
v w , n y s t u e n = a 0 · ( SPL 8 ) 3 + a 1 · ( SPL 8 ) 2 + a 2 · SPL 8 + a 3 ,
where the coefficients were defined to be a 0 = 0.0005 , a 1 = 0.0310 , a 2 = 0.4904 , and a 3 = 2.0871 [4].
In this work, a relationship between wind speed and a third-order polynomial in terms of the PSD at 6 kHz is proposed, and the value also depends on the depth of the recorder. Specifically, the estimated wind speed v w , p r o p o s e d is expressed as
v w , p r o p o s e d = b 0 · ( PSD 6 ) 3 + b 1 · ( PSD 6 ) 2 + b 2 · PSD 6 + b 3 · h ,
where h represents the depth of the recorder. The coefficients b 0 , b 1 , b 2 , and b 3 are estimated to be 2.309178 × 10 5 , 5.852897 × 10 3 , 1.102547 × 10 1 , and 1.661 × 10 4 , respectively.
Figure 14 compares the PSD trends at 6 kHz using Equation (26) and at 8 kHz using Equation (25) with the corresponding actual wind speed. The red curves delineate the estimated wind speed derived from both the proposed algorithm (Equation (26)) and Nystuen’s algorithm (Equation (25)). As observed from Figure 14, the PSD at 6 kHz performs better in tracking wind speed trends than that at 8 kHz, which displays outliers, causing the estimated wind speed to deviate from the actual data. This observation validates that the proposed algorithm can outperform Nystuen’s algorithm in deep water conditions.
The proposed algorithm, incorporating a depth-dependent coefficient, introduces a level of flexibility for fitting, enhancing its performance. Additionally, the PSDs at 6 kHz capture more features of the wind speed, enhancing the algorithm’s performance, as illustrated in Figure 15. This figure shows a comparison between the 24 h averaged wind speed as estimated by the proposed algorithm and the actual 24 h averaged wind speed obtained from the ERA5 data spanning from February to August 2019 in the Blake Plateau. The comparison of root mean square (RMS) error among three wind speed estimation algorithms for deep water sites reveals distinct levels of accuracy. The proposed algorithm stands out with an RMS error of 1.1 m/s, indicating the smallest average discrepancy between predicted and observed wind speeds. In contrast, Vagle’s algorithm exhibits a higher error of 1.5 m/s, suggesting less precise estimations. Nystuen’s algorithm falls in between with an RMS error of 1.22 m/s, offering a moderate level of accuracy.

Model for the Ambient Sound Level Underwater from Wind (MASLUW)

To estimate wind speed from the PSD, it is necessary to remove PSD data influenced by specific soundscapes. PSDs predicted to contain heavy shipping, light shipping, light rain, heavy rain, moderate rain, whistles, sperm clicks, and whale clicks are excluded, as these soundscapes, particularly in the 6 to 8 kHz range, can interfere with accurate wind speed estimation. Consequently, an alternative method is employed, which does not rely on soundscape prediction. This approach estimates wind speed based on parameters such as water depth, sensor depth, sound speed profile, and sediment type, represented by grain size.
MASLUW [64] is a model for estimating ambient sound levels and spatial coherence using Harrison’s frequency domain formulas [65]. It generates time-series data through Fourier synthesis, incorporating surface and seabed reflection loss, ray-based propagation, and random time-series generation. The model maintains spatial coherence crucial for sonar array processing. MASLUW’s surface noise-level model uses Ainslie’s formulas [66] based on wind speed. It includes seabed reflection loss from Jensen et al. [67], ray-based propagation for accurate sound-level predictions, and seawater absorption models. The spatial coherence is computed using singular value decomposition (SVD), ensuring realistic time-series data for sonar performance modeling. The spectral density of the areic dipole source factor, K Wind , is calculated based on the frequency in kilohertz (F) [66]:
K Wind = 10 4.12 × v ^ APL 2.24 ( 1.5 + F 1.59 ) × 10 0.1 × δ μ Pa 2   Hz 1
δ = 0 if   Δ T < 1 0.26 × ( Δ T 1 ) 2 if   Δ T 1
where Δ T is the temperature difference in degrees Celsius:
Δ T = T ^ air T ^ water
v ^ APL is related to wind speed v 10 (wind speed at 10 m above the water surface in m/s) according to
v ^ APL = max ( v 10 , 1 )
The model compares predicted sound levels with ERA5 data, incorporating environmental factors such as water depth, sediment type, and sound speed profile. It is designed to accurately predict wind-related noise and spatial correlations, which is essential for improving sonar system performance.
First, the slopes of the averaged spectrum against log 10 ( Frequency ) between 4 and 15 kHz are calculated, along with the coefficient of determination ( R 2 ) for the data and the fitted line. The analysis proceeds only for hours where the R 2 value exceeds 0.95, indicating a strong linear fit. Additionally, the slope must fall between −13 and −19 dB/decade, allowing for a tolerance around the expected slope of −16 dB/decade. For the minutes that meet these criteria, the mean PSD at 6 kHz, 5 kHz, and 4 kHz is extracted for comparative analysis.
To generate the reference PSD data, MASLUW was executed for wind speeds ranging from 3 to 15 m/s in increments of 0.25 m/s, with ϕ values from −2 to 6 [68], also spaced at 0.25 increments.
The analysis utilized the average monthly sound speeds, sound speed profiles (SSPs), and sediment-type data extracted from the DECs 2 Beta tool for September and October 2019, as shown in Figure 16. The bathymetry subfigure indicates that the sediment in the area is predominantly sandy, with a grain size parameter of ϕ = 1.5 . These characteristics play a crucial role in understanding sound propagation in the region, as both sound speed and sediment type significantly influence acoustic interactions with the seafloor.
Figure 17 presents a comparison between the hourly averaged wind speed estimated by the MASLUW model and the actual hourly averaged wind speed recorded at Gaspe (Site B) during the period from September to October 2019. The estimated wind speed values from the MASLUW model demonstrate a high level of agreement with the actual measurements, with an overall error of 1.08 m/s. While the overall alignment between the MASLUW estimates and actual data is strong, slight discrepancies may be observed at higher wind speeds or during rapid fluctuations in atmospheric conditions. These deviations could be due to limitations in model resolution or external factors influencing wind patterns, which are not captured in the PSD data.

6. Conclusions

This study introduced an algorithm designed to identify common sound sources in underwater soundscapes, providing valuable insights into their presence within the marine environment. This algorithm employs easily computed metrics, specifically one-minute spectral kurtosis and power spectral density levels, offering a practical and efficient means of identifying and quantifying specific features in the local soundscape.
Aligned with the core mission of the Global Ocean Observation System (GOOS), the proposed algorithm offers an accessible means of quantifying the contributors to the marine soundscape, which will contribute to monitoring climate change and sustaining ocean health. The performance of the algorithm was evaluated through an automated soundscape classification system developed from extensive, long-term acoustic recordings across diverse depths and locations, employing sampling frequencies of at least 64 kHz. The algorithm employed an empirically tuned approach, initially detecting peaks in the one-minute spectrum indicative of various acoustic phenomena such as rainfall, drizzle, heavy and light shipping, and biological signals. The subsequent manual review of a sample of detections further refined the algorithm through the Genetic Optimization algorithm, enhancing its accuracy and reliability.
The classification results demonstrate the algorithm’s high accuracy, affirming its effectiveness in accurately categorizing underwater soundscapes. This study incorporated a range of performance metrics, including precision, recall, F1 scores, and the Matthews Correlation Coefficient (MCC), to evaluate the algorithm’s efficacy across different sound source types. The detailed analysis revealed noteworthy achievements in specific metrics for diverse sources, demonstrating the algorithm’s strong overall performance in sound source classification. Additionally, the wind speed estimation component utilized a cubic function that incorporated the PSD at 6 kHz and the recording depth, along with the MASLUW method, showing strong agreement with satellite data, particularly for wind speeds below 15 m/s.

Author Contributions

Conceptualization, methodology, software, validation and review and editing, M.M.H., B.M. and J.F.B.; writing—original draft preparation and formal analysis, M.M.H.; supervision, B.M. and J.F.B.; project administration, B.M.; funding acquisition, B.M.; validation and review and editing, J.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by Mitacs through the Mitacs Elevate program (Award Number: IT23432).

Data Availability Statement

The data used in this study are confidential and cannot be shared publicly due to the data-sharing policy of JASCO Applied Sciences Ltd.

Acknowledgments

The authors extend their sincere gratitude to Katie Kowarski, Allison Richardson, Wolfgang Friedrich, and Laird Bolt from JASCO for their invaluable guidance, which played a crucial role in advancing this project. We also appreciate Briand J. Gaudet for his modifications to ARK/PAMlab to support the manual analysis.

Conflicts of Interest

Authors Bruce Martin and Julien Delarue were employed by the company JASCO Applied Sciences Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Tyack, P. Essential Ocean Variables (EOV) for Either the GOOS Physics and Climate Panel or Biology and Ecosystems Panel. 2018. Available online: https://www.marineboard.eu/sites/marineboard.eu/files/public/New_website/Science%20webinars/tyack_ocean_sound_eov_implementation_plan_for_emb_20_mar24_1.pdf (accessed on 30 March 2025).
  2. Miksis-Olds, J.L.; Martin, B.; Tycka, F.L. Exploring the ocean through soundscapes. Acoust. Today 2018, 14, 26–34. [Google Scholar]
  3. Hiroyuki, H.; Pensieri, S.; Bozzano, R.; Nystuen, J.A.; Anagnostou, E.N.; Anagnostou, M.N.; Bechini, R. Underwater Acoustic Measurements to Estimate Wind and Rainfall in the Mediterranean Sea. Adv. Meteorol. 2015, 2015, 612512. [Google Scholar] [CrossRef]
  4. Nystuen, J.A.; Anagnostou, M.N.; Anagnostou, E.N.; Papadopoulos, A. Monitoring Greek Seas Using Passive Underwater Acoustics. J. Atmos. Ocean. Technol. 2015, 32, 334–349. [Google Scholar] [CrossRef]
  5. Schwock, F.; Abadi, S. Statistical analysis and modeling of underwater wind noise at the northeast pacific continental margin. J. Acoust. Soc. Am. 2021, 150, 4166–4177. [Google Scholar] [CrossRef] [PubMed]
  6. Schwock, F.; Abadi, S. Characterizing underwater noise during rain at the northeast Pacific continental margin. J. Acoust. Soc. Am. 2021, 149, 4579–4595. [Google Scholar] [CrossRef]
  7. Nystuen, J.A.; Moore, S.E.; Stabeno, P.J. A sound budget for the southeastern Bering Sea: Measuring wind, rainfall, shipping, and other sources of underwater sound. J. Acoust. Soc. Am. 2010, 128, 58–65. [Google Scholar] [CrossRef]
  8. Lalande, C.; Grebmeier, J.M.; Wassmann, P.; Cooper, L.W.; Flint, M.V.; Sergeeva, V.M. Export fluxes of biogenic matter in the presence and absence of seasonal sea ice cover in the Chukchi Sea. Cont. Shelf Res. 2007, 27, 2051–2065. [Google Scholar] [CrossRef]
  9. Cato, J.C.; Brown, C.L. Marine Ornamental Species: Collection, Culture and Conservation; John Wiley & Sons: Hoboken, NJ, USA, 2008. [Google Scholar]
  10. Hotkani, M.M.; Seyedin, S.A.; Bousquet, J. Estimation of the Bistatic Echolocation from Underwater Target Using Ship Noise based on Normal-Mode Model. Signal Process. Renew. Energy 2021, 5, 1–14. [Google Scholar]
  11. Hotkani, M.M.; Seyedin, S.A. Ship Tracking Utilizing Propeller Noise with a Compact Hydro-phone Array. Signal Process. Renew. Energy 2023, 7, 59–67. [Google Scholar]
  12. Southall, B.L.; Donovan, G.P.; Racca, R.; Reeves, R.R.; Vedenev, A.I.; Weller, D.W.; Nowacek, D.P. Data collection and analysis methods to evaluate potential impacts of seismic surveys and other marine industrial activities on baleen whales. Ocean Coast. Manag. 2023, 245, 106799. [Google Scholar] [CrossRef]
  13. Arranz, G.J. Acoustic Impacts of Seismic Surveys in Complex Underwater Environments; University of Bath: Bath, UK, 2023. [Google Scholar]
  14. Hawkins, A.D.; Popper, A.N. Effects of man-made sound on fishes. In Effects of Anthropogenic Noise on Animals; Springer: New York, NY, USA, 2018; pp. 145–177. [Google Scholar]
  15. Erbe, C.; Reichmuth, C.; Cunningham, K.; Lucke, K.; Dooling, R. Communication masking in marine mammals: A review and research strategy. Mar. Pollut. Bull. 2016, 103, 15–38. [Google Scholar] [CrossRef] [PubMed]
  16. Ehrhardt, M.; Pereselkov, S.; Kuz’kin, V.; Kaznacheev, I.; Rybyanets, P. Experimental observation and theoretical analysis of the low-frequency source interferogram and hologram in shallow water. J. Sound Vib. 2023, 544, 117388. [Google Scholar] [CrossRef]
  17. Bertucci, F.; Parmentier, E.; Lecellier, G.; Hawkins, A.D.; Lecchini, D. Acoustic indices provide information on the status of coral reefs: An example from Moorea Island in the South Pacific. Sci. Rep. 2016, 6, 33326. [Google Scholar] [CrossRef] [PubMed]
  18. Puebla-Aparicio, M.; Ascencio-Elizondo, C.; Vieira, M.; Amorim, M.C.P.; Duarte, R.; Fonseca, P.J. Characterization of the fish acoustic communities in a Mozambican tropical coral reef. Mar. Ecol. Prog. Ser. 2024, 727, 143–158. [Google Scholar] [CrossRef]
  19. Lin, T.-H.; Sinniger, F.; Harii, S.; Akamatsu, T. Using Soundscapes to Assess Changes in Coral Reef Social-Ecological Systems. Oceanography 2023, 36, 20–27. [Google Scholar] [CrossRef]
  20. Cominelli, S.; Bellin, N.; Brown, C.D.; Rossi, V.; Lawson, J. Acoustic features as a tool to visualize and explore marine soundscapes: Applications illustrated using marine mammal passive acoustic monitoring datasets. Ecol. Evol. 2024, 14, e10951. [Google Scholar] [CrossRef]
  21. Leone, M.T.; Warren, J.D. Quantifying Variability and Sources of Anthropogenic Noise in Coastal Marine and Terrestrial Soundscapes of Long Island, New York Following the Covid Pandemic. Available online: https://ssrn.com/abstract=4696232 (accessed on 30 March 2025).
  22. Tyack, P.L.; Miksis-Olds-Olds, J.; Urban, E.R., Jr.; Ausubel, J. Measuring ambient ocean sound during the COVID-19 pandemic. Eos Trans. Am. Geophys. Union 2021, 102, 155447. [Google Scholar] [CrossRef]
  23. Lecocq, T.; Hicks, S.P.; Van Noten, K.; Van Wijk, K.; Koelemeijer, P.; De Plaen, R.S.M.; Massin, F.; Hillers, G.; Anthony, R.E.; Apoloner, M.-T.; et al. Global quieting of high-frequency seismic noise due to COVID-19 pandemic lockdown measures. Science 2020, 369, 1338–1343. [Google Scholar] [CrossRef]
  24. Roy, K.S.; Sharma, J.; Kumar, S.; Kumar, M.R. Effect of coronavirus lockdowns on the ambient seismic noise levels in Gujarat, northwest India. Sci. Rep. 2021, 11, 7148. [Google Scholar] [CrossRef]
  25. Thomson, D.J.M.; Barclay, D.R. Real-time observations of the impact of COVID-19 on underwater noise. J. Acoust. Soc. Am. 2020, 147, 3390–3396. [Google Scholar] [CrossRef]
  26. Afonso, A.S. Humans influence shark behavior: Evidence from the COVID-19 lockdown. Ocean Coast. Manag. 2024, 248, 106965. [Google Scholar] [CrossRef]
  27. Butts, A.E.; Primack, R.B.; Morgan, P.; Diario, C.M. COVID-19 pandemic and associated lockdown as a Global Human Confinement Experiment to investigate biodiversity conservation. Biol. Conserv. 2020, 248, 108665. [Google Scholar] [CrossRef] [PubMed]
  28. Tyack, P.L.; Akamatsu, T.; Boebel, O.; Chapuis, L.; Debusschere, E.; Jong, C.D.; Erbe, C.; Evans, K.; Gedamke, J.; Gridley, T.; et al. Ocean Sound Essential Ocean Variable Implementation Plan; International Quiet Ocean Experiment, Scientific Committee on Oceanic Research: Newark, DE, USA, 2023. [Google Scholar]
  29. Hatch, L.; Gedamke, J.; Dekeling, R.; Porter, M.; Erbe, C.; Tyack, P.; Frisk, G.; Williams, R.; Ainslie, M.; Donovan, G. Joint Workshop Report: Predicting Sound Fields—Global Soundscape Modeling to Inform Management of Cetaceans and Anthropogenic Noise; Joint Workshop Sponsored by the IWC, IQOE, US NOAA, ONRG, and Netherlands TNO; Ministry of Infrastructure and the Environment: Amsterdam, The Netherlands, 2014.
  30. Ainslie, M.; de Jong, C.; Miksis-Olds, J. IQOE Workshop Report: Guidelines for Observation of Ocean Sound, 13 July 2019, Park Hotel, Den Haag, Netherlands; International Quiet Ocean Experiment (IQOE): Newark, DE, USA, 2019. [Google Scholar]
  31. Kastelein, R.A.; Helder-Hoek, L.; Van de Voorde, S.; von Benda-Beckmann, A.M.; Lam, F.-P.A.; Jansen, E.; de Jong, C.A.F.; Ainslie, M.A. Temporary hearing threshold shift in a harbor porpoise (Phocoena phocoena) after exposure to multiple airgun sounds. J. Acoust. Soc. Am. 2017, 142, 2430–2442. [Google Scholar] [CrossRef] [PubMed]
  32. Wang, L.; Robinson, S. JOMOPANS Standard: Terminology for Ambient Ocean Noise Monitoring; Joint Monitoring Programme for Ambient Noise North Sea (JOMOPANS): Copenhagen, Denmark, 2020. [Google Scholar]
  33. ISO 7605:2023; Underwater Acoustics—Measurement of Underwater Ambient Sound. International Organization for Standardization (ISO): Geneva, Switzerland, 2023. Available online: https://www.iso.org/standard/82844.html#lifecycle (accessed on 30 March 2025).
  34. Martin, B.S.; Gaudet, B.J.; Klinck, H.; Dugan, P.J.; Miksis-Olds, J.L.; Mellinger, D.K.; Mann, D.A.; Boebel, O.; Wilson, C.C.; Ponirakis, D.W.; et al. Erratum: Hybrid millidecade spectra: A practical format for exchange of long-term ambient sound data. JASA Express Lett. 2021, 1, 081201. [Google Scholar] [CrossRef] [PubMed]
  35. Minello, M.; Calado, L.; Xavier, F.C. Ecoacoustic indices in marine ecosystems: A review on recent developments, challenges, and future directions. ICES J. Mar. Sci. 2021, 78, 3066–3074. [Google Scholar] [CrossRef]
  36. Roca, I.T.; Van Opzeeland, I. Using acoustic metrics to characterize underwater acoustic biodiversity in the Southern Ocean. Remote Sens. Ecol. Conserv. 2020, 6, 262–273. [Google Scholar] [CrossRef]
  37. Towsey, M.; Zhang, L.; Cottman-Fields, M.; Wimmer, J.; Zhang, J.; Roe, P. Visualization of Long-duration Acoustic Recordings of the Environment. Procedia Comput. Sci. 2014, 29, 703–712. [Google Scholar] [CrossRef]
  38. Wilford, D.C.; Miksis-Olds, J.L.; Martin, S.B. Multidimensional comparison of underwater soundscapes using the soundscape code. J. Acoust. Soc. Am. 2023, 154, 3438–3453. [Google Scholar] [CrossRef]
  39. Marques, T.A.; Thomas, L.; Martin, S.W.; Mellinger, D.K.; Ward, J.A.; Moretti, D.J.; Harris, D.; Tyack, P.L. Estimating animal population density using passive acoustics. Biol. Rev. 2013, 88, 287–309. [Google Scholar] [CrossRef]
  40. White, E.L.; White, P.R.; Bull, J.M.; Risch, D.; Beck, S.; Edwards, E.W.J. More than a whistle: Automated detection of marine sound sources with a convolutional neural network. Front. Mar. Sci. 2022, 9, 879145. [Google Scholar] [CrossRef]
  41. Duc, P.N.H. Development of Artificial Intelligence Methods for Marine Mammal Detection and Classification of Underwater Sounds in a Weak Supervision (But) Big Data-Expert Context. Ph.D. Thesis, Sorbonne Université, Paris, France, 2020. [Google Scholar]
  42. Lima, E.; Sun, X.; Dong, J.; Wang, H.; Yang, Y.; Liu, L. Learning and transferring convolutional neural network knowledge to ocean front recognition. IEEE Geosci. Remote Sens. Lett. 2017, 14, 354–358. [Google Scholar] [CrossRef]
  43. Laun, A.; Pittman, E. Development of a small, low-cost, networked buoy for persistent ocean monitoring and data acquisition. In Proceedings of the OCEANS 2018 MTS/IEEE Charleston, Charleston, SC, USA, 22–25 October 2018; pp. 1–6. [Google Scholar]
  44. Kowarski, K.A.; Delarue, J.J.-Y.; Gaudet, B.J.; Bruce Martin, S. Automatic data selection for validation: A method to determine cetacean occurrence in large acoustic data sets. JASA Express Lett. 2021, 1, 051201. [Google Scholar] [CrossRef]
  45. Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Schepers, D.; et al. The ERA5 global reanalysis. Q. J. R. Meteorol. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
  46. Copernicus Climate Change Service (C3S). Copernicus Climate Change Service; European Centre for Medium-Range Weather Forecasts (ECMWF): Reading, UK, 2020; Available online: https://climate.copernicus.eu/ (accessed on 30 March 2025).
  47. Gehrmann, R.A.S.; Barclay, D.R.; Johnson, H.; Shajahan, N.; Nolet, V.; Davies, K.T.A. Ambient noise levels with depth from an underwater glider survey across shipping lanes in the Gulf of St. Lawrence, Canada. J. Acoust. Soc. Am. 2023, 154, 1735–1745. [Google Scholar] [CrossRef]
  48. Ainslie, M.A.; McColm, J.G. A simplified formula for viscous and chemical absorption in sea water. J. Acoust. Soc. Am. 1998, 103, 1671–1672. [Google Scholar] [CrossRef]
  49. Ruppert, D. What is kurtosis? An influence function approach. Am. Stat. 1987, 41, 1–5. [Google Scholar] [PubMed]
  50. Tao, P.; Sun, Z.; Sun, Z. An improved intrusion detection algorithm based on GA and SVM. IEEE Access 2018, 6, 13624–13631. [Google Scholar] [CrossRef]
  51. McCall, J. Genetic algorithms for modelling and optimisation. J. Comput. Appl. Math. 2005, 184, 205–222. [Google Scholar] [CrossRef]
  52. Fernandes, R.P.; Apolinário, J.A., Jr. Underwater target classification with optimized feature selection based on Genetic Algorithms. In Proceedings of the Simpósio Brasileiro de Telecomunicações e Processamento de Sinais, Florianópolis, Brazil, 13–16 September 2020. [Google Scholar]
  53. Reeves, C.; Rowe, J.E. Genetic Algorithms: Principles and Perspectives: A Guide to GA Theory; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2002. [Google Scholar]
  54. Wenz, G.M. Acoustic ambient noise in the ocean: Spectra and sources. J. Acoust. Soc. Am. 1962, 34, 1936–1956. [Google Scholar] [CrossRef]
  55. Strasberg, M. Nonacoustic noise interference in measurements of infrasonic ambient noise. J. Acoust. Soc. Am. 1979, 66, 1487–1493. [Google Scholar] [CrossRef]
  56. Ma, B.B.; Nystuen, J.A. Passive acoustic detection and measurement of rainfall at sea. J. Atmos. Ocean. Technol. 2005, 22, 1225–1248. [Google Scholar] [CrossRef]
  57. Cohen, R.E.; Frasier, K.E.; Baumann-Pickering, S.; Wiggins, S.M.; Rafter, M.A.; Baggett, L.M.; Hildebrand, J.A. Identification of western North Atlantic odontocete echolocation click types using machine learning and spatiotemporal correlates. PLoS ONE 2022, 17, e0264988. [Google Scholar] [CrossRef] [PubMed]
  58. Frasier, K.E.; Roch, M.A.; Soldevilla, M.S.; Wiggins, S.M.; Garrison, L.P.; Hildebrand, J.A. Automated classification of dolphin echolocation click types from the Gulf of Mexico. PLoS Comput. Biol. 2017, 13, e1005823. [Google Scholar] [CrossRef]
  59. Oswald, J.N.; Rankin, S.; Barlow, J.; Lammers, M.O. A tool for real-time acoustic species identification of delphinid whistles. J. Acoust. Soc. Am. 2007, 122, 587–595. [Google Scholar] [CrossRef]
  60. Rendell, L.E.; Gordon, J.C.D. Vocal response of long-finned pilot whales (Globicephala melas) to military sonar in the Ligurian Sea. Mar. Mammal Sci. 1999, 15, 198–204. [Google Scholar] [CrossRef]
  61. Weirathmueller, M.J.; Stafford, K.M.; Wilcock, W.S.D.; Hilmo, R.S.; Dziak, R.P.; Tréhu, A.M. Spatial and temporal trends in fin whale vocalizations recorded in the NE Pacific Ocean between 2003–2013. PLoS ONE 2017, 12, e0186127. [Google Scholar] [CrossRef]
  62. McDonald, M.A.; Hildebrand, J.A.; Mesnick, S. Worldwide decline in tonal frequencies of blue whale songs. Endanger. Species Res. 2009, 9, 13–21. [Google Scholar] [CrossRef]
  63. Vagle, S.; Large, W.G.; Farmer, D.M. An evaluation of the WOTAN technique of inferring oceanic winds from underwater ambient sound. J. Atmos. Ocean. Technol. 1990, 7, 576–595. [Google Scholar] [CrossRef]
  64. Martin, B.; Siderius, M.; Mirzaei Hotkani, M. Using the Model for Ambient Sound Levels Underwater from Wind (MUSLAW) to Estimate Wind Speeds. In Proceedings of the OCEANS 2024-Halifax, Halifax, NS, Canada, 23–26 September 2024; pp. 1–7. [Google Scholar]
  65. Harrison, C.H. Formulas for ambient noise level and coherence. J. Acoust. Soc. Am. 1996, 99, 2055–2066. [Google Scholar] [CrossRef]
  66. Ainslie, M.A. Principles of Sonar Performance Modeling; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
  67. Jensen, F.B.; Kuperman, W.A.; Porter, M.B.; Schmidt, H.; Tolstoy, A. Computational Ocean Acoustics; Springer: New York, NY, USA, 2011. [Google Scholar]
  68. Heaney, K.D.; Ainslie, M.; Murray, J.J.; Heaney, A.J.; Miksis-Olds, J.; Martin, B. Regional soundscape modeling of the Atlantic Outer Continental Shelf. J. Acoust. Soc. Am. 2024, 156, 378–390. [Google Scholar] [CrossRef]
Figure 1. Locations of the AMAR passive acoustic observatories.
Figure 1. Locations of the AMAR passive acoustic observatories.
Acoustics 07 00044 g001
Figure 2. Diagram of millidecade PSD data analysis.
Figure 2. Diagram of millidecade PSD data analysis.
Acoustics 07 00044 g002
Figure 3. Power spectral densities corresponding to various underwater soundscapes collected from different recording sites, validated by both manual and automated analyses.
Figure 3. Power spectral densities corresponding to various underwater soundscapes collected from different recording sites, validated by both manual and automated analyses.
Acoustics 07 00044 g003
Figure 4. Spectrogram extracted from a recorded file at the Scotian Shelf station on 19 November 2016 at 07:38:56 (UTC) that underwent processing in PAMLab. The processing involved applying specific settings, including a sampling frequency of 250 kHz, a frame length of 0.1 s, a time step of 0.01 s, a frequency step of 1 Hz, and a file duration of 60 s, with the use of a Hanning window. Boxes (ad) represent the presence of different soundscapes, while Box (e) depicts a capture of the spectrum tool in PAMlab.
Figure 4. Spectrogram extracted from a recorded file at the Scotian Shelf station on 19 November 2016 at 07:38:56 (UTC) that underwent processing in PAMLab. The processing involved applying specific settings, including a sampling frequency of 250 kHz, a frame length of 0.1 s, a time step of 0.01 s, a frequency step of 1 Hz, and a file duration of 60 s, with the use of a Hanning window. Boxes (ad) represent the presence of different soundscapes, while Box (e) depicts a capture of the spectrum tool in PAMlab.
Acoustics 07 00044 g004
Figure 5. Spectrogram of a recorded file from the Scotian Shelf station on 19 November 2016 at 07:38:56 (UTC), processed in PAMLab with distinct settings: 1 s frame length, 0.1 s time step, and a frequency range of 0 to 100 Hz. Box (a) depicts the fin whale soundscape. Box (b) shows the flow noise soundscape.
Figure 5. Spectrogram of a recorded file from the Scotian Shelf station on 19 November 2016 at 07:38:56 (UTC), processed in PAMLab with distinct settings: 1 s frame length, 0.1 s time step, and a frequency range of 0 to 100 Hz. Box (a) depicts the fin whale soundscape. Box (b) shows the flow noise soundscape.
Acoustics 07 00044 g005
Figure 6. Proposed acoustic classifier.
Figure 6. Proposed acoustic classifier.
Acoustics 07 00044 g006
Figure 7. One-minute PSD corresponding to various flow velocities from Grand Passage sites for flow noise analysis. The blue curve represents data recorded on 12 November 2020 at 17:36:00 (UTC). The red curve corresponds to 18 October 2020 at 09:11:00 (UTC), and the purple curve corresponds to 9 October 2020 at 09:26:00 (UTC).
Figure 7. One-minute PSD corresponding to various flow velocities from Grand Passage sites for flow noise analysis. The blue curve represents data recorded on 12 November 2020 at 17:36:00 (UTC). The red curve corresponds to 18 October 2020 at 09:11:00 (UTC), and the purple curve corresponds to 9 October 2020 at 09:26:00 (UTC).
Acoustics 07 00044 g007
Figure 8. Average power spectral density (PSD) estimates for different wind speed categories based on data collected from the Blake Plateau site (color online). The wind speed categories include 25,777 1 min PSD samples. The dashed lines represent the result of linear regression applied to average PSDs within the 4–15 kHz frequency range. Details regarding the slopes and scores of the linear regression lines can be found in Table 2. Shaded areas surrounding the average PSDs depict one-quarter of the standard deviation.
Figure 8. Average power spectral density (PSD) estimates for different wind speed categories based on data collected from the Blake Plateau site (color online). The wind speed categories include 25,777 1 min PSD samples. The dashed lines represent the result of linear regression applied to average PSDs within the 4–15 kHz frequency range. Details regarding the slopes and scores of the linear regression lines can be found in Table 2. Shaded areas surrounding the average PSDs depict one-quarter of the standard deviation.
Acoustics 07 00044 g008
Figure 9. Comparison of R-squared values between the regression model for wind PSD levels in deep water developed in this study for the frequency range of 4–15 kHz and Schwock’s work, which spans the frequency range of 3–25 kHz in deep water.
Figure 9. Comparison of R-squared values between the regression model for wind PSD levels in deep water developed in this study for the frequency range of 4–15 kHz and Schwock’s work, which spans the frequency range of 3–25 kHz in deep water.
Acoustics 07 00044 g009
Figure 10. One-minute PSD corresponding to various wind speeds from Cook Strait sites for drizzle analysis. The blue curve represents data recorded on 27 June 2016 at 8:40:55 (UTC). The red curve corresponds to 7 December 2016 at 19:40:55 (UTC), and the orange curve corresponds to 24 October 2016 at 23:10:55 (UTC).
Figure 10. One-minute PSD corresponding to various wind speeds from Cook Strait sites for drizzle analysis. The blue curve represents data recorded on 27 June 2016 at 8:40:55 (UTC). The red curve corresponds to 7 December 2016 at 19:40:55 (UTC), and the orange curve corresponds to 24 October 2016 at 23:10:55 (UTC).
Acoustics 07 00044 g010
Figure 11. Scatter plots illustrating the relationship between PSD levels at different frequencies and slope variations under varying rain classes (heavy, moderate, and light rain). (a) PSD at 15 kHz vs. PSD at 8 kHz. (b) Slope between 8–15 kHz vs. PSD at 8 kHz.
Figure 11. Scatter plots illustrating the relationship between PSD levels at different frequencies and slope variations under varying rain classes (heavy, moderate, and light rain). (a) PSD at 15 kHz vs. PSD at 8 kHz. (b) Slope between 8–15 kHz vs. PSD at 8 kHz.
Acoustics 07 00044 g011aAcoustics 07 00044 g011b
Figure 12. Scatter plots illustrating the varying slope regression of the PSD level within the 5–15 kHz range and spectral kurtosis within the 8–15 kHz frequency range over time. These visualizations are derived from manual analysis, pinpointing the time and date of whistle observations in the Gaspe (Site B) dataset. (a) Spectral kurtosis. (b) Regression slope.
Figure 12. Scatter plots illustrating the varying slope regression of the PSD level within the 5–15 kHz range and spectral kurtosis within the 8–15 kHz frequency range over time. These visualizations are derived from manual analysis, pinpointing the time and date of whistle observations in the Gaspe (Site B) dataset. (a) Spectral kurtosis. (b) Regression slope.
Acoustics 07 00044 g012aAcoustics 07 00044 g012b
Figure 13. Ship routes and PSDs around Gaspé Station B in 2019. (a) Routes that specified ships have passed. (b) PSDs after three ships have passed. (a) depicts the trajectories of three ships on the map during specified frame times, offering a visual representation of their paths near the recording station in the vicinity of Gaspé based on the AIS dataset. (b) depicts the PSDs corresponding to the specified frame times. The PSDs are presented with averages and standard variations, offering insights into the frequency characteristics of the ship signals captured by the recorder.
Figure 13. Ship routes and PSDs around Gaspé Station B in 2019. (a) Routes that specified ships have passed. (b) PSDs after three ships have passed. (a) depicts the trajectories of three ships on the map during specified frame times, offering a visual representation of their paths near the recording station in the vicinity of Gaspé based on the AIS dataset. (b) depicts the PSDs corresponding to the specified frame times. The PSDs are presented with averages and standard variations, offering insights into the frequency characteristics of the ship signals captured by the recorder.
Acoustics 07 00044 g013
Figure 14. Distribution of wind speed measurement as a function of the energy level for the whole dataset recorded over 50,581 min at depths exceeding 500 m in the Blake Plateau site. (a) As a function of the PSD at 6 kHz. (b) As a function of the PSD at 8 kHz. In (a), the depicted results utilize the proposed wind speed estimator, while (b) reflects outcomes obtained through Nystuen’s algorithm.
Figure 14. Distribution of wind speed measurement as a function of the energy level for the whole dataset recorded over 50,581 min at depths exceeding 500 m in the Blake Plateau site. (a) As a function of the PSD at 6 kHz. (b) As a function of the PSD at 8 kHz. In (a), the depicted results utilize the proposed wind speed estimator, while (b) reflects outcomes obtained through Nystuen’s algorithm.
Acoustics 07 00044 g014
Figure 15. Comparison of the 24 h averaged wind speed estimated by the proposed algorithm with the actual 24 h averaged wind speed. The analysis is grounded in recorded data from February to August 2019 in the Blake Plateau.
Figure 15. Comparison of the 24 h averaged wind speed estimated by the proposed algorithm with the actual 24 h averaged wind speed. The analysis is grounded in recorded data from February to August 2019 in the Blake Plateau.
Acoustics 07 00044 g015
Figure 16. Data extracted from the DECs 2 Beta tool for Gaspe Station B was used to estimate wind speed using the MASLUW model. (a) Sound speed profiles for different months. (b) Average, minimum, and maximum wind speeds. (c) Bathymetry data of the desired area (indicated by the red dot) (d) Sediment analysis of the area.
Figure 16. Data extracted from the DECs 2 Beta tool for Gaspe Station B was used to estimate wind speed using the MASLUW model. (a) Sound speed profiles for different months. (b) Average, minimum, and maximum wind speeds. (c) Bathymetry data of the desired area (indicated by the red dot) (d) Sediment analysis of the area.
Acoustics 07 00044 g016
Figure 17. Comparison of the hourly averaged wind speed estimated by MASLUW with the actual hourly averaged wind speed. The analysis is grounded in recorded data from September to October 2019 in Gaspe (Site B) with an error of 1.08 m/s.
Figure 17. Comparison of the hourly averaged wind speed estimated by MASLUW with the actual hourly averaged wind speed. The analysis is grounded in recorded data from September to October 2019 in Gaspe (Site B) with an error of 1.08 m/s.
Acoustics 07 00044 g017
Table 1. Deployment information for various oceanic stations.
Table 1. Deployment information for various oceanic stations.
Location Name of the StationDuration of DeploymentDepth (m)Sampling Rate (Hz)Number of MinutesLatitudeLongitude
Prince Rupert22 March 2013 to 25 September 20138064,00062,75554.2087°−130.34087°
Blake Plateau6 November 2018 to 16 November 2019900512,00054,02029.2506°−78.35106667°
Chukchi Sea *5 August 2014 to 14 October 201450375,00013,400 71 . 33516667 ° 163 . 1243667 °
Scotian Shelf *17 July 2016 to 19 July 20171547250,00026,422 44 . 19273333 ° 53 . 27476667 °
Gippsland Basin26 March 2021 to 13 April 202175128,00025,370 38 . 326275 ° 147 . 7111 °
Scotian Shelf (FCH)8 October 2019 to 2 September 20201452256,000190,146 42 . 041 ° 65 . 3622 °
Ireland Trough *19 March 2016 to 9 August 20161750250,00028,540 49 . 54771 ° 13 . 37298 °
Gowgaia Shelf *25 July 2019 to 25 July 2020744250,00035,130 52 . 39333333 ° −131.7137°
Grand Passage14 October 2020 to 12 November 202043512,0008157 44 . 269182 ° 66 . 339397 °
North Sea21 October 2020 to 24 January 202110064,000136,861 57 . 530018 ° 1 . 1263334 °
Cook Strait (Station 4) *28 April 2016 to 20 December 20161022250,00037,680 41 . 61233 ° 174 . 7353 °
Cook Strait (Station 6) *28 April 2016 to 21 December 20161486250,00038,154 41 . 805017 ° 175 . 081 °
North Baffin (PondInlet)8 August 2021 to 14 September 202113164,00050,386 72 . 70773 ° 77 . 98278 °
Gaspé (Site A)4 September 2019 to 30 October 2019335128,00080,530 49 . 1042 ° 64 . 28408333 °
Gaspé (Site B)4 September 2019 to 30 October 2019384128,00080,540 49 . 25856667 ° 64 . 1588 °
North Baffin (Ragged Island)2 August 2021 to 13 September 2021130512,0004047 72 . 46352 ° 80 . 07005 °
Sept-Îles26 March 2021 to 13 April 20218532,000102,522 50 . 03606 ° 66 . 36876 °
Haro Strait (Southbound)26 March 2021 to 13 April 2021250128,0008640 48 . 51811667 ° 123 . 1915833 °
Haro Strait (Northbound)26 March 2021 to 13 April 2021205128,0008640 48 . 51646667 ° 123 . 2076667 °
The data indicated by * were not used for estimating the wind speed.
Table 2. Results of the linear regression model for the frequency range of 4–15 kHz in deep water at the Blake Plateau site.
Table 2. Results of the linear regression model for the frequency range of 4–15 kHz in deep water at the Blake Plateau site.
Wind Speed (m/s)0–22–44–66–88–1010–1212–14Over 14
R-Squared0.89040.94770.97750.98490.98670.98880.99030.9951
Slope (dB/decade)−7.632−10.08−13.73−15.82−16.73−17.72−18.51−23.08
Table 3. Optimized parameter values by the GA.
Table 3. Optimized parameter values by the GA.
Parameters α 1 α 2 α 3 α 4 α 5 α 6 α 7 α 8 α 9
Values1032.814.5−4.512.6−1.914.57369
Table 4. AIS information of three ships.
Table 4. AIS information of three ships.
NumberDurationClass
114 September 2019 17:03:31 to 6 October 2019 21:39:09Cargo
211 October 2019 08:48:48 to 11 October 2019 19:12:38Passenger
318 October 2019 11:23:07 to 18 October 2019 18:22:41Tanker
Table 5. Classification results in % for the proposed algorithm.
Table 5. Classification results in % for the proposed algorithm.
Source TypePrecisionRecall F 1 MCC F 0.5 Occurrence
Flow noise91.3079.2484.8471.3788.60510
Heavy rain72.0910083.7877.4076.35112
Light rain81.811009089.4784.90150
Fin whale10077.7878.977.594.5930
Whistle92.3870.588077.5786.95173
Clicks93.7599.896.7796.2694.93112
Shipping activity90.0378.7383.0469.8387.551048
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mirzaei Hotkani, M.; Martin, B.; Bousquet, J.F.; Delarue, J. Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification. Acoustics 2025, 7, 44. https://doi.org/10.3390/acoustics7030044

AMA Style

Mirzaei Hotkani M, Martin B, Bousquet JF, Delarue J. Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification. Acoustics. 2025; 7(3):44. https://doi.org/10.3390/acoustics7030044

Chicago/Turabian Style

Mirzaei Hotkani, Mojgan, Bruce Martin, Jean Francois Bousquet, and Julien Delarue. 2025. "Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification" Acoustics 7, no. 3: 44. https://doi.org/10.3390/acoustics7030044

APA Style

Mirzaei Hotkani, M., Martin, B., Bousquet, J. F., & Delarue, J. (2025). Real-Time Analysis of Millidecade Spectra for Ocean Sound Identification and Wind Speed Quantification. Acoustics, 7(3), 44. https://doi.org/10.3390/acoustics7030044

Article Metrics

Back to TopTop