1. Introduction
With the rapid expansion of global shipping trade, ocean noise levels continue to rise annually [
1,
2,
3,
4,
5]. Substantial research indicates that underwater radiated noise poses a significant threat to marine life, particularly marine mammals [
6,
7,
8]. It can interfere with essential behaviors such as navigation, communication, and foraging. Prolonged exposure to elevated noise levels may also cause hearing damage and even cause fatalities in marine life. To mitigate these impacts of ocean noise pollution on marine ecosystems, the International Maritime Organization and various nations have implemented regulations limiting the radiated noise levels from marine vessels [
9,
10,
11]. Consequently, measuring the radiated noise of ships and underwater vehicles prior to delivery is essential to assess whether their noise levels comply with design requirements and regulatory standards [
12,
13,
14,
15,
16]. As a fundamental physical characteristic, the power of radiated noise provides the basis for calculating other quantities that describe the characteristics of underwater radiated noise, such as the source level, source band level, and source spectral level [
17].
Due to cost and operational constraints, early measurements of underwater radiated noise predominantly employed single hydrophone systems, where the received sound pressure level was corrected via acoustic propagation loss to deduce the source level of the underwater radiated noise [
18,
19,
20,
21,
22]. However, as underwater radiated noise levels decline while the ocean ambient noise level increases year by year, the Signal-to-Noise Ratio (SNR) of data received by a single hydrophone has progressively declined, leading to a gradual inability to meet measurement accuracy requirements [
23,
24,
25,
26]. To address this issue, methods based on hydrophone arrays have been developed that leverage the spatial directivity and array gain to reduce the influence of ambient noise on radiated noise measurements, thereby improving measurement accuracy [
27,
28,
29]. Nevertheless, these methods typically assume plane-wave acoustic propagation, which differs from the actual shallow-water waveguide environment, leading to degraded performance on radiated noise power estimation in practical applications. Subsequently, a radiated noise measurement method based on conventional Matched-Field Processing (MFP) is proposed, which calculates the channel transfer function from the sound source to each element of the array via acoustic field models, exploiting multipath signals for improved accuracy [
30]. However, this method remains susceptible to ambient noise effects under low SNR conditions. For such scenarios, a denoising method is used to estimate the signal covariance matrix prior to beamforming, effectively improving measurement accuracy [
17,
31,
32].
Maximum Likelihood Estimation (MLE) is one of the most fundamental and widely used parameter estimation methods in statistics [
33,
34]. Its core principle is to find the parameter value that maximizes the probability of the observed data. This paper applies it to the measurement of underwater radiated noise and proposes a maximum likelihood estimation method for underwater radiated noise power. Unlike conventional methods for radiated noise measurement based on array signal processing, the proposed method firstly establishes the probability density function of the received data under the assumption of white Gaussian noise. Subsequently, an estimation of the radiated noise power, which conforms to the maximum likelihood criterion, is derived and validated as the minimum variance unbiased estimation. Compared to conventional radiated noise measurement methods, the proposed MLE is the optimal estimation, providing the optimal estimation accuracy for the underwater radiated noise power.
The rest of the paper is organized as follows.
Section 2 introduces the receiving data model with a Vertical Linear Array (VLA).
Section 3 reviews several common radiated noise power estimation methods based on beamforming.
Section 4 establishes the probability density function of the array-received data and develops the MLE method, and
Section 5 derives the Cramér–Rao Lower Bound (CRLB). In
Section 6, the effects of different parameters on MLE performance are investigated through numerical simulations, with performance comparisons against the methods introduced in
Section 3.
Section 7 further validates the practical efficacy of MLE using experimental data. Finally,
Section 8 provides a summary.
2. Data Model
Assuming that the underwater radiated noise source can be approximated as a point source, and considering the effects of shallow-water sound propagation, the time-domain data
received by a single hydrophone can be expressed as the convolution of the channel impulse response and the radiated noise [
35]:
where
denotes convolution,
represents the radiated noise at a distance of 1 m from the source,
denotes the channel impulse response from the source to the hydrophone at the position
, and
represents the ambient noise received by the hydrophone. In this study, it is assumed that both the radiated noise and the ambient noise are characterized as white Gaussian noise and are mutually independent.
By segmenting the received time-domain data and applying the Discrete Fourier Transform (DFT), the frequency-domain received data is obtained and expressed as
where
denotes the time-segment index,
denotes the signal frequency,
,
and
represent the DFTs of the hydrophone data
, the underwater radiated noise
, and the ambient noise
, respectively, and
is the channel transfer function from the radiated noise source to the hydrophone at frequency
.
In practice, an array composed of multiple hydrophones is employed for data acquisition, thereby achieving beamforming gain to enhance the SNR of the received data. For an
M-element hydrophone array, the received data can be expressed as
where the subscript
. indicates the index of each hydrophone.
Due to the influence of the ocean channel, the signal power received by various array hydrophones may vary. Consequently, the SNR of the array received data is defined by utilizing the average power of the received signal and the ambient noise across the entire array as
where
denotes the square of the
norm of a vector,
represents the power of the radiated noise, and
represents the average power of the ambient noise received by the
M-element array.
3. Beamforming-Based Power Estimation Methods
For comparison, the commonly used beamforming-based power estimation methods for underwater radiated noise are first reviewed. First, the beamforming is performed to the array received data, and the output signal is expressed as [
35]
where
is a simplified notation of
,
denotes the weighting vector, and the superscript
represents the conjugate transpose. Then the beamforming output power is expressed as
In Equation (6),
denotes the covariance matrix of the received data, where
is the expectation operator,
is the covariance matrix of the received radiated noise, and
is the covariance matrix of the ambient noise. The approximation means that the power of ambient noise is negligible, which is reasonable when the SNR of the beamforming output is high. So the power of the radiated noise is given by
3.1. Conventional Beamforming
Under the far-field assumption, the estimated channel transfer function and the weighting vector can be expressed as
where
denotes the reference hydrophone index,
is the distance between the reference hydrophone and the sound source,
is the wavenumber,
is the inter-element spacing, and
is the bearing angle of the source relative to the array. Substituting Equation (8) to Equation (7) yields the Conventional BeamForming (CBF) estimate of radiated noise power
3.2. Near-Field Focused Beamforming
In the near-field, the estimated channel transfer function and the weighting vector can be expressed as
where
is the distance between the
m-th hydrophone and the source. Substituting Equation (10) to Equation (7), the radiated noise power estimate with Near-field Focused BeamForming (NFBF) is given by
3.3. Matched-Field Processing
For matched-field processing, the weighting vector is typically represented as the normalized form of the channel transfer function:
where
denotes the
norm of a vector. Substituting the weighting vector to Equation (7), the power estimate with MFP is expressed as [
30]
3.4. RPCA-Based Denoising Approach
Since the derivation of Equation (6) neglects the influence of ambient noise, the measurement error becomes significant when the beamforming output SNR is low. Consequently, denoising methods can be applied first to suppress the ambient noise before estimating the radiated noise power. The Robust Principal Component Analysis (RPCA) method can be used to estimate the signal covariance matrix and the noise covariance matrix simultaneously. The optimization formulation can be expressed as [
31]
where
and
denote the nuclear norm and the
norm, respectively,
denotes the regularization parameter, and
means the matrix is a positive semi-definite matrix. Then substituting the estimated signal covariance matrix into Equation (11), the radiated noise power estimate is expressed as
4. Maximum Likelihood Estimation Method
Differing from beamforming-based approaches, this section introduces a radiated noise power estimation method based on probability density maximization. Since the array elements are mutually independent, the signal received by the array follows a joint normal distribution. In the complex domain, the Probability Density Function (PDF) is expressed as [
36]
where
denotes the determinant of a matrix.
For the received
independent and identically distributed frequency snapshots, the corresponding joint PDF is
For the matrix
, it can be transformed into
where
is the identity matrix. For the rank-1 matrix
in Equation (18), it has only one non-zero eigenvalue
. Additionally, since the sum of all eigenvalues of a matrix equals its trace, it follows that
where
denotes the trace of a matrix. Moreover, as the determinant of a matrix is equal to the product of all its eigenvalues, then
According to the matrix inversion lemma, the inverse of the matrix
can be expressed as
Substituting Equations (20) and (21) into Equation (17), and additionally, to facilitate computations, Equation (17) is represented in logarithmic form as
In Equation (22),
where
denotes the sampling covariance matrix of the received data.
To find the parameter
that maximizes Equation (22), by substituting Equation (23) into Equation (22) and taking the partial derivative of the resulting expression with respect to the parameter
[
36], one obtains
By setting Equation (24) as equal to 0, the estimation of
can be obtained, expressed as
For Equation (25), it can be interpreted as the outputs of Minimum Variance Distortionless Response (MVDR) beamforming of the covariance matrix difference between the received data and the ambient noise, thereby eliminating the influence of ambient noise on the estimation. Additionally, the ambient noise covariance matrix
for radiated noise power estimation can be obtained by collecting ambient noise before radiated noise measurement. The channel transfer function
can be calculated from marine environmental parameters using acoustic field simulation toolbox, such as the KrakenC program based on normal mode [
37,
38] in the Acoustics Toolbox [
39]. The toolbox we are using was updated on 4 November 2020. Combining these with the sampling covariance matrix of the received data
, the radiated noise power can be accurately determined.
5. The Cramér–Rao Lower Bound Derivation
To analyze the estimation performance of the proposed method, we theoretically verified whether its estimation variance achieves the CRLB. First, the theoretical value of CRLB is derived. For Equation (24), taking the second partial derivative with respect to the parameter
yields
According to Reference [
36], the Fisher information matrix
can be obtained by taking the negative expectation of Equation (26), denoted as
where
. Subsequently, the CRLB can be calculated and represented as
Subsequently, we prove that the proposed power estimation method achieves the CRLB. For Equation (24), it can be transformed into
According to CRLB theorem [
36], the variance of estimation meets the CRLB. In addition, the expectation of Equation (25) is given by
Thus, it can be inferred that is the Minimum Variance Unbiased Estimation (MVUE) of .
6. Numerical Simulation Results Analysis
The simulated shallow-water waveguide environment is depicted in
Figure 1. The 100 m water column overlies a 10 m sediment layer, with the sound speed profile annotated in the figure. Key environmental parameters affecting acoustic propagation, including sound speed, density, and attenuation coefficients, are specified in the diagram. The radiated noise is assumed to be a single-frequency signal at 750 Hz with the power of 90 dB. The sound source marked by a black pentagram in the figure is located at a depth of 50 m. The receiving hydrophone array marked by a set of black solid circles is a 31-element VLA with half-wavelength inter-element spacing. The central depth of the array is 50 m, and it is horizontally situated 100 m away from the radiated noise source. The ambient noise is set as white Gaussian noise with a power of 60 dB.
In the following simulations, we compare the proposed MLE method against conventional beamforming approaches: CBF, NFBF, MFP, and an RPCA-based denoising method. Performance is systematically evaluated across varying conditions using the estimated variance, calculated as
where
represents the average of
N estimated values, and
N is the number of Monte Carlo trials. In the following simulations, the number of Monte Carlo trials
N is taken as 10,000. For the convenience of numerical display, the logarithm of the calculated variance is represented in the picture.
6.1. Impact of SNR on Estimation Results
The SNR of the received data is a critical factor influencing the radiated noise estimation results. It can be altered by adjusting the power of the radiated noise. The power of the radiated noise varies within the range of 70 to 110 dB, and the corresponding SNR of the received data is shown in
Figure 2. Furthermore, the estimation results and estimation variances through different methods are shown in
Figure 3.
It can be seen that the SNR of the received data varies linearly with the radiated noise power. The estimation errors of different methods decrease as the radiated noise power increases, and the estimation variances tend towards CRLB with the increasing power. Among these methods, the CBF method exhibits the highest estimation error and variance, primarily due to the failure of the far-field assumption at close ranges. Consequently, the NFBF method effectively reduces the estimation error caused by range mismatch compared to CBF, but its measurement error remains relatively large. The MFP method achieves the CRLB in terms of estimation variance under different radiated noise power, while its estimation error is relatively larger at lower radiated noise level due to the influence of ambient noise. The RPCA method reduces the impact of ambient noise through denoising, thereby decreasing the estimation error, but its estimation variance remains higher than CRLB. In contrast, the MLE method is least affected by variations in the radiated noise power, providing the most accurate estimation results. Moreover, its estimation variance reaches the CRLB, demonstrating the optimal estimation performance.
6.2. Impact of Snapshot Number on Estimation Results
The proposed radiated noise power estimation method necessitates the use of the ambient noise covariance matrix, which can be estimated through prolonged monitoring of ambient noise. However, when the number of snapshots
in Equation (17) is limited, discrepancies may arise between the actual noise covariance matrix and the sampling covariance matrix, potentially affecting the estimation accuracy.
Figure 4 presents the estimation results and variances of radiated noise for different snapshot numbers.
It can be observed that, except for the RPCA method, the estimation results of the other methods remain constant regardless of the number of snapshots, while their estimation variances decrease as the number of snapshots increases. Among these methods, the CBF and NFBF methods exhibit the largest estimation errors and variances. The estimation variance of the MFP method achieves CRLB, but its estimation error remains relatively large. The denoising performance of the RPCA method improves with the increasing number of snapshots [
32], thereby enhancing the estimation accuracy, although its estimation variance remains relatively large. In contrast, the MLE method achieves optimal performance in both estimation result and variance, reaching the CRLB and demonstrating the best estimation performance.
6.3. Impact of Signal Frequency on Estimation Results
For an array with fixed inter-element spacing, the beam patterns of different beamforming methods vary with frequency, raising the question of whether this variability affects the performance of beamforming-based methods for estimating radiated noise power. Additionally, it is also worth discussing whether the MLE method is subject to this issue. Due to variations in the channel transfer functions at different frequencies, the SNR of the received data fluctuates. For a VLA with 1 m inter-element spacing,
Figure 5 presents the SNR of the received data under different frequency conditions, and
Figure 6 provides the estimation results and variances.
It can be observed that the SNR of the received data remains nearly constant at different frequencies. Except for the CBF method, the estimation results and variances of the other methods are largely unaffected by frequency variations. For the CBF method, based on the Fraunhofer distance condition under the far-field assumption, it becomes increasingly challenging to satisfy the far-field condition as the frequency rises at a fixed distance. This leads to a rise in both estimation error and variance at higher frequencies. The NFBF method exhibits relatively large estimation error and variance, while the MFP method achieves the CRLB but still suffers from significant estimation error. The RPCA method demonstrates relatively smaller estimation error and variance. In contrast, the MLE method achieves the smallest estimation error, and its estimation variance reaches CRLB, thereby demonstrating optimal performance in radiated noise estimation.
6.4. Impact of the Unknown Sound Position on Estimation Results
In previous simulations, it was assumed that the source position was known. However, in practical scenarios, deviation in the target position can occur. Under such conditions, it is necessary to evaluate the performance of radiated noise estimation at different assumed positions.
Figure 7 presents the estimation results of the radiated noise power using channel transfer functions at different positions.
It can be observed that the position with the highest estimated power corresponds to the actual position of the sound source, and the estimation result matches the true power of the radiated noise. It indicates that the proposed MLE method remains applicable even when the source position is unknown.
7. Experiment Validation
In this section, the performance of MLE on underwater radiated noise measurement is validated using experimental data. The experiment was implemented in a reservoir in December 2019 [
32].
Figure 8 illustrates the reservoir environment, as well as the geometric configuration of the source and receiving array. The sound speed profile was measured using a CTD sensor. The properties of the sediment layer and semi-infinite layer, including sound speed, density, and attenuation coefficients, were determined based on seabed material. A UW-350 transducer submerged at a depth of 50 m emitted two types of signals: (1) a 750 Hz Continuous Wave (CW) tone, and (2) a 10 s duration Linear Frequency Modulated (LFM) pulse with a frequency range of 100–900 Hz. A nested vertical hydrophone array, comprising a 24-element uniform linear subarray with a 1 m spacing and a 17-element subarray with a 0.5 m spacing, was deployed vertically at a horizontal range of 34.5 m from the source, spanning depths from 73 to 97 m. Additionally, a reference hydrophone, located at the same 50 m depth as the source but horizontally offset by 5.2 m, was used to monitor the source level. Due to the short propagation distance, the recorded signals exhibited high SNRs, allowing the single hydrophone-derived power estimates to serve as real value for radiated signal validation. All data were sampled at 48 kHz.
During post-processing, data from the central 15 hydrophones of the nested array with 1 m inter-element spacing were analyzed. To investigate the impact of snapshot number on measurement performance, the CW signal duration was varied from 2 s to 10 s. Frequency-domain conversion was performed via DFT with a 0.2 s rectangular window and 50% overlap.
Figure 9a presents the estimated powers of CW signal across different snapshot numbers, while
Figure 9b shows the corresponding estimation biases. These experimental results are basically consistent with the simulated trends in
Figure 4a. It can be seen that the estimation bias of RPCA exhibits a decreasing trend as snapshot numbers increase. Moreover, the MLE method demonstrates the best estimation performance, highlighting its superior robustness to snapshot limitations compared to other methods.
To comparatively evaluate frequency-dependent performance, the power and associated estimation biases of the LFM signal across the 500–900 Hz are analyzed. The results are presented in
Figure 10. The comparative analysis reveals that MLE achieves the closest agreement with real values across the entire frequency band. However, it should be noted that under specific spectral regions, MLE exhibits a slight negative bias, likely caused by noise covariance matrix mismatch. In contrast, other methods demonstrate bias fluctuations exceeding 3 dB.
8. Conclusions
Underwater radiated noise measurement plays a crucial role in assessing the radiated noise levels of marine vessels. Unlike measurement methods based on array beamforming, this paper approaches the problem from a probabilistic perspective, proposing a MLE method for underwater radiated noise power. Assuming white Gaussian noise, the PDF representation of the hydrophone array received data is established, and the radiated noise power estimation under the maximum likelihood criterion is derived.
Theoretical derivation proves that this estimate is a MVUE, satisfying the CRLB. Numerical simulation results indicate that the proposed method exhibits reduced susceptibility to ambient noise and maintains optimal power estimation performance even under low SNR conditions. Experimental data further verifies the effectiveness of this estimation method in practical measurement applications.
Additionally, when the source location is unknown, the method can still locate the source and estimate its power by searching for the peak value within a specified area. However, due to the necessity of employing the ambient noise covariance matrix, the method is more suitable for fixed measurement sites with stationary ambient noise, as the statistical covariance matrix of ambient noise can be obtained through prolonged monitoring.
For future work, (1) it is necessary to consider that ship-radiated noise sources are not a point source in realistic scenarios, but rather volume sources composed of multipoint sources; and (2) considering the fluctuations in ocean environmental parameters, the robustness of this method to changes in environmental parameters needs to be analyzed.