A Fast Sound Source Mapping by Morphological Operations on Acoustic Images

Wu, Yue Ivan; Song, Jiahao; Yin, Hang; Quan, Qinhao

doi:10.3390/math14111865

Open AccessArticle

A Fast Sound Source Mapping by Morphological Operations on Acoustic Images

by

Yue Ivan Wu

^1,*

,

Jiahao Song

¹,

Hang Yin

² and

Qinhao Quan

²

¹

College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China

²

College of Computer Science, Sichuan University, Chengdu 610065, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(11), 1865; https://doi.org/10.3390/math14111865

Submission received: 27 February 2026 / Revised: 24 May 2026 / Accepted: 25 May 2026 / Published: 27 May 2026

(This article belongs to the Section E1: Mathematics and Computer Science)

Download

Browse Figures

Versions Notes

Abstract

The deconvolution approach for the mapping of acoustic sources (DAMAS) based on the microphone array is proved effective in various acoustic imaging applications. Generally, DAMAS and its variations result in heavy computation load due to the nature of large-scale linear equations and the iterative solver, which prevent the deployment of DAMAS to platforms with limited resources, such as the edge devices of the internet of things (IoT). In order to enhance the computational efficiency of DAMAS, a fast algorithm based on DAMAS with grid compression by the morphological operations on the acoustic images is proposed in this work. The proposed approach intentionally neglects the physics behind the acoustic imaging, but emphasizes the general visual features of acoustic images, as if they were natural images. A low computation load can be guaranteed regardless of the complicated acoustic environments, which alternatively ensures the robustness of proposed algorithm. Numerical simulations demonstrate that the proposed algorithm effectively accelerates the acoustic image reconstruction. In practical experiments, the proposed method reduces the algorithm time to be within 26% of DAMAS. In certain scenarios, both the algorithm time and localization accuracy of the proposed method outperform the conventional methods.

Keywords:

acoustic cameras; acoustic imaging; sound source localization; microphone arrays; morphological operations

MSC:

60G35

1. Introduction

In recent years, acoustic cameras have become increasingly popular in various fields such as air pump experiments [1], aircraft noise control [2], and port noise monitoring [3]. The acoustic cameras capture sounds with microphone arrays and map the sound intensity on the natural images obtained by the optical camera for sound source localization. As acoustic cameras are typically desired in the real-time applications, it is vital that the sound source localization algorithm is simple and fast with low computation burden. For example, if an acoustic camera aims to achieve approximately 30 frames per second, the acoustic imaging must be done within 0.04 s. The conventional delay-and-sum (DAS) beamforming [4,5,6] is widely used for acoustic imaging due to its advantages of simplicity. However, DAS is data-independent, resulting high sidelobes and narrow dynamic range. To explore the signal’s characteristics, various data-dependent beamforming techniques are utilized: the orthogonal beamforming performs eigenvalue decomposition on the cross-spectral matrix [7,8]; the functional beamforming leverages the incoherence of source signals and matrix functions to suppress sidelobes, and improves the spatial resolution [9,10]; and optimized beamforming, such as the minimum variance distortionless response (MVDR) beamformer [11] and the linearly constrained minimum variance (LCMV) [12] beamformer, calculates the best weight vector based on the statistics of the array signals.

The spatial resolutions of the above beamforming methods are limited by their beampatterns. To achieve higher spatial resolution, the deconvolution sound source localization approaches require close attention. Ref. [13] develops the CLEAN algorithm for sound source localization, and [14] extends CLEAN to CLEAN-SC for coherent sources localization. Ref. [15] directly solves the deconvolution problem by covariance matrix fitting with sparsity constraints, and [16] extends the method to tackle the coherent sources. Ref. [17] introduces the orthogonal matching pursuit (OMP) method to solve the problem, based on which [18] develops the non-negative matrix factorization and the hierarchical clustering to ensure the algorithm speed. However, the OMP-based methods are prone to local optimum convergence.

Ref. [19] proposes the deconvolution approach for the mapping of acoustic sources (DAMAS). It removes the effects of the point spread functions, thereby significantly improving the spatial resolution. Based on DAMAS, ref. [20] proposes the DAMAS-C algorithm for coherent sources. The DAMAS-based methods are considered as a major breakthrough in sound source localization and acoustic imaging [21]. Since DAMAS iteratively solves the linear equation systems, the major drawback is the substantial computational burden [22,23]. The high demand of computation resources prevents DAMAS from being effective in real-time acoustic imaging.

To reduce the algorithm complexity, two major strategies are proposed. One strategy is based on the assumption of shift-invariant point spread function. Ref. [24] proposes DAMAS2 and DAMAS3. Refs. [25,26] develop the non-negative least squares (NNLS) algorithm. Ref. [26] proposes the FFT-NNLS algorithm. Ref. [27] proposes the FFT-OMP-DAMAS algorithm. Ref. [28] proposes the DAMAS2-v and FFT-NNLS-v algorithms. Although the aforementioned methods optimize DAMAS, they do not reduce the scale of the linear equations, which is the key factor aggravating the computation load of deconvolution approaches.

In recent years, data-driven methods have been incorporated into acoustic imaging algorithms. Ref. [29] proposes an autoencoder structure model, and the trained network can achieve source localization with significant faster speed than DAMAS. Ref. [30] proposes the DAMAS-FISTA-Net, which applies the model learned from the simulated data to real-world data. Ref. [31] proposes a grid-based acoustic source localization method via the deconvolution through mean-reverting stochastic differential equations with a score-based generative model. To extract more comprehensive features, ref. [32] proposes a dual-encoder U-net deep learning model, converting beamforming maps into high-resolution maps of sources’ strength distribution. And ref. [33] proposes a diffusion-based framework for acoustic source mapping. The above data-driven methods greatly enhance the deconvolution approach performance in terms of accuracy with lower computation loads. However, these approaches heavily rely on the large amount of data for the model training process, and thus the performance naturally depends on the specific datasets and environments.

To overcome the above drawback, the other strategy based on the selection of grid points to reduce the scale of linear equations for deconvolution, a.k.a the grid compression, is developed. Ref. [34] proposes DAMAS-CG1 to reduce the grid points based on the wavelet compression. To mitigate the spatial aliasing, ref. [35] proposes DAMAS-CG2, which updates the DAS beamformer outputs by applying diagonal removal on the spatial covariance matrix. Ref. [36] proposes DAMAS-CG3 to accommodate the functional beamforming [9] and further improves algorithm efficiency.

The above grid compression methods are performed based on the physical principle of acoustic imaging. In adverse scenarios, such as complicated channels, low signal-to-noise ratio (SNR), and spatially close sources, these methods may perform conservatively. That is, their improvements of computation efficiency may be limited compared to the original DAMAS. In this work, an entirely different grid compression philosophy is proposed. Instead of signal processing with the principle of acoustic imaging, the proposed method simply and brutally takes the acoustic images as natural images and applies the morphological operations to implement the grid compression. The proposed method implicitly neglects the physics behind the acoustic imaging but relies on the general visual features of acoustic images, e.g., the peaks are likely to be round or oval due to the beamforming. Thus, a heavy grid compression (hence the low computation load) can be guaranteed regardless of the complicated acoustic environments, which alternatively ensures the robustness of the proposed algorithm.

2. Problem Formulation

As shown in Figure 1, a microphone array consists of M microphones geometrically located at

p_{m}, \forall m \in \{1, 2, \dots, M\}

. Suppose an unknown number of static point sources emitting wide-sense stationary sound signals in the three dimensional space. Suppose that an imaginary grid in the three-dimensional space has N grid points locating at

g_{n}

,

\forall n \in \{1, 2, \dots, N\}

.

Without a loss of generality, take the geometric center of the microphone array as the Cartesian coordinates’ origin, i.e.,

p_{0} = \frac{1}{M} \sum_{m = 1}^{M} p_{m} = {[0, 0, 0]}^{T}

. Thus, the distance from each grid point

g_{n}

to

p_{m}

can be defined as

d_{m, n} = {∥ g_{n} - p_{m} ∥}_{2}

, where

{∥ \cdot ∥}_{2}

denotes the Euclidean norm. The time difference of arrival (TDOA) between the received signals at

p_{m}

and

p_{0}

equals

τ_{m, n} = \frac{d_{m, n} - d_{0, n}}{c}

, where c denotes the speed of sound.

The microphone array’s steering vector to

g_{n}

can be written as

\begin{matrix} a_{n} (f) & = & {[d_{1, n}^{- 1} e^{- j 2 π f τ_{1, n}}, \dots, d_{M, n}^{- 1} e^{- j 2 π f τ_{M, n}}]}^{T} \\ = & e^{j \frac{2 π}{λ} d_{0, n}} {[d_{1, n}^{- 1} e^{- j \frac{2 π}{λ} d_{1, n}}, \dots, d_{M, n}^{- 1} e^{- j \frac{2 π}{λ} d_{M, n}}]}^{T}, \end{matrix}

(1)

where f and

λ

denote the signal’s frequency and corresponding wavelength, respectively.

The array signal in the frequency domain can be expressed as

\begin{matrix} x (f) & = & \sum_{n = 1}^{N} a_{n} (f) s_{n} (f) + n (f) \\ = & A (f) s (f) + n (f), \end{matrix}

(2)

where

s (f) = {[s_{1} (f), \dots, s_{N} (f)]}^{T}

stand for the frequency spectral vector of N uncorrelated sound signals at

g_{n}

;

\forall n \in \{1, \dots, N\}

,

A (f) = [a_{1} (f), \dots, a_{N} (f)]

denotes the M-by-N array manifold matrix; and

n (f)

denotes the additive noise on the microphones that is uncorrelated with

s_{n} (f)

, and has the spatially identical power spectral

σ^{2} (f)

.

With

x (f)

in Equation (2), the cross-spectral matrix (CSM) of the array signal equals

\begin{matrix} C (f) & = & E \{x (f) x^{H} (f)\} \\ = & A (f) \underset{: = C_{s} (f)}{\underset{︸}{E \{s (f) s^{H} (f)\}}} A^{H} (f) + \underset{: = C_{n} (f)}{\underset{︸}{E \{n (f) n^{H} (f)\}}}, \end{matrix}

(3)

where

C_{s} (f) = diag [ρ_{1} (f), ρ_{2} (f), \dots, ρ_{N} (f)]

,

C_{n} (f) = σ^{2} (f) I_{M}

are the CSM’s of the source signals and the noise, respectively.

ρ_{n}

stands for the signal power at the n-th grid point.

diag [\cdot]

denotes the diagonal matrix, and

I_{M}

represents the M-order identity matrix.

Since the theoretical

x (f)

in Equation (2) can be hardly obtained, it is generally estimated by a certain number of consecutive snapshots (a frame) in the time domain.

{\hat{x}}_{k} (f)

denotes the array signal spectrum estimated from the k-th frame, for

k = 1, 2, \dots, K

. Thus, the CSM of the array signal can be estimated by

\begin{matrix} \hat{C} (f) & = & \frac{1}{K} \sum_{k = 1}^{K} {\hat{x}}_{k} (f) {\hat{x}}_{k}^{H} (f) \\ = & A (f) {\hat{C}}_{s} (f) A^{H} (f) + {\hat{C}}_{n} (f) \\ = & \sum_{n = 1}^{N} {\hat{ρ}}_{n} (f) a_{n} (f) a_{n}^{H} (f) + {\hat{C}}_{n} (f), \end{matrix}

(4)

where

{\hat{C}}_{s} (f) : = diag [{\hat{ρ}}_{1} (f), \dots, {\hat{ρ}}_{N} (f)]

and

{\hat{C}}_{n} (f) : = {\hat{σ}}^{2} (f) I_{M}

are unknown.

The output of the DAS beamformer steering towards the n-th grid point equals

\begin{matrix} b_{n} (f) & = & w_{n}^{H} (f) \hat{C} (f) w_{n} (f) \\ = & w_{n}^{H} (f) [\sum_{n^{'} = 1}^{N} {\hat{ρ}}_{n^{'}} (f) a_{n^{'}} (f) a_{n^{'}}^{H} (f)] w_{n} (f) \\ + w_{n}^{H} (f) {\hat{C}}_{n} (f) w_{n} (f) \\ = & \sum_{n^{'} = 1}^{N} {\hat{ρ}}_{n^{'}} (f) \underset{: = p_{n, n^{'}} (f)}{\underset{︸}{w_{n}^{H} (f) a_{n^{'}} (f) a_{n^{'}}^{H} (f) w_{n} (f)}} \\ + w_{n}^{H} (f) {\hat{C}}_{n} (f) w_{n} (f), \end{matrix}

(5)

where

p_{n, n^{'}} (f) \geq 0

is known as the point spread function, and

\begin{matrix} w_{n} (f) & = & \frac{1}{M} {[d_{1, n} e^{- j \frac{2 π}{λ} d_{1, n}}, \dots, d_{M, n} e^{- j \frac{2 π}{λ} d_{M, n}}]}^{T} \end{matrix}

(6)

is the DAS beamformer weight vector constrained by

| w_{n}^{H} (f) a_{n} (f) | = 1

.

With

b_{n} (f)

in Equation (5), stacking the beamfomrer outputs towards all of

g_{n}, \forall n \in {1, \dots, N}

gives

\begin{matrix} b (f) & = & {[b_{1} (f), \dots, b_{N} (f)]}^{T} \\ = & \underset{: = P (f)}{\underset{︸}{[\begin{matrix} p_{1, 1} (f), & p_{1, 2} (f), & \dots & p_{1, N} (f) \\ p_{2, 1} (f), & p_{2, 2} (f), & \dots & p_{2, N} (f) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ p_{N, 1} (f), & p_{N, 2} (f), & \dots & p_{N, N} (f) \end{matrix}]}} \underset{: = \hat{ρ} (f)}{\underset{︸}{[\begin{matrix} {\hat{ρ}}_{1} (f) \\ {\hat{ρ}}_{2} (f) \\ ⋮ \\ {\hat{ρ}}_{N} (f) \end{matrix}]}} \\ + [\begin{matrix} w_{1}^{H} (f) {\hat{C}}_{n} (f) w_{1} (f) \\ w_{2}^{H} (f) {\hat{C}}_{n} (f) w_{2} (f) \\ ⋮ \\ w_{N}^{H} (f) {\hat{C}}_{n} (f) w_{N} (f) \end{matrix}] \\ = & P (f) \hat{ρ} (f) + \frac{{\hat{σ}}^{2} (f)}{M^{2}} d, \end{matrix}

(7)

where

d = {[{\overset{˘}{d}}_{1}^{2}, {\overset{˘}{d}}_{2}^{2}, \dots, {\overset{˘}{d}}_{N}^{2}]}^{T}

and

{\overset{˘}{d}}_{n}^{2} = \sum_{m = 1}^{M} d_{m, n}^{2}

. Note that

d

is a constant vector determined by the locations of all grid points

g_{n}, \forall n

and the locations of all microphones

p_{m}, \forall m

.

When the noise power

{\hat{σ}}^{2} (f)

is sufficiently low, Equation (7) implies that

\begin{matrix} b (f) & \approx & P (f) \hat{ρ} (f) . \end{matrix}

(8)

Generally in acoustic imaging and sound source localization, the number of grid points is much larger than the number of sources, i.e.,

N ≫ M

. The sounds sources are presumed sparsely distributed on the grid. Thus, the vector

\hat{ρ} (f)

in Equation (7) is generally sparse. The general problem is to determine

\hat{ρ} (f)

from the DAS beamformer outputs

b (f)

.

3. Proposed Method

3.1. Acoustic Imaging by Natural Image Processing

It is well known that the DAS beamforming suffers from limited spatial resolution and dynamic range due to the point spread function. For higher resolution and faster processing, it is desired to decide which grid points are more likely to have contributions for sound sources localization. Inspired by segmentation algorithms in natural image processing, the morphological watershed method is applied to the DAS beamformer output

b (f)

. This approach yields a global threshold, below which the corresponding grid points are considered redundant and discarded for fast deconvolution in acoustic imaging.

3.1.1. Erosion and Dilation

In natural image processing, the structuring element plays an essential role in morphological dilation and erosion operations. A flat structuring element is a binary-valued neighborhood where true pixels are considered in the morphological computation, and false pixels are excluded. The center pixel of the structuring element identifies the pixel being processed.

To accommodate the morphological operations, the DAS beamformer output

b (f)

and its corresponding grid points

g_{n}, \forall n \in {1, \dots, N}

can be regarded as a two-dimensional natural image, denoted by the set B. Thus, the size of this image B naturally corresponds to the rectangular spatial grid in Figure 1. Note the

(i, j)

-th grid point (pixel)

g_{i, j}

in B has the value

B_{i, j}

, and the flat structuring element centered at

g_{i, j}

is denoted as the set K.

Denote

ε_{K} (B)

as the erosion of B by the structuring element K, where its

(i, j)

-th entry is equal to

\begin{matrix} {[ε_{K} (B)]}_{i, j} & = & min_{g_{k, l} \in K} \{B_{i + k, j + l}\} . \end{matrix}

(9)

With S denoting the mask image, the one-step geodesic erosion of B with respect to S is defined as

\begin{matrix} ε_{K, S}^{(1)} (B) & = & ε_{K} (B) \lor S, \end{matrix}

(10)

where ∨ denotes the point-wise maximum operator. Thus, the n-step geodesic erosion can be obtained by repeating Equation (10) as

\begin{matrix} ε_{K, S}^{(n)} (B) & = & ε_{K, S}^{(1)} (ε_{K, S}^{(n - 1)} (B)) . \end{matrix}

(11)

Denote

δ_{K} (B)

as the dilation of B by the structuring element K, where its

(i, j)

-th entry is equal to

\begin{matrix} {[δ_{K} (B)]}_{i, j} & = & max_{g_{k, l} \in K} \{B_{i + k, j + l}\} . \end{matrix}

(12)

With S denoting the mask image, the one-step geodesic dilation of B with respect to S is defined as

\begin{matrix} δ_{K, S}^{(1)} (B) & = & δ_{K} (B) \land S, \end{matrix}

(13)

where ∧ denotes the point-wise minimum operator. Thus, the n-step geodesic dilation can be obtained by repeating Equation (13) as

\begin{matrix} δ_{K, S}^{(n)} (B) & = & δ_{K, S}^{(1)} (δ_{K, S}^{(n - 1)} (B)) . \end{matrix}

(14)

3.1.2. Morphological Reconstruction

With the geodesic dilation and geodesic erosion defined in Equations (11) and (14), the opening by reconstruction can be defined as

\begin{matrix} γ (B) & = & δ_{K, B}^{(n)} (ε_{K} (B)), \end{matrix}

(15)

with the convergence condition of

δ_{K, B}^{(n + 1)} (ε_{K} (B)) = δ_{K, B}^{(n)} (ε_{K} (B))

.

Similarly, the closing by reconstruction can be defined as

\begin{matrix} φ (B) & = & ε_{K, B}^{(n)} (δ_{K} (B)), \end{matrix}

(16)

with the convergence condition of

ε_{K, B}^{(n + 1)} (δ_{K} (B)) = ε_{K, B}^{(n)} (δ_{K} (B))

.

Through

γ (B)

, the undesired side lobes and noisy spikes in B, i.e., the original acoustic image generated by DAS, are expected to be removed. On the other hand,

φ (B)

is applied to fill the undesired holes in B. Thus, the original acoustic image generated by the DAS beamformer and processed by morphological reconstruction can be expressed by

\begin{matrix} \overset{˚}{B} & = & φ (γ (B)) . \end{matrix}

(17)

Since the cardinality

card (\overset{˚}{B}) = card (B) = N

,

\overset{˚}{B}

can be converted back to a N-by-1 column vector

\overset{˚}{b} (f)

for the subsequent processing.

3.2. Grid Points Selection

After the morphological reconstruction of B in Section 3.1.2, the Otsu’s method [37] is applied to threshold

\overset{˚}{b} (f)

in order to determine the

\tilde{N}

-by-1 vector

\tilde{b} (f)

, corresponding to the

\tilde{N} < N

grid points. These

\tilde{N}

grid points are more likely to be the locations of sound sources.

Normalize and quantify

{\overset{˚}{b}}_{n} (f), \forall n

with L gray levels

{0, 1, \dots, L - 1}

(generally

L = 256

in natural image processing) to build

\bar{b} (f)

. Thus, the empirical probability of having the gray level l can be determined by the histogram as

μ_{l}

.

Let

η

be a variable threshold divide N grid points into two classes

\begin{matrix} C_{0} (η) & : = & \{\forall g_{n} | {\bar{b}}_{n} (f) \in [0, η]\}, \end{matrix}

(18)

\begin{matrix} C_{1} (η) & : = & \{\forall g_{n} | {\bar{b}}_{n} (f) \in [η + 1, L]\}, \end{matrix}

(19)

where the probabilities of

C_{0} (η)

and

C_{0} (η)

are

\begin{matrix} μ_{C_{0}} (η) & : = & \sum_{l = 0}^{η} μ_{l}, \end{matrix}

(20)

\begin{matrix} μ_{C_{1}} (η) & : = & 1 - μ_{C_{0}} = \sum_{η + 1}^{L} μ_{l} . \end{matrix}

(21)

The between-class variance of

C_{0} (η)

and

C_{1} (η)

is calculated as

\begin{matrix} σ_{C}^{2} (η) & = & μ_{C_{0}} (η) μ_{C_{1}} (η) \frac{\sum_{\forall n, g_{n} \in C_{0} (η)} {\bar{b}}_{n} (f)}{card (C_{0} (η))} \frac{\sum_{\forall n, g_{n} \in C_{1} (η)} {\bar{b}}_{n} (f)}{card (C_{1} (η))}, \end{matrix}

(22)

and the optimal threshold

η^{*}

is determined as

\begin{matrix} η^{*} & = & \underset{η \in {0, \dots, L - 1}}{arg} max σ_{C}^{2} (η) . \end{matrix}

(23)

Lastly, the reserved grid points can be determined by

\begin{matrix} \tilde{B} & = & \{\forall g_{n} | {\bar{b}}_{n} (f) > η^{*}\}, \end{matrix}

(24)

where

\tilde{N} : = card (\tilde{B})

, and

\tilde{b} (f)

is determined by selecting the

\tilde{N}

entries of

\overset{˚}{b} (f)

corresponding to

g_{n} \in \tilde{B}

.

3.3. Dimension-Reduced Linear Equation System

With

\tilde{b} (f)

, Equation (8) can be simplified as

\begin{matrix} \tilde{b} (f) & = & \tilde{P} (f) \tilde{ρ} (f), \end{matrix}

(25)

where

\tilde{P} (f) \in C^{\tilde{N} \times \tilde{N}}

and

\tilde{ρ} (f) \in C^{\tilde{N} \times 1}

are dimension-reduced versions of

P (f)

and

\hat{ρ} (f)

by selecting the grid points in

\tilde{B}

.

If

\tilde{N} ≪ N

, then the linear equation system in Equation (25) has a much smaller scale compared to Equation (8). In such a case, the computational complexity of the devolution approach for sound source localization can be significantly reduced. Consequentially, much faster processing can be expected in real-time acoustic imaging.

In Equation (25),

\tilde{P} (f)

is generally singular, i.e., typically

rank (\tilde{P} (f)) ≪ \tilde{N}

. Therefore, the Gauss–Seidel iterative method [21] is applied to solve Equation (25). The i-th iteration is performed as

\begin{matrix} {\tilde{ρ}}_{n}^{(i)} (f) & = & max \{0, {\tilde{b}}_{n} (f) - [\sum_{n^{'} = 1}^{n - 1} {\tilde{p}}_{n, n^{'}} (f) {\tilde{ρ}}_{n^{'}}^{(i)} (f) + \sum_{n^{'} = n + 1}^{\tilde{N}} {\tilde{p}}_{n, n^{'}} (f) {\tilde{ρ}}_{n^{'}}^{(i - 1)} (f)]\}, \end{matrix}

(26)

where

max \{0, \cdot\}

is due to the fact of

{\tilde{ρ}}_{n} (f) \geq 0, \forall n

, since they represent the sound source powers. Generally, the initialization can be set as

\tilde{ρ} (f) = 0

.

For the selected

\tilde{N}

grid points, the sufficient I iterations in Equation (26) are expected for convergence. For the other

N - \tilde{N}

grid points,

{\tilde{ρ}}_{n} (f)

is simply set to 0.

3.4. Algorithm Summary

The algorithmic steps of the proposed method is summarized in Algorithm 1.

Algorithm 1: algorithmic steps of the proposed method

1 Obtain

b (f)

in Equation (7) by the DAS beamforming output

b_{n} (f), \forall n

in

Equation (5);

2 Use

g_{n}, b_{n} (f), \forall n

to form the image B as defined in Section 3.1.1;

3 Perform morphological reconstruction in Equation (17) to get

\overset{˚}{B}

and

\overset{˚}{b} (f)

;

4 Construct

\bar{b} (f)

from

\overset{˚}{b} (f)

as stated in Section 3.2;

5 Determine the optimal threshold

η^{*}

in Equation (23);

6 Obtain

\tilde{B}

and

\tilde{b} (f)

via Equation (24);

7 Apply the Gauss–Seidel iterations in Equation (26) to solve the linear system

Equation (25);

4. Numerical Simulations

In the numerical simulations, a circular microphone array of 1 m radius with

M = 64

microphones is used. A

51 \times 51

square grid (

N = 2601

grid points) spanning a 4 m × 4 m plane parallel to the circular array at a distance of 2 m is set. The proposed algorithm adopts a disk-shaped structuring element of the radius equal to 2 grid points. The simulations are conducted on a laptop with an AMD Ryzen 7 5800H 3.20 GHz processor.

For the i-th iteration in Equation (26), the per-grid-point standard deviation of source mapping error is defined as [26]

\begin{matrix} ϵ^{(i)} (f) & = & {[\frac{1}{N} \sum_{n = 1}^{N} {({\tilde{ρ}}_{n}^{(i)} (f) - {\hat{ρ}}_{n} (f))}^{2}]}^{1 / 2}, \end{matrix}

(27)

Define the total sound power on all grid points before applying the proposed algorithm as

\begin{matrix} ρ_{Σ} (f) & = & \sum_{n = 1}^{N} {\hat{ρ}}_{n} (f) . \end{matrix}

(28)

Define the total sound power on all grid points after applying the proposed algorithm as

\begin{matrix} {\tilde{ρ}}_{Σ} (f) & = & \sum_{n = 1}^{N} {\tilde{ρ}}_{n}^{(I)} (f) . \end{matrix}

(29)

Define the total sound power on the grid points within a circle

C_{g_{n}}

centered at a specific

g_{n}

as

\begin{matrix} {\tilde{ρ}}_{g_{n}} (f) & = & \sum_{\forall n \in C_{g_{n}}} {\tilde{ρ}}_{n}^{(I)} (f) . \end{matrix}

(30)

With the above definitions in Equations (28) and (30), respectively, define the overall level error, the specific level error, and the inverse level error as [23]

\begin{matrix} Δ_{Σ} (f) & : = & | {\tilde{ρ}}_{Σ} (f) - ρ_{Σ} (f) |, \end{matrix}

(31)

\begin{matrix} Δ_{g_{n}} (f) & : = & | {\tilde{ρ}}_{g_{n}} (f) - ρ_{Σ} (f) |, \end{matrix}

(32)

\begin{matrix} {\tilde{Δ}}_{g_{n}} (f) & : = & | {\tilde{ρ}}_{g_{n}} (f) - {\tilde{ρ}}_{Σ} (f) |, \end{matrix}

(33)

which evaluates the performance of the proposed algorithm to pinpoint all sources, to pinpoint the major sources, and to separate the major sources.

To evaluate the performance of the proposed algorithm, DAMAS, DAMAS-CG2, DAMAS-CG3 and DAMAS2-v are simulated for comparison. Note that DAMAS-CG2, DAMAS-CG3 and the proposed algorithm set

{\hat{ρ}}_{n} (f) = 0

for the non-selected grid points, which inherently improves performance in terms of

Δ_{Σ} (f)

,

Δ_{g_{n}} (f)

and

{\tilde{Δ}}_{g_{n}} (f)

.

I = 1000

Gauss–Seidel iterations in Equation (26) and

ϵ^{(i)} (f) = 10^{- 5}

in Equation (27) are applied to ensure the algorithm convergence. Define an algorithm’s running time relative to that of DAMAS as T. That is,

T = 100 %

for DAMAS.

4.1. Scenario 1: Single Source

In this scenario, only a single sound source with

ρ_{0} (f)

is presumed. In each of the 1000 Monte Carlo realizations, the sound source locates at the grid point

g_{1301} = {[0, 0, 2]}^{T}

. The constructed acoustic images by the DAS beamforming in Figure 2a, and the proposed algorithm in Figure 2b are shown. The

\tilde{N}

selected grid points by the morphological reconstruction of a proposed algorithm are shown as blue circles in Figure 2a.

The performance metrics are summarized in Table 1. Taking the algorithm time of DAMAS as the reference (100%), DAMAS-CG2 has over 40% algorithm time, DAMAS-CG3 has 11.44% algorithm time, and DAMAS2-v has 15.55% algorithm time (and the proposed algorithm reduces this number to 8.06%). Apparently, the proposed algorithm generates the acoustic image with the localization accuracy comparable to the other algorithms, but with the algorithm time lower than the others.

4.2. Scenario 2: Triple Sources with Unequal Power

In this scenario, three sources are set. In each of the 1000 Monte Carlo realizations, the sound sources are fixed at the grid points

g_{1041} = {[- 0.4, 0.4, 2]}^{T}

,

g_{1301} = {[0, 0, 2]}^{T}

and

g_{1561} = {[0.4, - 0.4, 2]}^{T}

with intensity level

0.7 ρ_{0} (f)

,

ρ_{0} (f)

and

0.5 ρ_{0} (f)

, respectively.

The algorithm performance is shown in Figure 3 and Table 2, similarly to that in Section 4.1. This simulation confirms that the proposed algorithm outputs an accurate acoustic image by not neglecting the weaker sources.

4.3. Scenario 3: Many Sources

In this scenario, 22 spatially distributed sources with center frequency of 2 kHz and identical power

ρ_{0} (f)

are employed, as indicated by the black ’x’ icons in Figure 4. The algorithm performance is shown in Figure 4 and Table 3, similarly to that in Section 4.1. In this very adverse scenario with many sources, the proposed algorithm has a comparable accuracy in acoustic imaging to DAMAS-CG3, but with only about 38% of computation load of DAMAS-CG3. Although DAMAS2-v reduces the algorithm time to 7.85%, which is lower than that of the proposed method, its localization performance drops substantially in this scenario.

4.4. Ablation Experiments

The proposed method comprises three modules: opening by reconstruction, closing by reconstruction, and Otsu’s method. In scenario 3, ablation experiments are performed in three configurations: without opening by reconstruction, without closing by reconstruction, and without both of the two reconstruction operations.

The performance of the algorithm under different configurations is shown in Table 4. All schemes achieve comparable localization performance and exhibit only slight difference in computational time.

The method without opening by reconstruction has no obvious change in running time, yet it results in incomplete removal of non-sound-source regions, as displayed in Figure 5a. The method without closing by reconstruction requires less running time than the proposed method, but it causes hollow cavities to emerge inside sound-source regions, as displayed in Figure 5b. When both reconstruction morphological operations are discarded and only Otsu’s method is applied, the running time decreases. Nevertheless, this approach simultaneously induces hollow cavities inside sound-source areas and fails to fully eliminate non-sound-source regions, as displayed in Figure 5c.

Consequently, the combination of all three steps guarantees that the extracted sound-source regions are the most complete and accurate, as displayed in Figure 5d.

5. Empirical Experiments

Practical experiments are conducted in both the indoor and the outdoor scenarios. A

M = 4 \times 4 = 16

square microphone array is used for real-data acquisition, with a inter-microphone distance of 0.1 m.

I = 1000

Gauss–Seidel iterations in Equation (26) and

ϵ^{(i)} (f) = 10^{- 5}

in Equation (27) are applied to ensure the algorithm convergence. Four NI-9234 data acquisition cards together with an NI-9184 CompactDAQ build the A/D conversion system with the 16-channel simultaneous sampling rate of 51.2 kHz.

In the real environment experiments, the sound source power can be hardly determined due to the background noise, the noticeable reverberation, and the nonideal measurements. Consequently, the metrics

Δ_{Σ} (f)

,

Δ_{g_{n}} (f)

,

{\tilde{Δ}}_{g_{n}} (f)

, and

ϵ^{(I)} (f)

in Section 4 cannot be obtained. Instead, the average source localization error

Δ_{q} : = \frac{1}{J} {∥ q_{j} - g_{j} ∥}_{2}^{2}

is used to assess the accuracy of acoustic imaging, where

q_{j}

represents the source position, J denotes the source number, and

g_{j}

signifies the grid point position as the estimate of

q_{j}

. To assess the grid compression performance, the proposed algorithm is compared with DAMAS-CG2 and DAMAS-CG3 using the empirical data. DAMAS with no grid compression is taken as the reference.

5.1. Scenario 4: Indoor Experiment

The indoor experiment is carried out in a shoebox-shape classroom at the Wangjiang Campus of Sichuan University, with

12.35

m in length,

7.29

m in width, and

3.15

m in height, as shown in Figure 6. A handheld smartphone playing a 2 kHz pure tone signal simulates a single source. The microphone array faces the wall at 1 m distance. The primary background noise comes from the central air conditioning system and the bird calls outside the windows. The sound level meter shows the average environmental noise level is around 45 dB. The virtual grid of

N = 51 \times 51 = 2601

points is on the wall plane spanning an area of

0.94

m in length and

0.67

m in width.

The acoustic image by the DAS beamformer and the proposed algorithm are shown in Figure 7a,b. The source localization error and algorithm time of the competing algorithms are shown in Table 5. It can be seen that the localization errors of various algorithms are at the same level, while the proposed algorithm has the lowest relative algorithm time of

17.1 %

, which is only about

\frac{1}{5}

of DAMAS-CG3, and

\frac{1}{6}

of DAMAS and DAMAS-CG2. Apparently, the significantly lower

\tilde{N}

is one major reason for this reduction.

5.2. Scenario 5: Outdoor Experiment

The outdoor experiment with the same microphone array in Section 5.1 is conducted on the rooftop of a teaching building, as shown in Figure 8. The outdoor environment has a 55 dB background noise, primarily due to the wind weather. A wireless loudspeaker controlled by a smartphone via the Bluetooth connection plays a 2 kHz pure tone signal. Meanwhile, another handheld smartphone playing the same pure tone signal acts as another sound source. The same grid as in Section 5.1 is set on a rectangular plane of

0.93 \times 0.67 =

m² at 1 m distance from the microphone array.

The acoustic image by the DAS beamformer and the proposed algorithm are shown in Figure 9a,b. Apparently, the DAS beamformer cannot separate and locate the two sources in Figure 9a due to the single broad peak of

b (f)

. On the other hand, the proposed algorithm successfully separates the two sound sources with a high spatial resolution. The average localization error and relative algorithm time of the competing algorithms are summarized in Table 6. Surprisingly, the proposed algorithm achieves the lowest localization error with only

\frac{1}{4}

to

\frac{1}{3}

algorithm time of the other deconvolution approaches.

6. Conclusions

A deconvolution approach for sound source localization based on the morphological operations is proposed in this work. Morphological operations are generally used in natural image processing but not acoustic image processing. By incorporating the deconvolution approach, the proposed algorithm implicitly neglects the physics behind the acoustic imaging principle but explores the visual features of acoustic image. This method turns out to be a direct and efficient way to narrow down the grid points on which the sources are likely to locate. Thus, the scale of the linear system relating to the array measurements and the source power can be significantly reduced. Compared to the conventional deconvolution approaches, the proposed method significantly reduces the algorithm time without sacrificing the localization accuracy. Both numerical simulations and in-/outdoor experiments validate the efficacy of the proposed algorithm.

Author Contributions

Conceptualization, Y.I.W.; software, J.S., H.Y. and Q.Q.; validation, J.S.; formal analysis, Y.I.W.; investigation, Y.I.W., J.S. and H.Y.; data curation, Y.I.W. and H.Y.; writing—original draft, H.Y.; writing—review & editing, Y.I.W. and J.S.; visualization, J.S. and Q.Q.; supervision, Y.I.W.; project administration, Y.I.W.; and funding acquisition, Y.I.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Stable Supporting Fund of Acoustic Science and Technology Laboratory under grant number JCKYS2024604SSJS017 and the National Natural Science Foundation of China under grant number 62271333.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare that they have no conficts of interest.

References

Lanslots, J.; Deblauwe, F.; Janssens, K. Selecting sound source localization techniques for industrial applications. J. Sound Vib. 2010, 44, 6. Available online: http://www.sandv.com/jun10.shtml (accessed on 26 February 2026).
Merino-Martínez, R.; Sijtsma, P.; Snellen, M.; Ahlefeldt, T.; Antoni, J.; Bahr, C.J.; Blacodon, D.; Ernst, D.; Finez, A.; Funke, S.; et al. A review of acoustic imaging methods using phased microphone arrays. CEAS Aeronaut. J. 2019, 10, 197–230. [Google Scholar] [CrossRef]
Bocanegra, J.A.; Borelli, D.; Gaggero, T.; Rizzuto, E.; Schenone, C. A novel approach to port noise characterization using an acoustic camera. Sci. Total Environ. 2022, 808, 151903. [Google Scholar] [CrossRef]
Soderman, P.T.; Noble, S.C. Directional microphone array for acoustic studies of wind tunnel models. J. Aircr. 1975, 12, 168–173. [Google Scholar] [CrossRef]
Hald, J. Array designs optimized for both low-frequency NAH and high-frequency Beamforming. In Proceedings of the 33rd International Conference on Noise Control Engineering, Minneapolis, MN, USA, 15–18 August 2004; pp. 1–8. Available online: https://ince.publisher.ingentaconnect.com/contentone/ince/incecp/2004/00002004/00000005/art00010 (accessed on 26 February 2026).
Malboeuf, A.; Snellen, M.; Sijtsma, P.; Simons, D. Improving beamforming by optimization of acoustic array microphone positions. In Proceedings of the 6th Berlin Beamforming Conference, Berlin, Germany, 29 February 2016; pp. 1–24. Available online: https://www.bebec.eu/fileadmin/bebec/downloads/bebec-2016/papers/BeBeC-2016-S5.pdf (accessed on 26 February 2026).
Sarradj, E.; Schulze, C.; Zeibig, A. Identification of noise source mechanisms using orthogonal beamforming. In Proceedings of the Deutsche Jahrestagung für Akustik (DAGA), Dresden, Germany, 14–17 March 2005; Available online: https://www-docs.b-tu.de/fg-akustik/public/veroeffentlichungen/sarradj_orthogonal_novem2005.pdf (accessed on 26 February 2026).
Sarradj, E. A fast signal subspace approach for the determination of absolute levels from phased microphone array measurements. J. Sound Vib. 2010, 329, 1553–1569. [Google Scholar] [CrossRef]
Dougherty, R.P. Functional beamforming for aeroacoustic source distributions. In Proceedings of the 20th AIAA/CEAS Aeroacoustics Conference, Atlanta, GA, USA, 16–20 June 2014; p. 3066. [Google Scholar] [CrossRef]
Yang, Y.; Chu, Z.; Shen, L.; Xu, Z. Functional delay and sum beamforming for three-dimensional acoustic source identification with solid spherical arrays. J. Sound Vib. 2016, 373, 340–359. [Google Scholar] [CrossRef]
Capon, J. High-resolution frequency-wavenumber spectrum analysis. Proc. IEEE 1969, 57, 1408–1418. [Google Scholar] [CrossRef]
Frost, O.L. An algorithm for linearly constrained adaptive array processing. Proc. IEEE 1972, 60, 926–935. [Google Scholar] [CrossRef]
Dougherty, R.P.; Stoker, R.W. Sidelobe suppression for phased array aeroacoustic measurements. In Proceedings of the 4th AIAA/CEAS Aeroacoust Conference, Toulouse, France, 2–4 June 1998; p. 2242. [Google Scholar] [CrossRef]
Sijtsma, P. CLEAN based on spatial source coherence. Int. J. Aeroacoust. 2007, 6, 357–374. [Google Scholar] [CrossRef]
Yardibi, T.; Li, J.; Stoica, P.; Cattafesta, L.N. Sparsity constrained deconvolution approaches for acoustic source mapping. J. Acoust. Soc. Am. 2008, 123, 2631–2642. [Google Scholar] [CrossRef]
Yardibi, T.; Li, J.; Stoica, P.; Zawodny, N.S.; Cattafesta, L.N. A covariance fitting approach for correlated acoustic source mapping. J. Acoust. Soc. Am. 2010, 127, 2920–2931. [Google Scholar] [CrossRef]
Padois, T.; Berry, A. Orthogonal matching pursuit applied to the deconvolution approach for the mapping of acoustic sources inverse problem. J. Acoust. Soc. Am. 2015, 138, 3678–3685. [Google Scholar] [CrossRef] [PubMed]
Bergh, T.F.; Hafizovic, I.; Holm, S. Acoustic imaging of sparse Sources with Orthogonal Matching Pursuit and clustering of basis vectors. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, New Orleans, LA, USA, 5–9 March 2017; pp. 6030–6034. [Google Scholar] [CrossRef]
Brooks, T.F.; Humphreys, W.M. A deconvolution approach for the mapping of acoustic sources (DAMAS) determined from phased microphone arrays. J. Sound Vib. 2006, 294, 856–879. [Google Scholar] [CrossRef]
Brooks, T.F.; Humphreys, W.M. Extension of DAMAS phased array processing for spatial coherence determination (DAMAS-C). In Proceedings of the 12th AIAA/CEAS Aeroacoustics Conference, Cambridge, MA, USA, 8–10 May 2006; p. 2654. [Google Scholar] [CrossRef]
Chardon, G.; Picheral, J.; Ollivier, F. Theoretical analysis of the DAMAS algorithm and efficient implementation of the covariance matrix fitting method for large-scale problems. J. Sound Vib. 2021, 508, 116208. [Google Scholar] [CrossRef]
Padois, T.; Berry, A. Two and three-dimensional sound source localization with beamforming and several deconvolution techniques. Acta Acust. United Acust 2017, 103, 392–400. [Google Scholar] [CrossRef]
Herold, G.; Saradj, E. Performance analysis of microphone array methods. J. Sound Vib. 2017, 401, 152–168. [Google Scholar] [CrossRef]
Dougherty, R.P. Extensions of DAMAS and benefits and limitations of deconvolution in beamforming. In Proceedings of the 11th AIAA/CEAS Aeroacoustics Conference, Monterey, CA, USA, 23–25 May 2005; p. 2961. [Google Scholar] [CrossRef]
Lawson, C.L.; Hanson, R.J. Solving Least Square Problems. In Classics in Applied Mathematics; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1995; pp. 158–173. [Google Scholar] [CrossRef]
Ehrenfried, K.; Koop, L. Comparison of iterative deconvolution algorithms for the mapping of acoustic sources. AIAA J. 2007, 45, 1584–1595. [Google Scholar] [CrossRef]
Zhang, J.; Wen, Y.; Yan, J.; Yang, X.; Chu, Z. Improvement of orthogonal matching pursuit deconvolution beamforming method for acoustic source identification. J. Low Freq. Noise Vib. Act. Control 2023, 42, 209–221. [Google Scholar] [CrossRef]
Li, W.; Zhao, S.; Zhou, C.; Qin, Y.; Zhu, H.; Li, S. Improved fast deconvolution algorithms based on functional beamforming for gas leakage sound source imaging. Measurement 2025, 242, 116238. [Google Scholar] [CrossRef]
Lobato, T.; Sottek, R.; Vorländer, M. Deconvolution with neural grid compression: A method to accurately and quickly process beamforming results. J. Acoust. Soc. Am. 2023, 153, 2073. [Google Scholar] [CrossRef]
Liang, H.; Zhou, G.; Tu, X.; Jakobsson, A.; Ding, X.; Huang, Y. Learning an interpretable end-to-end network for real-time acoustic beamforming. J. Sound Vib. 2024, 591, 118620. [Google Scholar] [CrossRef]
Lyu, M.; Yu, L.; Wang, R.; Fang, Y. Deconvolution of acoustic beamforming maps in interference environments with mean-reverting stochastic differential equations. Mech. Syst. Signal Process. 2025, 237, 113091. [Google Scholar] [CrossRef]
Jia, H.; Yang, F.; Hu, X.; Yang, J. A dual-encoder U-net architecture with prior knowledge embedding for acoustic source mapping. J. Acoust. Soc. Am. 2025, 158, 1767–1782. [Google Scholar] [CrossRef] [PubMed]
Jia, H.; Yang, F.; Tong, J.; Yang, J. A conditional diffusion-based model for high-resolution acoustic source mapping. J. Acoust. Soc. Am. 2026, 159, 1917–1929. [Google Scholar] [CrossRef] [PubMed]
Ma, W.; Liu, X. Improving the efficiency of DAMAS for sound source localization via wavelet compression computational grid. J. Sound Vib. 2017, 395, 341–353. [Google Scholar] [CrossRef]
Ma, W.; Liu, X. DAMAS with compression computational grid for acoustic source mapping. J. Sound Vib. 2017, 410, 473–484. [Google Scholar] [CrossRef]
Ma, W.; Liu, X. Compression computational grid based on functional beamforming for acoustic source localization. Appl. Acoust. 2018, 134, 75–87. [Google Scholar] [CrossRef]
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]

Figure 1. A microphone array and a spatial grid for sound source localization and imaging.

Figure 2. Single source with frequency of 2 kHz. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, and the selected grid points by the proposed algorithm are shown as pink ‘o’; (b) the proposed algorithm:

\tilde{B}

.

Figure 2. Single source with frequency of 2 kHz. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, and the selected grid points by the proposed algorithm are shown as pink ‘o’; (b) the proposed algorithm:

\tilde{B}

.

Figure 3. Triple 2 kHz sources with unequal power. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, while the selected grid points by the proposed algorithm are shown as pink ‘o’; and (b) proposed algorithm:

\tilde{B}

.

Figure 3. Triple 2 kHz sources with unequal power. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, while the selected grid points by the proposed algorithm are shown as pink ‘o’; and (b) proposed algorithm:

\tilde{B}

.

Figure 4. 22 sources with center frequency of 2kHz and equal power. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, while the selected grid points by the proposed algorithm are shown as pink ‘o’; and (b) proposed algorithm:

\tilde{B}

.

Figure 4. 22 sources with center frequency of 2kHz and equal power. The black ‘x’ icon signifies the true source location in acoustic image by (a) DAS: B, while the selected grid points by the proposed algorithm are shown as pink ‘o’; and (b) proposed algorithm:

\tilde{B}

.

Figure 5. Ablation experiments for Scenario 3 in Section 4.3: The selected grid points (pink ‘o’) by the proposed algorithm (a) w/o

γ (B)

in Equation (15), (b) w/o

φ (B)

in Equation (16), (c) w/o

φ (γ (B))

in Equation (17), and (d) as it is.

Figure 5. Ablation experiments for Scenario 3 in Section 4.3: The selected grid points (pink ‘o’) by the proposed algorithm (a) w/o

γ (B)

in Equation (15), (b) w/o

φ (B)

in Equation (16), (c) w/o

φ (γ (B))

in Equation (17), and (d) as it is.

Figure 6. A 16-elements square microphone array with an optical camera is deployed in a classroom.

Figure 7. A smartphone emitting the 2 kHz puretone signal. The white ‘x’ icon signifies the source position in the natural image captured by the optical camera, with the acoustic image output by (a) DAS, and (b) the proposed algorithm.

Figure 8. A 16-elements square microphone array with an optical camera is deployed on the rooftop of a teaching building.

Figure 9. A loudspeaker and a smartphone emitting the 2 kHz puretone signals act as two sound sources in space. The white ‘x’ icon signifies the source positions in the natural image captured by the optical camera, with the acoustic image output by (a) DAS, and (b) the proposed algorithm.

Table 1. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 1: a single source of intensity level

ρ_{0} (f)

.

Table 1. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 1: a single source of intensity level

ρ_{0} (f)

.

Algorithm	T	$\frac{Δ_{Σ} (f)}{ρ_{0} (f)}$	$\frac{Δ_{g_{n}} (f)}{ρ_{0} (f)}$	$\frac{{\tilde{Δ}}_{g_{n}} (f)}{ρ_{0} (f)}$	$ϵ^{(I)} (f)$
DAMAS	$(100.00 \pm 0.00)$ %	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$1.54 \times 10^{- 4} \pm 1.47 \times 10^{- 5}$
DAMAS-CG2	$(42.73 \pm 5.63)$ %	$7.91 \times 10^{- 3} \pm 7.45 \times 10^{- 4}$	$7.91 \times 10^{- 3} \pm 7.45 \times 10^{- 4}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$1.55 \times 10^{- 4} \pm 1.45 \times 10^{- 5}$
DAMAS-CG3	$(11.44 \pm 1.87)$ %	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$1.54 \times 10^{- 4} \pm 1.45 \times 10^{- 5}$
DAMAS2-v	$(15.55 \pm 1.12)$ %	$8.57 \times 10^{- 2} \pm 2.38 \times 10^{- 4}$	$8.57 \times 10^{- 2} \pm 2.38 \times 10^{- 4}$	$5.35 \times 10^{- 10} \pm 1.98 \times 10^{- 11}$	$8.25 \times 10^{- 4} \pm 1.01 \times 10^{- 5}$
Proposed	$(8.06 \pm 1.27) %$	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$7.84 \times 10^{- 3} \pm 7.38 \times 10^{- 4}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$1.54 \times 10^{- 4} \pm 1.45 \times 10^{- 5}$

Table 2. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 2: triple sources of intensity level

0.7 ρ_{0} (f)

,

ρ_{0} (f)

and

0.5 ρ_{0} (f)

, respectively.

Table 2. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 2: triple sources of intensity level

0.7 ρ_{0} (f)

,

ρ_{0} (f)

and

0.5 ρ_{0} (f)

, respectively.

Algorithm	T	$\frac{Δ_{Σ} (f)}{ρ_{0} (f)}$	$\frac{Δ_{g_{n}} (f)}{ρ_{0} (f)}$	$\frac{{\tilde{Δ}}_{g_{n}} (f)}{ρ_{0} (f)}$	$ϵ^{(I)} (f)$
DAMAS	$(100.00 \pm 0.00)$ %	$9.39 \times 10^{- 3} \pm 1.27 \times 10^{- 3}$	$9.82 \times 10^{- 3} \pm 1.28 \times 10^{- 3}$	$4.28 \times 10^{- 4} \pm 8.53 \times 10^{- 5}$	$2.08 \times 10^{- 4} \pm 1.52 \times 10^{- 5}$
DAMAS-CG2	$(73.87 \pm 8.65)$ %	$9.40 \times 10^{- 2} \pm 1.30 \times 10^{- 3}$	$9.40 \times 10^{- 2} \pm 1.30 \times 10^{- 3}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$1.08 \times 10^{- 3} \pm 1.48 \times 10^{- 5}$
DAMAS-CG3	$(10.19 \pm 1.91)$ %	$9.81 \times 10^{- 3} \pm 1.28 \times 10^{- 3}$	$9.81 \times 10^{- 3} \pm 1.28 \times 10^{- 3}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$2.08 \times 10^{- 4} \pm 1.52 \times 10^{- 5}$
DAMAS2-v	$(14.23 \pm 0.93)$ %	$2.93 \times 10^{- 1} \pm 2.57 \times 10^{- 3}$	$2.93 \times 10^{- 1} \pm 2.57 \times 10^{- 3}$	$2.94 \times 10^{- 6} \pm 6.69 \times 10^{- 7}$	$4.90 \times 10^{- 3} \pm 1.97 \times 10^{- 5}$
Proposed	$(7.41 \pm 1.34) %$	$9.81 \times 10^{- 3} \pm 1.28 \times 10^{- 3}$	$9.81 \times 10^{- 3} \pm 1.28 \times 10^{- 3}$	$0.00 \times 10^{0} \pm 0.00 \times 10^{0}$	$2.08 \times 10^{- 4} \pm 1.52 \times 10^{- 5}$

Table 3. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 3: 22 sources of equal intensity level

ρ_{0} (f)

.

Table 3. Algorithm performance comparison with 1000 Monte Carlo runs under scenario 3: 22 sources of equal intensity level

ρ_{0} (f)

.

Algorithm	T	$\frac{Δ_{Σ} (f)}{ρ_{0} (f)}$	$\frac{Δ_{g_{n}} (f)}{ρ_{0} (f)}$	$\frac{{\tilde{Δ}}_{g_{n}} (f)}{ρ_{0} (f)}$	$ϵ^{(I)} (f)$
DAMAS	$(100.00 \pm 0.00)$ %	$4.72 \times 10^{- 1} \pm 3.49 \times 10^{- 3}$	$5.48 \times 10^{- 1} \pm 3.61 \times 10^{- 3}$	$7.87 \times 10^{- 2} \pm 1.33 \times 10^{- 3}$	$4.62 \times 10^{- 2} \pm 1.25 \times 10^{- 5}$
DAMAS-CG2	$(71.45 \pm 1.11)$ %	$4.77 \times 10^{0} \pm 3.16 \times 10^{- 3}$	$4.77 \times 10^{0} \pm 3.16 \times 10^{- 3}$	$2.66 \times 10^{- 15} \pm 2.17 \times 10^{- 15}$	$4.04 \times 10^{- 2} \pm 1.89 \times 10^{- 5}$
DAMAS-CG3	$(52.62 \pm 3.13)$ %	$4.69 \times 10^{- 1} \pm 3.40 \times 10^{- 3}$	$5.48 \times 10^{- 1} \pm 3.48 \times 10^{- 3}$	$7.89 \times 10^{- 2} \pm 1.12 \times 10^{- 3}$	$4.62 \times 10^{- 2} \pm 1.44 \times 10^{- 5}$
DAMAS2-v	$(7.85 \pm 0.24)$ %	$1.83 \times 10^{2} \pm 3.24 \times 10^{- 1}$	$1.42 \times 10^{2} \pm 2.43 \times 10^{- 1}$	$4.10 \times 10 \pm 8.84 \times 10^{- 2}$	$1.46 \times 10^{- 1} \pm 1.61 \times 10^{- 4}$
Proposed	$(19.86 \pm 2.38) %$	$4.84 \times 10^{- 1} \pm 3.54 \times 10^{- 3}$	$4.95 \times 10^{- 1} \pm 3.57 \times 10^{- 3}$	$1.12 \times 10^{- 2} \pm 1.02 \times 10^{- 3}$	$4.75 \times 10^{- 2} \pm 1.67 \times 10^{- 5}$

Table 4. Algorithm performance of ablation experiments with 1000 Monte Carlo under scenario 3.

Algorithm	T	$\frac{Δ_{Σ} (f)}{ρ_{0} (f)}$	$\frac{Δ_{g_{n}} (f)}{ρ_{0} (f)}$	$\frac{{\tilde{Δ}}_{g_{n}} (f)}{ρ_{0} (f)}$	$ϵ^{(I)} (f)$
DAMAS	$(100.00 \pm 0.00)$ %	$4.695 \times 10^{- 1} \pm 3.5 \times 10^{- 3}$	$5.483 \times 10^{- 1} \pm 3.6 \times 10^{- 3}$	$7.87 \times 10^{- 2} \pm 1.3 \times 10^{- 3}$	$4.622 \times 10^{- 2} \pm 1 \times 10^{- 5}$
w/o $γ (B)$ in Equation (15)	$(21.76 \pm 1.99)$ %	$5.04 \times 10^{- 1} \pm 3.52 \times 10^{- 3}$	$5.22 \times 10^{- 1} \pm 3.56 \times 10^{- 3}$	$1.74 \times 10^{- 2} \pm 9.85 \times 10^{- 4}$	$4.62 \times 10^{- 2} \pm 1.31 \times 10^{- 5}$
w/o $φ (B)$ in Equation (16)	$(16.29 \pm 1.14)$ %	$5.00 \times 10^{- 1} \pm 3.54 \times 10^{- 3}$	$5.14 \times 10^{- 1} \pm 3.57 \times 10^{- 3}$	$1.35 \times 10^{- 2} \pm 9.78 \times 10^{- 4}$	$4.75 \times 10^{- 2} \pm 1.67 \times 10^{- 5}$
w/o $φ (γ (B))$ in Equation (17)	$(17.66 \pm 1.46)$ %	$5.14 \times 10^{- 1} \pm 3.49 \times 10^{- 3}$	$5.33 \times 10^{- 1} \pm 3.66 \times 10^{- 3}$	$1.91 \times 10^{- 2} \pm 1.43 \times 10^{- 3}$	$4.63 \times 10^{- 2} \pm 1.31 \times 10^{- 5}$
Proposed	$(19.86 \pm 2.38) %$	$4.84 \times 10^{- 1} \pm 3.54 \times 10^{- 3}$	$4.95 \times 10^{- 1} \pm 3.57 \times 10^{- 3}$	$1.12 \times 10^{- 2} \pm 1.02 \times 10^{- 3}$	$4.75 \times 10^{- 2} \pm 1.67 \times 10^{- 5}$

Table 5. Scenario 4: Single source indoor.

Algorithm	$\tilde{N}$	T	$Δ_{q}$
DAMAS	2601	100%	0.03
DAMAS-CG2	2601	102.6%	0.03
DAMAS-CG3	2490	86.3%	0.03
Proposed	901	17.1%	0.03

Table 6. Scenario 5: double sources outdoor.

Algorithm	$\tilde{N}$	T	$Δ_{q}$
DAMAS	2601	100%	0.07
DAMAS-CG2	2601	101.1%	0.06
DAMAS-CG3	2408	81.5%	0.16
Proposed	1652	25.7%	0.05

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, Y.I.; Song, J.; Yin, H.; Quan, Q. A Fast Sound Source Mapping by Morphological Operations on Acoustic Images. Mathematics 2026, 14, 1865. https://doi.org/10.3390/math14111865

AMA Style

Wu YI, Song J, Yin H, Quan Q. A Fast Sound Source Mapping by Morphological Operations on Acoustic Images. Mathematics. 2026; 14(11):1865. https://doi.org/10.3390/math14111865

Chicago/Turabian Style

Wu, Yue Ivan, Jiahao Song, Hang Yin, and Qinhao Quan. 2026. "A Fast Sound Source Mapping by Morphological Operations on Acoustic Images" Mathematics 14, no. 11: 1865. https://doi.org/10.3390/math14111865

APA Style

Wu, Y. I., Song, J., Yin, H., & Quan, Q. (2026). A Fast Sound Source Mapping by Morphological Operations on Acoustic Images. Mathematics, 14(11), 1865. https://doi.org/10.3390/math14111865

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Fast Sound Source Mapping by Morphological Operations on Acoustic Images

Abstract

1. Introduction

2. Problem Formulation

3. Proposed Method

3.1. Acoustic Imaging by Natural Image Processing

3.1.1. Erosion and Dilation

3.1.2. Morphological Reconstruction

3.2. Grid Points Selection

3.3. Dimension-Reduced Linear Equation System

3.4. Algorithm Summary

4. Numerical Simulations

4.1. Scenario 1: Single Source

4.2. Scenario 2: Triple Sources with Unequal Power

4.3. Scenario 3: Many Sources

4.4. Ablation Experiments

5. Empirical Experiments

5.1. Scenario 4: Indoor Experiment

5.2. Scenario 5: Outdoor Experiment

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI