3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm

Zhang, Fan; Li, Shaobo; Gao, Haolong; Wu, Yunlong

doi:10.3390/jmse14070597

Open AccessArticle

3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm

by

Fan Zhang

¹,

Shaobo Li

^2,3,*

,

Haolong Gao

² and

Yunlong Wu

^2,3

¹

School of Architectural Engineering, Wuhan Railway Bridge Vocational College, Wuhan 430090, China

²

School of Geography and Information Engineering, China University of Geosciences, Wuhan 430074, China

³

Hubei Key Laboratory of Information Technology, China University of Geosciences, Wuhan 430074, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2026, 14(7), 597; https://doi.org/10.3390/jmse14070597

Submission received: 24 February 2026 / Revised: 17 March 2026 / Accepted: 22 March 2026 / Published: 24 March 2026

(This article belongs to the Special Issue Advanced Studies in Marine Structures)

Download

Browse Figures

Versions Notes

Abstract

Three-dimensional (3D) sonar overcomes the limitations of traditional measurement methods regarding imaging coverage and accuracy, making it indispensable for underwater structure monitoring. However, complex underwater environments often introduce significant noise into 3D sonar data, degrading monitoring performance. To address this, we propose a geometry-based filtering method. First, Total Least Squares (TLS) is employed to construct local spatial features, which guides a region-growing segmentation based on normal vector attributes. Subsequently, the resulting clusters are refined using these local geometric characteristics. Finally, statistical filtering is applied to eliminate residual outliers from a local to a global scale. Experimental results demonstrate that the proposed method achieves F1 scores of 78.65% and 84.49% in outlier removal, effectively suppressing noise while preserving structural integrity.

Keywords:

3D sonar; denoising; spatial feature; region growth; structural feature

1. Introduction

With the advancement of marine science and engineering, the demand for underwater structure monitoring has been steadily increasing. Traditional monitoring methods are primarily based on optical measurement techniques, often requiring diver-assisted operations [1]. These methods impose strict water quality requirements and pose risks to personnel. At present, multibeam sonar enables three-dimensional underwater point cloud mapping; however, the density of the measured points depends on the number of beams, typically 512 or 1024, which fails to meet high-precision monitoring requirements [2]. Other sonar systems, such as mechanically scanned sonar and side-scan sonar, provide high-resolution two-dimensional imaging but lack three-dimensional monitoring capabilities [3].

The emergence of 3D (three-dimensional) sonar technology has addressed the limitations of optical imaging and conventional sonar measurement techniques, such as multibeam and side-scan sonar, in terms of imaging coverage and measurement accuracy [2]. This advancement provides essential technical support for high-precision three-dimensional monitoring of underwater structures. However, 3D sonar measurements are susceptible to interference from complex underwater environments, often resulting in substantial noise, which hinders its broader application. For instance, suspended particles in the water generate clustered outliers, multiple reflections of acoustic waves between underwater objects introduce spurious noise points, and surface scattering from object surfaces and volume scattering from within objects further degrade measurement accuracy [4]. Thus, effective data filtering is urgently needed. However, challenges such as acoustic shadows and complex underwater structure geometries complicate filtering processes [5]. Overall, current 3D sonar filtering techniques remain underdeveloped relative to their extensive application demands, making it difficult to satisfy the growing need for underwater monitoring. In particular, with the increasing construction of offshore wind farms and the rising demand for underwater infrastructure maintenance, enhancing 3D sonar data filtering capabilities is crucial for advancing automation and intelligent applications in underwater structure monitoring [6,7].

To address the filtering challenges in 3D sonar data, we review the existing research on underwater point cloud filtering.

Traditional Methods

Early international studies on bathymetric data outlier detection predominantly employed methods such as median/mean filtering [8], threshold filtering [9], and angle and gradient filtering [10]. These methods cannot remove cluster noise, since they cannot consider a large area information.

Filtering techniques based on trend surfaces have been widely applied in sonar data processing, which can take a wide range of information into consideration [11]. These methods fit a quadratic polynomial surface function to approximate the actual seabed topography and identify outliers based on deviations from the fitted values. While easy to implement, these algorithms perform poorly in areas with complex terrain variations. To improve trend surface filtering, an enhanced algorithm was presented incorporating the influence domain of natural neighboring points, achieving localized trend surface fitting and filtering. However, this approach suffers from high computational costs when processing large datasets [12].

Based on the surface filtering, assuming that outliers are all above the surface, a special method was proposed, the cloth simulation filtering algorithm, which was introduced for removing gross errors from underwater point clouds [13,14]. When all outliers are above the surface, it can achieve good performance. But it cannot deal with more complex cases. To better address complex noise in 3D sonar data, He et al. proposed a partitioned filtering method that constructs local coordinate systems for point cloud blocks, fits a trend surface, and applies Grubbs’ test for adaptive threshold-based noise removal. However, this method struggles with clustered noise [15]. Overall, despite the success of statistical methods in large-scale point cloud denoising, they are highly sensitive to parameter selection, making them challenging to apply in complex environments.

Apart from the above surface-based methods, there are some statistical methods. Traditional LiDAR point cloud filtering methods often include slope-based filtering algorithms, morphological filtering algorithms, fitting-based filtering algorithms, and filtering based on irregular triangular networks (TIN) [16,17,18,19]. Slope-based filtering algorithms differentiate outliers based on slope variations between neighboring points. Morphological filtering algorithms apply mathematical morphology operations (e.g., erosion, dilation, and opening) to remove outliers. Fitting-based filtering models surfaces using mathematical functions (e.g., plane or surface fitting) to determine whether a point belongs to a structural surface or is an outlier. TIN-based filtering constructs a triangulated irregular network (TIN) to analyze relative height relationships for noise removal. However, most of these methods focus on terrain data and struggle with complex underwater structures in point cloud data.

2.: Deep Learning-Based Methods

Recent advancements in deep learning have introduced innovative approaches to noise removal. Neural network architectures [20,21,22] can be trained to learn complex patterns in point clouds and effectively identify outliers, enabling end-to-end data filtering. Rakotosaona et al. [23] leveraged PCPNet [24] for the robust processing of densely sampled point clouds with significant noise. Hu et al. introduced RandLA-Net [25], an efficient and lightweight architecture for point-wise classification in large-scale point clouds, effectively identifying and removing anomalies. However, deep learning for underwater point cloud processing requires large-scale annotated underwater datasets, necessitating extensive underwater operations and auxiliary judgment using underwater imaging equipment, which significantly increases the costs. As a result, despite the widespread adoption of 3D sonar in underwater structure monitoring, no open datasets currently exist, and limited sample availability hinders the development of deep learning models, increasing the difficulty of intelligent processing [26].

Aimed at that, we propose a method to capture the structure information of the 3D sonar point cloud data, and on this basis, we achieve the removal of both far-surface and near-surface outliers, as well as a cluster of outliers in seafloor and structures with near vertical surface. The contributions of our work are listed as follows:

We employ plane features to effectively characterize the geometric attributes of the point cloud data.
These features are leveraged to precisely describe the inherent properties of the structural components within the scene.
By exploiting these distinct characteristics, our method successfully differentiates the structural point cloud from noise, enabling robust separation.

The core contribution of our work lies in the adaptation and systematic integration of these techniques specifically for 3D sonar data.

The rest of this paper is organized as follows. Section 2 introduces the theory and characteristics of 3D sonar data. Section 3 gives a detailed description of the proposed method. Section 4 and Section 5 describe and analyze the experimental results. Finally, our conclusions are drawn in Section 6. The flowchart of the proposed method is shown in Figure 1.

2. Background

2.1. Imaging Process of 3D Sonar

The BlueView 3D sonar system performs underwater scanning by leveraging the fundamental principle of acoustic ranging. In operation, it emits high-frequency acoustic pulses at 1.35 MHz with a pulse repetition rate of 40 Hz, forming a narrow sectorial scan field of 45° × 1°. Each pulse comprises 256 vertically spaced acoustic beams, with an angular separation of 0.178° between adjacent beams. Upon reception of echoes reflected from underwater targets and topographical features, the system employs beam forming techniques in conjunction with amplitude and phase analysis to determine the spatial coordinates of the detected points. A computer-controlled mechanism then drives the sonar transducer to execute a full 360° horizontal rotation, progressively capturing spatial data from all directions. The result is a high-resolution 3D point cloud that reconstructs the structural profile of the submerged environment or object.

2.2. Noise Characteristics

Water Column Interference: During data acquisition, the 3D sonar system captures a continuous series of acoustic returns, including signals reflected not only from the intended target but also from scatterers within the water column. The presence of biological entities such as plankton and fish within the water column can introduce erroneous high-intensity reflections. These spurious returns, if mis-classified as valid targets, manifest as outliers in the resulting dataset.
Scattering-Induced Imaging Artifacts: Sonar imagery is inherently affected by multi-path and scattering effects, particularly over uneven or rough seafloor terrains. These interactions generate speckle-like anomalies, scattering artifacts, that contaminate the measurements. Unlike the relatively uniform noise seen in airborne LiDAR data, sonar signals are susceptible to a richer variety of interference, leading to higher complexity in outlier suppression and data interpretation.
Acoustic Shadows: In contrast to bathymetric or airborne LiDAR systems, 3D sonar typically operates at shorter ranges from the surveyed surface. As a result, occlusion effects—referred to as acoustic shadows—are more prevalent. These regions of missing data introduce structural discontinuities, further complicating the interpretation and processing of the point cloud.

In summary, the first factor is the predominant source of noise in 3D sonar datasets, while the second and third introduce significant challenges in outlier identification. Noise is very complex, and the structural characteristics of the structure are significantly different from noise. We will carry out subsequent work based on this point.

3. Methods

3.1. 3D Spatial Feature

In point cloud processing, traditional methods typically employ Least Squares (LS) fitting to estimate local surface normal vectors by computing the covariance matrix of the point cloud and performing eigenvalue decomposition. However, LS assumes that errors exist only in the z-direction, neglecting the fact that measurement errors can occur in all three coordinate directions. To achieve a more robust normal estimation, this study adopts the Total Least Squares (TLS) method [15], which solves for local plane parameters in three-dimensional space using Singular Value Decomposition (SVD).

The point set A = {p₁(x₁, y₁, z₁), p₂(x₂, y₂, z₂),…, p_m(x_m, y_m, z_m)} can also be represented as

P = \{p_{i} | p_{i} = {(x_{i}, y_{i}, z_{i})}^{T} \in R^{3}, i = 1,2, \dots, m\}

[15]. We select the k neighbor set of

p_{j} \in P

:

{P_{i}} = {{(x_{i}, y_{i}, z_{i})}^{T}, i \in P (p_{j}, k)}

(1)

P (p_{j}, k)

is the index of

p_{j}

in point set

P

. Then, the geometric center of the neighboring points of

p_{j}

is

\bar{P}

, where

\bar{P} = \frac{1}{k} \sum_{i = 1}^{k} {{P}_{i}} = [\begin{matrix} \bar{x} \\ \bar{y} \\ \bar{z} \end{matrix}]

(2)

In TLS, {P_i} is used to construct the decentralized matrix M

M = {[\begin{matrix} \begin{matrix} x_{1} - \bar{x} & y_{1} - \bar{y} & z_{1} - \bar{z} \end{matrix} \\ \begin{matrix} x_{2} - \bar{x} & y_{2} - \bar{y} & z_{2} - \bar{z} \end{matrix} \\ \begin{matrix} ⋮ & ⋮ & ⋮ \end{matrix} \\ \begin{matrix} x_{k} - \bar{x} & y_{k} - \bar{y} & z_{k} - \bar{z} \end{matrix} \end{matrix}]}_{k \times 3}

(3)

SVD is then applied to M:

M = U Σ V^{T}

(4)

where

Σ = d i a g (σ_{1}, σ_{2}, σ_{3})

is the singular matrix, satisfying

σ_{1} > σ_{2} > σ_{3}

. The last column of the right singular matrix (corresponding to

σ_{3}

) is the normal vector of

p_{j}

.

A matrix based on local three-dimensional point cloud statistics can divide LiDAR data into spherical, linear and planar structures, as shown in Figure 2. Therefore, singular values were used to describe the structural features of the neighborhood space of points:

P_{s c a t t e r} = \frac{σ_{3}}{σ_{1}}, P_{l i n e a r} = \frac{σ_{1} - σ_{2}}{σ_{1}}, P_{s u r f a c e} = \frac{σ_{2} - σ_{3}}{σ_{1}}

(5)

If $σ_{1} \approx σ_{2} {\approx σ}_{3}$ , then $P_{s c a t t e r} \approx 1$ , a spherical structure exists;

If $σ_{1} ≫ σ_{2} {\approx σ}_{3}$ , then $P_{l i n e a r} \approx 1$ , a line-like structure exists;

If $σ_{1} \approx σ_{2} {≫ σ}_{3}$ , then $P_{s u r f a c e} \approx 1$ , a plane-like structure exists.

As shown in (6), we select the most like structure as the

F_{p_{j}}

of

p_{j}

.

F_{p_{j}} = argmax {P_{s c a t t e r}, P_{l i n e a r}, P_{s u r f a c e}}

(6)

At this point, the TLS method is used to calculate the normal vector and structural features of the points in 3D space, and the above operations are performed on all points, providing a basis for the growth segmentation of the subsequent point cloud region and noise removal. In the next section, we will aggregate similar structures.

3.2. Region Growth Segmentation of Point Cloud Region Based on Spatial Features

The region growing algorithm is adopted to segment a point cloud dataset A = {p₁(x₁, y₁, z₁), p₂(x₂, y₂, z₂),…, p_m(x_m, y_m, z_m)} into n clusters with distinct attributes, forming a set of point cloud clusters C{p₁, p₂, …} and Clusters{C₁, C₂, …, C_n}. For a point cloud dataset A containing m points obtained from object scanning, we extract the target region, while removing the surrounding noise requires region growing segmentation based on the 3D spatial distribution of the points.

The region growing algorithm progressively aggregates points with similar attributes to identify and segment different objects in the point cloud, thereby isolating the target region, namely the area where interested structures exist. In point cloud A, there are often spatial distribution differences between surfaces of structures or between surfaces and noise points, whereas points belonging to the same surface or adjacent regions typically exhibit smaller characteristics. To extract surfaces and separate noise points, we introduce the definition of fitting residuals S_d. For any point ∀ p(x, y, z) ∈ A{p₁, p₂, …, p_m}, the distance from p to the plane S fitted by its k-nearest neighbors is given by

S_{d} = \frac{| n_{x} (x - \bar{x}) + n_{y} (y - \bar{y}) + n_{z} (z - \bar{z}) |}{\sqrt{{n_{x}}^{2} + {n_{y}}^{2} + {n_{z}}^{2}}}

(7)

where

(\bar{x}, \bar{y}, \bar{z})

is the average coordinate of the neighboring of p. (

n_{x}

,

n_{y}

,

n_{z}

) is the normal vector of p.

At the same time, different plane objects are segmented according to the angle difference of normal vectors. For the normal vector

N_{p i}

,

N_{p j}

of any point p_i and p_j in A, the angle difference between them is

θ_{ij} = a r c c o s (\frac{|N_{p i} \times N_{p j}|}{‖N_{p i}‖ \times ‖N_{p j}‖})

(8)

Then, the regional growth segmentation process is as follows:

1.: Set three thresholds according to the spatial distribution of the midpoint cloud of A, $T_{S_{d}}$ , $T_{θ_{1}}$ , $T_{θ_{2}}$ , $T_{S_{d}}$ is the residual threshold fitted for the point to the plane; $T_{θ_{1}}$ is the included angle difference threshold (small), and $T_{θ_{2}}$ is the included angle difference threshold (large), as shown in Figure 3.
2.: Find any point p from the undivided points in A, incorporate the point into the empty seed point set seeds{} and point cloud cluster C₁{}, and mark p as segmented.
3.: Starting from point p, find all points in its neighborhood P{p₁, p₂, …, p_k}, if there is point p_i in P, and the attribute difference between point p_i and point p is $θ_{i}$ < $T_{θ_{2}}$ , then p_i is added in C₁{}. If $S_{d_{i}}$ < $T_{S_{d}}$ , p_i is added to seeds{}. Remove p from seeds{}.
4.: Iterate over the remaining points in seeds{}, and repeat step 3 until seeds{} is empty, at which point cloud cluster C₁ is stored.
5.: Repeat steps (2)–(4) until all points in A have been processed. The storage of Clusters{C₁, C₂, …, C_n} is completed.

After the regional growth segmentation of all points, according to the point characteristics given by Equation (6), we can analyze any point cloud cluster C_i{p₁, p₂, …}, and its features are expressed as

F_{C_{i}} = m a x \{\sum F_{p}, p \in C_{i}\}

(9)

Then, we can describe the structure features of Clusters{C₁, C₂, …, C_n} further.

3.3. Noise Removal Considering the Spatial Distribution and Structural Features of Point Clouds

The spatial characteristics of the 3D point cloud have been achieved, the regional growth algorithm makes the point clouds with the same spatial distribution gather together, and then, it is necessary to accurately extract the target object and remove noise according to its spatial distribution and structural characteristics.

Since point cloud dataset A is obtained by scanning structures, which contain a lot of planar information and surrounding noise points, some noise points will inevitably form a few planar structures, but these noises are often sparse. Therefore, it is necessary not only to adopt structural feature constraints but also to remove them by a filtering method.

Then, the regional growth segmentation process is as follows:

Statistical analysis of spatial distance of point cloud dataset A. Perform spatial calculation for the neighborhood space of all points in A. For ∀p_i ∈ A, the average neighborhood space distance of point p_i (x_i, y_i, z_i) to its k nearest neighbor set {P_i} is

p_{r}^{i} = \frac{\sum_{j}^{k} {‖p_{i} - p_{j}‖}_{2}}{k}

(10)

where ||p_i − p_j||₂ represents the distance between p_i and p_j.

2.: The spatial distance calculation of Clusters{C₁, C₂, …, C_n}. As for any C_i{p₁, p₂, …} ∈ Clusters{C₁, C₂, …, C_n}, the neighboring distance of C_i is

C_{r}^{i} = \frac{\sum_{j}^{m} p_{r}^{j}}{m}

(11)

3.: Object extraction and noise removal with structural features and spatial distance constraints. The point cloud clusters that meet the characteristics of the planar structure and are relatively clustered in spatial distribution are extracted (in our case, the structures are planar like structures):

\{\begin{matrix} F_{C_{i}} = S u r f a c e \\ C_{r}^{i} < T_{r} \end{matrix}

(12)

where T_r is the threshold of the neighboring distance.

If

C_{i}

satisfies (12), it is the point cluster of structures. This parameter is related to the performance parameters of the sonar, and usually it can be calculated from the angular resolution of the sonar and the detection distance. All the point clouds that meet the conditions are clustered together; that is, the structure point cloud is extracted, and the noise has been removed.

4. Experiments

4.1. 3D Sonar Point Cloud Data

The BV5000 3D Mechanical Scanning Sonar (Teledyne BlueView, Bothell, WA, USA) was used to observe each cabin, and the observation results are shown in the table below. There is a circular blind spot at the bottom, which is caused by the way the 3D sonar works. Take BV5000 as an example to introduce the three-dimensional sonar system: BV5000 has two models, and they work at the frequencies of 2.25 MHz and 1.35 MHz, respectively. Table 1 describes the main system parameters of the BV5000.

The data used in this study are from Changjiang underwater caisson observation, and the dataset contains caissons. Caisson foundations are widely employed in engineering construction, typically installed by excavating soil from beneath the structure to facilitate sinking. For this study, two representative regions within the research area were selected for experimentation, encompassing both structural data and surrounding noise (see Figure 4; noise clusters are highlighted in the red box). The two datasets, denoted as Data 1 and Data 2, comprise 2,624,672 and 2,417,866 points, respectively.

4.2. The Experimental Process

In this experiment, C++ and PCL library are used for point cloud data processing. The experimental process of Data 1 was analyzed to verify the effectiveness of the proposed method. The comparative analysis mainly focused on the following core steps: (1) extracting the three-dimensional spatial features of the point cloud based on the TLS; (2) point cloud segmentation based on region growth algorithm; (3) target extraction by combining the structural characteristics and spatial distribution information of the point cloud. For our method, the 0.1 m setting of the point-to-plane fitting residual, the angle threshold is 20°.

According to the method proposed in Section 2.1 of this paper, the three-dimensional sonar point cloud is processed in the experiment. First, the TLS method is adopted, the number of neighborhood points k = 30 is set, and the local features of the point cloud in three-dimensional space are solved by singular value decomposition. As shown in Figure 5a, yellow points represent scattered points, red points represent linear structure points, and blue points represent planar structure points. The experimental results show that this method can identify the spatial structure features of different points accurately. However, there are still some mis-classification points in the planar area, which actually constitute the structure of the structure, so preserving these points is essential for the complete reconstruction of the structure area.

In fact, it can be observed that the main structure of the caisson presents a regular geometric shape: the bottom surface of the caisson’s shaft is square, the sides are approximately rectangular, the bottom is wider, tapering towards the middle section, the top is slightly narrower, and the length and width of the bottom are approximately 10 m, with a shaft height of about 10 m. The bottom of the data is the underwater terrain point cloud. This part contains circular voids, which are caused by the blind area of the sonar scanning. At areas with terrain undulations, there will also be sonar shadows. When the terrain rises, echo data will be displayed on the side close to the sonar equipment in the raised part, while the other side at the back is affected by terrain obstruction and has data missing, forming a shadow. Similarly, in the terrain depression area, shadows will also be produced. Unlike the raised part, in the depression area of the terrain, a shadow will appear on the side close to the sonar equipment.

In order to further optimize the point cloud segmentation, the region growth algorithm is introduced, and the point to plane fitting residual and normal vector angle are combined to optimize the point cloud clustering. In view of the adjustment of the mis-classification points in Figure 5a, the experiment set three times the standard deviation of the mean value of the fitting residual threshold and set the angle threshold of the normal vector = 5° to effectively distinguish different planes and noise points. The segmentation results are shown in Figure 5b. Different colors represent different point cloud clusters. The purple point cloud area represents the structural point cloud, and its recognition effect is significantly improved, while there are still some discrete point cloud clusters around.

According to the obtained point cloud structural characteristics (Figure 5a), the spatial structure characteristics of the point cloud cluster are analyzed based on the equation. The structural characteristics of the point cloud after regional growth are shown in Figure 5c. This shows that the method can strike a balance between local structural features (Figure 5a) and global region growth segmentation (Figure 5b) and obtain optimal feature characterization. However, there are still limitations in selecting target point clouds only by relying on spatial structure features. For example, the noise points in the red box area in Figure 5c are easily misidentified as planar point clouds due to their dense distribution and proximity to the surface. Therefore, in order to further remove interference points, the filtering method described in Section 3.3 was used to optimize the experiment.

The neighborhood spatial analysis of point cloud dataset is carried out, the average neighborhood spatial distance of each point p is calculated, and the whole neighborhood distance of each point cloud cluster C_i is calculated. Based on the spatial distribution difference between the discrete point cloud cluster and the target point cloud cluster, the threshold of neighborhood spatial distance is set as the exclusion criterion.

The experimental results are shown in Figure 5d. The black point cloud represents the noise point cloud identified by filtering method in Section 3.3, and the remaining color point clouds meet the neighborhood spatial distance threshold. These point clouds will be further combined with Equation (12) for feature verification, so as to screen the final target point convergence.

Finally, according to the results (Figure 5d) and the structural characteristics of point clouds after regional growth (Figure 5c), the structural point clouds (purple point clouds in Figure 6a) matching the planar characteristics are screened, and the noise point clouds (black point clouds in Figure 6a) are eliminated. The final target object extraction results are shown in Figure 6b, indicating that the proposed method can effectively remove the target point while maintaining the integrity and structural consistency of the target point cloud.

We also carried out the same process for Data 2, and the processing result is shown in Figure 7. A large amount of noise in the data has been well filtered out, and the main structure has been well retained. There are some missing point clouds in the lower part of the data. This is mainly due to the missed measurement of some point clouds during the measurement process, rather than over-filtering caused by the algorithm in this paper.

4.3. Accuracy Evaluation

In order to evaluate the denoising effect of point clouds combined with spatial features and regional growth, Precision, Recall and F1 Score were used as evaluation indicators in this study. The proportion of the point cloud that the precision measurement algorithm determines as the target point is actually correctly identified among all points. The recall rate reflects the proportion of the actual target point cloud successfully detected and removed by the algorithm. The F1 score is the harmonic average of the accuracy rate and the recall rate, which is used to comprehensively evaluate the accuracy and coverage ability of the algorithm.

P r e c i s i o n = \frac{T P}{T P + F P}

(13)

R e c a l l = \frac{T P}{T P + F N}

(14)

F 1 S c o r e = 2 \times \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

(15)

TP (True Positive) indicates the number of targets correctly recognized, FP (False Positive) indicates the number of noise points incorrectly recognized as targets, and FN (False Negative) indicates the number of targets that are not detected.

The ground truth data are shown in Figure 8 and Table 2. It can be seen from the experimental results in Table 1 that the method proposed in this paper achieved an accuracy of 78.93% and 85.64%, respectively, in the experiments of Data 1 and 2, demonstrating an excellent ability to suppress mis-classification. Especially in terms of Recall, the method proposed in this paper achieves 83.38% on Data 2, indicating that this method retains the real target points when extracting the target point cloud.

The results differ significantly between the two datasets. In fact, the dataset is categorized into two distinct regions: the wellbore wall and the bottom surface. Initial evaluations indicate that all methods exhibit relatively lower accuracy on the first batch of data. Upon detailed comparative analysis, we identified that the primary discrepancy lies within the point clouds of the bottom surface. The second batch contains a denser distribution of bottom points, with a significant number of points appearing outside the wellbore wall. During the manual generation of ground truth, these points were retained as valid structural features rather than being classified as noise. In contrast, the first batch contained fewer such points outside the wall, which were predominantly filtered out as noise during manual processing. This suggests a potential human error in the annotation process, where valid points in the outer region were mistakenly labeled as noise. Consequently, the observed variations in data processing accuracy are primarily attributed to deviations in the ground truth.

The proposed method utilizes TLS to extract 3D spatial features and optimizes the process through a combination of region growing and statistical filtering. This approach achieves a balance between local feature preservation and global segmentation, ensuring greater stability and generalization in complex environments. As a result, the method enhances the completeness of the target point cloud while reducing noise interference, demonstrating significant application value in 3D sonar point cloud processing.

5. Discussion

5.1. The Stability of the Algorithm for Threshold Setting

The threshold parameters were determined by analyzing data from small regions and observing the experimental results, after which they were extended to larger areas. Although underwater structures exhibit a certain degree of data complexity, the scenarios themselves are relatively uniform. Consequently, thresholds established in small regions demonstrate significant potential for application across larger domains.

To verify the stability of the algorithm for threshold setting, we conducted a threshold setting experiment with the angle threshold as 50°. The included angle threshold is the most important and influential threshold; so, it was mainly adjusted. The final results are shown in Figure 9 and Table 3. Overall, some data indicators have improved, and some have decreased, but the basic accuracy is all good, indicating that the method proposed in this paper is stable under different parameter settings.

5.2. Comparison and Ablation Experiments

To verify the performance of the algorithm, we conducted comparison experiments. The statistical filtering method and the distance clustering method, as comparison methods [15,27,28,29], were also used in the experiment. The core principle of statistical filtering is based on the local statistical characteristics of point cloud data. By calculating the average distance between each point and its neighboring points and comparing it with the global average distance, outliers that are far from the surrounding points can be identified and removed. The distance clustering method is a classification method based on the distance between points. The commonly used algorithm is Euclidean distance clustering. It classifies points whose distance is less than the set threshold into one category by calculating the Euclidean distance between points. The results are shown in Figure 10 and Figure 11 and Table 4. Overall, the accuracy of the distance clustering algorithm is higher than that of statistical filtering, but both are lower than the method in this paper.

It can be seen that statistical filtering is always prone to under-filtering, and some clustered small blocks are difficult to handle well. This is because statistical filtering has difficulty distinguishing aggregated noise. The distance clustering method can handle some aggregated noise points, and the overall result is good. However, compared with the method in this paper, there is still the problem of insufficient accuracy. The main reason is that a single distance feature is insufficient to describe the complex structure of underwater 3D sonar.

Since our algorithm also incorporates a statistical filtering strategy, it significantly outperforms the standard statistical filtering method in terms of accuracy metrics. This also can be considered as an ablation experiment, which shows that the adopted structural information is important for filtering. Both of the statistical filtering and distance clustering methods only utilize the attribute information of a single point cloud, making it difficult to comprehensively describe the differences between structural point clouds and noise points. Therefore, their accuracy is also relatively low. As a comparison, the proposed method achieves F1 scores of 78.65% and 84.49% in outlier removal, effectively eliminating noise while largely preserving structural features. For Data 1, our method achieved precision improvements of 0.38% and 0.07% compared to the statistical filtering and distance clustering, respectively. In terms of recall, our method outperformed these baselines by 1.63% and 0.93%, respectively. Consequently, the F1 score saw gains of 1.02% and 0.51% over the two methods. For Data 2, our method improved precision by 0.13% over statistical filtering. Regarding recall, we observed improvements of 0.44% and 4.05% compared to statistical filtering and distance clustering, respectively. Finally, the F1 score increased by 0.28% and 0.14% relative to the respective baselines. The comparison figure is shown in Figure 12.

5.3. Limitations

We evaluated the method on two datasets acquired from Changjiang underwater caisson observations, both from the same structural category. This may be narrow to support claims of robustness or generalization in complex underwater environments. However, this limitation stems from the scarcity of 3D underwater sonar data across different structural types and the lack of publicly available datasets. From a structural perspective, although we utilized data exclusively from caissons, other similar scenarios, such as bridge pier inspections and bank slope surveys, share comparable geometric characteristics, primarily consisting of planar or curved surfaces. Furthermore, unlike optical imaging, the underwater environment has a relatively minor impact on acoustic imaging. Therefore, we maintain that our method possesses a certain degree of generalization capability and practical value for these analogous structures. In terms of the calculation speed, the method presented in this paper is inferior to the methods being compared. However, we believe that the additional time cost spent on calculating the structural features is worthwhile for obtaining better feature description performance.

Finally, while our method demonstrates improvements over existing approaches, these gains are incremental rather than overwhelming; nevertheless, our approach exhibits a consistent overall advantage. More importantly, we believe this work validates a promising research direction: explicitly incorporating structural characteristics is an effective strategy for processing 3D sonar point clouds.

6. Conclusions

This paper proposes a novel 3D sonar point cloud denoising method that integrates local spatial feature constraints with global region growing. Initially, local spatial features are extracted using a TLS approach. These features are then combined with vector attribute parameters to guide the region growing segmentation, effectively distinguishing between different point cloud clusters. Subsequently, the initial segmentation results are refined through feature correction based on local spatial characteristics, followed by statistical filtering to optimize point cloud quality and mitigate mis-classification errors. The experimental results demonstrate that the proposed method achieves F1 scores of 78.65% and 84.49% on 3D sonar denoising tasks. Compared with traditional methods, our approach not only removes noise more effectively but also better preserves structural details. In complex environments, the method strikes an optimal balance between local feature extraction and global region growing, thereby maintaining the integrity and structural consistency of the point clouds. Consequently, the proposed method exhibits strong adaptability for noise removal in complex underwater scenarios, offering a reliable solution for high-precision underwater structure monitoring. Future work will focus on optimizing the region growing strategy and exploring the integration of deep learning with traditional techniques to further enhance the segmentation accuracy and denoising performance.

Author Contributions

Conceptualization, F.Z. and S.L.; Methodology, F.Z. and H.G.; Software, F.Z. and S.L.; Writing—original draft, F.Z., S.L. and H.G.; Writing—review & editing, F.Z., S.L. and Y.W.; Supervision, Y.W.; Funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under Grant Nos. 42304049, 42274111, 42574073, the China Postdoctoral Science Foundation under Grant Nos. 2023M743282, 2024T170856, and the Key Laboratory of Submarine Geosciences, Ministry of Natural Resources under Grant No. KLSG2406.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Guerneve, T.; Petillot, Y. Underwater 3D Reconstruction Using Blueview Imaging Sonar. In OCEANS 2015-Genova; IEEE: New York, NY, USA, 2015; pp. 1–7. [Google Scholar]
Lurton, X. An Introduction to Underwater Acoustics: Principles and Applications; Springer Science & Business Media: Berlin, Germany, 2002. [Google Scholar]
Gerigk, M.K.; Gerigk, M. Application of Unmanned USV Surface and AUV Underwater Maritime Platforms for the Monitoring of Offshore Structures at Sea. Sci. J. Marit. Univ. Szczec. 2023, 148, 89–100. [Google Scholar] [CrossRef]
Long, J.; Zhang, H.; Zhao, J. A Comprehensive Deep Learning-Based Outlier Removal Method for Multibeam Bathymetric Point Cloud. IEEE Trans. Geosci. Remote Sens. 2023, 61, 4201622. [Google Scholar] [CrossRef]
Trevorrow, M.V. Statistics of Fluctuations in High-Frequency Low-Grazing-Angle Backscatter from a Rocky Sea Bed. IEEE J. Ocean. Eng. 2004, 29, 236–245. [Google Scholar] [CrossRef]
Kim, B.; Joe, H.; Yu, S.-C. High-Precision Underwater 3D Mapping Using Imaging Sonar for Navigation of Autonomous Underwater Vehicle. Int. J. Control Autom. Syst. 2021, 19, 3199–3208. [Google Scholar] [CrossRef]
Charroud, A.; El Moutaouakil, K.; Palade, V.; Yahyaouy, A.; Onyekpe, U.; Eyo, E.U. Localization and Mapping for Self-Driving Vehicles: A Survey. Machines 2024, 12, 118. [Google Scholar] [CrossRef]
Mann, M.; Agathoklis, P.; Antoniou, A. Automatic Outlier Detection in Multibeam Data Using Median Filtering. In 2001 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (IEEE Cat. No. 01CH37233); IEEE: New York, NY, USA, 2001; Volume 2, pp. 690–693. [Google Scholar]
Rezvani, M.-H.; Sabbagh, A.; Ardalan, A.A. Robust Automatic Reduction of Multibeam Bathymetric Data Based on M-Estimators. Mar. Geod. 2015, 38, 327–344. [Google Scholar] [CrossRef]
Bourillet, J.F.; Edy, C.; Rambert, F.; Satra, C.; Loubrieu, B. Swath Mapping System Processing: Bathymetry and Cartography. Mar. Geophys. Res. 1996, 18, 487–506. [Google Scholar] [CrossRef]
Zhu, Q.; Li, D.R. Error Analysis and Processing of Multibeam Soundings. Geomat. Inf. Sci. Wuhan Univ. 1998, 23, 3–6+48. [Google Scholar]
Zhang, Z.; Peng, R.; Huang, W.; Dong, J. An Improved Algorithm of Trend Surface Filtering Based on the Natural Neighboring Points Range. In Electromechanical Control Technology and Transportation; CRC Press: Boca Raton, FL, USA, 2017; pp. 415–420. [Google Scholar]
Yang, A.; Wu, Z.; Yang, F.; Su, D.; Ma, Y.; Zhao, D.; Qi, C. Filtering of Airborne LiDAR Bathymetry Based on Bidirectional Cloth Simulation. ISPRS J. Photogramm. Remote Sens. 2020, 163, 49–61. [Google Scholar] [CrossRef]
Zhang, W.; Qi, J.; Wan, P.; Wang, H.; Yan, G. An Easy-to-Use Airborne LiDAR Data Filtering Method Based on Cloth Simulation. Remote Sens. 2016, 8, 501. [Google Scholar] [CrossRef]
He, Z.; Wu, Y.; Li, S.; Zhang, S.; Li, H.; Bian, S. A Partition Filtering Method for 3D Sonar Point Cloud Data Considering Horizontal Deviation. Geomat. Inf. Sci. Wuhan Univ. 2024, 49, 1639–1649. [Google Scholar]
Wang, W.; Li, Z.; Fu, Y.; He, H.; Xiong, F. A Multi-Scale Adaptive Slope Filtering Algorithm for Point Cloud. Geomat. Inf. Sci. Wuhan Univ. 2022, 47, 438–446. [Google Scholar]
Zhao, X.; Guo, Q.; Su, Y.; Xue, B. Improved Progressive TIN Densification Filtering Algorithm for Airborne LiDAR Data in Forested Areas. ISPRS J. Photogramm. Remote Sens. 2016, 117, 79–91. [Google Scholar] [CrossRef]
Xing, S.; Li, P.; Xu, Q.; Wang, D. Surface Fitting Filtering of LiDAR Point Cloud with Waveform Information. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, 4, 179–184. [Google Scholar] [CrossRef]
Zhang, J.; Lin, X. Filtering Airborne LiDAR Data by Embedding Smoothness-Constrained Segmentation in Progressive TIN Densification. ISPRS J. Photogramm. Remote Sens. 2013, 81, 44–59. [Google Scholar] [CrossRef]
Kolodiazhnyi, M.; Vorontsova, A.; Konushin, A.; Rukhovich, D. Oneformer3d: One Transformer for Unified Point Cloud Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–22 June 2024; IEEE: New York, NY, USA, 2024; pp. 20943–20953. [Google Scholar]
Qi, C.R.; Su, H.; Mo, K.; Guibas, L.J. Pointnet: Deep Learning on Point Sets for 3D Classification and Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; IEEE: New York, NY, USA, 2017; pp. 652–660. [Google Scholar]
Zhao, H.; Jiang, L.; Jia, J.; Torr, P.H.; Koltun, V. Point Transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Nashville, TN, USA, 20–25 June 2021; IEEE: New York, NY, USA, 2021; pp. 16259–16268. [Google Scholar]
Rakotosaona, M.-J.; La Barbera, V.; Guerrero, P.; Mitra, N.J.; Ovsjanikov, M. Pointcleannet: Learning to Denoise and Remove Outliers from Dense Point Clouds. Comput. Graph. Forum 2020, 39, 185–203. [Google Scholar] [CrossRef]
Guerrero, P.; Kleiman, Y.; Ovsjanikov, M.; Mitra, N.J. Pcpnet Learning Local Shape Properties from Raw Point Clouds. Comput. Graph. Forum 2018, 37, 75–85. [Google Scholar] [CrossRef]
Hu, Q.; Yang, B.; Xie, L.; Rosa, S.; Guo, Y.; Wang, Z.; Trigoni, N.; Markham, A. Randla-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; IEEE: New York, NY, USA, 2020; pp. 11108–11117. [Google Scholar]
Vangi, M.; Topini, E.; Liverani, G.; Topini, A.; Ridolfi, A.; Allotta, B. Design, Development, and Testing of an Innovative Autonomous Underwater Reconfigurable Vehicle for Versatile Applications. IEEE J. Ocean. Eng. 2025, 50, 509–526. [Google Scholar] [CrossRef]
Hu, C.; Pan, Z.; Li, P. A 3D Point Cloud Filtering Method for Leaves Based on Manifold Distance and Normal Estimation. Remote Sens. 2019, 11, 198. [Google Scholar] [CrossRef]
Han, X.-F.; Jin, J.S.; Wang, M.-J.; Jiang, W.; Gao, L.; Xiao, L. A Review of Algorithms for Filtering the 3D Point Cloud. Signal Process. Image Commun. 2017, 57, 103–112. [Google Scholar] [CrossRef]
Zhao, Q.; Gao, X.; Li, J.; Luo, L. Optimization Algorithm for Point Cloud Quality Enhancement Based on Statistical Filtering. J. Sens. 2021, 2021, 7325600. [Google Scholar] [CrossRef]

Figure 1. The flowchart of the method.

Figure 2. Structural feature of point cloud clusters. In the figure, the green areas mark the spherical structures, the blue areas mark the rod-like structures, and the red areas mark the planar structures.

Figure 3. Illustration for normal vectors and corresponding angles.

Figure 4. Point cloud data of 3D sonar. Red rectangles indicate the noise points.

Figure 5. Flowchart of the experimental process: (a) point cloud structural features; (b) point cloud regional growth results; (c) point cloud structural features after regional growth; (d) statistical filter separation of regional growth results.

Figure 6. (a) Proposed extraction of structural point clouds and noisy spot clouds; (b) final extraction of structural point clouds. Black points indicate the noise points.

Figure 7. The filtered result of Data 2.

Figure 8. The ground truth data of (a) Data 1, and (b) Data 2.

Figure 9. The result data of (a) Data 1 and (b) Data 2 after parameter adjustment.

Figure 10. (a) Sampling statistical filtering result of Data 1; (b) distance clustering of Data 1.

Figure 11. (a) Statistical filtering result of Data 2; (b) distance clustering result of Data 2.

Figure 12. Performance data. (a) Performance on Data 1; (b) performance on Data 2.

Table 1. Parameters of BV 5000.

Parameters	BV5000-1350	BV5000-2250
Sonar Head Field-of-view	1° × 45°	1° × 45°
Pan Scan Angle Coverage	45–360°	45–360°
Tilt Scan Angle Coverage	−65–65°	−65–65°
Up-data Rate	Up to 40 Hz	Up to 40 Hz
Frequency	1.35 MHz	2.25 MHz
Maximum Range	27 m (90 ft)	9 m (30 ft)
Number of Beams	256	256
Beam Width	1° × 1°	1° × 1°
Beam Spacing	0.18°	0.18°
Time Resolution	1.23 min	0.74 min
Size (L × W × H)	10.5 in × 9.2 in × 15.4 in	10.5 in × 9.2 in × 15.4 in
Weight in Air	21.7 lbs	19.1 lbs
Weight in Water	8.2 lbs	6.0 lbs
Depth Rating	300 m (1000 ft)	300 m (1000 ft)
Power Consumption	45 W	45 W
SPT Junction Box	120–240 VDC	120–240 VDC
Voltage Directly to Head (no SPT)	12–48 VDC	12–48 VDC

Table 2. Experimental comparative evaluation.

Method	Area	TP	FP	FN	Precision	Recall	F1 Score
The proposed method	Data 1	2,039,060	544,286	562,817	78.93%	78.37%	78.65%
The proposed method	Data 2	1,875,740	314,573	373,857	85.64%	83.38%	84.49%

Table 3. Experimental comparative evaluation after adjusting the parameters.

Method	Area	TP	FP	FN	Precision	Recall	F1 Score
The proposed method	Data 1	2,046,400	547,122	555,475	78.90%	78.65%	78.78%
The proposed method	Data 2	1,932,900	410,861	316,696	82.47%	85.92%	84.16%

Table 4. The results of other methods.

Method	Area	TP	FP	FN	Precision	Recall	F1 Score
Statistical filtering	Data 1	2,003,849	548,027	598,027	78.52%	77.02%	77.76%
Statistical filtering	Data 2	1,922,996	412,561	326,599	82.34%	85.48%	83.88%
Distance clustering	Data 1	2,022,176	543,060	579,700	78.83%	77.72%	78.27%
Distance clustering	Data 2	1,841,713	292,734	407,883	86.29%	81.87%	84.02%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, F.; Li, S.; Gao, H.; Wu, Y. 3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm. J. Mar. Sci. Eng. 2026, 14, 597. https://doi.org/10.3390/jmse14070597

AMA Style

Zhang F, Li S, Gao H, Wu Y. 3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm. Journal of Marine Science and Engineering. 2026; 14(7):597. https://doi.org/10.3390/jmse14070597

Chicago/Turabian Style

Zhang, Fan, Shaobo Li, Haolong Gao, and Yunlong Wu. 2026. "3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm" Journal of Marine Science and Engineering 14, no. 7: 597. https://doi.org/10.3390/jmse14070597

APA Style

Zhang, F., Li, S., Gao, H., & Wu, Y. (2026). 3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm. Journal of Marine Science and Engineering, 14(7), 597. https://doi.org/10.3390/jmse14070597

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

3D Sonar Point Cloud Denoising Constrained by Local Spatial Features and Global Region Growth Algorithm

Abstract

1. Introduction

2. Background

2.1. Imaging Process of 3D Sonar

2.2. Noise Characteristics

3. Methods

3.1. 3D Spatial Feature

3.2. Region Growth Segmentation of Point Cloud Region Based on Spatial Features

3.3. Noise Removal Considering the Spatial Distribution and Structural Features of Point Clouds

4. Experiments

4.1. 3D Sonar Point Cloud Data

4.2. The Experimental Process

4.3. Accuracy Evaluation

5. Discussion

5.1. The Stability of the Algorithm for Threshold Setting

5.2. Comparison and Ablation Experiments

5.3. Limitations

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI