1. Introduction
Surveying the shallow water topography in lakes, reservoirs, and other coastal zones helps research in hydrology, fishery, coastal erosion, terrestrial water resource planning, and water body protection [
1], which is of great significance to the sustainable terrestrial water resource development of human society [
2]. Researchers usually use shipborne sonar systems [
3,
4] or airborne LiDAR systems [
5,
6] to collect accurate bathymetric data. The limitations of these two approaches in surveyed efficiency and range lead to the fact that most shallow water areas still need to be mapped, and the data around the coastal zones on bathymetric maps still need to be completed. The critical gap between the lack of fine shallow water bathymetric data and its vast demand must be filled.
The Ice, Cloud, and Land Elevation Satellite-2 (ICESat-2) is the new generation of spaceborne laser altimeter. The onboard Advanced Topographic Laser Altimeter System (ATLAS) equips six laser beams arranged in three pairs [
7]. The energy of each laser pulse emitted by strong beams is 0.12 mJ, and the ratio of the pulse energy of strong beam to weak beam is about 4:1. ATLAS emits 10,000 laser pulses every second, while the diameter of the footprints formed by these pulses is 11 m, and the distance between adjacent footprints is 0.7 m, much higher than that of its predecessor, ICESat [
8]. Compared with the near-infrared laser, the 532 nm blue-green laser emitted by ATLAS delves deeper into the water. Although the primary goal of ICESat-2 is to monitor the changes in glaciers, sea levels, and forests, its strong pulses can penetrate up to ~40 m in clear water [
7]. Therefore, ICESat-2 is regarded as an effective bathometer in satellite-derived bathymetry (SDB) [
9,
10,
11]. The coordination of bathymetric photons can be used as prior results to participate in bathymetry, thus replacing in situ data acquired by shipborne or airborne platforms and improving efficiency [
12,
13,
14,
15].
However, due to the high sensitivity of the photon-counting LiDAR [
16], many noise photons are recorded in the ICESat-2 Level 2 datasets. These noise photons caused by solar background, atmospheric scattering, and water scattering flooded the original data, challenging the extraction of bathymetric photons. In bathymetric data, the spatial distribution of water surface bathymetric photons is dense, while the spatial distribution of underwater bathymetric photons is related to water depth. Because of the absorption and scattering of pulse energy by water, the density of underwater bathymetric photons decreases with the increase of water depth. When reaching the extreme depth measured by ICESat-2, it is difficult to distinguish the underwater bathymetric photons from the noise photons.
At the beginning of the launch, the scientific team did not expect the performance of ICESat-2 in bathymetry [
13], so they did not develop a special method for extracting bathymetric photons in nearshore environments. Nevertheless, ATL03 still provided a confidence label
for extracting signal photons from ocean data [
17] (after this, referred to as CONF). Photons in the original data are labeled with 0–4 categories, corresponding to noise, buffer, weak, medium, and strong signals. Photons with
are the basis for producing advanced marine product ATL12, which can be used for global sea level detection [
18]. Besides, the existing bathymetric photon extraction methods usually use elevation frequency histograms [
12,
19,
20,
21] to divide the data into three parts: above the water surface, the water surface, and under the water surface, then remove the noises, respectively. Density-based noise removal methods have been proven to extract water surface photons [
22,
23]. The Joint North Sea Wave Project (JONSWAP) wave spectrum has also been introduced to describe the undulating shape of the water surface photons and extract them [
22,
24]. Density-based spatial clustering method of applications with noise (DBSCAN) [
21,
25,
26], which has been widely used in the noise removal of land data, has been improved for underwater bathymetric photon extraction. To deal with the characteristics of photon density changing with water depth, DBSCAN is improved in two ways: an improved adaptive variable ellipse filtering bathymetric method (AVEBM) increases the size of the filter kernel with water depth [
19]. Alternatively, by decreasing the density threshold with water depth, the underwater bathymetric photons can be identified by reducing the requirement on the adaptive minimum number of photons (AMNP) in the filter kernel [
12,
20]. However, the direction of the improved DBSCAN filter kernel is fixed, and it cannot adapt to the fluctuation of the underwater terrain. With the increase of water depth, the photon distribution becomes sparser, and the distinction between underwater bathymetric photons and noise photons becomes blurred. Special steps are needed to distinguish the underwater bathymetric photons from discrete noise clusters, especially when processing data acquired during the daytime.
Zhang et al. proposed the quadtree isolation (QI) method [
27], which converted photon coordination into a quadtree hierarchical structure by iterative space segmentation to remove noise. It is not affected by terrain fluctuation and does not need input parameters, so it has potential in bathymetric photon extraction. However, the original QI lacks the consideration of photon density changing with water depth, and the total threshold used in QI cannot wholly extract underwater bathymetric photons. Besides, QI cannot identify noise clusters around underwater bathymetric photons.
In this research, we proposed pre-pruning quadtree isolation (PQI) with changing threshold, which can be used to extract bathymetric photons. The improvement is mainly concentrated in two aspects. Firstly, the spatial distribution of photons is judged before each layer of isolated quadtree grows to avoid extracting noise clusters. Secondly, considering the underwater bathymetric photon density decreases with water depth, the frequency histogram of isolated level (IL, that is the photon level in isolated quadtree) and photon elevation is calculated. The relationship between IL and water depth is quantitatively described, which is helpful to determine the threshold of IL in different depth ranges.
3. Methods
The spatial distribution of signal photons is denser than that of noise photons, but the noise removal methods are still challenged in processing ICESat-2 bathymetric data. Affected by the fluctuation of the water surface, the noise removal methods without directional adaptive ability cannot extract whole water surface photons. Under the water surface, the density of underwater bathymetric photons decreases with water depth, so IL also needs to change with photon elevation. The spatial segmentation of noise clusters distributed around underwater bathymetric photons will overestimate the density of these noise photons, leading to potential errors. Therefore, PQI with changing threshold, shown in
Figure 2, is proposed, which includes two main steps: pre-pruning quadtree isolation and isolated level threshold determination.
3.1. Pre-Pruning Quadtree Isolation
QI removes noise photons by iterative quad space segmentation. The space of photon distribution is divided into four subspaces, and it is required that each subspace can only contain one photon or not. If this limit is exceeded, four subspaces will be further segmented until the requirement is met. This process of spatial segmentation is regarded as photon isolation, which can correspond to the growth process of the isolated quadtree, and the density of photons affects their level in the quadtree. Since the spatial distribution of signal photons is denser than noise photons, isolating signal photons requires more spatial segmentation, and their ILs are larger than the noise photons. Comparing the ILs can remove the noise photons contained in the raw data.
QI has strong adaptability to the fluctuation of signal photons along with terrain or water surface and does not need to be provided user-defined parameters, which has an advantage in signal photon extraction [
27]. However, when processing the bathymetric data shown in
Figure 3c, the IL of the noise photon marked by the purple dotted line is 4, the IL of the signal photon marked by the yellow dotted line is 4, and the IL of the signal photon marked by the purple dotted line is 3. The IL of the noise photon is greater than or equal to the IL of the signal photon. QI has an error.
Although the spatial distribution of noise photons is generally more dispersed than signal photons, the noise clusters formed by the aggregation of noise photons make this assumption inaccurate locally. As shown in
Figure 3c, the noise photon marked by the purple dotted line is close to the noise photon on its left. To isolate them, QI made an extra space segmentation, which overestimated the IL of the noise photon. QI does not consider the local photon density anomaly caused by noise clusters, but the backscattering of water makes noise clusters ubiquitous in bathymetric data.
Further analysis of the results in
Figure 3c shows that the noise photon marked by the purple dotted line cannot be separated from the noise photon on its left by the second and third space segmentation. The noise photon is isolated only after the fourth space segmentation. This means that the second and third space segmentation are meaningless, which leads to the photon being overestimated by two levels in the quadtree.
To avoid excessive space segmentation, the pre-pruning step is proposed to optimize the quadtree structure. The core idea of pre-pruning is to check whether photons are further isolated before and after space segmentation. If photons are still in the same subspace before and after segmentation, these photons are not further isolated. The growth of the quadtree will cause overestimations of IL, so this part of the quadtree needs to be cut off. Therefore, when PQI processes bathymetric data, the whole space of photon distribution is taken as the root node in the quadtree, the non-leaf nodes represent the divided subspace, and each non-empty leaf node contains isolated photons. With the introduction of the pre-pruning step, the following conditions will cause the quad tree to stop growing:
(1) The growth of quadtree stops naturally. In this condition, space segmentation further isolates photons, and the number of photons contained in subspaces does not exceed one.
(2) The overgrown quadtree is cut off. In this condition, the pre-pruning step finds that photons are not further isolated before and after space segmentation and are still in the same subspace. The quadtree structure grown by this segmentation will be cut off, and the quadtree growth will stop to avoid excessive space segmentation.
The result of PQI is shown in
Figure 3d. Before the third spatial segmentation, the pre-pruning step found that the noise photon marked by the purple dotted line was not further isolated from its surrounding photon, so the quadtree stopped growing, and the noise photon was in the second level of the quadtree. Therefore, the pre-pruning step can avoid excessive segmentation caused by local spatial photon aggregation, limit the influence of underwater noise clusters on bathymetric photon extraction, and enlarge the difference between bathymetric photons and noise photons in the quadtree.
3.2. Isolated Level Threshold Determination
After using PQI to obtain ILs of the photons, it is necessary to calculate the elevation range of bathymetric photons and determine the appropriate IL threshold. The water absorption of pulse energy makes IL decrease with the water depth (as shown in
Figure 4a). The IL of water surface photons and some shallow underwater bathymetric photons is the largest, while noise photons have the smallest ILs. In the same elevation range, bathymetric photons are denser than noise photons. Using the same IL threshold cannot wholly extract bathymetric photons. According to Equation (1) to Equation (4), the photon space is divided, and the frequency histogram with elevation and IL as the plane is constructed to quantify the change of IL with elevation, realizing range demarcation and automatic extraction of bathymetric photons.
where
and
are the elevation and IL of each photon,
and
are the interval of the histogram in the elevation direction and the IL direction,
and
are the segment number in the elevation direction and the IL direction, and
and
define the center position in the elevation direction and IL direction, respectively, which together determine the grid coordinates of the histogram.
The generated histogram is shown in
Figure 4b, and the IL distribution in different elevation ranges shows pattern. The upper and lower boundaries of bathymetric photon are defined according to the histogram, and the data are divided into noise photons in the air (
), bathymetric photons, and noise photons in the water (
).
The elevation range corresponding to the position with the highest frequency in the histogram (assuming , where is the lower bound of the range and is the upper bound of the range) is regarded as the elevation range where the water surface photons are located, and its upper bound represents the upper bound of bathymetric photons. Photons with elevation higher than the upper bound are marked as the . Since the difference in IL between the deeper underwater bathymetric photons and the surrounding noises is fuzzy, it is difficult to determine the lower elevation bound of bathymetric photons by using the same threshold. Therefore, the mean IL of the noise photons in the air is calculated. When the mean IL of photons in a certain elevation range (assuming , where is the lower bound of the range and is the upper bound of the range) is less than or equal to , the photon density is considered to be close to the . Photons with elevation smaller than the upper bound are marked as the . Both AP and WP are removed.
With the increase of water depth, the IL of photons decreases, and the threshold of IL in different elevation ranges also changes. The Otsu method [
28] is introduced to calculate the optimal thresholds in different elevation ranges. It calculates the threshold by maximizing the inter-class variance of IL between bathymetric photons and noise photons and has been applied to the noise removal of photon-counting LiDAR data [
23]. The Otsu method is used to calculate the variance between classes in different elevation ranges, as shown in Equation (5):
where
is the inter-class variance,
is the proportion of potential bathymetric photons,
is the proportion of potential noise photons,
is the mean IL of bathymetric photons,
is the mean IL of potential noise photons, and
is the mean IL of all photons, as shown in Equation (6) to Equation (10):
In each elevation range, by changing , and are recalculated, and is renewed. Finding the largest , the corresponding IL is selected as the threshold, and the photons with an IL greater than the threshold are extracted as bathymetric photons.
3.3. Performance Verification
Bathymetric photons include water surface and underwater bathymetric photons, which record the underwater topography of shallow water environments. However, the lack of reference data makes evaluating the method’s performance challenging. Because ATL03 data does not provide the confidence label of bathymetric photons, some studies have conducted refraction correction of underwater bathymetric photons and compared the corrected results with high-precision underwater topography [
19,
29]. This method cannot measure the completeness of underwater bathymetric photon extraction results. Therefore, in this study, the reference data of bathymetric photons are obtained by visual interpretation. All bathymetric photons used as reference are selected through human-computer interaction. The extraction results of bathymetric photons are compared with reference data, and precision
, recall rate
, and F1 score
are calculated.
In ATL03 data, photons with
are introduced as a comparison, and this method is marked as CONF in the results [
17]. AVEBM [
19] and AMNP [
12,
20], two widely used bathymetric photon extraction methods, are also introduced as comparison methods. In addition, the extraction results are divided according to the elevation to evaluate the method’s performance in extracting water surface and underwater bathymetric photons, respectively.
where
indicates the number of correctly extracted photons,
indicates the number of incorrectly extracted photons, and
indicates the number of unextracted bathymetric photons.
5. Discussion
In this part, we first identify the factors that affect the extraction of bathymetric photons by analyzing the results and then discuss the laws of the noise rate and water depth influencing the methods. Finally, we evaluate the performance of PQI for extracting bathymetric photons.
5.1. Analysis of Influencing Factors
As shown in
Figure 7, when different algorithms extract bathymetric photons in the same scenario, PQI performs best. However, when comparing the processing results of the same method in different scenarios, it is difficult to directly understand the influence of factors on the performance of the bathymetric photon extraction method.
The change in noise rate will lead to a change in photon density, and the number of noise clusters close to bathymetric photons will increase. Therefore, a higher noise rate usually makes it more challenging to extract bathymetric photons accurately. Take the results of PQI and AVEBM in
Figure 7 as an example. The results in
Figure 7a,b,e,f are obtained in adjacent areas at different times, which allows us to analyze the results of the bathymetric photon extraction method under different noise rates. By comparison, the increase in noise rate does not affect the extraction of water surface photons, and both methods can obtain water surface photons entirely and accurately. During the extraction of underwater bathymetric photons, it is found that with the increase in noise rate, the noise clusters around underwater bathymetric photons increase. PQI effectively eliminates these noise clusters, and the noise rate has no apparent influence on performance, while AVEBM is affected by surrounding noise clusters.
The box diagram of the F1 scores of the calculations of PQI and AVEBM when processing data with different noise rates is shown in
Figure 8. It can be found that the F1 score of PQI and AVEBM will decrease with the increase in photon density. With the increase of noise photons, the median value of PQI’s F1 score decreased slowly from 94% to 92%. This result shows that although noise influences quad space segmentation, this influence is weak with the help of the pre-pruning step. The performance of AVEBM decreases more obviously with the increase of noise rate, but even when processing data of 6–8 pts/m, it can still achieve more than 85% F1 score, which is good performance. Comparing the trend of the box diagram between PQI and AVEBM, it is found that the boxes of PQI are narrower than those of AVEBM under the same noise rate. Therefore, the performance of PQI is less affected by the change in noise rate, and the results obtained by PQI have a better consistency.
The extraction of underwater bathymetric photons is more challenging. Compared with water surface photons, the number of underwater bathymetric photons is smaller, and the spatial density of underwater bathymetric photons decreases with the increase of water depth. Take the results of PQI and AVEBM as an example. Therefore, underwater bathymetric photons are divided into four parts, namely 0–10 m, 10–20 m, 20–30 m, and 30–40 m.
Figure 9 shows the box diagram of PQI and AVEBM at different water depths. The performance of PQI and AVEBM decreases with the increase in water depth, and PQI is better than AVEBM in different water depth ranges. Specifically, the F1 score of PQI in 0–30 m water depth decreases slowly with the increased water depth. When the water depth exceeds 30 m, the change in the F1 score corresponding to PQI is not apparent. The pre-pruning step can still effectively identify and remove noise clusters around bathymetric photons in the deep-water environment. The F1 score corresponding to AVEBM decreases with the increase in water depth. When the water depth is 30–40 m, the median F1 score of AVEBM is lower than 40%, which indicates that AVEBM has failed at this time.
The analysis shows that noise rate and water depth are the main factors affecting the performance of the bathymetric photon extraction methods. Whether the noise rate increases or the water depth increases, the extraction algorithm’s performance will decline to vary degrees. In addition, we also found that the influence of the change of influencing factors on different methods is inconsistent, and PQI is not as affected by noise rate and water depth as AVEBM. In order to understand the influence of noise rate and water depth change on PQI and the performance of the comparison methods, more detailed analyses were designed and carried out.
5.2. The Influence of Noise Rate
Table 3 divides the bathymetric data into 2–4 pts/m (low noise rate), 4–6 pts/m (medium noise rate), and 6–8 pts/m (high noise rate). The evaluation indexes obtained by the four methods when dealing with data with different noise rates are calculated. PQI achieves a 93.17% F1 score when dealing with data at a low noise rate, a 92.40% F1 score when dealing with data at a medium noise rate, and a 92.28% F1 score when dealing with data at a high noise rate. With the increase in noise rate, the performance of PQI decreases by 1% at maximum. The results show that it is insensitive to the change in noise rate. The pre-pruning step can make PQI effectively identify the noise clusters around the bathymetric photons. Although these noise clusters may be adjacent to the bathymetric photons in space, their differences at the quadtree level are apparent. In addition, as can be known from the analysis in
Section 4.1, PQI can feel the change in noise rate. More dense photons will cause each photon to be isolated after more times of spatial segmentation. Therefore, the change in noise rate will be reflected in the histogram generated by PQI, and the isolation threshold will be different with different noise rates. The performance of other methods under different noise rates is inferior to PQI’s. AVEBM’s F1 score decreases with the increase in noise rate; however, the decrease rate is not more than 3%, which shows that the noise rate has no apparent influence on it. The adaptability of the two methods to the noise rate makes them have no requirement for data acquisition time. Even if the data to be processed contains more noise, the performance of the two methods will remain relatively high.
5.3. The Influence of Water Depth
The indexes of underwater bathymetric photon extraction at different water depths are shown in
Table 4. PQI achieved a higher F1 score in the water depth range of 0–10 m, 10–20 m, 20–30 m, and 30–40 m. PQI has the highest F1 score (83.92%) when extracting underwater bathymetric photons at 0–10 m and the lowest F1 score (77.35%) when extracting underwater bathymetric photons at 30–40 m. CONF did not extract underwater bathymetric photons with a water depth of more than 10 m, and the extraction results of bathymetric photons from 0 to 10 m were unsatisfactory. AVEBM’s F1 score decreases with water depth, gradually decreasing from 74.52% to 61.75%. The performance of all methods decreased with the increase in water depth. Although the difference between underwater bathymetric photons and noise photons gradually decreased with the increase of water depth, PQI’s performance was still better than AVEBM and AMNP in the same water depth range. The increased water depth also led to a more noticeable difference between the methods. When extracting underwater bathymetric photons with a water depth of 0–10 m, PQI’s F1 score is 9.20% higher than AVEBM’s F1 score, and the difference increases to 15.60% when processing data with a water depth of 30–40 m. We can reach the same conclusion by comparing the results of PQI and AMNP. Therefore, PQI’s performance in extracting underwater bathymetric photons is better than AVEBM’s and AMNP’s, and PQI has more robust adaptability to the spatial distribution changes of photons caused by the increase in water depth. Considering that ICESat-2 can penetrate about 40 m into deep water at most,
Table 4 enables us to fully understand the changes in PQI, AVEBM, and AMNP performance with water depth.