Few-Shot Classification of Shallow-Water Seabed Sediment and Benthic Cover by Fusing Airborne LiDAR Bathymetry and Multispectral Imagery
Highlights
- A robust cross-modal alignment between airborne LiDAR bathymetry and multispectral imagery is achieved using SIFT-PROSAC and perspective transformation, ensuring high geometric consistency.
- The proposed FCA-Relief-F feature selection and GAT-PN model enable accurate classification of five sediment and benthic-cover types under few-shot conditions, significantly improving discriminative performance.
- The study demonstrates that effective feature fusion and selection can substantially reduce redundancy and enhance cross-modal representation for shallow-water seabed mapping.
- The proposed framework provides a practical solution for sediment and benthic-cover classification in data-scarce scenarios, with strong potential for coastal applications.
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Preprocessing and Feature Extraction
2.2. Muti-Source Features Registration and Fusion Model Construction
- (1)
- Scale space construction and feature point detection. Image locations were sought at all scales, with feature points unresponsive to scale and rotation identified using Gaussian derivative functions. The location and scale of the feature points are then determined through the application of a fitted model [26].
- (2)
- Feature point main orientation estimation and descriptor creation. The main direction is determined by calculating the gradient direction of each extreme point in the image. Gradient calculation and direction assignment were performed on the pixels around the extreme points to generate the feature descriptor of each extreme point. The gradient direction and modulus were calculated using the following equations [27]:where L(x,y) denotes the feature point, m denotes the gradient modulus, and θ denotes the gradient direction.
- (3)
- Erroneous feature point-pair elimination. For the matching points obtained after the SIFT transformation, all matching points were sorted in descending order according to the Hamming distance similarity. The first n groups of high-quality matching point pairs were selected, and m groups of matching point pairs were chosen to form a sample set. Then, the basic matrix F is obtained according to Equation (3) [28,29]. The inliers are defined as the number of matching point pairs that satisfy the matrix F. When the number of inliers exceeds the set maximum threshold or the number of inliers after two consecutive samplings does not increase, the above cycle is stopped, and the corresponding inliers are output.where (x, y, 1) and (x′, y′, 1) are the homogeneous coordinates of a pair of correctly matched points a(x, y) and a′(x′, y′), respectively.
- (4)
- Perspective transformation matrix calculation and multi-source feature fusion. To address the geometric discrepancies between ALB intensity range images and multispectral blue-band range images caused by different imaging geometries and viewing perspectives, the perspective transformation model was adopted. Compared with affine transformation, the perspective model can better handle non-linear projective distortions. Therefore, the transformation matrix for pixel coordinate conversion between the two datasets was calculated based on the perspective transformation model. Using this matrix, ALB features were then aligned and fused with multispectral features. The perspective transformation model is expressed as follows [30]:where q and ω are the coordinate points in the original multispectral image, (q′, ω′) are the corresponding two-dimensional coordinates after the perspective transformation, and is the perspective transformation parameter matrix, which can be divided into four parts. a11, a12 and a13 are the parameters that control the scaling, rotation, and translation in the horizontal direction; a21, a22 and a23 are the same in the vertical direction; a31, a32 and a33 are the parameters of the perspective projection; a31 and a32 determine the nonlinear scaling of the q and ω coordinates, and a33 is the scale factor.
2.3. Cross-Modal Feature Optimization Model Construction
2.4. Few-Shot Sediment and Benthic-Cover Classification Model Construction
3. Results
3.1. Study Data
3.2. Result of Feature Extraction, Registration and Fusion
- (1)
- Waveform and terrain features extraction from the ALB data
- (2)
- Texture and spectral features extraction from multispectral image
- (3)
- Results of ALB data and multispectral data registration and fusion
3.3. Result of Classification and Accuracy Assessment
4. Discussion
4.1. Correlation Analysis of Features
4.2. Feature Contribution Analysis
4.3. Limitations and Future Work
5. Conclusions
- (1)
- Based on ALB and multispectral features, the constructed SIFT-PROSAC and perspective transformation models were used for feature registration and fusion. The experimental results demonstrate that the algorithm can obtain higher registration accuracy, leading to the generation of more precise fusion features.
- (2)
- To select an effective classifier, after feature optimization, traditional classifiers, such as support vector machines, BP neural networks, random forests, prototype networks, graph convolutional networks, GraphSAGE, graph attention networks, and the GAT-PN classifier proposed in this article, were used to extract five sediment and benthic covers. The experimental results demonstrate that the GAT-PN classifier has the best classification effect, with an overall accuracy and kappa coefficients of 97.50% and 0.969, respectively. The results can provide effective technical support for the classification of seabed sediment and benthic covers in marine engineering and other fields.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| ALB | Airborne LiDAR Bathymetry |
| SIFT | Scale-Invariant Feature Transform |
| PROSAC | Progressive Sample Consensus |
| FCA | Formal Concept Analysis |
| GAT | Graph Attention Network |
| PN | Prototype Network |
| OA | Overall Accuracy |
| MBES | Multi-Beam Echo Sounder |
| RANSAC | Random Sample Consensus |
| NDVI | Normalized Difference Vegetation Index |
| NDWI | Normalized Difference Water Index |
| UTM | Universal Transverse Mercator |
| RMSE | Root Mean Square Error |
| GraphSAGE | Graph Sample and Aggregate |
| PCA | Principal Component Analysis |
| KNN | K-nearest neighbor |
References
- Anokye, M.; Cui, X.; Yang, F.; Wang, P.; Sun, Y.; Ma, H.; Amoako, E.O. Optimizing multi-classifier fusion for seabed sediment classification using machine learning. Int. J. Digit. Earth 2024, 17, 2295988. [Google Scholar] [CrossRef]
- Zhao, Y.; Zhu, K.; Zhao, T.; Zheng, L.; Deng, X. Small-sample seabed sediment and benthic-cover classification based on deep learning. Remote Sens. 2023, 15, 2178. [Google Scholar] [CrossRef]
- Foglini, F.; Rovere, M.; Tonielli, R.; Castellan, G.; Prampolini, M.; Budillon, F.; Cuffaro, M.; Di Martino, G.; Grande, V.; Innangi, S.; et al. A new multi-grid bathymetric dataset of the Gulf of Naples (Italy) from complementary multi-beam echo sounders. Earth Syst. Sci. Data 2025, 17, 181–203. [Google Scholar] [CrossRef]
- Guenther, G.C. Airborne Laser Hydrography System Design and Performance Factors; NOAA Professional Paper Series; Nat. Ocean Service 1; National Oceanic and Atmospheric Administration, U.S. Department of Commerce: Washington, DC, USA, 1985.
- Szafarczyk, A.; Toś, C. The use of green laser in LiDAR bathymetry: State of the art and recent advancements. Sensors 2022, 23, 292. [Google Scholar] [CrossRef] [PubMed]
- Amani, M.; Moghimi, A.; Mirmazloumi, S.M.; Ranjgar, B.; Ghorbanian, A.; Ojaghi, S.; Ebrahimy, H.; Naboureh, A.; Nazari, M.E.; Mahdavi, S.; et al. Ocean remote sensing techniques and applications: A review (part I). Water 2022, 14, 3400. [Google Scholar] [CrossRef]
- Amani, M.; Mehravar, S.; Asiyabi, R.M.; Moghimi, A.; Ghorbanian, A.; Ahmadi, S.A.; Ebrahimy, H.; Moghaddam, S.H.A.; Naboureh, A.; Ranjgar, B.; et al. Ocean remote sensing techniques and applications: A review (part II). Water 2022, 14, 3401. [Google Scholar] [CrossRef]
- Pan, Z.; Glennie, C.L.; Fernandez-Diaz, J.C.; Legleiter, C.J.; Overstreet, B. Fusion of LiDAR orthowaveforms and hyperspectral imagery for shallow river bathymetry and turbidity estimation. IEEE Trans. Geosci. Remote Sens. 2016, 54, 4165–4177. [Google Scholar] [CrossRef]
- He, X.; Tang, C.; Liu, X.; Zhang, W.; Sun, K.; Xu, J. Object detection in hyperspectral image via unified spectral–spatial feature aggregation. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5521213. [Google Scholar] [CrossRef]
- Umeda, K.; Godin, G.; Rioux, M. Registration of range and color images using gradient constraints and range intensity images. In Proceedings of the 17th International Conference on Pattern Recognition, Cambridge, UK, 23–26 August 2004. [Google Scholar] [CrossRef]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Cavalli, L.; Barath, D.; Pollefeys, M.; Larsson, V. Consensus-adaptive ransac. arXiv 2023, arXiv:2307.14030. [Google Scholar] [CrossRef]
- Zhang, L.; Wang, Y.; Chen, H. Optimal feature selection from non-stationary signals for early anomaly detection using adaptive sparse regression. Mech. Syst. Signal Process. 2024, 208, 111045. [Google Scholar] [CrossRef]
- Le, T.T.; Simmons, W.K.; Misaki, M.; Bodurka, J.; White, B.C.; Savitz, J.; McKinney, B.A. Differential privacy-based evaporative cooling feature selection and classification with relief-F and random forests. Bioinformatics 2017, 33, 2906–2913. [Google Scholar] [CrossRef] [PubMed]
- Su, D.; Yang, F.; Ma, Y.; Zhang, K.; Huang, J.; Wang, M. Classification of coral reefs in the South China Sea by combining airborne LiDAR bathymetry bottom waveforms and bathymetric features. IEEE Trans. Geosci. Remote Sens. 2018, 57, 815–828. [Google Scholar] [CrossRef]
- Zhang, J.; Liu, L.; Silvén, O.; Pietikäinen, M.; Hu, D. Few-shot class-incremental learning for classification and object detection: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 2924–2945. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Shi, D.; Lin, H. Few-shot learning with representative global prototype. Neural Netw. 2024, 180, 106600. [Google Scholar] [CrossRef] [PubMed]
- Sun, Q.; Liu, Y.; Chua, T.S.; Schiele, B. Meta-transfer learning for few-shot learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, Y.; Liu, X. Task-adaptive meta-metric learning for few-shot image classification. Pattern Recognit. 2024, 146, 110025. [Google Scholar] [CrossRef]
- Zhang, J.; Huang, J.; Jin, S.; Lu, S. Vision-language models for vision tasks: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 5625–5644. [Google Scholar] [CrossRef] [PubMed]
- Wang, S.; Wu, J.; Zhang, L. Adaptive global-local prototypical networks for few-shot representation learning. Neural Netw. 2024, 171, 215–228. [Google Scholar] [CrossRef] [PubMed]
- Corso, G.; Stark, H.; Jegelka, S.; Jaakkola, T.; Barzilay, R. Graph neural networks. Nat. Rev. Methods Prim. 2024, 4, 17. [Google Scholar] [CrossRef]
- Li, W.; Du, L.; Meng, X.; Liu, J.; Li, Y.; Zhang, X.; Wan, D. Adaptive denoising for airborne lidar bathymetric full waveforms using emd-based multiresolution analysis. IEEE Geosci. Remote Sens. Lett. 2024, 21, 6500705. [Google Scholar] [CrossRef]
- Chao, Q.; Dianpeng, S.; Xiankun, W.; Mingwei, W.; Bo, S.; Fanlin, Y. Fitting algorithm for airborne laser bathymetric waveforms based on layered heterogeneous model. Infrared Laser Eng. 2019, 48, 0206004. [Google Scholar] [CrossRef]
- Chao, Q.; Fengnian, Z.; Jingwen, W.; Dianpeng, S.; Xiankun, W.; Fanlin, Y. Extraction method for diffuse attenuation coefficient based on airborne LiDAR bathymetric water column waveform. Acta Oceanol. Sin. 2021, 43, 147–154. [Google Scholar] [CrossRef]
- Guastella, M.; Pisciotta, A.; Martorana, R.; D’aLessandro, A. Automated Detection of Recent Mud Extrusions Using UAV Imagery and Deep Learning: A Comparative Analysis of Traditional and CNN-Based Approaches. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2025, 48, 169–179. [Google Scholar] [CrossRef]
- Rahman, M.D.; Matta, G.R. PySIFT: GPU-Resident Deterministic SIFT for Deep Learning Vision Pipelines. arXiv 2026, arXiv:2605.17869. [Google Scholar] [CrossRef]
- Luo, Y.; Wang, X.; Liao, Y.; Fu, Q.; Shu, C.; Wu, Y.; He, Y. A review of homography estimation: Advances and challenges. Electronics 2023, 12, 4977. [Google Scholar] [CrossRef]
- Wei, X.; Sen, G.; Chengzhi, C.; Zhenwei, J.; Changhai, R. Real-time correction of image drift in scanning electron microscope. Opto-Electron. Eng. 2018, 45, 180198–180201. [Google Scholar] [CrossRef]
- Zhenyu, S.; Lihui, W.; Liqiang, L. Three-dimensional reconstruction algorithm based on inverse perspective transformation. In Proceedings of the 2020 International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), Fuzhou, China, 12–14 June 2020. [Google Scholar] [CrossRef]
- Sun, X.; Guo, S.; Liu, S.; Guo, J.; Du, B. A Correlation-Redundancy Guided Evolutionary Algorithm and Its Application to High-Dimensional Feature Selection in Classification. Neural Process. Lett. 2024, 56, 86. [Google Scholar] [CrossRef]
- Veličković, P.; Cucurull, G.; Casanova, A.; Romero, A.; Lio, P.; Bengio, Y. Graph attention networks. arXiv 2017, arXiv:1710.10903. [Google Scholar]
- Zhang, S.; Li, Z.; Zhang, K.; Lu, Y.; Deng, Y.; Tang, L.; Jiang, X.; Ma, J. Deep learning reforms image matching: A survey and outlook. arXiv 2025, arXiv:2506.04619. [Google Scholar] [CrossRef]















| Registration Method | Reprojection Deviation/m | RMSE/m |
|---|---|---|
| SIFT-RANSAC | 0.341 | 0.419 |
| SIFT-PROSAC (proposed) | 0.205 | 0.263 |
| Method | Overall Accuracy | Kappa |
|---|---|---|
| Support Vector Machines | 43.75% | 0.326 |
| Backpropagation neural network | 71.25% | 0.637 |
| Random Forest | 88.78% | 0.861 |
| Prototypical Network | 86.90% | 0.836 |
| Graph Convolutional Networks | 87.50% | 0.840 |
| Graph Sample and Aggregate (GraphSAGE) | 92.50% | 0.904 |
| GAT | 96.25% | 0.953 |
| GAT-PN (proposed) | 97.50% | 0.969 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Chen, S.; Song, X.; Mao, J.; Huang, Y.; Yang, A.; Shan, R.; Gao, H.; Su, D. Few-Shot Classification of Shallow-Water Seabed Sediment and Benthic Cover by Fusing Airborne LiDAR Bathymetry and Multispectral Imagery. Remote Sens. 2026, 18, 2128. https://doi.org/10.3390/rs18132128
Chen S, Song X, Mao J, Huang Y, Yang A, Shan R, Gao H, Su D. Few-Shot Classification of Shallow-Water Seabed Sediment and Benthic Cover by Fusing Airborne LiDAR Bathymetry and Multispectral Imagery. Remote Sensing. 2026; 18(13):2128. https://doi.org/10.3390/rs18132128
Chicago/Turabian StyleChen, Shuohao, Xueshan Song, Jinfeng Mao, Yu Huang, Anxiu Yang, Rui Shan, Han Gao, and Dianpeng Su. 2026. "Few-Shot Classification of Shallow-Water Seabed Sediment and Benthic Cover by Fusing Airborne LiDAR Bathymetry and Multispectral Imagery" Remote Sensing 18, no. 13: 2128. https://doi.org/10.3390/rs18132128
APA StyleChen, S., Song, X., Mao, J., Huang, Y., Yang, A., Shan, R., Gao, H., & Su, D. (2026). Few-Shot Classification of Shallow-Water Seabed Sediment and Benthic Cover by Fusing Airborne LiDAR Bathymetry and Multispectral Imagery. Remote Sensing, 18(13), 2128. https://doi.org/10.3390/rs18132128

