Lightweight Spatial-Frequency Collaborative Interaction Network for RGB-D Salient Object Detection
Abstract
1. Introduction
- We propose SFCINet, an efficient and unified framework for RGB-D SOD that enables spatial-frequency information interaction and synergistic fusion. By establishing a robust global-to-local guidance mechanism, it injects a global frequency-synergized prior into the decoding procedure, effectively mitigating structural drift during multi-scale feature fusions.
- We present the SFS module, which shifts the perspective to the complex Fourier domain to capture holistic contextual features. By exploiting the distinct physical properties of amplitude and phase frequency spectral components, SFS suppresses depth noise while capturing sharp RGB structural boundaries, resolving the multi-modal spectral heterogeneity. It crystallizes a purified global frequency-synergized prior, and simultaneously calibrates the spatial features.
- We introduce the CMGI module to achieve cross-modal fusion without contamination. Driven by a reciprocal anchoring mechanism, it utilizes the spatial confidence derived from the SFS-enhanced features as an invariant structural constraint to mutually gate both modalities, successfully blocking localized noise and deceptive semantic clutter.
2. Related Work
2.1. RGB Salient Object Detection
2.2. RGB-D Salient Object Detection
2.3. Lightweight RGB-D Salient Object Detection
2.4. Applications of Frequency Information
3. Proposed Method
3.1. Overall Architecture
3.2. Spatial-Frequency Synergy (SFS) Module
3.3. Cross-Modal Guidance Interaction (CMGI) Module
3.4. Calibrated Hierarchical Decoder (CHD)
3.5. Loss Function
4. Experimental Results and Analysis
4.1. Datasets
4.2. Evaluation Metrics
4.3. Implementation Details
4.4. Performance Comparison
- (1)
- Quantitative Comparison
- (2)
- Qualitative Comparison
4.5. Ablation Studies
- (1)
- The effectiveness of the SFS module
- (2)
- The effectiveness of the CMGI module
- (3)
- The effectiveness of the CHD module
- (4)
- Efficiency Analysis
4.6. Failure Case Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Yang, N.; Zhong, Q.; Li, K.; Cong, R.; Zhao, Y.; Kwong, S. A reference-free underwater image quality assessment metric in frequency domain. Signal Process. Image Commun. 2021, 94, 116218. [Google Scholar] [CrossRef]
- Lee, H.; Yoon, J.; Jeong, Y.; Yi, K. Moving object detection and tracking based on interaction of static obstacle map and geometric model-free approach for urban autonomous driving. IEEE Trans. Intell. Transp. Syst. 2021, 22, 3275–3284. [Google Scholar] [CrossRef]
- Guo, Y.; Liu, Y.; Georgiou, T.; Lew, M.S. A review of semantic segmentation using deep neural networks. Int. J. Multimed. Inf. Retr. 2018, 7, 87–93. [Google Scholar] [CrossRef]
- Zhu, X.-F.; Wu, X.-J.; Xu, T.; Feng, Z.-H.; Kittler, J. Robust visual object tracking via adaptive attribute-aware discriminative correlation filters. IEEE Trans. Multimed. 2022, 24, 301–312. [Google Scholar] [CrossRef]
- Oprea, S.; Martinez-Gonzalez, P.; Garcia-Garcia, A.; Castro-Vargas, J.A.; Orts-Escolano, S.; Garcia-Rodriguez, J.; Argyros, A. A review on deep learning techniques for video prediction. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 2806–2826. [Google Scholar] [CrossRef]
- Yi, K.; Li, Y.; Tang, H.; Xu, J. Adaptive Depth Enhancement Network for RGB-D Salient Object Detection. IEEE Signal Process. Lett. 2025, 32, 176–180. [Google Scholar] [CrossRef]
- Wei, L.; Zhu, Z.; Mi, Y.; Hu, W. PDNet: Pluralistic depth-aware network for RGB-D salient object detection. Signal Process. 2026, 239, 110271. [Google Scholar] [CrossRef]
- Achanta, R.; Hemami, S.; Estrada, F.; Susstrunk, S. Frequency-tuned salient region detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2009; pp. 1597–1604. [Google Scholar] [CrossRef]
- Ma, X.; Xie, X.; Lam, K.-M.; Zhong, Y. Efficient saliency analysis based on wavelet transform and entropy theory. J. Vis. Commun. Image Represent. 2015, 30, 201–207. [Google Scholar] [CrossRef]
- Jin, X.; Guo, C.; He, Z.; Xu, J.; Wang, Y.; Su, Y. FCMNet: Frequency-aware cross-modality attention networks for RGB-D salient object detection. Neurocomputing 2022, 491, 414–425. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, R.; Fan, X.; Wang, T.; He, X. Pixels, regions, and objects: Multiple enhancement for salient object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2023; pp. 10031–10040. [Google Scholar] [CrossRef]
- Liu, N.; Zhang, N.; Wan, K.; Shao, L.; Han, J. Visual saliency transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: Piscataway, NJ, USA, 2021; pp. 4702–4712. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, Y.; Liu, Y.; Qin, C.; Coleman, S.A.; Kerr, D. LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection. IEEE Trans. Multimed. 2024, 26, 5207–5222. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, Y.; Liu, Y.; Zhu, D.; Coleman, S.A.; Kerr, D. ELWNet: An Extremely Lightweight Approach for Real-Time Salient Object Detection. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 6404–6417. [Google Scholar] [CrossRef]
- Wen, F.; Wang, Q.; Zou, R.; Wang, Y.; Liu, F.; Chen, Y.; Yu, L.; Du, S.; Yuan, C. A Salient Object Detection Method Based on Boundary Enhancement. Sensors 2023, 23, 7077. [Google Scholar] [CrossRef]
- Peng, Y.; Zhai, Z.; Feng, M. SLMSF-Net: A Semantic Localization and Multi-Scale Fusion Network for RGB-D Salient Object Detection. Sensors 2024, 24, 1117. [Google Scholar] [CrossRef]
- Wei, L.; Zhu, Z. Modal-Aware Interaction Network for RGB-D Salient Object Detection. IEEE Trans. Instrum. Meas. 2025, 74, 5026212. [Google Scholar] [CrossRef]
- Zhou, W.; Zhu, Y.; Lei, J.; Wan, J.; Yu, L. IRFR-Net: Interactive recursive feature-reshaping network for detecting salient objects in RGB-D images. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 7820–7833. [Google Scholar] [CrossRef] [PubMed]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Salt Lake City, UT, USA, 2018; pp. 4510–4520. [Google Scholar] [CrossRef]
- Zhang, X.; Zhou, X.; Lin, M.; Sun, J. ShuffleNet: An extremely efficient convolutional neural network for mobile devices. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Salt Lake City, UT, USA, 2018; pp. 6848–6856. [Google Scholar] [CrossRef]
- Mehta, S.; Rastegari, M. MobileViT: Light-weight, general-purpose, and mobile-friendly vision transformer. arXiv 2021, arXiv:2110.02178. [Google Scholar] [CrossRef]
- Jin, X.; Yi, K.; Xu, J. MoADNet: Mobile asymmetric dual-stream networks for real-time and lightweight RGB-D salient object detection. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 7632–7645. [Google Scholar] [CrossRef]
- Zhang, J.; Dai, Y.; Zhang, T.; Harandi, M.; Barnes, N.; Hartley, R. FasterSal: Robust and real-time single-stream architecture for RGB-D salient object detection. IEEE Trans. Multimed. 2025, 27, 2477–2488. [Google Scholar] [CrossRef]
- Zhang, C.; Chen, F.; Huang, L.; Peng, Z.; Hu, X. Multiple cross-modal complementation network for lightweight RGB-D salient object detection. J. Vis. Commun. Image Represent. 2025, 113, 104622. [Google Scholar] [CrossRef]
- Mao, X.; Liu, Y.; Liu, F.; Li, Q.; Shen, W.; Wang, Y. Intriguing findings of frequency selection for image deblurring. In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI); AAAI: Washington, DC, USA, 2023; pp. 1905–1913. [Google Scholar] [CrossRef]
- Qiao, Y.; Shao, M.; Wang, L.; Zuo, W. Learning depth-density priors for Fourier-based unpaired image restoration. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 2604–2618. [Google Scholar] [CrossRef]
- Zhao, R.; Wang, M.; Wang, F.; Sun, F.; Li, H. Spatial-frequency collaborative learning for camouflaged object detection. IEEE Trans. Multimed. 2025, 27, 7756–7768. [Google Scholar] [CrossRef]
- Zhou, H.; Hong, W.; Zhang, Z.; Liu, X.; Wu, X.-J. Lightweight Spatial-Channel-Frequency Network for RGB-Thermal Salient Object Detection. IEEE Signal Process. Lett. 2025, 32, 4009–4013. [Google Scholar] [CrossRef]
- Fan, D.-P.; Lin, Z.; Zhang, Z.; Zhu, M.; Cheng, M.-M. Rethinking RGB-D salient object detection: Models, datasets, and large-scale benchmarks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 32, 2075–2089. [Google Scholar] [CrossRef] [PubMed]
- Ju, R.; Ge, L.; Geng, W.; Ren, T.; Wu, G. Depth saliency based on anisotropic center-surround difference. In Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP); IEEE: Piscataway, NJ, USA, 2014; pp. 1115–1119. [Google Scholar] [CrossRef]
- Peng, H.; Li, B.; Xiong, W.; Hu, W.; Ji, R. RGBD salient object detection: A benchmark and algorithms. In Computer Vision—ECCV 2014; Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T., Eds.; Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2014; Volume 8691, pp. 92–109. [Google Scholar] [CrossRef]
- Niu, Y.; Geng, Y.; Li, X.; Liu, F. Leveraging stereopsis for saliency analysis. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2012; pp. 454–461. [Google Scholar] [CrossRef]
- Piao, Y.; Ji, W.; Li, J.; Zhang, M.; Lu, H. Depth-induced multi-scale recurrent attention network for saliency detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: Piscataway, NJ, USA, 2019; pp. 7253–7262. [Google Scholar] [CrossRef]
- Perazzi, F.; Krähenbühl, P.; Pritch, Y.; Hornung, A. Saliency filters: Contrast based filtering for salient region detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: Piscataway, NJ, USA, 2012; pp. 733–740. [Google Scholar] [CrossRef]
- Fan, D.-P.; Cheng, M.-M.; Liu, Y.; Li, T.; Borji, A. Structure-measure: A new way to evaluate foreground maps. In Proceedings of the IEEE International Conference on Computer Vision (ICCV); IEEE: Piscataway, NJ, USA, 2017; pp. 4548–4557. [Google Scholar] [CrossRef]
- Fan, D.-P.; Gong, C.; Cao, Y.; Ren, B.; Cheng, M.-M.; Borji, A. Enhanced-alignment measure for binary foreground map evaluation. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI); IJCAI: Stockholm, Sweden, 2018; pp. 698–704. [Google Scholar] [CrossRef]
- Jin, W.-D.; Xu, J.; Han, Q.; Zhang, Y.; Cheng, M.-M. CDNet: Complementary depth network for RGB-D salient object detection. IEEE Trans. Image Process. 2021, 31, 3376–3390. [Google Scholar] [CrossRef]
- Zhou, W.; Zhu, Y.; Lei, J.; Wan, J.; Yu, L. CCAFNet: Crossflow and cross-scale adaptive fusion network for detecting salient objects in RGB-D images. IEEE Trans. Multimed. 2022, 24, 2192–2204. [Google Scholar] [CrossRef]
- Pang, Y.; Zhao, X.; Zhang, L.; Lu, H. CAVER: Cross-modal view-mixed transformer for bi-modal salient object detection. IEEE Trans. Image Process. 2023, 32, 892–904. [Google Scholar] [CrossRef] [PubMed]
- Zhang, X.; Xu, Y.; Wang, T.; Liao, T. Multi-prior driven network for RGB-D salient object detection. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 9209–9222. [Google Scholar] [CrossRef]
- Wang, R.; Wang, F.; Su, Y.; Sun, J.; Sun, F.; Li, H. Attention-guided multi-modality interaction network for RGB-D salient object detection. ACM Trans. Multimed. Comput. Commun. Appl. 2024, 20, 68. [Google Scholar] [CrossRef]
- Wu, Z.; Allibert, G.; Meriaudeau, F.; Ma, C.; Demonceaux, C. HiDAnet: RGB-D Salient Object Detection via Hierarchical Depth Awareness. IEEE Trans. Image Process. 2023, 32, 2160–2173. [Google Scholar] [CrossRef] [PubMed]
- Wu, J.; Hao, F.; Liang, W.; Xu, J. Transformer fusion and pixel-level contrastive learning for RGB-D salient object detection. IEEE Trans. Multimed. 2024, 26, 1011–1026. [Google Scholar] [CrossRef]
- Tang, Y.; Li, M. DMGNet: Depth mask guiding network for RGB-D salient object detection. Neural Netw. 2024, 180, 106751. [Google Scholar] [CrossRef] [PubMed]
- Huang, N.; Jiao, Q.; Zhang, Q.; Han, J. Middle-level feature fusion for lightweight RGB-D salient object detection. IEEE Trans. Image Process. 2022, 31, 6621–6634. [Google Scholar] [CrossRef]
- Zhou, W.; Zhu, Y.; Lei, J.; Yang, R.; Yu, L. LSNet: Lightweight spatial boosting network for detecting salient objects in RGB-thermal images. IEEE Trans. Image Process. 2023, 32, 1329–1340. [Google Scholar] [CrossRef]
- Zeng, Z.; Liu, H.; Chen, F.; Tan, X. AirSOD: A lightweight network for RGB-D salient object detection. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 1656–1669. [Google Scholar] [CrossRef]
- Gao, H.; Wang, F.; Wang, M.; Sun, F.; Li, H. Highly efficient RGB-D salient object detection with adaptive fusion and attention regulation. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 3104–3118. [Google Scholar] [CrossRef]
- Zhong, M.; Sun, J.; Ren, P.; Wang, F.; Sun, F. MAGNet: Multi-scale awareness and global fusion network for RGB-D salient object detection. Knowl. Based Syst. 2024, 299, 112126. [Google Scholar] [CrossRef]
- Ren, P.; Bai, T.; Sun, F. Bio-inspired two-stage network for efficient RGB-D salient object detection. Neural Netw. 2025, 185, 107244. [Google Scholar] [CrossRef]







| Datasets | Metrics | TIP21 | TMM22 | TIP23 | TCSVT23 | TOMM23 | TIP23 | TMM24 | NN24 | Ours |
|---|---|---|---|---|---|---|---|---|---|---|
| CDNet | CCAF | CAVER | MPDNet | AMINet | HiDANet | TPCL | DMGNet | |||
| Params (M) | 32.9 | 41.8 | 55.8 | 52.0 | 199.1 | 523.0 | 129.47 | - | 10.08 | |
| FLOPs (G) | 72.0 | 76.6 | 21.86 | 52.0 | 124.7 | 71.5 | 212.02 | - | 4.27 | |
| SIP | 0.805 | 0.864 | 0.884 | 0.884 | - | 0.893 | 0.903 | 0.877 | 0.911 | |
| 0.076 | 0.054 | 0.043 | 0.048 | - | 0.043 | 0.038 | 0.049 | 0.041 | ||
| 0.823 | 0.876 | 0.893 | 0.904 | - | 0.892 | 0.900 | 0.878 | 0.899 | ||
| 0.880 | 0.916 | 0.927 | 0.925 | - | 0.930 | 0.937 | 0.924 | 0.943 | ||
| NJU2K | 0.866 | 0.897 | 0.874 | 0.892 | 0.909 | 0.921 | 0.918 | 0.884 | 0.921 | |
| 0.048 | 0.037 | 0.032 | 0.041 | 0.036 | 0.029 | 0.028 | 0.035 | 0.031 | ||
| 0.885 | 0.910 | 0.920 | 0.912 | 0.906 | 0.926 | 0.926 | 0.913 | 0.920 | ||
| 0.908 | 0.942 | 0.922 | 0.937 | 0.943 | 0.952 | 0.928 | 0.930 | 0.952 | ||
| NLPR | 0.848 | 0.881 | 0.895 | 0.897 | 0.893 | 0.910 | 0.908 | 0.893 | 0.921 | |
| 0.032 | 0.026 | 0.022 | 0.023 | 0.025 | 0.021 | 0.017 | 0.022 | 0.023 | ||
| 0.902 | 0.922 | 0.929 | 0.932 | 0.907 | 0.929 | 0.935 | 0.924 | 0.929 | ||
| 0.935 | 0.953 | 0.959 | 0.951 | 0.950 | 0.959 | 0.965 | 0.956 | 0.965 | ||
| STERE | 0.873 | 0.869 | 0.872 | 0.884 | 0.882 | 0.897 | 0.896 | 0.903 | 0.907 | |
| 0.042 | 0.044 | 0.034 | 0.039 | 0.040 | 0.035 | 0.031 | 0.032 | 0.037 | ||
| 0.896 | 0.891 | 0.914 | 0.915 | 0.890 | 0.911 | 0.916 | 0.917 | 0.910 | ||
| 0.922 | 0.933 | 0.931 | 0.936 | 0.937 | 0.944 | 0.928 | 0.950 | 0.951 | ||
| DUT-RGBD | 0.874 | 0.904 | 0.919 | - | 0.950 | 0.920 | 0.935 | - | 0.947 | |
| 0.048 | 0.038 | 0.029 | - | 0.022 | 0.032 | 0.024 | - | 0.026 | ||
| 0.880 | 0.903 | 0.930 | - | 0.944 | 0.924 | 0.936 | - | 0.935 | ||
| 0.918 | 0.944 | 0.955 | - | 0.968 | 0.951 | 0.960 | - | 0.968 | ||
| Datasets | Metrics | TIP22 | TIP22 | TIP23 | TCSVT24 | TCSVT24 | KBS24 | TMM25 | NN25 | Ours |
|---|---|---|---|---|---|---|---|---|---|---|
| MoADN | MMF | LSNet | AirSOD | HENet | MAGNet | FasterSal | BTNet | |||
| Params (M) | 5.0 | 3.9 | 5.4 | 2.4 | 2.7 | 5.2 | 5.2 | 3.8 | 10.08 | |
| FLOPs (G) | 1.58 | 10.9 | 1.21 | 0.9 | 2.2 | 2.2 | 0.9 | 7.2 | 4.27 | |
| SIP | 0.850 | 0.871 | 0.881 | 0.855 | 0.894 | 0.894 | 0.870 | 0.891 | 0.911 | |
| 0.058 | 0.048 | 0.050 | 0.060 | 0.041 | 0.041 | 0.049 | 0.046 | 0.041 | ||
| 0.865 | 0.882 | 0.886 | 0.859 | 0.899 | 0.899 | 0.870 | 0.890 | 0.899 | ||
| 0.911 | 0.919 | 0.920 | 0.904 | 0.933 | 0.932 | 0.929 | 0.923 | 0.943 | ||
| NJU2K | 0.892 | 0.885 | 0.899 | 0.889 | 0.899 | 0.900 | 0.906 | 0.911 | 0.921 | |
| 0.041 | 0.042 | 0.039 | 0.039 | 0.034 | 0.034 | 0.034 | 0.035 | 0.031 | ||
| 0.906 | 0.898 | 0.911 | 0.908 | 0.918 | 0.918 | 0.908 | 0.925 | 0.920 | ||
| 0.935 | 0.925 | 0.939 | 0.934 | 0.912 | 0.942 | 0.949 | 0.944 | 0.952 | ||
| NLPR | 0.875 | 0.887 | 0.891 | 0.884 | 0.892 | 0.904 | 0.902 | 0.911 | 0.921 | |
| 0.027 | 0.027 | 0.025 | 0.023 | 0.025 | 0.021 | 0.022 | 0.021 | 0.023 | ||
| 0.915 | 0.917 | 0.919 | 0.925 | 0.916 | 0.932 | 0.920 | 0.927 | 0.929 | ||
| 0.947 | 0.943 | 0.951 | 0.957 | 0.958 | 0.962 | 0.960 | 0.956 | 0.965 | ||
| STERE | 0.868 | 0.881 | 0.850 | 0.865 | 0.895 | 0.895 | 0.875 | 0.893 | 0.907 | |
| 0.042 | 0.039 | 0.055 | 0.043 | 0.036 | 0.036 | 0.040 | 0.038 | 0.037 | ||
| 0.898 | 0.903 | 0.871 | 0.895 | 0.916 | 0.916 | 0.888 | 0.920 | 0.910 | ||
| 0.935 | 0.936 | 0.909 | 0.932 | 0.933 | 0.943 | 0.939 | 0.938 | 0.951 | ||
| DUT-RGBD | 0.923 | 0.920 | 0.844 | - | 0.922 | 0.921 | 0.925 | 0.929 | 0.947 | |
| 0.031 | 0.033 | 0.061 | - | 0.031 | 0.030 | 0.030 | 0.028 | 0.026 | ||
| 0.927 | 0.907 | 0.867 | - | 0.930 | 0.929 | 0.918 | 0.927 | 0.935 | ||
| 0.959 | 0.945 | 0.887 | - | 0.950 | 0.955 | 0.958 | 0.956 | 0.968 | ||
| Variant | SFS | CMGI | CHD | DUT-RGBD | SIP | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| (a) | х | ✓ | х | 0.939 | 0.029 | 0.929 | 0.963 | 0.902 | 0.046 | 0.892 | 0.935 |
| (b) | ✓ | ✓ | х | 0.942 | 0.027 | 0.933 | 0.965 | 0.905 | 0.044 | 0.893 | 0.938 |
| (c) | ✓ | х | ✓ | 0.943 | 0.028 | 0.932 | 0.964 | 0.902 | 0.043 | 0.894 | 0.936 |
| (d) | ✓ | ✓ | ✓ | 0.947 | 0.026 | 0.935 | 0.967 | 0.911 | 0.041 | 0.899 | 0.943 |
| Variant | DUT-RGBD | SIP | ||||||
|---|---|---|---|---|---|---|---|---|
| (1) CA + SA | 0.944 | 0.027 | 0.932 | 0.964 | 0.902 | 0.043 | 0.892 | 0.937 |
| (2) w/o amplitude | 0.942 | 0.029 | 0.931 | 0.962 | 0.906 | 0.043 | 0.894 | 0.939 |
| (c) w/o phase | 0.943 | 0.028 | 0.933 | 0.965 | 0.907 | 0.045 | 0.890 | 0.935 |
| Ours | 0.947 | 0.026 | 0.935 | 0.967 | 0.911 | 0.041 | 0.899 | 0.943 |
| Variant | SFS | CMGI | CHD | Params | FLOPS | FPS |
|---|---|---|---|---|---|---|
| (a) | х | ✓ | х | 9.88 | 4.02 | 277 |
| (b) | ✓ | ✓ | х | 10.05 | 4.24 | 232 |
| (c) | ✓ | х | ✓ | 10.07 | 4.26 | 238 |
| (d) | ✓ | ✓ | ✓ | 10.08 | 4.27 | 236 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Lu, Y.; Cui, Z. Lightweight Spatial-Frequency Collaborative Interaction Network for RGB-D Salient Object Detection. Sensors 2026, 26, 3708. https://doi.org/10.3390/s26123708
Lu Y, Cui Z. Lightweight Spatial-Frequency Collaborative Interaction Network for RGB-D Salient Object Detection. Sensors. 2026; 26(12):3708. https://doi.org/10.3390/s26123708
Chicago/Turabian StyleLu, Yitong, and Ziguan Cui. 2026. "Lightweight Spatial-Frequency Collaborative Interaction Network for RGB-D Salient Object Detection" Sensors 26, no. 12: 3708. https://doi.org/10.3390/s26123708
APA StyleLu, Y., & Cui, Z. (2026). Lightweight Spatial-Frequency Collaborative Interaction Network for RGB-D Salient Object Detection. Sensors, 26(12), 3708. https://doi.org/10.3390/s26123708

