FSSM: Frequency-Enhanced State Space Modeling with FFT-Based Two-Sided Non-Causal Convolution for Image Dehazing
Abstract
1. Introduction
- We propose an FFT-based State Space Block (FFTSSB), which performs two-sided non-causal convolution in the frequency domain to replace conventional recursive state propagation. This design enables implicit state propagation with reduced computational redundancy, facilitating efficient global dependency modeling while preserving physical interpretability.
- We design a Frequency-Aware State Interaction (FASI) Block, which tightly couples FFTSSB with the Frequency-Aware Discriminative Enhancement Block (FDEB). This unified spatial–frequency modeling unit enhances texture and edge restoration and improves robustness to complex haze structures.
- We construct a hierarchical multi-scale encoder–decoder framework with skip connections, allowing effective cross-scale feature interaction and fusion. This design significantly improves detail recovery and structural consistency in dehazed images.
- Extensive experiments on the HazyDet dataset demonstrate that FSSM achieves favorable quantitative and qualitative dehazing performance, with improvements in PSNR and SSIM over representative comparison methods. Ablation studies further verify the effectiveness of the proposed FFTSSB and FDEB modules in enhancing global structural consistency and local detail restoration.
2. Related Work
2.1. Traditional Image-Dehazing Methods
2.2. Deep Learning-Based Dehazing Methods
2.3. State Space Models for Image Dehazing
3. Methodology
3.1. Notation
3.2. Overall Architecture
3.3. State Space Modeling and Its Extensions
3.3.1. State Space Model Foundation
3.3.2. FFT-Based State Space Block (FFTSSB)
3.3.3. Frequency-Aware Discriminative Enhancement Block (FDEB)
3.3.4. Frequency-Aware State Interaction Block (FASI Block)
3.4. Loss Function
4. Experiments
4.1. Experimental Settings
4.1.1. Dataset
4.1.2. Comparative Methods
4.1.3. Implementation Details
4.1.4. Evaluation Metrics
- PSNR (Peak Signal-to-Noise Ratio): measures the pixel-level reconstruction accuracy between the dehazed image and the ground-truth image;
- SSIM (Structural Similarity Index): evaluates the consistency of structural information;
- NIQE (Natural Image Quality Evaluator): reflects the perceptual naturalness of images, where lower values indicate better visual quality.
4.2. Quantitative Evaluations
4.3. Computational Complexity Analysis
4.4. Qualitative Evaluations
4.5. Ablation Study
4.5.1. Ablation Study on FFTSSB
4.5.2. Ablation Study on FDEB
4.5.3. Qualitative Analysis of Ablation Results
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Qiu, Z.; Gong, T.; Liang, Z.; Chen, T.; Cong, R.; Bai, H.; Zhao, Y. Perception-Oriented UAV Image Dehazing Based on Superpixel Scene Prior. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5913519. [Google Scholar]
- Liu, Y.; Wang, X.; Hu, E.; Wang, A.; Shiri, B.; Lin, W. VNDHR: Variational Single Nighttime Image Dehazing for Enhancing Visibility in Intelligent Transportation Systems via Hybrid Regularization. IEEE Trans. Intell. Transp. Syst. 2025, 26, 10189–10203. [Google Scholar] [CrossRef]
- Shao, Y.; Li, L.; Ren, W.; Gao, C.; Sang, N. Domain Adaptation for Image Dehazing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 14–19 June 2020; pp. 2808–2817. [Google Scholar]
- He, K.; Sun, J.; Tang, X. Single Image Haze Removal Using Dark Channel Prior. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 2341–2353. [Google Scholar]
- Cui, Y.; Ren, W.; Cao, X.; Knoll, A. Focal Network for Image Restoration. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–6 October 2023; pp. 13001–13011. [Google Scholar]
- Cui, Y.; Tao, Y.; Jing, L.; Knoll, A. Strip Attention for Image Restoration. In Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI 2023), Cape Town, South Africa, 19–25 August 2023; pp. 645–653. [Google Scholar]
- Tong, L.; Liu, Y.; Ye, T.; Li, W.; Chen, L.; Chen, E. Parallel Cross Strip Attention Network for Single Image Dehazing. In Proceedings of the 2024 5th International Conference on Computing, Networks and Internet of Things (CNIOT 2024), Tokyo, Japan, 23–25 May 2024; pp. 148–154. [Google Scholar]
- Liu, X.; Ma, Y.; Shi, Z.; Chen, J. GridDehazeNet: Attention-Based Multi-Scale Network for Image Dehazing. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 7314–7323. [Google Scholar]
- Dong, H.; Pan, J.; Xiang, L.; Hu, Z.; Yang, M.-H. Multi-Scale Boosted Dehazing Network with Dense Feature Fusion. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 14–19 June 2020. [Google Scholar]
- Guo, C.-L.; Yan, Q.; Anwar, S.; Cong, R.; Ren, W.; Li, C. Image Dehazing Transformer with Transmission-Aware 3D Position Embedding. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 19–24 June 2022; pp. 5812–5820. [Google Scholar]
- Song, Y.; He, Z.; Qian, H.; Du, X. Vision Transformers for Single Image Dehazing. IEEE Trans. Image Process. 2023, 32, 1927–1941. [Google Scholar] [CrossRef] [PubMed]
- Pushpalatha, D.; Prithvi, P. LDFormer: Lightweight Dehazing Transformer. Neural Comput. Appl. 2025, 37, 20049–20068. [Google Scholar] [CrossRef]
- Zhang, Z.; Feng, Z.; Long, A.; Wang, Z. LWTD: A Novel Lightweight Transformer-like CNN Architecture for Driving Scene Dehazing. Int. J. Mach. Learn. Cybern. 2025, 16, 1303–1326. [Google Scholar] [CrossRef]
- Sun, J.; Liu, H.; Wang, Y.; Zhang, X.P.; Wei, M. WDMamba: When Wavelet Degradation Prior Meets Vision Mamba for Image Dehazing. arXiv 2025, arXiv:2505.04369. [Google Scholar] [CrossRef]
- Guo, H.; Guo, Y.; Zha, Y.; Zhang, Y.; Li, W.; Dai, T.; Xia, S.-T.; Li, Y. MambaIRv2: Attentive State Space Restoration. In Proceedings of the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 11–15 June 2025; pp. 28124–28133. [Google Scholar]
- Somvanshi, S.; Islam, M.M.; Mimi, M.S.; Polock, S.B.B.; Chhetri, G.; Das, S. From S4 to Mamba: A Comprehensive Survey on Structured State Space Models. arXiv 2025, arXiv:2503.18970. [Google Scholar]
- Gu, A.; Dao, T. Mamba: Linear-Time Sequence Modeling with Selective State Spaces. In Proceedings of the 2024 Conference on Language Modeling (COLM), Philadelphia, PA, USA, 7–9 October 2024. [Google Scholar]
- Guo, H.; Li, J.; Dai, T.; Ouyang, Z.; Ren, X.; Xia, S.T. MambaIR: A Simple Baseline for Image Restoration with State Space Model. In Proceedings of the 2024 European Conference on Computer Vision (ECCV 2024), Milan, Italy, 29 September–4 October 2024; pp. 222–241. [Google Scholar]
- Liu, Y.; Tian, Y.; Zhao, Y.; Yu, H.; Xie, L.; Wang, Y.; Ye, Q.; Jiao, J.; Liu, Y. VMamba: Visual State Space Model. Adv. Neural Inf. Process. Syst. 2024, 37, 103031–103063. [Google Scholar]
- Shi, Y.; Xia, B.; Jin, X.; Wang, X.; Zhao, T.; Xia, X.; Xiao, X.; Yang, W. VMambaIR: Visual State Space Model for Image Restoration. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 5560–5574. [Google Scholar] [CrossRef]
- Zhu, Q.; Mai, J.; Shao, L. A Fast Single Image Haze Removal Algorithm Using Color Attenuation Prior. IEEE Trans. Image Process. 2015, 24, 3522–3533. [Google Scholar] [CrossRef]
- Berman, D.; Avidan, S. Non-Local Image Dehazing. In Proceedings of the 2016 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2016), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 1674–1682. [Google Scholar]
- Su, Y.Z.; He, C.; Cui, Z.G.; Li, A.H.; Wang, N. Physical Model and Image Translation Fused Network for Single-Image Dehazing. Pattern Recognit. 2023, 142, 109700. [Google Scholar] [CrossRef]
- Li, B.; Peng, X.; Wang, Z.; Xu, J.; Feng, D. AOD-Net: All-in-One Dehazing Network. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV 2017), Venice, Italy, 22–29 October 2017; pp. 4770–4778. [Google Scholar]
- Qin, X.; Wang, Z.; Bai, Y.; Xie, X.; Jia, H. FFA-Net: Feature Fusion Attention Network for Single Image Dehazing. In Proceedings of the 2020 AAAI Conference on Artificial Intelligence (AAAI-20), New York, NY, USA, 7–12 February 2020; Volume 34, pp. 11908–11915. [Google Scholar]
- Lu, L.; Xiong, Q.; Xu, B.; Chu, D. MixDehazeNet: Mix Structure Block for Image Dehazing Network. In Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN 2024), Yokohama, Japan, 30 June–5 July 2024; pp. 1–10. [Google Scholar]
- Cui, Y.; Ren, W.; Knoll, A. Learning Efficient Image Restoration with Advanced Attention Mechanisms. In Proceedings of the 2024 AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, BC, Canada, 20–27 February 2024; Volume 38, pp. 1426–1434. [Google Scholar]
- Guan, X.; He, R.; Wang, L.; Zhou, H.; Liu, Y.; Xiong, H. DWTMA-Net: Discrete Wavelet Transform and Multi-Dimensional Attention Network for Remote Sensing Image Dehazing. Remote Sens. 2025, 17, 2033. [Google Scholar] [CrossRef]
- Qiu, Y.; Zhang, K.; Wang, C.; Luo, W.; Li, H.; Jin, Z. MB-TaylorFormer: Multi-Branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV 2023), Paris, France, 2–6 October 2023; pp. 12802–12813. [Google Scholar]
- Feng, X.; Ji, H.; Pei, W.; Li, J.; Lu, G.; Zhang, D. U2-Former: Nested U-Shaped Transformer for Image Restoration via Multi-View Contrastive Learning. IEEE Trans. Circuits Syst. Video Technol. 2023, 34, 168–181. [Google Scholar] [CrossRef]
- Wang, C.; Pan, J.; Lin, W.; Dong, J.; Wang, W.; Wu, X.M. SelfPromer: Self-Prompt Dehazing Transformers with Depth Consistency. In Proceedings of the 2024 AAAI Conference on Artificial Intelligence (AAAI-24), Vancouver, BC, Canada, 20–27 February 2024; Volume 38, pp. 5327–5335. [Google Scholar]
- Chi, K.; Yuan, Y.; Wang, Q. Trinity-Net: Gradient-Guided Swin Transformer-Based Remote Sensing Image Dehazing and Beyond. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–14. [Google Scholar] [CrossRef]
- Liu, H.; Li, X.; Tan, T. Interaction-Guided Two-Branch Image Dehazing Network. In Proceedings of the 2024 Asian Conference on Computer Vision (ACCV 2024), Hanoi, Vietnam, 8–12 December 2024; pp. 4069–4084. [Google Scholar]
- Wang, Y.; Yan, X.; Wang, F.L.; Xie, H.; Yang, W.; Zhang, X.P.; Wei, M. UCL-Dehaze: Toward Real-World Image Dehazing via Unsupervised Contrastive Learning. IEEE Trans. Image Process. 2024, 33, 1361–1374. [Google Scholar] [CrossRef]
- Wang, X.; Yang, G.; Ye, T.; Liu, Y. Dehaze-RetinexGAN: Real-World Image Dehazing via Retinex-Based Generative Adversarial Network. In Proceedings of the 2025 AAAI Conference on Artificial Intelligence (AAAI-25), Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 7997–8005. [Google Scholar]
- Huang, C.; Li, S.; Chen, X.; Liu, J.; Li, D. Haze Has Many Faces: Multi-Domain Haze Style Transfer for Diverse Haze Removal. Pattern Recognit. 2026, 175, 113094. [Google Scholar] [CrossRef]
- Wang, Y.; Qu, Q.; He, K.; Liu, T.; Du, X.; Lei, T.; Nandi, A.K. PRDiff-Dehaze: Toward Non-Homogeneous Haze Image Restoration via Progressive Refinement Diffusion. Pattern Recognit. 2026, 179, 113688. [Google Scholar] [CrossRef]
- Murtaza, A.; Khairuddin, U.; Mohd Fudzi, A.A.; Hamamoto, K.; Fang, Y.; Omar, Z. WaveLiteDehaze-Net: A Low-Parameter Wavelet-Based Method for Real-Time Dehazing. CAAI Trans. Intell. Technol. 2025, 10, 1033–1048. [Google Scholar] [CrossRef]
- Zhang, Q.; Shao, M.; Chen, X.; Lv, X.; Xu, K. Wave-MambaAD: Wavelet-Driven State Space Model for Multi-Class Unsupervised Anomaly Detection. In Proceedings of the 2025 IEEE/CVF International Conference on Computer Vision (ICCV 2025), Honolulu, HI, USA, 19–25 October 2025; pp. 20868–20877. [Google Scholar]
- Zheng, Z.; Wu, C. U-Shaped Vision Mamba for Single Image Dehazing. arXiv 2024, arXiv:2402.04139. [Google Scholar] [CrossRef]
- Wang, Y.; Chen, L.; Hu, B.; Liu, H.; Zhang, X.P.; Wei, M. Laplace-Mamba: Laplace Frequency Prior-Guided Mamba-CNN Fusion Network for Image Dehazing. arXiv 2025, arXiv:2507.00501. [Google Scholar]
- Su, X.; Li, S.; Cui, Y.; Cao, M.; Zhang, Y.; Chen, Z.; Wu, Z.; Wang, Z.; Zhang, Y.; Yuan, X. Prior-Guided Hierarchical Harmonization Network for Efficient Image Dehazing. In Proceedings of the 2025 AAAI Conference on Artificial Intelligence (AAAI-25), Philadelphia, PA, USA, 25 February–4 March 2025; Volume 39, pp. 7042–7050. [Google Scholar]
- Gu, A.; Goel, K.; Ré, C. Efficiently Modeling Long Sequences with Structured State Spaces. arXiv 2021, arXiv:2111.00396. [Google Scholar]
- Poli, M.; Massaroli, S.; Nguyen, E.; Fu, D.Y.; Dao, T.; Baccus, S.; Bengio, Y.; Ermon, S.; Ré, C. Hyena Hierarchy: Towards Larger Convolutional Language Models. In Proceedings of the 2023 International Conference on Machine Learning (ICML 2023), Honolulu, HI, USA, 23–29 July 2023; pp. 28043–28078. [Google Scholar]
- Kong, L.; Dong, J.; Tang, J.; Yang, M.H.; Pan, J. Efficient Visual State Space Model for Image Deblurring. In Proceedings of the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2025), Nashville, TN, USA, 11–15 June 2025; pp. 12710–12719. [Google Scholar]
- Kong, L.; Dong, J.; Tang, J.; Yang, M.H.; Pan, J. Efficient Frequency Domain-Based Transformers for High-Quality Image Deblurring. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023), Vancouver, BC, Canada, 18–22 June 2023. [Google Scholar]
- Feng, C.; Chen, Z.; Li, X.; Wang, C.; Yang, J.; Cheng, M.M.; Dai, Y.; Fu, Q. HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth Cues in Hazy Scenes. arXiv 2024, arXiv:2409.19833. [Google Scholar] [CrossRef]




| Methods | Publication | PSNR (dB) | SSIM | NIQE |
|---|---|---|---|---|
| DCP | TPAMI 2011 | 17.03 | 0.8024 | 12.30 |
| AOD-Net | ICCV 2017 | 18.99 | 0.7808 | 12.27 |
| GridDehaze-Net | ICCV 2019 | 26.66 | 0.8801 | 11.33 |
| FFA-Net | AAAI 2020 | 27.12 | 0.8782 | 11.31 |
| MixDehaze-Net | IJCNN 2024 | 28.75 | 0.9068 | 11.30 |
| OK-Net | AAAI 2024 | 27.76 | 0.8875 | 11.36 |
| DWTMA-Net | Sensors 2025 | 29.02 | 0.9108 | 11.23 |
| Ours | – | 29.97 | 0.9396 | 11.20 |
| Method | FLOPs | Parameters |
|---|---|---|
| DCP | – | – |
| AOD-Net | 457.70 M | 1.76 K |
| GridDehaze-Net | 85.72 G | 955.75 K |
| FFA-Net | 624.20 G | 4.68 M |
| MixDehaze-Net | 114.30 G | 3.17 M |
| OK-Net | 158.20 G | 4.43 M |
| DWTMA-Net | 188.72 G | 8.34 M |
| Ours | 246.66 G | 16.74 M |
| Method | Global Modeling Module | PSNR (dB) | SSIM |
|---|---|---|---|
| Linear Attention | Linear Attention | 29.0133 | 0.9295 |
| Multi-Head Attention | Self-Attention (MHSA) | 29.9250 | 0.9236 |
| Proposed Method | FFTSSB | 29.9695 | 0.9396 |
| Method | FFT Branch | FFN Type | PSNR (dB) | SSIM |
|---|---|---|---|---|
| w/o FFT Branch | × | Spatial Gated FFN | 29.7928 | 0.9389 |
| Replace FDEB with Standard FFN | × | Standard FFN (MLP) | 29.2409 | 0.9365 |
| Proposed Method | ✓ | FDEB | 29.9695 | 0.9396 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zeng, L.; Huang, Y. FSSM: Frequency-Enhanced State Space Modeling with FFT-Based Two-Sided Non-Causal Convolution for Image Dehazing. J. Imaging 2026, 12, 260. https://doi.org/10.3390/jimaging12060260
Zeng L, Huang Y. FSSM: Frequency-Enhanced State Space Modeling with FFT-Based Two-Sided Non-Causal Convolution for Image Dehazing. Journal of Imaging. 2026; 12(6):260. https://doi.org/10.3390/jimaging12060260
Chicago/Turabian StyleZeng, Li, and Yinqing Huang. 2026. "FSSM: Frequency-Enhanced State Space Modeling with FFT-Based Two-Sided Non-Causal Convolution for Image Dehazing" Journal of Imaging 12, no. 6: 260. https://doi.org/10.3390/jimaging12060260
APA StyleZeng, L., & Huang, Y. (2026). FSSM: Frequency-Enhanced State Space Modeling with FFT-Based Two-Sided Non-Causal Convolution for Image Dehazing. Journal of Imaging, 12(6), 260. https://doi.org/10.3390/jimaging12060260
