MAENet: A Multi-Scale Attention Efficient Network for Coherent Noise Suppression in Digital Holographic Microscopy
Abstract
1. Introduction
- (1)
- We propose a dual-branch encoding architecture composed of an enlarged scale efficient (ESE) encoder and a basic scale detail (BSD) encoder. This design aims to efficiently extract complementary multi-scale features without excessively increasing network depth. The ESE encoder uses depthwise separable convolutions with a 7 × 7 kernel to capture broad noise information and spatially correlated noise patterns at low computational cost. The BSD encoder employs standard 3 × 3 convolutions, focusing on preserving high-frequency details and fine image textures. This dual-path parallel design enables the model to achieve a robust balance between effective global noise modeling and high-fidelity local detail reconstruction.
- (2)
- We present a dual-branch dense attention fusion (DDAF) module for efficiently integrating complementary features from the dual-branch encoders. The core functionality of DDAF is collaboratively achieved by two branches: an adaptive fusion (AF) module that utilizes an attention mechanism to dynamically assign weights to features from each branch, thus generating an optimal combination. The fused features are then processed by a dense residual enhancement (DRE) module, which significantly improves the model’s denoising performance by promoting feature reuse and enhancing representational capability.
2. Related Work
2.1. Deep Neural Networks for Image Denoising
2.2. Attention Mechanism and Skip Connections
3. The Proposed Denoising Model
3.1. Architecture of MAENet
3.2. Dual-Branch Encoder
3.3. Dual-Branch Dense Attention Fusion (DDAF)
3.4. Loss Function
4. Experimental Results and Discussion
4.1. Dataset
4.2. Metrics
4.3. Experimental Setup
4.4. Ablation Research
4.5. Simulated Image Denoising
4.6. Real-World Image Denoising
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Correction Statement
Abbreviations
| AF | Adaptive Fusion |
| DDAF | Dual-Branch Dense Attention Fusion |
| BSD | Basic Scale Detail |
| ESE | Enlarged Scale Efficient |
| DRE | Dense Residual Enhancement |
| DHM | Digital Holographic Microscopy |
References
- Gupta, G. Algorithm for image processing using improved median filter and comparison of mean, median and improved median filter. Int. J. Soft Comput. Eng. (IJSCE) 2011, 1, 304–311. [Google Scholar]
- Pitas, I. Digital Image Processing Algorithms and Applications; John Wiley & Sons: Hoboken, NJ, USA, 2000. [Google Scholar]
- Chen, R.; Yu, W.; Wang, R.; Liu, G.; Shao, Y. Interferometric phase denoising by pyramid nonlocal means filter. IEEE Geosci. Remote Sens. Lett. 2013, 10, 826–830. [Google Scholar] [CrossRef]
- Dabov, K.; Foi, A.; Katkovnik, V.; Egiazarian, K. Image denoising with block-matching and 3D filtering. In Proceedings of the SPIE Conference, Image Processing: Algorithms and Systems, Neural Networks, and Machine Learning, San Jose, CA, USA, 15–19 January 2006; Volume 6064, pp. 354–365. [Google Scholar]
- Kemao, Q. Two-dimensional windowed fourier transform for fringe pattern analysis: Principles, applications and implementations. Opt. Lasers Eng. 2007, 45, 304–317. [Google Scholar] [CrossRef]
- Wu, J.; Tang, J.; Zhang, J.; Di, J. Coherent noise suppression in digital holographic microscopy based on label-free deep learning. Front. Phys. 2022, 10, 880403. [Google Scholar] [CrossRef]
- Yin, D.; Gu, Z.; Zhang, Y.; Gu, F.; Nie, S.; Feng, S.; Ma, J.; Yuan, C. Speckle noise reduction in coherent imaging based on deep learning without clean data. Opt. Lasers Eng. 2022, 133, 106151. [Google Scholar] [CrossRef]
- Yan, K.; Yu, Y.; Huang, C.; Sui, L.; Qian, K.; Asundi, A. Fringe pattern denoising based on deep learning. Opt. Commun. 2019, 437, 148–152. [Google Scholar] [CrossRef]
- Lin, B.; Fu, S.; Zhang, C.; Wang, F.; Li, Y. Optical fringe patterns filtering based on multi-stage convolution neural network. Opt. Lasers Eng. 2020, 126, 105853. [Google Scholar] [CrossRef]
- Montresor, S.; Tahon, M.; Laurent, A.; Picart, P. Computational denoising based on deep learning for phase data in digital holographic interferometry. APL Photonics 2020, 5, 030802. [Google Scholar] [CrossRef]
- Fang, Q.; Xia, H.-T.; Song, Q.; Zhang, M.; Guo, R.; Montresor, S.; Picart, P. Speckle denoising based on deep learning via a conditional generative adversarial network in digital holographic interferometry. Opt. Express 2022, 30, 20666–20683. [Google Scholar] [CrossRef]
- Tahon, M.; Montresor, S.; Picart, P. Towards reduced CNNs for denoising phase images corrupted with speckle noise. Photonics 2021, 8, 255. [Google Scholar] [CrossRef]
- Zhang, J.; Huang, L.; Chen, B.; Yan, L. Accurate extraction of the +1 term spectrum with spurious spectrum elimination in off-axis digital holography. Opt. Express 2022, 30, 28142–28157. [Google Scholar] [CrossRef]
- Lin, Z.; Jia, S.; Zhou, X.; Zhang, H.; Wang, L.; Li, G.; Wang, Z. Digital holographic microscopy phase noise reduction based on an over-complete chunked discrete cosine transform sparse dictionary. Opt. Lasers Eng. 2023, 166, 107571. [Google Scholar] [CrossRef]
- Fu, Z.; Li, J.; Hua, Z. DEAU-Net: Attention Networks Based on Dual Encoder for Medical Image Segmentation. Comput. Biol. Med. 2022, 150, 106197. [Google Scholar] [CrossRef] [PubMed]
- Liang, B.; Tang, C.; Zhang, W.; Xu, M.; Wu, T. N-Net: An UNet Architecture with Dual Encoder for Medical Image Segmentation. Signal Image Video Process. 2023, 17, 3073–3081. [Google Scholar] [CrossRef] [PubMed]
- Li, H.; Zhai, D.-H.; Xia, Y. ERDUnet: An Efficient Residual Double-Coding Unet for Medical Image Segmentation. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 2083–2096. [Google Scholar] [CrossRef]
- Yao, Z.; Bi, J.; Deng, W.; He, W.; Wang, Z.; Kuang, X.; Zhou, M.; Gao, Q.; Tong, T. DEUNet: Dual-Encoder UNet for Simultaneous Denoising and Reconstruction of Single HDR Image. Comput. Graph. 2024, 119, 103882. [Google Scholar] [CrossRef]
- Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Trans. Image Process 2017, 26, 3142–3155. [Google Scholar] [CrossRef]
- Gu, S.; Xie, Q.; Meng, D.; Zuo, W.; Feng, X.; Zhang, L. Weighted nuclear norm minimization and its applications to low level vision. Int. J. Comput. Vis. 2017, 121, 183–208. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Yadav, M.; Tiwari, N. Performance comparison of image restoration techniques using CNN and their applications. In Proceedings of the 2021 5th International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 8–10 April 2021; pp. 1146–1151. [Google Scholar]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer: Berlin/Heidelberg, Germany, 2015; pp. 234–241. [Google Scholar]
- Yao, Z.; Hao, L.; Qin, L.; Chen, J.; Li, W. Efficient seismic data denoising via multi-scale attention network with depthwise separable and residual dilated convolutions. Sci. Rep. 2025, 16, 2818. [Google Scholar] [CrossRef]
- Li, J.; Lu, G.; Zhang, B.; You, J.; Zhang, D. Shared linear encoder-based multikernel Gaussian process latent variable model for visual classification. IEEE Trans. Cybern. 2021, 51, 534–547. [Google Scholar] [CrossRef]
- Li, Y.; Chen, X.; Zhu, Z.; Xie, L.; Huang, G.; Du, D.; Wang, X. Attention-guided unified network for panoptic segmentation. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 7026–7035. [Google Scholar]
- Du, B.; Wei, Q.; Liu, R. An improved quantum-behaved particle swarm optimization for endmember extraction. IEEE Trans. Geosci. Remote Sens. 2019, 57, 6003–6017. [Google Scholar] [CrossRef]
- Woo, S.; Park, J.; Lee, J.; Kweon, I.S. CBAM: Convolutional block attention module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar]
- Zhang, Y.; Li, K.; Li, K.; Wang, L.; Zhong, B.; Fu, Y. Image super-resolution using very deep residual channel attention networks. In Proceedings of the Computer Vision—ECCV 2018: 15th European Conference, Munich, Germany, 8–14 September 2018; pp. 294–310. [Google Scholar]
- Dai, T.; Cai, J.; Zhang, Y.; Xia, S.-T.; Zhang, L. Second-order attention network for single image super-resolution. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 11065–11074. [Google Scholar]
- Lyn, J.; Yan, S. Non-local second-order attention network for single image super resolution. Mach. Learn. Knowl. Extr. 2020, 12279, 267–279. [Google Scholar]
- Tian, C.; Xu, Y.; Li, Z.; Zuo, W.; Fei, L.; Liu, H. Attention-guided CNN for image denoising. Neural Netw. 2020, 124, 117–129. [Google Scholar] [CrossRef] [PubMed]
- Anwar, S.; Barnes, N. Real image denoising with feature attention. In Proceedings of the IEEE International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 3155–3164. [Google Scholar]
- Li, G.; Fang, Q.; Zha, L.; Gao, X.; Zheng, N. Ham: Hybrid attention module in deep convolutional neural networks for image classification. Pattern Recognit. 2022, 129, 108785. [Google Scholar] [CrossRef]
- Peng, Y.; Zhang, L.; Liu, S.; Wu, X.; Zhang, Y.; Wang, X. Dilated residual networks with symmetric skip connection for image denoising. Neurocomputing 2019, 345, 67–76. [Google Scholar] [CrossRef]
- Zhang, Y.; Tian, Y.; Kong, Y.; Zhong, B.; Fu, Y. Residual dense network for image super-resolution. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 2472–2481. [Google Scholar]
- Song, Y.; Zhu, Y.; Du, X. Dynamic residual dense network for image denoising. Sensors 2019, 19, 3809. [Google Scholar] [CrossRef]
- Choi, Y.; Park, S.; Kim, S. Development of Point Cloud Data-Denoising Technology for Earthwork Sites Using Encoder-Decoder Network. KSCE J. Civ. Eng. 2022, 26, 4380–4389. [Google Scholar] [CrossRef]
- Wang, Q.; Wu, B.; Zhu, P.; Li, P.; Zuo, W.; Hu, Q. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 14–19 June 2020. [Google Scholar]
- Tang, J.; Chen, B.; Yan, L.; Huang, L. Continuous Phase Denoising via Deep Learning Based on Perlin Noise Similarity in Digital Holographic Microscopy. IEEE Trans. Ind. Inf. 2024, 20, 8707–8716. [Google Scholar] [CrossRef]
- Montrésor, S.; Picart, P. On the assessment of de-noising algorithms in digital holographic interferometry and related approaches. Appl. Phys. B 2022, 128, 59. [Google Scholar] [CrossRef]
- Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
- Reyes-Figueroa, A.; Flores, V.H.; Rivera, M. Deep neural network for fringe pattern filtering and normalization. Appl. Opt. 2021, 60, 2022–2036. [Google Scholar] [CrossRef]
- Zhang, K.; Zuo, W. FFDNet: Toward a fast and flexible solution for CNN-based image denoising. IEEE Trans. Image Process. 2018, 27, 4608–4622. [Google Scholar] [CrossRef]











| Parameter | Symbol | Range | Unit |
|---|---|---|---|
| Medium thickness | [0.008, 0.0016] | m | |
| Propagation distance | [0.09, 0.095] | m | |
| Angle | [0.06, 0.08] | rad |
| Models | Dual-Encoder | DDAF | Performance | |||
|---|---|---|---|---|---|---|
| ESE Encoder | BSD Encoder | AF Module | DRE Module | PSNR | SSIM | |
| MAENetv0 | √ | 31.58 | 0.90880 | |||
| MAENetv1 | √ | 32.83 | 0.92628 | |||
| MAENetv2 | √ | √ | 32.91 | 0.92683 | ||
| MAENetv3 | √ | √ | √ | 33.11 | 0.92866 | |
| MAENet | √ | √ | √ | √ | 33.25 | 0.93042 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhu, Y.; Yu, J.; Zhang, Z.; Kong, M.; Feng, Y.; Hou, F.; Tang, Z.; Liu, W. MAENet: A Multi-Scale Attention Efficient Network for Coherent Noise Suppression in Digital Holographic Microscopy. Photonics 2026, 13, 303. https://doi.org/10.3390/photonics13030303
Zhu Y, Yu J, Zhang Z, Kong M, Feng Y, Hou F, Tang Z, Liu W. MAENet: A Multi-Scale Attention Efficient Network for Coherent Noise Suppression in Digital Holographic Microscopy. Photonics. 2026; 13(3):303. https://doi.org/10.3390/photonics13030303
Chicago/Turabian StyleZhu, Yifan, Jing Yu, Zihao Zhang, Ming Kong, Yushuo Feng, Feixue Hou, Zihan Tang, and Wei Liu. 2026. "MAENet: A Multi-Scale Attention Efficient Network for Coherent Noise Suppression in Digital Holographic Microscopy" Photonics 13, no. 3: 303. https://doi.org/10.3390/photonics13030303
APA StyleZhu, Y., Yu, J., Zhang, Z., Kong, M., Feng, Y., Hou, F., Tang, Z., & Liu, W. (2026). MAENet: A Multi-Scale Attention Efficient Network for Coherent Noise Suppression in Digital Holographic Microscopy. Photonics, 13(3), 303. https://doi.org/10.3390/photonics13030303

