Generative Adversarial Optical Networks Using Diffractive Layers for Digit and Action Generation
Abstract
1. Introduction
2. Methods
2.1. Forward Propagation Model of D2NN
2.2. Gaussian Beam Parameters
2.3. Discriminator Network Parameters
2.4. Network Training Parameters
2.5. FID Calculation Method
2.5.1. Feature Extraction with Inception-v3
2.5.2. Statistical Computation
2.5.3. Fréchet Distance Calculation
3. Results
3.1. GAN Principle Based on D2NN
3.2. D2NN-GAN for the MNIST Dataset
3.3. D2NN-GAN for the KTH Dataset
4. Experimental Error Discussion
4.1. Quantitative Tolerance to Inter-Layer Translational Misalignment
4.2. Temperature/Mechanical Variations
4.3. Manufacturing Precision
4.4. Real-Time Optimization
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Lake Tahoe, NV, USA, 3–6 December 2012; Volume 25, pp. 1097–1105. [Google Scholar]
- Silver, D.; Huang, A.; Maddison, C.J.; Guez, A.; Sifre, L.; van den Driessche, G.; Schrittwieser, J.; Antonoglou, I.; Panneershelvam, V.; Lanctot, M.; et al. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484–489. [Google Scholar] [CrossRef]
- Strubell, E.; Ganesh, A.; McCallum, A. Energy and policy considerations for deep learning in NLP. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 28 July–2 August 2019; pp. 3645–3650. [Google Scholar]
- Thompson, N.C.; Greenewald, K.; Lee, K.; Manso, G.F. The computational limits of deep learning. arXiv 2020, arXiv:2007.05558. [Google Scholar]
- Patterson, D.; Gonzalez, J.; Le, Q.V.; Liang, C.; Munguia, L.-M.; Rothchild, D.; So, D.; Texier, M.; Dean, J. Carbon emissions and large neural network training. arXiv 2021, arXiv:2104.10350. [Google Scholar] [CrossRef]
- Schwartz, R.; Dodge, J.; Smith, N.A.; Etzioni, O. Green AI. Green AI. Commun. ACM 2020, 63, 54–63. [Google Scholar]
- Shastri, B.J.; Tait, A.N.; Ferreira de Lima, T.; Pernice, W.H.P.; Bhaskaran, H.; Wright, C.D. Photonics for artificial intelligence and neuromorphic computing. Nat. Photonics 2021, 15, 102–114. [Google Scholar]
- Wetzstein, G.; Ozcan, A.; Gigan, S.; Fan, S.; Englund, D.; Soljačić, M.; Denz, C.; Miller, D.A.B.; Psaltis, D. Inference in artificial intelligence with deep optics and photonics. Nature 2020, 588, 39–47. [Google Scholar] [CrossRef]
- Kitayama, K.; Notomi, M.; Naruse, M.; Inoue, K.; Kawakami, S.; Uchida, A. Novel frontier of photonics for data processing—Photonic accelerator. APL Photonics 2019, 4, 090901. [Google Scholar] [CrossRef]
- Wang, Y.; Liao, K.; Zhang, K.; Du, Z.; Wang, Z.; Ni, B.; Xu, T.; Feng, S.; Yang, Y.; Yang, Q.-F.; et al. Reconfigurable versatile integrated photonic computing chip. eLight 2025, 5, 20. [Google Scholar] [CrossRef]
- Li, D.; Zhang, K.; Hu, X.; Feng, S. Integrated convolutional kernel based on two-dimensional photonic crystals. Opt. Lett. 2024, 49, 6297–6300. [Google Scholar] [CrossRef]
- Tian, X.; Li, R.; Peng, T.; Xue, Y.; Min, J.; Li, X.; Bai, C.; Yao, B. Multi-prior physics-enhanced neural network enables pixel super-resolution and twin-image-free phase retrieval from single-shot hologram. Opto-Electron. Adv. 2024, 7, 240060. [Google Scholar] [CrossRef]
- Gigan, S. Data-driven polarimetric approaches fuel computational imaging expansion. Opto-Electron. Adv. 2024, 7, 240158. [Google Scholar] [CrossRef]
- Miller, D.A.B. Attojoule optoelectronics for low-energy information processing and communications. J. Light. Technol. 2017, 35, 346–396. [Google Scholar] [CrossRef]
- Prucnal, P.R.; Shastri, B.J. Neuromorphic Photonics; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
- Feldmann, J.; Youngblood, N.; Wright, C.D.; Bhaskaran, H.; Pernice, W.H.P. All-optical spiking neurosynaptic networks with self-learning capabilities. Nature 2019, 569, 208–214. [Google Scholar] [CrossRef] [PubMed]
- Larger, L.; Baylón-Fuentes, A.; Martinenghi, R.; Udaltsov, V.S.; Chembo, Y.K.; Jacquot, M. High-speed photonic reservoir computing using a time-delay-based architecture: Million words per second classification. Phys. Rev. X 2017, 7, 011015. [Google Scholar] [CrossRef]
- Vandoorne, K.; Mechet, P.; Van Vaerenbergh, T.; Fiers, M.; Morthier, G.; Verstraeten, D.; Schrauwen, B.; Dambre, J.; Bienstman, P. Experimental demonstration of reservoir computing on a silicon photonics chip. Nat. Commun. 2014, 5, 3541. [Google Scholar] [CrossRef]
- Lin, X.; Rivenson, Y.; Yardimci, N.T.; Veli, M.; Luo, Y.; Jarrahi, M.; Ozcan, A. All-optical machine learning using diffractive deep neural networks. Science 2018, 361, 1004–1008. [Google Scholar] [CrossRef]
- Zhang, K.; Liao, K.; Cheng, H.; Feng, S.; Hu, X. Advanced all-optical classification using orbital-angular-momentum-encoded diffractive networks. Adv. Photonics Nexus 2023, 2, 066006. [Google Scholar] [CrossRef]
- Lin, X.; Fu, Y.; Zhang, K.; Huang, H.; Xu, F.; Fan, J.; Lin, X.; Dai, Q.; Hu, X. Polarization and wavelength routers based on diffractive neural network. Front. Optoelectron. 2024, 17, 22. [Google Scholar] [CrossRef]
- Lin, X.; Zhang, K.; Liao, K.; Huang, H.; Fu, Y.; Zhang, X.; Feng, S.; Hu, X. Polarization-based all-optical logic gates using diffractive neural networks. J. Opt. 2024, 26, 035701. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhang, K.; Hu, P.; Li, D.; Feng, S. Multi-wavelength diffractive optical neural network integrated with 2D photonic crystals for joint optical classification. Nanophotonics 2025, 14, 2891–2899. [Google Scholar] [CrossRef] [PubMed]
- Luo, Y.; Mengu, D.; Yardimci, N.T.; Rivenson, Y.; Veli, M.; Jarrahi, M.; Ozcan, A. Design of task-specific optical systems using broadband diffractive neural networks. Light Sci. Appl. 2019, 8, 112. [Google Scholar] [CrossRef]
- Mengu, D.; Luo, Y.; Rivenson, Y.; Ozcan, A. Analysis of diffractive optical neural networks and their integration with electronic neural networks. IEEE J. Sel. Top. Quantum Electron. 2020, 26, 1–14. [Google Scholar] [CrossRef]
- Rahman, M.S.S.; Ozcan, A. Computer-free, all-optical reconstruction of holograms using diffractive networks. ACS Photonics 2021, 8, 3375–3384. [Google Scholar] [CrossRef]
- He, C.; Zhao, D.; Fan, F.; Zhou, H.; Li, X.; Li, Y.; Li, J.; Dong, F.; Miao, Y.-X.; Wang, Y.; et al. Pluggable multitask diffractive neural networks based on cascaded metasurfaces. Opto-Electron. Adv. 2024, 7, 230005. [Google Scholar] [CrossRef]
- Yan, T.; Wu, J.; Zhou, T.; Xie, H.; Xu, F.; Fan, J.; Fang, L.; Lin, X.; Dai, Q. Fourier-space diffractive deep neural network. Phys. Rev. Lett. 2019, 123, 023901. [Google Scholar] [CrossRef] [PubMed]
- Chen, H.; Feng, J.; Jiang, M.; Wang, Y.; Lin, J.; Tan, J.; Jin, P. Diffractive deep neural networks at visible wavelengths. Engineering 2021, 7, 1483–1491. [Google Scholar] [CrossRef]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Montreal, QC, Canada, 8–11 December 2014; Volume 27, pp. 2672–2680. [Google Scholar]
- Creswell, A.; White, T.; Dumoulin, V.; Arulkumaran, K.; Sengupta, B.; Bharath, A.A. Generative adversarial networks: An overview. IEEE Signal Process. Mag. 2018, 35, 53–65. [Google Scholar] [CrossRef]
- Karras, T.; Laine, S.; Aila, T. A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4401–4410. [Google Scholar]
- Karras, T.; Aittala, M.; Laine, S.; Härkönen, E.; Hellsten, J.; Lehtinen, J.; Aila, T. Alias-free generative adversarial networks. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Online, 6–14 December 2021; Volume 34, pp. 852–863. [Google Scholar]
- Brock, A.; Donahue, J.; Simonyan, K. Large scale GAN training for high fidelity natural image synthesis. In Proceedings of the International Conference on Learning Representations (ICLR), New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Méndez-Lucio, O.; Baillif, B.; Clevert, D.-A.; Rouquié, D.; Wichard, J. De novo generation of hit-like molecules from gene expression signatures using artificial intelligence. Nat. Commun. 2020, 11, 10. [Google Scholar] [CrossRef]
- Dan, Y.; Zhao, Y.; Li, X.; Li, S.; Hu, M.; Hu, J. Generative adversarial networks (GAN) based efficient sampling of chemical composition space for inverse design of inorganic materials. npj Comput. Mater. 2020, 6, 84. [Google Scholar] [CrossRef]
- Arjovsky, M.; Bottou, L. Towards principled methods for training generative adversarial networks. In Proceedings of the International Conference on Learning Representations (ICLR), Toulon, France, 24–26 April 2017. [Google Scholar]
- Arjovsky, M.; Chintala, S.; Bottou, L. Wasserstein generative adversarial networks. In Proceedings of the International Conference on Machine Learning (ICML), Sydney, Australia, 6–11 August 2017; pp. 214–223. [Google Scholar]
- Chen, S.; Li, Y.; Wang, Y.; Chen, H.; Ozcan, A. Optical generative models. Nature 2025, 644, 903–911. [Google Scholar] [CrossRef] [PubMed]
- Zhan, Z.; Wang, H.; Liu, Q.; Fu, X. Photonic diffractive generators through sampling noises from scattering media. Nat. Commun. 2024, 15, 10643. [Google Scholar] [CrossRef] [PubMed]
- Qiu, J.; Lu, G.; Liu, T.; Zhang, D.; Xiao, S.; Yu, T. Optoelectronic generative adversarial networks. Commun. Phys. 2025, 8, 162. [Google Scholar] [CrossRef]
- Dong, Y.; Bai, Y.; Zhang, Q.; Luan, H.; Gu, M. High-throughput optical neuromorphic graphic processing at millions of images per second. eLight 2025, 5, 29. [Google Scholar] [CrossRef]
- Goodman, J.W. Introduction to Fourier Optics; Roberts and Company Publishers: Greenwood Village, CO, USA, 2005. [Google Scholar]
- Allen, L.; Beijersbergen, M.W.; Spreeuw, R.J.C.; Woerdman, J.P. Orbital angular momentum of light and the transformation of Laguerre–Gaussian laser modes. Phys. Rev. A 1992, 45, 8185–8189. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Schuldt, C.; Laptev, I.; Caputo, B. Recognizing human actions: A local SVM approach. In Proceedings of the 17th International Conference on Pattern Recognition, Cambridge, UK, 23–26 August 2004; Volume 3, pp. 32–36. [Google Scholar]
- Dong, Y.; Lin, D.; Chen, L.; Li, B.; Chen, X.; Zhang, Q.; Luan, H.; Fang, X.; Gu, M. Compact eternal diffractive neural network chip for extreme environments. Commun. Eng. 2024, 3, 64. [Google Scholar] [CrossRef]
- Cheng, J.; Huang, C.; Zhang, J.; Wu, B.; Zhang, W.; Liu, X.; Zhang, J.; Tang, Y.; Zhou, H.; Zhang, Q.; et al. Multimodal deep learning using on-chip diffractive optics with in situ training capability. Nat. Commun. 2024, 15, 6189. [Google Scholar] [CrossRef] [PubMed]
- Li, Y.; Chen, S.; Gong, T.; Ozcan, A. Model-free optical processors using in situ reinforcement learning with proximal policy optimization. Light Sci. Appl. 2026, 15, 32. [Google Scholar] [CrossRef] [PubMed]





| Misaligned Layers | Shift | Axis | Final FID | SSIM (Mean ± Std) |
|---|---|---|---|---|
| 1 | 5% (10 px) | Horizontal | 250.79 | 0.9530 ± 0.0192 |
| 5% (10 px) | Vertical | 255.58 | 0.9530 ± 0.0163 | |
| 10% (20 px) | Horizontal | 245.38 | 0.9480 ± 0.0279 | |
| 10% (20 px) | Vertical | 181.91 | 0.9505 ± 0.0208 | |
| 2 | 5% (10 px) | Horizontal | 143.66 | 0.9523 ± 0.0239 |
| 5% (10 px) | Vertical | 157.96 | 0.9540 ± 0.0150 | |
| 10% (20 px) | Horizontal | 242.57 | 0.9504 ± 0.0212 | |
| 10% (20 px) | Vertical | 233.92 | 0.9530 ± 0.0201 | |
| 3 | 5% (10 px) | Horizontal | 167.73 | 0.9551 ± 0.0179 |
| 5% (10 px) | Vertical | 157.89 | 0.9542 ± 0.0174 | |
| 10% (20 px) | Horizontal | 215.57 | 0.9438 ± 0.0262 | |
| 10% (20 px) | Vertical | 253.72 | 0.9355 ± 0.0299 | |
| 4 | 5% (10 px) | Horizontal | 221.43 | 0.9449 ± 0.0217 |
| 5% (10 px) | Vertical | 241.58 | 0.9511 ± 0.0223 | |
| 10% (20 px) | Horizontal | 72.96 | 0.9739 ± 0.0077 | |
| 10% (20 px) | Vertical | 73.97 | 0.9733 ± 0.0119 | |
| 5 | 5% (10 px) | Horizontal | 202.62 | 0.9325 ± 0.0235 |
| 5% (10 px) | Vertical | 233.88 | 0.9354 ± 0.0304 | |
| 10% (20 px) | Horizontal | 108.33 | 0.9586 ± 0.0116 | |
| 10% (20 px) | Vertical | 154.31 | 0.8963 ± 0.0514 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Hu, P.; Cui, T.; Zhang, Y.; Feng, S. Generative Adversarial Optical Networks Using Diffractive Layers for Digit and Action Generation. Photonics 2026, 13, 94. https://doi.org/10.3390/photonics13010094
Hu P, Cui T, Zhang Y, Feng S. Generative Adversarial Optical Networks Using Diffractive Layers for Digit and Action Generation. Photonics. 2026; 13(1):94. https://doi.org/10.3390/photonics13010094
Chicago/Turabian StyleHu, Pei, Tengyu Cui, Yuanyuan Zhang, and Shuai Feng. 2026. "Generative Adversarial Optical Networks Using Diffractive Layers for Digit and Action Generation" Photonics 13, no. 1: 94. https://doi.org/10.3390/photonics13010094
APA StyleHu, P., Cui, T., Zhang, Y., & Feng, S. (2026). Generative Adversarial Optical Networks Using Diffractive Layers for Digit and Action Generation. Photonics, 13(1), 94. https://doi.org/10.3390/photonics13010094

