Next Article in Journal
Recent Advances in Sliding Mode Control Techniques for Permanent Magnet Synchronous Motor Drives
Previous Article in Journal
A Hybrid Neural Network for Efficient Rectilinear Steiner Minimum Tree Construction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Skeleton-Guided Diffusion for Font Generation

1
School of Artificial Intelligence and Computer Science, Shaanxi Normal University, Xi’an 710119, China
2
Key Laboratory of Intelligent Computing and Service Technology for Folk Song, Ministry of Culture and Tourism, Xi’an 710119, China
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(19), 3932; https://doi.org/10.3390/electronics14193932
Submission received: 10 September 2025 / Revised: 24 September 2025 / Accepted: 28 September 2025 / Published: 3 October 2025

Abstract

Generating non-standard fonts, such as running script (e.g., XingShu), poses significant challenges due to their high stroke continuity, structural flexibility, and stylistic diversity, which traditional component-based prior knowledge methods struggle to model effectively. While diffusion models excel at capturing continuous feature spaces and stroke variations through iterative denoising, they face critical limitations: (1) style leakage, where large stylistic differences lead to inconsistent outputs due to noise interference; (2) structural distortion, caused by the absence of explicit structural guidance, resulting in broken strokes or deformed glyphs; and (3) style confusion, where similar font styles are inadequately distinguished, producing ambiguous results. To address these issues, we propose a novel skeleton-guided diffusion model with three key innovations: (1) a skeleton-constrained style rendering module that enforces semantic alignment and balanced energy constraints to amplify critical skeletal features, mitigating style leakage and ensuring stylistic consistency; (2) a cross-scale skeleton preservation module that integrates multi-scale glyph skeleton information through cross-dimensional interactions, effectively modeling macro-level layouts and micro-level stroke details to prevent structural distortions; (3) a contrastive style refinement module that leverages skeleton decomposition and recombination strategies, coupled with contrastive learning on positive and negative samples, to establish robust style representations and disambiguate similar styles. Extensive experiments on diverse font datasets demonstrate that our approach significantly improves the generation quality, achieving superior style fidelity, structural integrity, and style differentiation compared to state-of-the-art diffusion-based font generation methods.
Keywords: skeleton guide; font generation; style confusion; structural distortion; contrastive learning; style fidelity skeleton guide; font generation; style confusion; structural distortion; contrastive learning; style fidelity

Share and Cite

MDPI and ACS Style

Zhao, L.; Dong, S.; Liu, J.; Zhang, X.; Gao, X.; Wu, X. Skeleton-Guided Diffusion for Font Generation. Electronics 2025, 14, 3932. https://doi.org/10.3390/electronics14193932

AMA Style

Zhao L, Dong S, Liu J, Zhang X, Gao X, Wu X. Skeleton-Guided Diffusion for Font Generation. Electronics. 2025; 14(19):3932. https://doi.org/10.3390/electronics14193932

Chicago/Turabian Style

Zhao, Li, Shan Dong, Jiayi Liu, Xijin Zhang, Xiaojiao Gao, and Xiaojun Wu. 2025. "Skeleton-Guided Diffusion for Font Generation" Electronics 14, no. 19: 3932. https://doi.org/10.3390/electronics14193932

APA Style

Zhao, L., Dong, S., Liu, J., Zhang, X., Gao, X., & Wu, X. (2025). Skeleton-Guided Diffusion for Font Generation. Electronics, 14(19), 3932. https://doi.org/10.3390/electronics14193932

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop