Previous Article in Journal
Optimization of Neural Network Models of Computer Vision for Biometric Identification on Edge IoT Devices
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Multi-Level Attribute-Guided-Based Adaptive Multi-Dilated Convolutional Network for Image Aesthetic Assessment

1
School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
2
School of Computing, Engineering and Mathematical Sciences, La Trobe University, Melbourne, VIC 3086, Australia
3
College of Science and Engineering, James Cook University, Cairns, QLD 4878, Australia
*
Author to whom correspondence should be addressed.
J. Imaging 2025, 11(12), 420; https://doi.org/10.3390/jimaging11120420
Submission received: 16 September 2025 / Revised: 14 November 2025 / Accepted: 19 November 2025 / Published: 21 November 2025
(This article belongs to the Section Computer Vision and Pattern Recognition)

Abstract

Image aesthetic assessment (IAA) is crucial for both scientific research and practical applications, and numerous studies have achieved promising performance. However, they still exhibit two major limitations: the neglect of hierarchical interactions between attribute features and aesthetic features, and the distortion of the original aspect ratio during image preprocessing, which leads to a loss of aesthetic information. To address these issues, we propose a Multi-level Attribute-Guided Adaptive Multi-Dilated Convolutional Network (MAADN), which leverages multi-level attribute features to guide aesthetic assessment and reduces the negative impact of image preprocessing through adaptive dilated convolution. Specifically, we employ a dual-branch architecture: one branch extracts multi-level attribute features, while the other learns aesthetic features under the guidance of these attributes. We further design an Attention-based Attribute-Guided Aesthetic Module (AGAM), which utilizes visual attention mechanisms to enhance the guidance of attributes. Additionally, we design an Adaptive Multi-Dilate Rate Convolution Module (AMDM) that generates weights adaptively through the network to fuse dilated convolution features with different dilation rates, rather than simply calculating weights based on aspect ratios. This approach effectively alleviates the negative effects of image preprocessing while maintaining training flexibility. Extensive experimental results demonstrate that the proposed model outperforms current state-of-the-art approaches. Furthermore, visual analysis confirms MAADN’s precise localization capability for aesthetically critical regions.
Keywords: image aesthetics assessment; aesthetic attributes; attention mechanism; dilated convolution image aesthetics assessment; aesthetic attributes; attention mechanism; dilated convolution

Share and Cite

MDPI and ACS Style

Li, S.; Xie, M.; Xiang, W. Multi-Level Attribute-Guided-Based Adaptive Multi-Dilated Convolutional Network for Image Aesthetic Assessment. J. Imaging 2025, 11, 420. https://doi.org/10.3390/jimaging11120420

AMA Style

Li S, Xie M, Xiang W. Multi-Level Attribute-Guided-Based Adaptive Multi-Dilated Convolutional Network for Image Aesthetic Assessment. Journal of Imaging. 2025; 11(12):420. https://doi.org/10.3390/jimaging11120420

Chicago/Turabian Style

Li, Sumei, Mingxuan Xie, and Wei Xiang. 2025. "Multi-Level Attribute-Guided-Based Adaptive Multi-Dilated Convolutional Network for Image Aesthetic Assessment" Journal of Imaging 11, no. 12: 420. https://doi.org/10.3390/jimaging11120420

APA Style

Li, S., Xie, M., & Xiang, W. (2025). Multi-Level Attribute-Guided-Based Adaptive Multi-Dilated Convolutional Network for Image Aesthetic Assessment. Journal of Imaging, 11(12), 420. https://doi.org/10.3390/jimaging11120420

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop