Submit to Special Issue Submit Abstract to Special Issue Review for Applied Sciences Propose a Special Issue

Journal Menu

Journal Browser

Deep Learning for Image Processing and Computer Vision

Special Issue Editors
Special Issue Information
Keywords
Benefits of Publishing in a Special Issue
Published Papers

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 30 January 2026 | Viewed by 6171

Share This Special Issue

Special Issue Editors

Dr. Honggang Chen

E-Mail Website
Guest Editor

College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China
Interests: image/video restoration; image/video understanding; image/video coding
Special Issues, Collections and Topics in MDPI journals

Dr. Yun Liu

E-Mail Website
Guest Editor

College of Artificial Intelligence, Southwest University, Chongqing 400715, China
Interests: image enhancement; image restoration; deep learning; artificial intelligence
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Image processing, computer vision, and deep learning are pivotal in driving technological innovation, enabling machines to interpret, enhance, and analyze visual data with unprecedented precision. These advancements not only support critical applications in fields such as autonomous vehicles, healthcare, and smart cities, but also significantly accelerate the development of artificial intelligence, empowering systems to perform complex visual tasks autonomously and intelligently across diverse real-world environments.

This Special Issue aims to explore recent advances in the fields of image processing, computer vision, and deep learning.

This Special Issue aims to gather researchers and experts, sharing their innovative approaches, methodologies, and findings in addressing the challenges and advancing the state of the art in the fields of image processing, computer vision, and deep learning.

Dr. Honggang Chen
Dr. Yun Liu
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

image enhancement and restoration
visual recognition
image quality assessment
medical data processing
deep learning models
intelligent transportation systems

Benefits of Publishing in a Special Issue

Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (5 papers)

Download All Papers

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Research

23 pages, 3703 KB

Open AccessArticle

Multi-Scale Context Fusion Method with Spatial Attention for Accurate Crop Disease Detection

by Yang Zhao, Liangchen Hu and Sen Xu

Appl. Sci. 2025, 15(17), 9341; https://doi.org/10.3390/app15179341 - 26 Aug 2025

Viewed by 882

Abstract

Crop diseases not only severely affect crop quality but also lead to significant reductions in yield. To address the challenge of accurate crop disease detection in complex environments, we propose a novel detection method based on the YOLO11 model, termed YOLO-MSCM. To enhance the extraction of small-object features, we introduce the MCSA module, which improves the model’s spatial feature perception. Additionally, a SimRepHMS module is designed to leverage local contextual information and strengthen multi-scale feature fusion. To improve the model’s adaptability and generalization capability, we employ DynamicConv, which dynamically adjusts channel weights based on input-specific patterns. For more accurate bounding box localization, we incorporate the WIoUv3 loss function to optimize box regression. Experimental results demonstrate that YOLO-MSCM achieves improvements of 6.1% in precision, 9.1% in recall, 6.2% in mAP@50, and 3.7% in mAP@50:95 compared to the baseline YOLO11n model. Comparative evaluations with several mainstream and state-of-the-art models further validate the superior detection performance of YOLO-MSCM, offering a reliable and effective solution for accurate crop disease detection in complex scenarios. Full article

(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)

► Show Figures

Figure 1

29 pages, 4725 KB

Open AccessArticle

Feature Fusion Using Deep Learning Algorithms in Image Classification for Security Purposes by Random Weight Network

by Mustafa Servet Kiran, Gokhan Seyfi, Merve Yilmaz, Engin Esme and Xizhao Wang

Appl. Sci. 2025, 15(16), 9053; https://doi.org/10.3390/app15169053 - 17 Aug 2025

Viewed by 819

Abstract

Automated threat detection in X-ray security imagery is a critical yet challenging task, where conventional deep learning models often struggle with low accuracy and overfitting. This study addresses these limitations by introducing a novel framework based on feature fusion. The proposed method extracts features from multiple and diverse deep learning architectures and classifies them using a Random Weight Network (RWN), whose hyperparameters are optimized for maximum performance. The results show substantial improvements at each stage: while the best standalone deep learning model achieved a test accuracy of 83.55%, applying the RWN to a single feature set increased accuracy to 94.82%. Notably, the proposed feature fusion framework achieved a state-of-the-art test accuracy of 97.44%. These findings demonstrate that a modular approach combining multi-model feature fusion with an efficient classifier is a highly effective strategy for improving the accuracy and generalization capability of automated threat detection systems. Full article

(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)

► Show Figures

Figure 1

19 pages, 2622 KB

Open AccessArticle

An Improved Lightweight Model for Defect Detection on Paths in Images

by Zhaoning Cui, Yuejia Xu, Xinyi Jin and Yu Li

Appl. Sci. 2025, 15(13), 7014; https://doi.org/10.3390/app15137014 - 21 Jun 2025

Viewed by 827

Abstract

To address the challenges of detecting multi-scale road defects and the lack of lightweight designs in conventional detection models, we propose ACD-YOLOv8, an enhanced model based on YOLOv8s. Our model enhances baseline architecture by integrating three key components: a lightweight Cross-Scale Feature Fusion Module (CCFM), an ADown sampling operation, and a Dynamic Head (DyHead). Experimental results on the RDD2022 dataset demonstrate the superiority of our approach. Compared to the baseline YOLOv8s, ACD-YOLOv8 achieves a 0.9% increase in mAP@0.5 and a 1.6% increase in the more stringent mAP@0.5:0.95 metric. Simultaneously, the model’s parameter count is reduced by 3.72 million (a 33.3% reduction) and its size is reduced by 7.4 MB. This work provides a practical and scalable solution for deploying high-accuracy defect detection on resource-constrained mobile platforms, offering significant potential to enhance traffic safety and maintenance efficiency. Full article

(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)

► Show Figures

Figure 1

26 pages, 15489 KB

Open AccessArticle

Weighted Feature Fusion Network Based on Multi-Level Supervision for Migratory Bird Counting in East Dongting Lake

by Haojie Zou, Hai Zhou, Guo Liu, Yingchun Kuang, Qiang Long and Haoyu Zhou

Appl. Sci. 2025, 15(5), 2317; https://doi.org/10.3390/app15052317 - 21 Feb 2025

Viewed by 846

Abstract

East Dongting Lake is an important habitat for migratory birds. Accurately counting the number of migratory birds is crucial to assessing the health of the wetland ecological environment. Traditional manual observation and low-precision methods make it difficult to meet this demand. To this end, this paper proposes a weighted feature fusion network based on multi-level supervision (MS-WFFNet) to count migratory birds. MS-WFFNet consists of three parts: an EEMA-VGG16 sub-network, a multi-source feature aggregation (MSFA) module, and a density map regression (DMR) module. Among them, the EEMA-VGG16 sub-network cross-injects enhanced efficient multi-scale attention (EEMA) into the truncated VGG16 structure. It uses multi-head attention to nonlinearly learn the relative importance of different positions in the same direction. With only a few parameters added, EEMA effectively suppresses the noise interference caused by a cluttered background. The MSFA module integrates a weighted mechanism to fully preserve low-level detail information and high-level semantic information. It achieves this by aggregating multi-source features and enhancing the expression of key features. The DMR module applies density map regression to the output of each path in the MSFA module. It ensures local consistency and spatial correlation among multiple regression results by using distributed supervision. In addition, this paper presents the migratory bird counting dataset DTH, collected using local monitoring equipment in East Dongting Lake. It is combined with other object counting datasets for extensive experiments, showcasing the proposed method’s excellent performance and generalization capability. Full article

(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)

► Show Figures

Figure 1

18 pages, 2656 KB

Open AccessArticle

Multimodal Natural Disaster Scene Recognition with Integrated Large Model and Mamba

by Yuxuan Shao and Liwen Xu

Appl. Sci. 2025, 15(3), 1149; https://doi.org/10.3390/app15031149 - 23 Jan 2025

Viewed by 1803

Abstract

The accurate identification of natural disasters is crucial in ensuring effective post-disaster relief efforts. However, the existing models for disaster classification often incur high costs. To address this, we propose leveraging the most advanced pre-trained large language models, which offer superior generative and multimodal understanding capabilities. Using a question-answering approach, we extract textual descriptions and category prediction probabilities for disaster scenarios, which are then used as input to our proposed Mamba Multimodal Disaster Recognition Network (Mamba-MDRNet). This model integrates a large pre-trained model with the Mamba mechanism, enabling the selection of the most reliable modality information as a robust basis for scene classification. Extensive experiments demonstrate consistent performance improvements across various visual models with heterogeneous architectures. Notably, integrating EfficientNet within Mamba-MDRNet yielded 97.82% accuracy for natural scene classification, surpassing the performance of the CNN (91.75%), ViT (94.50%), and ResNet18 (97.25%). These results highlight the potential of multimodal models combining large models and the Mamba mechanism for disaster type prediction. Full article

(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)

► Show Figures

Journal Menu

Journal Browser

Deep Learning for Image Processing and Computer Vision

Share This Special Issue

Special Issue Editors

Special Issue Information

Keywords

Benefits of Publishing in a Special Issue

Published Papers (5 papers)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI