applsci-logo

Journal Browser

Journal Browser

Deep Learning for Image Processing and Computer Vision

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 30 June 2025 | Viewed by 1693

Special Issue Editors


E-Mail Website
Guest Editor
College of Artificial Intelligence, Southwest University, Chongqing 400715, China
Interests: image enhancement; image restoration; deep learning; artificial intelligence
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Image processing, computer vision, and deep learning are pivotal in driving technological innovation, enabling machines to interpret, enhance, and analyze visual data with unprecedented precision. These advancements not only support critical applications in fields such as autonomous vehicles, healthcare, and smart cities, but also significantly accelerate the development of artificial intelligence, empowering systems to perform complex visual tasks autonomously and intelligently across diverse real-world environments.

This Special Issue aims to explore recent advances in the fields of image processing, computer vision, and deep learning.

This Special Issue aims to gather researchers and experts, sharing their innovative approaches, methodologies, and findings in addressing the challenges and advancing the state of the art in the fields of image processing, computer vision, and deep learning.

Dr. Honggang Chen
Dr. Yun Liu
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • image enhancement and restoration
  • visual recognition
  • image quality assessment
  • medical data processing
  • deep learning models
  • intelligent transportation systems

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (2 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

26 pages, 15489 KiB  
Article
Weighted Feature Fusion Network Based on Multi-Level Supervision for Migratory Bird Counting in East Dongting Lake
by Haojie Zou, Hai Zhou, Guo Liu, Yingchun Kuang, Qiang Long and Haoyu Zhou
Appl. Sci. 2025, 15(5), 2317; https://doi.org/10.3390/app15052317 - 21 Feb 2025
Viewed by 428
Abstract
East Dongting Lake is an important habitat for migratory birds. Accurately counting the number of migratory birds is crucial to assessing the health of the wetland ecological environment. Traditional manual observation and low-precision methods make it difficult to meet this demand. To this [...] Read more.
East Dongting Lake is an important habitat for migratory birds. Accurately counting the number of migratory birds is crucial to assessing the health of the wetland ecological environment. Traditional manual observation and low-precision methods make it difficult to meet this demand. To this end, this paper proposes a weighted feature fusion network based on multi-level supervision (MS-WFFNet) to count migratory birds. MS-WFFNet consists of three parts: an EEMA-VGG16 sub-network, a multi-source feature aggregation (MSFA) module, and a density map regression (DMR) module. Among them, the EEMA-VGG16 sub-network cross-injects enhanced efficient multi-scale attention (EEMA) into the truncated VGG16 structure. It uses multi-head attention to nonlinearly learn the relative importance of different positions in the same direction. With only a few parameters added, EEMA effectively suppresses the noise interference caused by a cluttered background. The MSFA module integrates a weighted mechanism to fully preserve low-level detail information and high-level semantic information. It achieves this by aggregating multi-source features and enhancing the expression of key features. The DMR module applies density map regression to the output of each path in the MSFA module. It ensures local consistency and spatial correlation among multiple regression results by using distributed supervision. In addition, this paper presents the migratory bird counting dataset DTH, collected using local monitoring equipment in East Dongting Lake. It is combined with other object counting datasets for extensive experiments, showcasing the proposed method’s excellent performance and generalization capability. Full article
(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)
Show Figures

Figure 1

18 pages, 2656 KiB  
Article
Multimodal Natural Disaster Scene Recognition with Integrated Large Model and Mamba
by Yuxuan Shao and Liwen Xu
Appl. Sci. 2025, 15(3), 1149; https://doi.org/10.3390/app15031149 - 23 Jan 2025
Viewed by 909
Abstract
The accurate identification of natural disasters is crucial in ensuring effective post-disaster relief efforts. However, the existing models for disaster classification often incur high costs. To address this, we propose leveraging the most advanced pre-trained large language models, which offer superior generative and [...] Read more.
The accurate identification of natural disasters is crucial in ensuring effective post-disaster relief efforts. However, the existing models for disaster classification often incur high costs. To address this, we propose leveraging the most advanced pre-trained large language models, which offer superior generative and multimodal understanding capabilities. Using a question-answering approach, we extract textual descriptions and category prediction probabilities for disaster scenarios, which are then used as input to our proposed Mamba Multimodal Disaster Recognition Network (Mamba-MDRNet). This model integrates a large pre-trained model with the Mamba mechanism, enabling the selection of the most reliable modality information as a robust basis for scene classification. Extensive experiments demonstrate consistent performance improvements across various visual models with heterogeneous architectures. Notably, integrating EfficientNet within Mamba-MDRNet yielded 97.82% accuracy for natural scene classification, surpassing the performance of the CNN (91.75%), ViT (94.50%), and ResNet18 (97.25%). These results highlight the potential of multimodal models combining large models and the Mamba mechanism for disaster type prediction. Full article
(This article belongs to the Special Issue Deep Learning for Image Processing and Computer Vision)
Show Figures

Figure 1

Back to TopTop