Submit to Special Issue Submit Abstract to Special Issue Review for Technologies Propose a Special Issue

Journal Menu

Journal Browser

Image Analysis and Processing

Special Issue Editors
Special Issue Information
Keywords
Benefits of Publishing in a Special Issue
Published Papers

A special issue of Technologies (ISSN 2227-7080). This special issue belongs to the section "Information and Communication Technologies".

Deadline for manuscript submissions: 31 December 2025 | Viewed by 13309

Share This Special Issue

Special Issue Editors

Dr. Xi Li

E-Mail Website
Guest Editor

School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China
Interests: artificial intelligence and Internet of Things

Dr. Zhongtao Fu

E-Mail Website
Guest Editor

School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China
Interests: robot theory and algorithms; multimodal perception and learning; Lie groups; Lie algebra

Dr. Yu Shi

E-Mail Website
Guest Editor

School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China
Interests: computer vision; image analysis and processing; deep learning

Dr. Zhenghua Huang

E-Mail Website
Guest Editor

School of Electrical and Information Engineering, Wuhan Institute of Technology, Wuhan 430205, China
Interests: computer vision; image analysis and processing; deep learning
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Image analysis and processing is an important branch in the field of computer vision and artificial intelligence, which focuses on how to use computer technology to efficiently process and analyze images in order to extract useful information and meet the needs of specific applications. This field involves image preprocessing, feature extraction, image segmentation, target recognition, image reconstruction, and other aspects. Through the use of mathematics, physics, statistics, and other methods, image analysis and processing technology plays an important role in medical diagnosis, remote sensing monitoring, security monitoring, industrial automation, digital entertainment, and other fields, which brings convenience to human life and, at the same time, promotes the development of related disciplines.

This Special Issue focuses on some of the recent developments in computer vision, artificial intelligence, image preprocessing, feature extraction, image segmentation, target recognition, and image reconstruction.

Potential topics of this Special Issue include but are not limited to the following:

Multimodal image denoising and enhancement;
Multimodal image fusion;
Image classification and semantic segmentation;
Object detection and segmentation;
Robot dynamics and control;
Human–robot interaction;
Robot learning;
AI for teaching.

Dr. Xi Li
Dr. Zhongtao Fu
Dr. Yu Shi
Dr. Zhenghua Huang
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 250 words) can be sent to the Editorial Office for assessment.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Technologies is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 1600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

computer vision
artificial intelligence
image preprocessing
feature extraction
image segmentation
target recognition
image reconstruction

Benefits of Publishing in a Special Issue

Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (7 papers)

Download All Papers

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Research

19 pages, 2027 KB

Open AccessArticle

Novel End-to-End CNN Approach for Fault Diagnosis in Electromechanical Systems Based on Relevant Heating Areas in Thermography

by Gilberto Alvarado-Robles, Angel Perez-Cruz, Isac Andres Espinosa-Vizcaino, Arturo Yosimar Jaen-Cuellar and Juan Jose Saucedo-Dorantes

Technologies 2025, 13(12), 551; https://doi.org/10.3390/technologies13120551 - 26 Nov 2025

Viewed by 313

Abstract

The reliability of electromechanical systems is a critical factor in modern Industry 4.0, as unexpected failures in induction motors or gearboxes can cause costly downtime, productivity losses, and increased maintenance demands. Infrared thermography offers a non-invasive and real-time means of monitoring thermal behavior, yet its effective use for fault diagnosis remains challenging due to sensitivity to noise, environmental variability, and the need for robust feature extraction. This work proposes a novel end-to-end convolutional neural network (CNN) methodology for detecting and classifying faults in electromechanical systems through the processing of infrared thermography images. The method integrates an automatic preprocessing stage that isolates the Relevant Heating Areas (RHAs), preserving their geometric and thermal descriptors while filtering irrelevant background information. A tailored data augmentation strategy, including controlled noise injection, was designed to improve robustness under realistic acquisition conditions. The CNN architecture combines 3 × 3 and 5 × 5 kernels to capture both fine-grained and global heating patterns. Experimental validation is carried out under nine different faulty conditions, achieving 99.7% accuracy and demonstrating strong resilience against Gaussian blur and additive Gaussian noise. The results suggest that the method provides a scalable, interpretable, and efficient approach for fault diagnosis in electromechanical systems within Industry 4.0 environments. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Figure 1

27 pages, 1220 KB

Open AccessArticle

Robust Supervised Deep Discrete Hashing for Cross-Modal Retrieval

by Xiwei Dong, Fei Wu, Junqiu Zhai, Fei Ma, Guangxing Wang, Tao Liu, Xiaogang Dong and Xiao-Yuan Jing

Technologies 2025, 13(9), 383; https://doi.org/10.3390/technologies13090383 - 29 Aug 2025

Viewed by 793

Abstract

The exponential growth of multi-modal data in the real world poses significant challenges to efficient retrieval, and traditional single-modal methods are no longer suitable for the growth of multi-modal data. To address this issue, hashing retrieval methods play an important role in cross-modal retrieval tasks when referring to a large amount of multi-modal data. However, effectively embedding multi-modal data into a common low-dimensional Hamming space remains challenging. A critical issue is that feature redundancies in existing methods lead to suboptimal hash codes, severely degrading retrieval performance; yet, selecting optimal features remains an open problem in deep cross-modal hashing. In this paper, we propose an end-to-end approach, named Robust Supervised Deep Discrete Hashing (RSDDH), which can accomplish feature learning and hashing learning simultaneously. RSDDH has a hybrid deep architecture consisting of a convolutional neural network and a multilayer perceptron adaptively learning modality-specific representations. Moreover, it utilizes a non-redundant feature selection strategy to select optimal features for generating discriminative hash codes. Furthermore, it employs a direct discrete hashing scheme (SVDDH) to solve the binary constraint optimization problem without relaxation, fully preserving the intrinsic properties of hash codes. Additionally, RSDDH employs inter-modal and intra-modal consistency preservation strategies to reduce the gap between modalities and improve the discriminability of learned Hamming space. Extensive experiments on four benchmark datasets demonstrate that RSDDH significantly outperforms state-of-the-art cross-modal hashing methods. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Figure 1

15 pages, 7157 KB

Open AccessArticle

RADAR: Reasoning AI-Generated Image Detection for Semantic Fakes

by Haochen Wang, Xuhui Liu, Ziqian Lu, Cilin Yan, Xiaolong Jiang, Runqi Wang and Efstratios Gavves

Technologies 2025, 13(7), 280; https://doi.org/10.3390/technologies13070280 - 2 Jul 2025

Viewed by 1894

Abstract

As modern generative models advance rapidly, AI-generated images exhibit higher resolution and lifelike details. However, the generated images may not adhere to world knowledge and common sense, as there is no such awareness and supervision in the generative models. For instance, the generated images could feature a penguin walking in the desert or a man with three arms, scenarios that are highly unlikely to occur in real life. Current AI-generated image detection methods mainly focus on low-level features, such as detailed texture patterns and frequency domain inconsistency, which are specific to certain generative models, making it challenging to identify the above-mentioned general semantic fakes. In this work, (1) we propose a new task, reasoning AI-generated image detection, which focuses on identifying semantic fakes in generative images that violate world knowledge and common sense. (2) To benchmark the new task, we collect a new dataset Spot the Semantic Fake (STSF). STSF contains 358 images with clear semantic fakes generated by three different modern diffusion models and provides bounding boxes as well as text annotations to locate the fakes. (3) We propose RADAR, a reasoning AI-generated image detection assistor, to locate semantic fakes in the generative images and output corresponding text explanations. Specifically, RADAR contains a specialized multimodal LLM to process given images and detect semantic fakes. To improve the generalization ability, we further incorporate ChatGPT as an assistor to detect unrealistic components in grounded text descriptions. The experiments on the STSF dataset show that RADAR effectively detects semantic fakes in modern generative images. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Figure 1

16 pages, 5373 KB

Open AccessArticle

Design and Development of an Electronic Interface for Acquiring Signals from a Piezoelectric Sensor for Ultrasound Imaging Applications

by Elizabeth Espitia-Romero, Adriana Guzmán-López, Micael Gerardo Bravo-Sánchez, Juan José Martínez-Nolasco, José Alfredo Padilla Medina and Francisco Villaseñor-Ortega

Technologies 2025, 13(7), 270; https://doi.org/10.3390/technologies13070270 - 25 Jun 2025

Viewed by 1927

Abstract

The increasing demand for accurate and accessible medical imaging has driven efforts to develop technologies that overcome limitations associated with conventional imaging techniques, such as MRI and CT scans. This study presents the design and implementation of an electronic interface for acquiring signals from a piezoelectric ultrasound sensor with the aim of improving image reconstruction quality by addressing electromagnetic interference and speckle noise, two major factors that degrade image fidelity. The proposed interface is installed between the ultrasound transducer and acquisition system, allowing real-time signal capture without altering the medical equipment’s operation. Using a printed circuit board with 110-pin connectors, signals from individual piezoelectric elements were analyzed using an oscilloscope. Results show that noise amplitudes occasionally exceed those of the acoustic echoes, potentially compromising image quality. By enabling direct observation of these signals, the interface facilitates the future development of analog filtering solutions to mitigate high-frequency noise before digital processing. This approach reduces reliance on computationally expensive digital filtering, offering a low-cost, real-time alternative. The findings underscore the potential of the interface to enhance diagnostic accuracy and support further innovation in medical imaging technologies. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Graphical abstract

19 pages, 16547 KB

Open AccessArticle

A New Method for Camera Auto White Balance for Portrait

by Sicong Zhou, Kaida Xiao, Changjun Li, Peihua Lai, Hong Luo and Wenjun Sun

Technologies 2025, 13(6), 232; https://doi.org/10.3390/technologies13060232 - 5 Jun 2025

Viewed by 2840

Abstract

Accurate skin color reproduction under varying CCT remains a critical challenge in the graphic arts, impacting applications such as face recognition, portrait photography, and human–computer interaction. Traditional AWB methods like gray-world or max-RGB often rely on statistical assumptions, which limit their accuracy under complex or extreme lighting. We propose SCR-AWB, a novel algorithm that leverages real skin reflectance data to estimate the scene illuminant’s SPD and CCT, enabling accurate skin tone reproduction. The method integrates prior knowledge of human skin reflectance, basis vectors, and camera sensitivity to perform pixel-wise spectral estimation. Experimental results on difficult skin color reproduction task demonstrate that SCR-AWB significantly outperforms traditional AWB algorithms. It achieves lower reproduction angle errors and more accurate CCT predictions, with deviations below 300 K in most cases. These findings validate SCR-AWB as an effective and computationally efficient solution for robust skin color correction. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Figure 1

21 pages, 8188 KB

Open AccessArticle

New Approach to Dominant and Prominent Color Extraction in Images with a Wide Range of Hues

by Yurii Kynash and Mariia Semeniv

Technologies 2025, 13(6), 230; https://doi.org/10.3390/technologies13060230 - 4 Jun 2025

Viewed by 2559

Abstract

Dominant colors significantly influence visual image perception and are widely used in computer vision and design. Traditional extraction methods often neglect visually salient colors that occupy small areas yet possess high aesthetic relevance. This study introduces a method for detecting both dominant and visually prominent colors in a wide range of hues and images. We analyzed the color gamut of images in the CIE L^*a^*b^* color space and concluded that it is difficult to identify the dominant and prominent colors due to high color variability. To address these challenges, the proposed approach transforms images into the orthogonal ICaS color space, integrating the properties of RGB and CMYK models, followed by K-means clustering. A spectral residual saliency map is applied to exclude background regions and emphasize perceptually significant objects. Experimental evaluation on an image database shows that the proposed method yields color palettes with broader gamut coverage, preserved luminance, and visually balanced combinations. A comparative analysis was conducted using the ΔE₀₀ metric, which accounts not only for differences in lightness, chroma, and hue but also for the perceptual interactions between colors, based on their proximity in the color space. The results confirm that the proposed method exhibits greater color stability and aesthetic coherence than existing approaches. These findings highlight the effectiveness of the orthogonal saliency mean method for delivering a more perceptually accurate and visually consistent representation of the dominant colors in an image. This outcome validates the method’s applicability for image analysis and design. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Figure 1

23 pages, 5095 KB

Open AccessArticle

Human-Machine Interaction: A Vision-Based Approach for Controlling a Robotic Hand Through Human Hand Movements

by Gerardo García-Gil, Gabriela del Carmen López-Armas and José de Jesús Navarro, Jr.

Technologies 2025, 13(5), 169; https://doi.org/10.3390/technologies13050169 - 23 Apr 2025

Cited by 2 | Viewed by 2113

Abstract

An anthropomorphic robot is a mechanical device designed to perform human-like tasks, such as manipulating objects, and has been one of the significant contributions in robotics over the past 60 years. This paper presents an advanced system for controlling a robotic arm using user hand gestures and movements. It eliminates the need for traditional sensors or physical controls by implementing an intuitive approach based on MediaPipe and computer vision. The system recognizes the user’s hand movements. It translates them into commands that are sent to a microcontroller, which operates a robotic hand equipped with six servomotors: five for the fingers and one for the wrist, which stands out for its orthonormal design that avoids occlusion problems in turns of up to 180°, guaranteeing precise wrist control. Unlike conventional systems, this approach uses only a 2D camera to capture movements, simplifying design and reducing costs. The proposed system allows replicating the user’s activity with high precision, expanding the possibilities of human-robot interaction. Notably, the system has been able to replicate the user’s hand gestures with an accuracy of up to 95%. Full article

(This article belongs to the Special Issue Image Analysis and Processing)

► Show Figures

Journal Menu

Journal Browser

Image Analysis and Processing

Share This Special Issue

Special Issue Editors

Special Issue Information

Keywords

Benefits of Publishing in a Special Issue

Published Papers (7 papers)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI