Advances in Image Recognition, Image Segmentation, Image Fusion, and Singal Processing

A special issue of Electronics (ISSN 2079-9292). This special issue belongs to the section "Computer Science & Engineering".

Deadline for manuscript submissions: 15 October 2025 | Viewed by 871

Special Issue Editor

Foundation Model Group, Artificial Intelligence Department, Brookhaven National Laboratory, Upton, NY 11741, USA
Interests: weakly supervised Image segmentation; domain generalization; information theoretical learning; visual-language model; LLM agent

Special Issue Information

Dear Colleagues,

We are pleased to invite you to contribute to our Special Issue titled "Advances in Image Recognition, Image Segmentation, Image Fusion, and Singal Processing". This research area stands at the forefront of modern computational techniques, offering transformative applications in medical imaging, remote sensing, autonomous vehicles, and more. With rapid advancements in machine learning and deep learning, innovative approaches in image processing and analysis are becoming increasingly critical. In particular, the integration of visual-language models is revolutionizing how systems understand and relate visual content to natural language, while novel grounding segmentation techniques are enhancing object localization by leveraging textual and contextual cues. This Special Issue aims to highlight the latest methodologies and breakthroughs that address both the theoretical and practical challenges in these domains.

This Special Issue aims to gather a collection of high-quality articles that explore advanced techniques and novel applications in image recognition, segmentation, fusion, and signal processing. We aim to foster interdisciplinary collaborations and present research that bridges cutting-edge approaches with real-world applications.

In this Special Issue, original research articles and reviews are welcome. Research areas may include (but not limited to) the following:

  • Advanced Algorithms for Image Recognition and Classification: Cutting-edge approaches in feature extraction, deep learning models, and performance optimization.
  • Innovative Techniques in Image Segmentation: Including traditional methods and deep learning-based segmentation, with a focus on both semantic and instance segmentation.
  • Visual-Language Models: Research on models that integrate visual data with natural language processing, enabling richer interpretation of scenes and enhanced human-machine interaction.
  • Grounding Segmentation Techniques: Approaches that leverage textual cues and contextual information to improve segmentation accuracy and object localization.
  • Methods and Applications in Image Fusion: Integrating data from multiple sources to generate comprehensive images for improved decision-making.
  • Novel Developments in Signal Processing: Advanced methodologies for image and multimedia analysis across diverse applications.
  • Case Studies and Comparative Evaluations: Empirical studies demonstrating the practical impacts and performance comparisons of state-of-the-art methods.

We look forward to receiving your contributions.

Dr. Xi Yu
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Electronics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • image and video segmentation
  • image and video understanding
  • visual-language model
  • grounding segmentation
  • multi-modality (e.g., image, text, and video) learning

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • Reprint: MDPI Books provides the opportunity to republish successful Special Issues in book format, both online and in print.

Further information on MDPI's Special Issue policies can be found here.

Published Papers (3 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

23 pages, 27054 KB  
Article
ActionMamba: Action Spatial–Temporal Aggregation Network Based on Mamba and GCN for Skeleton-Based Action Recognition
by Jinglong Wen, Dan Liu and Bin Zheng
Electronics 2025, 14(18), 3610; https://doi.org/10.3390/electronics14183610 - 11 Sep 2025
Abstract
Skeleton-based action recognition networks have widely adopted the approach of Graph Convolutional Networks (GCN) due to their superior capabilities in modeling data topology, but several key issues still require further investigation. Firstly, the graph convolutional network extracts action features by applying temporal convolution [...] Read more.
Skeleton-based action recognition networks have widely adopted the approach of Graph Convolutional Networks (GCN) due to their superior capabilities in modeling data topology, but several key issues still require further investigation. Firstly, the graph convolutional network extracts action features by applying temporal convolution to each key point, which causes the model to ignore the temporal connections between different important points. Secondly, the local receptive field of graph convolutional networks limits their ability to capture correlations between non-adjacent joints. Motivated by the State Space Model (SSM), we propose an Action Spatio-temporal Aggregation Network, named ActionMamba. Specifically, we introduce a novel embedding module called the Action Characteristic Encoder (ACE), which enhances the coupling of temporal and spatial information in skeletal features by combining intrinsic spatio-temporal encoding with extrinsic space encoding. Additionally, we design an Action Perception Model (APM) based on Mamba and GCN. By effectively combining the excellent feature processing capabilities of GCN with the outstanding global information modeling capabilities of Mamba, APM is able to comprehend the hidden features between different joints and selectively filter information from various joints. Extensive experimental results demonstrate that ActionMamba achieves highly competitive performance on three challenging benchmark datasets: NTU-RGB+D 60, NTU-RGB+D 120, and UAV–Human. Full article
Show Figures

Figure 1

30 pages, 59872 KB  
Article
Advancing 3D Seismic Fault Identification with SwiftSeis-AWNet: A Lightweight Architecture Featuring Attention-Weighted Multi-Scale Semantics and Detail Infusion
by Ang Li, Rui Li, Yuhao Zhang, Shanyi Li, Yali Guo, Liyan Zhang and Yuqing Shi
Electronics 2025, 14(15), 3078; https://doi.org/10.3390/electronics14153078 - 31 Jul 2025
Viewed by 334
Abstract
The accurate identification of seismic faults, which serve as crucial fluid migration pathways in hydrocarbon reservoirs, is of paramount importance for reservoir characterization. Traditional interpretation is inefficient. It also struggles with complex geometries, failing to meet the current exploration demands. Deep learning boosts [...] Read more.
The accurate identification of seismic faults, which serve as crucial fluid migration pathways in hydrocarbon reservoirs, is of paramount importance for reservoir characterization. Traditional interpretation is inefficient. It also struggles with complex geometries, failing to meet the current exploration demands. Deep learning boosts fault identification significantly but struggles with edge accuracy and noise robustness. To overcome these limitations, this research introduces SwiftSeis-AWNet, a novel lightweight and high-precision network. The network is based on an optimized MedNeXt architecture for better fault edge detection. To address the noise from simple feature fusion, a Semantics and Detail Infusion (SDI) module is integrated. Since the Hadamard product in SDI can cause information loss, we engineer an Attention-Weighted Semantics and Detail Infusion (AWSDI) module that uses dynamic multi-scale feature fusion to preserve details. Validation on field seismic datasets from the Netherlands F3 and New Zealand Kerry blocks shows that SwiftSeis-AWNet mitigates challenges like the loss of small-scale fault features and misidentification of fault intersection zones, enhancing the accuracy and geological reliability of automated fault identification. Full article
Show Figures

Figure 1

16 pages, 6397 KB  
Article
Heterogenous Image Matching Fusion Based on Cumulative Structural Similarity
by Nan Zhu, Shiman Yang and Zhongxun Wang
Electronics 2025, 14(13), 2693; https://doi.org/10.3390/electronics14132693 - 3 Jul 2025
Viewed by 294
Abstract
To solve the problem of the limited capability of multimodal image feature descriptors constructed by gradient information and the phase consistency principle, a method of cumulative structure feature descriptor construction with rotation invariance is proposed in this paper. Firstly, we extract the direction [...] Read more.
To solve the problem of the limited capability of multimodal image feature descriptors constructed by gradient information and the phase consistency principle, a method of cumulative structure feature descriptor construction with rotation invariance is proposed in this paper. Firstly, we extract the direction of multi-scale and multi-direction feature point edges using the Log-Gabor odd-symmetric filter and calculate the amplitude of pixel edges based on the phase consistency principle. Then, the main direction of the key points is determined based on the edge direction feature map, and the coordinates are established according to the main direction to ensure that the feature point descriptor has rotation invariance. Finally, the Log-Gabor odd-symmetric filter calculates the cumulative structural response in the maximum direction and constructs a highly identifiable descriptor with rotation invariance. We select several representative heterogeneous images as test data and compare the matching performance of the proposed algorithm with several excellent descriptors. The results indicate that the descriptor constructed in this paper is more robust than other descriptors for heterosource images with rotation changes. Full article
Show Figures

Figure 1

Back to TopTop