Topic Editors

Dr. Jun Xu
School of Statistics and Data Science, Nankai University, Tianjin 300071, China
College of Software, Northeastern University, Shenyang 110819, China

Image Processing, Signal Processing and Their Applications

Abstract submission deadline
closed (16 May 2026)
Manuscript submission deadline
16 July 2026
Viewed by
21139

Topic Information

Dear Colleagues,

Signal processing involves the analysis, modification, and synthesis of signals, such as sound, images, and scientific measurements. A variety of techniques are used to improve, extract, or compress information from raw data. Image processing, a subset of signal processing, focuses on visual data, such as photographs or video frames, and aims to enhance image quality, detect features, and transform images for interpretation or analysis. This Topic presents a wide range of research on image processing and signal processing, as well as their applications, covering the following subjects: Medical imaging; Speech and audio processing; Machine Learning for signal processing; Image and video processing; Image enhancement; Image restoration; Segmentation; Edge detection; Compression; Color image processing; Communications and networking; Computer vision; Multimedia.

Dr. Jun Xu
Prof. Dr. Lianbo Ma
Topic Editors

Keywords

  • medical imaging
  • speech and audio processing
  • machine learning for signal processing
  • image and video processing
  • image enhancement
  • image restoration
  • segmentation
  • edge detection
  • compression
  • color image processing
  • communications and networking
  • computer vision
  • multimedia

Participating Journals

Journal Name Impact Factor CiteScore Launched Year First Decision (median) APC
Applied Sciences
applsci
2.5 5.5 2011 16 Days CHF 2400 Submit
Information
information
2.9 6.5 2010 20.9 Days CHF 1800 Submit
Journal of Imaging
jimaging
3.3 6.7 2015 18 Days CHF 1800 Submit
Remote Sensing
remotesensing
4.1 8.6 2009 24.3 Days CHF 2700 Submit
Signals
signals
2.6 4.6 2020 21.8 Days CHF 1200 Submit
Symmetry
symmetry
2.2 5.3 2009 15.8 Days CHF 2400 Submit

Preprints.org is a multidisciplinary platform offering a preprint service designed to facilitate the early sharing of your research. It supports and empowers your research journey from the very beginning.

MDPI Topics is collaborating with Preprints.org and has established a direct connection between MDPI journals and the platform. Authors are encouraged to take advantage of this opportunity by posting their preprints at Preprints.org prior to publication:

  1. Share your research immediately: disseminate your ideas prior to publication and establish priority for your work.
  2. Safeguard your intellectual contribution: Protect your ideas with a time-stamped preprint that serves as proof of your research timeline.
  3. Boost visibility and impact: Increase the reach and influence of your research by making it accessible to a global audience.
  4. Gain early feedback: Receive valuable input and insights from peers before submitting to a journal.
  5. Ensure broad indexing: Web of Science (Preprint Citation Index), Google Scholar, Crossref, SHARE, PrePubMed, Scilit and Europe PMC.

Published Papers (23 papers)

Order results
Result details
Journals
Select all
Export citation of selected articles as:
20 pages, 1096 KB  
Article
Wavelet Basis Selection in Signal Denoising Based on Wavelet-Coefficient Distribution Shape
by Mladen Tomic and Marko Gulic
Signals 2026, 7(3), 39; https://doi.org/10.3390/signals7030039 - 2 May 2026
Viewed by 369
Abstract
Denoising one-dimensional signals by wavelet shrinkage critically depends on the choice of wavelet basis, yet basis selection is often guided by heuristics rather than explicit statistical criteria. This paper investigates the relationship between wavelet-basis properties and the shape of the probability density function [...] Read more.
Denoising one-dimensional signals by wavelet shrinkage critically depends on the choice of wavelet basis, yet basis selection is often guided by heuristics rather than explicit statistical criteria. This paper investigates the relationship between wavelet-basis properties and the shape of the probability density function (PDF) of the detail coefficients in the coarsest retained detail subband. On this basis, it proposes the shape of this PDF as a criterion for wavelet-basis selection. We hypothesize that, for a fixed decomposition depth, noise model, and shrinkage rule, a basis better matched to the signal’s local regularity produces a narrower and more sharply peaked coefficient PDF in this subband than a mismatched basis and can therefore serve as a data-driven indicator for basis selection. To evaluate the consistency of this proposal, we perform controlled hard-thresholding experiments on six canonical test signals, five wavelet bases, and additive white Gaussian noise. Although the test signals differ significantly in local regularity and features, the relationship between basis suitability and PDF shape is confirmed for each of them. Therefore, the results suggest that the proposed PDF-shape criterion is a valuable indicator for wavelet-basis selection. Full article
Show Figures

Figure 1

11 pages, 3992 KB  
Article
Cauchy Norm-Constrained Nonstationary High-Resolution Processing for Seismic Data
by Shengjun Li, Jinyong Gui, Bingyang Liu, Xin Guo and Hui Pan
Appl. Sci. 2026, 16(9), 4075; https://doi.org/10.3390/app16094075 - 22 Apr 2026
Viewed by 211
Abstract
Due to the intrinsic Q attenuation of geological formations, seismic waves experience amplitude and frequency attenuation during propagation, which results in reduced resolution and pronounced discrepancies between shallow and deep seismic data. Specifically, deep reflections exhibit weakened amplitudes and diminished high-frequency content. To [...] Read more.
Due to the intrinsic Q attenuation of geological formations, seismic waves experience amplitude and frequency attenuation during propagation, which results in reduced resolution and pronounced discrepancies between shallow and deep seismic data. Specifically, deep reflections exhibit weakened amplitudes and diminished high-frequency content. To mitigate these effects, a Q-compensation method based on nonstationary inversion is proposed to enhance seismic resolution and improve vertical consistency. A nonstationary reflectivity inversion framework is first established using a nonstationary convolution model with Cauchy norm regularization. The formation quality factor (Q) is then estimated from seismic data via the spectral ratio method using selected shallow and deep time windows. By incorporating the estimated Q value and an initial seismic wavelet, the proposed inversion simultaneously compensates for Q attenuation and wavelet effects, yielding a high-resolution reflectivity series. Q-compensated seismic data are subsequently reconstructed through the convolution of this reflectivity and an appropriate seismic wavelet. Both the model test and the field data application results demonstrate that the proposed method effectively compensates for Q attenuation, significantly enhances seismic resolution, and restores amplitude and frequency consistency between shallow and deep data. Full article
Show Figures

Figure 1

17 pages, 12185 KB  
Article
Adjustable Complexity Transformer Architecture for Image Denoising
by Jan-Ray Liao, Wen Lin and Li-Wen Chang
Signals 2026, 7(2), 33; https://doi.org/10.3390/signals7020033 - 6 Apr 2026
Viewed by 757
Abstract
In recent years, image denoising has seen a shift from traditional non-local self-similarity methods like BM3D to deep-learning based approaches that use learnable convolutions and attention mechanisms. While pixel-level attention is effective at capturing long-range relationships similar to non-local self-similarity based methods, it [...] Read more.
In recent years, image denoising has seen a shift from traditional non-local self-similarity methods like BM3D to deep-learning based approaches that use learnable convolutions and attention mechanisms. While pixel-level attention is effective at capturing long-range relationships similar to non-local self-similarity based methods, it incurs extremely high computational costs that scale quadratically with image resolution. As an alternative, channel-wise attention is resolution-independent and computationally efficient but may miss crucial spatial details. In this paper, an adjustable attention mechanism is introduced that bridges the gap between pixel and channel attentions. In the proposed model, average pooling and variable-size convolutions are added before attention calculation to adjust spatial resolution and, thus, allow dynamical adjustment of computational complexity. This adjustable attention is applied in a transformer-based U-Net architecture and achieves performance comparable to state-of-the-art methods in both real and Gaussian blind denoising tasks. To be more concrete, the proposed method achieves a Peak Signal-to-Noise Ratio of 39.65 dB and a Structural Similarity Index Measure of 0.913 on the Smartphone Image Denoising Dataset. Therefore, the proposed method demonstrates a balance between efficiency and denoising quality. Full article
Show Figures

Figure 1

24 pages, 11445 KB  
Article
SIMRET: A Similarity-Guided Retinex Approach for Low-Light Enhancement
by Abdülmuttalip Öztürk and Ferzan Katırcıoğlu
Appl. Sci. 2026, 16(7), 3517; https://doi.org/10.3390/app16073517 - 3 Apr 2026
Viewed by 333
Abstract
Standard Retinex-based algorithms typically rely on gradient constraints to decompose an image, assuming that illumination is spatially smooth while reflectance contains sharp details. However, strictly gradient-based priors frequently produce halo artifacts or over-smoothing because they are unable to differentiate between intrinsic structural edges [...] Read more.
Standard Retinex-based algorithms typically rely on gradient constraints to decompose an image, assuming that illumination is spatially smooth while reflectance contains sharp details. However, strictly gradient-based priors frequently produce halo artifacts or over-smoothing because they are unable to differentiate between intrinsic structural edges and high-frequency noise. In this paper, we propose a novel Similarity Image-Guided Retinex (SIMRET) model that fundamentally diverges from traditional derivative-based regularization. We present a color-based pixel-level similarity analysis to build a global guidance matrix rather than merely depending on local gradients. This Similarity Image functions as a reliable weight map during the decomposition process by mathematically encoding the chromatic relationships and spatial coherence between pixels. The model strictly maintains consistency across structural boundaries to avoid halo effects while adaptively enforcing smoothness in homogeneous regions to suppress noise by incorporating this similarity guidance into the optimization objective. We solve the proposed SIMRET model using an alternating optimization framework, where the similarity constraints effectively regularize the ill-posed decomposition problem. Extensive tests on various low-light datasets show that the suggested model successfully overcomes the trade-off between noise reduction and detail preservation, achieving better visual naturalness and signal fidelity than state-of-the-art techniques. Full article
Show Figures

Figure 1

22 pages, 26198 KB  
Article
Virial Extension for Discrete Data Series
by Dino Otero, Ariel Amadio, Leandro Robles Dávila, Marcos Maillot, Cristian Bonini and Walter Legnani
Signals 2026, 7(2), 29; https://doi.org/10.3390/signals7020029 - 1 Apr 2026
Viewed by 568
Abstract
The Virial theorem has been applied with considerable success in various fields of natural sciences. This work proposes an extension of the theorem applied to discrete data series. This application will be called the Virial theorem extension and can be applied to the [...] Read more.
The Virial theorem has been applied with considerable success in various fields of natural sciences. This work proposes an extension of the theorem applied to discrete data series. This application will be called the Virial theorem extension and can be applied to the numerical solution of nonlinear dynamic systems represented by difference equations, such as logistic, discubic and random number generators, the numerical solution of differential equations like the nonlinear double pendulum and a series of pseudorandom numbers and its reciprocals. For this purpose, a coefficient was derived from the discrete Virial formalism. This coefficient can be used to detect when a time series is obtained as the solution of a differential equation, in which case the coefficient is close to 1, and when the data come from other sources, in which case it takes different values. With reference to chaotic dynamic systems, the discrete Virial coefficient shows the feasibility in the detection of a change in behavior, as an alternative to the traditional calculation of Lyapunov exponents, and it is a thousand times faster. The convergence speed of the final value of the discrete Virial coefficient of a dynamic system in a non-chaotic regime is between one and five orders of magnitude greater than in the chaotic regime, thus extending results in non-Hamiltonian systems, previously found by another author in Hamiltonian systems. The results obtained show that the proposal characterizes and distinguishes different types of behavior from the series under study. It also shows great sensitivity to the evolution of the series, even anticipating critical points. The proposed method to construct the discrete Virial extension does not require the existence of a Hamiltonian, which allows its application to a series obtained experimentally or from any differential equation. From a general point of view, this research shows a series of properties that can be reinterpreted in light of the discrete Virial coefficient, providing a novel and versatile tool, given its minimal applicability requirements. For pseudorandom number series, the extension reveals a consistent, quasi-mirror behavior between its kinetic and potential factors, suggesting an underlying structural property. Full article
Show Figures

Figure 1

24 pages, 3028 KB  
Article
A Spectral Entropy-Based Metric for Evaluating Speech Perceptual Quality with Emphasis on Spectral Coherence
by Ali Sarafnia, M. Omair Ahmad and M.N.S. Swamy
Signals 2026, 7(2), 27; https://doi.org/10.3390/signals7020027 - 16 Mar 2026
Viewed by 720
Abstract
Distortion of speech in real-life communication is inevitable, affecting its quality. Conventionally, the effectiveness of a speech system in terms of the perceptual quality of the speech it produces has been assessed using a time-consuming subjective metric, the mean opinion score. There are [...] Read more.
Distortion of speech in real-life communication is inevitable, affecting its quality. Conventionally, the effectiveness of a speech system in terms of the perceptual quality of the speech it produces has been assessed using a time-consuming subjective metric, the mean opinion score. There are a number of objective metrics that can be used instead of the mean opinion score to assess the perceptual quality of the speech signal. The objective of this paper is to propose and validate a new objective metric, the spectral entropy-based metric (SEM), designed to evaluate the perceptual quality of speech and perceptual naturalness by quantifying spectral coherence. While other metrics focus on intelligibility, this study aims to fill a gap in naturalness assessment. The core novelty of this work lies in offering a diagnostic perspective on spectral coherence, an indicator of speech naturalness that is often not explicitly addressed by other metrics. To demonstrate the effectiveness of the proposed metric in evaluating the perceptual quality, we consider fixed-beam and steerable-beam first-order differential microphone arrays. Compared with other objective metrics, it is shown that the proposed SEM is more sensitive to spectral coherence, a predominant indicator of the naturalness of the output speech signal of a speech system. Full article
Show Figures

Figure 1

30 pages, 10668 KB  
Article
MambaLIC: State-Space Models for Efficient Remote Sensing Image Compression
by Haobo Xiong, Kai Liu, Huachao Xiao, Chongyang Ding and Feiyang Wang
Remote Sens. 2026, 18(6), 881; https://doi.org/10.3390/rs18060881 - 12 Mar 2026
Viewed by 604
Abstract
Remote sensing (RS) images, characterized by their large size and rich texture, require algorithms capable of effectively integrating both global and local features for compression. However, existing Learned Image Compression (LIC) approaches face distinct bottlenecks. While Transformer-based architectures typically suffer from heavy computational [...] Read more.
Remote sensing (RS) images, characterized by their large size and rich texture, require algorithms capable of effectively integrating both global and local features for compression. However, existing Learned Image Compression (LIC) approaches face distinct bottlenecks. While Transformer-based architectures typically suffer from heavy computational loads, standard State Space Models (SSMs) often incur prohibitive memory costs when processing high-resolution inputs. To address these limitations, we propose MambaLIC, a novel RS image compression network that integrates the efficient long-range modeling of SSMs with the local modeling ability of CNNs. In this paper, we introduce an innovative Remote Sensing State Space Model (RS-SSM) module, which combines visual SSM with dynamic convolution for remote sensing image compression. This integration facilitates effective interaction between local and global information, thereby enhancing the performance of RS image compression. Furthermore, we propose an SSM attention-based (SSA-based) spatial-channel context model for better entropy modeling. Compared to Transformer-CNN mixed architectures, MambaLIC reduces computational complexity by 63.9% and achieves superior rate-distortion (RD) performance. Consequently, compared to the latest SS2D-based method MambaIC, MambaLIC achieves substantial efficiency gains, saving 78.8% in memory usage. Experimental results demonstrate that MambaLIC achieves state-of-the-art (SOTA) performance, outperforming VVC (VTM-17.0) by 14.22%, 18.48%, and 17.47% in BD-rate on UC-Merced, LoveDA, and xView datasets, respectively. Full article
Show Figures

Figure 1

20 pages, 14718 KB  
Article
Selective Trace Mix: A New Processing Tool to Enhance Seismic Imaging of Complex Subsurface Structures
by Mohamed Rashed, Nassir Al-Amri, Riyadh Halawani, Ali Atef and Hussein Harbi
J. Imaging 2026, 12(3), 124; https://doi.org/10.3390/jimaging12030124 - 12 Mar 2026
Viewed by 424
Abstract
In seismic imaging, the trace mixing process involves merging neighboring traces in seismic data to enhance the signal-to-noise ratio and improve the continuity and spatial coherence of seismic data. In regions with complex subsurface structures, current trace mix filters are often ineffective as [...] Read more.
In seismic imaging, the trace mixing process involves merging neighboring traces in seismic data to enhance the signal-to-noise ratio and improve the continuity and spatial coherence of seismic data. In regions with complex subsurface structures, current trace mix filters are often ineffective as they introduce artifacts that reduce interpretability and obscure the signatures of important structures, such as faults and folds. We introduce the selective trace mix as a novel, data-dependent filter. This filter enhances amplitude consistency, spatial coherence, and the definition of reflections, while it preserves complex structures and maintains their clarity. Selective trace mix uses sequential steps of evaluation, referencing, exclusion, weighting, and normalization of all samples within the filter operator. As a result, selective trace mix is a temporally and spatially variable, data-dependent filter. The filter’s effectiveness is validated using both synthetic and real field seismic data. Synthetic data is a portion of the Marmousi seismic model, while real data include land and marine seismic datasets imaging complex subsurface fault/fold structures. When compared to three of the commonly used conventional filters, the selective trace mix yields far better results in terms of horizon integrity and fault clarity. Full article
Show Figures

Figure 1

22 pages, 34457 KB  
Article
Agentic Vision Framework for Real-Time Manufacturing Contamination Detection Using Patch-Based Lightweight Convolutional Neural Networks
by Yuan Xing, Xuedong Ding and Haowen Pan
Signals 2026, 7(2), 21; https://doi.org/10.3390/signals7020021 - 3 Mar 2026
Viewed by 821
Abstract
Modern manufacturing quality control demands intelligent, adaptive inspection systems capable of real-time contamination detection with minimal computational overhead. We present a five-agent vision framework for material-aware contamination detection in manufacturing environments. The system comprises: a Material Classification Agent that identifies contamination type (fiber, [...] Read more.
Modern manufacturing quality control demands intelligent, adaptive inspection systems capable of real-time contamination detection with minimal computational overhead. We present a five-agent vision framework for material-aware contamination detection in manufacturing environments. The system comprises: a Material Classification Agent that identifies contamination type (fiber, sand, or mixed), three Material-Specific Detection Agents, each employing patch-based CNNs optimized for their respective material with dynamic patch size selection (128 px, 256 px, 384 px), and an Adaptation Agent that monitors performance and eliminates consistently failing patch size configurations. This hierarchical architecture enables intelligent routing to specialized detectors and continuous refinement through performance-driven adaptation. The Material Classification Agent achieves 98% accuracy in contamination type identification. Material-specific agents demonstrate F1-scores of 0.968 (fiber), 0.977 (sand), and 0.977 (mixed) with real-time inference (2.40–11.11 ms per 512 × 512 image). The Adaptation Agent implements selective patch size elimination: configurations failing quality thresholds (F1 < 0.5) across multiple evaluation cycles are removed from the detection pipeline. On the synthetic test split used in this study, comparative evaluation against PatchCore, WinCLIP, and PaDiM shows 3–45× higher F1-scores with superior accuracy–latency trade-offs, validating the efficacy of specialized material-aware architectures for manufacturing contamination detection. Full article
Show Figures

Figure 1

20 pages, 1202 KB  
Article
Adaptive ORB Accelerator on FPGA: High Throughput, Power Consumption, and More Efficient Vision for UAVs
by Hussam Rostum and József Vásárhelyi
Signals 2026, 7(1), 13; https://doi.org/10.3390/signals7010013 - 2 Feb 2026
Viewed by 1399
Abstract
Feature extraction and description are fundamental components of visual perception systems used in applications such as visual odometry, Simultaneous Localization and Mapping (SLAM), and autonomous navigation. In resource-constrained platforms, such as Unmanned Aerial Vehicles (UAVs), achieving real-time hardware acceleration on Field-Programmable Gate Arrays [...] Read more.
Feature extraction and description are fundamental components of visual perception systems used in applications such as visual odometry, Simultaneous Localization and Mapping (SLAM), and autonomous navigation. In resource-constrained platforms, such as Unmanned Aerial Vehicles (UAVs), achieving real-time hardware acceleration on Field-Programmable Gate Arrays (FPGAs) is challenging. This work demonstrates an FPGA-based implementation of an adaptive ORB (Oriented FAST and Rotated BRIEF) feature extraction pipeline designed for high-throughput and energy-efficient embedded vision. The proposed architecture is a completely new design for the main algorithmic blocks of ORB, including the FAST (Features from Accelerated Segment Test) feature detector, Gaussian image filtering, moment computation, and descriptor generation. Adaptive mechanisms are introduced to dynamically adjust thresholds and filtering behavior, improving robustness under varying illumination conditions. The design is developed using a High-Level Synthesis (HLS) approach, where all processing modules are implemented as reusable hardware IP cores and integrated at the system level. The architecture is deployed and evaluated on two FPGA platforms, PYNQ-Z2 and KRIA KR260, and its performance is compared against CPU and GPU implementations using a dedicated C++ testbench based on OpenCV. Experimental results demonstrate significant improvements in throughput and energy efficiency while maintaining stable and scalable performance, making the proposed solution suitable for real-time embedded vision applications on UAVs and similar platforms. Notably, the FPGA implementation increases DSP utilization from 11% to 29% compared to the previous designs implemented by other researchers, effectively offloading computational tasks from general purpose logic (LUTs and FFs), reducing LUT usage by 6% and FF usage by 13%, while maintaining overall design stability, scalability, and acceptable thermal margins at 2.387 W. This work establishes a robust foundation for integrating the optimized ORB pipeline into larger drone systems and opens the door for future system-level enhancements. Full article
Show Figures

Figure 1

36 pages, 11446 KB  
Article
SIFT-SNN for Traffic-Flow Infrastructure Safety: A Real-Time Context-Aware Anomaly Detection Framework
by Munish Rathee, Boris Bačić and Maryam Doborjeh
J. Imaging 2026, 12(2), 64; https://doi.org/10.3390/jimaging12020064 - 31 Jan 2026
Viewed by 716
Abstract
Automated anomaly detection in transportation infrastructure is essential for enhancing safety and reducing the operational costs associated with manual inspection protocols. This study presents an improved neuromorphic vision system, which extends the prior SIFT-SNN (scale-invariant feature transform–spiking neural network) proof-of-concept by incorporating temporal [...] Read more.
Automated anomaly detection in transportation infrastructure is essential for enhancing safety and reducing the operational costs associated with manual inspection protocols. This study presents an improved neuromorphic vision system, which extends the prior SIFT-SNN (scale-invariant feature transform–spiking neural network) proof-of-concept by incorporating temporal feature aggregation for context-aware and sequence-stable detection. Analysis of classical stitching-based pipelines exposed sensitivity to motion and lighting variations, motivating the proposed temporally smoothed neuromorphic design. SIFT keypoints are encoded into latency-based spike trains and classified using a leaky integrate-and-fire (LIF) spiking neural network implemented in PyTorch. Evaluated across three hardware configurations—an NVIDIA RTX 4060 GPU, an Intel i7 CPU, and a simulated Jetson Nano—the system achieved 92.3% accuracy and a macro F1 score of 91.0% under five-fold cross-validation. Inference latencies were measured at 9.5 ms, 26.1 ms, and ~48.3 ms per frame, respectively. Memory footprints were under 290 MB, and power consumption was estimated to be between 5 and 65 W. The classifier distinguishes between safe, partially dislodged, and fully dislodged barrier pins, which are critical failure modes for the Auckland Harbour Bridge’s Movable Concrete Barrier (MCB) system. Temporal smoothing further improves recall for ambiguous cases. By achieving a compact model size (2.9 MB), low-latency inference, and minimal power demands, the proposed framework offers a deployable, interpretable, and energy-efficient alternative to conventional CNN-based inspection tools. Future work will focus on exploring the generalisability and transferability of the work presented, additional input sources, and human–computer interaction paradigms for various deployment infrastructures and advancements. Full article
Show Figures

Figure 1

25 pages, 9860 KB  
Article
Symmetry-Aware SXA-YOLO: Enhancing Tomato Leaf Disease Recognition with Bidirectional Feature Fusion and Task Decoupling
by Guangyue Du, Shuyu Fang, Lianbin Zhang, Wanlu Ren and Biao He
Symmetry 2026, 18(1), 178; https://doi.org/10.3390/sym18010178 - 18 Jan 2026
Viewed by 500
Abstract
Tomatoes are an important economic crop in China, and crop diseases often lead to a decline in their yield. Deep learning-based visual recognition methods have become an approach for disease identification; however, challenges remain due to complex background interference in the field and [...] Read more.
Tomatoes are an important economic crop in China, and crop diseases often lead to a decline in their yield. Deep learning-based visual recognition methods have become an approach for disease identification; however, challenges remain due to complex background interference in the field and the diversity of disease manifestations. To address these issues, this paper proposes the SXA-YOLO (an improvement based on YOLO, where S stands for the SAAPAN architecture, X represents the XIoU loss function, and A denotes the AsDDet module) symmetric perception recognition model. First, a comprehensive symmetry architecture system is established. The backbone network creates a hierarchical feature foundation through C3k2 (Cross-stage Partial Concatenated Bottleneck Convolution with Dual-kernel Design) and SPPF (the Fast Pyramid Pooling module) modules; the neck employs a SAAPAN (Symmetry-Aware Adaptive Path Aggregation Architecture) bidirectional feature pyramid architecture, utilizing multiple modules to achieve equal fusion of multi-scale features; and the detection head is based on the AsDDet (Adaptive Symmetry-aware Decoupled Detection Head) module for functional decoupling, combining dynamic label assignment and the XIoU (Extended Intersection over Union) loss function to collaboratively optimize classification, regression, and confidence prediction. Ultimately, a complete recognition framework is formed through triple symmetric optimization of “feature hierarchy, fusion path, and task functionality.” Experimental results indicate that this method effectively enhances the model’s recognition performance, achieving a P (Precision) value of 0.992 and an mAP50 (mean Average Precision at 50% IoU threshold) of 0.993. Furthermore, for ten categories of diseases, the SXA-YOLO symmetric perception recognition model outperforms other comparative models in both p value and mAP50. The improved algorithm enhances the recognition of foliar diseases in tomatoes, achieving a high level of accuracy. Full article
Show Figures

Figure 1

24 pages, 12718 KB  
Article
Proposed Methodology for Correcting Fourier-Transform Infrared Spectroscopy Field-of-View Scene-Change Artifacts
by Kody A. Wilson, Michael L. Dexter, Benjamin F. Akers and Anthony L. Franz
Remote Sens. 2026, 18(2), 317; https://doi.org/10.3390/rs18020317 - 17 Jan 2026
Cited by 1 | Viewed by 570
Abstract
Fourier-transform spectrometers are widely used for spectral measurements. Changes in the field of view during measurement introduce oscillations into the measured spectra known as scene-change artifacts. Field-of-view changes also introduce uncertainty about which target the measured spectrum represents. Though scene-change artifacts are often [...] Read more.
Fourier-transform spectrometers are widely used for spectral measurements. Changes in the field of view during measurement introduce oscillations into the measured spectra known as scene-change artifacts. Field-of-view changes also introduce uncertainty about which target the measured spectrum represents. Though scene-change artifacts are often present in dynamic data, their significance is disputed in the current literature. This work presents a theoretical framework and experimental validation for scene-change artifacts. Field-of-view changes introduce variable interferogram offsets, which standard processing techniques assume are constant. The error between the interferogram offset and its estimate is Fourier-transformed, yielding scene-change artifacts, often confused with noise, in the calibrated spectrum. Previous theoretical models ignored the effect of the interferogram offset in generating SCAs, leading to an underestimation of the scene-change artifact significance. Smooth offset correction removes these artifacts by estimating the variable interferogram offset using locally weighted scatter-plot smoothing. Updating the interferogram offset estimate resulted in the same accuracy expected for static conditions. The resulting spectra resemble the zero path difference spectra, similar to earlier theoretical predictions. These results indicate that Fourier-transform spectroscopy accuracy with variable scenes can be significantly improved with minor modifications to data processing. Full article
Show Figures

Graphical abstract

19 pages, 1885 KB  
Article
A Hierarchical Multi-Resolution Self-Supervised Framework for High-Fidelity 3D Face Reconstruction Using Learnable Gabor-Aware Texture Modeling
by Pichet Mareo and Rerkchai Fooprateepsiri
J. Imaging 2026, 12(1), 26; https://doi.org/10.3390/jimaging12010026 - 5 Jan 2026
Viewed by 907
Abstract
High-fidelity 3D face reconstruction from a single image is challenging, owing to the inherently ambiguous depth cues and the strong entanglement of multi-scale facial textures. In this regard, we propose a hierarchical multi-resolution self-supervised framework (HMR-Framework), which reconstructs coarse-, medium-, and fine-scale facial [...] Read more.
High-fidelity 3D face reconstruction from a single image is challenging, owing to the inherently ambiguous depth cues and the strong entanglement of multi-scale facial textures. In this regard, we propose a hierarchical multi-resolution self-supervised framework (HMR-Framework), which reconstructs coarse-, medium-, and fine-scale facial geometry progressively through a unified pipeline. A coarse geometric prior is first estimated via 3D morphable model regression, followed by medium-scale refinement using a vertex deformation map constrained by a global–local Markov random field loss to preserve structural coherence. In order to improve fine-scale fidelity, a learnable Gabor-aware texture enhancement module has been proposed to decouple spatial–frequency information and thus improve sensitivity for high-frequency facial attributes. Additionally, we employ a wavelet-based detail perception loss to preserve the edge-aware texture features while mitigating noise commonly observed in in-the-wild images. Extensive qualitative and quantitative evaluation of benchmark datasets indicate that the proposed framework provides better fine-detail reconstruction than existing state-of-the-art methods, while maintaining robustness over pose variations. Notably, the hierarchical design increases semantic consistency across multiple geometric scales, providing a functional solution for high-fidelity 3D face reconstruction from monocular images. Full article
Show Figures

Figure 1

30 pages, 482 KB  
Article
Chromatic Asymmetry in Visual Attention: Dissociable Effects of Background Color on Capture and Processing During Reading—An Eye-Tracking Study
by Ana Teixeira, Pedro Martins, Sónia Brito-Costa and Maryam Abbasi
Symmetry 2026, 18(1), 76; https://doi.org/10.3390/sym18010076 - 2 Jan 2026
Viewed by 791
Abstract
Visual attention mechanisms are modulated by chromatic properties of the environment, with significant implications for human–computer interaction, interface design, and cognitive ergonomics. Despite extensive research on color perception, a critical gap remains in understanding how background colors differentially affect initial attentional capture versus [...] Read more.
Visual attention mechanisms are modulated by chromatic properties of the environment, with significant implications for human–computer interaction, interface design, and cognitive ergonomics. Despite extensive research on color perception, a critical gap remains in understanding how background colors differentially affect initial attentional capture versus sustained processing efficiency during text reading. This study investigates how seven different background colors (yellow, orange, red, green, blue, purple, and black) influence visual attention and cognitive load during standardized reading tasks with white text, revealing a fundamental asymmetry in chromatic processing stages. Using high-frequency eye-tracking at 120 Hz with 30 participants in a within-subjects design, we measured time-to-first fixation, total viewing duration, fixation count, and revisitation frequency across chromatic conditions. Non-parametric statistical analyses (Friedman test for omnibus comparisons, Wilcoxon signed-rank test for pairwise comparisons) revealed a systematic dissociation between preattentive capture and sustained processing. Yellow backgrounds enabled the fastest initial attentional capture (0.65 s), while black backgrounds produced the slowest detection (1.75 s). However, this pattern reversed during sustained processing: black backgrounds enabled the shortest total viewing time (0.88 s) through efficient information sampling (median 5.0 fixations), while yellow required the longest viewing duration (1.75 s) with fewer fixations (median 3.0). Statistical comparisons confirmed significant differences across conditions (Friedman test: χ2(6)=138.4154.2, all p<0.001; pairwise comparisons with Bonferroni correction: α=0.0024). We note that luminance and chromatic contrast were not independently controlled, as colors inherently vary in both dimensions in realistic interface design. Consequently, the observed effects reflect the combined influence of hue, saturation, and luminance contrast as they naturally co-occur. These findings reveal a descriptive pattern consistent with functionally distinct mechanisms, where chromatic salience appears to facilitate preattentive capture while luminance contrast appears to determine sustained processing efficiency, with optimal colors for one stage being suboptimal for the other under the present experimental conditions. This observed chromatic asymmetry suggests potential implications for interface design: warm colors like yellow may optimize rapid attention capture for alerts and warnings, while high-contrast combinations like white-on-black may optimize sustained reading efficiency, though these preliminary patterns require validation across diverse contexts. Green and purple backgrounds offer balanced performance across both processing stages, representing near-symmetric solutions suitable for mixed-task interfaces. Given the controlled laboratory setting, university student sample, and 15 s exposure duration, design recommendations should be considered preliminary and validated in diverse real-world contexts. Full article
Show Figures

Figure 1

29 pages, 11637 KB  
Article
Scene Heatmap-Guided Adaptive Tiling and Dual-Model Collaboration-Based Object Detection in Ultra-Wide-Area Remote Sensing Images
by Fuwen Hu, Yeda Li, Jiayu Zhao and Chunping Min
Symmetry 2025, 17(12), 2158; https://doi.org/10.3390/sym17122158 - 15 Dec 2025
Cited by 1 | Viewed by 877
Abstract
This work addresses computational inefficiency in ultra-wide-area remote sensing image (RSI) object detection. Traditional homogeneous tiling strategies enforce computational symmetry by processing all image regions uniformly, ignoring the intrinsic spatial asymmetry of target distribution where target-dense coexist with vast target-sparse areas (e.g., deserts, [...] Read more.
This work addresses computational inefficiency in ultra-wide-area remote sensing image (RSI) object detection. Traditional homogeneous tiling strategies enforce computational symmetry by processing all image regions uniformly, ignoring the intrinsic spatial asymmetry of target distribution where target-dense coexist with vast target-sparse areas (e.g., deserts, farmlands), thereby wasting computational resources. To overcome symmetry mismatch, we propose a heat-guided adaptive blocking and dual-model collaboration (HAB-DMC) framework. First, a lightweight EfficientNetV2 classifies initial 1024 × 1024 tiles into semantic scenes (e.g., airports, forests). A target-scene relevance metric converts scene probabilities into a heatmap, identifying high-attention regions (HARs, e.g., airports) and low-attention regions (LARs, e.g., forests). HARs undergo fine-grained tiling (640 × 640 with 20% overlap) to preserve small targets, while LARs use coarse tiling (1024 × 1024) to minimize processing. Crucially, a dual-model strategy deploys: (1) a high-precision LSK-RTDETR-base detector (with Large Selective Kernel backbone) for HARs to capture multi-scale features, and (2) a streamlined LSK-RTDETR-lite detector for LARs to accelerate inference. Experiments show 23.9% faster inference on 30k-pixel images and reduction in invalid computations by 72.8% (from 50% to 13.6%) versus traditional methods, while maintaining competitive mAP (74.2%). The key innovation lies in repurposing heatmaps from localization tools to dynamic computation schedulers, enabling system-level efficiency for Ultra-Wide-Area RSIs. Full article
Show Figures

Figure 1

18 pages, 2411 KB  
Article
AVD-YOLO: Active Vision-Driven Multi-Scale Feature Extraction for Enhanced Road Anomaly Detection
by Minhong Jin, Zhongjie Zhu, Renwei Tu, Ang Lv and Zhijing Yu
Information 2025, 16(12), 1064; https://doi.org/10.3390/info16121064 - 3 Dec 2025
Cited by 1 | Viewed by 747
Abstract
Deficiencies in road anomaly detection systems precipitate multifaceted risks, including elevated collision probabilities from unidentified hazards, compromised traffic flow efficiency, and exponential maintenance costs. Contemporary methods struggle with complex road environments, dynamic viewing perspectives, and limited datasets. We present AVD-YOLO, an enhanced YOLO [...] Read more.
Deficiencies in road anomaly detection systems precipitate multifaceted risks, including elevated collision probabilities from unidentified hazards, compromised traffic flow efficiency, and exponential maintenance costs. Contemporary methods struggle with complex road environments, dynamic viewing perspectives, and limited datasets. We present AVD-YOLO, an enhanced YOLO variant that synergistically integrates Active Vision-Driven (AVD) multi-scale feature extraction with Position Modulated Attention (PMA) mechanisms. PMA addresses diminished target-background discriminability under variable illumination and weather conditions by capturing long range spatial dependencies, enhancing weak-feature target detection. The AVD technique mitigates missed detections caused by real-time viewing distance variations through adaptive multi-receptive field mechanisms, maintaining conceptual target fixation while dynamically adjusting feature scales. To address data scarcity, a comprehensive Multi-Class Road Anomaly Dataset (MCRAD) comprising 14,208 annotated images across nine anomaly categories is constructed. Experiments demonstrate that AVD-YOLO improves detection accuracy, achieving a 1.6% gain in mAP@0.5 and a 2.9% improvement in F1-score over baseline. These performance gains indicate both more precise localization of abnormal objects and a better balance between precision and recall, thereby enhancing the overall robustness of the detection model. Full article
Show Figures

Figure 1

13 pages, 5816 KB  
Technical Note
Discretization of Digital Controllers Comprising Second-Order Notch Filters
by Alon Kuperman
Signals 2025, 6(4), 69; https://doi.org/10.3390/signals6040069 - 1 Dec 2025
Viewed by 708
Abstract
Second-order notch filters (NFs) with constant coefficients are often used as part of feedback controllers in grid-connected power conversion systems to prevent unwanted harmonic content polluting the closed control loops. In practice, the value of the mains frequency resides within a certain known [...] Read more.
Second-order notch filters (NFs) with constant coefficients are often used as part of feedback controllers in grid-connected power conversion systems to prevent unwanted harmonic content polluting the closed control loops. In practice, the value of the mains frequency resides within a certain known range rather than remaining constant. Hence, the correct selection of NF coefficients is crucial for ensuring that the desired performance is maintained within the whole expected mains frequency range. Bilinear transformation (BLT) with notch frequency prewarping is often adopted to convert an NF from a continuous to a digital form. While accurately preserving the notch frequency location, the method reduces the filter bandwidth. As a remedy, BLT with both notch frequency and damping ratio prewarping may be employed. Nevertheless, some inaccuracy remains under low sampling-to-notch frequency ratios. This technical note demonstrates that the issue may be solved by prewarping the boundary values of the expected harmonic frequency range rather than the notch frequency and/or damping factor before applying the BLT. Simulation results accurately support the presented issue and proposed solution. Full article
Show Figures

Figure 1

24 pages, 43264 KB  
Article
Document Image Verification Based on Paragraph Alignment and Subtle Change Detection
by Daoquan Li, Weifei Jia, Quanlin Yu and Zhaoxu Hu
Appl. Sci. 2025, 15(23), 12430; https://doi.org/10.3390/app152312430 - 23 Nov 2025
Viewed by 993
Abstract
The digitization of paper documents enables rapid sharing and long-term preservation of information, making it a widely adopted approach for efficient document storage and management across various domains. However, the recent advances in image editing software pose an increasing threat to the integrity [...] Read more.
The digitization of paper documents enables rapid sharing and long-term preservation of information, making it a widely adopted approach for efficient document storage and management across various domains. However, the recent advances in image editing software pose an increasing threat to the integrity of document images. Comparing the input with the corresponding reference document image is a direct and effective approach to verification. Nevertheless, this task is challenging due to two key factors, namely, the need for efficient retrieval of the reference document images and the difficulty of detecting subtle content changes under the print–scan (PS) distortions. To address these challenges, this work proposes a document image verification scheme based on paragraph alignment and subtle change detection. It first extracts paragraph structural features from both input and reference document images to achieve efficient image retrieval and accurate paragraph alignment. Based on the alignment results, the proposed scheme employs contrastive learning to reduce the effect of PS distortions in extracting features from the input and reference document images. Finally, an additional verification step is introduced that significantly reduces the false positive detection by addressing the feature misalignment within the extracted paragraphs. To evaluate the proposed scheme, extensive experiments were conducted on databases constructed from public datasets, and various benchmark methods were compared. Experimental results show that the proposed scheme outperforms benchmark methods, achieving an accuracy score of 0.963. Full article
Show Figures

Figure 1

24 pages, 248126 KB  
Article
Image Matching for UAV Geolocation: Classical and Deep Learning Approaches
by Fatih Baykal, Mehmet İrfan Gedik, Constantino Carlos Reyes-Aldasoro and Cefa Karabağ
J. Imaging 2025, 11(11), 409; https://doi.org/10.3390/jimaging11110409 - 12 Nov 2025
Cited by 2 | Viewed by 2162
Abstract
Today, unmanned aerial vehicles (UAVs) are heavily dependent on Global Navigation Satellite Systems (GNSSs) for positioning and navigation. However, GNSS signals are vulnerable to jamming and spoofing attacks. This poses serious security risks, especially for military operations and critical civilian missions. In order [...] Read more.
Today, unmanned aerial vehicles (UAVs) are heavily dependent on Global Navigation Satellite Systems (GNSSs) for positioning and navigation. However, GNSS signals are vulnerable to jamming and spoofing attacks. This poses serious security risks, especially for military operations and critical civilian missions. In order to solve this problem, an image-based geolocation system has been developed that eliminates GNSS dependency. The proposed system estimates the geographical location of the UAV by matching the aerial images taken by the UAV with previously georeferenced high-resolution satellite images. For this purpose, common visual features were determined between satellite and UAV images and matching operations were carried out using methods based on the homography matrix. Thanks to image processing, a significant relationship has been established between the area where the UAV is located and the geographical coordinates, and reliable positioning is ensured even in cases where GNSS signals cannot be used. Within the scope of the study, traditional methods such as SIFT, AKAZE, and Multiple Template Matching were compared with learning-based methods including SuperPoint, SuperGlue, and LoFTR. The results showed that deep learning-based approaches can make successful matches, especially at high altitudes. Full article
Show Figures

Figure 1

20 pages, 11332 KB  
Article
A Fast Nonlinear Sparse Model for Blind Image Deblurring
by Zirui Zhang, Zheng Guo, Zhenhua Xu, Huasong Chen, Chunyong Wang, Yang Song, Jiancheng Lai, Yunjing Ji and Zhenhua Li
J. Imaging 2025, 11(10), 327; https://doi.org/10.3390/jimaging11100327 - 23 Sep 2025
Viewed by 900
Abstract
Blind image deblurring, which requires simultaneous estimation of the latent image and blur kernel, constitutes a classic ill-posed problem. To address this, priors based on L2, L1, and Lp regularizations have been widely adopted. Based on this foundation [...] Read more.
Blind image deblurring, which requires simultaneous estimation of the latent image and blur kernel, constitutes a classic ill-posed problem. To address this, priors based on L2, L1, and Lp regularizations have been widely adopted. Based on this foundation and combining successful experiences of previous work, this paper introduces LN regularization, a novel nonlinear sparse regularization combining the Lp and L norms via nonlinear coupling. Statistical probability analysis demonstrates that LN regularization achieves stronger sparsity than traditional regularizations like L2, L1, and Lp regularizations. Furthermore, building upon the LN regularization, we propose a novel nonlinear sparse model for blind image deblurring. To optimize the proposed LN regularization, we introduce an Adaptive Generalized Soft-Thresholding (AGST) algorithm and further develop an efficient optimization strategy by integrating AGST with the Half-Quadratic Splitting (HQS) strategy. Extensive experiments conducted on synthetic datasets and real-world images demonstrate that the proposed nonlinear sparse model achieves superior deblurring performance while maintaining completive computational efficiency. Full article
Show Figures

Figure 1

17 pages, 8282 KB  
Article
Research on ADPLL for High-Precision Phase Measurement
by Weilai Yao, Chenying Sun, Xindong Liang and Jianjun Jia
Symmetry 2025, 17(9), 1487; https://doi.org/10.3390/sym17091487 - 8 Sep 2025
Cited by 1 | Viewed by 1310
Abstract
The inter-satellite laser interferometer, which functions as a high-performance displacement sensor, will be used in forthcoming space-based gravitational wave detection missions. The readout of these interferometers is typically performed by phasemeters based on all-digital phase-locked loops (ADPLLs) implemented in FPGAs. This paper proposes [...] Read more.
The inter-satellite laser interferometer, which functions as a high-performance displacement sensor, will be used in forthcoming space-based gravitational wave detection missions. The readout of these interferometers is typically performed by phasemeters based on all-digital phase-locked loops (ADPLLs) implemented in FPGAs. This paper proposes a feasible loop parameter design workflow and a comprehensive noise model, providing guidelines for designing and optimizing an ADPLL to meet specified bandwidth and precision requirements. The validity of our analysis is demonstrated through numerical performance measurements based on the modified digital splitting test. Full article
Show Figures

Figure 1

19 pages, 12806 KB  
Article
A Vision Method for Detecting Citrus Separation Lines Using Line-Structured Light
by Qingcang Yu, Song Xue and Yang Zheng
J. Imaging 2025, 11(8), 265; https://doi.org/10.3390/jimaging11080265 - 8 Aug 2025
Viewed by 898
Abstract
The detection of citrus separation lines is a crucial step in the citrus processing industry. Inspired by the achievements of line-structured light technology in surface defect detection, this paper proposes a method for detecting citrus separation lines based on line-structured light. Firstly, a [...] Read more.
The detection of citrus separation lines is a crucial step in the citrus processing industry. Inspired by the achievements of line-structured light technology in surface defect detection, this paper proposes a method for detecting citrus separation lines based on line-structured light. Firstly, a gamma-corrected Otsu method is employed to extract the laser stripe region from the image. Secondly, an improved skeleton extraction algorithm is employed to mitigate the bifurcation errors inherent in original skeleton extraction algorithms while simultaneously acquiring 3D point cloud data of the citrus surface. Finally, the least squares progressive iterative approximation algorithm is applied to approximate the ideal surface curve; subsequently, principal component analysis is used to derive the normals of this ideally fitted curve. The deviation between each point (along its corresponding normal direction) and the actual geometric characteristic curve is then adopted as a quantitative index for separation lines positioning. The average similarity between the extracted separation lines and the manually defined standard separation lines reaches 92.5%. In total, 95% of the points on the separation lines obtained by this method have an error of less than 4 pixels. Experimental results demonstrate that through quantitative deviation analysis of geometric features, automatic detection and positioning of the separation lines are achieved, satisfying the requirements of high precision and non-destructiveness for automatic citrus splitting. Full article
Show Figures

Figure 1

Back to TopTop