Saved Queries

Schlieren imaging is a powerful optical sensing technique that captures flow-induced refractive index gradients, offering valuable visual data for analyzing complex fluid dynamics. However, the large volume and structural complexity of the data generated by this sensor pose significant challenges for extracting key physical insights and performing efficient reconstruction and temporal prediction. In this study, we propose a Spatial-Frequency-Scale variational autoencoder (SFS-VAE), a deep learning framework designed for the unsupervised feature decomposition of Schlieren sensor data. To address the limitations of traditional

β

-variational autoencoder (

β

-VAE) in capturing complex flow regions, the Progressive Frequency-enhanced Spatial Multi-scale Module (PFSM) is designed, which enhances the structures of different frequency bands through Fourier transform and multi-scale convolution; the Feature-Spatial Enhancement Module (FSEM) employs a gradient-driven spatial attention mechanism to extract key regional features. Experiments on flat plate film-cooled jet schlieren data show that SFS-VAE can effectively preserve the information of the mainstream region and more accurately capture the high-gradient features of the jet region, reducing the Root Mean Square Error (RMSE) by approximately 16.9% and increasing the Peak Signal-to-Noise Ratio (PSNR) by approximately 1.6 dB. Furthermore, when integrated with a Transformer for temporal prediction, the model exhibits significantly improved stability and accuracy in forecasting flow field evolution. Overall, the model’s physical interpretability and generalization ability make it a powerful new tool for advanced flow diagnostics through the robust analysis of Schlieren sensor data. Full article

(This article belongs to the Section Optical Sensors)

►▼ Show Figures

Figure 1

23 pages, 3467 KB

Open AccessArticle

Adaptive Neuro-Fuzzy Inference System Framework for Paediatric Wrist Injury Classification

by Olamilekan Shobayo, Reza Saatchi and Shammi Ramlakhan

Multimodal Technol. Interact. 2025, 9(10), 104; https://doi.org/10.3390/mti9100104 - 8 Oct 2025

Viewed by 36

Abstract

An Adaptive Neuro-Fuzzy Inference System (ANFIS) framework for paediatric wrist injury classification (fracture versus sprain) was developed utilising infrared thermography (IRT). ANFIS combines artificial neural network (ANN) learning with interpretable fuzzy rules, mitigating the “black-box” limitation of conventional ANNs through explicit membership functions and Takagi–Sugeno rule consequents. Forty children (19 fractures, 21 sprains, confirmed by X-ray radiograph) provided thermal image sequences from which three statistically discriminative temperature distribution features namely standard deviation, inter-quartile range (IQR) and kurtosis were selected. A five-layer Sugeno ANFIS with Gaussian membership functions were trained using a hybrid least-squares/gradient descent optimisation and evaluated under three premise-parameter initialisation strategies: random seeding, K-means clustering, and fuzzy C-means (FCM) data partitioning. Five-fold cross-validation guided the selection of membership functions standard deviation (σ) and rule count, yielding an optimal nine-rule model. Comparative experiments show K-means initialisation achieved the best balance between convergence speed and generalisation versus slower but highly precise random initialisation and rapidly convergent yet unstable FCM. The proposed K-means–driven ANFIS offered data-efficient decision support, highlighting the potential of thermal feature fusion with neuro-fuzzy modelling to reduce unnecessary radiographs in emergency bone fracture triage. Full article

►▼ Show Figures

Figure 1

34 pages, 13615 KB

Open AccessArticle

Seamless Reconstruction of MODIS Land Surface Temperature via Multi-Source Data Fusion and Multi-Stage Optimization

by Yanjie Tang, Yanling Zhao, Yueming Sun, Shenshen Ren and Zhibin Li

Remote Sens. 2025, 17(19), 3374; https://doi.org/10.3390/rs17193374 - 7 Oct 2025

Viewed by 196

Abstract

Land Surface Temperature (LST) is a critical variable for understanding land–atmosphere interactions and is widely applied in urban heat monitoring, evapotranspiration estimation, near-surface air temperature modeling, soil moisture assessment, and climate studies. MODIS LST products, with their global coverage, long-term consistency, and radiometric calibration, are a major source of LST data. However, frequent data gaps caused by cloud contamination and atmospheric interference severely limit their applicability in analyses requiring high spatiotemporal continuity. This study presents a seamless MODIS LST reconstruction framework that integrates multi-source data fusion and a multi-stage optimization strategy. The method consists of three key components: (1) topography- and land cover-constrained spatial interpolation, which preliminarily fills orbit-induced gaps using elevation and land cover similarity criteria; (2) pixel-level LST reconstruction via random forest (RF) modeling with multi-source predictors (e.g., NDVI, NDWI, surface reflectance, DEM, land cover), coupled with HANTS-based temporal smoothing to enhance temporal consistency and seasonal fidelity; and (3) Poisson-based image fusion, which ensures spatial continuity and smooth transitions without compromising temperature gradients. Experiments conducted over two representative regions—Huainan and Jining—demonstrate the superior performance of the proposed method under both daytime and nighttime scenarios. The integrated approach (Step 3) achieves high accuracy, with correlation coefficients (CCs) exceeding 0.95 and root mean square errors (RMSEs) below 2K, outperforming conventional HANTS and standalone interpolation methods. Cross-validation with high-resolution Landsat LST further confirms the method’s ability to retain spatial detail and cross-scale consistency. Overall, this study offers a robust and generalizable solution for reconstructing MODIS LST with high spatial and temporal fidelity. The framework holds strong potential for broad applications in land surface process modeling, regional climate studies, and urban thermal environment analysis. Full article

►▼ Show Figures

Figure 1

12 pages, 9239 KB

Open AccessArticle

Effects of Motion in Ultrashort Echo Time Quantitative Susceptibility Mapping for Musculoskeletal Imaging

by Sam Sedaghat, Jinil Park, Eddie Fu, Fang Liu, Youngkyoo Jung and Hyungseok Jang

J. Imaging 2025, 11(10), 347; https://doi.org/10.3390/jimaging11100347 - 6 Oct 2025

Viewed by 186

Abstract

Quantitative susceptibility mapping (QSM) is a powerful magnetic resonance imaging (MRI) technique for assessing tissue composition in the human body. For imaging short-T2 tissues in the musculoskeletal (MSK) system, ultrashort echo time (UTE) imaging plays a key role. However, UTE-based QSM (UTE-QSM) often involves repeated acquisitions, making it vulnerable to inter-scan motion. In this study, we investigate the effects of motion on UTE-QSM and introduce strategies to reduce motion-induced artifacts. Eight healthy male volunteers underwent UTE-QSM imaging of the knee joint, while an additional seven participated in imaging of the ankle joint. UTE-QSM was conducted using multiple echo acquisitions, including both UTE and gradient-recalled echoes, and processed using the iterative decomposition of water and fat with echo asymmetry and least-squares estimation (IDEAL) and morphology-enabled dipole inversion (MEDI) algorithms. To assess the impact of motion, datasets were reconstructed both with and without motion correction. Furthermore, we evaluated a two-step UTE-QSM approach that incorporates tissue boundary information. This method applies edge detection, excludes pixels near detected edges, and performs a two-step QSM reconstruction to reduce motion-induced streaking artifacts. In participants exhibiting substantial inter-scan motion, prominent streaking artifacts were evident. Applying motion registration markedly reduced these artifacts in both knee and ankle UTE-QSM. Additionally, the two-step UTE-QSM approach, which integrates tissue boundary information, further enhanced image quality by mitigating residual streaking artifacts. These results indicate that motion-induced errors near tissue boundaries play a key role in generating streaking artifacts in UTE-QSM. Inter-scan motion poses a fundamental challenge in UTE-QSM due to the need for multiple acquisitions. However, applying motion registration along with a two-step QSM approach that excludes tissue boundaries can effectively suppress motion-induced streaking artifacts, thereby improving the accuracy of musculoskeletal tissue characterization. Full article

(This article belongs to the Section Medical Imaging)

►▼ Show Figures

Figure 1

14 pages, 11400 KB

Open AccessArticle

Classification of Blackcurrant Genotypes by Ploidy Levels on Stomata Microscopic Images with Deep Learning: Convolutional Neural Networks and Vision Transformers

by Aleksandra Konopka, Ryszard Kozera, Agnieszka Marasek-Ciołakowska and Aleksandra Machlańska

Appl. Sci. 2025, 15(19), 10735; https://doi.org/10.3390/app151910735 - 5 Oct 2025

Viewed by 216

Abstract

Plants vary in number of chromosomes (ploidy levels), which can influence morphological traits, including the size and density of stomata cells. Although biologists can detect these differences under a microscope, the process is often time-consuming and tedious. This study aims to automate the classification of blackcurrant (Ribes nigrum L.) ploidy levels—diploid, triploid, and tetraploid—by leveraging deep learning techniques. Convolutional Neural Networks and Vision Transformers are employed to perform microscopic image classification across two distinct blackcurrant datasets. Initial experiments demonstrate that these models can effectively classify ploidy levels when trained and tested on subsets derived from the same dataset. However, the primary challenge lies in proposing a model capable of yielding satisfactory classification results across different datasets ensuring robustness and generalization, which is a critical step toward developing a universal ploidy classification system. In this research, a variety of experiments is performed including application of augmentation technique. Model efficacy is evaluated with standard metrics and its interpretability is ensured through Gradient-weighted Class Activation Mapping visualizations. Finally, future research directions are outlined with application of other advanced state-of-the-art machine learning methods to further refine ploidy level prediction in botanical studies. Full article

(This article belongs to the Special Issue Advances in Machine Learning and Data Mining: Emerging Trends and Applications)

►▼ Show Figures

Figure 1

10 pages, 1114 KB

Open AccessArticle

Toward Supportive Decision-Making for Ureteral Stent Removal: Development of a Morphology-Based X-Ray Analysis

by So Hyeon Lee, Young Jae Kim, Tae Young Park and Kwang Gi Kim

Bioengineering 2025, 12(10), 1084; https://doi.org/10.3390/bioengineering12101084 - 5 Oct 2025

Viewed by 180

Abstract

Purpose: Timely removal of ureteral stents is critical to prevent complications such as infection, discomfort and stent encrustation or fragmentation, as well as stone formation associated with neglected stents. Current decisions, however, rely heavily on subjective interpretation of postoperative imaging. This study introduces a semi-automated image-processing algorithm that quantitatively evaluates stent morphology, aiming to support objective and reproducible decision-making in minimally invasive urological care. Methods: Two computational approaches were developed to analyze morphological changes in ureteral stents following surgery. The first method employed a vector-based analysis, using the FitLine function to derive unit vectors for each stent segment and calculating inter-vector angles. The second method applied a slope-based analysis, computing gradients between coordinate points to evaluate global straightening of the ureter over time. Results: The vector-angle method did not demonstrate significant temporal changes (p = 0.844). In contrast, the slope-based method identified significant ureteral straightening (p < 0.05), consistent with clinical observations. These results confirm that slope-based quantitative analysis provides reliable insight into postoperative morphological changes. Conclusions: This study presents an algorithm-based and reproducible imaging analysis method that enhances objectivity in postoperative assessment of ureteral stents. By aligning quantitative image processing with clinical decision support, the approach contributes to precision medicine and addresses the absence of standardized criteria for stent removal. Full article

(This article belongs to the Special Issue Artificial Intelligence-Based Medical Imaging Processing)

►▼ Show Figures

Figure 1

15 pages, 2364 KB

Open AccessArticle

Optimized Lung Nodule Classification Using CLAHE-Enhanced CT Imaging and Swin Transformer-Based Deep Feature Extraction

by Dorsaf Hrizi, Khaoula Tbarki and Sadok Elasmi

J. Imaging 2025, 11(10), 346; https://doi.org/10.3390/jimaging11100346 - 4 Oct 2025

Viewed by 121

Abstract

Lung cancer remains one of the most lethal cancers globally. Its early detection is vital to improving survival rates. In this work, we propose a hybrid computer-aided diagnosis (CAD) pipeline for lung cancer classification using Computed Tomography (CT) scan images. The proposed CAD pipeline integrates ten image preprocessing techniques and ten pretrained deep learning models for feature extraction including convolutional neural networks and transformer-based architectures, and four classical machine learning classifiers. Unlike traditional end-to-end deep learning systems, our approach decouples feature extraction from classification, enhancing interpretability and reducing the risk of overfitting. A total of 400 model configurations were evaluated to identify the optimal combination. The proposed approach was evaluated on the publicly available Lung Image Database Consortium and Image Database Resource Initiative dataset, which comprises 1018 thoracic CT scans annotated by four thoracic radiologists. For the classification task, the dataset included a total of 6568 images labeled as malignant and 4849 images labeled as benign. Experimental results show that the best performing pipeline, combining Contrast Limited Adaptive Histogram Equalization, Swin Transformer feature extraction, and eXtreme Gradient Boosting, achieved an accuracy of 95.8%. Full article

(This article belongs to the Special Issue Advancements in Imaging Techniques for Detection of Cancer)

►▼ Show Figures

Figure 1

25 pages, 2714 KB

Open AccessArticle

Evaluating Municipal Solid Waste Incineration Through Determining Flame Combustion to Improve Combustion Processes for Environmental Sanitation

by Jian Tang, Xiaoxian Yang, Wei Wang and Jian Rong

Sustainability 2025, 17(19), 8872; https://doi.org/10.3390/su17198872 - 4 Oct 2025

Viewed by 178

Abstract

Municipal solid waste (MSW) refers to solid and semi-solid waste generated during human production and daily activities. The process of incinerating such waste, known as municipal solid waste incineration (MSWI), serves as a critical method for reducing waste volume and recovering resources. Automatic online recognition of flame combustion status during MSWI is a key technical approach to ensuring system stability, addressing issues such as high pollution emissions, severe equipment wear, and low operational efficiency. However, when manually selecting optimized features and hyperparameters based on empirical experience, the MSWI flame combustion state recognition model suffers from high time consumption, strong dependency on expertise, and difficulty in adaptively obtaining optimal solutions. To address these challenges, this article proposes a method for constructing a flame combustion state recognition model optimized based on reinforcement learning (RL), long short-term memory (LSTM), and parallel differential evolution (PDE) algorithms, achieving collaborative optimization of deep features and model hyperparameters. First, the feature selection and hyperparameter optimization problem of the ViT-IDFC combustion state recognition model is transformed into an encoding design and optimization problem for the PDE algorithm. Then, the mutation and selection factors of the PDE algorithm are used as modeling inputs for LSTM, which predicts the optimal hyperparameters based on PDE outputs. Next, during the PDE-based optimization of the ViT-IDFC model, a policy gradient reinforcement learning method is applied to determine the parameters of the LSTM model. Finally, the optimized combustion state recognition model is obtained by identifying the feature selection parameters and hyperparameters of the ViT-IDFC model. Test results based on an industrial image dataset demonstrate that the proposed optimization algorithm improves the recognition performance of both left and right grate recognition models, with the left grate achieving a 0.51% increase in recognition accuracy and the right grate a 0.74% increase. Full article

(This article belongs to the Section Waste and Recycling)

►▼ Show Figures

Figure 1

19 pages, 4966 KB

Open AccessArticle

A Study on Geometrical Consistency of Surfaces Using Partition-Based PCA and Wavelet Transform in Classification

by Vignesh Devaraj, Thangavel Palanisamy and Kanagasabapathi Somasundaram

AppliedMath 2025, 5(4), 134; https://doi.org/10.3390/appliedmath5040134 - 3 Oct 2025

Viewed by 172

Abstract

The proposed study explores the consistency of the geometrical character of surfaces under scaling, rotation and translation. In addition to its mathematical significance, it also exhibits advantages over image processing and economic applications. In this paper, the authors used partition-based principal component analysis similar to two-dimensional Sub-Image Principal Component Analysis (SIMPCA), along with a suitably modified atypical wavelet transform in the classification of 2D images. The proposed framework is further extended to three-dimensional objects using machine learning classifiers. To strengthen fairness, we benchmarked against both Random Forest (RF) and Support Vector Machine (SVM) classifiers using nested cross-validation, showing consistent gains when TIFV is included. In addition, we carried out a robustness analysis by introducing Gaussian noise to the intensity channel, confirming that TIFV degrades much more gracefully compared to traditional descriptors. Experimental results demonstrate that the method achieves improved performance compared to traditional hand-crafted descriptors such as measured values and histogram of oriented gradients. In addition, it is found to be useful that this proposed algorithm is capable of establishing consistency locally, which is never possible without partition. However, a reasonable amount of computational complexity is reduced. We note that comparisons with deep learning baselines are beyond the scope of this study, and our contribution is positioned within the domain of interpretable, affine-invariant descriptors that enhance classical machine learning pipelines. Full article

►▼ Show Figures

Figure 1

22 pages, 4598 KB

Open AccessArticle

Machinability of Vitrified Semi-Finished Products: Chip Formation and Heat Development at the Cutting Edge

by Jannick Fuchs, Yehor Kozlovets, Jonathan Alms, Markus Meurer, Christian Hopmann, Thomas Bergs and Mustapha Abouridouane

Polymers 2025, 17(19), 2681; https://doi.org/10.3390/polym17192681 - 3 Oct 2025

Viewed by 254

Abstract

Fibre-reinforced composites are facing new challenges in the context particular in sustainability and recyclability. Vitrimers could be useful as new matrices to support the increase in sustainability. Due to their high strength, which is comparable to that of thermosets often used in composites, and their covalent adaptive networks, which make them reshapeable for scaled-up manufacturing and recycling purposes, they are very useful. Orthogonal cutting is used for precise reshaping and functional integration into carbon fibre reinforced plastics. Vitrimers could improve processing results at the cutting edge as well as surface quality thanks to their self-healing properties compared to brittle matrices, as well as enabling the recycling of formed chips and scrap. This study showcases the manufacturing of a carbon fibre-reinforced vitrimer using 4-aminophenyl disulfide as a hardener, with vacuum-assisted resin infusion. The temperature of chip formation and the cutting parameters are then shown for different fibre orientations, cutting widths and speeds. The observed cutting forces are lower (less than 140 N) and more irregular for fibre orientations 45°/135°, increasing with cutting depth, and fluctuating periodically during machining. Despite varying cutting speeds, the forces remain relatively constant in range between 85 N and 175 N for 0°/90° fibre orientation and 50 N and 120 N for 45°/135° fibre orientation, with no significant tool wear observed and lower-damage depth and overhanging fibres observed for 0°/90° fibre orientation. Damage observation of the cutting tool shows promising results, with lower abrasion observed compared to thermoset matrices. Microscopic images of the broached surface also show good quality, which could be improved by self-healing of the matrix at higher temperatures. Temperature measurements of chip formation using a high-speed camera show a high temperature gradient as cutting speeds increase, but the temperature only ever exceeds 180 °C at cutting speeds of 150 m/min, ensuring reprocessability since this is below the degradation temperature. Therefore, orthogonal cutting of vitrimers can impact sustainable composite processing. Full article

(This article belongs to the Section Polymer Networks and Gels)

►▼ Show Figures

Figure 1

20 pages, 9101 KB

Open AccessArticle

Improved Measurement Method of Human Skin Temperature Based on Human Skin-Like Gradient Standard Radiation Source

by Tianshuo Li, Zhenyuan Zhang, Guojin Feng, Xinhua Chen and Ziqi Hao

Thermo 2025, 5(4), 38; https://doi.org/10.3390/thermo5040038 - 2 Oct 2025

Viewed by 151

Abstract

Infrared thermography for human skin temperature measurement, when calibrated with standard blackbodies, suffers from errors due to the mismatch in emissivity between a blackbody and human skin. This study introduces a novel calibration method utilizing a human skin-like gradient radiation source to enhance measurement accuracy. A custom radiation source with six temperature points and skin-like emissivity was developed. Thermal imagers were calibrated using this source, and their performance was compared against traditional blackbody calibration. The proposed method reduced the calibration error to 0.04 °C, a significant improvement over the 0.15 °C error obtained with blackbody calibration. Calibration with a skin-like radiation source proves superior to the blackbody method, enabling high-accuracy (less than 0.1 °C) human skin temperature measurement for improved fever screening. Full article

►▼ Show Figures

Figure 1

22 pages, 2189 KB

Open AccessArticle

Hybrid CNN-Swin Transformer Model to Advance the Diagnosis of Maxillary Sinus Abnormalities on CT Images Using Explainable AI

by Mohammad Alhumaid and Ayman G. Fayoumi

Computers 2025, 14(10), 419; https://doi.org/10.3390/computers14100419 - 2 Oct 2025

Viewed by 171

Abstract

Accurate diagnosis of sinusitis is essential due to its widespread prevalence and its considerable impact on patient quality of life. While multiple imaging techniques are available for detecting maxillary sinus, computed tomography (CT) remains the preferred modality because of its high sensitivity and spatial resolution. Although recent advances in deep learning have led to the development of automated methods for sinusitis classification, many existing models perform poorly in the presence of complex pathological features and offer limited interpretability, which hinders their integration into clinical workflows. In this study, we propose a hybrid deep learning framework that combines EfficientNetB0, a convolutional neural network, with the Swin Transformer, a vision transformer, to improve feature representation. An attention-based fusion module is used to integrate both local and global information, thereby enhancing diagnostic accuracy. To improve transparency and support clinical adoption, the model incorporates explainable artificial intelligence (XAI) techniques using Gradient-weighted Class Activation Mapping (Grad-CAM). This allows for visualization of the regions influencing the model’s predictions, helping radiologists assess the clinical relevance of the results. We evaluate the proposed method on a curated maxillary sinus CT dataset covering four diagnostic categories: Normal, Opacified, Polyposis, and Retention Cysts. The model achieves a classification accuracy of 95.83%, with precision, recall, and F1 score all at 95%. Grad-CAM visualizations indicate that the model consistently focuses on clinically significant regions of the sinus anatomy, supporting its potential utility as a reliable diagnostic aid in medical practice. Full article

(This article belongs to the Special Issue Application of Artificial Intelligence and Modeling Frameworks in Health Informatics and Related Fields)

►▼ Show Figures

Figure 1

16 pages, 3175 KB

Open AccessArticle

Defects Identification in Ceramic Composites Based on Laser-Line Scanning Thermography

by Yalei Wang, Jianqiu Zhou, Leilei Ding, Xiaohan Liu and Senlin Jin

J. Compos. Sci. 2025, 9(10), 532; https://doi.org/10.3390/jcs9100532 - 1 Oct 2025

Viewed by 264

Abstract

Infrared thermography non-destructive testing technology has been widely used in the defect detection of composite structures due to its advantages, including non-contact operation, rapidity, low cost, and high precision. In this study, a laser-line scanning system combined with an infrared thermography was developed, along with a corresponding dynamic sequence image reconstruction method, enabling rapid localization of surface damages. Then, high-precision quantitative characterization of defect morphology in reconstructed images was achieved by integrating an edge gradient detection algorithm. The reconstruction method was validated through finite element simulations and experimental studies. The results demonstrated that the laser-line scanning thermography effectively enables both rapid localization of surface damages and precise quantitative characterization of their morphology. Experimental measurements of ceramic materials indicate that the relative error in detecting crack width is about 6% when the crack is perpendicular to the scanning direction, and the relative error gradually increases when the angle between the crack and the scanning direction decreases. Additionally, an alumina ceramic plate with micrometer-width cracks is inspected by the continuous laser-line scanning thermography. The morphology detection results are completely consistent with the actual morphology. However, limited by the spatial resolution of the thermal imager in the experiment, the quantitative identification of the crack width cannot be carried out. Finally, the proposed method is also effective for detecting surface damage of wrinkles in ceramic matrix composites. It can localize damage and quantify its geometric features with an average relative error of less than 3%, providing a new approach for health monitoring of large-scale ceramic matrix composite structures. Full article

(This article belongs to the Section Composites Modelling and Characterization)

►▼ Show Figures

Figure 1

19 pages, 7270 KB

Open AccessArticle

A Fast Rotation Detection Network with Parallel Interleaved Convolutional Kernels

by Leilei Deng, Lifeng Sun and Hua Li

Symmetry 2025, 17(10), 1621; https://doi.org/10.3390/sym17101621 - 1 Oct 2025

Viewed by 184

Abstract

In recent years, convolutional neural network-based object detectors have achieved extensive applications in remote sensing (RS) image interpretation. While multi-scale feature modeling optimization remains a persistent research focus, existing methods frequently overlook the symmetrical balance between feature granularity and morphological diversity, particularly when handling high-aspect-ratio RS targets with anisotropic geometries. This oversight leads to suboptimal feature representations characterized by spatial sparsity and directional bias. To address this challenge, we propose the Parallel Interleaved Convolutional Kernel Network (PICK-Net), a rotation-aware detection framework that embodies symmetry principles through dual-path feature modulation and geometrically balanced operator design. The core innovation lies in the synergistic integration of cascaded dynamic sparse sampling and symmetrically decoupled feature modulation, enabling adaptive morphological modeling of RS targets. Specifically, the Parallel Interleaved Convolution (PIC) module establishes symmetric computation patterns through mirrored kernel arrangements, effectively reducing computational redundancy while preserving directional completeness through rotational symmetry-enhanced receptive field optimization. Complementing this, the Global Complementary Attention Mechanism (GCAM) introduces bidirectional symmetry in feature recalibration, decoupling channel-wise and spatial-wise adaptations through orthogonal attention pathways that maintain equilibrium in gradient propagation. Extensive experiments on RSOD and NWPU-VHR-10 datasets demonstrate our superior performance, achieving 92.2% and 84.90% mAP, respectively, outperforming state-of-the-art methods including EfficientNet and YOLOv8. With only 12.5 M parameters, the framework achieves symmetrical optimization of accuracy-efficiency trade-offs. Ablation studies confirm that the symmetric interaction between PIC and GCAM enhances detection performance by 2.75%, particularly excelling in scenarios requiring geometric symmetry preservation, such as dense target clusters and extreme scale variations. Cross-domain validation on agricultural pest datasets further verifies its rotational symmetry generalization capability, demonstrating 84.90% accuracy in fine-grained orientation-sensitive detection tasks. Full article

(This article belongs to the Section Computer)

►▼ Show Figures

Figure 1

25 pages, 13151 KB

Open AccessArticle

Adaptive Energy–Gradient–Contrast (EGC) Fusion with AIFI-YOLOv12 for Improving Nighttime Pedestrian Detection in Security

by Lijuan Wang, Zuchao Bao and Dongming Lu

Appl. Sci. 2025, 15(19), 10607; https://doi.org/10.3390/app151910607 - 30 Sep 2025

Viewed by 107

Abstract

In security applications, visible-light pedestrian detectors are highly sensitive to changes in illumination and fail under low-light or nighttime conditions, while infrared sensors, though resilient to lighting, often produce blurred object boundaries that hinder precise localization. To address these complementary limitations, we propose a practical multimodal pipeline—Adaptive Energy–Gradient–Contrast (EGC) Fusion with AIFI-YOLOv12—that first fuses infrared and low-light visible images using per-pixel weights derived from local energy, gradient magnitude and contrast measures, then detects pedestrians with an improved YOLOv12 backbone. The detector integrates an AIFI attention module at high semantic levels, replaces selected modules with A2C2f blocks to enhance cross-channel feature aggregation, and preserves P3–P5 outputs to improve small-object localization. We evaluate the complete pipeline on the LLVIP dataset and report Precision, Recall, mAP@50, mAP@50–95, GFLOPs, FPS and detection time, comparing against YOLOv8, YOLOv10–YOLOv12 baselines (n and s scales). Quantitative and qualitative results show that the proposed fusion restores complementary thermal and visible details and that the AIFI-enhanced detector yields more robust nighttime pedestrian detection while maintaining a competitive computational profile suitable for real-world security deployments. Full article

(This article belongs to the Special Issue Advanced Image Analysis and Processing Technologies and Applications)

►▼ Show Figures

Figure 1

Show export options Show export options

Select all

Export citation of selected articles as:

Error

Oops... you haven't selected anything for export.

Displaying article 1-50 on page 1 of 67.

Go to page 1 2 3 4 5

Search Results (3,332)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI