MDPI - Publisher of Open Access Journals

25 pages, 2439 KB

Open AccessArticle

Personalized Adaptive Gabor Filtering with Three-Stage Semi-Supervised Domain-Adversarial Learning for Cross-Subject SSVEP Decoding

by Junjun Guo, Xiaonan Pan, Ning Mi, Jianrui Zhang and Ting Huyan

Sensors 2026, 26(12), 3694; https://doi.org/10.3390/s26123694 - 10 Jun 2026

Viewed by 136

Abstract

Improving the decoding accuracy and information transfer rate (ITR) of steady-state visual evoked potential brain–computer interface (SSVEP-BCI) systems, while enhancing cross-subject generalization and reducing calibration cost, is essential for practical deployment. This study proposes an end-to-end framework that integrates adaptive filtering with semi-supervised [...] Read more.

Improving the decoding accuracy and information transfer rate (ITR) of steady-state visual evoked potential brain–computer interface (SSVEP-BCI) systems, while enhancing cross-subject generalization and reducing calibration cost, is essential for practical deployment. This study proposes an end-to-end framework that integrates adaptive filtering with semi-supervised domain adaptation. The framework incorporates a Gabor adaptive filter bank (G-AFB) to optimize time–frequency representations and extract features matched to individual neural responses. It also introduces a three-stage semi-supervised domain-adversarial neural network (TriS-DANN), which combines unsupervised pre-alignment and supervised fine-tuning to align cross-subject feature distributions and enable lightweight calibration. On the 1.0 s public benchmark dataset, G-AFB-tCNN achieved 89.13% accuracy, a 4.63 percentage-point improvement over its conventional filter-bank counterpart. On the 0.4 s in-house dataset, G-AFB-tCNN achieved 91.85% accuracy, a 3.22 percentage-point improvement over the conventional fixed filter bank. In transfer learning, TriS-DANN reached 86.60% accuracy using 0.4 s segments extracted from the stimulation period and only 23.07% of the available target-domain training/calibration trials, demonstrating higher efficiency and stability than conventional fine-tuning. These results support the proposed framework as a feasible route toward reliable, low-calibration SSVEP-BCI systems. Full article

(This article belongs to the Special Issue Advanced Biomedical Imaging and Signal Processing)

21 pages, 20330 KB

Open AccessArticle

A Fault Diagnosis Method for Mobile Communication Networks Based on Improved Convolutional Neural Networks

by Hongliang Tian, Bolin Song and Xiaoke Liu

Telecom 2026, 7(3), 60; https://doi.org/10.3390/telecom7030060 - 28 May 2026

Viewed by 150

Abstract

In response to the shortcomings of current mobile communication network (MCN) fault diagnosis methods, such as the insufficient robustness of time-series-spectrum features and the limited ability to capture long-distance dependencies, an improved convolutional neural network is proposed, along with a hybrid diagnosis method [...] Read more.

In response to the shortcomings of current mobile communication network (MCN) fault diagnosis methods, such as the insufficient robustness of time-series-spectrum features and the limited ability to capture long-distance dependencies, an improved convolutional neural network is proposed, along with a hybrid diagnosis method based on time-frequency perception and a lightweight deep network (TL-FDN). The TL-FDN introduces a time-series-spectrum feature enhancement module (TFN-E) at the input end, and enhances the robustness of features through a learnable Gabor filter bank. The main architecture employs a hybrid module that integrates a lightweight convolution (LiConv-Block) and a broadcast self-attention (BSA) mechanism (Former-Block), effectively balancing the efficiency of local feature extraction with the capture of global time-series dependencies. Additionally, the model uses a multi-task loss function to achieve joint diagnosis of fault type and fault location. The experimental results show that the average accuracy of the proposed TL-FDN method is 98.6%, which is 3.5% higher than that of the standard convolutional + standard attention baseline method. To strictly evaluate the performance improvement, this paper conducted a non-parametric Wilcoxon signed-rank test in 10 independent experiments. The p-values of the core model indicators were all strictly less than 0.05. These results statistically confirm the superiority of TL-FDN in the fault type identification and location tasks, while maintaining a lightweight parameter quantity suitable for edge-end deployment. Full article

(This article belongs to the Special Issue Emerging Technologies in Communications and Machine Learning)

► Show Figures

Figure 1

24 pages, 5111 KB

Open AccessArticle

Evolutionarily Optimized Multi-Scale Gabor Modeling of Directional Lesion Texture in Dermoscopic Images for Interpretable Melanoma Classification

by Raúl Santiago-Montero, Valentin Calzada-Ledesma, David Asael Gutiérrez-Hernández, Lucero de Montserrat Ortiz-Aguilar, Armando Mares-Castro, Luis Angel Xoca-Orozco and José de Jesús Flores-Sierra

Diagnostics 2026, 16(10), 1430; https://doi.org/10.3390/diagnostics16101430 - 8 May 2026

Viewed by 397

Abstract

Background: Melanoma is one of the most aggressive forms of skin cancer, making early and accurate diagnosis essential for improving patient outcomes. Methods: In this work, we propose an Evolutionary Gabor-based Melanoma Descriptor (Evo-GMD), a lightweight and interpretable approach designed under [...] Read more.

Background: Melanoma is one of the most aggressive forms of skin cancer, making early and accurate diagnosis essential for improving patient outcomes. Methods: In this work, we propose an Evolutionary Gabor-based Melanoma Descriptor (Evo-GMD), a lightweight and interpretable approach designed under the principles of Frugal AI. The method integrates multi-scale Gabor filtering with Differential Evolution to automatically learn discriminative texture patterns using a reduced set of parameters. The proposed approach was evaluated on the PH2 dataset, achieving competitive performance (accuracy above 95%) while maintaining low computational complexity and full interpretability. To further assess its robustness, complementary experiments were conducted on the ISIC 2017 dataset, which presents higher variability, class imbalance, and heterogeneous lesion characteristics. Results: The results reveal that multiple methods—including handcrafted descriptors, convolutional neural networks, and transfer learning models—exhibit significant performance degradation or converge to trivial solutions under these conditions. This behavior highlights that increasing model complexity does not necessarily improve classification performance when data constraints are present. Conclusions: Overall, the findings demonstrate that the proposed method provides a robust and efficient alternative for melanoma classification in low-resource scenarios, where data availability, computational capacity, and interpretability are critical factors. Full article

(This article belongs to the Special Issue Artificial Intelligence for Health and Medicine—2nd Edition)

► Show Figures

Figure 1

27 pages, 10843 KB

Open AccessArticle

Optimization of Gabor Filters Based on Quaternions for Image Preprocessing in the Automated Detection of Bemisia tabaci in Yellow Traps

by Ramiro Esquivel-Felix, Mireya Moreno-Lucio, Celina Lizeth Castañeda-Miranda, Héctor Alonso Guerrero-Osuna, Rodrigo Castañeda-Miranda, Carlos A. Olvera-Olvera, Ma. del Rosario Martínez-Blanco and Luis Octavio Solís-Sánchez

Algorithms 2026, 19(5), 360; https://doi.org/10.3390/a19050360 - 4 May 2026

Viewed by 257

Abstract

In precision agriculture, identifying pests such as the whitefly (Bemisia tabaci) is a significant challenge, as precise knowledge of these insects is essential for developing effective Integrated Pest Management (IPM) strategies. Automated daily monitoring within IPM programs optimizes the diagnostic registration [...] Read more.

In precision agriculture, identifying pests such as the whitefly (Bemisia tabaci) is a significant challenge, as precise knowledge of these insects is essential for developing effective Integrated Pest Management (IPM) strategies. Automated daily monitoring within IPM programs optimizes the diagnostic registration stage by reducing logistical expenses and manual errors, enabling early pest treatment interventions and providing quantitative data for informed decision-making. In this study, an image bank was processed using a Quaternionic Gabor Filter (QGF) algorithmto highlight textural features through hypercomplex correlation. The highlighted objects were then processed by a YOLOv8 pretrained model to identify Bemisia tabaci. Experimental results demonstrate that this combination achieves a precision of 0.868 and an mAP@0.5 of 0.950, while a PSNR of 34.10 dB ensures the structural integrity of the enhanced images. Although the total execution time averages 2.3 s per image due to preprocessing complexity, the GPU inference time of 10.3 ms confirms the potential for high-speed detection. This approach significantly enhanced the morphological features of Bemisia tabaci, increasing the robustness of the detection model and narrowing down processing conditions for yellow trap samples to strengthen precision in the semi-arid regions of Zacatecas, Mexico. Full article

(This article belongs to the Special Issue Advances in Computer Vision: Emerging Trends and Applications)

► Show Figures

Graphical abstract

21 pages, 1405 KB

Open AccessArticle

Bionic Corner Detection Based on Cooperative Processing of Simple Cells and End-Stopped Cells

by Shuo Sun and Haiyang Yu

Algorithms 2026, 19(5), 343; https://doi.org/10.3390/a19050343 - 30 Apr 2026

Viewed by 290

Abstract

Corner detection is a fundamental task in computer vision that plays a critical role in applications such as image registration, 3D reconstruction, and object tracking. In biological visual systems, simple cells in the primary visual cortex exhibit high selectivity to edge stimuli of [...] Read more.

Corner detection is a fundamental task in computer vision that plays a critical role in applications such as image registration, 3D reconstruction, and object tracking. In biological visual systems, simple cells in the primary visual cortex exhibit high selectivity to edge stimuli of specific orientations, while end-stopped cells can detect geometric singular structures such as line segment endpoints and corners. Existing corner detection methods based on visual neural computation typically employ a strategy of densely distributed end-stopped cells for corner localization, which suffers from significant localization deviation under small angle conditions due to mutual interference between responses of adjacent neurons. To address this problem, this paper proposes a bionic corner detection method based on cooperative processing of simple cells and end-stopped cells. The method constructs a two-stage cooperative processing framework: the edge filtering stage employs a Gabor filter bank to simulate the orientation selectivity of simple cells, extracting edge positions and orientation information; the dynamic construction stage builds unilateral end-stopped cells only at filtered edge positions based on local orientation information, fundamentally avoiding computational redundancy and response interference caused by global dense distribution; the corner localization stage determines precise corner coordinates through hierarchical clustering and dual-cluster centroid fusion strategies. Experimental results demonstrate that, in the 15° acute-angle regime where dense end-stopped schemes are most severely affected by response interference, the proposed method reduces the mean localization error from 8.76 to 2.34 pixels, corresponding to a 73.3% improvement; averaged across the eight tested angle levels from 15° to 165°, the improvement is approximately 40.9%, and all per-angle differences are statistically significant (paired t-test, p < 0.01 or below, N = 10 independent runs). On standard test images, the method attains the lowest mean localization error among the eight compared detectors (1.58 pixels, versus 1.68–3.42 pixels for Harris, FAST, COSFIRE, KAZE, SuperPoint, Deep Corner, and Wei et al.), while maintaining competitive detection rate, false-alarm rate, and runtime. Physiological plausibility validation experiments show that the correlation coefficient between the detection deviation of this method and human perceptual deviation reaches 0.923, indicating that the output of the framework aligns with previously reported human perceptual bias patterns and supporting its biological plausibility as a biologically inspired—rather than mechanistic—model of corner perception. The source code, dataset, and experimental results are publicly available (see Data Availability Statement). Full article

► Show Figures

Figure 1

27 pages, 6783 KB

Open AccessArticle

A Robust Intelligent CNN Model Enhanced with Gabor-Based Feature Extraction, SMOTE Balancing, and Adam Optimization for Multi-Grade Diabetic Retinopathy Classification

by Asri Mulyani, Muljono, Purwanto and Moch Arief Soeleman

J. Imaging 2026, 12(5), 188; https://doi.org/10.3390/jimaging12050188 - 27 Apr 2026

Viewed by 471

Abstract

Diabetic retinopathy (DR) is a leading cause of vision impairment and permanent blindness worldwide, requiring accurate and automated systems for multi-grade severity classification. However, standard Convolutional Neural Networks (CNNs) often struggle to capture fine, high-frequency microvascular patterns critical for diagnosis. This study proposes [...] Read more.

Diabetic retinopathy (DR) is a leading cause of vision impairment and permanent blindness worldwide, requiring accurate and automated systems for multi-grade severity classification. However, standard Convolutional Neural Networks (CNNs) often struggle to capture fine, high-frequency microvascular patterns critical for diagnosis. This study proposes a Robust Intelligent CNN Model (RICNN) that integrates Gabor-based feature extraction with deep learning to improve DR classification. Specifically, Gabor filters are applied during preprocessing to extract orientation- and frequency-sensitive texture features, which are transformed into feature maps and concatenated with CNN feature representations at the fully connected layer (feature-level fusion). The model also incorporates the Synthetic Minority Oversampling Technique (SMOTE) for data balancing and the Adam optimizer for efficient convergence. This integration enhances sensitivity to microvascular structures such as microaneurysms and hemorrhages. The proposed RICNN was evaluated on the Messidor dataset (1200 images) across four severity levels: Mild, Moderate, Severe, and Proliferative DR. The model achieved an accuracy of 89%, a precision of 88.75%, a recall of 89%, and an F1-score of 89%, with AUCs of 97% for Severe DR and 99% for Proliferative DR. Comparative analysis confirms that the proposed texture-aware Gabor enhancement significantly outperforms LBP and Color Histogram approaches, indicating its potential for reliable clinical decision support. Full article

(This article belongs to the Section Medical Imaging)

► Show Figures

Figure 1

24 pages, 5148 KB

Open AccessArticle

Plant-Leaf Disease Detection Based on Texture Enhancement Using ATD-Net

by Yuheng Li and Xiafen Zhang

AgriEngineering 2026, 8(5), 160; https://doi.org/10.3390/agriengineering8050160 - 22 Apr 2026

Viewed by 776

Abstract

Early plant leaf disease detection and timely control is important for agricultural yield and stability. Yet, it is difficult for manual labor to monitor the health of the plant leaf 24 h a day. Existing detection approach cannot meet the demands of texture [...] Read more.

Early plant leaf disease detection and timely control is important for agricultural yield and stability. Yet, it is difficult for manual labor to monitor the health of the plant leaf 24 h a day. Existing detection approach cannot meet the demands of texture enhancement features. Therefore, this paper proposes a new detection approach which undergoes three-layer transformations: convolutional layer, attention mechanism layer and loss function layer. Firstly, ADown is used to extract fine-grained texture features from suspected leaves to reduce computational load. Secondly, Gabor texture enhancement is proposed to extract and enhance the contour and the directional texture of suspected areas using multi-directional filtering, followed by a combination Transformer to enhance the global context modeling capability. Thirdly, a dynamic boundary loss function (DBL) is employed to dynamically adjust the probability distribution of bounding box regression through adaptive temperature coefficient and information entropy, thereby improving the positioning accuracy of the detection box. The experiments show that ATD-Net achieved an average accuracy of 87.42% (mAP50) and an accuracy of 85.96%, with a computational complexity of 6.5 GFLOPs. The visualization results and ablation experiments show that the collaborative work of the proposed modules significantly improves the detection robustness in complex backgrounds, early diseases, and small target scenes. Compared to the original model, ATD-Net achieves a performance improvement of 1.1% at mAP50 and a speed increase of 17.7%. The model size remains almost unchanged, at 5.2 MB. It is an efficient and promising solution for future real-time disease recognition in complex agricultural environments. Full article

(This article belongs to the Section Computer Applications and Artificial Intelligence in Agriculture)

► Show Figures

Figure 1

13 pages, 750 KB

Open AccessArticle

Evaluating Handcrafted Image Descriptors for Defect Detection in the X-Ray Inspection of Turbine Blade Castings: A Feature Separability Study

by Andrzej Burghardt and Wojciech Łabuński

Appl. Sci. 2026, 16(8), 3905; https://doi.org/10.3390/app16083905 - 17 Apr 2026

Viewed by 282

Abstract

The industrial X-ray inspection of turbine blade castings requires reliable and auditable decision support, yet defect indications are subtle, and data availability is limited. This study quantitatively assesses the diagnostic potential of handcrafted image descriptors by evaluating class separability in feature space, independently [...] Read more.

The industrial X-ray inspection of turbine blade castings requires reliable and auditable decision support, yet defect indications are subtle, and data availability is limited. This study quantitatively assesses the diagnostic potential of handcrafted image descriptors by evaluating class separability in feature space, independently of any trained classifier. The dataset comprises 1600 16-bit DICOM radiograms of 200 blades (eight views per blade), including 156 defective images with 207 localized defects. Standardized 32 × 32 ROI patches were sampled randomly in the vicinity of indications and from defect-free regions to reduce sample correlation and to emulate localization uncertainty. Feature vectors were extracted using five descriptor families—first-order statistics, GLCM/Haralick, FFT and wavelet (DWT) features, Gabor filters, and LBP—and the standardized z-score. Separability was ranked using complementary distribution-based and distance-based metrics grouped into three sets, and the results were min–max-normalized to enable TOP-5 comparisons. Spectral descriptors, particularly DWT wavelets and FFT combined with DWT, consistently achieved the highest scores in distributional metrics, supporting a lightweight screening profile. In contrast, richer combinations dominated multidimensional geometric metrics, indicating benefits from multi-perspective representations for offline analysis. The proposed metric-driven framework provides an interpretable basis for representation selection prior to classifier development under industrial constraints. Full article

(This article belongs to the Special Issue Applications of Vision Measurement System on Product Quality Control: 2nd Edition)

► Show Figures

Figure 1

25 pages, 15197 KB

Open AccessArticle

Semi-Automated Computational Identification of Fibrosis for Enhanced Histopathological Decision Support

by Alexandru-George Berciu, Diana Rus-Gonciar, Teodora Mocan, Lucia Agoston-Coldea, Carmen Cionca and Eva-Henrietta Dulf

J. Imaging 2026, 12(4), 152; https://doi.org/10.3390/jimaging12040152 - 31 Mar 2026

Viewed by 454

Abstract

Myocardial fibrosis is a critical prognostic marker involving a progressive cascade of pathological conditions. Accurate assessment of fibrosis in myocardial samples is a routine but difficult procedure for pathologists. This article presents a semi-automated system designed to ease this task while providing pixel-level [...] Read more.

Myocardial fibrosis is a critical prognostic marker involving a progressive cascade of pathological conditions. Accurate assessment of fibrosis in myocardial samples is a routine but difficult procedure for pathologists. This article presents a semi-automated system designed to ease this task while providing pixel-level accuracy that exceeds manual estimation capabilities. The proposed innovative approach combines Gabor filters with CIELAB color space analysis to ensure the efficiency and interpretability of calculations. Testing on histopathological samples, differentiating between fibrous, healthy, and variant tissues, yielded a promising accuracy of 87.5% for images with fibrosis and 80% for all 45 images tested. This system successfully establishes a solid foundation for automated diagnosis, providing pathologists with a reliable and highly accurate tool for quantitative analysis of cardiac tissue. Full article

(This article belongs to the Section AI in Imaging)

► Show Figures

Figure 1

25 pages, 42196 KB

Open AccessArticle

Frequency–Spatial Domain Jointly Guided Perceptual Network for Infrared Small Target Detection

by Yeteng Han, Minrui Ye, Bohan Liu, Jie Li, Chaoxian Jia, Wennan Cui and Tao Zhang

Remote Sens. 2026, 18(7), 1000; https://doi.org/10.3390/rs18071000 - 26 Mar 2026

Cited by 2 | Viewed by 941

Abstract

Infrared small target detection is a critical task in remote sensing. However, it remains highly challenging due to low contrast, heavy background clutter, and large variations in target scale. Traditional convolutional networks are inadequate for joint modeling, as they cannot effectively capture both [...] Read more.

Infrared small target detection is a critical task in remote sensing. However, it remains highly challenging due to low contrast, heavy background clutter, and large variations in target scale. Traditional convolutional networks are inadequate for joint modeling, as they cannot effectively capture both fine structural details and global contextual dependencies. To address these issues, we propose FSGPNet, a frequency–spatial domain jointly guided perceptual network that explicitly exploits complementary representations in both the frequency and spatial domains. Specifically, a Frequency–Spatial Enhancement Module (FSEM) is introduced to strengthen target details while suppressing background interference through high-frequency enhancement and Perona–Malik diffusion. To enhance global context modeling, we propose a Multi-Scale Global Perception (MSGP) module that integrates non-local attention with multi-scale dilated convolutions, enabling robust background modeling. Furthermore, a Gabor Transformer Attention Module (GTAM) is designed to achieve selective frequency–spatial feature aggregation via self-attention over multi-directional and multi-scale Gabor responses, effectively highlighting discriminative structures of various small targets. Extensive experiments are conducted on two benchmark datasets (IRSTD-1K and NUDT-SIRST) that cover typical remote sensing infrared scenarios. Quantitative and qualitative results demonstrate that FSGPNet consistently outperforms state-of-the-art methods across multiple evaluation metrics. These findings validate the effectiveness and robustness of the proposed FSGPNet for detecting small infrared targets in remote sensing applications. Full article

(This article belongs to the Special Issue Deep Learning-Based Small-Target Detection in Remote Sensing)

► Show Figures

Figure 1

24 pages, 8810 KB

Open AccessArticle

FreqPose: Frequency-Aware Diffusion with Fractional Gabor Filters and Global Pose–Semantic Alignment

by Meng Wang, Bing Wang, Huiling Chen, Jing Ren and Xueping Tang

Sensors 2026, 26(4), 1334; https://doi.org/10.3390/s26041334 - 19 Feb 2026

Viewed by 501

Abstract

The task of pose-guided person image generation has long been confronted with two major challenges: high-frequency texture details tend to blur and be lost during appearance transfer, while the semantic identity of the person is difficult to maintain consistently during pose changes. To [...] Read more.

The task of pose-guided person image generation has long been confronted with two major challenges: high-frequency texture details tend to blur and be lost during appearance transfer, while the semantic identity of the person is difficult to maintain consistently during pose changes. To address these issues, this paper proposes a diffusion-based generative framework that integrates frequency awareness and global semantic alignment. The framework consists of two core modules: a multi-level fractional-order Gabor frequency-aware network, which accurately extracts and reconstructs high-frequency texture features such as hair strands and fabric wrinkles, enhances image detail fidelity through fractional-order filtering and complex domain modeling; and a global semantic-pose alignment module that utilizes a cross-modal attention mechanism to establish a global mapping between pose features and appearance semantics, ensuring pose-driven semantic alignment and appearance consistency. The collaborative function of these two modules ensures that the generated results maintain structural integrity and natural textures even under complex pose variations and large-angle rotations. The experimental results on the DeepFashion and Market1501 datasets demonstrate that the proposed method outperforms existing state-of-the-art approaches in terms of SSIM, FID, and perceptual quality, validating the effectiveness of the model in enhancing texture fidelity and semantic consistency. Full article

(This article belongs to the Section Intelligent Sensors)

► Show Figures

Figure 1

14 pages, 421 KB

Open AccessArticle

Artificial Intelligence-Based Evaluation of Permanent First Molar Extraction Indications in Children Using Panoramic Radiographs

by Serap Gülçin Çetin, Ömer Faruk Ertuğrul, Nursezen Kavasoğlu and Veysel Eratilla

Children 2026, 13(2), 277; https://doi.org/10.3390/children13020277 - 17 Feb 2026

Cited by 1 | Viewed by 593

Abstract

Background: The aim of this study was to develop an artificial intelligence (AI)-based decision support model for evaluating the extraction indication of permanent first molars in pediatric patients using panoramic radiographs, and to investigate the potential contribution of this model to the clinical [...] Read more.

Background: The aim of this study was to develop an artificial intelligence (AI)-based decision support model for evaluating the extraction indication of permanent first molars in pediatric patients using panoramic radiographs, and to investigate the potential contribution of this model to the clinical decision-making process. Methods: This retrospective observational study analyzed 1000 panoramic radiographs obtained from children aged 8–10 years who attended the Clinics of Batman University Faculty of Dentistry for routine dental examination. Among the radiographs meeting the inclusion criteria, a total of 176 panoramic images were selected based on dental maturation assessment using the Demirjian tooth development staging system. Cases in which the permanent second molar was classified as Demirjian stages E–F were labeled as “extraction indication present”, while the remaining stages were labeled as “extraction indication absent”. A balanced dataset was created, consisting of 88 cases in each group. Image features were extracted using Gabor filters and Histogram of Oriented Gradients (HOG). The selected features were analyzed using a Support Vector Machine (SVM) classifier with a radial basis function (RBF) kernel. Model performance was evaluated using accuracy, sensitivity, specificity, F1-score, and area under the receiver operating characteristic curve (ROC–AUC). Results: The proposed Gabor–HOG–SVM-based AI model achieved an overall classification accuracy of 77.78% with an AUC value of 0.77 in distinguishing between “extraction indication present” and “extraction indication absent” cases. For the extraction-indicated group, the sensitivity was 0.81 and the F1-score was 0.79, whereas for the non-indicated group, the sensitivity and F1-score were 0.74 and 0.77, respectively. No statistically significant differences were observed between the groups in terms of age or sex distribution (p > 0.05). Conclusions: This study demonstrates that artificial intelligence-based analysis of panoramic radiographic images can provide an objective and reproducible decision support approach for evaluating extraction indications of permanent first molars in pediatric patients. The proposed model should be considered as an adjunctive tool to reduce observer-dependent variability rather than a replacement for clinical judgment, and its clinical applicability should be further validated through multicenter and multi-parametric studies. Full article

(This article belongs to the Section Pediatric Dentistry & Oral Medicine)

► Show Figures

Figure 1

19 pages, 2617 KB

Open AccessArticle

Topic-Modeling Guided Semantic Clustering for Enhancing CNN-Based Image Classification Using Scale-Invariant Feature Transform and Block Gabor Filtering

by Natthaphong Suthamno and Jessada Tanthanuch

J. Imaging 2026, 12(2), 70; https://doi.org/10.3390/jimaging12020070 - 9 Feb 2026

Viewed by 611

Abstract

This study proposes a topic-modeling guided framework that enhances image classification by introducing semantic clustering prior to CNN training. Images are processed through two key-point extraction pipelines: Scale-Invariant Feature Transform (SIFT) with Sobel edge detection and Block Gabor Filtering (BGF), to obtain local [...] Read more.

This study proposes a topic-modeling guided framework that enhances image classification by introducing semantic clustering prior to CNN training. Images are processed through two key-point extraction pipelines: Scale-Invariant Feature Transform (SIFT) with Sobel edge detection and Block Gabor Filtering (BGF), to obtain local feature descriptors. These descriptors are clustered using K-means to build a visual vocabulary. Bag of Words histograms then represent each image as a visual document. Latent Dirichlet Allocation is applied to uncover latent semantic topics, generating coherent image clusters. Cluster-specific CNN models, including AlexNet, GoogLeNet, and several ResNet variants, are trained under identical conditions to identify the most suitable architecture for each cluster. Two topic guided integration strategies, the Maximum Proportion Topic (MPT) and the Weight Proportion Topic (WPT), are then used to assign test images to the corresponding specialized model. Experimental results show that both the SIFT-based and BGF-based pipelines outperform non-clustered CNN models and a baseline method using Incremental PCA, K-means, Same-Cluster Prediction, and unweighted Ensemble Voting. The SIFT pipeline achieves the highest accuracy of 95.24% with the MPT strategy, while the BGF pipeline achieves 93.76% with the WPT strategy. These findings confirm that semantic structure introduced through topic modeling substantially improves CNN classification performance. Full article

(This article belongs to the Topic Machine Learning and Deep Learning in Medical Imaging)

► Show Figures

Figure 1

24 pages, 5019 KB

Open AccessArticle

A Dual Stream Deep Learning Framework for Alzheimer’s Disease Detection Using MRI Sonification

by Nadia A. Mohsin and Mohammed H. Abdul Ameer

J. Imaging 2026, 12(1), 46; https://doi.org/10.3390/jimaging12010046 - 15 Jan 2026

Cited by 1 | Viewed by 973

Abstract

Alzheimer’s Disease (AD) is an advanced brain illness that affects millions of individuals across the world. It causes gradual damage to the brain cells, leading to memory loss and cognitive dysfunction. Although Magnetic Resonance Imaging (MRI) is widely used in AD diagnosis, the [...] Read more.

Alzheimer’s Disease (AD) is an advanced brain illness that affects millions of individuals across the world. It causes gradual damage to the brain cells, leading to memory loss and cognitive dysfunction. Although Magnetic Resonance Imaging (MRI) is widely used in AD diagnosis, the existing studies rely solely on the visual representations, leaving alternative features unexplored. The objective of this study is to explore whether MRI sonification can provide complementary diagnostic information when combined with conventional image-based methods. In this study, we propose a novel dual-stream multimodal framework that integrates 2D MRI slices with their corresponding audio representations. MRI images are transformed into audio signals using a multi-scale, multi-orientation Gabor filtering, followed by a Hilbert space-filling curve to preserve spatial locality. The image and sound modalities are processed using a lightweight CNN and YAMNet, respectively, then fused via logistic regression. The experimental results of the multimodal achieved the highest accuracy in distinguishing AD from Cognitively Normal (CN) subjects at 98.2%, 94% for AD vs. Mild Cognitive Impairment (MCI), and 93.2% for MCI vs. CN. This work provides a new perspective and highlights the potential of audio transformation of imaging data for feature extraction and classification. Full article

(This article belongs to the Section AI in Imaging)

► Show Figures

Figure 1

19 pages, 2960 KB

Open AccessArticle

Gabor Transform-Based Deep Learning System Using CNN for Melanoma Detection

by S. Deivasigamani, C. Senthilpari, Siva Sundhara Raja. D, A. Thankaraj, G. Narmadha and K. Gowrishankar

Computers 2026, 15(1), 54; https://doi.org/10.3390/computers15010054 - 13 Jan 2026

Cited by 2 | Viewed by 853

Abstract

Melanoma is highly dangerous and can spread rapidly to other parts of the body. It has an increasing fatality rate among different types of cancer. Timely detection of skin malignancies can reduce overall mortality. Therefore, clinical screening methods require more time and accuracy [...] Read more.

Melanoma is highly dangerous and can spread rapidly to other parts of the body. It has an increasing fatality rate among different types of cancer. Timely detection of skin malignancies can reduce overall mortality. Therefore, clinical screening methods require more time and accuracy for diagnosis. An automated, computer-aided system would facilitate earlier melanoma detection, thereby increasing patient survival rates. This paper identifies melanoma images using a Convolutional Neural Network. Skin images are preprocessed using Histogram Equalization and Gabor transforms. A Gabor filter-based Convolutional Neural Network (CNN) classifier trains and classifies the extracted features. We adopt Gabor filters because they are bandpass filters that transform a pixel into a multi-resolution kernel matrix, providing detailed information about the image. This study suggests a method with accuracy, sensitivity, and specificity of 98.58%, 98.66%, and 98.75%, respectively. This research supports SDGs 3 and 4 by facilitating early melanoma detection and enhancing AI-driven medical education. Full article

(This article belongs to the Topic AI, Deep Learning, and Machine Learning in Veterinary Science Imaging)

► Show Figures

Figure 1

Search Results (172)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (172)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI