Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (47)

Search Parameters:
Keywords = Gabor wavelet

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
19 pages, 1885 KB  
Article
A Hierarchical Multi-Resolution Self-Supervised Framework for High-Fidelity 3D Face Reconstruction Using Learnable Gabor-Aware Texture Modeling
by Pichet Mareo and Rerkchai Fooprateepsiri
J. Imaging 2026, 12(1), 26; https://doi.org/10.3390/jimaging12010026 - 5 Jan 2026
Viewed by 729
Abstract
High-fidelity 3D face reconstruction from a single image is challenging, owing to the inherently ambiguous depth cues and the strong entanglement of multi-scale facial textures. In this regard, we propose a hierarchical multi-resolution self-supervised framework (HMR-Framework), which reconstructs coarse-, medium-, and fine-scale facial [...] Read more.
High-fidelity 3D face reconstruction from a single image is challenging, owing to the inherently ambiguous depth cues and the strong entanglement of multi-scale facial textures. In this regard, we propose a hierarchical multi-resolution self-supervised framework (HMR-Framework), which reconstructs coarse-, medium-, and fine-scale facial geometry progressively through a unified pipeline. A coarse geometric prior is first estimated via 3D morphable model regression, followed by medium-scale refinement using a vertex deformation map constrained by a global–local Markov random field loss to preserve structural coherence. In order to improve fine-scale fidelity, a learnable Gabor-aware texture enhancement module has been proposed to decouple spatial–frequency information and thus improve sensitivity for high-frequency facial attributes. Additionally, we employ a wavelet-based detail perception loss to preserve the edge-aware texture features while mitigating noise commonly observed in in-the-wild images. Extensive qualitative and quantitative evaluation of benchmark datasets indicate that the proposed framework provides better fine-detail reconstruction than existing state-of-the-art methods, while maintaining robustness over pose variations. Notably, the hierarchical design increases semantic consistency across multiple geometric scales, providing a functional solution for high-fidelity 3D face reconstruction from monocular images. Full article
Show Figures

Figure 1

89 pages, 1188 KB  
Article
New Frontiers of Fractal Uncertainty
by Saeed Hashemi Sababe and Ismail Nikoufar
Fractal Fract. 2025, 9(12), 808; https://doi.org/10.3390/fractalfract9120808 - 9 Dec 2025
Cited by 1 | Viewed by 540
Abstract
We extend the classical fractal uncertainty principle (FUP) framework in time-frequency analysis by exploring several novel directions. First, we generalize the FUP beyond the classical Gaussian window by investigating non-Gaussian windows and the corresponding generalized Fock space techniques. Second, we develop uncertainty estimates [...] Read more.
We extend the classical fractal uncertainty principle (FUP) framework in time-frequency analysis by exploring several novel directions. First, we generalize the FUP beyond the classical Gaussian window by investigating non-Gaussian windows and the corresponding generalized Fock space techniques. Second, we develop uncertainty estimates in alternative joint representations, including the continuous wavelet transform and directional representations such as shearlets. Third, we study fractal uncertainty on random and anisotropic fractal sets, providing probabilistic and geometric refinements of the FUP. Fourth, we connect these results with semiclassical and microlocal analysis, thereby elucidating the role of fractal geometry in resonance theory and pseudodifferential operators. Finally, we extend the analysis beyond Gaussian Gabor multipliers by considering non-Gaussian generating functions and irregular lattice samplings. Our results yield new operator norm estimates and spectral properties, with potential applications in signal processing, quantum mechanics, and numerical analysis. Full article
Show Figures

Figure 1

21 pages, 3966 KB  
Article
Automatic Fault Diagnosis System for Induction Motors During Transient Conditions
by Wojciech Wroński and Maciej Sułowicz
Energies 2025, 18(24), 6439; https://doi.org/10.3390/en18246439 - 9 Dec 2025
Viewed by 640
Abstract
This paper presents a novel system for the fault diagnosis of induction motors, employing the Transient Motor Current Signature Analysis (TMCSA) method. The developed system operates in a laboratory environment and enables the detection of motor faults during transient conditions, specifically during the [...] Read more.
This paper presents a novel system for the fault diagnosis of induction motors, employing the Transient Motor Current Signature Analysis (TMCSA) method. The developed system operates in a laboratory environment and enables the detection of motor faults during transient conditions, specifically during the startup phase. The diagnostic process relies on tracking characteristic patterns in the time–frequency domain, which are extracted from current signals using advanced signal processing techniques, including the Gabor transform, Short-Time Fourier Transform (STFT), Wigner–Ville distribution, and Continuous Wavelet Transform (CWT). These transformations allow precise identification of fault-related components and their evolution over time. Experimental investigations were conducted for two distinct types of faults: a broken rotor bar and mixed eccentricity. The obtained results demonstrate a high accuracy of fault detection and confirm the robustness of the proposed method. Furthermore, the findings indicate its suitability for practical applications in variable-speed drive systems, where conventional steady-state diagnostic methods are often ineffective. Full article
(This article belongs to the Section E: Electric Vehicles)
Show Figures

Figure 1

23 pages, 6001 KB  
Article
Quantification of Flavonoid Contents in Holy Basil Using Hyperspectral Imaging and Deep Learning Approaches
by Apichat Suratanee, Panita Chutimanukul and Kitiporn Plaimas
Appl. Sci. 2025, 15(13), 7582; https://doi.org/10.3390/app15137582 - 6 Jul 2025
Cited by 1 | Viewed by 1515
Abstract
Holy basil (Ocimum tenuiflorum L.) is a medicinal herb rich in bioactive flavonoids with therapeutic properties. Traditional quantification methods rely on time-consuming and destructive extraction processes, whereas hyperspectral imaging provides a rapid, non-destructive alternative by analysing spectral signatures. However, effectively linking hyperspectral [...] Read more.
Holy basil (Ocimum tenuiflorum L.) is a medicinal herb rich in bioactive flavonoids with therapeutic properties. Traditional quantification methods rely on time-consuming and destructive extraction processes, whereas hyperspectral imaging provides a rapid, non-destructive alternative by analysing spectral signatures. However, effectively linking hyperspectral data to flavonoid levels remains a challenge for developing early detection tools before harvest. This study integrates deep learning with hyperspectral imaging to quantify flavonoid contents in 113 samples from 26 Thai holy basil cultivars collected across diverse regions of Thailand. Two deep learning architectures, ResNet1D and CNN1D, were evaluated in combination with feature extraction techniques, including wavelet transformation and Gabor-like filtering. ResNet1D with wavelet transformation achieved optimal performance, yielding an area under the receiver operating characteristic curve (AUC) of 0.8246 and an accuracy of 0.7702 for flavonoid content classification. Cross-validation demonstrated the model’s robust predictive capability in identifying antioxidant-rich samples. Samples with the highest predicted flavonoid content were identified, and cultivars exhibiting elevated levels of both flavonoids and phenolics were highlighted across various regions of Thailand. These findings demonstrate the predictive capability of hyperspectral data combined with deep learning for phytochemical assessment. This approach offers a valuable tool for non-destructive quality evaluation and supports cultivar selection for higher phytochemical content in breeding programs and agricultural applications. Full article
Show Figures

Figure 1

12 pages, 2801 KB  
Article
Multi-Algorithm Feature Extraction from Dual Sections for the Recognition of Three African Redwoods
by Jiawen Sun, Jiashun Niu, Liren Xu, Jianping Sun and Linhong Zhao
Forests 2025, 16(7), 1043; https://doi.org/10.3390/f16071043 - 21 Jun 2025
Viewed by 608
Abstract
To address the persistent challenge of low recognition accuracy in precious wood species classification, this study proposes a novel methodology for identifying Pterocarpus santalinus, Pterocarpus tinctorius (PTD), and Pterocarpus tinctorius (Zambia). This approach synergistically integrates artificial neural networks (ANNs) with advanced image feature [...] Read more.
To address the persistent challenge of low recognition accuracy in precious wood species classification, this study proposes a novel methodology for identifying Pterocarpus santalinus, Pterocarpus tinctorius (PTD), and Pterocarpus tinctorius (Zambia). This approach synergistically integrates artificial neural networks (ANNs) with advanced image feature extraction techniques, specifically Fast Fourier Transform, Gabor Transform, Wavelet Transform, and Gray-Level Co-occurrence Matrix. Features were extracted from both transverse and longitudinal wood sections. Fifteen distinct ANN models were subsequently developed: hybrid-section models combined features from different sections using a single algorithm, while multi-algorithm models aggregated features from the same section across all four algorithms. The dual-section hybrid wavelet model (LC4) demonstrated superior performance, achieving a perfect 100% recognition accuracy. High accuracies were also observed in the four-parameter combination models for longitudinal (L5) and transverse (C5) sections, yielding 97.62% and 91.67%, respectively. Notably, 92.31% of the LC4 model’s test samples exhibited an absolute error of ≤1%, highlighting its high reliability and precision. These findings confirm the efficacy of integrating image processing with neural networks for fine-grained wood identification and underscore the exceptional discriminative power of wavelet-based features in cross-sectional data fusion. Full article
(This article belongs to the Section Wood Science and Forest Products)
Show Figures

Figure 1

17 pages, 2781 KB  
Article
Enhancing AI-Driven Diagnosis of Invasive Ductal Carcinoma with Morphologically Guided and Interpretable Deep Learning
by Suphakon Jarujunawong and Paramate Horkaew
Appl. Sci. 2025, 15(12), 6883; https://doi.org/10.3390/app15126883 - 18 Jun 2025
Viewed by 1077
Abstract
Artificial intelligence is increasingly shaping the landscape of computer-aided diagnosis of breast cancer. Despite incrementally improved accuracy, pathologist supervision remains essential for verified interpretation. While prior research focused on devising deep model architecture, this study examines the pivotal role of multi-band visual-enhanced features [...] Read more.
Artificial intelligence is increasingly shaping the landscape of computer-aided diagnosis of breast cancer. Despite incrementally improved accuracy, pathologist supervision remains essential for verified interpretation. While prior research focused on devising deep model architecture, this study examines the pivotal role of multi-band visual-enhanced features in invasive ductal carcinoma classification using whole slide imaging. Our results showed that orientation invariant filters achieved an accuracy of 0.8125, F1-score of 0.8134, and AUC of 0.8761, while preserving cellular arrangement and tissue morphology. By utilizing spatial relationships across varying extents, the proposed fusion strategy aligns with pathological interpretation principles. While integrating Gabor wavelet responses into ResNet-50 enhanced feature association, the comparative analysis emphasized the benefits of weighted morphological fusion, further strengthening diagnostic performance. These insights underscore the crucial role of informative filters in advancing DL schemes for breast cancer screening. Future research incorporating diverse, multi-center datasets could further validate the approach and broaden its diagnostic applications. Full article
(This article belongs to the Special Issue Novel Insights into Medical Images Processing)
Show Figures

Figure 1

22 pages, 4582 KB  
Article
Enhanced Object Detection in Thangka Images Using Gabor, Wavelet, and Color Feature Fusion
by Yukai Xian, Yurui Lee, Te Shen, Ping Lan, Qijun Zhao and Liang Yan
Sensors 2025, 25(11), 3565; https://doi.org/10.3390/s25113565 - 5 Jun 2025
Cited by 5 | Viewed by 1377
Abstract
Thangka image detection poses unique challenges due to complex iconography, densely packed small-scale elements, and stylized color–texture compositions. Existing detectors often struggle to capture both global structures and local details and rarely leverage domain-specific visual priors. To address this, we propose a frequency- [...] Read more.
Thangka image detection poses unique challenges due to complex iconography, densely packed small-scale elements, and stylized color–texture compositions. Existing detectors often struggle to capture both global structures and local details and rarely leverage domain-specific visual priors. To address this, we propose a frequency- and prior-enhanced detection framework based on YOLOv11, specifically tailored for Thangka images. We introduce a Learnable Lifting Wavelet Block (LLWB) to decompose features into low- and high-frequency components, while LLWB_Down and LLWB_Up enable frequency-guided multi-scale fusion. To incorporate chromatic and directional cues, we design a Color-Gabor Block (CGBlock), a dual-branch attention module based on HSV histograms and Gabor responses, and embed it via the Color-Gabor Cross Gate (C2CG) residual fusion module. Furthermore, we redesign all detection heads with decoupled branches and introduce center-ness prediction, alongside an additional shallow detection head to improve recall for ultra-small targets. Extensive experiments on a curated Thangka dataset demonstrate that our model achieves 89.5% mAP@0.5, 59.4% mAP@[0.5:0.95], and 84.7% recall, surpassing all baseline detectors while maintaining a compact size of 20.9 M parameters. Ablation studies validate the individual and synergistic contributions of each proposed component. Our method provides a robust and interpretable solution for fine-grained object detection in complex heritage images. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

41 pages, 1802 KB  
Review
A Systematic Review of CNN Architectures, Databases, Performance Metrics, and Applications in Face Recognition
by Andisani Nemavhola, Colin Chibaya and Serestina Viriri
Information 2025, 16(2), 107; https://doi.org/10.3390/info16020107 - 5 Feb 2025
Cited by 19 | Viewed by 10061
Abstract
This study provides a comparative evaluation of face recognition databases and Convolutional Neural Network (CNN) architectures used in training and testing face recognition systems. The databases span from early datasets like Olivetti Research Laboratory (ORL) and Facial Recognition Technology (FERET) to more recent [...] Read more.
This study provides a comparative evaluation of face recognition databases and Convolutional Neural Network (CNN) architectures used in training and testing face recognition systems. The databases span from early datasets like Olivetti Research Laboratory (ORL) and Facial Recognition Technology (FERET) to more recent collections such as MegaFace and Ms-Celeb-1M, offering a range of sizes, subject diversity, and image quality. Older databases, such as ORL and FERET, are smaller and cleaner, while newer datasets enable large-scale training with millions of images but pose challenges like inconsistent data quality and high computational costs. The study also examines CNN architectures, including FaceNet and Visual Geometry Group 16 (VGG16), which show strong performance on large datasets like Labeled Faces in the Wild (LFW) and VGGFace, achieving accuracy rates above 98%. In contrast, earlier models like Support Vector Machine (SVM) and Gabor Wavelets perform well on smaller datasets but lack scalability for larger, more complex datasets. The analysis highlights the growing importance of multi-task learning and ensemble methods, as seen in Multi-Task Cascaded Convolutional Networks (MTCNNs). Overall, the findings emphasize the need for advanced algorithms capable of handling large-scale, real-world challenges while optimizing accuracy and computational efficiency in face recognition systems. Full article
(This article belongs to the Special Issue Machine Learning and Data Mining for User Classification)
Show Figures

Figure 1

25 pages, 2248 KB  
Article
SCMs: Systematic Conglomerated Models for Audio Cough Signal Classification
by Sunil Kumar Prabhakar and Dong-Ok Won
Algorithms 2024, 17(7), 302; https://doi.org/10.3390/a17070302 - 8 Jul 2024
Cited by 2 | Viewed by 2026
Abstract
A common and natural physiological response of the human body is cough, which tries to push air and other wastage thoroughly from the airways. Due to environmental factors, allergic responses, pollution or some diseases, cough occurs. A cough can be either dry or [...] Read more.
A common and natural physiological response of the human body is cough, which tries to push air and other wastage thoroughly from the airways. Due to environmental factors, allergic responses, pollution or some diseases, cough occurs. A cough can be either dry or wet depending on the amount of mucus produced. A characteristic feature of the cough is the sound, which is a quacking sound mostly. Human cough sounds can be monitored continuously, and so, cough sound classification has attracted a lot of interest in the research community in the last decade. In this research, three systematic conglomerated models (SCMs) are proposed for audio cough signal classification. The first conglomerated technique utilizes the concept of robust models like the Cross-Correlation Function (CCF) and Partial Cross-Correlation Function (PCCF) model, Least Absolute Shrinkage and Selection Operator (LASSO) model, elastic net regularization model with Gabor dictionary analysis and efficient ensemble machine learning techniques, the second technique utilizes the concept of stacked conditional autoencoders (SAEs) and the third technique utilizes the concept of using some efficient feature extraction schemes like Tunable Q Wavelet Transform (TQWT), sparse TQWT, Maximal Information Coefficient (MIC), Distance Correlation Coefficient (DCC) and some feature selection techniques like the Binary Tunicate Swarm Algorithm (BTSA), aggregation functions (AFs), factor analysis (FA), explanatory factor analysis (EFA) classified with machine learning classifiers, kernel extreme learning machine (KELM), arc-cosine ELM, Rat Swarm Optimization (RSO)-based KELM, etc. The techniques are utilized on publicly available datasets, and the results show that the highest classification accuracy of 98.99% was obtained when sparse TQWT with AF was implemented with an arc-cosine ELM classifier. Full article
(This article belongs to the Special Issue Quantum and Classical Artificial Intelligence)
Show Figures

Figure 1

22 pages, 3024 KB  
Article
Augmenting Aquaculture Efficiency through Involutional Neural Networks and Self-Attention for Oplegnathus Punctatus Feeding Intensity Classification from Log Mel Spectrograms
by Usama Iqbal, Daoliang Li, Zhuangzhuang Du, Muhammad Akhter, Zohaib Mushtaq, Muhammad Farrukh Qureshi and Hafiz Abbad Ur Rehman
Animals 2024, 14(11), 1690; https://doi.org/10.3390/ani14111690 - 5 Jun 2024
Cited by 12 | Viewed by 2373
Abstract
Understanding the feeding dynamics of aquatic animals is crucial for aquaculture optimization and ecosystem management. This paper proposes a novel framework for analyzing fish feeding behavior based on a fusion of spectrogram-extracted features and deep learning architecture. Raw audio waveforms are first transformed [...] Read more.
Understanding the feeding dynamics of aquatic animals is crucial for aquaculture optimization and ecosystem management. This paper proposes a novel framework for analyzing fish feeding behavior based on a fusion of spectrogram-extracted features and deep learning architecture. Raw audio waveforms are first transformed into Log Mel Spectrograms, and a fusion of features such as the Discrete Wavelet Transform, the Gabor filter, the Local Binary Pattern, and the Laplacian High Pass Filter, followed by a well-adapted deep model, is proposed to capture crucial spectral and spectral information that can help distinguish between the various forms of fish feeding behavior. The Involutional Neural Network (INN)-based deep learning model is used for classification, achieving an accuracy of up to 97% across various temporal segments. The proposed methodology is shown to be effective in accurately classifying the feeding intensities of Oplegnathus punctatus, enabling insights pertinent to aquaculture enhancement and ecosystem management. Future work may include additional feature extraction modalities and multi-modal data integration to further our understanding and contribute towards the sustainable management of marine resources. Full article
(This article belongs to the Special Issue Animal Health and Welfare in Aquaculture)
Show Figures

Figure 1

23 pages, 7093 KB  
Article
Synthetic Aperture Radar Image Change Detection Based on Principal Component Analysis and Two-Level Clustering
by Liangliang Li, Hongbing Ma, Xueyu Zhang, Xiaobin Zhao, Ming Lv and Zhenhong Jia
Remote Sens. 2024, 16(11), 1861; https://doi.org/10.3390/rs16111861 - 23 May 2024
Cited by 49 | Viewed by 7235
Abstract
Synthetic aperture radar (SAR) change detection provides a powerful tool for continuous, reliable, and objective observation of the Earth, supporting a wide range of applications that require regular monitoring and assessment of changes in the natural and built environment. In this paper, we [...] Read more.
Synthetic aperture radar (SAR) change detection provides a powerful tool for continuous, reliable, and objective observation of the Earth, supporting a wide range of applications that require regular monitoring and assessment of changes in the natural and built environment. In this paper, we introduce a novel SAR image change detection method based on principal component analysis and two-level clustering. First, two difference images of the log-ratio and mean-ratio operators are computed, then the principal component analysis fusion model is used to fuse the two difference images, and a new difference image is generated. To incorporate contextual information during the feature extraction phase, Gabor wavelets are used to obtain the representation of the difference image across multiple scales and orientations. The maximum magnitude across all orientations at each scale is then concatenated to form the Gabor feature vector. Following this, a cascading clustering algorithm is developed within this discriminative feature space by merging the first-level fuzzy c-means clustering with the second-level neighbor rule. Ultimately, the two-level combination of the changed and unchanged results produces the final change map. Five SAR datasets are used for the experiment, and the results show that our algorithm has significant advantages in SAR change detection. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Figure 1

20 pages, 3669 KB  
Article
Ultrasonic Detection of Aliased Signal Separation Based on Adaptive Feature Dictionary and K–SVD Algorithm for Protective Coatings of Assembled Steel Structure
by Yiyi Liu, Ruiqi Zhou, Zhigang Wang, Qiufeng Li, Chao Lu and Haitao Wang
Coatings 2023, 13(7), 1239; https://doi.org/10.3390/coatings13071239 - 11 Jul 2023
Cited by 7 | Viewed by 1704
Abstract
When using ultrasound to detect the thickness of protective coatings on assembled steel structures, the coatings are extremely thin, which can cause echo signals to overlap and impair the detection accuracy. Therefore, the study of the separation of the superimposed signals is essential [...] Read more.
When using ultrasound to detect the thickness of protective coatings on assembled steel structures, the coatings are extremely thin, which can cause echo signals to overlap and impair the detection accuracy. Therefore, the study of the separation of the superimposed signals is essential for the precise measurement of the thickness of thinner coatings. A method for signal time domain feature extraction based on an adaptive feature dictionary and K–SVD is investigated. First, the wavelet transform, which is sensitive to singular signal values, is used to identify the extreme values of the signal and use them as the new signal to be processed. Then, the feature signal extracted by wavelet transform is transformed into Hankel matrix form, and the initial feature dictionary is constructed by period segmentation and random extraction. The optimized feature dictionary is subsequently obtained by enhancing the K–SVD algorithm. Finally, the time domain signal is reconstructed using the optimized feature dictionary. Simulations and experiments demonstrate that the method is more accurate in separating mixed signals and extracting signal time domain feature information than the conventional wavelet transform and Gabor dictionary-based MP algorithm, and that it is more advantageous in detecting the thickness of protective coatings. Full article
(This article belongs to the Section Functional Polymer Coatings and Films)
Show Figures

Figure 1

17 pages, 171558 KB  
Communication
Two Filters for Acquiring the Profiles from Images Obtained from Weak-Light Background, Fluorescence Microscope, Transmission Electron Microscope, and Near-Infrared Camera
by Yinghui Huang, Ruoxi Yang, Xin Geng, Zongan Li and Ye Wu
Sensors 2023, 23(13), 6207; https://doi.org/10.3390/s23136207 - 6 Jul 2023
Cited by 4 | Viewed by 2568
Abstract
Extracting the profiles of images is critical because it can bring simplified description and draw special attention to particular areas in the images. In our work, we designed two filters via the exponential and hypotenuse functions for profile extraction. Their ability to extract [...] Read more.
Extracting the profiles of images is critical because it can bring simplified description and draw special attention to particular areas in the images. In our work, we designed two filters via the exponential and hypotenuse functions for profile extraction. Their ability to extract the profiles from the images obtained from weak-light conditions, fluorescence microscopes, transmission electron microscopes, and near-infrared cameras is proven. Moreover, they can be used to extract the nesting structures in the images. Furthermore, their performance in extracting images degraded by Gaussian noise is evaluated. We used Gaussian white noise with a mean value of 0.9 to create very noisy images. These filters are effective for extracting the edge morphology in the noisy images. For the purpose of a comparative study, we used several well-known filters to process these noisy images, including the filter based on Gabor wavelet, the filter based on the watershed algorithm, and the matched filter, the performances of which in profile extraction are either comparable or not effective when dealing with extensively noisy images. Our filters have shown the potential for use in the field of pattern recognition and object tracking. Full article
(This article belongs to the Special Issue Advanced Biomedical Optics and Imaging)
Show Figures

Figure 1

37 pages, 8265 KB  
Article
Quantized Information in Spectral Cyberspace
by Milton A. Garcés
Entropy 2023, 25(3), 419; https://doi.org/10.3390/e25030419 - 26 Feb 2023
Cited by 2 | Viewed by 2687
Abstract
The constant-Q Gabor atom is developed for spectral power, information, and uncertainty quantification from time–frequency representations. Stable multiresolution spectral entropy algorithms are constructed with continuous wavelet and Stockwell transforms. The recommended processing and scaling method will depend on the signature of interest, the [...] Read more.
The constant-Q Gabor atom is developed for spectral power, information, and uncertainty quantification from time–frequency representations. Stable multiresolution spectral entropy algorithms are constructed with continuous wavelet and Stockwell transforms. The recommended processing and scaling method will depend on the signature of interest, the desired information, and the acceptable levels of uncertainty of signal and noise features. Selected Lamb wave signatures and information spectra from the 2022 Tonga eruption are presented as representative case studies. Resilient transformations from physical to information metrics are provided for sensor-agnostic signal processing, pattern recognition, and machine learning applications. Full article
(This article belongs to the Section Signal and Data Analysis)
Show Figures

Figure 1

23 pages, 2745 KB  
Article
Cervical Cancer Diagnosis Based on Multi-Domain Features Using Deep Learning Enhanced by Handcrafted Descriptors
by Omneya Attallah
Appl. Sci. 2023, 13(3), 1916; https://doi.org/10.3390/app13031916 - 2 Feb 2023
Cited by 69 | Viewed by 5854
Abstract
Cervical cancer, among the most frequent adverse cancers in women, could be avoided through routine checks. The Pap smear check is a widespread screening methodology for the timely identification of cervical cancer, but it is susceptible to human mistakes. Artificial Intelligence-reliant computer-aided diagnostic [...] Read more.
Cervical cancer, among the most frequent adverse cancers in women, could be avoided through routine checks. The Pap smear check is a widespread screening methodology for the timely identification of cervical cancer, but it is susceptible to human mistakes. Artificial Intelligence-reliant computer-aided diagnostic (CAD) methods have been extensively explored to identify cervical cancer in order to enhance the conventional testing procedure. In order to attain remarkable classification results, most current CAD systems require pre-segmentation steps for the extraction of cervical cells from a pap smear slide, which is a complicated task. Furthermore, some CAD models use only hand-crafted feature extraction methods which cannot guarantee the sufficiency of classification phases. In addition, if there are few data samples, such as in cervical cell datasets, the use of deep learning (DL) alone is not the perfect choice. In addition, most existing CAD systems obtain attributes from one domain, but the integration of features from multiple domains usually increases performance. Hence, this article presents a CAD model based on extracting features from multiple domains not only one domain. It does not require a pre-segmentation process thus it is less complex than existing methods. It employs three compact DL models to obtain high-level spatial deep features rather than utilizing an individual DL model with large number of parameters and layers as used in current CADs. Moreover, it retrieves several statistical and textural descriptors from multiple domains including spatial and time–frequency domains instead of employing features from a single domain to demonstrate a clearer representation of cervical cancer features, which is not the case in most existing CADs. It examines the influence of each set of handcrafted attributes on diagnostic accuracy independently and hybrid. It then examines the consequences of combining each DL feature set obtained from each CNN with the combined handcrafted features. Finally, it uses principal component analysis to merge the entire DL features with the combined handcrafted features to investigate the effect of merging numerous DL features with various handcrafted features on classification results. With only 35 principal components, the accuracy achieved by the quatric SVM of the proposed CAD reached 100%. The performance of the described CAD proves that combining several DL features with numerous handcrafted descriptors from multiple domains is able to boost diagnostic accuracy. Additionally, the comparative performance analysis, along with other present studies, shows the competing capacity of the proposed CAD. Full article
(This article belongs to the Special Issue Artificial Intelligence (AI) in Healthcare)
Show Figures

Figure 1

Back to TopTop