MDPI - Publisher of Open Access Journals

25 pages, 911 KB

Open AccessArticle

Migraine and Epilepsy Discrimination Using DTCWT and Random Subspace Ensemble Classifier

by Tuba Nur Subasi and Abdulhamit Subasi

Mach. Learn. Knowl. Extr. 2026, 8(2), 35; https://doi.org/10.3390/make8020035 - 4 Feb 2026

Viewed by 429

Migraine and epilepsy are common neurological disorders that share overlapping symptoms, such as visual disturbances and altered consciousness, making accurate diagnosis challenging. Although their underlying mechanisms differ, both conditions involve recurrent irregular brain activity, and traditional EEG-based diagnosis relies heavily on clinical interpretation, [...] Read more.

Migraine and epilepsy are common neurological disorders that share overlapping symptoms, such as visual disturbances and altered consciousness, making accurate diagnosis challenging. Although their underlying mechanisms differ, both conditions involve recurrent irregular brain activity, and traditional EEG-based diagnosis relies heavily on clinical interpretation, which may be subjective and insufficient for clear differentiation. To address this challenge, this study introduces an automated EEG classification framework combining Dual Tree Complex Wavelet Transform (DTCWT) for feature extraction with a Random Subspace Ensemble Classifier for multi-class discrimination. EEG data recorded under photic and nonphotic stimulation were analyzed to capture both temporal and frequency characteristics. DTCWT proved effective in modeling the non-stationary nature of EEG signals and extracting condition-specific features, while the ensemble classifier improved generalization by training multiple models on diverse feature subsets. The proposed system achieved an average accuracy of 99.50%, along with strong F-measure, AUC, and Kappa scores. Notably, although previous studies suggest heightened EEG activity in migraine patients during flash stimulation, findings here indicate that flash stimulation alone does not reliably distinguish migraine from epilepsy. Overall, this research highlights the promise of advanced signal processing and machine learning techniques in enhancing diagnostic precision for complex neurological disorders. Full article

(This article belongs to the Section Learning)

► Show Figures

Figure 1

20 pages, 5660 KB

Open AccessArticle

Fault Diagnosis for Imbalanced Datasets Based on Deep Convolution Fuzzy System

by Junwei Zhu and Linfang Zhu

Machines 2025, 13(4), 326; https://doi.org/10.3390/machines13040326 - 17 Apr 2025

Cited by 1 | Viewed by 1034

Abstract

To address the data imbalance issue in the process of collecting bearing fault data in industrial environments and to enhance the robustness and generalization ability of fault diagnosis, this paper proposes a bearing fault diagnosis method based on a Bidirectional Autoregressive Variational Autoencoder [...] Read more.

To address the data imbalance issue in the process of collecting bearing fault data in industrial environments and to enhance the robustness and generalization ability of fault diagnosis, this paper proposes a bearing fault diagnosis method based on a Bidirectional Autoregressive Variational Autoencoder (BAVAE) and a Deep Convolutional Interval Type-2 Fuzzy System (DCIT2FS). First, the method extracts features from the imbalanced dataset using dual-tree complex wavelet transform (DTCWT), and then feeds the feature dataset into the proposed BAVAE for data augmentation. The BAVAE improves data generation capabilities by introducing autoregressive distributions to learn latent variables, iteratively obtaining complex high-order latent variables, and amplifying inter-class differences through the introduction of feature discrimination loss during training. Given that relying solely on data augmentation under imbalanced data conditions may lead to overfitting or underfitting, this paper combines the generalization approximation ability of Interval Type-2 (IT2) fuzzy systems with the feature extraction capability of deep convolutional networks, achieving a better balance between model complexity and feature transformation, thereby enhancing the stability and accuracy of the final diagnosis. Full article

(This article belongs to the Section Machines Testing and Maintenance)

► Show Figures

Figure 1

19 pages, 8990 KB

Open AccessArticle

Optimizing Image Watermarking with Dual-Tree Complex Wavelet Transform and Particle Swarm Intelligence for Secure and High-Quality Protection

by Abed Al Raoof Bsoul and Alaa Bani Ismail

Appl. Sci. 2025, 15(3), 1315; https://doi.org/10.3390/app15031315 - 27 Jan 2025

Cited by 5 | Viewed by 2212

Abstract

Watermarking is a technique used to address issues related to the widespread use of the internet, such as copyright protection, tamper localization, and authentication. However, most watermarking approaches negatively affect the quality of the original image. In this research, we propose an optimized [...] Read more.

Watermarking is a technique used to address issues related to the widespread use of the internet, such as copyright protection, tamper localization, and authentication. However, most watermarking approaches negatively affect the quality of the original image. In this research, we propose an optimized image watermarking approach that utilizes the dual-tree complex wavelet transform and particle swarm optimization algorithm. Our approach focuses on maintaining the highest possible quality of the watermarked image by minimizing any noticeable changes. During the embedding phase, we break down the original image using a technique called dual-tree complex wavelet transform (DTCWT) and then use particle swarm optimization (PSO) to choose specific coefficients. We embed the bits of a binary logo into the least significant bits of these selected coefficients, creating the watermarked image. To extract the watermark, we reverse the embedding process by first decomposing both versions of the input image using DTCWT and extracting the same coefficients to retrieve those corresponding bits (watermark). In our experiments, we used a common dataset from watermarking research to demonstrate the functionality against various watermarked copies and peak signal-to-noise ratio (PSNR) and normalized cross-correlation (NCC) metrics. The PSNR is a measure of how well the watermarked image maintains its original quality, and the NCC reflects how accurately the watermark can be extracted. Our method gives mean PSNR and NCC of 80.50% and 92.51%, respectively. Full article

(This article belongs to the Special Issue Digital Image Processing: Technologies and Applications)

► Show Figures

Figure 1

15 pages, 5974 KB

Open AccessArticle

Structural Damage Early Warning Method of Quayside Container Crane Based on Fuzzy Entropy Ratio Variation Deviation

by Jiahui Liu, Jian Zhao, Dong Zhao and Xianrong Qin

Sensors 2024, 24(23), 7575; https://doi.org/10.3390/s24237575 - 27 Nov 2024

Cited by 2 | Viewed by 1305

Abstract

Real-time monitoring and early warning of structures are essential for assessing structural health and ensuring safety maintenance. To improve the timeliness of early warnings for structural abnormal states in quayside container cranes (QCCs) with incomplete damage data, a structural abnormal state early warning [...] Read more.

Real-time monitoring and early warning of structures are essential for assessing structural health and ensuring safety maintenance. To improve the timeliness of early warnings for structural abnormal states in quayside container cranes (QCCs) with incomplete damage data, a structural abnormal state early warning method based on fuzzy entropy ratio variation deviation (FERVD) is proposed. First, monitoring data are subjected to dual-tree complex wavelet transform (DTCWT). The adaptive frequency bands obtained from the decomposition, combined with fuzzy entropy (FE), are used to extract response signal features and construct the FERVD warning indicator. Based on this indicator, dynamic thresholds for early warning are established to differentiate between structural health states and various damage conditions. Secondly, a finite element model of structure for QCCs is developed. By simulating damage at various locations and severities through the stiffness reduction of different elements, a comprehensive structural simulation monitoring dataset is generated. The efficacy of the proposed early warning method is validated through numerical experiments and engineering case studies. The numerical results demonstrate that the proposed method effectively distinguishes between different damage conditions and provides timely warnings for various damage states. Furthermore, engineering case analysis shows that when the structure is in a healthy state, the FERVD values at different monitoring points fluctuate within the threshold range, indicating the applicability of the proposed method in the structural health monitoring (SHM) of QCCs. Full article

(This article belongs to the Special Issue Sensor-Based Frequency, Time–Frequency and Higher-Order Signal Processing for Condition Monitoring, Structural Health Monitoring and Non-Destructive Testing (Second Edition))

► Show Figures

Figure 1

14 pages, 3240 KB

Open AccessArticle

Buried PE Pipeline Location Method Based on Double-Tree Complex Wavelet Cross-Correlation Delay

by Yang Li, Hanyu Zhang, Zhuo Xu, Ao Zhang, Xianfa Liu, Pengyao Sun and Xianchao Sun

Sensors 2024, 24(22), 7310; https://doi.org/10.3390/s24227310 - 15 Nov 2024

Cited by 3 | Viewed by 1526

Abstract

This study presents a location method for buried polyethylene (PE) pipelines based on the double-tree complex wavelet cross-correlation delay. Initially, the dual-tree complex wavelet transform (DTCWT) is applied to denoise the acquired signal, followed by extracting the delay time through the cross-correlation function [...] Read more.

This study presents a location method for buried polyethylene (PE) pipelines based on the double-tree complex wavelet cross-correlation delay. Initially, the dual-tree complex wavelet transform (DTCWT) is applied to denoise the acquired signal, followed by extracting the delay time through the cross-correlation function to locate the buried pipeline. A simulation model is established to analyze the peak values of the time-domain signals in both asymmetric and symmetric sensor layouts using COMSOL, determining the relationship between the signal time differences and pipeline positions. Then, an experimental test system is set up, and experiments are carried out under the conditions of asymmetric and symmetrical sensors and different excitation points. The results indicate that the maximum error is 4.6% for asymmetric arrangements and less than 1% for symmetric arrangements. In practical applications, the pipeline’s position can be inferred from the delay time, with higher accuracy observed as the excitation point approaches the sensor. This method addresses the limitations of existing pipeline locating techniques and provides a foundation for the development of pipeline positioning technology. Full article

(This article belongs to the Special Issue Mechanical, Civil and Petroleum Engineering: Advances in Sensors and Measurement Methods of Multi-Scale Systems)

► Show Figures

Figure 1

24 pages, 19107 KB

Open AccessArticle

Enhanced Wavelet Scattering Network for Image Inpainting Detection

by Adrian-Alin Barglazan and Remus Brad

Computation 2024, 12(11), 228; https://doi.org/10.3390/computation12110228 - 13 Nov 2024

Cited by 3 | Viewed by 2911

Abstract

The rapid advancement of image inpainting tools, especially those aimed at removing artifacts, has made digital image manipulation alarmingly accessible. This paper proposes several innovative ideas for detecting inpainting forgeries based on a low-level noise analysis by combining Dual-Tree Complex Wavelet Transform (DT-CWT) [...] Read more.

The rapid advancement of image inpainting tools, especially those aimed at removing artifacts, has made digital image manipulation alarmingly accessible. This paper proposes several innovative ideas for detecting inpainting forgeries based on a low-level noise analysis by combining Dual-Tree Complex Wavelet Transform (DT-CWT) for feature extraction with convolutional neural networks (CNN) for forged area detection and localization, and lastly by employing an innovative combination of texture segmentation with noise variance estimations. The DT-CWT offers significant advantages due to its shift-invariance, enhancing its robustness against subtle manipulations during the inpainting process. Furthermore, its directional selectivity allows for the detection of subtle artifacts introduced by inpainting within specific frequency bands and orientations. Various neural network architectures were evaluated and proposed. Lastly, we propose a fusion detection module that combines texture analysis with noise variance estimation to give the forged area. Also, to address the limitations of existing inpainting datasets, particularly their lack of clear separation between inpainted regions and removed objects—which can inadvertently favor detection—we introduced a new dataset named the Real Inpainting Detection Dataset. Our approach was benchmarked against state-of-the-art methods and demonstrated superior performance over all cited alternatives. Full article

► Show Figures

Figure 1

18 pages, 8989 KB

Open AccessArticle

A Novel Method for Heat Haze-Induced Error Mitigation in Vision-Based Bridge Displacement Measurement

by Xintong Kong, Baoquan Wang, Dongming Feng, Chenchen Yuan, Ruoyu Gu, Weihang Ren and Kaijing Wei

Sensors 2024, 24(16), 5151; https://doi.org/10.3390/s24165151 - 9 Aug 2024

Cited by 1 | Viewed by 2172

Abstract

Vision-based techniques have become widely applied in structural displacement monitoring. However, heat haze poses a great threat to the precision of vision systems by creating distortions in the images. This paper proposes a vision-based bridge displacement measurement technique with heat haze mitigation capability. [...] Read more.

Vision-based techniques have become widely applied in structural displacement monitoring. However, heat haze poses a great threat to the precision of vision systems by creating distortions in the images. This paper proposes a vision-based bridge displacement measurement technique with heat haze mitigation capability. The properties of heat haze-induced errors are illustrated. A dual-tree complex wavelet transform (DT-CWT) is used to mitigate the heat haze in images, and the speeded-up robust features (SURF) algorithm is employed to extract the displacement. The proposed method is validated through indoor experiments on a bridge model. The designed vision system achieves high measurement accuracy in a heat haze-free condition. The proposed mitigation method successfully corrects 61.05% of heat haze-induced errors in static experiments and 95.31% in dynamic experiments. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

20 pages, 10858 KB

Open AccessArticle

PolSAR Image Classification with Active Complex-Valued Convolutional-Wavelet Neural Network and Markov Random Fields

by Lu Liu and Yongxiang Li

Remote Sens. 2024, 16(6), 1094; https://doi.org/10.3390/rs16061094 - 20 Mar 2024

Cited by 6 | Viewed by 2629

Abstract

PolSAR image classification has attracted extensive significant research in recent decades. Aiming at improving PolSAR classification performance with speckle noise, this paper proposes an active complex-valued convolutional-wavelet neural network by incorporating dual-tree complex wavelet transform (DT-CWT) and Markov random field (MRF). In this [...] Read more.

PolSAR image classification has attracted extensive significant research in recent decades. Aiming at improving PolSAR classification performance with speckle noise, this paper proposes an active complex-valued convolutional-wavelet neural network by incorporating dual-tree complex wavelet transform (DT-CWT) and Markov random field (MRF). In this approach, DT-CWT is introduced into the complex-valued convolutional neural network to suppress the speckle noise of PolSAR images and maintain the structures of learned feature maps. In addition, by applying active learning (AL), we iteratively select the most informative unlabeled training samples of PolSAR datasets. Moreover, MRF is utilized to obtain spatial local correlation information, which has been proven to be effective in improving classification performance. The experimental results on three benchmark PolSAR datasets demonstrate that the proposed method can achieve a significant classification performance gain in terms of its effectiveness and robustness beyond some state-of-the-art deep learning methods. Full article

(This article belongs to the Special Issue Artificial Intelligence Algorithm for Remote Sensing Imagery Processing III)

► Show Figures

Figure 1

22 pages, 4983 KB

Open AccessArticle

RiPa-Net: Recognition of Rice Paddy Diseases with Duo-Layers of CNNs Fostered by Feature Transformation and Selection

by Omneya Attallah

Biomimetics 2023, 8(5), 417; https://doi.org/10.3390/biomimetics8050417 - 7 Sep 2023

Cited by 9 | Viewed by 2568

Abstract

Rice paddy diseases significantly reduce the quantity and quality of crops, so it is essential to recognize them quickly and accurately for prevention and control. Deep learning (DL)-based computer-assisted expert systems are encouraging approaches to solving this issue and dealing with the dearth [...] Read more.

Rice paddy diseases significantly reduce the quantity and quality of crops, so it is essential to recognize them quickly and accurately for prevention and control. Deep learning (DL)-based computer-assisted expert systems are encouraging approaches to solving this issue and dealing with the dearth of subject-matter specialists in this area. Nonetheless, a major generalization obstacle is posed by the existence of small discrepancies between various classes of paddy diseases. Numerous studies have used features taken from a single deep layer of an individual complex DL construction with many deep layers and parameters. All of them have relied on spatial knowledge only to learn their recognition models trained with a large number of features. This study suggests a pipeline called “RiPa-Net” based on three lightweight CNNs that can identify and categorize nine paddy diseases as well as healthy paddy. The suggested pipeline gathers features from two different layers of each of the CNNs. Moreover, the suggested method additionally applies the dual-tree complex wavelet transform (DTCWT) to the deep features of the first layer to obtain spectral–temporal information. Additionally, it incorporates the deep features of the first layer of the three CNNs using principal component analysis (PCA) and discrete cosine transform (DCT) transformation methods, which reduce the dimension of the first layer features. The second layer’s spatial deep features are then combined with these fused time-frequency deep features. After that, a feature selection process is introduced to reduce the size of the feature vector and choose only those features that have a significant impact on the recognition process, thereby further reducing recognition complexity. According to the results, combining deep features from two layers of different lightweight CNNs can improve recognition accuracy. Performance also improves as a result of the acquired spatial–spectral–temporal information used to learn models. Using 300 features, the cubic support vector machine (SVM) achieves an outstanding accuracy of 97.5%. The competitive ability of the suggested pipeline is confirmed by a comparison of the experimental results with findings from previously conducted research on the recognition of paddy diseases. Full article

(This article belongs to the Special Issue Biomimetics in Agri-Food: From Preliminary Design to Field Applications)

► Show Figures

Figure 1

14 pages, 6266 KB

Open AccessArticle

Supervised Single Channel Speech Enhancement Method Using UNET

by Md. Nahid Hossain, Samiul Basir, Md. Shakhawat Hosen, A.O.M. Asaduzzaman, Md. Mojahidul Islam, Mohammad Alamgir Hossain and Md Shohidul Islam

Electronics 2023, 12(14), 3052; https://doi.org/10.3390/electronics12143052 - 12 Jul 2023

Cited by 11 | Viewed by 5216

Abstract

This paper proposes an innovative single-channel supervised speech enhancement (SE) method based on UNET, a convolutional neural network (CNN) architecture that expands on a few changes in the basic CNN architecture. In the training phase, short-time Fourier transform (STFT) is exploited on the [...] Read more.

This paper proposes an innovative single-channel supervised speech enhancement (SE) method based on UNET, a convolutional neural network (CNN) architecture that expands on a few changes in the basic CNN architecture. In the training phase, short-time Fourier transform (STFT) is exploited on the noisy time domain signal to build a noisy time-frequency domain signal which is called a complex noisy matrix. We take the real and imaginary parts of the complex noisy matrix and concatenate both of them to form the noisy concatenated matrix. We apply UNET to the noisy concatenated matrix for extracting speech components and train the CNN model. In the testing phase, the same procedure is applied to the noisy time-domain signal as in the training phase in order to construct another noisy concatenated matrix that can be tested using a pre-trained or saved model in order to construct an enhanced concatenated matrix. Finally, from the enhanced concatenated matrix, we separate both the imaginary and real parts to form an enhanced complex matrix. Magnitude and phase are then extracted from the newly created enhanced complex matrix. By using that magnitude and phase, the inverse STFT (ISTFT) can generate the enhanced speech signal. Utilizing the IEEE databases and various types of noise, including stationary and non-stationary noise, the proposed method is evaluated. Comparing the exploratory results of the proposed algorithm to the other five methods of STFT, sparse non-negative matrix factorization (SNMF), dual-tree complex wavelet transform (DTCWT)-SNMF, DTCWT-STFT-SNMF, STFT-convolutional denoising auto encoder (CDAE) and casual multi-head attention mechanism (CMAM) for speech enhancement, we determine that the proposed algorithm generally improves speech quality and intelligibility at all considered signal-to-noise ratios (SNRs). The suggested approach performs better than the other five competing algorithms in every evaluation metric. Full article

(This article belongs to the Special Issue Machine Learning in Music/Audio Signal Processing)

► Show Figures

Figure 1

19 pages, 5449 KB

Open AccessArticle

Optimal Deep Learning Architecture for Automated Segmentation of Cysts in OCT Images Using X-Let Transforms

by Reza Darooei, Milad Nazari, Rahele Kafieh and Hossein Rabbani

Diagnostics 2023, 13(12), 1994; https://doi.org/10.3390/diagnostics13121994 - 7 Jun 2023

Cited by 9 | Viewed by 2376

Abstract

The retina is a thin, light-sensitive membrane with a multilayered structure found in the back of the eyeball. There are many types of retinal disorders. The two most prevalent retinal illnesses are Age-Related Macular Degeneration (AMD) and Diabetic Macular Edema (DME). Optical Coherence [...] Read more.

The retina is a thin, light-sensitive membrane with a multilayered structure found in the back of the eyeball. There are many types of retinal disorders. The two most prevalent retinal illnesses are Age-Related Macular Degeneration (AMD) and Diabetic Macular Edema (DME). Optical Coherence Tomography (OCT) is a vital retinal imaging technology. X-lets (such as curvelet, DTCWT, contourlet, etc.) have several benefits in image processing and analysis. They can capture both local and non-local features of an image simultaneously. The aim of this paper is to propose an optimal deep learning architecture based on sparse basis functions for the automated segmentation of cystic areas in OCT images. Different X-let transforms were used to produce different network inputs, including curvelet, Dual-Tree Complex Wavelet Transform (DTCWT), circlet, and contourlet. Additionally, three different combinations of these transforms are suggested to achieve more accurate segmentation results. Various metrics, including Dice coefficient, sensitivity, false positive ratio, Jaccard index, and qualitative results, were evaluated to find the optimal networks and combinations of the X-let’s sub-bands. The proposed network was tested on both original and noisy datasets. The results show the following facts: (1) contourlet achieves the optimal results between different combinations; (2) the five-channel decomposition using high-pass sub-bands of contourlet transform achieves the best performance; and (3) the five-channel decomposition using high-pass sub-bands formations out-performs the state-of-the-art methods, especially in the noisy dataset. The proposed method has the potential to improve the accuracy and speed of the segmentation process in clinical settings, facilitating the diagnosis and treatment of retinal diseases. Full article

(This article belongs to the Section Biomedical Optics)

► Show Figures

Figure 1

14 pages, 4077 KB

Open AccessArticle

Deep versus Handcrafted Tensor Radiomics Features: Prediction of Survival in Head and Neck Cancer Using Machine Learning and Fusion Techniques

by Mohammad R. Salmanpour, Seyed Masoud Rezaeijo, Mahdi Hosseinzadeh and Arman Rahmim

Diagnostics 2023, 13(10), 1696; https://doi.org/10.3390/diagnostics13101696 - 11 May 2023

Cited by 70 | Viewed by 4770

Abstract

Background: Although handcrafted radiomics features (RF) are commonly extracted via radiomics software, employing deep features (DF) extracted from deep learning (DL) algorithms merits significant investigation. Moreover, a “tensor’’ radiomics paradigm where various flavours of a given feature are generated and explored can provide [...] Read more.

Background: Although handcrafted radiomics features (RF) are commonly extracted via radiomics software, employing deep features (DF) extracted from deep learning (DL) algorithms merits significant investigation. Moreover, a “tensor’’ radiomics paradigm where various flavours of a given feature are generated and explored can provide added value. We aimed to employ conventional and tensor DFs, and compare their outcome prediction performance to conventional and tensor RFs. Methods: 408 patients with head and neck cancer were selected from TCIA. PET images were first registered to CT, enhanced, normalized, and cropped. We employed 15 image-level fusion techniques (e.g., dual tree complex wavelet transform (DTCWT)) to combine PET and CT images. Subsequently, 215 RFs were extracted from each tumor in 17 images (or flavours) including CT only, PET only, and 15 fused PET-CT images through the standardized-SERA radiomics software. Furthermore, a 3 dimensional autoencoder was used to extract DFs. To predict the binary progression-free-survival-outcome, first, an end-to-end CNN algorithm was employed. Subsequently, we applied conventional and tensor DFs vs. RFs as extracted from each image to three sole classifiers, namely multilayer perceptron (MLP), random-forest, and logistic regression (LR), linked with dimension reduction algorithms. Results: DTCWT fusion linked with CNN resulted in accuracies of 75.6 ± 7.0% and 63.4 ± 6.7% in five-fold cross-validation and external-nested-testing, respectively. For the tensor RF-framework, polynomial transform algorithms + analysis of variance feature selector (ANOVA) + LR enabled 76.67 ± 3.3% and 70.6 ± 6.7% in the mentioned tests. For the tensor DF framework, PCA + ANOVA + MLP arrived at 87.0 ± 3.5% and 85.3 ± 5.2% in both tests. Conclusions: This study showed that tensor DF combined with proper machine learning approaches enhanced survival prediction performance compared to conventional DF, tensor and conventional RF, and end-to-end CNN frameworks. Full article

(This article belongs to the Special Issue Machine Learning in Radiomics: Opportunities and Challenges)

► Show Figures

Figure 1

10 pages, 1165 KB

Open AccessBrief Report

Invariant Pattern Recognition with Log-Polar Transform and Dual-Tree Complex Wavelet-Fourier Features

by Guangyi Chen and Adam Krzyzak

Sensors 2023, 23(8), 3842; https://doi.org/10.3390/s23083842 - 9 Apr 2023

Cited by 5 | Viewed by 2839

Abstract

In this paper, we propose a novel method for 2D pattern recognition by extracting features with the log-polar transform, the dual-tree complex wavelet transform (DTCWT), and the 2D fast Fourier transform (FFT2). Our new method is invariant to translation, rotation, and scaling of [...] Read more.

In this paper, we propose a novel method for 2D pattern recognition by extracting features with the log-polar transform, the dual-tree complex wavelet transform (DTCWT), and the 2D fast Fourier transform (FFT2). Our new method is invariant to translation, rotation, and scaling of the input 2D pattern images in a multiresolution way, which is very important for invariant pattern recognition. We know that very low-resolution sub-bands lose important features in the pattern images, and very high-resolution sub-bands contain significant amounts of noise. Therefore, intermediate-resolution sub-bands are good for invariant pattern recognition. Experiments on one printed Chinese character dataset and one 2D aircraft dataset show that our new method is better than two existing methods for a combination of rotation angles, scaling factors, and different noise levels in the input pattern images in most testing cases. Full article

(This article belongs to the Topic Applications in Image Analysis and Pattern Recognition)

► Show Figures

Figure 1

24 pages, 5712 KB

Open AccessArticle

Improved Procedure for Multi-Focus Image Quality Enhancement Using Image Fusion with Rules of Texture Energy Measures in the Hybrid Wavelet Domain

by Chinnem Rama Mohan, Siddavaram Kiran and Vasudeva

Appl. Sci. 2023, 13(4), 2138; https://doi.org/10.3390/app13042138 - 7 Feb 2023

Cited by 8 | Viewed by 2365

Abstract

Feature extraction is a collection of the necessary detailed information from the given source, which holds the information for further analysis. The quality of the fused image depends on many parameters, particularly its directional selectivity and shift-invariance. On the other hand, the traditional [...] Read more.

Feature extraction is a collection of the necessary detailed information from the given source, which holds the information for further analysis. The quality of the fused image depends on many parameters, particularly its directional selectivity and shift-invariance. On the other hand, the traditional wavelet-based transforms produce ringing distortions and artifacts due to poor directionality and shift invariance. The Dual-Tree Complex Wavelet Transforms (DTCWT) combined with Stationary Wavelet Transform (SWT) as a hybrid wavelet fusion algorithm overcomes the deficiencies of the traditional wavelet-based fusion algorithm and preserves the directional and shift invariance properties. The purpose of SWT is to decompose the given source image into approximate and detailed sub-bands. Further, approximate sub-bands of the given source are decomposed with DTCWT. In this extraction, low-frequency components are considered to implement Texture Energy Measures (TEM), and high-frequency components are considered to implement the absolute-maximum fusion rule. For the detailed sub-bands, the absolute-maximum fusion rule is implemented. The texture energy rules have significantly classified the image and improved the output image’s accuracy after fusion. Finally, inverse SWT is applied to generate an extended fused image. Experimental results are evaluated to show that the proposed approach outperforms approaches reported earlier. This paper proposes a fusion method based on SWT, DTCWT, and TEM to address the inherent defects of both the Parameter Adaptive-Dual Channel Pulse coupled neural network (PA-DCPCNN) and Multiscale Transform-Convolutional Sparse Representation (MST-CSR). Full article

(This article belongs to the Special Issue Multimedia Communications Using Machine Learning)

► Show Figures

Figure 1

20 pages, 5182 KB

Open AccessArticle

Dual-Tree Complex Wavelet Input Transform for Cyst Segmentation in OCT Images Based on a Deep Learning Framework

by Reza Darooei, Milad Nazari, Rahele Kafieh and Hossein Rabbani

Photonics 2023, 10(1), 11; https://doi.org/10.3390/photonics10010011 - 23 Dec 2022

Cited by 6 | Viewed by 4350

Abstract

Optical coherence tomography (OCT) represents a non-invasive, high-resolution cross-sectional imaging modality. Macular edema is the swelling of the macular region. Segmentation of fluid or cyst regions in OCT images is essential, to provide useful information for clinicians and prevent visual impairment. However, manual [...] Read more.

Optical coherence tomography (OCT) represents a non-invasive, high-resolution cross-sectional imaging modality. Macular edema is the swelling of the macular region. Segmentation of fluid or cyst regions in OCT images is essential, to provide useful information for clinicians and prevent visual impairment. However, manual segmentation of fluid regions is a time-consuming and subjective procedure. Traditional and off-the-shelf deep learning methods fail to extract the exact location of the boundaries under complicated conditions, such as with high noise levels and blurred edges. Therefore, developing a tailored automatic image segmentation method that exhibits good numerical and visual performance is essential for clinical application. The dual-tree complex wavelet transform (DTCWT) can extract rich information from different orientations of image boundaries and extract details that improve OCT fluid semantic segmentation results in difficult conditions. This paper presents a comparative study of using DTCWT subbands in the segmentation of fluids. To the best of our knowledge, no previous studies have focused on the various combinations of wavelet transforms and the role of each subband in OCT cyst segmentation. In this paper, we propose a semantic segmentation composite architecture based on a novel U-net and information from DTCWT subbands. We compare different combination schemes, to take advantage of hidden information in the subbands, and demonstrate the performance of the methods under original and noise-added conditions. Dice score, Jaccard index, and qualitative results are used to assess the performance of the subbands. The combination of subbands yielded high Dice and Jaccard values, outperforming the other methods, especially in the presence of a high level of noise. Full article

(This article belongs to the Special Issue Machine Learning in Photonics)

► Show Figures

Figure 1

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI