Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (864)

Search Parameters:
Keywords = wavelet domain

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
40 pages, 47306 KB  
Review
Advances in EMG Signal Processing and Pattern Recognition: Techniques, Challenges, and Emerging Applications
by Lasitha Piyathilaka, Jung-Hoon Sul, Sanura Dunu Arachchige, Amal Jayawardena and Diluka Moratuwage
Electronics 2026, 15(3), 590; https://doi.org/10.3390/electronics15030590 - 29 Jan 2026
Abstract
Electromyography (EMG) has become essential in biomedical engineering, rehabilitation, and human–machine interfacing due to its ability to capture neuromuscular activation for control, monitoring, and diagnosis. Recent advances in sensing hardware, high-density and flexible electrodes, and embedded acquisition modules combined with modern signal processing [...] Read more.
Electromyography (EMG) has become essential in biomedical engineering, rehabilitation, and human–machine interfacing due to its ability to capture neuromuscular activation for control, monitoring, and diagnosis. Recent advances in sensing hardware, high-density and flexible electrodes, and embedded acquisition modules combined with modern signal processing and machine learning have significantly enhanced the robustness and applicability of EMG-based systems. This review provides an integrated overview of EMG generation, acquisition standards, and preprocessing techniques, including adaptive filtering, wavelet denoising, and empirical mode decomposition. Feature extraction methods across the time, frequency, time–frequency, and nonlinear domains are compared with respect to computational efficiency and suitability for real-time systems. The review synthesizes classical and contemporary pattern-recognition approaches, from statistical classifiers to deep architectures such as CNNs, RNNs, hybrid CNN–RNN models, transformer-based networks, and graph neural networks. Key challenges, including signal non-stationarity, electrode displacement, muscle fatigue, and poor cross-user or cross-session generalization, are examined alongside emerging strategies such as transfer learning, domain adaptation, and multimodal fusion with IMU or FMG signals. Finally, the paper surveys rapidly growing EMG applications in prosthetics, rehabilitation robotics, human–machine interfaces, clinical diagnostics, and sports analytics. The review highlights ongoing limitations and outlines future pathways toward robust, adaptive, and deployable EMG-driven intelligent systems. Full article
(This article belongs to the Special Issue Image and Signal Processing Techniques and Applications)
Show Figures

Figure 1

21 pages, 477 KB  
Article
Symbolic Manifolds and Transform Closure: A Geometric Framework for Operator-Invariant Structure
by Robert Castro
Mathematics 2026, 14(3), 461; https://doi.org/10.3390/math14030461 - 28 Jan 2026
Abstract
We introduce a geometric framework in which classical transforms are represented as coordinate charts on a symbolic manifold. The construction defines symbolic curvature (κ), strain (τ), compressibility (σ), and the ratio Γ = κ/τ, which together provide a diagnostic coordinate system for comparing [...] Read more.
We introduce a geometric framework in which classical transforms are represented as coordinate charts on a symbolic manifold. The construction defines symbolic curvature (κ), strain (τ), compressibility (σ), and the ratio Γ = κ/τ, which together provide a diagnostic coordinate system for comparing representational stability across chart transitions. Within this setting, transforms such as Fourier, Laplace, wavelet, Jordan, and polynomial projection can be treated as charts connected by transition maps that preserve Γ on specified domains. We also introduce a symmetric positive-definite metric tensor Gab to quantify displacement in the invariant coordinates and to formalize minimal-effort paths (geodesics) under modeling assumptions stated in the text. The resulting framework provides a reproducible screening method for evaluating transform stability, diagnosing closure failure, and comparing transform behavior under a shared set of invariants. Full article
25 pages, 11974 KB  
Article
Restoring Ambiguous Boundaries: An Efficient and Robust Framework for Underwater Camouflaged Object Detection
by Zihan Wei, Yucheng Zheng, Yaohua Shen and Xiaofei Yang
Sensors 2026, 26(3), 872; https://doi.org/10.3390/s26030872 - 28 Jan 2026
Abstract
The efficacy of Underwater Camouflaged Object Detection (UCOD) is fundamentally constrained by severe boundary ambiguity, where biological mimicry blends targets into complex backgrounds and aquatic optical degradation erodes edge details. We propose a lightweight boundary perception detector named CAR-YOLO (Camouflage Ambiguity Resolution YOLO). [...] Read more.
The efficacy of Underwater Camouflaged Object Detection (UCOD) is fundamentally constrained by severe boundary ambiguity, where biological mimicry blends targets into complex backgrounds and aquatic optical degradation erodes edge details. We propose a lightweight boundary perception detector named CAR-YOLO (Camouflage Ambiguity Resolution YOLO). Specifically, a frequency-domain dual-path mechanism (FRM-DWT/EG-IWT) leverages selective wavelet aggregation and dynamic injection to recover high-frequency edges. Subsequently, these high-frequency cues are synergized with low-frequency semantic information via the Low-level Adaptive Fusion (LAF) module. To further address noisy samples, an Uncertainty Calibration Head (UCH) refines supervision via prediction consistency. Finally, we constructed specialized datasets based on public data for training and evaluation, including UCOD10K and UWB-COT220. On UCOD10K, CAR-YOLO achieves 27.1% mAP50–95, surpassing several state-of-the-art (SOTA) methods while reducing parameters from 2.58 M to 2.43 M and GFLOPs from 6.3 to 5.9. On the challenging UWB-COT220 benchmark, the model attains 30.7% mAP50–95, marking a 7.7-point improvement over YOLOv11. Furthermore, cross-domain experiments on UODD demonstrate strong generalization. These results indicate that CAR-YOLO effectively mitigates boundary ambiguity, achieving an optimal balance between accuracy, robustness, and efficiency. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

27 pages, 4051 KB  
Article
Lossless Compression of Large Field-of-View Infrared Video Based on Transform Domain Hybrid Prediction
by Ya Liu, Rui Zhang, Yong Zhang and Yuwei Chen
Sensors 2026, 26(3), 868; https://doi.org/10.3390/s26030868 - 28 Jan 2026
Abstract
Large field-of-view (FOV) infrared imaging, widely utilized in applications including target detection and remote sensing, generates massive datasets that pose significant challenges for transmission and storage. To address this issue, we propose an efficient lossless compression method for large FOV infrared video. Our [...] Read more.
Large field-of-view (FOV) infrared imaging, widely utilized in applications including target detection and remote sensing, generates massive datasets that pose significant challenges for transmission and storage. To address this issue, we propose an efficient lossless compression method for large FOV infrared video. Our approach employs a hybrid prediction strategy within the transform domain. The video frames are first decomposed into low- and high-frequency components via the discrete wavelet transform. For the low-frequency subbands, an improved low-latency Multi-view High-Efficiency Video Coding (MV-HEVC) encoder is adopted, where the background reference frames are treated as one view to enable more accurate inter-frame prediction. For high-frequency components, pixel-wise clustered edge prediction is applied. Furthermore, the prediction residuals are reduced by optimal direction prediction, according to the principle of minimizing residual energy. Experimental results demonstrate that our method significantly outperforms mainstream video compression techniques. While maintaining compression performance comparable to MV-HEVC, the proposed method exhibits a 19.3-fold improvement in computational efficiency. Full article
(This article belongs to the Section Sensing and Imaging)
12 pages, 2668 KB  
Article
Spatial-Frequency Fusion Tiny-Transformer for Efficient Image Super-Resolution
by Qiaoyue Man
Appl. Sci. 2026, 16(3), 1284; https://doi.org/10.3390/app16031284 - 27 Jan 2026
Viewed by 42
Abstract
In image super-resolution tasks, methods based on Generative Adversarial Networks (GANs), Transformer models, and diffusion models demonstrate robust global modeling capabilities and outstanding performance. However, their computational costs remain prohibitively high, limiting deployment on resource-constrained devices. Meanwhile, frequency-domain approaches based on convolutional neural [...] Read more.
In image super-resolution tasks, methods based on Generative Adversarial Networks (GANs), Transformer models, and diffusion models demonstrate robust global modeling capabilities and outstanding performance. However, their computational costs remain prohibitively high, limiting deployment on resource-constrained devices. Meanwhile, frequency-domain approaches based on convolutional neural networks (CNNs) capture complementary structural information but lack long-range dependencies, resulting in suboptimal perceptual image quality. To overcome these limitations, we propose a micro-Transformer-based architecture. This framework enriches high-frequency image information through wavelet transform-based frequency-domain features, integrates spatio-temporal and frequency-domain cross-feature fusion, and incorporates a discriminator constraint to achieve image super-resolution. Extensive experiments demonstrate that this approach achieves competitive PSNR/SSIM performance while maintaining reasonable computational complexity. Its visual quality and efficiency outperform most existing SR methods. Full article
Show Figures

Figure 1

23 pages, 2628 KB  
Article
Scattering-Based Self-Supervised Learning for Label-Efficient Cardiac Image Segmentation
by Serdar Alasu and Muhammed Fatih Talu
Electronics 2026, 15(3), 506; https://doi.org/10.3390/electronics15030506 - 24 Jan 2026
Viewed by 215
Abstract
Deep learning models based on supervised learning rely heavily on large annotated datasets and particularly in the context of medical image segmentation, the requirement for pixel-level annotations makes the labeling process labor-intensive, time-consuming and expensive. To overcome these limitations, self-supervised learning (SSL) has [...] Read more.
Deep learning models based on supervised learning rely heavily on large annotated datasets and particularly in the context of medical image segmentation, the requirement for pixel-level annotations makes the labeling process labor-intensive, time-consuming and expensive. To overcome these limitations, self-supervised learning (SSL) has emerged as a promising alternative that learns generalizable representations from unlabeled data; however, existing SSL frameworks often employ highly parameterized encoders that are computationally expensive and may lack robustness in label-scarce settings. In this work, we propose a scattering-based SSL framework that integrates Wavelet Scattering Networks (WSNs) and Parametric Scattering Networks (PSNs) into a Bootstrap Your Own Latent (BYOL) pretraining pipeline. By replacing the initial stages of the BYOL encoder with fixed or learnable scattering-based front-ends, the proposed method reduces the number of learnable parameters while embedding translation-invariant and small deformation-stable representations into the SSL pipeline. The pretrained encoders are transferred to a U-Net and fine-tuned for cardiac image segmentation on two datasets with different imaging modalities, namely, cardiac cine MRI (ACDC) and cardiac CT (CHD), under varying amounts of labeled data. Experimental results show that scattering-based SSL pretraining consistently improves segmentation performance over random initialization and ImageNet pretraining in low-label regimes, with particularly pronounced gains when only a few labeled patients are available. Notably, the PSN variant achieves improvements of 4.66% and 2.11% in average Dice score over standard BYOL with only 5 and 10 labeled patients, respectively, on the ACDC dataset. These results demonstrate that integrating mathematically grounded scattering representations into SSL pipelines provides a robust and data-efficient initialization strategy for cardiac image segmentation, particularly under limited annotation and domain shift. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

23 pages, 5986 KB  
Article
Modulation and Perturbation in Frequency Domain for SAR Ship Detection
by Mengqin Fu, Wencong Zhang, Xiaochen Quan, Dahu Shi, Luowei Tan, Jia Zhang, Yinghui Xing and Shizhou Zhang
Remote Sens. 2026, 18(2), 338; https://doi.org/10.3390/rs18020338 - 20 Jan 2026
Viewed by 116
Abstract
Synthetic Aperture Radar (SAR) has unique advantages in ship monitoring at sea due to its all-weather imaging capability. However, its unique imaging mechanism presents two major challenges. First, speckle noise in the frequency domain reduces the contrast between the target and the background. [...] Read more.
Synthetic Aperture Radar (SAR) has unique advantages in ship monitoring at sea due to its all-weather imaging capability. However, its unique imaging mechanism presents two major challenges. First, speckle noise in the frequency domain reduces the contrast between the target and the background. Second, side-lobe scattering blurs the ship outline, especially in nearshore complex scenes, and strong scattering characteristics make it difficult to separate the target from the background. The above two challenges significantly limit the performance of tailored CNN-based detection models in optical images when applied directly to SAR images. To address these challenges, this paper proposes a modulation and perturbation mechanism in the frequency domain based on a lightweight CNN detector. Specifically, the wavelet transform is firstly used to extract high-frequency features in different directions, and feature expression is dynamically adjusted according to the global statistical information to realize the selective enhancement of the ship edge and detail information. In terms of frequency-domain perturbation, a perturbation mechanism guided by frequency-domain weight is introduced to effectively suppress background interference while maintaining key target characteristics, which improves the robustness of the model in complex scenes. Extensive experiments on four widely adopted benchmark datasets, namely LS-SSDD-v1.0, SSDD, SAR-Ship-Dataset, and AIR-SARShip-2.0, demonstrate that our FMP-Net significantly outperforms 18 existing state-of-the-art methods, especially in complex nearshore scenes and sea surface interference scenes. Full article
Show Figures

Figure 1

20 pages, 5606 KB  
Article
Heart Sound Classification for Early Detection of Cardiovascular Diseases Using XGBoost and Engineered Acoustic Features
by P. P. Satya Karthikeya, P. Rohith, B. Karthikeya, M. Karthik Reddy, Akhil V M, Andrea Tigrini, Agnese Sbrollini and Laura Burattini
Sensors 2026, 26(2), 630; https://doi.org/10.3390/s26020630 - 17 Jan 2026
Viewed by 226
Abstract
Heart sound-based detection of cardiovascular diseases is a critical task in clinical diagnostics, where early and accurate identification can significantly improve patient outcomes. In this study, we investigate the effectiveness of combining traditional acoustic features and transformer-based Wav2Vec embeddings with advanced machine learning [...] Read more.
Heart sound-based detection of cardiovascular diseases is a critical task in clinical diagnostics, where early and accurate identification can significantly improve patient outcomes. In this study, we investigate the effectiveness of combining traditional acoustic features and transformer-based Wav2Vec embeddings with advanced machine learning models for multi-class classification of five heart sound categories. Ten engineered acoustic features, i.e., Log Mel, MFCC, delta, delta-delta, chroma, discrete wavelet transform, zero-crossing rate, energy, spectral centroid, and temporal flatness, were extracted as regular features. Four model configurations were evaluated: a hybrid CNN + LSTM and XGBoost trained with either regular features or Wav2Vec embeddings. Models were assessed using a held-out test set with hyperparameter tuning and cross-validation. Results demonstrate that models trained on regular features consistently outperform Wav2Vec-based models, with XGBoost achieving the highest accuracy of 99%, surpassing the hybrid model at 98%. These findings highlight the importance of domain-specific feature engineering and the effectiveness of ensemble learning with XGBoost for robust and accurate heart sound classification, offering a promising approach for early detection and intervention in cardiovascular diseases. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

21 pages, 23946 KB  
Article
Infrared Image Denoising Algorithm Based on Wavelet Transform and Self-Attention Mechanism
by Hongmei Li, Yang Zhang, Luxia Yang and Hongrui Zhang
Sensors 2026, 26(2), 523; https://doi.org/10.3390/s26020523 - 13 Jan 2026
Viewed by 163
Abstract
Infrared images are often degraded by complex noise due to hardware and environmental factors, posing challenges for subsequent processing and target detection. To overcome the shortcomings of existing denoising methods in balancing noise removal and detail preservation, this paper proposes a Wavelet Transform [...] Read more.
Infrared images are often degraded by complex noise due to hardware and environmental factors, posing challenges for subsequent processing and target detection. To overcome the shortcomings of existing denoising methods in balancing noise removal and detail preservation, this paper proposes a Wavelet Transform Enhanced Infrared Denoising Model (WTEIDM). Firstly, a Wavelet Transform Self-Attention (WTSA) is designed, which combines the frequency-domain decomposition ability of the discrete wavelet transform (DWT) with the dynamic weighting mechanism of self-attention to achieve effective separation of noise and detail. Secondly, a Multi-Scale Gated Linear Unit (MSGLU) is devised to improve the ability to capture detail information and dynamically control features through dual-branch multi-scale depth-wise convolution and gating strategy. Finally, a Parallel Hybrid Attention Module (PHAM) is proposed to enhance cross-dimensional feature fusion effect through the parallel cross-interaction of spatial and channel attention. Extensive experiments are conducted on five infrared datasets under different noise levels (σ = 15, 25, and 50). The results demonstrate that the proposed WTEIDM outperforms several state-of-the-art denoising algorithms on both PSNR and SSIM metrics, confirming its superior generalization capability and robustness. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

20 pages, 10682 KB  
Article
FESW-UNet: A Dual-Domain Attention Network for Sorghum Aphid Segmentation
by Caijian Hua and Fangjun Ren
Sensors 2026, 26(2), 458; https://doi.org/10.3390/s26020458 - 9 Jan 2026
Viewed by 251
Abstract
Current management strategies for sorghum aphids heavily rely on indiscriminate chemical application, leading to severe environmental consequences and impacting food safety. While precision spraying offers a viable remediation for pesticide overuse, its effectiveness depends on accurately locating and classifying pests. To address the [...] Read more.
Current management strategies for sorghum aphids heavily rely on indiscriminate chemical application, leading to severe environmental consequences and impacting food safety. While precision spraying offers a viable remediation for pesticide overuse, its effectiveness depends on accurately locating and classifying pests. To address the critical challenge of segmenting small, swarming aphids in complex field environments, we propose FESW-UNet, a dual-domain attention network that integrates Fourier-enhanced attention, spatial attention, and wavelet-based downsampling into a UNet backbone. We introduce an efficient multi-scale attention (EMA) module between the encoder and decoder to enhance global context perception, enabling the model to capture more accurate relationships between global and local features in the field. In the feature extraction stage, we embed a simple attention module (SimAM) to target key infestation regions while suppressing background noise, thereby enhancing pixel-level discrimination. Furthermore, we replace conventional downsampling with Haar wavelet downsampling (HWD) to reduce resolution while preserving structural edge details. Finally, a Fourier-enhanced attention module (FEAM) is added to the skip-connection layers. By using complex-valued weights to regulate frequency-domain features, FEAM effectively fuses global low-frequency structures with local high-frequency details, thereby enhancing feature representation diversity. Experiments on the Aphid Cluster Segmentation dataset demonstrate that FESW-UNet outperforms other models, achieving an mIoU of 68.76%, mPA of 78.19%, and mF1 of 79.01%. The model also demonstrated strong adaptability on the AphidSeg-Sorghum dataset, achieving an mIoU of 81.22%, mPA of 87.97%, and mF1 of 88.60%. The proposed method offers an efficient and feasible technical solution for monitoring and controlling sorghum aphids through image segmentation, demonstrating broad application potential. Full article
(This article belongs to the Special Issue AI, IoT and Smart Sensors for Precision Agriculture: 2nd Edition)
Show Figures

Figure 1

26 pages, 4009 KB  
Article
A Hybrid Simulation–Physical Data-Driven Framework for Occupant Injury Prediction in Vehicle Underbody Structures
by Xinge Si, Changan Di, Peng Peng, Yongjian Zhang, Tao Lin and Cong Xu
Sensors 2026, 26(2), 380; https://doi.org/10.3390/s26020380 - 7 Jan 2026
Viewed by 192
Abstract
One major challenge in optimizing vehicle underbody structures for blast protection is the trade-off between the high cost of physical tests and the limited accuracy of simulations. We introduce a predictive framework that is co-driven by limited physical measurements and systematically augmented simulation [...] Read more.
One major challenge in optimizing vehicle underbody structures for blast protection is the trade-off between the high cost of physical tests and the limited accuracy of simulations. We introduce a predictive framework that is co-driven by limited physical measurements and systematically augmented simulation datasets. The main problem arises from the complex components of blast impact signals, which makes it difficult to augment the load signals for finite element simulations when only extremely small sample sets are available. Specifically, a small-scale data-augmentation model within the wavelet domain based on a conditional generative adversarial network (CGAN) was designed. Real-time perturbations, governed by cumulative distribution functions, were introduced to expand and diversify the data representations for enhanced dataset enrichment. A predictive model based on Gaussian process regression (GPR) that integrates physical experimental data with augmented data wavelet characteristics is employed to estimate injury indices, using wavelet scale energies reduced via principal component analysis (PCA) as inputs. Cross-validation shows that this hybrid model achieves higher accuracy than using simulations alone. Through the case study, the model demonstrates that increased hull angle and depth can effectively reduce occupant injury. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

20 pages, 20362 KB  
Article
Node-Incremental-Based Multisource Domain Adaptation for Fault Diagnosis of Rolling Bearings with Limited Data
by Di Deng, Wei Li, Jiang Liu and Yan Qin
Machines 2026, 14(1), 71; https://doi.org/10.3390/machines14010071 - 6 Jan 2026
Viewed by 212
Abstract
Bearing fault diagnosis is essential for ensuring the safe and reliable operation of rotating machinery. However, accurate and timely fault identification with limited data remains a significant challenge. This study proposes a novel node-incremental-based multisource domain adaptation (NiMDA) approach for bearing fault diagnosis. [...] Read more.
Bearing fault diagnosis is essential for ensuring the safe and reliable operation of rotating machinery. However, accurate and timely fault identification with limited data remains a significant challenge. This study proposes a novel node-incremental-based multisource domain adaptation (NiMDA) approach for bearing fault diagnosis. The method employs a cloud model to adaptively extract fault-sensitive information while accounting for uncertainties across multiple wavelet packet decomposition levels. Subsequently, node incremental domain adaptation (NiDA) is used to construct a base classifier utilizing limited labeled data from both target and source domains. This approach reduces discrepancies between marginal and conditional distributions across different domain feature spaces during the node-increment process, resulting in a compact domain-adaptation structure. Robust diagnostic performance is achieved through parallel ensemble learning of NiDAs across multiple source domains. The experimental results demonstrate that NiMDA significantly outperforms state-of-the-art bearing fault diagnosis methods in few-shot scenarios, achieving improvements of 30.52%, 42.31%, 10.31%, 26.08%, 25.59%, and 7.98% over WDCNN, MCNN-LSTM, Bayesian-RF, DM-RVFLN, Five-shot, and ESCN, respectively, while maintaining satisfactory diagnostic speed. Full article
Show Figures

Figure 1

13 pages, 2618 KB  
Article
Multi-Domain Perception Transformer for Generalized Forgery Image Detection
by Qiaoyue Man, Seok-Jeong Gee and Young-Im Cho
Appl. Sci. 2026, 16(1), 533; https://doi.org/10.3390/app16010533 - 5 Jan 2026
Viewed by 207
Abstract
With the rapid advancement of generative AI (AIGC) technology, synthetic images are increasingly approaching real pictures in terms of resolution and semantic consistency. Traditional detection methods face numerous challenges, such as insufficient cross-modal generalization capabilities and difficulty in identifying hidden generative traces. Existing [...] Read more.
With the rapid advancement of generative AI (AIGC) technology, synthetic images are increasingly approaching real pictures in terms of resolution and semantic consistency. Traditional detection methods face numerous challenges, such as insufficient cross-modal generalization capabilities and difficulty in identifying hidden generative traces. Existing solutions primarily design feature extractors for single generative models, struggling to address the complexity of multimodal forgeries. Therefore, we propose a multi-domain feature fusion Transformer network that integrates spatial, frequency, and wavelet transform features and introduce a cross-domain feature fusion module (CDAF) to detect subtle forgery traces in deepfake images. This model demonstrates superior detection performance on current forged images generated by generative adversarial networks (GANs) and diffusion models while exhibiting enhanced robustness. Full article
Show Figures

Figure 1

22 pages, 37315 KB  
Article
WG-FuseNet: Wavelet-Guided and Gated Fusion Network for Road Segmentation
by Yu Nie, Jiaqi Sun, Ming Zhu, Yuan Liu, Yuanfu Yuan, Shuhui Jiang, Yan Lu and Jiarong Wang
Sensors 2026, 26(1), 218; https://doi.org/10.3390/s26010218 - 29 Dec 2025
Viewed by 288
Abstract
In current road segmentation tasks, high-frequency details of roads (such as road edges and pavement textures) tend to become blurred or even lost during feature extraction due to progressive downsampling, leading to imprecise segmentation boundaries. Moreover, existing fusion methods predominantly rely on simple [...] Read more.
In current road segmentation tasks, high-frequency details of roads (such as road edges and pavement textures) tend to become blurred or even lost during feature extraction due to progressive downsampling, leading to imprecise segmentation boundaries. Moreover, existing fusion methods predominantly rely on simple concatenation or summation operations, which struggle to adaptively integrate the rich texture information from RGB modality with the geometric structural information from Depth modality, thereby limiting fusion efficiency. To address these issues, this paper proposes an innovative model. We design a Cross-scale Wavelet Enhancement Module (CWEM) to compensate for the shortcomings of traditional networks in frequency domain analysis, explicitly enhancing the representation capability of edge and texture features. Simultaneously, a Gated Cross-Modality Fusion module (GCMF) is constructed to achieve adaptive and efficient fusion between RGB and Depth features. Additionally, to tackle the high false detection rates and confusion between sidewalks and opposite lanes in existing methods, this paper optimizes the loss function to further improve the model’s discriminative ability in complex scenarios. Experiments on the public KITTI_Road dataset demonstrate that the proposed method achieves a segmentation accuracy of 97.31% while maintaining a real-time inference speed of 34 FPS, with particularly outstanding performance in road edge integrity and shadow area handling. Full article
(This article belongs to the Section Vehicular Sensing)
Show Figures

Figure 1

18 pages, 5414 KB  
Article
Experimental Study on Acoustic Emission Signals Under Different Processing States of Laser-Assisted Machining of SiC Ceramics
by Chen Cao, Yugang Zhao, Xiukun Hu and Xiao Cui
Micromachines 2026, 17(1), 42; https://doi.org/10.3390/mi17010042 - 29 Dec 2025
Viewed by 234
Abstract
In this paper, laser-assisted machining (LAM) of SiC ceramics was taken as the research object, and the different spectrum and energy spectrum characteristics and their changing trends of acoustic emission (AE) signals under processing states of brittleness, plasticity and thermal damage were analyzed. [...] Read more.
In this paper, laser-assisted machining (LAM) of SiC ceramics was taken as the research object, and the different spectrum and energy spectrum characteristics and their changing trends of acoustic emission (AE) signals under processing states of brittleness, plasticity and thermal damage were analyzed. The numerical characterization of ceramic softening degree was indirectly realized by the energy spectrum characteristics of low-frequency band energy ratio, marking a methodological breakthrough in transitioning from qualitative analysis to quantitative detection for identifying plastic processing state. First, the surface morphology of the machined surface based on the single-factor experiment of laser power was analyzed, and three different processing states and ranges of laser power were determined, namely brittle state (0–185 W), plastic state (185–225 W) and thermal damage state (>225 W). Then, the wavelet packet denoising and spectrum analysis of AE signals under different processing states were carried out to obtain the corresponding frequency of the maximum amplitude and the amplitude change trend of the characteristic frequency (515 kHz) in the high-frequency domain. Finally, the energy spectrum analysis of acoustic emission signals was carried out, and the method of indirect characterization of ceramic softening degree by low-frequency band energy ratio was proposed. This paper provides a numerical characterization method and theoretical guidance for the detection and identification of the plastic processing state of ceramic laser-assisted cutting. Full article
(This article belongs to the Section D:Materials and Processing)
Show Figures

Figure 1

Back to TopTop