MDPI - Publisher of Open Access Journals

34 pages, 4240 KB

Open AccessArticle

A Multimodal Data Fusion Algorithm for Urban Low-Altitude UAV Perception

by Bowen Xu, Peinan He, Xu Wang, Yixiao Zhang and Yuanjie Zhao

Drones 2026, 10(6), 457; https://doi.org/10.3390/drones10060457 - 11 Jun 2026

Viewed by 216

Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical [...] Read more.

Accurate Unmanned Aerial Vehicle (UAV) position estimation is the cornerstone of urban low-altitude safety management systems. Time Difference of Arrival (TDOA) and Remote Identification (Remote ID) are widely used surveillance technologies with complementary characteristics. TDOA provides high-rate updates but suffers from geometry-induced horizontal–vertical anisotropy and multipath effects, while Remote ID supplies absolute state information yet struggles with intermittent sampling and packet loss. Existing fusion schemes typically address these issues in isolation: sequential filtering manages asynchrony but assumes Gaussian noise, robust estimators suppress outliers at the cost of discarding valid data, and coupled-filter architectures allow vertical anomalies to contaminate horizontal estimates through the Kalman gain cross-coupling. No prior framework jointly handles structural TDOA altitude jumps, stochastic Remote ID timing jitter, and the geometric anisotropy between estimation subspaces within a single coherent pipeline. To bridge this gap, we propose a Hybrid Conditional Kalman Filter (HCKF) framework comprising three integrated modules. First, a kinematics-based temporal alignment module maps asynchronous measurements onto a uniform timeline and predicts missing samples, resolving cross-modal time mismatches. Second, a measurement quality evaluation mechanism detects TDOA altitude steps via robust two-layer stratification and scores Remote ID timing irregularity through a confidence mapping, converting these anomalies into dynamic covariance adjustments and weight caps without discarding observations. Third, a Subspace-Decoupled Fusion strategy exploits the physical insight that TDOA horizontal precision derives from hyperbolic intersection geometry, whereas its vertical estimates suffer from weak observability due to near-coplanar ground-station deployment. By applying entropy-guided weighting in the horizontal plane and a conditional Remote ID-dominant rule in the vertical axis, this design prevents cross-dimensional error propagation. The framework was validated using three real-world flight missions at distinct altitudes (255 m, 345 m, and 440 m) totaling 13.51 km of flight distance, with RTK serving as ground truth. HCKF reduces the Root Mean Square Error by over 40% relative to single-source baselines (95% bootstrap confidence interval: [35.2%, 48.7%]), and paired Wilcoxon signed-rank tests confirm statistically significant improvement (

p < 0.01

) over standard EKF, Covariance Intersection, and Iterative CI across all three tracks. Full article

(This article belongs to the Special Issue Detection, Identification and Tracking of UAVs and Drones: 2nd Edition)

► Show Figures

Figure 1

22 pages, 5140 KB

Open AccessArticle

Application of Deep Multi-Scale Representation Learning Based on Eye-Tracking and Facial Expression Data in Cognitive Decline Assessment

by Yanfeng Xue, Xianpeng Luo, Shuai Guo and Tao Song

Sensors 2026, 26(9), 2600; https://doi.org/10.3390/s26092600 - 23 Apr 2026

Viewed by 614

Abstract

Digital biomarkers derived from eye-tracking and facial expression hold significant potential for the non-invasive screening of cognitive decline (CD). However, existing approaches predominantly rely on single-task or feature engineering-based unimodal methods, which struggle to capture complex temporal behavioral patterns. While deep learning (DL) [...] Read more.

Digital biomarkers derived from eye-tracking and facial expression hold significant potential for the non-invasive screening of cognitive decline (CD). However, existing approaches predominantly rely on single-task or feature engineering-based unimodal methods, which struggle to capture complex temporal behavioral patterns. While deep learning (DL) excels at extracting hierarchical features and intricate temporal dynamics from behavioral sequences, its application in this specific multimodal sensing domain remains exploratory. Addressing this gap, this study designed an assessment system integrating five multi-dimensional cognitive paradigms and collected eye-tracking and facial expression data from 20 healthy controls (HC) and 20 individuals with CD. For these multimodal sequences, we propose a deep neural network capable of multi-scale representation learning. By utilizing subspace exploration and multi-scale convolutions, this architecture extracts deep representations directly from data and incorporates a decision fusion mechanism to enhance diagnostic robustness. Experimental results demonstrate that our method achieves a 90% classification accuracy, outperforming machine learning models. Furthermore, statistical analyses conducted in this study validated several features associated with CD and also explored some novel potential behavioral patterns. This study confirms the feasibility of a DL framework based on eye-tracking and facial expression signals for identifying CD, providing a reference for developing objective and efficient digital screening tools. Full article

(This article belongs to the Section Biomedical Sensors)

► Show Figures

Figure 1

20 pages, 1848 KB

Open AccessArticle

Benchmarking Multimodal Deep Fusion Strategies for Heterogeneous Neuroimaging and Cognitive Data Using a Controlled Sex Classification Task

by Chiara Camastra, Assunta Pelagi, Andrea Quattrone and Alessia Sarica

Brain Sci. 2026, 16(4), 405; https://doi.org/10.3390/brainsci16040405 - 10 Apr 2026

Viewed by 820

Abstract

Background/Objectives: Multimodal data fusion is increasingly applied in neuroinformatics to integrate heterogeneous sources of information. However, the optimal strategies for combining modalities with markedly different dimensionality, scale, and noise characteristics remain unclear. To our knowledge, this is among the first systematic and [...] Read more.

Background/Objectives: Multimodal data fusion is increasingly applied in neuroinformatics to integrate heterogeneous sources of information. However, the optimal strategies for combining modalities with markedly different dimensionality, scale, and noise characteristics remain unclear. To our knowledge, this is among the first systematic and controlled benchmarks explicitly disentangling the effects of fusion strategy and feature scaling within a unified deep learning framework. Methods: Using data from 747 healthy participants from the Human Connectome Project, we evaluated multiple fusion paradigms—including early fusion, attention-based fusion, subspace-based fusion, and graph-based fusion—within a unified and reproducible framework. Importantly, we assessed how different feature scaling techniques (Standard, Min–Max, and Robust scaling) interact with fusion strategies and influence model performance. Biological sex was used as a controlled benchmark task to focus on methodological insights rather than task-specific optimization. Results: Early feature-level fusion consistently achieved the highest classification performance across all evaluated configurations. In particular, direct concatenation of cognitive and neuroimaging features combined with Standard Scaling yielded the best results (AUC–ROC = 0.96 (0.95–0.96)), outperforming unimodal baselines as well as intermediate and late fusion strategies. Conclusions: This systematic benchmark demonstrates that multimodal deep learning performance in neuroscience is driven primarily by the interaction between fusion strategy and feature scaling rather than by architectural complexity alone. By explicitly disentangling the effects of fusion level and preprocessing within a unified framework, this study provides practical methodological guidance for the design, evaluation, and reproducible deployment of multimodal deep learning models in neuroscience. Full article

(This article belongs to the Section Computational Neuroscience, Neuroinformatics, and Neurocomputing)

► Show Figures

Figure 1

18 pages, 966 KB

Open AccessArticle

Anomaly Detection Based on Hybrid Kernelized Fuzzy Density

by Kaitian Luo, Shenhong Lei, Chaoqing Li and Yi Li

Symmetry 2026, 18(1), 192; https://doi.org/10.3390/sym18010192 - 20 Jan 2026

Viewed by 454

Abstract

Unsupervised anomaly detection has been extensively studied. However, most existing methods are designed for either numerical or nominal data, which struggle to detect anomalies effectively in real-world mixed-type datasets. Fuzzy information granulation is a key concept in granular computing, which offers a potent [...] Read more.

Unsupervised anomaly detection has been extensively studied. However, most existing methods are designed for either numerical or nominal data, which struggle to detect anomalies effectively in real-world mixed-type datasets. Fuzzy information granulation is a key concept in granular computing, which offers a potent framework for managing uncertainty in mixed-type data and provides a viable pathway for unsupervised anomaly detection. Nevertheless, conventional fuzzy information granulation-based detection methods often model only simple, linear fuzzy relations between samples. This limitation prevents them from capturing the complex, nonlinear structures inherent in the data, leading to a degradation in detection performance. To address these shortcomings, we propose a Hybrid Kernelized Fuzzy Density-based anomaly detector (HKFD). HKFD pioneers a hybrid kernelized fuzzy relation by integrating a hybrid distance metric with kernel methods. This new relation allows us to define a hybrid kernelized fuzzy density for each sample within every feature subspace, effectively capturing the local data dispersion. Crucially, we introduce an information-theoretic weighting mechanism. By calculating the fuzzy information entropy of each feature’s distribution, HKFD automatically assigns higher weights to more informative feature subspaces that contribute more to identifying anomalies. The final anomaly factor is then calculated by the weighted fusion of these densities. Comprehensive experiments on 20 datasets demonstrate that HKFD significantly outperforms state-of-the-art methods, achieving superior anomaly detection performance. Full article

(This article belongs to the Special Issue Symmetry/Asymmetry in Fuzzy Sets and Fuzzy Systems)

► Show Figures

Figure 1

20 pages, 1355 KB

Open AccessArticle

Multimodal Mutual Information Extraction and Source Detection with Application in Focal Seizure Localization

by Soosan Beheshti, Erfan Naghsh, Younes Sadat-Nejad and Yashar Naderahmadian

Electronics 2025, 14(24), 4897; https://doi.org/10.3390/electronics14244897 - 12 Dec 2025

Cited by 1 | Viewed by 809

Abstract

Current multimodal imaging–based source localization (SoL) methods often rely on synchronously recorded data, and many neural network–driven approaches require large training datasets, conditions rarely met in clinical neuroimaging. To address these limitations, we introduce MieSoL (Multimodal Mutual Information Extraction and Source Localization), a [...] Read more.

Current multimodal imaging–based source localization (SoL) methods often rely on synchronously recorded data, and many neural network–driven approaches require large training datasets, conditions rarely met in clinical neuroimaging. To address these limitations, we introduce MieSoL (Multimodal Mutual Information Extraction and Source Localization), a unified framework that fuses EEG and MRI, whether acquired synchronously or asynchronously, to achieve robust cross-modal information extraction and high-accuracy SoL. Targeting neuroimaging applications, MieSoL combines Magnetic Resonance Imaging (MRI) and Electroencephalography (EEG), leveraging their complementary strengths—MRI’s high spatial resolution and EEG’s superior temporal resolution. MieSoL addresses key limitations of existing SoL methods, including poor localization accuracy and an unreliable estimation of the true source number. The framework combines two existing components—Unified Left Eigenvectors (ULeV) and Efficient High-Resolution sLORETA (EHR-sLORETA)—but integrates them in a novel way: ULeV is adapted to extract a noise-resistant shared latent representation across modalities, enabling cross-modal denoising and an improved estimation of the true source number (TSN), while EHR-sLORETA subsequently performs anatomically constrained high-resolution inverse mapping on the purified subspace. While EHR-sLORETA already demonstrates superior localization precision relative to sLORETA, replacing conventional PCA/ICA preprocessing with ULeV provides substantial advantages, particularly when data are scarce or asynchronously recorded. Unlike PCA/ICA approaches, which perform denoising and source selection separately and are limited in capturing shared information, ULeV jointly processes EEG and MRI to perform denoising, dimension reduction, and mutual-information-based feature extraction in a unified step. This coupling directly addresses longstanding challenges in multimodal SoL, including inconsistent noise levels, temporal misalignment, and the inefficiency of traditional PCA-based preprocessing. Consequently, on synthetic datasets, MieSoL achieves 40% improvement in Average Correlation Coefficient (ACC) and 56% reduction in Average Error Estimation (AEE) compared to conventional techniques. Clinical validation involving 26 epilepsy patients further demonstrates the method’s robustness, with automated results aligning closely with expert epileptologist assessments. Overall, MieSoL offers a principled and interpretable multimodal fusion paradigm that enhances the fidelity of EEG source localization, holding significant promise for both clinical and cognitive neuroscience applications. Full article

► Show Figures

Figure 1

25 pages, 25914 KB

Open AccessArticle

Permeability Index Modeling with Multiscale Time Delay Characteristics Excavation in Blast Furnace Ironmaking Process

by Yonghong Xu, Chunjie Yang and Siwei Lou

Electronics 2025, 14(23), 4670; https://doi.org/10.3390/electronics14234670 - 27 Nov 2025

Viewed by 777

Abstract

The permeability index (PI) is a key comprehensive indicator that reflects the smoothness of internal gas flow in pig iron production via blast furnace. An accurate prediction for it is essential for forecasting abnormal furnace conditions and preventing potential faults. However, developing an [...] Read more.

The permeability index (PI) is a key comprehensive indicator that reflects the smoothness of internal gas flow in pig iron production via blast furnace. An accurate prediction for it is essential for forecasting abnormal furnace conditions and preventing potential faults. However, developing an early prediction model for PI has been neglected in existing research, and it faces massive challenges due to the strong nonlinearity, undesirable nonstationarity, and significant multiscale time delays inherent in the blast furnace data. To bridge this gap, a new modeling paradigm for PI is proposed to explore the inherent time delay characteristics among multiple variables. First, the data are progressively decomposed into multiple components using wavelet decomposition and spike separation. Then, a novel delay extraction method based on wavelet coherence analysis is developed to obtain accurate multiscale time delay knowledge. Furthermore, the integration of Orthonormal Subspace Analysis (OSA) and wavelet neural network (WNN) achieves comprehensive modeling across time and frequency domains, incorporating global and local features. A Gauss–Markov-based fusion framework is also utilized to reduce the output error variance, ultimately enabling the early prediction of PI. Mechanism analysis and a practical case study on blast furnace production verify the effectiveness of the proposed target-oriented prediction framework. Full article

(This article belongs to the Special Issue Advancements in Artificial Intelligence (AI) for Engineering Applications)

► Show Figures

Figure 1

20 pages, 2659 KB

Open AccessArticle

Twin-Space Decoupling and Interaction for Efficient Vision-Language Transfer

by Wei Liang, Junqiang Li, Zhengkai Guo, Zhiwei Peng, Xiaocui Li, Junfeng Yang, Chuang Li and Wei Long

Electronics 2025, 14(21), 4314; https://doi.org/10.3390/electronics14214314 - 3 Nov 2025

Viewed by 728

Abstract

Pre-trained visual language models have become excellent basic models for many downstream tasks in transfer learning. However, due to the serious gap between the data scale of downstream tasks and the large-scale data used by pre-trained models, migration to downstream tasks will face [...] Read more.

Pre-trained visual language models have become excellent basic models for many downstream tasks in transfer learning. However, due to the serious gap between the data scale of downstream tasks and the large-scale data used by pre-trained models, migration to downstream tasks will face the dilemma of discriminability and generalization. Therefore, it is necessary to learn task-specific knowledge while retaining general knowledge. How to accurately identify and distinguish these two types of representations remains a challenge. This paper proposes a dual-subspace driven cross-modal semantic interaction and dynamic feature fusion framework, which uses a decentralized covariance dual-subspace decomposition method to decouple visual and text features by constructing task subspaces and general knowledge subspaces, and performs refined modal interactions on the decoupled general features and task features through a cross-modal semantic interaction adapter module. Finally, a cross-level semantic fusion module based on a gating mechanism is used to achieve dynamic fusion of different semantics from shallow to deep. We verify the effectiveness of this method on three tasks: generalization to novel classes, novel target datasets, and domain generalization. Compared with a variety of advanced methods, the proposed method has achieved excellent performance in all evaluation tasks. Full article

(This article belongs to the Special Issue Digital Intelligence Technology and Applications, 2nd Edition)

► Show Figures

Graphical abstract

22 pages, 33466 KB

Open AccessArticle

Symmetry-Constrained Dual-Path Physics-Guided Mamba Network: Balancing Performance and Efficiency in Underwater Image Enhancement

by Ye Fang, Heting Sun, Yali Li, Shuai Yuan and Feng Zhao

Symmetry 2025, 17(10), 1742; https://doi.org/10.3390/sym17101742 - 16 Oct 2025

Cited by 1 | Viewed by 1271

Abstract

The field of underwater image enhancement (UIE) has advanced significantly, yet it continues to grapple with persistent challenges stemming from complex, spatially varying optical degradations such as light absorption, scattering, and color distortion. These factors often impede the efficient deployment of enhancement models. [...] Read more.

The field of underwater image enhancement (UIE) has advanced significantly, yet it continues to grapple with persistent challenges stemming from complex, spatially varying optical degradations such as light absorption, scattering, and color distortion. These factors often impede the efficient deployment of enhancement models. Conventional approaches frequently rely on uniform processing strategies that neither adapt effectively to diverse degradation patterns nor adequately incorporate physical principles, resulting in a trade-off between enhancement quality and computational efficiency. To overcome these limitations, we propose a Dual-Path Physics-Guided Mamba Network (DPPGM), a lightweight framework designed to synergize physical optics modeling with data-driven learning. Extensive experiments on three benchmark datasets (UIEB, LSUI, and U45) demonstrate that DPPGM outperforms 13 state-of-the-art methods, achieving an exceptional balance with only 1.48 M parameters and 25.39 G FLOPs. The key to this performance is a symmetry-constrained architecture: it incorporates a dual-path Mamba module for degradation-aware processing, physics-guided optimization based on the Jaffe–McGlamery model, and compact subspace fusion, ensuring that quality and efficiency are mutually reinforced rather than competing objectives. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

26 pages, 2902 KB

Open AccessArticle

Distributed Phased-Array Radar Mainlobe Interference Suppression and Cooperative Localization Based on CEEMDAN–WOBSS

by Xiang Liu, Huafeng He, Ruike Li, Yubin Wu, Xin Zhang and Yongquan You

Sensors 2025, 25(20), 6277; https://doi.org/10.3390/s25206277 - 10 Oct 2025

Viewed by 1306

Abstract

Mainlobe interference can severely degrade the performance of distributed phased-array radar systems in the presence of strong jamming or low-reflectivity targets. This paper introduces a signal–data dual-domain cooperative antijamming and localization (SDCAL) framework that integrates adaptive complete ensemble empirical mode decomposition with improved [...] Read more.

Mainlobe interference can severely degrade the performance of distributed phased-array radar systems in the presence of strong jamming or low-reflectivity targets. This paper introduces a signal–data dual-domain cooperative antijamming and localization (SDCAL) framework that integrates adaptive complete ensemble empirical mode decomposition with improved blind source separation and wavelet optimization (CEEMDAN-WOBSS) for signal-level denoising and separation. Following source separation, CFAR-based pulse compression is applied for precise range estimation, and multi-node data fusion is then used to achieve three-dimensional target localization. Under low signal-to-noise ratio (SNR) conditions, the adaptive CEEMDAN–WOBSS approach reconstructs the signal covariance matrix to preserve subspace rank, thereby accelerating convergence of the separation matrix. The subsequent pulse compression and CFAR detection steps provide reliable inter-node distance measurements for accurate fusion. The simulation results demonstrate that, compared to conventional blind-source-separation methods, the proposed framework markedly enhances interference suppression, detection probability, and localization accuracy—validating its effectiveness for robust collaborative sensing in challenging jamming scenarios. Full article

(This article belongs to the Special Issue Radar Target Detection, Imaging and Recognition)

► Show Figures

Figure 1

16 pages, 1106 KB

Open AccessArticle

Direct Position Determination of Wideband Source over Multipath Environment: Combining Taylor Expansion and Subspace Data Fusion in the Cross-Spectrum Domain

by Heng Chai, Xinjian Yin, Hao Hu and Xiaofei Zhang

Sensors 2025, 25(16), 4967; https://doi.org/10.3390/s25164967 - 11 Aug 2025

Cited by 1 | Viewed by 1072

Abstract

Position localization of wideband source over multipath environment is addressed in this paper. Traditional methods generally estimate intermediate parameters first and then use these parameters to construct equations for determining the source position. However, the localization accuracy of such methods deteriorates significantly in [...] Read more.

Position localization of wideband source over multipath environment is addressed in this paper. Traditional methods generally estimate intermediate parameters first and then use these parameters to construct equations for determining the source position. However, the localization accuracy of such methods deteriorates significantly in the presence of multipath effects. In this paper, a direct position determination method combining Taylor expansion and subspace data fusion in the cross-spectrum domain is proposed. The method constructs the data model based on the cross-spectrum of the received signals from arbitrary sensor pairs, effectively avoiding the loss of the available information. Subsequently, forward spatial smoothing is used to address the rank-deficiency problem caused by the multipath effect. Finally, a cost function using subspace data fusion is constructed, and the optimal value is derived via first-order Taylor expansion to compensate for the position estimation bias. The proposed method shows higher localization accuracy compared to state-of-the-art methods. The numerical and experimental results validate the superior localization performance of the proposed algorithm. Full article

(This article belongs to the Special Issue Recent Advances in Sensor Array Signal Processing and Its Applications in Future Communication and Radar)

► Show Figures

Figure 1

23 pages, 578 KB

Open AccessArticle

Distributed Partial Label Multi-Dimensional Classification via Label Space Decomposition

by Zhen Xu and Sicong Chen

Electronics 2025, 14(13), 2623; https://doi.org/10.3390/electronics14132623 - 28 Jun 2025

Cited by 4 | Viewed by 873

Abstract

Multi-dimensional classification (MDC), in which the training data are concurrently associated with numerous label variables across many dimensions, has garnered significant interest recently. Most of the current MDC methods are based on the framework of supervised learning, which induces a predictive model from [...] Read more.

Multi-dimensional classification (MDC), in which the training data are concurrently associated with numerous label variables across many dimensions, has garnered significant interest recently. Most of the current MDC methods are based on the framework of supervised learning, which induces a predictive model from a large amount of precisely labeled data. So, they are challenged to obtain satisfactory learning results in the situation where the training data are not annotated with precise labels but assigned with ambiguous labels. Besides, the current MDC algorithms only consider the scenario of centralized learning, where all training data are handled at a single node for the purpose of classifier induction. However, in some real applications, the training data are not consolidated at a single fusion center, but rather are dispersedly distributed among multiple nodes. In this study, we focus on the problem of decentralized classification involving partial multi-dimensional data that have partially accessible candidate labels, and develop a distributed method called dPL-MDC for learning with these partial labels. In this algorithm, we conduct one-vs.-one decomposition on the originally heterogeneous multi-dimensional output space, such that the problem of partial MDC can be transformed into the issue of distributed partial multi-label learning. Then, by using several shared anchor data to characterize the global distribution of label variables, we propose a novel distributed approach to learn the label confidence of the training data. Under the supervision of recovered credible labels, the classifier can be induced by exploiting the high-order label dependencies from a common low-dimensional subspace. Experiments performed on various datasets indicate that our proposed method is capable of achieving learning performance in distributed partial MDC. Full article

(This article belongs to the Special Issue Computational Intelligence and Machine Learning: Models and Applications: 2nd Edition)

► Show Figures

Figure 1

15 pages, 984 KB

Open AccessArticle

Tensioned Multi-View Ordered Kernel Subspace Clustering

by Liping Chen and Gongde Guo

Appl. Sci. 2025, 15(13), 7251; https://doi.org/10.3390/app15137251 - 27 Jun 2025

Cited by 1 | Viewed by 978

Abstract

Multi-view data improve the effectiveness of clustering tasks, but they often encounter complex noise and corruption. The missing view of the multi-view samples leads to serious degradation of the clustering model’s performance. Current multi-view clustering methods always try to compensate for the missing [...] Read more.

Multi-view data improve the effectiveness of clustering tasks, but they often encounter complex noise and corruption. The missing view of the multi-view samples leads to serious degradation of the clustering model’s performance. Current multi-view clustering methods always try to compensate for the missing information in the original domain, which is limited by the linear representation function. Even more, their clustering structures across views are not sufficiently considered, which leads to suboptimal results. To solve these problems, a tensioned multi-view subspace clustering algorithm is proposed based on sequential kernels to integrate complementary information in multi-source heterogeneous data. By superimposing the kernel matrix based on the sequential characteristics onto the third-order tensor, the robust low-rank representation for the missing is reconstructed by the matrix calculation of sequential kernel learning. Moreover, the tensor structure helps subspace learning to mine the high-order associations between different views. Tensioned Multi-view Ordered Kernel Subspace Clustering (TMOKSC) implements the ADMM framework. Compared with current representative multi-view clustering algorithms, the proposed TMOKSC algorithm is the best in many objective measures. In general, the robust sequential kernel represents the tensor fusion potential subspace structure. Full article

► Show Figures

Figure 1

17 pages, 8295 KB

Open AccessArticle

CGLCS-Net: Addressing Multi-Temporal and Multi-Angle Challenges in Remote Sensing Change Detection

by Ke Liu, Hang Xue, Caiyi Huang, Jiaqi Huo and Guoxuan Chen

Sensors 2025, 25(9), 2836; https://doi.org/10.3390/s25092836 - 30 Apr 2025

Cited by 2 | Viewed by 1189

Abstract

Currently, deep learning networks based on architectures such as CNN and Transformer have achieved significant advances in remote sensing image change detection, effectively addressing the issue of false changes due to spectral and radiometric discrepancies. However, when handling remote sensing image data from [...] Read more.

Currently, deep learning networks based on architectures such as CNN and Transformer have achieved significant advances in remote sensing image change detection, effectively addressing the issue of false changes due to spectral and radiometric discrepancies. However, when handling remote sensing image data from multiple sensors, different viewing angles, and extended periods, these models show limitations in modelling dynamic interactions and feature representations in change regions, restricting their ability to model the integrity and precision of irregular change areas. We propose the Context-Aware Global-Local Subspace Attention Change Detection Network (CGLCS-Net) to resolve these issues and introduce the Global-Local Context-Aware Selector (GLCAS) and the Subspace-based Self-Attention Fusion (SSAF) module. GLCAS dynamically selects receptive fields at different feature extraction stages through a joint pooling attention mechanism and depthwise separable convolution, enhancing global context and local feature extraction capabilities and improving feature representation for multi-scale and irregular change regions. The SSAF module establishes dynamic interactions between dual-temporal features via feature decomposition and self-attention mechanisms, focusing on semantic change areas to address challenges such as sensor viewpoint variations and the texture and spectral inconsistencies caused by long periods. Compared to ChangeFormer, CGLCS-Net achieved improvements in the IoU metric of 0.95%, 9.23%, and 13.16% on the three public datasets, i.e., LEVIR-CD, SYSU-CD, and S2Looking, respectively. Additionally, it reduced model parameters by 70.05%, floating-point operations by 7.5%, and inference time by 11.5%. These improvements enhance its applicability for continuous land use and land cover change monitoring. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

26 pages, 20717 KB

Open AccessArticle

HFDF-EffNetV2: A Lightweight, Noise-Robust Model for Fault Diagnosis in Rolling Bearings

by Donglei Zhang, Jiafang Pan, Tianping Huang, Junlin Niu and Faguo Huang

Appl. Sci. 2025, 15(9), 4902; https://doi.org/10.3390/app15094902 - 28 Apr 2025

Cited by 2 | Viewed by 1383

Abstract

In rolling bearing intelligent fault diagnosis (FD), lightweight models are constrained by issues such as noise interference and the scarcity of fault data, making it challenging to achieve real-time, high-accuracy diagnosis on resource-limited devices. To address these challenges, this study proposes a lightweight [...] Read more.

In rolling bearing intelligent fault diagnosis (FD), lightweight models are constrained by issues such as noise interference and the scarcity of fault data, making it challenging to achieve real-time, high-accuracy diagnosis on resource-limited devices. To address these challenges, this study proposes a lightweight model that combines the hierarchical fine-grained decision fusion (HFDF) strategy with an improved EfficientNetV2 architecture (HFDF-EffNetV2). The model optimizes depth and width multiplicity factors to enhance parameter utilization efficiency. It uses pyramidal convolution (PyConv) combined with Fused-MBConv (Fused-MBPyConv) to obtain multi-scale time-frequency information. Additionally, an enhanced MBConv, termed BSMB-Conv-MLCA, integrates subspace blueprint separable convolution (BSConv-S) with mixed local channel attention (MLCA) extract deep-grained fault features. The HFDF strategy outputs confidence in stages and updates weights to prevent the model from falling into local overfitting when handling confusable samples. Experimental results on Case 1 and Case 2 show that HFDF-EffNetV2 achieved 100% accuracy with diagnostic times of 18.67 millisecond (ms) and 17.56 ms, respectively, and 1.85 million (M) parameters. Under noisy conditions, average accuracies reached 98.19% and 85.68%, respectively. Additionally, the model performed well with small samples, yielding accuracies of 98.69% and 97.51%. These results highlight its superior robustness to noise and lightweight performance compared with other advanced models. Full article

(This article belongs to the Section Mechanical Engineering)

► Show Figures

Figure 1

19 pages, 10414 KB

Open AccessArticle

A Novel Low-Rank Embedded Latent Multi-View Subspace Clustering Approach

by Sen Wang, Lian Chen, Zhijian Liang and Qingyang Liu

Sensors 2025, 25(9), 2778; https://doi.org/10.3390/s25092778 - 28 Apr 2025

Cited by 2 | Viewed by 1713

Abstract

Noises and outliers often degrade the final prediction performance in practical data processing. Multi-view learning by integrating complementary information across heterogeneous modalities has become one of the core techniques in the field of machine learning. However, existing methods rely on explicit-view clustering and [...] Read more.

Noises and outliers often degrade the final prediction performance in practical data processing. Multi-view learning by integrating complementary information across heterogeneous modalities has become one of the core techniques in the field of machine learning. However, existing methods rely on explicit-view clustering and stringent alignment assumptions, which affect the effectiveness in addressing the challenges such as inconsistencies between views, noise interference, and misalignment across different views. To alleviate these issues, we present a latent multi-view representation learning model based on low-rank embedding by implicitly uncovering the latent consistency structure of data, which allows us to achieve robust and efficient multi-view feature fusion. In particular, we utilize low-rank constraints to construct a unified latent subspace representation and introduce an adaptive noise suppression mechanism that significantly enhances robustness against outliers and noise interference. Moreover, the Augmented Lagrangian Multiplier Alternating Direction Minimization (ALM-ADM) framework enables efficient optimization of the proposed method. Experimental results on multiple benchmark datasets demonstrate that the proposed approach outperforms existing state-of-the-art methods in both clustering performance and robustness. Full article

(This article belongs to the Special Issue Multi-Modal Data Sensing and Processing)

► Show Figures

Figure 1

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (36)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI