Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (3,323)

Search Parameters:
Keywords = weight fusion

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 2428 KB  
Article
Reconfigurable Mobile Wireless Sensor Network Coordination for Simultaneous Multi-Target Tracking
by Naeimeh Najafizadeh Sari, Yeqi Sang, Goldie Nejat and Beno Benhabib
Robotics 2026, 15(7), 120; https://doi.org/10.3390/robotics15070120 (registering DOI) - 25 Jun 2026
Abstract
This paper presents a distributed coordination framework for simultaneous multi-target tracking using a mobile wireless sensor network (MWSN) based on discrete-event-system principles. The proposed framework employs a finite-state-machine architecture, where autonomous mobile sensors sequentially process detection and tracking events. Unlike passive tracking approaches [...] Read more.
This paper presents a distributed coordination framework for simultaneous multi-target tracking using a mobile wireless sensor network (MWSN) based on discrete-event-system principles. The proposed framework employs a finite-state-machine architecture, where autonomous mobile sensors sequentially process detection and tracking events. Unlike passive tracking approaches that react to target loss after it occurs, the proposed strategy implements predictive handover through Extended-Kalman-Filter-based uncertainty propagation. This enables sensors to anticipate target loss and to reposition auxiliary sensors in advance, acquiring targets along their predicted trajectories. A bidding-based allocation mechanism coordinates sensor assignments by evaluating four competing objectives: network preservation, spatial proximity to handover points, temporal mission feasibility, and estimation uncertainty. The proposed framework integrates four components: EKF-convergence-triggered proactive handover, multi-objective competitive bidding, distributed min–max conflict resolution, and fusion-driven proportional navigation. Unlike existing methods, auxiliary sensors navigate using confidence-weighted EKF estimates shared by neighboring sensors rather than their own measurements. An ablation study over ten Monte Carlo trials confirms that each component contributes independently, with EKF-based predictive triggering identified as the dominant performance driver. Full article
(This article belongs to the Section Sensors and Control in Robotics)
Show Figures

Graphical abstract

23 pages, 3991 KB  
Article
Enhancing Perception Through Context-Adaptive Visible and SWIR Image Fusion in Harsh Environments
by Alexandre Riffard, Mathieu Labussière, Pierre Duthon and Romuald Aufrère
Sensors 2026, 26(13), 4035; https://doi.org/10.3390/s26134035 (registering DOI) - 25 Jun 2026
Abstract
Robust perception in adverse weather conditions remains a significant challenge for autonomous vehicles. Short-wave infrared (SWIR) sensors offer specific physical properties that enable them to penetrate atmospheric disturbances like fog, rain, and snow. However, effectively combining this robustness with the textural and colour [...] Read more.
Robust perception in adverse weather conditions remains a significant challenge for autonomous vehicles. Short-wave infrared (SWIR) sensors offer specific physical properties that enable them to penetrate atmospheric disturbances like fog, rain, and snow. However, effectively combining this robustness with the textural and colour information of visible (VIS) cameras is difficult due to signal decorrelation and the limitations of static fusion schemes. To address this, we present VISWIR (Visible and SWIR Weighted Image Reconstruction), a pixel-level fusion method based on a multi-scale pyramid architecture. We introduce an automated strategy for scheduling parameters based on weather conditions using an optimisation framework. Rather than relying on static weights, our method applies offline parameter scheduling to adjust fusion hyperparameters based on the meteorological context. We focus on a multi-objective optimisation approach that maximises perceptual image quality via No-Reference Image Quality Assessment (NR-IQA) metrics. Validated in controlled environment scenarios with varying weather severities, our results confirm the potential of VISWIR as a robust, lightweight algorithmic baseline to enhance the perception capabilities of autonomous vehicles in adverse weather conditions. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

24 pages, 10388 KB  
Article
Adaptive Content and Style Fusion for Text-to-Image Generations
by Yi-Fang Lee, Chun-Chieh Lee, Chi-Hung Chuang, Chih-Lung Lin and Kuo-Chin Fan
Electronics 2026, 15(13), 2800; https://doi.org/10.3390/electronics15132800 (registering DOI) - 25 Jun 2026
Abstract
Text-to-image generation aims to produce images that match the semantic content of a text prompt. In style transfer tasks, the model must further integrate reference styles while preserving prompt semantics. However, balancing semantic consistency and style fidelity remains challenging. Existing methods commonly rely [...] Read more.
Text-to-image generation aims to produce images that match the semantic content of a text prompt. In style transfer tasks, the model must further integrate reference styles while preserving prompt semantics. However, balancing semantic consistency and style fidelity remains challenging. Existing methods commonly rely on fixed feature weights and lack adaptive control, which often leads to style over-injection and content distortion. To address these issues, we propose a novel framework that performs dynamic regulation at both the feature and temporal levels. At the feature level, we propose an Entropy-Aware Adaptive Fusion (EAAF) module. It incorporates a bidirectional distribution transformation mechanism to enhance the statistical correlation between content and style features. The module further uses information entropy as a dynamic control signal to adaptively adjust the strength of style injection, thereby achieving a balance between semantic consistency and style fidelity. At the temporal level, we design a Progressive Feature Reweighting (PFR) strategy. By applying stage-wise weighting to content and style features at different diffusion steps, this strategy effectively improves structural stability and color consistency. In addition, our framework is modular and can be integrated into existing diffusion-based style transfer models without additional fine-tuning or retraining. Experimental results demonstrate that applying our approach to current state-of-the-art models, such as StyleStudio and CSGO, significantly enhances their performance, particularly in maintaining strong prompt alignment while achieving high-fidelity style transfer. Full article
(This article belongs to the Special Issue Recent Advances in Object Detection and Computer Vision)
Show Figures

Figure 1

25 pages, 3468 KB  
Article
Confidence-Guided Fusion for Self-Supervised Monocular Depth Estimation in Endoscopy
by Shuang Li, Hongbo Wang, Zhaoxu Hu, Tian Chu, Yingping Li and Liang Zhao
Sensors 2026, 26(13), 4033; https://doi.org/10.3390/s26134033 (registering DOI) - 25 Jun 2026
Abstract
Accurate monocular depth estimation (MDE) is a foundational task in endoscopic surgery, critical for augmenting depth perception and aiding surgical navigation. While diffusion-based and discriminative depth estimators demonstrate complementary strengths, they also exhibit asymmetric errors: discriminative models yield precise geometric boundaries but struggle [...] Read more.
Accurate monocular depth estimation (MDE) is a foundational task in endoscopic surgery, critical for augmenting depth perception and aiding surgical navigation. While diffusion-based and discriminative depth estimators demonstrate complementary strengths, they also exhibit asymmetric errors: discriminative models yield precise geometric boundaries but struggle in homogeneous or saturated areas, whereas diffusion models recover fine textures at the cost of occasional structural incoherence. To systematically exploit this complementarity, we present CoDepth, a novel framework that leverages confidence-guided fusion to harmonize the outputs of these heterogeneous estimators. Its core components include a complementary map extractor that identifies structured disparity disagreements, a cross-attention module for context-aware feature integration, and a probabilistic confidence network that generates spatially adaptive fusion weights. Extensive evaluations on the SCARED dataset show that CoDepth achieves improved overall performance relative to strong single-model baselines, with the most consistent gains observed in Abs Rel and δ-based accuracy, while changes in some other error metrics are more modest. Furthermore, CoDepth exhibits encouraging cross-domain generalization. When a model trained on SCARED is directly evaluated on SERV-CT, Hamlyn, and C3VD without fine-tuning, it achieves competitive performance and improves several key metrics across datasets. The framework also demonstrates enhanced robustness against common synthetic corruptions like low-light conditions, Gaussian noise, and impulse noise, underscoring its practical utility in complex clinical settings. These results suggest that confidence-guided complementary fusion provides a practical integration-level paradigm for combining heterogeneous endoscopic depth estimators. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

19 pages, 6612 KB  
Article
Reproducible Industrial CT–to–Porosity Metrics with nnU-Net—A Weak Versus Strong Inference Benchmark on Cementitious Slices
by Youxi Wang, Chaowei Sun and Le Zhang
Buildings 2026, 16(13), 2518; https://doi.org/10.3390/buildings16132518 (registering DOI) - 25 Jun 2026
Abstract
Porosity-related quantities from industrial X-ray CT depend on segmentation and inference choices. When inference defaults are omitted from the report, void or phase fractions can shift by amounts comparable to slice-to-slice variability. The contribution is metrological rather than architectural: we document a reproducible [...] Read more.
Porosity-related quantities from industrial X-ray CT depend on segmentation and inference choices. When inference defaults are omitted from the report, void or phase fractions can shift by amounts comparable to slice-to-slice variability. The contribution is metrological rather than architectural: we document a reproducible nnU-Net 2D workflow on Dataset601 CTVoid from semantic labels to slice-wise void fraction, optional two-dimensional connected-component pore summaries, isotropic three-dimensional stacking at 0.058 mm spacing, and spatial axis diagnostics, with region of interest and voxel spacing stated explicitly. The main results pair a weak export policy, defined as a single forward pass per slice without multi-scale fusion or test-time augmentation, with a strong policy that enables multi-scale fusion and flip-based augmentation on the same slice exports and identical weights, on one hundred consecutive slices from one cementitious industrial stack of 1028 × 1028 pixels. In parallel we report trainer validation on eight named Dataset601 validation cases and mirroring-based test-time augmentation off versus on re-inference on those same cases; case identifiers and the cross-validation split appear in the main text. These quantities answer different questions and must not be substituted for one another or for independent full-stack ground truth. Porosity-related scalars from industrial X-ray CT depend on how segmentation and inference are configured; when defaults are omitted, void fractions can shift by amounts comparable to slice-to-slice variability. For fixed nnU-Net weights on one cementitious industrial slice stack (1028 × 1028 pixels), we benchmark weak inference (single forward pass, no multi-scale fusion or test-time augmentation) against a strong export policy (multi-scale fusion and flip-based augmentation) on 100 paired slices, and report parallel trainer validation and TTA-off versus TTA-on re-inference on eight Dataset601 hold-out cases. For the industrial dataset, mean void-class IoU between modes is 0.716 (SD 0.043), while strong inference is ~2.6× slower and predicts lower mean void area (2.37% vs. 3.04%). The full weak export gives a 3D void ratio of 2.44% and integrated void volume of 5175 mm3. On validation patches, mean void Dice/IoU against the reference are 0.835/0.728, while weak–strong void IoU reaches 0.924 under the nnU-Net-native TTA contrast—quantities that must not be interchanged across domains or definitions. The present benchmark does not include a systematic polymer dosage series, and the study does not equate semantic void with open porosity but provides a reproducible disclosure template relevant to porous and polymer-modified cementitious CT reporting. Full article
(This article belongs to the Section Building Materials, and Repair & Renovation)
Show Figures

Figure 1

31 pages, 2776 KB  
Article
A Multimodal Biomedical Transformer Fusion Network for Disease-Level Rare-Disease-Inheritance Classification Using Ontology-Enriched Text, Metadata, and Gene Associations
by Mahmood A. Mahmood and Khalaf Alsalem
Biomedicines 2026, 14(7), 1439; https://doi.org/10.3390/biomedicines14071439 (registering DOI) - 25 Jun 2026
Abstract
Background/Objectives: Inheritance classification in rare diseases remains challenging because curated knowledge is incomplete, heterogeneous, and imbalanced across inheritance categories. Disease-level inheritance modeling can support knowledge organization, annotation review, and hypothesis generation in rare-disease resources. This paper introduces RareFusion-Net, a multimodal benchmark framework for [...] Read more.
Background/Objectives: Inheritance classification in rare diseases remains challenging because curated knowledge is incomplete, heterogeneous, and imbalanced across inheritance categories. Disease-level inheritance modeling can support knowledge organization, annotation review, and hypothesis generation in rare-disease resources. This paper introduces RareFusion-Net, a multimodal benchmark framework for disease-level inheritance classification, and evaluates whether integrating ontology-enriched disease text, structured epidemiological metadata, and gene-association information improves prediction in curated rare-disease knowledge bases. RareFusion-Net is intended for knowledge modeling, not individual patient diagnosis. Methods: We developed RareFusionBalanced, a gated multimodal fusion model that combines biomedical disease descriptions, structured metadata, and gene-related information using auxiliary supervision. Ontology-enriched disease text was treated as the dominant semantic modality, while tabular and gene modalities were incorporated as complementary evidence when available. Robustness was improved using balanced regularization, selective transformer fine-tuning, dropout, weight decay, label smoothing, early stopping, and prediction aggregation across random seeds. Evaluation included accuracy, macro-F1, micro-F1, macro-AUC, mean average precision, calibration metrics, class-wise analysis, statistical testing, and ablation experiments. Results: RareFusionBalanced achieved 0.7382 test accuracy, 0.6284 macro-F1, 0.7382 micro-F1, 0.9183 macro-AUC, and 0.6686 mean average precision. Calibration was favorable, with an expected calibration error of 0.0395 and a Brier-OVR of 0.0528. The multimodal model slightly outperformed TextOnly-TransformerBalanced, but improvement over the best TF-IDF baseline was not statistically significant. Ablation showed ontology-enriched text as the strongest modality, with gene associations adding complementary value. Conclusions: RareFusion-Net provides a practical benchmark for ontology-aware rare-disease inheritance modeling. Results suggest selective multimodal benefit while highlighting minority-class difficulty, limited statistical superiority, need for external validation, and improved biological interpretability. Full article
Show Figures

Figure 1

24 pages, 8059 KB  
Article
Information-Theoretic Channel Selection and Spatiotemporal Deep Learning for Early Fault Detection in Microsatellite Thermal Control Systems
by Weijian Pang, Jun Zhou, Jingwen Xu and Xinian Zhi
Entropy 2026, 28(7), 725; https://doi.org/10.3390/e28070725 (registering DOI) - 24 Jun 2026
Abstract
Early fault detection in microsatellite thermal control systems (TCS) faces fundamental challenges: high-dimensional redundant telemetry channels, overlapping multi-scale periodicities that obscure anomaly signatures, and severely limited daily data downlink (1–2 passes per day) that restricts the temporal window for diagnosis. Existing data-driven approaches [...] Read more.
Early fault detection in microsatellite thermal control systems (TCS) faces fundamental challenges: high-dimensional redundant telemetry channels, overlapping multi-scale periodicities that obscure anomaly signatures, and severely limited daily data downlink (1–2 passes per day) that restricts the temporal window for diagnosis. Existing data-driven approaches either rely on supervised learning, requiring labeled fault data that are scarce in practice, or employ univariate analysis that fails to capture inter-sensor spatial correlations. To address these limitations, this paper introduces a hybrid framework integrating information-theoretic feature selection and spatiotemporal deep learning. The Generalized Maximum Information Coefficient (GMIC) quantifies nonlinear dependencies between temperature channels for key channel selection, reducing dimensionality by 82% while preserving diagnostic information. A dual-level Seasonal Trend Decomposition (STL) method disentangles orbital-periodic dynamics from diurnal cycles, effectively isolating distinct thermal characteristics at multiple timescales. Each decomposed component is modeled using Convolutional Neural Network–Long Short-Term Memory (CNN-LSTM) networks to capture spatiotemporal dependencies for accurate temperature prediction. An adaptive threshold-based weighted error fusion mechanism enables early fault detection within a single day of telemetry data. Experimental validation on real satellite telemetry data demonstrates that the proposed framework achieves high-precision fault detection across multiple fault types using a minimal set of temperature channels, significantly outperforming existing benchmarks in both prediction accuracy and detection reliability. Full article
(This article belongs to the Section Signal and Data Analysis)
15 pages, 710 KB  
Article
Soft-Gating Mixture Robust Kalman Filter for SINS/DVL Integrated Navigation Under DVL Outlier Interference
by Li Luo, Luyao Zhang, Congyi Yang and Tao Liu
J. Mar. Sci. Eng. 2026, 14(13), 1165; https://doi.org/10.3390/jmse14131165 (registering DOI) - 24 Jun 2026
Abstract
Aiming at the problem that complex underwater environments induce outliers in Doppler Velocity Log (DVL) measurements, which degrade the navigation accuracy of the Strapdown Inertial Navigation System (SINS)/DVL integrated system, this paper proposes a soft-gating Gaussian–Student’s t mixture robust Kalman filter (MRKF). Firstly, [...] Read more.
Aiming at the problem that complex underwater environments induce outliers in Doppler Velocity Log (DVL) measurements, which degrade the navigation accuracy of the Strapdown Inertial Navigation System (SINS)/DVL integrated system, this paper proposes a soft-gating Gaussian–Student’s t mixture robust Kalman filter (MRKF). Firstly, the measurement noise is modeled as a mixture of Gaussian and Student’s t distributions to adapt to normal stationary noise and abrupt outliers, respectively. Secondly, a logistic soft-gating weight is constructed based on the innovation Mahalanobis distance to adaptively balance the output contributions of the standard Kalman Filter (KF) and the variational Bayesian Student’s t filter. Finally, moment matching is adopted to realize the weighted fusion of two-branch posterior distributions, and an equivalent Gaussian posterior estimation is obtained. Simulation results under the considered SINS/DVL integrated navigation scenarios show that the proposed MRKF maintains estimation accuracy close to the standard KF under nominal Gaussian measurement noise. In the designed DVL outlier-injection scenario, the proposed MRKF achieves a position RMSE of 53.39m, compared with 878.75m, 58.84m, and 56.49m for the nominal KF, Huber KF (HKF), and Student’s-t variational Bayesian KF (STVBKF), respectively. These results indicate that the proposed MRKF can improve robustness against DVL outliers while maintaining competitive estimation accuracy under the simulated conditions. Full article
Show Figures

Graphical abstract

23 pages, 16049 KB  
Article
Deep Learning Image Steganography Based on Dual-Path Fusion in Frequency and Spatial Domains
by Xiang Meng, Yuexin Li, Wanjia Li, Yiliang Guo, Yanhua Dong and Hongyu Sun
Electronics 2026, 15(13), 2777; https://doi.org/10.3390/electronics15132777 (registering DOI) - 24 Jun 2026
Abstract
Contemporary deep learning-based image steganography techniques for embedding images within images are hindered by inadequate utilization of frequency-domain features and limited steganographic security, restricting their effectiveness in practical privacy protection contexts. To mitigate these limitations, we introduce a frequency–spatial dual-path fusion-based deep steganography [...] Read more.
Contemporary deep learning-based image steganography techniques for embedding images within images are hindered by inadequate utilization of frequency-domain features and limited steganographic security, restricting their effectiveness in practical privacy protection contexts. To mitigate these limitations, we introduce a frequency–spatial dual-path fusion-based deep steganography approach, termed FS-Stego. This method incorporates a frequency–spatial dual-path architecture within the generator network. Specifically, the frequency-domain processing module facilitates feature embedding in the complex domain, while the spatial-domain processing module maintains the image’s structural integrity, thereby enabling the co-optimization of multi-dimensional features. Second, an adaptive fusion module is developed to dynamically adjust the weights of the two paths, while residual connections and attention mechanisms are utilized to mitigate feature loss. Third, a multi-objective loss function is implemented to simultaneously optimize the quality of the stego images and the reconstruction accuracy of the secret images. The proposed method utilizes three open-source datasets as cover images and the LFW dataset as the secret images. Experimental results demonstrate that, compared to existing deep steganographic techniques, the stego and recovered images achieve superior peak signal-to-noise ratios (PSNR) and structural similarity (SSIM). Regarding model efficiency, the number of parameters is reduced to below 0.98 million, significantly enhancing practical performance. The proposed method ensures high-quality image recovery while maintaining steganographic security, thereby offering an effective solution for privacy protection. Full article
Show Figures

Figure 1

17 pages, 1431 KB  
Article
Adaptive Multi-Sensor Fusion for Robust Outdoor Localization and Path Tracking Under Weak GNSS Conditions
by Yanyan Dai, Subin Park and Kidong Lee
Electronics 2026, 15(13), 2768; https://doi.org/10.3390/electronics15132768 (registering DOI) - 23 Jun 2026
Viewed by 144
Abstract
Reliable outdoor localization is essential for autonomous mobile robots, where the Global Navigation Satellite System (GNSS) is widely used to provide global positioning information. However, GNSS signals are often degraded in real-world environments due to occlusions, multipath effects, and environmental interference, leading to [...] Read more.
Reliable outdoor localization is essential for autonomous mobile robots, where the Global Navigation Satellite System (GNSS) is widely used to provide global positioning information. However, GNSS signals are often degraded in real-world environments due to occlusions, multipath effects, and environmental interference, leading to unstable localization and degraded navigation performance. This paper proposes an adaptive multi-sensor fusion framework for robust outdoor localization and path tracking under weak GNSS conditions. The proposed system integrates GNSS, LiDAR, wheel odometry, and inertial measurement unit (IMU) measurements within an Extended Kalman Filter (EKF) framework. To address the limitations of GNSS, an adaptive weighting mechanism is introduced to dynamically adjust the influence of GNSS observations based on signal quality indicators. Furthermore, a GNSS quality-aware mode-switching strategy is developed, enabling seamless transition between GNSS-dominant localization and multi-sensor fusion-based localization. In the fusion mode, LiDAR, odometry, and IMU jointly provide robust pose estimation, while GNSS acts as a weak global constraint. The IMU further enhances heading estimation, improving orientation stability and path tracking performance. The estimated pose is then used for trajectory tracking using a path-following controller. Experimental results conducted in outdoor environments demonstrate that the proposed framework significantly improves localization robustness and path tracking performance under degraded GNSS conditions. Compared with raw GNSS localization, the proposed method reduces the mean localization error by 47.2% and decreases the root mean square localization error by 55.5%, while maintaining smoother and more continuous trajectory estimation in weak GNSS environments. Full article
(This article belongs to the Special Issue Nonlinear Analysis and Control of Electronic Systems)
Show Figures

Figure 1

18 pages, 8474 KB  
Article
Dual-Pathway Wavelet-Attention Framework for Image-Only AI-Generated Image Quality Assessment
by Yang Li, Yu Zheng and Dong Sui
Mathematics 2026, 14(13), 2249; https://doi.org/10.3390/math14132249 (registering DOI) - 23 Jun 2026
Viewed by 91
Abstract
AI-generated images (AIGIs) often contain perceptual defects that differ from the distortions commonly studied in conventional no-reference image quality assessment (NR-IQA). This work investigates image-only AIGC image quality assessment, where no prompt text is used and the quality score must be inferred from [...] Read more.
AI-generated images (AIGIs) often contain perceptual defects that differ from the distortions commonly studied in conventional no-reference image quality assessment (NR-IQA). This work investigates image-only AIGC image quality assessment, where no prompt text is used and the quality score must be inferred from visual evidence such as artifacts, structure, and semantic plausibility. We propose a dual-pathway wavelet-attention framework built on a Swin Transformer V2-Base backbone. The artifact pathway employs a Noise Perceptive Attention Module (NPAM) with fixed Haar wavelet decomposition to describe generation-related sub-band degradation cues, whereas the image-perception pathway models semantic, structural, and contextual quality evidence using multi-scale attention, global–local spatial-channel attention, and pyramid pooling. The two pathways are integrated through adaptive fusion and a spatially weighted regression head with an auxiliary global prediction. Experiments on AGIQA-1K, AGIQA-3K, and AIGCIQA2023 demonstrate competitive in-domain performance, including SRCC values of 0.8418 on AGIQA-3K and 0.8445 on the quality dimension of AIGCIQA2023. The evaluation further covers individual module ablations, score-fusion variants, seed stability, qualitative error analysis, and cross-database transfer, revealing both the contribution of the proposed components and the remaining difficulty of source-disjoint generalization. Full article
(This article belongs to the Section E1: Mathematics and Computer Science)
29 pages, 4629 KB  
Article
Asymmetric Spectral Filtering and Behavior-Guided Graph Convolution for Multimodal Recommendation
by Ganglong Duan, Yi Yao, Zhiqiang Ji, Tianqiao Gong and Jun Yan
Electronics 2026, 15(13), 2764; https://doi.org/10.3390/electronics15132764 (registering DOI) - 23 Jun 2026
Viewed by 73
Abstract
Multimodal recommender systems are challenged by heterogeneous modality noise and coarse-grained feature fusion. Specifically, existing frequency-domain methods typically apply symmetric filtering across modalities, ignoring their distinct spectral characteristics. Consequently, symmetric filtering cannot simultaneously satisfy the denoising requirements of visual features and the semantic [...] Read more.
Multimodal recommender systems are challenged by heterogeneous modality noise and coarse-grained feature fusion. Specifically, existing frequency-domain methods typically apply symmetric filtering across modalities, ignoring their distinct spectral characteristics. Consequently, symmetric filtering cannot simultaneously satisfy the denoising requirements of visual features and the semantic preservation requirements of textual features, leading to suboptimal multimodal representations. Meanwhile, current fusion strategies mainly operate at the instance level with static modality weights, lacking flexibility to dynamically adjust feature channels for user-specific collaborative contexts. To address these issues, this paper proposes MFA-GCN, a multimodal recommendation framework that combines asymmetric spectral filtering, multiview graph enhancement, and behavior-guided channel attention. For visual modalities, a multiscale frequency-domain module integrating 1D convolution and self-attention is adopted to suppress high-frequency disturbances while preserving informative structures. For textual modalities, a lightweight complex-domain scaling strategy is introduced to adjust spectral energy while maintaining semantic consistency. In addition, auxiliary user–user and item–item graphs are constructed to supplement sparse user–item interactions and provide richer collaborative signals. A behavior-guided channel attention mechanism is further used to dynamically refine multimodal representations. Experiments on three public Amazon datasets demonstrate that MFA-GCN consistently outperforms several representative baselines. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

18 pages, 2423 KB  
Article
Flexible Light Field Reconstruction: Enabling Arbitrary Sampling and Angular Resolution
by Xia Liu, Junzhen Ye, Zhangmin Wu and Qiang Fu
Electronics 2026, 15(13), 2763; https://doi.org/10.3390/electronics15132763 (registering DOI) - 23 Jun 2026
Viewed by 56
Abstract
Compared with hardware-dependent methods, light field (LF) reconstruction algorithms enable a more economical and convenient acquisition of densely sampled LF (DSLF). Existing learning-based LF reconstruction methods suffer from limited flexibility, as they rely on fixed sampling patterns and predefined angular resolutions. In this [...] Read more.
Compared with hardware-dependent methods, light field (LF) reconstruction algorithms enable a more economical and convenient acquisition of densely sampled LF (DSLF). Existing learning-based LF reconstruction methods suffer from limited flexibility, as they rely on fixed sampling patterns and predefined angular resolutions. In this paper, we propose a flexible deep learning framework, which can reconstruct DSLF with arbitrary angular resolution from randomly distributed sparse input views of an arbitrary quantity. The proposed framework consists of two core stages, namely the SAI Synthesis and the LF Refinement. The SAI Synthesis adopts Plane Sweep Volume (PSV) to cope with randomly sampled input views, and leverages the Multi-Scale Attention (MSA) module to compute per-view weights for adaptive feature fusion and support arbitrary numbers of input views. The LF Refinement stage integrates intermediate results and fully exploits LF parallax structures to further improve reconstruction quality. Experimental results demonstrate that our method achieves superior flexibility and reconstruction quality, and outperforms most state-of-the-art LF reconstruction methods. Full article
(This article belongs to the Special Issue Computer Vision and Image Processing in Machine Learning)
23 pages, 5400 KB  
Article
A Gearbox Fault Diagnosis Method for Small-Sample Conditions Based on Physics-Informed and Multi-Scale Graph Learning
by Peng Chen, Yazhou Zhang and Jintao Xu
Processes 2026, 14(13), 2035; https://doi.org/10.3390/pr14132035 (registering DOI) - 23 Jun 2026
Viewed by 125
Abstract
Existing intelligent fault diagnosis methods ignore the influence of sensors at different positions on the model fault diagnosis performance. Furthermore, the lack of interpretability leads to insufficient reliability of the model fault diagnosis results. Therefore, a physics-informed multi-sensor information fusion method for gearbox [...] Read more.
Existing intelligent fault diagnosis methods ignore the influence of sensors at different positions on the model fault diagnosis performance. Furthermore, the lack of interpretability leads to insufficient reliability of the model fault diagnosis results. Therefore, a physics-informed multi-sensor information fusion method for gearbox fault diagnosis is proposed. The method consists of a physics-informed shallow feature extraction module, a hierarchical multi-scale graph learning module, and an adaptive feature fusion module. The shallow feature extraction module is composed of Laplacian convolution. Multi-scale Laplacian convolution kernels are used to capture multi-frequency and multi-scale feature information, enriching fault representations. The hierarchical multi-scale graph learning module adopts graph convolutional neural networks to conduct deep multi-sensor fault feature extraction for generating high-level features. The adaptive feature fusion module realizes the weighting of important sensor data and the suppression of redundant information through attention scores. This method is validated on two gearbox datasets. The results show that when applied to the SEU dataset, the proposed method achieves a diagnosis accuracy 5.8% higher than that of the state-of-the-art method (MIFNet) under small-sample conditions. In noisy environments, the proposed method achieves an average diagnostic accuracy 1.8% higher than that of the state-of-the-art method (LiConvFormer). This indicates that the proposed method exhibits superior fault diagnosis performance and can effectively handle fault diagnosis tasks under small-sample conditions and in noisy environments. Full article
(This article belongs to the Special Issue Fault Diagnosis Technology in Machinery Manufacturing)
Show Figures

Figure 1

22 pages, 2359 KB  
Article
Occlusion-Aware Face Recognition via Adaptive Local Feature Fusion and Identity-Guided Contrastive Learning
by Kexin Zhu and Guoqing Ma
Sensors 2026, 26(13), 3977; https://doi.org/10.3390/s26133977 (registering DOI) - 23 Jun 2026
Viewed by 211
Abstract
Partial occlusion can substantially impair the accuracy and stability of face recognition systems. Although existing methods perform well on unobstructed face images, their performance often degrades under partial occlusion, because occluded regions may obscure discriminative identity cues and introduce noise into feature representations. [...] Read more.
Partial occlusion can substantially impair the accuracy and stability of face recognition systems. Although existing methods perform well on unobstructed face images, their performance often degrades under partial occlusion, because occluded regions may obscure discriminative identity cues and introduce noise into feature representations. To address this issue, this paper proposes an occlusion-aware face recognition framework that integrates lightweight feature extraction, local-region reliability modeling, adaptive feature fusion, and joint loss optimization. Specifically, the face is divided into three sub-regions according to common occlusion patterns, and an MLP-based module is used to estimate the reliability of each region. The estimated reliability weights are then used to adaptively fuse local features, thereby emphasizing visible and discriminative regions. In addition, a joint loss combining ArcFace and InfoNCE is constructed to enhance inter-class separability and intra-class feature consistency. Experimental results under masks, hats, sunglasses, and random occlusion conditions show that the proposed method achieves a recognition accuracy of 92.3%. Compared with ArcFace, CurricularFace, and AdaFace, the proposed method improves accuracy by 9.9%, 6.5%, and 4.1%, respectively. In addition, the FAR is reduced by 5.8%, 4.9%, and 3.7%, respectively, while the FRR is reduced by 2.2%, 6.5%, and 1.3%, respectively. These results demonstrate that the proposed framework effectively enhances the robustness of face recognition under partial occlusion. Full article
(This article belongs to the Section Intelligent Sensors)
Show Figures

Figure 1

Back to TopTop