Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (971)

Search Parameters:
Keywords = hierarchical feature learning

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 2511 KB  
Article
Adversarial and Hierarchical Distribution Alignment Network for Nonintrusive Load Monitoring
by Haozhe Xiong, Daojun Tan, Yuxuan Hu, Xuan Cai and Pan Hu
Electronics 2026, 15(3), 655; https://doi.org/10.3390/electronics15030655 - 2 Feb 2026
Abstract
Nonintrusive Load Monitoring (NILM) models often suffer from significant performance degradation when deployed across different households and datasets, primarily because of distribution discrepancies. To address this challenge, this study proposes an adversarial hierarchical distribution alignment unsupervised domain adaptation network for nonintrusive load disaggregation. [...] Read more.
Nonintrusive Load Monitoring (NILM) models often suffer from significant performance degradation when deployed across different households and datasets, primarily because of distribution discrepancies. To address this challenge, this study proposes an adversarial hierarchical distribution alignment unsupervised domain adaptation network for nonintrusive load disaggregation. The network aims to reduce the distribution divergence between the source and target domains in both the feature and label spaces, enabling effective adaptation to transfer learning scenarios in which the source domain has limited labeled data and the target domain has abundant unlabeled data. The proposed method integrates adversarial training with a hierarchical distribution alignment strategy that uses Correlation Alignment (CORAL) to align global marginal distributions. It employs Multi-Kernel Maximum Mean Discrepancy (MK-MMD) to constrain the conditional distributions of individual appliances, thereby enhancing cross-domain generalization. Extensive experiments on three public datasets demonstrate that, in both in-domain and cross-domain settings, the proposed method consistently reduces Mean Absolute Error (MAE) and Signal Aggregation Error (SAE), outperforming baseline approaches in cross-domain generalization. Full article
24 pages, 3790 KB  
Article
An Edge-Deployable Lightweight Intrusion Detection System for Industrial Control
by Zhenxiong Zhang, Lei Zhang, Jialong Xu, Zhengze Chen and Peng Wang
Electronics 2026, 15(3), 644; https://doi.org/10.3390/electronics15030644 - 2 Feb 2026
Abstract
Industrial Control Systems (ICSs), critical to infrastructure, face escalating cyber threats under Industry 4.0, yet existing intrusion detection methods are hindered by attack sample scarcity, spatiotemporal heterogeneity of industrial protocols, and resource constraints of embedded devices. This paper proposes a four-stage closed-loop intrusion [...] Read more.
Industrial Control Systems (ICSs), critical to infrastructure, face escalating cyber threats under Industry 4.0, yet existing intrusion detection methods are hindered by attack sample scarcity, spatiotemporal heterogeneity of industrial protocols, and resource constraints of embedded devices. This paper proposes a four-stage closed-loop intrusion detection framework for ICSs, with its core innovations integrating the following key components: First, a protocol-conditioned Conditional Generative Adversarial Network (CTGAN) is designed to synthesize realistic attack traffic by enforcing industrial protocol constraints and validating syntax through dual-path discriminators, ensuring generated traffic adheres to protocol specifications. Second, a three-tiered sliding window encoder transforms raw network flows into structured RGB images, capturing protocol syntax, device states, and temporal autocorrelation to enable multiresolution spatiotemporal analysis. Third, an Efficient Multiscale Attention Visual State Space Model (EMA-VSSM) is developed by integrating gate-enhanced state-space layers with multiscale attention mechanisms and contrastive learning, enhancing threat detection through improved long-range dependency modeling and spatial–temporal correlation capture. Finally, a lightweight EMA-VSSM student model, developed via hierarchical distillation, achieves a model compression rate of 64.8% and an inference efficiency enhancement of approximately 30% relative to the original model. Experimental results on a real-world ICS dataset demonstrate that this lightweight model attains an accuracy of 98.20% with a False Negative Rate (FNR) of 0.0316, outperforming state-of-the-art baseline methods such as XGBoost and Swin Transformer. By effectively balancing protocol compliance, multi-resolution feature extraction, and computational efficiency, this framework enables real-time deployment on resource-constrained ICS controllers. Full article
Show Figures

Figure 1

34 pages, 2320 KB  
Article
Research on a Computing First Network Based on Deep Reinforcement Learning
by Qianwen Xu, Jingchao Wang, Shuangyin Ren, Zhongbo Li and Wei Gao
Electronics 2026, 15(3), 638; https://doi.org/10.3390/electronics15030638 - 2 Feb 2026
Abstract
The joint optimization of computing resources and network routing constitutes a central challenge in Computing First Networks (CFNs). However, existing research has predominantly focused on computation offloading decisions, whereas the cooperative optimization of computing power and network routing remains underexplored. Therefore, this study [...] Read more.
The joint optimization of computing resources and network routing constitutes a central challenge in Computing First Networks (CFNs). However, existing research has predominantly focused on computation offloading decisions, whereas the cooperative optimization of computing power and network routing remains underexplored. Therefore, this study investigates the joint routing optimization problem within the CFN framework. We first propose a computing resource scheduling architecture for CFN, termed SICRSA, which integrates Software-Defined Networking (SDN) and Information-Centric Networking (ICN). Building upon this architecture, we further introduce an ICN-based hierarchical naming scheme for computing services, design a computing service request packet format that extends the IP header, and detail the corresponding service request identification process and workflow. Furthermore, we propose Computing-Aware Routing via Graph and Long-term Dependency Learning (CRGLD), a Graph Neural Network (GNN), and Long Short-Term Memory (LSTM)-based routing optimization algorithm, within the SICRSA framework to address the computing-aware routing (CAR) problem. The algorithm incorporates a decision-making framework grounded in spatiotemporal feature learning, thereby enabling the joint and coordinated selection of computing nodes and transmission paths. Simulation experiments conducted on real-world network topologies demonstrate that CRGLD enhances both the quality of service and the intelligence of routing decisions in dynamic network environments. Moreover, CRGLD exhibits strong generalization capability when confronted with unfamiliar topologies and topological changes, effectively mitigating the poor generalization performance typical of traditional Deep Reinforcement Learning (DRL)-based routing models in dynamic settings. Full article
Show Figures

Figure 1

24 pages, 2559 KB  
Article
A Symmetric Encoder–Decoder Network with Enhanced Group–Shuffle Modules for Robust Lung Nodule Detection in CT Scans
by Mohammad A. Thanoon, Siti Raihanah Abdani, Ahmad Asrul Ibrahim, Asraf Mohamed Moubark, Nor Azwan Mohamed Kamari, Muhammad Ammirrul Atiqi Mohd Zainuri, Mohd Hairi Mohd Zaman and Mohd Asyraf Zulkifley
Biomimetics 2026, 11(2), 92; https://doi.org/10.3390/biomimetics11020092 (registering DOI) - 1 Feb 2026
Abstract
Lung cancer is considered to be a significant cause of death in the world, and the timely identification of nodules in the lungs in CT scans is very important to enhance the prognosis of patients. Although the state of the art of nodule [...] Read more.
Lung cancer is considered to be a significant cause of death in the world, and the timely identification of nodules in the lungs in CT scans is very important to enhance the prognosis of patients. Although the state of the art of nodule delineation using deep learning-based segmentation models was achieved, major problems, including high feature diversity, low spatial discrimination, and overfitting of the models, require stronger feature-processing approaches. This research explores an enhanced symmetric encoder–decoder segmentation network known as the Improved Group–Shuffle Module (IGSM) to overcome these shortcomings. The most important feature of the proposed method is the IGSM, which hierarchically divides feature maps into a few groups, then transforms them independently, and then randomly switches channels between groups to increase inter-group interaction of features and diversity. This IGSM method is inspired by human brain functions, which are processed in specialized cortex areas, which are mimicked in this work through small-group feature processing. Channel shuffling is designed based on inter-modular communication in the human brain through coherent information sharing among the small groups of cortices. Through this mechanism, the model is much better at capturing discriminative spatial and contextual patterns, especially on complex and subtle nodule structures. The IGSM configurations have been optimized, specifically, the placement of the modules, grouping size, and shuffle permutation strategies. The proposed model’s performance is then compared with the benchmarked models, like U-Net and DeepLab, with various performance indicators such as mean Intersection over Union (mIoU), Dice Score, Accuracy, Sensitivity, and Specificity. The simulation results proved the superiority of the IGSM-enhanced model with the mIoU of 0.7735, the Dice Score of 0.9665, and the Accuracy of 0.9873. The addition of the group and shuffle module not only enhances the discrimination between the nodules and their background, but it also improves the ability to generalize over a variety of nodules’ morphology, thus producing a reliable tool for automated detection of lung cancer. Full article
Show Figures

Figure 1

18 pages, 10981 KB  
Article
Ensemble Entropy with Adaptive Deep Fusion for Short-Term Power Load Forecasting
by Yiling Wang, Yan Niu, Xuejun Li, Xianglong Dai, Xiaopeng Wang, Yong Jiang, Chenghu He and Li Zhou
Entropy 2026, 28(2), 158; https://doi.org/10.3390/e28020158 - 31 Jan 2026
Viewed by 43
Abstract
Accurate power load forecasting is crucial for ensuring the safety and economic operation of power systems. However, the complex, non-stationary, and heterogeneous nature of power load data presents significant challenges for traditional prediction methods, particularly in capturing instantaneous dynamics and effectively fusing multi-feature [...] Read more.
Accurate power load forecasting is crucial for ensuring the safety and economic operation of power systems. However, the complex, non-stationary, and heterogeneous nature of power load data presents significant challenges for traditional prediction methods, particularly in capturing instantaneous dynamics and effectively fusing multi-feature information. This paper proposes a novel framework—Ensemble Entropy with Adaptive Deep Fusion (EEADF)—for short-term multi-feature power load forecasting. The framework introduces an ensemble instantaneous entropy extraction module to compute and fuse multiple entropy types (approximate, sample, and permutation entropies) in real-time within sliding windows, creating a sensitive representation of system states. A task-adaptive hierarchical fusion mechanism is employed to balance computational efficiency and model expressivity. For time-series forecasting tasks with relatively structured patterns, feature concatenation fusion is used that directly combines LSTM sequence features with multimodal entropy features. For complex multimodal understanding tasks requiring nuanced cross-modal interactions, multi-head self-attention fusion is implemented that dynamically weights feature importance based on contextual relevance. A dual-branch deep learning model is constructed that processes both raw sequences (via LSTM) and extracted entropy features (via MLP) in parallel. Extensive experiments on a carefully designed simulated multimodal dataset demonstrate the framework’s robustness in recognizing diverse dynamic patterns, achieving MSE of 0.0125, MAE of 0.0794, and R² of 0.9932. Validation on the real-world ETDataset for power load forecasting confirms that the proposed method significantly outperforms baseline models (LSTM, TCN, transformer, and informer) and traditional entropy methods across standard evaluation metrics (MSE, MAE, RMSE, MAPE, and R²). Ablation studies further verify the critical roles of both the entropy features and the fusion mechanism. Full article
(This article belongs to the Section Multidisciplinary Applications)
23 pages, 2605 KB  
Article
Depression Detection on Social Media Using Multi-Task Learning with BERT and Hierarchical Attention: A DSM-5-Guided Approach
by Haichao Jin and Lin Zhang
Electronics 2026, 15(3), 598; https://doi.org/10.3390/electronics15030598 - 29 Jan 2026
Viewed by 157
Abstract
Depression represents a major global health challenge, yet traditional clinical diagnosis faces limitations, including high costs, limited coverage, and low patient willingness. Social media platforms provide new opportunities for early depression screening through user-generated content. However, existing methods often lack systematic integration of [...] Read more.
Depression represents a major global health challenge, yet traditional clinical diagnosis faces limitations, including high costs, limited coverage, and low patient willingness. Social media platforms provide new opportunities for early depression screening through user-generated content. However, existing methods often lack systematic integration of clinical knowledge and fail to leverage multi-modal information comprehensively. We propose a DSM-5-guided methodology that systematically maps clinical diagnostic criteria to computable social media features across three modalities: textual semantics (BERT-based deep semantic extraction), behavioral patterns (temporal activity analysis), and topic distributions (LDA-based cognitive bias identification). We design a hierarchical architecture integrating BERT, Bi-LSTM, hierarchical attention, and multi-task learning to capture both character-level and post-level importance while jointly optimizing depression classification, symptom recognition, and severity assessment. Experiments on the WU3D dataset (32,570 users, 2.19 million posts) demonstrate that our model achieves 91.8% F1-score, significantly outperforming baseline methods (BERT: 85.6%, TextCNN: 78.6%, and SVM: 72.1%) and large language models (GPT-4 few-shot: 86.9%). Ablation studies confirm that each component contributes meaningfully with synergistic effects. The model provides interpretable predictions through attention visualization and outputs fine-grained symptom assessments aligned with DSM-5 criteria. With low computational cost (~50 ms inference time), local deployability, and superior privacy protection, our approach offers significant practical value for large-scale mental health screening applications. This work demonstrates that domain-specialized methods with explicit clinical knowledge integration remain highly competitive in the era of general-purpose large language models. Full article
Show Figures

Figure 1

31 pages, 22732 KB  
Article
Binocular Rivalry and Fusion-Inspired Hierarchical Complementary Ensemble for No-Reference Stereoscopic Image Quality Assessment
by Yiling Tang, Shunliang Jiang, Shaoping Xu, Jian Xiao and Haiwen Yu
Sensors 2026, 26(3), 883; https://doi.org/10.3390/s26030883 - 29 Jan 2026
Viewed by 88
Abstract
No-reference stereoscopic image quality assessment (NR-SIQA) remains a fundamental challenge due to the complex biological mechanisms of binocular rivalry and fusion, particularly under asymmetric distortions. In this paper, we propose a novel framework termed Multi-Stage Complementary Ensemble (MSCE). The core innovation lies in [...] Read more.
No-reference stereoscopic image quality assessment (NR-SIQA) remains a fundamental challenge due to the complex biological mechanisms of binocular rivalry and fusion, particularly under asymmetric distortions. In this paper, we propose a novel framework termed Multi-Stage Complementary Ensemble (MSCE). The core innovation lies in the Adaptive Selective Propagation (ASP) strategy embedded within a hierarchical Transformer architecture to dynamically regulates the fusion of binocular features. Specifically, by simulating the human visual system’s transition from binocular rivalry to fusion, the ASP strategy applies nonlinear gain control to selectively reinforce features from the governing view based on binocular discrepancies. Furthermore, the proposed Hierarchical Complementary Fusion (HCF) module effectively captures and integrates low-level texture integrity, mid-level structural degradation, and high-level semantic consistency, leveraging ensemble learning principles, within a unified quality-aware manifold. Experimental results on four benchmark datasets demonstrate that the MSCE framework achieves state-of-the-art performance, particularly in terms of prediction consistency under complex asymmetric distortions. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

28 pages, 29386 KB  
Article
Dual-Scale Pixel Aggregation Transformer for Change Detection in Multitemporal Remote Sensing Images
by Kai Zhang, Ziqing Wan, Xue Zhao, Feng Zhang, Ke Liu and Jiande Sun
Remote Sens. 2026, 18(3), 422; https://doi.org/10.3390/rs18030422 - 28 Jan 2026
Viewed by 208
Abstract
Transformers have recently been applied to change detection (CD) of multitemporal remote sensing images because of their ability to model global information. However, the rigid patch partitioning in vanilla self-attention destroys spatial structures and consistency in observed scenes, leading to limited CD performance. [...] Read more.
Transformers have recently been applied to change detection (CD) of multitemporal remote sensing images because of their ability to model global information. However, the rigid patch partitioning in vanilla self-attention destroys spatial structures and consistency in observed scenes, leading to limited CD performance. In this paper, we propose a novel dual-scale pixel aggregation transformer (DSPA-Former) to mitigate this issue. The core of DSPA-Former lies in a dynamic superpixel tokenization strategy and bidirectional dual-scale interaction within the learned feature space, which preserves semantic integrity while capturing long-range dependencies. Specifically, we design a hierarchical decoder that integrates multiscale features through specialized mechanisms for pixel superpixel dialogue, guided feature enhancement, and adaptive multiscale fusion. By modeling the homogeneous properties of spatial information via superpixel segmentation, DSPA-Former effectively maintains structural consistency and sharpens change boundaries. Comprehensive experiments on the LEVIR-CD, WHU-CD, and CLCD datasets demonstrate that DSPA-Former achieves superior performance compared to state-of-the-art methods, particularly in preserving the structural integrity of complex change regions. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Figure 1

26 pages, 1315 KB  
Article
SFD-ADNet: Spatial–Frequency Dual-Domain Adaptive Deformation for Point Cloud Data Augmentation
by Jiacheng Bao, Lingjun Kong and Wenju Wang
J. Imaging 2026, 12(2), 58; https://doi.org/10.3390/jimaging12020058 - 26 Jan 2026
Viewed by 168
Abstract
Existing 3D point cloud enhancement methods typically rely on artificially designed geometric transformations or local blending strategies, which are prone to introducing illogical deformations, struggle to preserve global structure, and exhibit insufficient adaptability to diverse degradation patterns. To address these limitations, this paper [...] Read more.
Existing 3D point cloud enhancement methods typically rely on artificially designed geometric transformations or local blending strategies, which are prone to introducing illogical deformations, struggle to preserve global structure, and exhibit insufficient adaptability to diverse degradation patterns. To address these limitations, this paper proposes SFD-ADNet—an adaptive deformation framework based on a dual spatial–frequency domain. It achieves 3D point cloud augmentation by explicitly learning deformation parameters rather than applying predefined perturbations. By jointly modeling spatial structural dependencies and spectral features, SFD-ADNet generates augmented samples that are both structurally aware and task-relevant. In the spatial domain, a hierarchical sequence encoder coupled with a bidirectional Mamba-based deformation predictor captures long-range geometric dependencies and local structural variations, enabling adaptive position-aware deformation control. In the frequency domain, a multi-scale dual-channel mechanism based on adaptive Chebyshev polynomials separates low-frequency structural components from high-frequency details, allowing the model to suppress noise-sensitive distortions while preserving the global geometric skeleton. The two deformation predictions dynamically fuse to balance structural fidelity and sample diversity. Extensive experiments conducted on ModelNet40-C and ScanObjectNN-C involved synthetic CAD models and real-world scanned point clouds under diverse perturbation conditions. SFD-ADNet, as a universal augmentation module, reduces the mCE metrics of PointNet++ and different backbone networks by over 20%. Experiments demonstrate that SFD-ADNet achieves state-of-the-art robustness while preserving critical geometric structures. Furthermore, models enhanced by SFD-ADNet demonstrate consistently improved robustness against diverse point cloud attacks, validating the efficacy of adaptive space-frequency deformation in robust point cloud learning. Full article
(This article belongs to the Special Issue 3D Image Processing: Progress and Challenges)
Show Figures

Figure 1

23 pages, 3554 KB  
Article
Hybrid Mechanism–Data-Driven Modeling for Crystal Quality Prediction in Czochralski Process
by Duqiao Zhao, Junchao Ren, Xiaoyan Du, Yixin Wang and Dong Ding
Crystals 2026, 16(2), 86; https://doi.org/10.3390/cryst16020086 - 25 Jan 2026
Viewed by 160
Abstract
The V/G criterion is a critical indicator for monitoring dynamic changes during Czochralski silicon single crystal (Cz-SSC) growth. However, the inability to measure it in real time forces reliance on offline feedback for process regulation, leading to imprecise control and compromised crystal quality. [...] Read more.
The V/G criterion is a critical indicator for monitoring dynamic changes during Czochralski silicon single crystal (Cz-SSC) growth. However, the inability to measure it in real time forces reliance on offline feedback for process regulation, leading to imprecise control and compromised crystal quality. To overcome this limitation, this paper proposes a novel soft sensor modeling framework that integrates both mechanism-based knowledge and data-driven learning for the real-time prediction of the crystal quality parameter, specifically the V/G value (the ratio of growth rate to axial temperature gradient). The proposed approach constructs a hybrid prediction model by combining a data-driven sub-model with a physics-informed mechanism sub-model. The data-driven component is developed using an attention-based dynamic stacked enhanced autoencoder (AD-SEAE) network, where the SEAE structure introduces layer-wise reconstruction operations to mitigate information loss during hierarchical feature extraction. Furthermore, an attention mechanism is incorporated to dynamically weigh historical and current samples, thereby enhancing the temporal representation of process dynamics. In addition, a robust ensemble approach is achieved by fusing the outputs of two subsidiary models using an adaptive weighting strategy based on prediction accuracy, thereby enabling more reliable V/G predictions under varying operational conditions. Experimental validation using actual industrial Cz-SSC production data demonstrates that the proposed method achieves high-prediction accuracy and effectively supports real-time process optimization and quality monitoring. Full article
(This article belongs to the Section Industrial Crystallization)
Show Figures

Figure 1

20 pages, 1369 KB  
Article
Symmetry-Aware Interpretable Anomaly Alarm Optimization Method for Power Monitoring Systems Based on Hierarchical Attention Deep Reinforcement Learning
by Zepeng Hou, Qiang Fu, Weixun Li, Yao Wang, Zhengkun Dong, Xianlin Ye, Xiaoyu Chen and Fangyu Zhang
Symmetry 2026, 18(2), 216; https://doi.org/10.3390/sym18020216 - 23 Jan 2026
Viewed by 277
Abstract
With the rapid advancement of smart grids driven by renewable energy integration and the extensive deployment of supervisory control and data acquisition (SCADA) and phasor measurement units (PMUs), addressing the escalating alarm flooding via intelligent analysis of large-scale alarm data is pivotal to [...] Read more.
With the rapid advancement of smart grids driven by renewable energy integration and the extensive deployment of supervisory control and data acquisition (SCADA) and phasor measurement units (PMUs), addressing the escalating alarm flooding via intelligent analysis of large-scale alarm data is pivotal to safeguarding the safe and stable operation of power grids. To tackle these challenges, this study introduces a pioneering alarm optimization framework based on symmetry-driven crowdsourced active learning and interpretable deep reinforcement learning (DRL). Firstly, an anomaly alarm annotation method integrating differentiated crowdsourcing and active learning is proposed to mitigate the inherent asymmetry in data distribution. Secondly, a symmetrically structured DRL-based hierarchical attention deep Q-network is designed with a dual-path encoder to balance the processing of multi-scale alarm features. Finally, a SHAP-driven interpretability framework is established, providing global and local attribution to enhance decision transparency. Experimental results on a real-world power alarm dataset demonstrate that the proposed method achieves a Fleiss’ Kappa of 0.82 in annotation consistency and an F1-Score of 0.95 in detection performance, significantly outperforming state-of-the-art baselines. Additionally, the false positive rate is reduced to 0.04, verifying the framework’s effectiveness in suppressing alarm flooding while maintaining high recall. Full article
(This article belongs to the Special Issue Symmetry and Asymmetry in Data Analysis)
Show Figures

Figure 1

45 pages, 1326 KB  
Article
Cross-Domain Deep Reinforcement Learning for Real-Time Resource Allocation in Transportation Hubs: From Airport Gates to Seaport Berths
by Zihao Zhang, Qingwei Zhong, Weijun Pan, Yi Ai and Qian Wang
Aerospace 2026, 13(1), 108; https://doi.org/10.3390/aerospace13010108 - 22 Jan 2026
Viewed by 122
Abstract
Efficient resource allocation is critical for transportation hub operations, yet current scheduling systems require substantial domain-specific customization when deployed across different facilities. This paper presents a domain-adaptive deep reinforcement learning (DADRL) framework that learns transferable optimization policies for dynamic resource allocation across structurally [...] Read more.
Efficient resource allocation is critical for transportation hub operations, yet current scheduling systems require substantial domain-specific customization when deployed across different facilities. This paper presents a domain-adaptive deep reinforcement learning (DADRL) framework that learns transferable optimization policies for dynamic resource allocation across structurally similar transportation scheduling problems. The framework integrates dual-level heterogeneous graph attention networks for separating constraint topology from domain-specific features, hypergraph-based constraint modeling for capturing high-order dependencies, and hierarchical policy decomposition that reduces computational complexity from O(mnT) to O(m+n+T). Evaluated on realistic simulators modeling airport gate assignment (Singapore Changi: 50 gates, 300–400 daily flights) and seaport berth allocation (Singapore Port: 40 berths, 80–120 daily vessels), DADRL achieves 87.3% resource utilization in airport operations and 86.3% in port operations, outperforming commercial solvers under strict real-time constraints (Gurobi-MIP with 300 s time limit: 85.1%) while operating 270 times faster (1.1 s versus 298 s per instance). Given unlimited time, Gurobi achieves provably optimal solutions, but DADRL reaches 98.7% of this optimum in 1.1 s, making it suitable for time-critical operational scenarios where exact solvers are computationally infeasible. Critically, policies trained exclusively on airport scenarios retain 92.4% performance when applied to ports without retraining, requiring only 800 adaptation steps compared to 13,200 for domain-specific training. The framework maintains 86.2% performance under operational disruptions and scales to problems three times larger than training instances with only 7% degradation. These results demonstrate that learned optimization principles can generalize across transportation scheduling problems sharing common constraint structures, enabling rapid deployment of AI-based scheduling systems across multi-modal transportation networks with minimal customization and reduced implementation costs. Full article
(This article belongs to the Special Issue Emerging Trends in Air Traffic Flow and Airport Operations Control)
Show Figures

Figure 1

25 pages, 4607 KB  
Article
CHARMS: A CNN-Transformer Hybrid with Attention Regularization for MRI Super-Resolution
by Xia Li, Haicheng Sun and Tie-Qiang Li
Sensors 2026, 26(2), 738; https://doi.org/10.3390/s26020738 - 22 Jan 2026
Viewed by 69
Abstract
Magnetic resonance imaging (MRI) super-resolution (SR) enables high-resolution reconstruction from low-resolution acquisitions, reducing scan time and easing hardware demands. However, most deep learning-based SR models are large and computationally heavy, limiting deployment in clinical workstations, real-time pipelines, and resource-restricted platforms such as low-field [...] Read more.
Magnetic resonance imaging (MRI) super-resolution (SR) enables high-resolution reconstruction from low-resolution acquisitions, reducing scan time and easing hardware demands. However, most deep learning-based SR models are large and computationally heavy, limiting deployment in clinical workstations, real-time pipelines, and resource-restricted platforms such as low-field and portable MRI. We introduce CHARMS, a lightweight convolutional–Transformer hybrid with attention regularization optimized for MRI SR. CHARMS employs a Reverse Residual Attention Fusion backbone for hierarchical local feature extraction, Pixel–Channel and Enhanced Spatial Attention for fine-grained feature calibration, and a Multi-Depthwise Dilated Transformer Attention block for efficient long-range dependency modeling. Novel attention regularization suppresses redundant activations, stabilizes training, and enhances generalization across contrasts and field strengths. Across IXI, Human Connectome Project Young Adult, and paired 3T/7T datasets, CHARMS (~1.9M parameters; ~30 GFLOPs for 256 × 256) surpasses leading lightweight and hybrid baselines (EDSR, PAN, W2AMSN-S, and FMEN) by 0.1–0.6 dB PSNR and up to 1% SSIM at ×2/×4 upscaling, while reducing inference time ~40%. Cross-field fine-tuning yields 7T-like reconstructions from 3T inputs with ~6 dB PSNR and 0.12 SSIM gains over native 3T. With near-real-time performance (~11 ms/slice, ~1.6–1.9 s per 3D volume on RTX 4090), CHARMS offers a compelling fidelity–efficiency balance for clinical workflows, accelerated protocols, and portable MRI. Full article
(This article belongs to the Special Issue Sensing Technologies in Digital Radiology and Image Analysis)
Show Figures

Figure 1

20 pages, 7566 KB  
Article
Temporal Probability-Guided Graph Topology Learning for Robust 3D Human Mesh Reconstruction
by Hongsheng Wang, Jie Yang, Feng Lin and Fei Wu
Mathematics 2026, 14(2), 367; https://doi.org/10.3390/math14020367 - 21 Jan 2026
Viewed by 138
Abstract
Reconstructing 3D human motion from monocular video presents challenges when frames contain occlusions or blur, as conventional approaches depend on features extracted within limited temporal windows, resulting in structural distortions. In this paper, we introduce a novel framework that combines temporal probability guidance [...] Read more.
Reconstructing 3D human motion from monocular video presents challenges when frames contain occlusions or blur, as conventional approaches depend on features extracted within limited temporal windows, resulting in structural distortions. In this paper, we introduce a novel framework that combines temporal probability guidance with graph topology learning to achieve robust 3D human mesh reconstruction from incomplete observations. Our method leverages topology-aware probability distributions spanning entire motion sequences to recover missing anatomical regions. The Graph Topological Modeling (GTM) component captures structural relationships among body parts by learning the inherent connectivity patterns in human anatomy. Building upon GTM, our Temporal-alignable Probability Distribution (TPDist) mechanism predicts missing features through probabilistic inference, establishing temporal coherence across frames. Additionally, we propose a Hierarchical Human Loss (HHLoss) that hierarchically regularizes probability distribution errors for inter-frame features while accounting for topological variations. Experimental validation demonstrates that our approach outperforms state-of-the-art methods on the 3DPW benchmark, particularly excelling in scenarios involving occlusions and motion blur. Full article
Show Figures

Figure 1

26 pages, 2177 KB  
Article
A Semantic Similarity Model for Geographic Terminologies Using Ontological Features and BP Neural Networks
by Zugang Chen, Xinyu Chen, Yin Ma, Jing Li, Linhan Yang, Guoqing Li, Hengliang Guo, Shuai Chen and Tian Liang
Appl. Sci. 2026, 16(2), 1105; https://doi.org/10.3390/app16021105 - 21 Jan 2026
Viewed by 95
Abstract
Accurate measurement of semantic similarity between geographic terms is a fundamental challenge in geographic information science, directly influencing tasks such as knowledge retrieval, ontology-based reasoning, and semantic search in geographic information systems (GIS). Traditional ontology-based approaches primarily rely on a narrow set of [...] Read more.
Accurate measurement of semantic similarity between geographic terms is a fundamental challenge in geographic information science, directly influencing tasks such as knowledge retrieval, ontology-based reasoning, and semantic search in geographic information systems (GIS). Traditional ontology-based approaches primarily rely on a narrow set of features (e.g., semantic distance or depth), which inadequately capture the multidimensional and context-dependent nature of geographic semantics. To address this limitation, this study proposes an ontology-driven semantic similarity model that integrates a backpropagation (BP) neural network with multiple ontological features—hierarchical depth, node distance, concept density, and relational overlap. The BP network serves as a nonlinear optimization mechanism that adaptively learns the contributions of each feature through cross-validation, balancing interpretability and precision. Experimental evaluations on the Geo-Terminology Relatedness Dataset (GTRD) demonstrate that the proposed model outperforms traditional baselines, including the Thesaurus–Lexical Relatedness Measure (TLRM), Word2Vec, and SBERT (Sentence-BERT), with Spearman correlation improvements of 4.2%, 74.8% and 80.1%, respectively. Additionally, comparisons with Linear Regression and Random Forest models, as well as bootstrap analysis and error analysis, confirm the robustness and generalization of the BP-based approach. These results confirm that coupling structured ontological knowledge with data-driven learning enhances robustness and generalization in semantic similarity computation, providing a unified framework for geographic knowledge reasoning, terminology harmonization, and ontology-based information retrieval. Full article
Show Figures

Figure 1

Back to TopTop