Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,380)

Search Parameters:
Keywords = hierarchical attention

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
31 pages, 3068 KB  
Article
CEH-DETR: A State Space-Based Framework for Efficient Multi-Scale Ship Detection
by Xiaolin Zhang, Ru Wang and Shengzheng Wang
J. Mar. Sci. Eng. 2026, 14(3), 279; https://doi.org/10.3390/jmse14030279 - 29 Jan 2026
Abstract
Ship detection in optical images is critical for maritime supervision but faces challenges from scale variations and complex backgrounds. Existing detectors often struggle to balance global context modeling with computational efficiency. To address this, we propose Contextual Efficient Hierarchical DETR (CEH-DETR), an efficient [...] Read more.
Ship detection in optical images is critical for maritime supervision but faces challenges from scale variations and complex backgrounds. Existing detectors often struggle to balance global context modeling with computational efficiency. To address this, we propose Contextual Efficient Hierarchical DETR (CEH-DETR), an efficient framework for multi-scale ship detection. First, we introduce the Cross-stage Parallel State Space Hidden Mixer (CPSHM) backbone, integrating State Space Models with CNNs to capture global dependencies with linear complexity. Second, the Efficient Adaptive Feature Integration (EAFI) module reduces attention complexity to linear using Token Statistics-based Attention. Third, the Hierarchical Attention-guided Feature Pyramid Network (HAFPN) effectively fuses multi-scale features while preserving spatial details. Experiments on the ABOships dataset demonstrate that CEH-DETR achieves a superior balance between accuracy and efficiency. Relative to the baseline RT-DETR, our approach achieves a parameter reduction of 25.6% while increasing mAP@50 by 2.0 percentage points and boosting inference speed to 133.7 FPS (+112.1%), making it highly suitable for real-time maritime surveillance. Full article
23 pages, 2605 KB  
Article
Depression Detection on Social Media Using Multi-Task Learning with BERT and Hierarchical Attention: A DSM-5-Guided Approach
by Haichao Jin and Lin Zhang
Electronics 2026, 15(3), 598; https://doi.org/10.3390/electronics15030598 - 29 Jan 2026
Abstract
Depression represents a major global health challenge, yet traditional clinical diagnosis faces limitations, including high costs, limited coverage, and low patient willingness. Social media platforms provide new opportunities for early depression screening through user-generated content. However, existing methods often lack systematic integration of [...] Read more.
Depression represents a major global health challenge, yet traditional clinical diagnosis faces limitations, including high costs, limited coverage, and low patient willingness. Social media platforms provide new opportunities for early depression screening through user-generated content. However, existing methods often lack systematic integration of clinical knowledge and fail to leverage multi-modal information comprehensively. We propose a DSM-5-guided methodology that systematically maps clinical diagnostic criteria to computable social media features across three modalities: textual semantics (BERT-based deep semantic extraction), behavioral patterns (temporal activity analysis), and topic distributions (LDA-based cognitive bias identification). We design a hierarchical architecture integrating BERT, Bi-LSTM, hierarchical attention, and multi-task learning to capture both character-level and post-level importance while jointly optimizing depression classification, symptom recognition, and severity assessment. Experiments on the WU3D dataset (32,570 users, 2.19 million posts) demonstrate that our model achieves 91.8% F1-score, significantly outperforming baseline methods (BERT: 85.6%, TextCNN: 78.6%, and SVM: 72.1%) and large language models (GPT-4 few-shot: 86.9%). Ablation studies confirm that each component contributes meaningfully with synergistic effects. The model provides interpretable predictions through attention visualization and outputs fine-grained symptom assessments aligned with DSM-5 criteria. With low computational cost (~50 ms inference time), local deployability, and superior privacy protection, our approach offers significant practical value for large-scale mental health screening applications. This work demonstrates that domain-specialized methods with explicit clinical knowledge integration remain highly competitive in the era of general-purpose large language models. Full article
Show Figures

Figure 1

21 pages, 3941 KB  
Article
Explainable Prediction of Crowdfunding Success Using Hierarchical Attention Network
by SeungHun Lee, Muneeb A. Khan and Hyun-chul Kim
Electronics 2026, 15(3), 570; https://doi.org/10.3390/electronics15030570 - 28 Jan 2026
Abstract
Crowdfunding has emerged as an alternative funding source among entrepreneurs, businesses, and industries. In recent years, research on machine learning-based project classification models has been conducted with the aim of predicting the success of crowdfunding campaigns, both for entrepreneurs and investors. However, most [...] Read more.
Crowdfunding has emerged as an alternative funding source among entrepreneurs, businesses, and industries. In recent years, research on machine learning-based project classification models has been conducted with the aim of predicting the success of crowdfunding campaigns, both for entrepreneurs and investors. However, most of the research has focused on classification approaches using non-content information such as project metadata, creators’ behavior, and social history, but there have been few attempts to use text content data per se, particularly in order to provide explanations and evidence for how the prediction decisions were made. To address this point, we propose to use a deep learning-based approach called Hierarchical Attention Network (HAN) to predict the success of crowdfunding campaigns and provide explanation and justification of the prediction decisions using attention weights. We collect publicly available data of crowdfunding campaigns and build our success prediction model with an accuracy of 86.38% and 87.29%, using an Updates section and backers’ comments in a Comments section, respectively. We also explore the feasibility of early success prediction during the funding period (up to 2 months), with as much as 80.99% accuracy in 1 to 2 months. Finally, we examine word and sentence attention weight scores to clarify key factors in predicting crowdfunding success. Full article
(This article belongs to the Special Issue Novel Approaches for Deep Learning in Cybersecurity)
24 pages, 622 KB  
Review
Current Status and Future Prospects of Research on Sepsis-Related Acute Kidney Injury
by Yurou Wang, Le Zong, Manli Zhu, Jie Li, Jiayi Xu, Hunian Li and Yan Li
Int. J. Mol. Sci. 2026, 27(3), 1315; https://doi.org/10.3390/ijms27031315 - 28 Jan 2026
Abstract
Sepsis is defined as life-threatening organ dysfunction caused by a dysregulated host response to infection. The kidney is among the organs most susceptible to sepsis-induced injury, and acute kidney injury frequently develops in this context, thereby markedly increasing mortality in affected patients. With [...] Read more.
Sepsis is defined as life-threatening organ dysfunction caused by a dysregulated host response to infection. The kidney is among the organs most susceptible to sepsis-induced injury, and acute kidney injury frequently develops in this context, thereby markedly increasing mortality in affected patients. With continued advances in research, a more comprehensive understanding has been achieved regarding the clinical risk factors, pathophysiological mechanisms, therapeutic responses, and renal recovery processes associated with sepsis-associated acute kidney injury (SA-AKI). These advances have strengthened the capacity for prevention, early detection, and effective management of SA-AKI. Despite this progress, substantial gaps remain in the overall understanding of SA-AKI pathogenesis, including the complex interplay among pathophysiological mechanisms and the extensive cross-regulation of multiple signaling pathways. Consequently, SA-AKI remains a major clinical challenge and imposes a substantial global healthcare burden. There is therefore an urgent need for further research to elucidate the underlying mechanisms of SA-AKI and to identify more effective therapeutic strategies. Unlike previous reviews that primarily focused on individual mechanisms or isolated therapeutic targets, the present review synthesizes the most recent evidence on SA-AKI. Particular emphasis is placed on its pathogenic processes, associated molecular mechanisms and signaling pathways, and emerging therapeutic targets. Special attention is given to the hierarchical relationships among distinct mechanisms during disease progression and their implications for clinical translation. This review aims to inform clinical practice and to identify future research directions, thereby providing valuable insights for both researchers and clinicians in this field. Full article
(This article belongs to the Section Molecular Pathology, Diagnostics, and Therapeutics)
Show Figures

Figure 1

28 pages, 29386 KB  
Article
Dual-Scale Pixel Aggregation Transformer for Change Detection in Multitemporal Remote Sensing Images
by Kai Zhang, Ziqing Wan, Xue Zhao, Feng Zhang, Ke Liu and Jiande Sun
Remote Sens. 2026, 18(3), 422; https://doi.org/10.3390/rs18030422 - 28 Jan 2026
Abstract
Transformers have recently been applied to change detection (CD) of multitemporal remote sensing images because of their ability to model global information. However, the rigid patch partitioning in vanilla self-attention destroys spatial structures and consistency in observed scenes, leading to limited CD performance. [...] Read more.
Transformers have recently been applied to change detection (CD) of multitemporal remote sensing images because of their ability to model global information. However, the rigid patch partitioning in vanilla self-attention destroys spatial structures and consistency in observed scenes, leading to limited CD performance. In this paper, we propose a novel dual-scale pixel aggregation transformer (DSPA-Former) to mitigate this issue. The core of DSPA-Former lies in a dynamic superpixel tokenization strategy and bidirectional dual-scale interaction within the learned feature space, which preserves semantic integrity while capturing long-range dependencies. Specifically, we design a hierarchical decoder that integrates multiscale features through specialized mechanisms for pixel superpixel dialogue, guided feature enhancement, and adaptive multiscale fusion. By modeling the homogeneous properties of spatial information via superpixel segmentation, DSPA-Former effectively maintains structural consistency and sharpens change boundaries. Comprehensive experiments on the LEVIR-CD, WHU-CD, and CLCD datasets demonstrate that DSPA-Former achieves superior performance compared to state-of-the-art methods, particularly in preserving the structural integrity of complex change regions. Full article
(This article belongs to the Section Remote Sensing Image Processing)
Show Figures

Figure 1

28 pages, 909 KB  
Article
Job Demands and Resources During Digital Transformation in Public Administration: A Qualitative Study
by Victoria Sump, Tanja Wirth, Volker Harth and Stefanie Mache
Behav. Sci. 2026, 16(2), 187; https://doi.org/10.3390/bs16020187 - 27 Jan 2026
Abstract
Digital transformation poses significant challenges to employee well-being, particularly in public administration, where hierarchical structures, increasing digitalization pressures, and high mental health-related absenteeism underscore the need to understand individual and job demands and resources. This study explores these aspects from the perspectives of [...] Read more.
Digital transformation poses significant challenges to employee well-being, particularly in public administration, where hierarchical structures, increasing digitalization pressures, and high mental health-related absenteeism underscore the need to understand individual and job demands and resources. This study explores these aspects from the perspectives of employees and supervisors in public administration. Between September 2023 and February 2024, semi-structured interviews were conducted with eight employees and eleven supervisors from public administration organizations in Northern Germany and analyzed using deductive–inductive qualitative content analysis based on the Job Demands-Resources model. Identified individual resources included technical affinity, error tolerance, and willingness to learn, while key job resources involved early and transparent communication, attentive leadership, technical support, and counseling services, with most job resources linked to leadership behavior and work organization. Reported job demands comprised insufficient participation, inadequate planning, and lengthy procedures, whereas personal demands included fears and concerns about upcoming changes and negative attitudes toward transformation. The variation in perceived demands and resources highlights the individuality of the employees’ experiences. The findings provide initial insights into factors influencing psychological well-being at work during digital transformation, emphasizing the importance of participatory communication, employee involvement, leadership awareness of stressors, and competence development. Future research should employ longitudinal and interventional designs to improve causal understanding and generalizability. Full article
Show Figures

Figure 1

27 pages, 3922 KB  
Article
Hierarchical Multiscale Fusion with Coordinate Attention for Lithologic Mapping from Remote Sensing
by Fuyuan Xie and Yongguo Yang
Remote Sens. 2026, 18(3), 413; https://doi.org/10.3390/rs18030413 - 26 Jan 2026
Viewed by 110
Abstract
Accurate lithologic maps derived from satellite imagery underpin structural interpretation, mineral exploration, and geohazard assessment. However, automated mapping in complex terranes remains challenging because spectrally similar units, narrow anisotropic bodies, and ambiguous contacts can degrade boundary fidelity. In this study, we propose SegNeXt-HFCA, [...] Read more.
Accurate lithologic maps derived from satellite imagery underpin structural interpretation, mineral exploration, and geohazard assessment. However, automated mapping in complex terranes remains challenging because spectrally similar units, narrow anisotropic bodies, and ambiguous contacts can degrade boundary fidelity. In this study, we propose SegNeXt-HFCA, a hierarchical multiscale fusion network with coordinate attention for lithologic segmentation from a Sentinel-2/DEM feature stack. The model builds on SegNeXt and introduces a hierarchical multiscale encoder with coordinate attention to jointly capture fine textures and scene-level structure. It further adopts a class-frequency-aware hybrid loss that combines boundary-weighted online hard-example mining cross-entropy with Lovász-Softmax to better handle long-tailed classes and ambiguous contacts. In addition, we employ a robust training and inference scheme, including entropy-guided patch sampling, exponential moving average of parameters, test-time augmentation, and a DenseCRF-based post-refinement. Two study areas in the Beishan orogen, northwestern China (Huitongshan and Xingxingxia), are used to evaluate the method with a unified 10-channel Sentinel-2/DEM feature stack. Compared with U-NetFormer, PSPNet, DeepLabV3+, DANet, LGMSFNet, SegFormer, BiSeNetV2, and the SegNeXt backbone, SegNeXt-HFCA improves mean intersection-over-union (mIoU) by about 3.8% in Huitongshan and 2.6% in Xingxingxia, respectively, and increases mean pixel accuracy by approximately 3–4%. Qualitative analyses show that the proposed framework better preserves thin-unit continuity, clarifies lithologic contacts, and reduces salt-and-pepper noise, yielding geologically more plausible maps. These results demonstrate that hierarchical multiscale fusion with coordinate attention, together with class- and boundary-aware optimization, provides a practical route to robust lithologic mapping in structurally complex regions. Full article
(This article belongs to the Section Remote Sensing for Geospatial Science)
Show Figures

Figure 1

18 pages, 1264 KB  
Article
Comprehensive Methodology for the Design of Fuel Cell Vehicles: A Layered Approach
by Swantje C. Konradt and Hermann S. Rottengruber
Energies 2026, 19(3), 629; https://doi.org/10.3390/en19030629 - 26 Jan 2026
Viewed by 124
Abstract
This paper presents a hierarchical model architecture for the analysis and optimization of Fuel Cell Electric Vehicles (FCEVs). The model encompasses the levels of cell, stack, and complete vehicle, which are interconnected through clearly defined transfer parameters. At the cell level, electrochemical and [...] Read more.
This paper presents a hierarchical model architecture for the analysis and optimization of Fuel Cell Electric Vehicles (FCEVs). The model encompasses the levels of cell, stack, and complete vehicle, which are interconnected through clearly defined transfer parameters. At the cell level, electrochemical and thermodynamic processes are mapped, the results of which are aggregated at the stack level into characteristic maps such as current–voltage curves and efficiency profiles. These maps serve as interfaces to the vehicle level, where the electric powertrain—comprising the fuel cell, energy storage, electric motor, and auxiliary consumers—is integrated. Special attention is given to the trade-off between the lifetime and dynamics of the fuel cell, which is methodically captured through variable parameter vectors. The transfer parameters enable consistent and scalable modelling that considers both detailed cell and stack information as well as vehicle-side requirements. On this basis, various vehicle configurations can be evaluated and optimized with regard to efficiency, lifetime, and drivability. Full article
(This article belongs to the Special Issue Advances in Fuel Cells: Materials, Technologies, and Applications)
Show Figures

Figure 1

23 pages, 3554 KB  
Article
Hybrid Mechanism–Data-Driven Modeling for Crystal Quality Prediction in Czochralski Process
by Duqiao Zhao, Junchao Ren, Xiaoyan Du, Yixin Wang and Dong Ding
Crystals 2026, 16(2), 86; https://doi.org/10.3390/cryst16020086 - 25 Jan 2026
Viewed by 123
Abstract
The V/G criterion is a critical indicator for monitoring dynamic changes during Czochralski silicon single crystal (Cz-SSC) growth. However, the inability to measure it in real time forces reliance on offline feedback for process regulation, leading to imprecise control and compromised crystal quality. [...] Read more.
The V/G criterion is a critical indicator for monitoring dynamic changes during Czochralski silicon single crystal (Cz-SSC) growth. However, the inability to measure it in real time forces reliance on offline feedback for process regulation, leading to imprecise control and compromised crystal quality. To overcome this limitation, this paper proposes a novel soft sensor modeling framework that integrates both mechanism-based knowledge and data-driven learning for the real-time prediction of the crystal quality parameter, specifically the V/G value (the ratio of growth rate to axial temperature gradient). The proposed approach constructs a hybrid prediction model by combining a data-driven sub-model with a physics-informed mechanism sub-model. The data-driven component is developed using an attention-based dynamic stacked enhanced autoencoder (AD-SEAE) network, where the SEAE structure introduces layer-wise reconstruction operations to mitigate information loss during hierarchical feature extraction. Furthermore, an attention mechanism is incorporated to dynamically weigh historical and current samples, thereby enhancing the temporal representation of process dynamics. In addition, a robust ensemble approach is achieved by fusing the outputs of two subsidiary models using an adaptive weighting strategy based on prediction accuracy, thereby enabling more reliable V/G predictions under varying operational conditions. Experimental validation using actual industrial Cz-SSC production data demonstrates that the proposed method achieves high-prediction accuracy and effectively supports real-time process optimization and quality monitoring. Full article
(This article belongs to the Section Industrial Crystallization)
Show Figures

Figure 1

16 pages, 1697 KB  
Article
MSHI-Mamba: A Multi-Stage Hierarchical Interaction Model for 3D Point Clouds Based on Mamba
by Zhiguo Zhou, Qian Wang and Xuehua Zhou
Appl. Sci. 2026, 16(3), 1189; https://doi.org/10.3390/app16031189 - 23 Jan 2026
Viewed by 111
Abstract
Mamba, based on the state space model (SSM), offers an efficient alternative to the quadratic complexity of attention, showing promise for long-sequence data processing and global modeling in 3D object detection. However, applying it to this domain presents specific challenges: traditional serialization methods [...] Read more.
Mamba, based on the state space model (SSM), offers an efficient alternative to the quadratic complexity of attention, showing promise for long-sequence data processing and global modeling in 3D object detection. However, applying it to this domain presents specific challenges: traditional serialization methods can compromise the spatial structure of 3D data, and the standard single-layer SSM design may limit cross-layer feature extraction. To address these issues, this paper proposes MSHI-Mamba, a Mamba-based multi-stage hierarchical interaction architecture for 3D backbone networks. We introduce a cross-layer complementary cross-attention module (C3AM) to mitigate feature redundancy in cross-layer encoding, as well as a bi-shift scanning strategy (BSS) that uses hybrid space-filling curves with shift scanning to better preserve spatial continuity and expand the receptive field during serialization. We also develop a voxel densifying downsampling module (VD-DS) to enhance local spatial information and foreground feature density. Experimental results obtained on the KITTI and nuScenes datasets demonstrate that our approach achieves competitive performance, with a 4.2% improvement in the mAP on KITTI, validating the effectiveness of the proposed components. Full article
Show Figures

Figure 1

27 pages, 1996 KB  
Article
Salient Object Detection for Optical Remote Sensing Images Based on Gated Differential Unit
by Mingsi Sun, Ting Lan, Wei Wang and Pingping Liu
Remote Sens. 2026, 18(3), 389; https://doi.org/10.3390/rs18030389 - 23 Jan 2026
Viewed by 127
Abstract
Salient object detection in optical remote sensing images has attracted extensive research interest in recent years. However, CNN-based methods are generally limited by local receptive fields, while ViT-based methods suffer from common defects in noise suppression, channel selection, foreground-background distinction, and detail enhancement. [...] Read more.
Salient object detection in optical remote sensing images has attracted extensive research interest in recent years. However, CNN-based methods are generally limited by local receptive fields, while ViT-based methods suffer from common defects in noise suppression, channel selection, foreground-background distinction, and detail enhancement. To address these issues and integrate long-distance contextual dependencies, we introduce GDUFormer, an ORSI-SOD detection method based on the ViT backbone and Gated Differential Units (GDU). Specifically, the GDU consists of two key components—Full-Dimensional Gated Attention (FGA) and Hierarchical Differential Dynamic Convolution (HDDC). FGA consists of two branches aimed at filtering effective features from the information flow. The first branch focuses on aggregating spatial local information under multiple receptive fields and filters the local feature maps via a grouping mechanism. The second branch imitates the Vision Mamba to acquire high-level reasoning and abstraction capabilities, enabling weak channel filtering. HDDC primarily utilizes distance decay and hierarchical intensity difference capture mechanisms to generate dynamic kernel spatial weights, thereby facilitating the convolution kernel to fully mix long-range contextual dependencies. Among these, the intensity difference capture mechanism can adaptively divide hierarchies and allocate parameters according to kernel size, thus realizing varying levels of difference capture in the kernel space. Extensive quantitative and qualitative experiments demonstrate the effectiveness and rationality of GDUFormer and its internal components. Full article
Show Figures

Figure 1

20 pages, 1369 KB  
Article
Symmetry-Aware Interpretable Anomaly Alarm Optimization Method for Power Monitoring Systems Based on Hierarchical Attention Deep Reinforcement Learning
by Zepeng Hou, Qiang Fu, Weixun Li, Yao Wang, Zhengkun Dong, Xianlin Ye, Xiaoyu Chen and Fangyu Zhang
Symmetry 2026, 18(2), 216; https://doi.org/10.3390/sym18020216 - 23 Jan 2026
Viewed by 241
Abstract
With the rapid advancement of smart grids driven by renewable energy integration and the extensive deployment of supervisory control and data acquisition (SCADA) and phasor measurement units (PMUs), addressing the escalating alarm flooding via intelligent analysis of large-scale alarm data is pivotal to [...] Read more.
With the rapid advancement of smart grids driven by renewable energy integration and the extensive deployment of supervisory control and data acquisition (SCADA) and phasor measurement units (PMUs), addressing the escalating alarm flooding via intelligent analysis of large-scale alarm data is pivotal to safeguarding the safe and stable operation of power grids. To tackle these challenges, this study introduces a pioneering alarm optimization framework based on symmetry-driven crowdsourced active learning and interpretable deep reinforcement learning (DRL). Firstly, an anomaly alarm annotation method integrating differentiated crowdsourcing and active learning is proposed to mitigate the inherent asymmetry in data distribution. Secondly, a symmetrically structured DRL-based hierarchical attention deep Q-network is designed with a dual-path encoder to balance the processing of multi-scale alarm features. Finally, a SHAP-driven interpretability framework is established, providing global and local attribution to enhance decision transparency. Experimental results on a real-world power alarm dataset demonstrate that the proposed method achieves a Fleiss’ Kappa of 0.82 in annotation consistency and an F1-Score of 0.95 in detection performance, significantly outperforming state-of-the-art baselines. Additionally, the false positive rate is reduced to 0.04, verifying the framework’s effectiveness in suppressing alarm flooding while maintaining high recall. Full article
(This article belongs to the Special Issue Symmetry and Asymmetry in Data Analysis)
Show Figures

Figure 1

24 pages, 10940 KB  
Article
A Few-Shot Object Detection Framework for Remote Sensing Images Based on Adaptive Decision Boundary and Multi-Scale Feature Enhancement
by Lijiale Yang, Bangjie Li, Dongdong Guan and Deliang Xiang
Remote Sens. 2026, 18(3), 388; https://doi.org/10.3390/rs18030388 - 23 Jan 2026
Viewed by 181
Abstract
Given the high cost of acquiring large-scale annotated datasets, few-shot object detection (FSOD) has emerged as an increasingly important research direction. However, existing FSOD methods face two critical challenges in remote sensing images (RSIs): (1) features of small targets within remote sensing images [...] Read more.
Given the high cost of acquiring large-scale annotated datasets, few-shot object detection (FSOD) has emerged as an increasingly important research direction. However, existing FSOD methods face two critical challenges in remote sensing images (RSIs): (1) features of small targets within remote sensing images are incompletely represented due to extremely small-scale and cluttered backgrounds, which weakens discriminability and leads to significant detection degradation; (2) unified classification boundaries fail to handle the distinct confidence distributions between well-sampled base classes and sparsely sampled novel classes, leading to ineffective knowledge transfer. To address these issues, we propose TS-FSOD, a Transfer-Stable FSOD framework with two key innovations. First, the proposed detector integrates a Feature Enhancement Module (FEM) leveraging hierarchical attention mechanisms to alleviate small target feature attenuation, and an Adaptive Fusion Unit (AFU) utilizing spatial-channel selection to strengthen target feature representations while mitigating background interference. Second, Dynamic Temperature-scaling Learnable Classifier (DTLC) employs separate learnable temperature parameters for base and novel classes, combined with difficulty-aware weighting and dynamic adjustment, to adaptively calibrate decision boundaries for stable knowledge transfer. Experiments on DIOR and NWPU VHR-10 datasets show that TS-FSOD achieves competitive or superior performance compared to state-of-the-art methods, with improvements up to 4.30% mAP, particularly excelling in 3-shot and 5-shot scenarios. Full article
Show Figures

Figure 1

20 pages, 17064 KB  
Article
PriorSAM-DBNet: A SAM-Prior-Enhanced Dual-Branch Network for Efficient Semantic Segmentation of High-Resolution Remote Sensing Images
by Qiwei Zhang, Yisong Wang, Ning Li, Quanwen Jiang and Yong He
Sensors 2026, 26(2), 749; https://doi.org/10.3390/s26020749 - 22 Jan 2026
Viewed by 109
Abstract
Semantic segmentation of high-resolution remote sensing imagery is a critical technology for the intelligent interpretation of sensor data, supporting automated environmental monitoring and urban sensing systems. However, processing data from dense urban scenarios remains challenging due to sensor signal occlusions (e.g., shadows) and [...] Read more.
Semantic segmentation of high-resolution remote sensing imagery is a critical technology for the intelligent interpretation of sensor data, supporting automated environmental monitoring and urban sensing systems. However, processing data from dense urban scenarios remains challenging due to sensor signal occlusions (e.g., shadows) and the complexity of parsing multi-scale targets from optical sensors. Existing approaches often exhibit a trade-off between the accuracy of global semantic modeling and the precision of complex boundary recognition. While the Segment Anything Model (SAM) offers powerful zero-shot structural priors, its direct application to remote sensing is hindered by domain gaps and the lack of inherent semantic categorization. To address these limitations, we propose a dual-branch cooperative network, PriorSAM-DBNet. The main branch employs a Densely Connected Swin (DC-Swin) Transformer to capture cross-scale global features via a hierarchical shifted window attention mechanism. The auxiliary branch leverages SAM’s zero-shot capability to exploit structural universality, generating object-boundary masks as robust signal priors while bypassing semantic domain shifts. Crucially, we introduce a parameter-efficient Scaled Subsampling Projection (SSP) module that employs a weight-sharing mechanism to align cross-modal features, freezing the massive SAM backbone to ensure computational viability for practical sensor applications. Furthermore, a novel Attentive Cross-Modal Fusion (ACMF) module is designed to dynamically resolve semantic ambiguities by calibrating the global context with local structural priors. Extensive experiments on the ISPRS Vaihingen, Potsdam, and LoveDA-Urban datasets demonstrate that PriorSAM-DBNet outperforms state-of-the-art approaches. By fine-tuning only 0.91 million parameters in the auxiliary branch, our method achieves mIoU scores of 82.50%, 85.59%, and 53.36%, respectively. The proposed framework offers a scalable, high-precision solution for remote sensing semantic segmentation, particularly effective for disaster emergency response where rapid feature recognition from sensor streams is paramount. Full article
Show Figures

Figure 1

45 pages, 1326 KB  
Article
Cross-Domain Deep Reinforcement Learning for Real-Time Resource Allocation in Transportation Hubs: From Airport Gates to Seaport Berths
by Zihao Zhang, Qingwei Zhong, Weijun Pan, Yi Ai and Qian Wang
Aerospace 2026, 13(1), 108; https://doi.org/10.3390/aerospace13010108 - 22 Jan 2026
Viewed by 67
Abstract
Efficient resource allocation is critical for transportation hub operations, yet current scheduling systems require substantial domain-specific customization when deployed across different facilities. This paper presents a domain-adaptive deep reinforcement learning (DADRL) framework that learns transferable optimization policies for dynamic resource allocation across structurally [...] Read more.
Efficient resource allocation is critical for transportation hub operations, yet current scheduling systems require substantial domain-specific customization when deployed across different facilities. This paper presents a domain-adaptive deep reinforcement learning (DADRL) framework that learns transferable optimization policies for dynamic resource allocation across structurally similar transportation scheduling problems. The framework integrates dual-level heterogeneous graph attention networks for separating constraint topology from domain-specific features, hypergraph-based constraint modeling for capturing high-order dependencies, and hierarchical policy decomposition that reduces computational complexity from O(mnT) to O(m+n+T). Evaluated on realistic simulators modeling airport gate assignment (Singapore Changi: 50 gates, 300–400 daily flights) and seaport berth allocation (Singapore Port: 40 berths, 80–120 daily vessels), DADRL achieves 87.3% resource utilization in airport operations and 86.3% in port operations, outperforming commercial solvers under strict real-time constraints (Gurobi-MIP with 300 s time limit: 85.1%) while operating 270 times faster (1.1 s versus 298 s per instance). Given unlimited time, Gurobi achieves provably optimal solutions, but DADRL reaches 98.7% of this optimum in 1.1 s, making it suitable for time-critical operational scenarios where exact solvers are computationally infeasible. Critically, policies trained exclusively on airport scenarios retain 92.4% performance when applied to ports without retraining, requiring only 800 adaptation steps compared to 13,200 for domain-specific training. The framework maintains 86.2% performance under operational disruptions and scales to problems three times larger than training instances with only 7% degradation. These results demonstrate that learned optimization principles can generalize across transportation scheduling problems sharing common constraint structures, enabling rapid deployment of AI-based scheduling systems across multi-modal transportation networks with minimal customization and reduced implementation costs. Full article
(This article belongs to the Special Issue Emerging Trends in Air Traffic Flow and Airport Operations Control)
Show Figures

Figure 1

Back to TopTop