MDPI - Publisher of Open Access Journals

23 pages, 865 KB

Open AccessArticle

A Novel Genetic Algorithm for the Dual-Resource Flexible Job Shop Scheduling Problem with Partial Resource Allocation

by Diogo Marta, Bernardo Firme, Miguel S. E. Martins, João M. C. Sousa and Susana M. Vieira

Automation 2026, 7(3), 99; https://doi.org/10.3390/automation7030099 (registering DOI) - 20 Jun 2026

This paper proposes a genetic algorithm (GA) for the Dual-Resource Flexible Job Shop Scheduling Problem with Partial Resource Allocation (DRFJSSP-PRA), a particular variant of a dual-resource constrained scheduling problem that has not been fully explored due to its intricate nature. The DRFJSSP-PRA poses [...] Read more.

This paper proposes a genetic algorithm (GA) for the Dual-Resource Flexible Job Shop Scheduling Problem with Partial Resource Allocation (DRFJSSP-PRA), a particular variant of a dual-resource constrained scheduling problem that has not been fully explored due to its intricate nature. The DRFJSSP-PRA poses a challenging scheduling problem, having several applications in many industries, including food, chemistry and pharmaceutics. The proposed algorithm is applied to real-world scheduling instances in pharmaceutical quality control. The objective function considered is the total completion time. The GA is compared with three state-of-the-art algorithms. For small- and medium-size instances, the proposed algorithm achieves optimal or near optimal results for the majority of the instances tested. For large-sized instances, the proposed GA outperforms all the other algorithms, in all of the tested instances. Thus, the experimental results show that the proposed GA achieves competitive results for any type of instance. The proposed algorithm also has the ability to optimize production processes through scheduling, leading to potential cost savings, increased efficiency, and improved competitiveness. Full article

(This article belongs to the Section Intelligent Control and Machine Learning)

► Show Figures

Figure 1

16 pages, 4252 KB

Open AccessArticle

Acquired HIV–1 Drug Resistance and Molecular Transmission Networks in Zhongwei, Ningxia, China

by Youping Duan, Subinuer Mutalifu, Ziyang Luo, Yufeng Li, Xiaohong Zhu, Jianxin Pei, Dongzhi Yang and Zhonglan Wu

Viruses 2026, 18(6), 685; https://doi.org/10.3390/v18060685 (registering DOI) - 18 Jun 2026

Viewed by 55

Abstract

Objective: This retrospective cross–sectional study aimed to characterize HIV–1 genotypes, assess drug resistance, and analyze molecular transmission networks in Zhongwei City to inform prevention strategies. Methods: Plasma samples were collected from antiretroviral therapy (ART)–treated patients (2007–2024) with [...] Read more.

Objective: This retrospective cross–sectional study aimed to characterize HIV–1 genotypes, assess drug resistance, and analyze molecular transmission networks in Zhongwei City to inform prevention strategies. Methods: Plasma samples were collected from antiretroviral therapy (ART)–treated patients (2007–2024) with viral load ≥200 copies/mL. HIV–1 pol was amplified by nested PCR; successful sequences were genotyped by maximum likelihood (ML) (IQ–TREE, TVM+F+I+G4, 1000 bootstrap). Drug resistance (DR) was interpreted using Stanford HIV Drug Resistance Database (HIVDB) v9.0; detected mutations represent acquired drug resistance (ADR). Pairwise genetic distances (GD) (TN93 model) were calculated; transmission networks were constructed in Cytoscape 3.10.3. Results: 75 sequences were obtained. Males (84.00%), and heterosexual transmission (64.00%) predominated. CRF07_BC (46.67%) and CRF01_AE (38.67%) were the major subtypes; the overall ADR rate was 40.00%, mainly NNRTIs–associated (30.67% of all participants, including 16.00% single–class NNRTIs and 14.67% dual–class NRTIs–NNRTIs). Network inclusion rate was 40.00% of the 75 sequences; CRF07_BC showed higher betweenness centrality (p = 0.028), while CRF01_AE and CRF85_BC showed higher closeness centrality (p < 0.001). Occupation significantly affected network enrollment (p ≤ 0.05). Conclusion: HIV–1 subtypes are diverse with high ADR. CRF07_BC may act as a transmission bridge, whereas CRF01_AE and CRF85_BC exhibit faster potential spread. Baseline DR testing and network–guided interventions are recommended. Full article

(This article belongs to the Section Human Virology and Viral Diseases)

27 pages, 15972 KB

Open AccessArticle

A Dual-Branch Detector Based on the Multi-Granularity Dynamic Selection Mechanism for Remote Sensing Incremental Detection

by Shixi Li, Weiji Wang, Yousheng Xu, Wei Yao and Shengzhou Xu

Remote Sens. 2026, 18(12), 2032; https://doi.org/10.3390/rs18122032 - 18 Jun 2026

Viewed by 64

Abstract

In practical remote sensing object detection tasks, the application of deep learning approaches often takes the form of incremental learning: when the application includes new target types that were not encountered during training, a pre-trained model must acquire new knowledge without suffering catastrophic [...] Read more.

In practical remote sensing object detection tasks, the application of deep learning approaches often takes the form of incremental learning: when the application includes new target types that were not encountered during training, a pre-trained model must acquire new knowledge without suffering catastrophic forgetting. Among the various techniques proposed, knowledge distillation (KD)-based regularization has proven to be one of the most effective methods. Current KD-based approaches primarily focus on addressing inter-task confusion and optimizing feature selection during distillation processes. In this paper, we propose a dual-branch detector-independent learning framework and a multi-granularity dynamic selection strategy. The former decouples detection tasks for old and new classes to mitigate inter-class confusion, while the latter is a novel, exquisitely designed distillation mechanism that ensures precise transfer of critical old-class information. Moreover, we apply a DIST loss that aligns both inter-class and intra-class relations, further enhancing the fidelity of old-class knowledge transfer. Experiments on the DIOR and DOTA datasets demonstrate that our method significantly outperforms state-of-the-art incremental-learning approaches for remote-sensing object detection and exhibits good robustness under different remote-sensing scenarios. Full article

(This article belongs to the Special Issue Object Detection in Remote Sensing Images Based on Artificial Intelligence)

► Show Figures

Figure 1

25 pages, 5791 KB

Open AccessArticle

MSS-MambaNet: A Mamba Framework for Building Extraction from Multi-Phase Disaster Imagery

by Xin Liang, Huijiao Qiao, Yanda Chen and Jin Zhang

Sensors 2026, 26(12), 3868; https://doi.org/10.3390/s26123868 (registering DOI) - 17 Jun 2026

Viewed by 332

Abstract

Building extraction from disaster scenes is critical for emergency response and post-disaster assessment. Unlike conventional static remote sensing imagery, multi-phase disaster imagery contains scenes spanning early, middle, and late disaster stages, where building morphology, class distribution, and boundary characteristics exhibit significant cross-phase heterogeneity. [...] Read more.

Building extraction from disaster scenes is critical for emergency response and post-disaster assessment. Unlike conventional static remote sensing imagery, multi-phase disaster imagery contains scenes spanning early, middle, and late disaster stages, where building morphology, class distribution, and boundary characteristics exhibit significant cross-phase heterogeneity. Such phase-dependent variations substantially increase the difficulty of stable semantic segmentation, particularly under complex damage conditions. To address these challenges, we propose MSS-MambaNet for building extraction from multi-phase disaster imagery. A multi-scale architecture is designed to overcome the limitations of single-scale scanning in Mamba, enabling more effective perception of diverse building morphologies. To enhance feature discrimination, a Dual-Domain Cross-Gated Fusion (DDCGF) module is introduced through complementary interactions between spatial and frequency-domain representations. In addition, a Pixel-Aware Dynamic Weighting (PADW) strategy is developed to adaptively emphasize imbalanced foreground pixels and ambiguous boundary regions, thereby improving segmentation consistency under complex disaster conditions. Extensive experiments demonstrate that MSS-MambaNet consistently outperforms state-of-the-art methods, achieving an average mIoU of 92.78% and mF1 of 96.25% with only 12.37 M parameters. These results indicate that the proposed method effectively handles the heterogeneity of multi-phase data, providing a stable and efficient solution for building extraction from multi-phase disaster imagery. Full article

(This article belongs to the Special Issue Sensor Based Computer Vision for Safety and Health Monitoring in Disaster Risk Management)

► Show Figures

Figure 1

25 pages, 28692 KB

Open AccessArticle

Semi-Supervised Degradation-Aware Learning for All-in-One Weather-Degraded Image Restoration

by Lei Cai, Fang Ruan, Wei Lu, Qi Lin, Huijie Zheng, Wenjie Xiang and Tao Zhu

Electronics 2026, 15(12), 2686; https://doi.org/10.3390/electronics15122686 - 17 Jun 2026

Viewed by 71

Abstract

All-in-one weather-degraded image restoration aims to restore clean images from diverse weather-degraded observations (such as rain, haze, and snow) using a unified model. However, this topic remains challenging due to its ill-posed nature and the scarcity of large-scale paired training data. This article [...] Read more.

All-in-one weather-degraded image restoration aims to restore clean images from diverse weather-degraded observations (such as rain, haze, and snow) using a unified model. However, this topic remains challenging due to its ill-posed nature and the scarcity of large-scale paired training data. This article develops a novel semi-supervised learning framework, termed Semi-Supervised Degradation-Aware Learning (S²DAL), to adjust the feature space to align with the unified parameter space for all-in-one adverse weather removal. Specifically, the proposed S²DAL consists of two backbone networks: a Degradation-guided Histogram Transformer (DHformer) for weather-degraded image restoration and a Degradation-guided Convolutional Neural Network (DCNN) for degradation generation. A key component, the Degradation-guided Histogram Transformer (DHT) block, is designed to effectively capture intrinsic image features while suppressing diverse degradation interference through channel shuffling modulation, dynamic-range histogram self-attention, and dual-scale gated feed forward. Furthermore, a Monte Carlo-based Expectation-Maximization (EM) algorithm is introduced to jointly optimize latent variables and network parameters under both labeled and unlabeled data. Extensive quantitative and qualitative results on synthetic and real-world datasets consistently demonstrate that the proposed S²DAL achieves superior restoration performance compared to multiple state-of-the-art fully supervised and semi-supervised approaches. Full article

(This article belongs to the Topic Computer Vision and Image Processing, 3rd Edition)

► Show Figures

Figure 1

21 pages, 2782 KB

Open AccessArticle

LDST-ChangeNet: Lightweight Remote Sensing Change Detection Model Based on Dual Spatio-Temporal Attention and Multi-Scale Decoding

by Shuang Li, Shoubin Wang, Pengcheng Gao, Guili Peng and Zhen Huang

Remote Sens. 2026, 18(12), 2020; https://doi.org/10.3390/rs18122020 - 17 Jun 2026

Viewed by 159

Abstract

Remote sensing image change detection is widely used in urban expansion analysis, land-use monitoring, and disaster assessment. Nevertheless, it still faces significant challenges due to pseudo-change interference in high-resolution imagery, the large-scale variation in small changed objects, and the need for lightweight models [...] Read more.

Remote sensing image change detection is widely used in urban expansion analysis, land-use monitoring, and disaster assessment. Nevertheless, it still faces significant challenges due to pseudo-change interference in high-resolution imagery, the large-scale variation in small changed objects, and the need for lightweight models in real-world engineering applications. To address these issues, this paper proposes LDST-ChangeNet, a lightweight dual spatiotemporal attention network for change detection. The network adopts a Siamese EfficientNet-B1 as its dual-branch encoder and employs a differential bi-temporal feature fusion strategy (Diff) to explicitly model temporal discrepancies, enabling efficient feature extraction while significantly reducing model complexity. A Position Attention Module (PAM) is introduced at the encoder bottleneck to suppress pseudo changes caused by non-structural factors. Meanwhile, a lightweight Pyramid Pooling Module (PPM-lite) is incorporated at the entrance of the deepest decoder features to enhance multi-scale contextual representation. Furthermore, a Boundary Attention Module (BAM) is applied in the decoder output stage to improve boundary delineation and small-object change detection. Experimental results on the LEVIR-CD and WHU-CD datasets show that LDST-ChangeNet outperforms other state-of-the-art methods, achieving F1-scores of 90.67% and 91.08%, respectively. The model maintains a lightweight design, requiring only 11.72 M parameters and 10.03 GFLOPs on LEVIR-CD, and 11.77 M parameters and 9.12 GFLOPs on WHU-CD. Full article

► Show Figures

Figure 1

16 pages, 3093 KB

Open AccessArticle

LapDINO: A DINOv3 and Laplacian Pyramid-Based Approach for Outdoor Terrain Segmentation

by Shiquan Ling, Xingchen Qin, Wenkang Xu, Mingmin Fu, Hao Huang, Shijie Ma and Zhenyu Liu

Sensors 2026, 26(12), 3843; https://doi.org/10.3390/s26123843 - 17 Jun 2026

Viewed by 119

Abstract

As autonomous driving technology expands from structured urban roads to unstructured outdoor environments, precise understanding of complex terrain has become a critical requirement for ensuring safe vehicle navigation. However, outdoor environments are characterized by high dynamics, drastic illumination variations, ambiguous category boundaries, and [...] Read more.

As autonomous driving technology expands from structured urban roads to unstructured outdoor environments, precise understanding of complex terrain has become a critical requirement for ensuring safe vehicle navigation. However, outdoor environments are characterized by high dynamics, drastic illumination variations, ambiguous category boundaries, and prohibitive annotation costs, making traditional supervised learning methods that rely on large amounts of pixel-level annotations difficult to generalize. In this paper, we propose a novel dual-path bidirectional interactive encoder, termed LapDINO, that effectively combines the strong semantic generalization capability of the self-supervised foundation model DINOv3 with the multi-scale frequency analysis capacity of the Laplacian pyramid. Specifically, we leverage DINOv3 to extract global semantic features as a “semantic map”, while simultaneously obtaining multi-scale high-frequency details through Laplacian pyramid decomposition as “structural contours”. Building upon this, we design a bidirectional cross-attention fusion mechanism that enables dynamic interaction and mutual refinement between semantic information and geometric details. Furthermore, we introduce a multi-branch attention enhancement module that extracts pyramid features from three complementary perspectives. To address domain shift, we design lightweight visual adapters that enable efficient fine-tuning of the frozen DINOv3 backbone. Finally, we construct two off-road terrain segmentation datasets, VOTD and VOCD, to facilitate research in this domain. Experimental results demonstrate that the proposed method achieves state-of-the-art performance, striking an optimal balance between accuracy and computational efficiency, thereby providing a robust and efficient engineering solution for terrain perception in off-road environments. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

31 pages, 1493 KB

Open AccessArticle

Asset-Aware and Resilient Trust Management Framework for Industrial IoT Edge Networks

by Yufei Wang, Huanhuan Gu and Qian Ye

Sensors 2026, 26(12), 3808; https://doi.org/10.3390/s26123808 - 15 Jun 2026

Viewed by 200

Abstract

Trust evaluation in Industrial Internet of Things (IIoT) edge networks must account for both device behavior and the operational importance of industrial assets. Existing models often apply uniform scoring rules, which may limit their response to semantic attacks and whitewashing behavior while increasing [...] Read more.

Trust evaluation in Industrial Internet of Things (IIoT) edge networks must account for both device behavior and the operational importance of industrial assets. Existing models often apply uniform scoring rules, which may limit their response to semantic attacks and whitewashing behavior while increasing the processing burden on edge devices. This paper presents an Asset-Aware Resilient Trust (ART) framework. ART separates dynamic behavioral credibility from physical asset criticality through a dual-plane architecture. Cross-layer evidence is collected from communication, identity, physical, and semantic interactions. A Fuzzy Triggered-Entropy Weight Method (Fuzzy T-EWM) recalculates evidence weights only when the observed fluctuation exceeds a preset threshold. Trust scores are updated using a Fast-Drop Slow-Rise rule, together with a tolerance margin for routine network jitter. The simulation results show that ART detects stealthy False Data Injection attacks, limits trust recovery after whitewashing behavior, and reduces accumulated computational overhead by 76.4% compared with the Standard EWM baseline. The credibility-weighted aggregation mechanism also limits collusive recommendation manipulation during cold-start evaluation. These results support differentiated trust regulation for IIoT edge networks. Full article

(This article belongs to the Special Issue Sensor-Based IoT Systems: Edge Intelligence, Trust Management, and Applications)

15 pages, 2984 KB

Open AccessArticle

GG-YOLO: A Lightweight Dual-Path Attention Detector with Dynamic Sampling for Dense Wheat Spike Detection

by Guohong Gao, Fucheng Zhou, Lijun Xu, Jiaxin Zhang and Xueyong Li

Agronomy 2026, 16(12), 1156; https://doi.org/10.3390/agronomy16121156 - 12 Jun 2026

Viewed by 193

Abstract

Accurate wheat spike detection is essential for crop phenotyping and yield estimation, but real-world field conditions—such as dense spike overlap, environmental domain shifts, and degradation-induced failures like motion blur—pose significant challenges. Achieving robust perception under these circumstances while maintaining a strict accuracy-efficiency trade-off [...] Read more.

Accurate wheat spike detection is essential for crop phenotyping and yield estimation, but real-world field conditions—such as dense spike overlap, environmental domain shifts, and degradation-induced failures like motion blur—pose significant challenges. Achieving robust perception under these circumstances while maintaining a strict accuracy-efficiency trade-off for edge devices remains a pressing research problem. To overcome these limitations, we propose GG-YOLO, a unified lightweight detection framework specifically tailored for complex agricultural environments. Rather than a simple recombination of existing lightweight modules, GG-YOLO integrates three original structural adaptations: First, a Dual-path Attentive Ghost Mechanism (DAGM) introduces gradient-guided attention modulation to enhance feature discrimination and explicitly resolve feature confusion in dense, overlapping regions. Second, a C3Ghost module combines multi-branch aggregation with linear feature generation, mitigating parameter redundancy in the prediction head by approximately 31% compared to the standard YOLOv8s without sacrificing semantic capacity. Third, DSample, a dynamic upsampling operator featuring an original dual-mode adaptive mechanism, robustly recovers fine-grained spatial details during multi-scale feature pyramid fusion. Extensive cross-dataset experiments on the GlobalWheat2020 and HNKJXYwheat datasets validate the model’s exceptional resilience to domain shifts and varying growth stages. GG-YOLO achieves a precision of 94.35%, a recall of 91.93%, and a state-of-the-art mAP@50 of 96.47%. Furthermore, the model contains only 7.89 M parameters and requires 20.4 GFLOPs, reaching an inference speed of 165 FPS on a desktop GPU and a validated real-time speed of 64 FPS on an NVIDIA Jetson edge computing platform. These results demonstrate that GG-YOLO establishes a superior accuracy-efficiency frontier, making it highly reliable for real-time field deployment in precision agriculture. Full article

(This article belongs to the Section Precision and Digital Agriculture)

► Show Figures

Figure 1

26 pages, 2009 KB

Open AccessArticle

A Dual-Stage Multimodal Alignment Approach for Robust Breast Cancer Diagnosis via Visual–Textual Computing

by Ramazan Ozgur Dogan

Appl. Sci. 2026, 16(12), 5934; https://doi.org/10.3390/app16125934 - 11 Jun 2026

Viewed by 169

Abstract

Manual classification of breast cancer is resource-intensive, slow, and subject to inter-observer variability, motivating automated deep learning solutions. Most current methods rely on unimodal imaging data and struggle with domain generalization (DG) across varied clinical environments. We propose a Dual-Stage Multimodal Alignment approach [...] Read more.

Manual classification of breast cancer is resource-intensive, slow, and subject to inter-observer variability, motivating automated deep learning solutions. Most current methods rely on unimodal imaging data and struggle with domain generalization (DG) across varied clinical environments. We propose a Dual-Stage Multimodal Alignment approach that integrates breast ultrasound (US) imagery with clinical text reports to improve diagnostic stability. The method proceeds in two stages: (1) Local Correlation Alignment (LCA), which aligns fine-grained visual features with textual embeddings to capture localized lesion attributes, and (2) Global Attention Alignment (GAA), which applies multi-head self-attention to the joint visual–textual sequence to encourage domain-invariant representations. We evaluate the approach on a harmonized, leakage-free repository of 6880 images aggregated from six public US datasets (BUS-CoT, BrEaST, BUS-BRA, BUS-UCLM, BLUI, BUSI) under three protocols: independent benchmarking on BUS-CoT, pooled cross-dataset evaluation, and zero-shot domain generalization on unseen unimodal target domains. On the BUS-CoT benchmark, the 198M-parameter model reaches 0.8177 accuracy and 0.8852 AUC, on par with the 7-billion-parameter Qwen2.5-VL-7B with chain-of-thought reasoning (0.8064 accuracy, 0.8354 AUC) while using roughly 1/35 the parameter count. In the pooled setting, it is competitive with single-domain state-of-the-art methods on individual subsets (e.g., 0.9576 AUC on BUSI, 0.8741 accuracy on BUS-BRA). Under zero-shot transfer without clinical text, per-domain AUC ranges from 0.7360 to 0.8060 across four unseen targets, providing a lower bound under cross-scanner shift. These results indicate that task-specific multimodal alignment can rival large vision-language models in breast US diagnosis at a fraction of the parameter count. Full article

► Show Figures

Figure 1

23 pages, 15033 KB

Open AccessArticle

Lightweight Representation of Motion-Magnified Facial Dynamics for Micro Expression Sensing

by Seungho Lee and Sangkon Lee

Sensors 2026, 26(12), 3727; https://doi.org/10.3390/s26123727 - 11 Jun 2026

Viewed by 248

Abstract

Reliable monitoring of spontaneous affect is essential in biomedical sensing, where involuntary facial signals serve as objective indicators of physiological states. Micro expression recognition (MER) is particularly challenging due to the sub-second, low amplitude nature of these signals. Many existing MER methods rely [...] Read more.

Reliable monitoring of spontaneous affect is essential in biomedical sensing, where involuntary facial signals serve as objective indicators of physiological states. Micro expression recognition (MER) is particularly challenging due to the sub-second, low amplitude nature of these signals. Many existing MER methods rely on apex (peak) frame detection, making them sensitive to temporal localization errors and difficult to deploy in unconstrained settings. To address this, we propose an apex-free framework that analyzes facial dynamics by structuring motion-magnified features along a newly introduced magnification intensity axis. By applying Eulerian motion magnification across multiple discrete levels and collapsing the sequences into single accumulation images, we generate a multi-level representation of subtle facial dynamics without requiring frame-level annotations. A lightweight shared temporal mixer (STM) is employed to analyze the dynamic evolution of motion across the magnification intensity axis. Subsequently, a dual-branch convolutional neural network (CNN), processing low- and high-amplification regimes respectively, integrates a convolutional block attention module (CBAM) to capture subtle facial motion while effectively filtering out irrelevant noise. Our model is highly efficient, requiring only 0.94 M parameters and 262 MFLOPs, which is significantly lower than the computational demands of standard backbones such as ResNet18 or VGG16. To ensure the model generalizes to new individuals, we evaluated it by testing on subjects whose data was entirely excluded from the training process. Under this rigorous setup, the proposed method achieves approximately 80% and 70% accuracy on the CASME II and SMIC datasets respectively, showing performance comparable to, or in some cases, slightly above current state-of-the-art methods. Considering both the competitive accuracy and high computational efficiency, the proposed framework holds significant potential for practical integration into real-time affect monitoring systems, particularly within biomedical applications. Full article

(This article belongs to the Special Issue Sensing Signals for Biomedical Monitoring—2nd Edition)

► Show Figures

Figure 1

26 pages, 2291 KB

Open AccessArticle

VI-MSFFN: A Visible-Infrared Multi-Scale Feature Fusion Network for Cross-Modal Detection in Remote Sensing

by Yurong Yue, Weiwei Qin, Hao Chi, Baiwei An, Dingyi Wu, Wenxin Guo and Jingyi Xiong

Remote Sens. 2026, 18(12), 1938; https://doi.org/10.3390/rs18121938 - 11 Jun 2026

Viewed by 125

Abstract

To address the issues of insufficient single-modality robustness and limited multi-scale object detection accuracy in remote sensing image detection (RSID) in complex environments, this paper proposes a multimodal RSID network named VI-MSFFN. The model adopts a symmetric parallel dual-branch architecture to achieve independent [...] Read more.

To address the issues of insufficient single-modality robustness and limited multi-scale object detection accuracy in remote sensing image detection (RSID) in complex environments, this paper proposes a multimodal RSID network named VI-MSFFN. The model adopts a symmetric parallel dual-branch architecture to achieve independent extraction and collaborative modeling of visible and infrared modal features. A cross-modal multi-scale sparse cross-attention fusion module is proposed and applied to the P4 and P5 feature layers, and a high-low-level feature collaborative cross-modal fusion strategy was constructed to achieve efficient and robust cross-modal feature fusion while enhancing multi-scale object modeling capability and suppressing feature redundancy and noise. Additionally, a progressive feature interaction and fusion architecture was designed to combine spatial and frequency domain information to strengthen deep object representation. The experimental results on the VEDAI and Drone Vehicle datasets demonstrate that VI-MSFFN achieves state-of-the-art (SOTA) performance in detection accuracy, robustness, and generalization ability. The proposed method effectively solves the detection challenges of RSID and has significant application value in the field of multi-modal RSID. Full article

► Show Figures

Figure 1

26 pages, 2476 KB

Open AccessArticle

Symmetry-Aware Physics-Guided Graph Network for Slope Displacement Prediction from GNSS Data

by Yanbo Yu, Long Zhang, Jinhong Lu, Rong He, Han Liao and Yongkang Zhang

Symmetry 2026, 18(6), 986; https://doi.org/10.3390/sym18060986 - 8 Jun 2026

Viewed by 185

Abstract

Accurate prediction of slope displacement from high-frequency GNSS monitoring data is critical for early warning of landslides and tailings dam failures. However, existing deep learning approaches often neglect the spatial coordination imposed by geological structures and fail to decouple abrupt deformation signals from [...] Read more.

Accurate prediction of slope displacement from high-frequency GNSS monitoring data is critical for early warning of landslides and tailings dam failures. However, existing deep learning approaches often neglect the spatial coordination imposed by geological structures and fail to decouple abrupt deformation signals from background noise, leading to non-physical oscillations and inconsistent long-term predictions. To address these limitations, this paper proposes a Symmetry-Aware Physics-Guided Spatio-Temporal Graph Network (PG-STGN). First, a geological hierarchy-aware graph is constructed by integrating geometric proximity with prior knowledge of exploration levels, where the resulting adjacency matrix is symmetric by design and reflects the physical symmetry of deformation interactions among monitoring points at the same elevation. A hierarchical masking mechanism restricts feature aggregation to physically connected neighborhoods while preserving this symmetry. Second, an improved dual-path temporal convolutional network (iTCN) decouples high-frequency abrupt variations from low-frequency evolutionary trends, enabling both sensitive detection of sudden deformation and stable tracking of long-term creep. Third, a physics-consistent loss function combining first-order temporal differencing and graph Laplacian regularization enforces kinematic smoothness and spatial coordination; the Laplacian itself is derived from the symmetric adjacency matrix, ensuring symmetric regularization across the monitoring network. Evaluated on a real-world slope GNSS dataset from a large-scale mining project, PG-STGN reduces mean squared error (MSE) by approximately 23.7% and achieves a global R² of 0.924, outperforming state-of-the-art spatio-temporal models. Ablation studies confirm that the symmetric physics-guided graph, dual-path decoupling, and consistency loss are each essential for suppressing spurious correlations and maintaining physically plausible predictions. The proposed framework provides a robust, interpretable, and symmetry-constrained solution for automated slope monitoring under complex geological conditions. Full article

(This article belongs to the Special Issue Symmetry in Data Analysis and Optimization)

► Show Figures

Figure 1

27 pages, 2287 KB

Open AccessArticle

Dual-Branch Graph Learning with Frequency Gating for Industrial Sensor Anomaly and Cyberattack Detection

by Tong Zhao, Wei Yang and Yu Yao

Sensors 2026, 26(11), 3607; https://doi.org/10.3390/s26113607 - 5 Jun 2026

Viewed by 240

Abstract

Industrial sensor systems are increasingly vulnerable to both physical anomalies and cyberattacks, while their collected time series typically present complex periodic and non-stationary characteristics, along with dynamic spatial dependencies among sensors. To address these issues, this paper proposes a dual-branch graph learning framework [...] Read more.

Industrial sensor systems are increasingly vulnerable to both physical anomalies and cyberattacks, while their collected time series typically present complex periodic and non-stationary characteristics, along with dynamic spatial dependencies among sensors. To address these issues, this paper proposes a dual-branch graph learning framework with frequency gating for simultaneous industrial sensor anomaly and cyberattack detection. The model first divides the input time series into multiple patches and decomposes each patch into periodic and non-stationary components via frequency analysis. Two graph isomorphism network branches, namely periodic GIN (P-GIN) and non-stationary GIN (NS-GIN), are designed to model the spatial dependencies of the two components separately, where the graph structure is adaptively learned using a Gaussian kernel-based mechanism. Furthermore, a frequency gating module is introduced in the non-stationary branch to enhance the representation of abnormal and attack-related features. Hierarchical temporal encoding is performed via intra-patch attention and inter-patch attention to capture both local and long-range temporal dependencies. Extensive experimental results on real-world industrial sensor datasets demonstrate that the proposed method achieves superior performance compared with state-of-the-art methods in both anomaly detection and cyberattack detection tasks. Full article

(This article belongs to the Topic Artificial Intelligence and Machine Learning in Cyber–Physical Systems)

► Show Figures

Figure 1

20 pages, 2546 KB

Open AccessArticle

MCC-Net: Efficient Dual-Attention Network for Infrared Small-Target Detection

by Xiaotian Zhou, Xin Wang, Yan Tian, Kai Jiang, Min Guo, Xuezheng Lian, Lu Ding, Quanyu Zhang and Yaqi Xue

Remote Sens. 2026, 18(11), 1858; https://doi.org/10.3390/rs18111858 - 5 Jun 2026

Viewed by 287

Abstract

Recent years have witnessed the emergence of numerous U-shaped deep learning segmentation methods for infrared small-target detection (IRSTD). However, increasingly complex models still suffer from false and missed detections in challenging scenarios with cluttered backgrounds and weak targets while incurring escalating computational costs. [...] Read more.

Recent years have witnessed the emergence of numerous U-shaped deep learning segmentation methods for infrared small-target detection (IRSTD). However, increasingly complex models still suffer from false and missed detections in challenging scenarios with cluttered backgrounds and weak targets while incurring escalating computational costs. To address these limitations, this paper proposes MCC-Net, a novel and efficient IRSTD framework that achieves superior detection performance with significantly reduced computational complexity. First, we integrate Magnitude-Aware Linear Attention (MALA) and Conditionally Parameterized Convolutions (CondConv) to replace conventional attention mechanisms in skip connections and standard convolutions, respectively, endowing the model with spatial contextual modeling and enhanced feature extraction capabilities at minimal computational overhead. Second, we design an innovative Conditional Cross-Channel Fusion (CondCCF) module that establishes a complementary spatial-channel dual-attention mechanism with MALA, enabling efficient multi-scale feature fusion. Extensive comparative and ablation experiments conducted on three public benchmarks—SIRST-v1, NUDT-SIRST, and IRSTD-1K—demonstrate that MCC-Net achieves state-of-the-art mIoU scores of 77.98%, 95.43%, and 70.46%, respectively, surpassing state-of-the-art methods by 1.07%, 1.95%, and 0.95%. MCC-Net also outperforms existing approaches across multiple evaluation metrics while maintaining substantially lower computational complexity. Full article

(This article belongs to the Special Issue New Insights in Remote Sensing Image Interpretation with Deep Learning)

► Show Figures

Figure 1

Search Results (1,302)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (1,302)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI