Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (320)

Search Parameters:
Keywords = shot dynamics

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
21 pages, 3170 KB  
Article
Reliable Communication in Distributed Photovoltaic Sensor Networks: A Large Language Model-Driven Approach
by Wu Dong, Xu Liu, Qing Liu, Guanghui Zhang, Ji Shi, Xun Zhao, Zhongming Lei and Wei Wang
Sensors 2026, 26(3), 838; https://doi.org/10.3390/s26030838 - 27 Jan 2026
Abstract
Distributed photovoltaic (DPV) systems present a cost-effective and sustainable industrial energy solution, yet their reliable monitoring faces significant technological constraints. This paper proposes a hierarchical optimization framework that integrates hysteresis-based traffic shaping at the network layer with Large Language Model (LLM)-driven diagnostics at [...] Read more.
Distributed photovoltaic (DPV) systems present a cost-effective and sustainable industrial energy solution, yet their reliable monitoring faces significant technological constraints. This paper proposes a hierarchical optimization framework that integrates hysteresis-based traffic shaping at the network layer with Large Language Model (LLM)-driven diagnostics at the application layer. The proposed dynamic algorithm minimizes latency and downtime by prioritizing critical fault data. Priority-based scheduling ensures this critical data is transmitted preferentially over routine sensor readings. At the application layer, the system utilizes physics-informed prompt engineering to perform zero-shot root cause analysis, circumventing the training data requirements of traditional classifiers. Under a 10 Mbps gateway bandwidth, our method achieves a 46.08% to 49.87% reduction in P50 latency compared to traditional approaches. Moreover, the LLM-powered diagnostic system provides detailed assessments, enabling precise fault diagnosis for DPV systems. Full article
(This article belongs to the Special Issue Sensors for Enabling Wireless Spectrum Access)
Show Figures

Figure 1

17 pages, 8796 KB  
Article
Subgrade Distress Detection in GPR Radargrams Using an Improved YOLOv11 Model
by Mingzhou Bai, Qun Ma, Hongyu Liu and Zilun Zhang
Sustainability 2026, 18(3), 1273; https://doi.org/10.3390/su18031273 - 27 Jan 2026
Abstract
This study compares three detectors—Single Shot MultiBox Detector (SSD), Faster Region-based Convolutional Neural Network (Faster R-CNN), and Only Look Once v11 (YOLOv11)—for detecting subgrade distress in GPR radargrams. SSD converges fastest but shows weaker detection performance, while Faster R-CNN achieves higher localization accuracy [...] Read more.
This study compares three detectors—Single Shot MultiBox Detector (SSD), Faster Region-based Convolutional Neural Network (Faster R-CNN), and Only Look Once v11 (YOLOv11)—for detecting subgrade distress in GPR radargrams. SSD converges fastest but shows weaker detection performance, while Faster R-CNN achieves higher localization accuracy at the cost of slower convergence. YOLOv11 offers the best overall performance. To push YOLOv11 further, we introduce three enhancements: a Multi-Scale Edge Enhancement Module (MEEM), a Multi-Feature Multi-Scale Attention (MFMSA) mechanism, and a hybrid configuration that combines both. On a representative dataset, YOLOv11_MEEM yields a 0.2 percentage-point increase in precision with a 0.2 percentage-point decrease in recall and a 0.3 percentage-point gain in mean Average Precision@0.5:0.95, indicating improved generalization and efficiency. YOLOv11_MFMSA achieves precision comparable to MEEM but suffers a substantial recall drop and slower inference. The hybrid YOLOv11_MEEM+MFMSA underperforms on key metrics due to gradient conflicts. MEEM reduces electromagnetic interference through dynamic edge enhancement, preserving real-time performance and robust generalization. Overall, MEEM-enhanced YOLOv11 is suitable for real-time subgrade distress detection in GPR radargrams. The research findings can offer technical support for the intelligent detection of subgrade engineering while also promoting the resilient development and sustainable operation and maintenance of urban infrastructure. Full article
Show Figures

Figure 1

23 pages, 5234 KB  
Article
Training Agents for Strategic Curling Through a Unified Reinforcement Learning Framework
by Yuseong Son, Jaeyoung Park and Byunghwan Jeon
Mathematics 2026, 14(3), 403; https://doi.org/10.3390/math14030403 - 23 Jan 2026
Viewed by 90
Abstract
Curling presents a challenging continuous-control problem in which shot outcomes depend on long-horizon interactions between complex physical dynamics, strategic intent, and opponent responses. Despite recent progress in applying reinforcement learning (RL) to games and sports, curling lacks a unified environment that jointly supports [...] Read more.
Curling presents a challenging continuous-control problem in which shot outcomes depend on long-horizon interactions between complex physical dynamics, strategic intent, and opponent responses. Despite recent progress in applying reinforcement learning (RL) to games and sports, curling lacks a unified environment that jointly supports stable, rule-consistent simulation, structured state abstraction, and scalable agent training. To address this gap, we introduce a comprehensive learning framework for curling AI, consisting of a full-sized simulation environment, a task-aligned Markov decision process (MDP) formulation, and a two-phase training strategy designed for stable long-horizon optimization. First, we propose a novel MDP formulation that incorporates stone configuration, game context, and dynamic scoring factors, enabling an RL agent to reason simultaneously about physical feasibility and strategic desirability. Second, we present a two-phase curriculum learning procedure that significantly improves sample efficiency: Phase 1 trains the agent to master delivery mechanics by rewarding accurate placement around the tee line, while Phase 2 transitions to strategic learning with score-based rewards that encourage offensive and defensive planning. This staged training stabilizes policy learning and reduces the difficulty of direct exploration in the full curling action space. We integrate this MDP and training procedure into a unified Curling RL Framework, built upon a custom simulator designed for stability, reproducibility, and efficient RL training and a self-play mechanism tailored for strategic decision-making. Agent policies are optimized using Soft Actor–Critic (SAC), an entropy-regularized off-policy algorithm designed for continuous control. As a case study, we compare the learned agent’s shot patterns with elite match records from the men’s division of the Le Gruyère AOP European Curling Championships 2023, using 6512 extracted shot images. Experimental results demonstrate that the proposed framework learns diverse, human-like curling shots and outperforms ablated variants across both learning curves and head-to-head evaluations. Beyond curling, our framework provides a principled template for developing RL agents in physics-driven, strategy-intensive sports environments. Full article
(This article belongs to the Special Issue Applications of Intelligent Game and Reinforcement Learning)
23 pages, 1277 KB  
Article
A Few-Shot Optical Classification Approach for Meteorological Lightning Monitoring: Leveraging Frame Difference and Triplet Network
by Mengmeng Xiao, Yulong Yan, Qilin Zhang, Yan Liu, Xingke Pan, Bingzhe Dai and Chunxu Duan
Remote Sens. 2026, 18(3), 386; https://doi.org/10.3390/rs18030386 - 23 Jan 2026
Viewed by 74
Abstract
To address the challenges of scarce labeled samples, strong instantaneity, and variable morphology in lightning optical classification—issues that traditional methods struggle to handle efficiently and often require extensive manual intervention—we propose a frame difference triplet network (FD-TripletNet) tailored for few-shot lightning recognition. The [...] Read more.
To address the challenges of scarce labeled samples, strong instantaneity, and variable morphology in lightning optical classification—issues that traditional methods struggle to handle efficiently and often require extensive manual intervention—we propose a frame difference triplet network (FD-TripletNet) tailored for few-shot lightning recognition. The lightning optical dataset used in this study was collected from two observation stations over six months, comprising 459 video samples that include lightning events with diverse morphologies (e.g., branched, spherical) and non-lightning events prone to misclassification (e.g., strong light interference, moving objects). Considering the critical feature of lightning—abrupt single-frame changes—we introduce adjacent frame difference matrices as model input to explicitly capture transient brightness variations, reducing noise from static backgrounds. To enhance discriminative ability in few-shot scenarios, the model leverages Triplet Loss to compact intra-class features and separate inter-class features, combined with a dynamic sample matching strategy to focus on challenging cases. The experimental results show that FD-TripletNet achieves a classification accuracy of 94.8% on the dataset, outperforming traditional methods and baseline deep learning models. It effectively reduces the False Negative Rate (FNR) to 3.2% and False Positive Rate (FPR) to 7.4%, successfully distinguishing between lightning and non-lightning events, thus providing an efficient solution for real-time lightning monitoring in meteorological applications. Full article
Show Figures

Figure 1

24 pages, 10940 KB  
Article
A Few-Shot Object Detection Framework for Remote Sensing Images Based on Adaptive Decision Boundary and Multi-Scale Feature Enhancement
by Lijiale Yang, Bangjie Li, Dongdong Guan and Deliang Xiang
Remote Sens. 2026, 18(3), 388; https://doi.org/10.3390/rs18030388 - 23 Jan 2026
Viewed by 119
Abstract
Given the high cost of acquiring large-scale annotated datasets, few-shot object detection (FSOD) has emerged as an increasingly important research direction. However, existing FSOD methods face two critical challenges in remote sensing images (RSIs): (1) features of small targets within remote sensing images [...] Read more.
Given the high cost of acquiring large-scale annotated datasets, few-shot object detection (FSOD) has emerged as an increasingly important research direction. However, existing FSOD methods face two critical challenges in remote sensing images (RSIs): (1) features of small targets within remote sensing images are incompletely represented due to extremely small-scale and cluttered backgrounds, which weakens discriminability and leads to significant detection degradation; (2) unified classification boundaries fail to handle the distinct confidence distributions between well-sampled base classes and sparsely sampled novel classes, leading to ineffective knowledge transfer. To address these issues, we propose TS-FSOD, a Transfer-Stable FSOD framework with two key innovations. First, the proposed detector integrates a Feature Enhancement Module (FEM) leveraging hierarchical attention mechanisms to alleviate small target feature attenuation, and an Adaptive Fusion Unit (AFU) utilizing spatial-channel selection to strengthen target feature representations while mitigating background interference. Second, Dynamic Temperature-scaling Learnable Classifier (DTLC) employs separate learnable temperature parameters for base and novel classes, combined with difficulty-aware weighting and dynamic adjustment, to adaptively calibrate decision boundaries for stable knowledge transfer. Experiments on DIOR and NWPU VHR-10 datasets show that TS-FSOD achieves competitive or superior performance compared to state-of-the-art methods, with improvements up to 4.30% mAP, particularly excelling in 3-shot and 5-shot scenarios. Full article
Show Figures

Figure 1

20 pages, 17064 KB  
Article
PriorSAM-DBNet: A SAM-Prior-Enhanced Dual-Branch Network for Efficient Semantic Segmentation of High-Resolution Remote Sensing Images
by Qiwei Zhang, Yisong Wang, Ning Li, Quanwen Jiang and Yong He
Sensors 2026, 26(2), 749; https://doi.org/10.3390/s26020749 - 22 Jan 2026
Viewed by 85
Abstract
Semantic segmentation of high-resolution remote sensing imagery is a critical technology for the intelligent interpretation of sensor data, supporting automated environmental monitoring and urban sensing systems. However, processing data from dense urban scenarios remains challenging due to sensor signal occlusions (e.g., shadows) and [...] Read more.
Semantic segmentation of high-resolution remote sensing imagery is a critical technology for the intelligent interpretation of sensor data, supporting automated environmental monitoring and urban sensing systems. However, processing data from dense urban scenarios remains challenging due to sensor signal occlusions (e.g., shadows) and the complexity of parsing multi-scale targets from optical sensors. Existing approaches often exhibit a trade-off between the accuracy of global semantic modeling and the precision of complex boundary recognition. While the Segment Anything Model (SAM) offers powerful zero-shot structural priors, its direct application to remote sensing is hindered by domain gaps and the lack of inherent semantic categorization. To address these limitations, we propose a dual-branch cooperative network, PriorSAM-DBNet. The main branch employs a Densely Connected Swin (DC-Swin) Transformer to capture cross-scale global features via a hierarchical shifted window attention mechanism. The auxiliary branch leverages SAM’s zero-shot capability to exploit structural universality, generating object-boundary masks as robust signal priors while bypassing semantic domain shifts. Crucially, we introduce a parameter-efficient Scaled Subsampling Projection (SSP) module that employs a weight-sharing mechanism to align cross-modal features, freezing the massive SAM backbone to ensure computational viability for practical sensor applications. Furthermore, a novel Attentive Cross-Modal Fusion (ACMF) module is designed to dynamically resolve semantic ambiguities by calibrating the global context with local structural priors. Extensive experiments on the ISPRS Vaihingen, Potsdam, and LoveDA-Urban datasets demonstrate that PriorSAM-DBNet outperforms state-of-the-art approaches. By fine-tuning only 0.91 million parameters in the auxiliary branch, our method achieves mIoU scores of 82.50%, 85.59%, and 53.36%, respectively. The proposed framework offers a scalable, high-precision solution for remote sensing semantic segmentation, particularly effective for disaster emergency response where rapid feature recognition from sensor streams is paramount. Full article
Show Figures

Figure 1

9 pages, 221 KB  
Article
Comparison of a Single-Shot Antibiotic Protocol Compared to a Conventional 5-Day Antibiotic Protocol in Equine Diagnostic Laparotomy Regarding Pre- and Postoperative Colonization with Multi-Drug-Resistant Indicator Pathogens
by Sabita Diana Stöckle, Dania Annika Kannapin, Roswitha Merle, Antina Lübke-Becker and Heidrun Gehlen
Antibiotics 2026, 15(1), 106; https://doi.org/10.3390/antibiotics15010106 - 21 Jan 2026
Viewed by 85
Abstract
Objective: The emergence and spread of multi-drug-resistant (MDR) bacteria pose a growing threat in veterinary medicine, particularly in equine hospitals. This study investigated the colonization and infection dynamics of horses undergoing emergency laparotomy with two distinct antibiotic protocols (single-shot versus 5-day protocol) during [...] Read more.
Objective: The emergence and spread of multi-drug-resistant (MDR) bacteria pose a growing threat in veterinary medicine, particularly in equine hospitals. This study investigated the colonization and infection dynamics of horses undergoing emergency laparotomy with two distinct antibiotic protocols (single-shot versus 5-day protocol) during hospitalization. Methods: Nasal swabs and fecal samples were collected from 67 horses undergoing emergency laparotomy at clinic admission as well as on postoperative days 3 and 10. These were screened for multi-drug-resistant indicator pathogens. As multi-drug-resistant indicator pathogens, methicillin-resistant Staphylococcus aureus (MRSA), extended-spectrum β-lactamase (ESBL)-producing Enterobacterales (ESBL-E), and bacteria belonging to the Acinetobacter baumannii complex were defined. Results: Preoperatively, 6.2% of horses tested positive for MRSA and 13% for ESBL-E. An increase in colonization was observed on day 3 postoperatively, with 62.1% of nasal swabs and 86.4% of fecal samples testing positive for MDR organisms. On day 10, 53.4% of nasal swabs and 62.5% of fecal samples tested positive for indicator pathogens. Surgical site infection developed in five horses, two of which tested positive for MRSA in both nasal and wound samples during hospitalization, supporting the potential role of nasal carriage as a source of infection. Furthermore, all horses tested positive for ESBL-E during at least one time-point during hospitalization, and Enterobacterales (MDR in two surgical site infections (SSI)) were involved in all surgical site infections. No significant differences were observed between the two antibiotic treatment groups regarding colonization rates with indicator pathogens during hospitalization. However, the results indicate that hospitalization itself contributes to increased colonization with resistant bacteria. A clear limitation of the study is the restricted number of sampled horses and the lack of environmental contamination data. Non-sampled hospitalized horses with and without antibiotic treatment may have acted as reservoirs for MDR bacteria. Conclusion: The findings emphasize the need for routine environmental monitoring and strict adherence to hygiene protocols in equine clinics to reduce the risk of nosocomial transmission. Ongoing surveillance and infection control strategies are essential to mitigate the spread of MDR pathogens in veterinary settings. Full article
(This article belongs to the Special Issue Antibiotic Resistance in Bacterial Isolates of Animal Origin)
19 pages, 3563 KB  
Article
Numerical and Experimental Study of Laser Surface Modification Using a High-Power Fiber CW Laser
by Evaggelos Kaselouris, Alexandros Gosta, Efstathios Kamposos, Dionysios Rouchotas, George Vernardos, Helen Papadaki, Alexandros Skoulakis, Yannis Orphanos, Makis Bakarezos, Ioannis Fitilis, Nektarios A. Papadogiannis, Michael Tatarakis and Vasilis Dimitriou
Materials 2026, 19(2), 343; https://doi.org/10.3390/ma19020343 - 15 Jan 2026
Viewed by 235
Abstract
This work presents a combined numerical and experimental investigation into the laser machining of aluminum alloy Al 1050 H14 using a high-power Continuous Wave (CW) fiber laser. Advanced three-dimensional, coupled thermal–structural Finite Element Method (FEM) simulations are developed to model key laser–material interaction [...] Read more.
This work presents a combined numerical and experimental investigation into the laser machining of aluminum alloy Al 1050 H14 using a high-power Continuous Wave (CW) fiber laser. Advanced three-dimensional, coupled thermal–structural Finite Element Method (FEM) simulations are developed to model key laser–material interaction processes, including laser-induced plastic deformation, laser etching, and engraving. Cases for both static single-shot and dynamic linear scanning laser beams are investigated. The developed numerical models incorporate a Gaussian heat source and the Johnson–Cook constitutive model to capture elastoplastic, damage, and thermal effects. The simulation results, which provide detailed insights into temperature gradients, displacement fields, and stress–strain evolution, are rigorously validated against experimental data. The experiments are conducted on an integrated setup comprising a 2 kW TRUMPF CW fiber laser hosted on a 3-axis CNC milling machine, with diagnostics including thermal imaging, thermocouples, white-light interferometry, and strain gauges. The strong agreement between simulations and measurements confirms the predictive capability of the developed FEM framework. Overall, this research establishes a reliable computational approach for optimizing laser parameters, such as power, dwell time, and scanning speed, to achieve precise control in metal surface treatment and modification applications. Full article
(This article belongs to the Special Issue Fabrication of Advanced Materials)
Show Figures

Graphical abstract

34 pages, 10017 KB  
Article
U-H-Mamba: An Uncertainty-Aware Hierarchical State-Space Model for Lithium-Ion Battery Remaining Useful Life Prediction Using Hybrid Laboratory and Real-World Datasets
by Zhihong Wen, Xiangpeng Liu, Wenshu Niu, Hui Zhang and Yuhua Cheng
Energies 2026, 19(2), 414; https://doi.org/10.3390/en19020414 - 14 Jan 2026
Viewed by 228
Abstract
Accurate prognosis of the remaining useful life (RUL) for lithium-ion batteries is critical for mitigating range anxiety and ensuring the operational safety of electric vehicles. However, existing data-driven methods often struggle to maintain robustness when transferring from controlled laboratory conditions to complex, sensor-limited, [...] Read more.
Accurate prognosis of the remaining useful life (RUL) for lithium-ion batteries is critical for mitigating range anxiety and ensuring the operational safety of electric vehicles. However, existing data-driven methods often struggle to maintain robustness when transferring from controlled laboratory conditions to complex, sensor-limited, real-world environments. To bridge this gap, this study presents U-H-Mamba, a novel uncertainty-aware hierarchical framework trained on a massive hybrid repository comprising over 146,000 charge–discharge cycles from both laboratory benchmarks and operational electric vehicle datasets. The proposed architecture employs a two-level design to decouple degradation dynamics, where a Multi-scale Temporal Convolutional Network functions as the base encoder to extract fine-grained electrochemical fingerprints, including derived virtual impedance proxies, from high-frequency intra-cycle measurements. Subsequently, an enhanced Pressure-Aware Multi-Head Mamba decoder models the long-range inter-cycle degradation trajectories with linear computational complexity. To guarantee reliability in safety-critical applications, a hybrid uncertainty quantification mechanism integrating Monte Carlo Dropout with Inductive Conformal Prediction is implemented to generate calibrated confidence intervals. Extensive empirical evaluations demonstrate the framework’s superior performance, achieving a RMSE of 3.2 cycles on the NASA dataset and 5.4 cycles on the highly variable NDANEV dataset, thereby outperforming state-of-the-art baselines by 20–40%. Furthermore, SHAP-based interpretability analysis confirms that the model correctly identifies physics-informed pressure dynamics as critical degradation drivers, validating its zero-shot generalization capabilities. With high accuracy and linear scalability, the U-H-Mamba model offers a viable and physically interpretable solution for cloud-based prognostics in large-scale electric vehicle fleets. Full article
(This article belongs to the Section F5: Artificial Intelligence and Smart Energy)
Show Figures

Figure 1

28 pages, 30101 KB  
Article
Machine Learning-Driven Soil Fungi Identification Using Automated Imaging Techniques
by Karol Struniawski, Ryszard Kozera, Aleksandra Konopka, Lidia Sas-Paszt and Agnieszka Marasek-Ciolakowska
Appl. Sci. 2026, 16(2), 855; https://doi.org/10.3390/app16020855 - 14 Jan 2026
Viewed by 125
Abstract
Soilborne fungi (Fusarium, Trichoderma, Verticillium, Purpureocillium) critically impact agricultural productivity, disease dynamics, and soil health, requiring rapid identification for precision agriculture. Current diagnostics require labor-intensive microscopy or expensive molecular assays (up to 10 days), while existing ML studies [...] Read more.
Soilborne fungi (Fusarium, Trichoderma, Verticillium, Purpureocillium) critically impact agricultural productivity, disease dynamics, and soil health, requiring rapid identification for precision agriculture. Current diagnostics require labor-intensive microscopy or expensive molecular assays (up to 10 days), while existing ML studies suffer from small datasets (<500 images), expert selection bias, and lack of public availability. A fully automated identification system integrating robotic microscopy (Keyence VHX-700) with deep learning was developed. The Soil Fungi Microscopic Images Dataset (SFMID) comprises 20,151 images (11,511 no-water, 8640 water-based)—the largest publicly available soil fungi dataset. Four CNN architectures (InceptionResNetV2, ResNet152V2, DenseNet121, DenseNet201) were evaluated with transfer learning and three-shot majority voting. Grad-CAM analysis validated biological relevance. ResNet152V2 conv2 achieved optimal SFMID-NW performance (precision: 0.6711; AUC: 0.8031), with real-time inference (20 ms, 48–49 images/second). Statistical validation (McNemar’s test: χ2=27.34,p<0.001) confirmed that three-shot classification significantly outperforms single-image prediction. Confusion analysis identified Fusarium–Trichoderma (no-water) and Fusarium–Verticillium (water-based) challenges, indicating morphological ambiguities. The publicly available SFMID provides a scalable foundation for AI-enhanced agricultural diagnostics. Full article
(This article belongs to the Special Issue Latest Research on Computer Vision and Image Processing)
Show Figures

Figure 1

24 pages, 2570 KB  
Article
SCT-Diff: Seamless Contextual Tracking via Diffusion Trajectory
by Guohao Nie, Xingmei Wang, Debin Zhang and He Wang
J. Imaging 2026, 12(1), 38; https://doi.org/10.3390/jimaging12010038 - 9 Jan 2026
Viewed by 180
Abstract
Existing detection-based trackers exploit temporal contexts by updating appearance models or modeling target motion. However, the sequential one-shot integration of temporal priors risks amplifying error accumulation, as frame-level template matching restricts comprehensive spatiotemporal analysis. To address this, we propose SCT-Diff, a video-level framework [...] Read more.
Existing detection-based trackers exploit temporal contexts by updating appearance models or modeling target motion. However, the sequential one-shot integration of temporal priors risks amplifying error accumulation, as frame-level template matching restricts comprehensive spatiotemporal analysis. To address this, we propose SCT-Diff, a video-level framework that holistically estimates target trajectories. Specifically, SCT-Diff processes video clips globally via a diffusion model to incorporate bidirectional spatiotemporal awareness, where reverse diffusion steps progressively refine noisy trajectory proposals into optimal predictions. Crucially, SCT-Diff enables iterative correction of historical trajectory hypotheses by observing future contexts within a sliding time window. This closed-loop feedback from future frames preserves temporal consistency and breaks the error propagation chain under complex appearance variations. For joint modeling of appearance and motion dynamics, we formulate trajectories as unified discrete token sequences. The designed Mamba-based expert decoder bridges visual features with language-formulated trajectories, enabling lightweight yet coherent sequence modeling. Extensive experiments demonstrate SCT-Diff’s superior efficiency and performance, achieving 75.4% AO on GOT-10k while maintaining real-time computational efficiency. Full article
(This article belongs to the Special Issue Object Detection in Video Surveillance Systems)
Show Figures

Figure 1

24 pages, 1916 KB  
Article
ServiceGraph-FM: A Graph-Based Model with Temporal Relational Diffusion for Root-Cause Analysis in Large-Scale Payment Service Systems
by Zhuoqi Zeng and Mengjie Zhou
Mathematics 2026, 14(2), 236; https://doi.org/10.3390/math14020236 - 8 Jan 2026
Viewed by 193
Abstract
Root-cause analysis (RCA) in large-scale microservice-based payment systems is challenging due to complex failure propagation along service dependencies, limited availability of labeled incident data, and heterogeneous service topologies across deployments. We propose ServiceGraph-FM, a pretrained graph-based model for RCA, where “foundation” denotes a [...] Read more.
Root-cause analysis (RCA) in large-scale microservice-based payment systems is challenging due to complex failure propagation along service dependencies, limited availability of labeled incident data, and heterogeneous service topologies across deployments. We propose ServiceGraph-FM, a pretrained graph-based model for RCA, where “foundation” denotes a self-supervised graph encoder pretrained on large-scale production cluster traces and then adapted to downstream diagnosis. ServiceGraph-FM introduces three components: (1) masked graph autoencoding pretraining to learn transferable service-dependency embeddings for cross-topology generalization; (2) a temporal relational diffusion module that models anomaly propagation as graph diffusion on dynamic service graphs (i.e., Laplacian-governed information flow with learnable edge propagation strengths); and (3) a causal attention mechanism that leverages multi-hop path signals to better separate likely causes from correlated downstream effects. Experiments on the Alibaba Cluster Trace and synthetic PayPal-style topologies show that ServiceGraph-FM outperforms state-of-the-art baselines, improving Top-1 accuracy by 23.7% and Top-3 accuracy by 18.4% on average, and reducing mean time to detection by 31.2%. In zero-shot deployment on unseen architectures, the pretrained model retains 78.3% of its fully fine-tuned performance, indicating strong transferability for practical incident management. Full article
(This article belongs to the Section E1: Mathematics and Computer Science)
Show Figures

Figure 1

17 pages, 466 KB  
Article
Breaking the Speed–Accuracy Trade-Off: A Novel Embedding-Based Framework with Coarse Screening-Refined Verification for Zero-Shot Named Entity Recognition
by Meng Yang, Shuo Wang, Hexin Yang and Ning Chen
Computers 2026, 15(1), 36; https://doi.org/10.3390/computers15010036 - 7 Jan 2026
Viewed by 192
Abstract
Although fine-tuning pretrained language models has brought remarkable progress to zero-shot named entity recognition (NER), current generative approaches still suffer from inherent limitations. Their autoregressive decoding mechanism requires token-by-token generation, resulting in low inference efficiency, while the massive parameter scale leads to high [...] Read more.
Although fine-tuning pretrained language models has brought remarkable progress to zero-shot named entity recognition (NER), current generative approaches still suffer from inherent limitations. Their autoregressive decoding mechanism requires token-by-token generation, resulting in low inference efficiency, while the massive parameter scale leads to high computational and deployment costs. In contrast, span-based methods avoid autoregressive decoding but often face large candidate spaces and severe noise redundancy, which hinder efficient entity localization in long-text scenarios. To overcome these challenges, we propose an efficient Embedding-based NER framework that achieves an optimal balance between performance and efficiency. Specifically, the framework first introduces a lightweight dynamic feature matching module for coarse-grained entity localization, enabling rapid filtering of potential entity regions. Then, a hierarchical progressive entity filtering mechanism is applied for fine-grained recognition and noise suppression. Experimental results demonstrate that the proposed model, which is trained on a single RTX 5090 GPU for only 24 h, attains approximately 90% of the performance of the SOTA GNER-T5 11B model while using only one-seventh of its parameters. Moreover, by eliminating the redundancy of autoregressive decoding, the proposed framework achieves a 17× faster inference speed compared to GNER-T5 11B and significantly surpasses traditional span-based approaches in efficiency. Full article
Show Figures

Figure 1

21 pages, 9995 KB  
Article
HCNet: Multi-Exposure High-Dynamic-Range Reconstruction Network for Coded Aperture Snapshot Spectral Imaging
by Hang Shi, Jingxia Chen, Yahui Li, Pengwei Zhang and Jinshou Tian
Sensors 2026, 26(1), 337; https://doi.org/10.3390/s26010337 - 5 Jan 2026
Viewed by 375
Abstract
Coded Aperture Snapshot Spectral Imaging (CASSI) is a rapid hyperspectral imaging technique with broad application prospects. Due to limitations in three-dimensional compressed data acquisition modes and hardware constraints, the compressed measurements output by actual CASSI systems have a finite dynamic range, leading to [...] Read more.
Coded Aperture Snapshot Spectral Imaging (CASSI) is a rapid hyperspectral imaging technique with broad application prospects. Due to limitations in three-dimensional compressed data acquisition modes and hardware constraints, the compressed measurements output by actual CASSI systems have a finite dynamic range, leading to degraded hyperspectral reconstruction quality. To address this issue, a high-quality hyperspectral reconstruction method based on multi-exposure fusion is proposed. A multi-exposure data acquisition strategy is established to capture low-, medium-, and high-exposure low-dynamic-range (LDR) measurements. A multi-exposure fusion-based high-dynamic-range (HDR) CASSI measurement reconstruction network (HCNet) is designed to reconstruct physically consistent HDR measurement images. Unlike traditional HDR networks for visual enhancement, HCNet employs a multiscale feature fusion architecture and combines local–global convolutional joint attention with residual enhancement mechanisms to efficiently fuse complementary information from multiple exposures. This makes it more suitable for CASSI systems, ensuring high-fidelity reconstruction of hyperspectral data in both spatial and spectral dimensions. A multi-exposure fusion CASSI mathematical model is constructed, and a CASSI experimental system is established. Simulation and real-world experimental results demonstrate that the proposed method significantly improves hyperspectral image reconstruction quality compared to traditional single-exposure strategies, exhibiting high robustness against multi-exposure interval jitters and shot noise in practical systems. Leveraging the higher-dynamic-range target information acquired through multiple exposures, especially in HDR scenes, the method enables reconstruction with enhanced contrast in both bright and dark details and also demonstrates higher spectral correlation, validating the enhancement of CASSI reconstruction and effective measurement capability in HDR scenarios. Full article
(This article belongs to the Section Optical Sensors)
Show Figures

Figure 1

23 pages, 725 KB  
Article
From Sound to Risk: Streaming Audio Flags for Real-World Hazard Inference Based on AI
by Ilyas Potamitis
J. Sens. Actuator Netw. 2026, 15(1), 6; https://doi.org/10.3390/jsan15010006 - 1 Jan 2026
Viewed by 767
Abstract
Seconds count differently for people in danger. We present a real-time streaming pipeline for audio-based detection of hazardous life events affecting life and property. The system operates online rather than as a retrospective analysis tool. Its objective is to reduce the latency between [...] Read more.
Seconds count differently for people in danger. We present a real-time streaming pipeline for audio-based detection of hazardous life events affecting life and property. The system operates online rather than as a retrospective analysis tool. Its objective is to reduce the latency between the occurrence of a crime, conflict, or accident and the corresponding response by authorities. The key idea is to map reality as perceived by audio into a written story and question the text via a large language model. The method integrates streaming, zero-shot algorithms in an online decoding mode that convert sound into short, interpretable tokens, which are processed by a lightweight language model. CLAP text–audio prompting identifies agitation, panic, and distress cues, combined with conversational dynamics derived from speaker diarization. Lexical information is obtained through streaming automatic speech recognition, while general audio events are detected by a streaming version of Audio Spectrogram Transformer tagger. Prosodic features are incorporated using pitch- and energy-based rules derived from robust F0 tracking and periodicity measures. The system uses a large language model configured for online decoding and outputs binary (YES/NO) life-threatening risk decisions every two seconds, along with a brief justification and a final session-level verdict. The system emphasizes interpretability and accountability. We evaluate it on a subset of the X-Violence dataset, comprising only real-world videos. We release code, prompts, decision policies, evaluation splits, and example logs to enable the community to replicate, critique, and extend our blueprint. Full article
(This article belongs to the Topic Trends and Prospects in Security, Encryption and Encoding)
Show Figures

Figure 1

Back to TopTop