MDPI - Publisher of Open Access Journals

20 pages, 6122 KB

Open AccessArticle

Automated Detection and Classification of Lunar Linear Tectonic Features Using a Deep Learning Method

by Xiaoyang Liu, Yang Luo, Jianhui Wang, Denggao Qiu, Jianguo Yan, Wensong Zhang and Yaowen Luo

Remote Sens. 2026, 18(9), 1330; https://doi.org/10.3390/rs18091330 (registering DOI) - 26 Apr 2026

Abstract

On the lunar surface, wrinkle ridges, grabens, and lobate scarps represent key tectonic landforms that reflect the evolution of the Moon’s stress field and its tectonic processes. However, these linear structures often exhibit weak textures, low contrast, and large scale variations, making manual [...] Read more.

On the lunar surface, wrinkle ridges, grabens, and lobate scarps represent key tectonic landforms that reflect the evolution of the Moon’s stress field and its tectonic processes. However, these linear structures often exhibit weak textures, low contrast, and large scale variations, making manual interpretation inefficient and subjective. To address this issue, this study introduces an improved YOLOv8 model, termed HL-YOLOv8, for the automated detection of lunar linear features. The model incorporates a multiscale lightweight channel attention (C2f_MLCA) module into the backbone network to enhance the extraction of fine-grained and weak-texture features and integrates a multihead self-attention (C2f_MHSA) module in the feature fusion stage to improve the modelling of long-range spatial dependencies. In addition, the combination of a dual focal loss and a diversified data augmentation strategy effectively mitigates the detection difficulties caused by class imbalance and weak-feature samples. The experimental results obtained using the global LROC-WAC image dataset demonstrate that HL-YOLOv8 significantly outperforms the baseline YOLOv8 and other comparative models in terms of precision, recall, and mAP@0.5. Specifically, the proposed model achieved an average precision of 73.5%, an average recall of 73.1%, and an average mAP@0.5 of 74.6% on the evaluation dataset, showing particularly strong performance in detecting elongated grabens and boundary-blurred lobate scarps. The global distribution maps derived from the model predictions indicate that HL-YOLOv8 can be applied to comprehensively reconstruct the spatial patterns of the three types of linear structures and identify potential new features in high-latitude and geologically complex regions, demonstrating excellent generalizability and robustness. This study provides an efficient and reliable framework for the automated identification and global mapping of lunar linear features and offers a transferable methodological reference for the tectonic interpretation of terrestrial planets. Full article

28 pages, 3444 KB

Open AccessArticle

A Lightweight Method for Power Quality Disturbance Recognition Based on Optimized VMD and CNN–Transformer

by Dongya Xiao, Jiaming Liu, Haining Liu and Yang Zhao

Electronics 2026, 15(9), 1832; https://doi.org/10.3390/electronics15091832 (registering DOI) - 26 Apr 2026

Abstract

Aiming at the issues of low recognition accuracy and high model computational complexity for power quality disturbances (PQDs) in strong-noise environments, this paper proposes a novel lightweight PQD-recognition method that integrates a hybrid architecture of variational mode decomposition (VMD), convolutional neural network (CNN), [...] Read more.

Aiming at the issues of low recognition accuracy and high model computational complexity for power quality disturbances (PQDs) in strong-noise environments, this paper proposes a novel lightweight PQD-recognition method that integrates a hybrid architecture of variational mode decomposition (VMD), convolutional neural network (CNN), and transformer. Firstly, a hybrid optimization algorithm named the monkey–genetic hybrid optimization algorithm (MGHOA) is proposed to optimize VMD parameters for denoising disturbance signals, thereby enhancing recognition accuracy in noisy environments. Secondly, to fully extract disturbance signal features and reduce the computational complexity of the model, a lightweight CNN–transformer model is designed. Depthwise separable convolution (DSC) is employed to extract local features and the multi-head attention mechanism of transformer is utilized to mine the long-distance dependence and global features, thereby enhancing the feature representation. Thirdly, a multitask joint-learning method is proposed to collaboratively optimize classification accuracy and temporal localization tasks, enhancing the discrimination of similar disturbances. Additionally, a dual-pooling global feature fusion strategy is designed to further enhance the model’s ability to discriminate complex disturbances. Comparative experiments on 16 typical PQD types demonstrate that the proposed method achieves excellent performance in recognition accuracy, model robustness, and computational efficiency. The integration of the MGHOA–VMD module improves recognition accuracy by 1.08%, while the multitask joint-learning method contributes an additional 0.55% improvement. When achieving recognition accuracy comparable to complex models, the training time of the proposed method is 36.51% of that required by DeepCNN and merely 5.90% of that required by bidirectional long short-term memory (BiLSTM), with a 31.22% reduction in parameter scale. This work provides a novel solution for intelligent power quality disturbance recognition. Full article

(This article belongs to the Section Power Electronics)

► Show Figures

Figure 1

36 pages, 9428 KB

Open AccessArticle

Smart Diagnostics: Hierarchical Deep Learning of Acoustic Emission Signals for Early Crack Detection in Zirconia Dental Structures

by Kuson Tuntiwong, Rangsinee Wangman, Kanchana Kanchanatawewat, Boonjira Anucul, Hiranya Sritart, Pattarapong Phasukkit and Supan Tangjitkusolmun

Sensors 2026, 26(9), 2682; https://doi.org/10.3390/s26092682 (registering DOI) - 26 Apr 2026

Abstract

Monolithic zirconia restorations are frequently affected by the unnoticed growth of subcritical cracks, a failure process that is not captured by traditional imaging methods like radiographs and ultrasounds in sophisticated dental architectures. To address this evaluative inadequacy, this research introduces a hierarchical deep [...] Read more.

Monolithic zirconia restorations are frequently affected by the unnoticed growth of subcritical cracks, a failure process that is not captured by traditional imaging methods like radiographs and ultrasounds in sophisticated dental architectures. To address this evaluative inadequacy, this research introduces a hierarchical deep learning framework for microcrack detection and spatial localization. We promote a hierarchical deep learning system that integrates Acoustic Emission (AE) detection alongside signal processing. Raw AE signals utilized during dynamic loading are enhanced via Kalman filtering and Continuous Wavelet Transform (CWT) to construct high-fidelity time–frequency scalograms. The diagnostic pipeline operates in two stages: first, a hybrid CNN–BiGRU network with temporal attention fulfills zirconia component-level classification; second, a ResNet-18 backbone integrated with Bidirectional LSTM and Multi-Head Attention precisely localizes defects across five anatomical crown regions. This hierarchical design effectively captures the non-stationary, transient nature of fracture-induced stress waves. The framework achieved an F1-score of 99.00% and an AUC of 0.994, significantly outperforming conventional convolutional networks. By enabling predictive maintenance through early, non-invasive damage localization, this study demonstrates a promising laboratory framework for AE-based crack detection in zirconia dental structures and prosthetics and toward enhanced clinical reliability in digital dentistry. Full article

(This article belongs to the Special Issue Acoustic and Ultrasonic Sensing Technology in Non-Destructive Testing—2nd Edition)

► Show Figures

Graphical abstract

21 pages, 10729 KB

Open AccessArticle

Detecting Dairy Cattle Protective Behaviors via a Multi-Stage Attention SlowFast Network

by Bo Zhang, Jia Li, Feilong Kang, Yongan Zhang, Yu Xia, Yanqiu Liu and Jian Zhao

Animals 2026, 16(9), 1321; https://doi.org/10.3390/ani16091321 (registering DOI) - 26 Apr 2026

Abstract

Protective behavior in dairy cattle is one of the important potential indicators of their health and welfare status, and the precise detection of this behavior is of great significance for improving pasture management. However, existing methods face challenges, including capturing rapid motions, excessive [...] Read more.

Protective behavior in dairy cattle is one of the important potential indicators of their health and welfare status, and the precise detection of this behavior is of great significance for improving pasture management. However, existing methods face challenges, including capturing rapid motions, excessive background interference, and sample imbalance in complex agricultural environments. In response to these challenges, we proposed a Multi-Stage Attention SlowFast (MSA-SlowFast) model based on the improved SlowFast network to explore the model’s ability to distinguish between normal and protective behavior of dairy cattle. It achieves performance improvement through three core modules: the Multi-Path Balanced Head (MPBHead) for alleviating category imbalance, the Spatio-Temporal Convolutional Block Attention Module (ST-CBAM) for enhancing key feature extraction, and the 7 (BAF) for promoting multi-path feature complementarity. Additionally, we proposed novel timing-aware oversampling methods and dynamic loss adjustment mechanisms to further improve the detection performance of minority-class protective behaviors. Finally, a spatio-temporal-oriented dairy cattle protective behaviors dataset is constructed. Experimental results demonstrate that the proposed MSA-SlowFast model achieves 79.41% mAP, surpassing the standard SlowFast (70.58%) and Slow-only (68.21%). Further validation shows that the model exhibits high detection confidence in four specific actions labeled as protective behavior: 0.97 for tail swaying, 0.90 for head shaking, 0.92 for ear flapping, and 0.90 for leg kicking. These preliminary results show that the method proposed in this study has certain feasibility and reference value for the detection of protective behavior of dairy cattle under our constructed dataset. Full article

(This article belongs to the Section Animal System and Management)

► Show Figures

Figure 1

19 pages, 8343 KB

Open AccessArticle

TAHRNet: An Improved HRNet-Based Semantic Segmentation Model for Mangrove Remote Sensing Imagery

by Haonan Lin, Dongyang Fu, Chuhong Wang, Jinjun Huang, Hanrui Wu, Yu Huang and Litian Xiong

Forests 2026, 17(5), 525; https://doi.org/10.3390/f17050525 (registering DOI) - 25 Apr 2026

Abstract

Mangrove represent vital coastal ecosystems that contribute to shoreline stabilization, ecological balance, and environmental management. Nevertheless, the precise delineation of mangrove regions using remote sensing data is often impeded by spectral similarities with intertidal mudflats and aquatic features, alongside the irregular spatial patterns [...] Read more.

Mangrove represent vital coastal ecosystems that contribute to shoreline stabilization, ecological balance, and environmental management. Nevertheless, the precise delineation of mangrove regions using remote sensing data is often impeded by spectral similarities with intertidal mudflats and aquatic features, alongside the irregular spatial patterns and intricate margins of mangrove stands. This research utilizes high-resolution Gaofen-6 (GF-6) satellite observations as the foundational data to develop Triplet Axial High-Resolution Network (TAHRNet), a semantic segmentation architecture derived from the High-Resolution Network with Object-Contextual Representations (HRNet-OCR) framework for mangrove identification. The model integrates a Triplet Attention module to facilitate cross-dimensional feature dependencies and an improved Multi-Head Sequential Axial Attention mechanism to capture long-range spatial context while maintaining structural consistency. Based on evaluations using the test dataset, TAHRNet yielded a Mean Intersection over Union (MIoU) of 92.01% and a Overall Accuracy of 96.38%. Relative to U-Net and SegFormer, the proposed approach showed MIoU improvements of 5.25% and 1.88%, with corresponding Accuracy gains of 2.68% and 0.94%. Further application to coastal mapping in Zhanjiang produced results that align with manual visual interpretation. These findings suggest that TAHRNet is a viable tool for mangrove extraction and can provide technical support for coastal monitoring and ecological analysis. Full article

(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

29 pages, 4546 KB

Open AccessArticle

Beyond Scale Variability: Dynamic Cross-Scale Modeling and Efficient Sparse Heads for Wind Turbine Blade Defect Detection

by Xingxing Fan, Manxiang Gao, Yong Wang, Haining Tang, Fengyong Sun and Changpo Song

Processes 2026, 14(9), 1367; https://doi.org/10.3390/pr14091367 - 24 Apr 2026

Abstract

Images of wind turbine blades captured by drones often feature complex backgrounds, and small targets such as minor defects or images have low resolution, leading to reduced recognition rates. To address environments with complex feature backgrounds, this paper proposes the PPS-MSDeim model. Based [...] Read more.

Images of wind turbine blades captured by drones often feature complex backgrounds, and small targets such as minor defects or images have low resolution, leading to reduced recognition rates. To address environments with complex feature backgrounds, this paper proposes the PPS-MSDeim model. Based on the lightweight end-to-end detection framework DEIM-N, it introduces three core innovations to tackle the challenge of detecting small, irregular defects on wind turbine blades against complex backgrounds. First, we design an inverted multi-scale deep separable convolutional module (MDSC). After compressing channels via a bottleneck layer, it concurrently processes 3 × 3, 5 × 5, and 7 × 7 inverted deep separable convolutions. By first fusing channel information and then extracting multi-receiver-field spatial features, this approach enhances the ability to characterize morphologically variable defects while reducing computational overhead. The MDSC is then embedded into the backbone network HGNetv2. Second, we construct a Multi-Scale Feature Aggregation and Diffusion Pyramid Network (MFADPN). Through a Multi-Scale Feature Aggregation Module (MSFAM), it directly fuses features from layers P2 to P5, achieving deep integration of high-level semantics and low-level details. Combining dilated convolutions with expansion ratios of 1, 3, and 5 captures multi-level context, and a Sobel edge branch is introduced to enhance defect contours; subsequently, a feature diffusion operation is performed to distribute the enhanced features back to each level, shortening information paths and preventing signal decay; simultaneously, a high-resolution detection head is added to P2 and the P5 head is removed to improve sensitivity for small object detection. Finally, we propose the PPSformer module to replace the original Transformer encoding layer. It uses patch embedding to convert images into sequences and introduces a multi-head probabilistic sparse self-attention mechanism that focuses only on key-value pairs during attention computation. This design efficiently captures irregularly varying feature information and globally detects data anomalies induced by external defects. This study uses real engineering data sets, and the results show that PPS-MSDeim, based on DEIM, increased mAP@0.5 by 6.7%, reaching 95.1%. mAP@0.5–0.95 increased by 12.0%, reaching 70.1%. This indicates that the proposed method has a significant advantage in detecting defects in wind turbine blades. Full article

(This article belongs to the Special Issue Application of Adaptive Control, Fault Detection and Deep Learning in Electrical Engineering)

21 pages, 1850 KB

Open AccessArticle

A Spatio-Temporal Hybrid Multi-Head Attention Model for AIS-Based Ship Trajectory Prediction

by Yuhui Liu, Xiongguan Bao, Shuangming Li, Chenhui Gu and Qihua Fang

Future Transp. 2026, 6(3), 94; https://doi.org/10.3390/futuretransp6030094 - 24 Apr 2026

Abstract

To improve ship AIS trajectory prediction under pronounced spatiotemporal coupling and dynamic maneuvering conditions, this study proposes a Spatio-Temporal-Hybrid-Multi-head Attention model (STHA) integrating multiscale convolution, bidirectional long short-term memory, and multi-head attention. Historical AIS data from the Zhoushan waters in 2024 were preprocessed [...] Read more.

To improve ship AIS trajectory prediction under pronounced spatiotemporal coupling and dynamic maneuvering conditions, this study proposes a Spatio-Temporal-Hybrid-Multi-head Attention model (STHA) integrating multiscale convolution, bidirectional long short-term memory, and multi-head attention. Historical AIS data from the Zhoushan waters in 2024 were preprocessed through screening, cleaning, outlier removal, resampling, and cubic spline interpolation to construct trajectory samples. Comparative experiments were conducted against BP, BiLSTM, and BiGRU using MAPE, RMSE, and R² as evaluation metrics. The results show that STHA achieves the best overall predictive performance, more accurately follows trajectory variations across different vessel types, and exhibits better robustness in scenarios involving turning and speed changes. These findings indicate that the proposed model is effective for high-precision ship trajectory prediction and can provide useful support for subsequent collision risk assessment and navigation safety assistance. Full article

(This article belongs to the Special Issue Next-Generation AI and Foundation Models for Transportation Systems)

31 pages, 22857 KB

Open AccessArticle

Congestion-Aware Adaptive Routing Based on Graph Attention Networks and Dynamic Cost Optimization

by Jun Liu, Xinwei Li and Lingyun Zhou

Symmetry 2026, 18(5), 719; https://doi.org/10.3390/sym18050719 - 24 Apr 2026

Abstract

To mitigate local congestion and address the adaptability limitations of traditional static routing under dynamic traffic, this paper proposes an end-to-end routing method based on a Graph Attention Network (GAT), termed Congestion-Aware Graph Attention Routing (CA-GAR). To alleviate the issue of local optima [...] Read more.

To mitigate local congestion and address the adaptability limitations of traditional static routing under dynamic traffic, this paper proposes an end-to-end routing method based on a Graph Attention Network (GAT), termed Congestion-Aware Graph Attention Routing (CA-GAR). To alleviate the issue of local optima in traditional heuristic iterative optimization, we design a dynamic link cost optimization algorithm with multi-start parallel exploration. This algorithm employs a ”penalty–reselection–reward” closed-loop feedback mechanism, performing global searches from multiple random initial states to generate a high-quality, empirically near-optimal cost matrix as supervised labels. Building on this, CA-GAR leverages a multi-head attention mechanism to adaptively aggregate high-order topological features of nodes and edges, and incorporates a staged hierarchical hyperparameter optimization strategy to map real-time network states to link costs. Simulation results demonstrate that CA-GAR outperforms traditional static routing under light, medium, and heavy loads. Under high-load burst conditions, the method exhibits effective congestion avoidance capability, reducing end-to-end delay by approximately 50% and lowering the packet loss rate to as low as 2%. Compared with QLRA, CA-GAR shows promising performance in multi-path traffic splitting and possesses robust fast rerouting capabilities during node failures, thereby achieving intelligent traffic distribution and global load balancing. Full article

(This article belongs to the Special Issue Symmetry in Computational Intelligence and Data Science)

25 pages, 2134 KB

Open AccessArticle

High-Precision Airfoil Flow-Field Prediction Based on Spatial Multilayer Perceptron with Error-Gradient-Guided Data Sampling

by Yu Li, Di Peng and Feng Gu

Aerospace 2026, 13(5), 401; https://doi.org/10.3390/aerospace13050401 - 23 Apr 2026

Viewed by 74

Abstract

Airfoil flow-field prediction is important for aerodynamic design, but wind-tunnel testing and computational fluid dynamics (CFD) remain costly and time-consuming. Deep learning enables fast inference, yet many existing models still rely on fixed grid representations, which may lead to insufficient learning in high-gradient [...] Read more.

Airfoil flow-field prediction is important for aerodynamic design, but wind-tunnel testing and computational fluid dynamics (CFD) remain costly and time-consuming. Deep learning enables fast inference, yet many existing models still rely on fixed grid representations, which may lead to insufficient learning in high-gradient regions and larger local errors. This study proposes Spatial Multilayer Perceptron (Spatial MLP) together with an Error-Gradient-Guided Data Sampling (EGDS) strategy for airfoil flow-field prediction. Spatial MLP adopts a coordinate-based point-wise prediction framework. A spatial decoder is introduced as an auxiliary branch to enhance global flow consistency during pretraining, while channel-wise multi-head attention is incorporated to improve cross-variable feature coupling. EGDS prioritizes physically informative points according to relative prediction error and gradient magnitude, while retaining random samples to preserve data diversity. Experiments on an independent test set show that Spatial MLP reduces the mean relative error (averaged over the velocity components u, v, and pressure p) by 15.2% relative to the MLP baseline. With EGDS, the overall mean relative error is further reduced by 34.5% relative to the MLP baseline. These results demonstrate that combining global consistency constraints with targeted sampling effectively improves both global prediction accuracy and local reconstruction quality in high-gradient flow regions. Full article

(This article belongs to the Section Aeronautics)

26 pages, 8883 KB

Open AccessArticle

Strip Steel Defect Detection Algorithm Integrating Dynamic Convolution and Attention

by Changchun Shao, Zhijie Chen and Jianjun Meng

Electronics 2026, 15(9), 1796; https://doi.org/10.3390/electronics15091796 - 23 Apr 2026

Viewed by 80

Abstract

To address the issues of low accuracy, high false positives, and missed detections in hot-rolled strip steel surface defect inspection, this paper proposes an improved detection model named DFEM-NET based on YOLOv8n. First, an efficient feature extraction module (DSC2f) based on Dynamic Snake [...] Read more.

To address the issues of low accuracy, high false positives, and missed detections in hot-rolled strip steel surface defect inspection, this paper proposes an improved detection model named DFEM-NET based on YOLOv8n. First, an efficient feature extraction module (DSC2f) based on Dynamic Snake Convolution is designed to enhance the model’s capability in capturing features of irregular and elongated defects. Second, a Feature Pyramid Shared Convolution module (FPSC) is constructed to expand the model’s receptive field and effectively suppress interference from complex backgrounds. Third, an Enhanced Feature Correction (EFC) strategy is adopted during the feature fusion stage to help the model better learn the detailed features of small defect targets. Finally, a Multi-Scale Attention Aggregation module (MSAA) is introduced before the detection head, enabling the network to focus on critical feature information and thereby comprehensively improve detection accuracy for target defects. Experimental results demonstrate that, compared to the baseline model YOLOv8n, DFEM-NET achieves a detection accuracy (mAP@0.5) of 83.5%, representing an increase of 4.8%; a recall rate of 76.4%, an increase of 3.3%; and a precision of 84.7%, an increase of 3.1%, without a significant increase in model complexity. Furthermore, generalization experiments conducted on the GC10-DET dataset confirm that the proposed algorithm exhibits exceptional generalization capability. Full article

(This article belongs to the Special Issue Machine and Deep Learning Applications to Computer Vision and Recognition)

31 pages, 7259 KB

Open AccessArticle

Enhancing IoT Network Security: A BPSO-Optimized Attention-GRU Deep Learning Framework for Intrusion Detection

by Abdallah Elayan and Michel Kadoch

Computers 2026, 15(5), 266; https://doi.org/10.3390/computers15050266 - 23 Apr 2026

Viewed by 72

Abstract

The exponential expansion of computer networks, alongside the rapid development of the Internet of Things (IoT), has significantly increased the volume and complexity of transmitted data, emphasizing the need for robust network security measures to secure sensitive data and prevent unauthorized access or [...] Read more.

The exponential expansion of computer networks, alongside the rapid development of the Internet of Things (IoT), has significantly increased the volume and complexity of transmitted data, emphasizing the need for robust network security measures to secure sensitive data and prevent unauthorized access or breaches. Intrusion Detection Systems (IDSs) have emerged as a vital tool for protecting networks and IoT environments from threats. Various IDSs have been proposed in the literature; however, the lack of optimal feature learning, computational efficiency, and reliance on obsolete datasets poses significant challenges, limiting their effectiveness against evolving cyber threats. Moreover, traditional IDSs struggle to efficiently manage the high-dimensional and imbalanced nature of IoT network traffic data. To address these challenges, this research proposes a hybrid deep learning (DL)-based IDS integrating Binary Particle Swarm Optimization (BPSO), MultiHead Attention mechanisms (MHA), and a deep Gated Recurrent Unit (GRU) architecture, improving detection effectiveness while reducing computational overhead. Our proposed approach also utilizes a Target Sampling strategy to balance class distributions, enhancing the model’s ability to accurately identify minority attacks. The BPSO algorithm is employed to identify the most influential features from the high-dimensional network traffic datasets, enhancing model interpretability and supporting more efficient learning. This optimized feature subset is then fed into a GRU-based DL architecture augmented with MHA, which performs sequence processing and attention-based learning for intrusion detection. The performance of the proposed model is evaluated utilizing the BoT-IoT and the CIC-IDS2017 benchmark datasets, ensuring a comprehensive assessment of anomaly detection capabilities. Extensive experimental results demonstrate the superior performance of the proposed model, achieving a recall of 98.42% and 99.76%, with F1-score of 98.94% and 99.76% for binary classification and a recall of 99.79% and 98.69%, with F1-score of 99.89% and 98.04% for multiclass classification on the BoT-IoT and CIC-IDS2017 datasets, respectively, highlighting the effectiveness of our model in enhancing threat detection for computer networks and IoT environments in comparison to recent state-of-the-art IDSs. Full article

(This article belongs to the Special Issue AI-Powered IoT (AIoT) Systems: Advancements in Security, Sustainability, and Intelligence)

19 pages, 2502 KB

Open AccessArticle

Automatic Sleep Staging with Long-Term Temporal Modeling Using Single-Channel EEG

by Qiyu Yang, Dejun Zhang and Yi Huang

Appl. Sci. 2026, 16(9), 4092; https://doi.org/10.3390/app16094092 - 22 Apr 2026

Viewed by 208

Abstract

With the increasing demand for sleep health monitoring, automatic sleep staging using single-channel electroencephalogram (EEG) signals has become increasingly prominent due to its clinical practicality. Existing methods have achieved notable progress, but they often fail to adequately capture long-term temporal dependencies and struggle [...] Read more.

With the increasing demand for sleep health monitoring, automatic sleep staging using single-channel electroencephalogram (EEG) signals has become increasingly prominent due to its clinical practicality. Existing methods have achieved notable progress, but they often fail to adequately capture long-term temporal dependencies and struggle to characterize transition phases. We propose SleepLT, an automated sleep staging framework that integrates multi-scale wavelet decomposition (MWD) and multi-head latent Fourier attention (MLFA). The MLFA module incorporates Fourier analysis into self-attention mechanisms and employs a partially weight-sharing bottleneck to optimize Key/Value generation, effectively capturing sleep rhythms. Extensive experiments on SleepEDF-78 and SHHS datasets demonstrate strong and consistent performance, with Macro F1 improvements of 2.1–3.2% over the compared baselines. Visualizations confirm that SleepLT enhances inter-class discriminability between sleep stages, robustly detects salient waveforms, and effectively captures transitions through long-sequence modeling. These results indicate that SleepLT is effective for automatic sleep staging from single-channel EEG, particularly in improving the recognition of ambiguous transitional stages such as N1 and REM. Full article

(This article belongs to the Special Issue Applied Multimodal AI: Methods and Applications Across Domains)

► Show Figures

Figure 1

25 pages, 2360 KB

Open AccessArticle

ACF-YOLO: Feature Enhancement and Multi-Scale Alignment for Sustainable Crop Small Object Detection

by Chuanxiang Li, Yihang Li, Wenzhong Yang and Danny Chen

Sustainability 2026, 18(9), 4168; https://doi.org/10.3390/su18094168 - 22 Apr 2026

Viewed by 150

Abstract

Sustainable precision agriculture is crucial for optimizing resource utilization, reducing chemical inputs, and ensuring global food security. High-precision automatic recognition and monitoring of key crop organs (e.g., wheat heads and flower clusters) serve as the technological foundation for sustainable agricultural management decisions. However, [...] Read more.

Sustainable precision agriculture is crucial for optimizing resource utilization, reducing chemical inputs, and ensuring global food security. High-precision automatic recognition and monitoring of key crop organs (e.g., wheat heads and flower clusters) serve as the technological foundation for sustainable agricultural management decisions. However, visual perception in natural field environments is highly susceptible to external conditions. To address the challenges of severe background interference and feature dilution in crop small object detection within complex agricultural scenarios, this paper proposes an enhanced detection network, ACF-YOLO, based on YOLO11. First, an Aggregated Multi-scale Local-Global Attention (AMLGA) module is designed to enhance the feature representation of weak targets by fusing local details with global semantics. Second, a Context-Guided Fusion Module (CGFM) and a Soft-Neighbor Interpolation (SNI) strategy are introduced. Their synergy alleviates feature aliasing effects and ensures the precise alignment of deep semantic information with shallow spatial details. Furthermore, the Inner-MPDIoU loss function is employed to optimize the bounding box regression accuracy for non-rigid targets by incorporating geometric constraints and auxiliary scale factors. To verify the detection capability of the proposed method, we constructed a UAV Wheat Head Dataset (UWHD) and conducted extensive experiments on the UWHD, GWHD2021, and RFRB datasets. The experimental results demonstrate that ACF-YOLO outperforms other comparative methods, confirming its stable detection performance and contributing to the sustainable development of agriculture. Full article

(This article belongs to the Section Sustainable Agriculture)

18 pages, 39608 KB

Open AccessArticle

Denoising Domain Adversarial Network Based on Attention Mechanism for Motor Fault Diagnosis in Real Industrial Environment

by Linjie Jin, Zhengqing Liu, Dawei Gu, Baisong Pan, Qiucheng Wang and Mohammad Fard

Machines 2026, 14(5), 462; https://doi.org/10.3390/machines14050462 - 22 Apr 2026

Viewed by 182

Abstract

Acoustic signal-based fault diagnosis offers a promising non-contact approach for rotating machinery. However, its practical application is usually affected by environmental noise. This paper presented a Denoising Attention Domain Adversarial Network (DDAN) for the robust fault diagnosis of wheel hub motors under severe [...] Read more.

Acoustic signal-based fault diagnosis offers a promising non-contact approach for rotating machinery. However, its practical application is usually affected by environmental noise. This paper presented a Denoising Attention Domain Adversarial Network (DDAN) for the robust fault diagnosis of wheel hub motors under severe noise interference. The proposed framework consists of the following two core modules: a DenseNet-based denoising module that adaptively suppresses background noise while retaining critical fault features, and a Stacked Autoencoder Domain Adversarial Network (SADAN) that integrates channel attention, spatial attention, and multi-head self-attention (MHSA) for refined feature extraction and classification. Such a hierarchical attention mechanism facilitates effective local noise suppression and global dependency capture. Validation on a hub motor fault dataset and publicly available online dataset demonstrates that compared to existing methods, DDAN achieves superior diagnostic accuracy across various noise levels and signal-to-noise ratios, improving SNR from -15.97 dB to 1.24 dB, achieving 82.71% accuracy under low SNR condition, and reaching 84.93% and 83.75% accuracy in cross-domain generalization tests. Furthermore, the comparison of the diagnostic accuracy of audio signals from different acoustic acquisition devices further verifies the practicality and potential of the system in low-cost industrial deployment. Full article

(This article belongs to the Section Electrical Machines and Drives)

► Show Figures

Figure 1

32 pages, 1004 KB

Open AccessFeature PaperArticle

Macro–Market Fusion with Cross-Attention for Equity Return Prediction

by Janit Rajkarnikar, Sibin Joshi and Zhaoxian Zhou

Mathematics 2026, 14(8), 1361; https://doi.org/10.3390/math14081361 - 18 Apr 2026

Viewed by 153

Abstract

Macroeconomic conditions are widely believed to influence the direction of equity markets, yet most forecasting models either ignore macroeconomic information or incorporate it through a small set of ad hoc predictors. We propose XAttnFusion, a macro–market fusion architecture that jointly learns from high-frequency [...] Read more.

Macroeconomic conditions are widely believed to influence the direction of equity markets, yet most forecasting models either ignore macroeconomic information or incorporate it through a small set of ad hoc predictors. We propose XAttnFusion, a macro–market fusion architecture that jointly learns from high-frequency market data and lower-frequency macroeconomic time series for equity return prediction. The model comprises three branches: a 1D convolutional network that encodes 40-day market windows (price, volume, and technical indicators), a temporal convolutional network that encodes 24-month macro sequences, and a feedforward branch for volume-at-price structure features. These representations are integrated through multi-head cross-attention, in which the current market state queries the macro sequence to produce a fused representation for directional forecasting. We evaluate XAttnFusion on daily SPY returns from 2012 to 2024 using purged cross-validation with a 5-day embargo to prevent information leakage. To address potential look-ahead bias from macroeconomic publication lags, all macro inputs are lagged by two months. The model achieves a mean out-of-sample AUROC of

0.63 \pm 0.05

, representing a 27% improvement over random and an 8.1% improvement over the best concatenation baseline. In a fair comparison where each model is independently hyperparameter-tuned, cross-attention fusion improves AUROC by 0.047 over concatenation (

p = 0.031

, Wilcoxon signed-rank test). The model also generalizes to QQQ and IWM, where cross-attention consistently outperforms concatenation fusion. Crucially, the model’s discriminative ability is state-dependent, indicating that the value of macro–market fusion is itself conditioned on market structure. Permutation-based feature importance shows that macro and market branches contribute on a comparable scale (approximately 48% and 36%, respectively), so the gains come from jointly fusing two comparably weighted sources rather than from a single dominant input. Our results show that explicitly modeling macro–market interactions with interpretable attention improves predictive accuracy over naive fusion strategies and provides insight into the time-varying relevance of macroeconomic information in financial forecasting and equity market prediction. Full article

(This article belongs to the Section E5: Financial Mathematics)

► Show Figures

Figure 1

Search Results (1,543)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (1,543)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI