Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (22,506)

Search Parameters:
Keywords = convolution neural networks

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 4382 KB  
Article
Spatio-Temporal Joint Network for Coupler Anomaly Detection Under Complex Working Conditions Utilizing Multi-Source Sensors
by Zhirong Zhao, Zhentian Jiang, Qian Xiao, Long Zhang and Jinbo Wang
Sensors 2026, 26(9), 2661; https://doi.org/10.3390/s26092661 (registering DOI) - 24 Apr 2026
Abstract
Owing to the intricate mechanical coupling characteristics and the considerable difficulty in extracting synergistic spatio-temporal features from high-dimensional sensor data under fluctuating alternating loads, this study proposes a robust anomaly detection framework that combines Normalized Mutual Information (NMI) and Spatio-Temporal Graph Neural Networks [...] Read more.
Owing to the intricate mechanical coupling characteristics and the considerable difficulty in extracting synergistic spatio-temporal features from high-dimensional sensor data under fluctuating alternating loads, this study proposes a robust anomaly detection framework that combines Normalized Mutual Information (NMI) and Spatio-Temporal Graph Neural Networks (STGNN). First, NMI is utilized to quantify the nonlinear physical coupling intensity among multi-source sensors, thereby filtering out weakly correlated noise and reconstructing the spatial topological structure of the coupler system. Subsequently, a deep learning architecture incorporating Graph Convolutional Networks (GCN), Gated Recurrent Units (GRU), and one-dimensional convolutional residual connections is developed to capture the dynamic evolutionary characteristics of equipment states across both spatial interactions and temporal sequences. Finally, based on the model’s health-state predictions, a moving average algorithm is introduced to smooth the residual sequences, and an anomaly early-warning baseline is established in conjunction with the 3σ criterion. Experimental validation conducted using field service data from heavy-haul trains demonstrates that, compared to conventional serial CNN and Long Short-Term Memory (LSTM) models, the proposed method exhibits superior fitting performance and robustness against noise, effectively reducing the false alarm rate within normal working intervals. In a real-world case study, the method successfully identified variations in spatial linkage features induced by local damage and triggered timely alerts. Notably, the spatial alarm nodes were highly consistent with the fatigue crack initiation sites identified through on-site magnetic particle inspection. This study provides a viable data-driven analytical framework for the condition monitoring and anomaly identification of critical load-bearing components in heavy-haul trains. Full article
(This article belongs to the Special Issue Deep Learning Based Intelligent Fault Diagnosis)
32 pages, 18066 KB  
Article
Grapevine Winter Pruning Point Localization Using YOLO-Based Instance Segmentation
by Magdalena Kapłan and Kamil Buczyński
Agriculture 2026, 16(9), 943; https://doi.org/10.3390/agriculture16090943 - 24 Apr 2026
Abstract
Winter pruning is a key management practice in viticulture that directly affects vine architecture, yield balance, and grape quality. At the same time, it is a highly labor-intensive operation, and the selective identification of appropriate cutting locations remains one of the main challenges [...] Read more.
Winter pruning is a key management practice in viticulture that directly affects vine architecture, yield balance, and grape quality. At the same time, it is a highly labor-intensive operation, and the selective identification of appropriate cutting locations remains one of the main challenges limiting the automation of pruning in vineyards. Advances in machine vision provide new opportunities to support the development of robotic pruning systems. The objective of this study was to develop and evaluate a vision-based method for estimating grapevine pruning points and cutting lines using instance segmentation outputs generated by YOLO models. A dataset of 1500 RGB images of dormant grapevines was collected under field conditions in the Nobilis vineyard located in southeastern Poland. Two annotation strategies were implemented to define pruning regions. YOLO-based instance segmentation models were trained and evaluated for detecting cutting-related structures. Based on the predicted segmentation masks, a geometry-based method termed PCAcutSeg-V was developed to estimate class-dependent cutting points and cutting lines using principal component analysis applied to object contours. The results indicate that YOLOv8 and YOLO11 architectures achieved the highest segmentation performance among the evaluated models. The simplified annotation strategy provided more stable geometric inputs for the PCAcutSeg-V method, enabling more reliable estimation of cutting points and cutting lines compared with the extended annotation approach. When combined with the PCAcutSeg-V method, the proposed perception–geometry pipeline achieved high effectiveness in pruning decision estimation. The method was further implemented in a real-time processing pipeline using an RGB camera and an edge computing platform, where it maintained performance consistent with the results obtained from offline image analysis. These findings demonstrate that combining deep learning-based instance segmentation with deterministic geometric reasoning enables accurate and interpretable estimation of grapevine pruning locations and provides a promising foundation for future autonomous pruning systems. Full article
Show Figures

Figure 1

28 pages, 2658 KB  
Article
Analysis of Robustness and Interpretability of Multinomial Naïve Bayes and Tiny Text CNN Models for SMS Spam Detection Under Adversarial Attacks
by Murad A. Rassam and Redhwan Shaddad
Information 2026, 17(5), 408; https://doi.org/10.3390/info17050408 - 24 Apr 2026
Abstract
The growing complexity of unwanted messages, especially SMS spam, presents a serious challenge to the security of digital communication and user experience. While conventional spam detection models are useful on clean datasets, they are vulnerable to targeted attacks that aim to evade detection. [...] Read more.
The growing complexity of unwanted messages, especially SMS spam, presents a serious challenge to the security of digital communication and user experience. While conventional spam detection models are useful on clean datasets, they are vulnerable to targeted attacks that aim to evade detection. This study is motivated by the urgent need to evaluate the resilience of machine learning models against evolving threats in real-world applications. We specifically investigate the robustness and interpretability of a Multinomial Naive Bayes (MNB) model, representative of traditional machine learning, and a Tiny Text convolutional neural network (Tiny Text CNN), representative of deep learning models, for SMS spam detection. Using the UCI dataset under simulated adversarial text attacks, both models were tested against filler-word insertion and character-level perturbation attacks. Results show that while the Tiny Text CNN maintained higher overall robustness (accuracy: 0.9821 clean vs. 0.9758 under character attacks), both models experienced notable degradation in recall, with MNB being more susceptible to filler-word attacks. Interpretability analyses using LIME and gradient-based saliency maps indicated that adversarial perturbations alter feature importance, diminishing the influence of spam-indicative tokens. The findings underscore the trade-offs between model complexity and adversarial resilience, offering insights for developing more secure and interpretable spam detection systems. Full article
Show Figures

Figure 1

66 pages, 1148 KB  
Review
Explainability and Trust in Deep Learning for Cancer Imaging: Systematic Barriers, Clinical Misalignment, and a Translational Roadmap
by Surekha Borra, Nilanjan Dey, Simon Fong, R. Simon Sherratt and Fuqian Shi
Cancers 2026, 18(9), 1361; https://doi.org/10.3390/cancers18091361 - 24 Apr 2026
Abstract
Deep learning (DL) has transformed cancer imaging by enabling automated tumour detection, classification, and risk prediction. Despite impressive diagnostic performance, limited explainability and poor uncertainty calibration continue to restrict clinical integration. This review is guided by five research questions that examine the challenges, [...] Read more.
Deep learning (DL) has transformed cancer imaging by enabling automated tumour detection, classification, and risk prediction. Despite impressive diagnostic performance, limited explainability and poor uncertainty calibration continue to restrict clinical integration. This review is guided by five research questions that examine the challenges, impact, and translational implications of explainable artificial intelligence (XAI) in oncology imaging. We identify key barriers to trust, including dataset bias, shortcut learning, opacity of convolutional neural networks, and workflow misalignment. Evidence suggests that explainable models can increase clinician confidence, reduce false positives, and improve collaborative decision-making when explanations are faithful, semantically meaningful, and uncertainty aware. We evaluate architectural strategies that embed interpretability such as concept-bottleneck models, prototype-based learning, and attention regularization along with post hoc techniques. Beyond performance metrics, we examine how interpretable AI aligns with clinical reasoning processes and analyse regulatory, ethical, and medico-legal considerations influencing deployment. The findings indicate that explainability alone is insufficient, durable trust requires epistemic alignment, prospective validation, lifecycle governance, and equity-focused evaluation. By reframing explainability as a structural design principle rather than a supplementary feature, this review outlines a pathway toward accountable and clinically dependable AI systems in oncology. Full article
(This article belongs to the Section Cancer Informatics and Big Data)
Show Figures

Figure 1

18 pages, 1839 KB  
Article
A GNN-Based Log Anomaly Detection Framework with Prompt Learning for Edge Computing
by Xianlang Hu, Guangsheng Feng, Xinling Huang, Xiangying Kong and Hongwu Lv
Computers 2026, 15(5), 273; https://doi.org/10.3390/computers15050273 - 24 Apr 2026
Abstract
System logs have been critical for analyzing the operational status and abnormal behavior of highly distributed and heterogeneous edge computing nodes. In edge environments, logs exhibit cross-event and cross-field structural interactions, making it difficult to uncover potential anomaly patterns from isolated events. Moreover, [...] Read more.
System logs have been critical for analyzing the operational status and abnormal behavior of highly distributed and heterogeneous edge computing nodes. In edge environments, logs exhibit cross-event and cross-field structural interactions, making it difficult to uncover potential anomaly patterns from isolated events. Moreover, sparse annotations and varying log formats limit the effectiveness of existing methods. To address these challenges, we propose a graph neural network (GNN) anomaly detection framework with prompt learning. It leverages few-shot prompt learning to automatically extract key fields and constructs a weighted directed graph that jointly models semantic embeddings and temporal dependencies, fully representing the structural interactions and semantic associations across events and fields. Furthermore, the framework performs graph-level anomaly detection by jointly optimizing graph representation learning and classification objective within an enhanced one-class directed graph convolutional network, enabling effective identification of global structural anomaly patterns in log graphs. Experimental results demonstrate that the proposed method achieves an average F1-score of 93.3%, surpassing the current state-of-the-art (SOTA) methods by 6.93%. Full article
(This article belongs to the Special Issue Mobile Fog and Edge Computing)
25 pages, 3097 KB  
Article
Healthcare AI as Critical Digital Health Infrastructure: A Public Health Preparedness Framework for Systemic Risk
by Nikolay Lipskiy and Stephen V. Flowerday
Future Internet 2026, 18(5), 232; https://doi.org/10.3390/fi18050232 - 24 Apr 2026
Abstract
Healthcare artificial intelligence (AI) is moving from the laboratory into the infrastructure of care. As these systems become embedded in imaging, electronic health records, triage, and clinical decision support, their failures can affect not only individual encounters but also institutions and patient populations. [...] Read more.
Healthcare artificial intelligence (AI) is moving from the laboratory into the infrastructure of care. As these systems become embedded in imaging, electronic health records, triage, and clinical decision support, their failures can affect not only individual encounters but also institutions and patient populations. Yet governance still centers on model development, local validation, and one-time compliance, with limited attention to cross-site failure after deployment. This article examines how public health preparedness can help close that gap. It presents a conceptual analysis grounded in two cases: a pneumonia-screening convolutional neural network that learned institutional confounders rather than portable clinical signals, and a widely deployed sepsis prediction model whose external performance and alert burden fell short of developer claims. Together, these cases reveal five governance features of systemic healthcare AI risk: population-level exposure, cascade effects across shared infrastructures, unequal vulnerability, delayed recognition, and coordination needs beyond any single institution. In response, we propose a tripartite framework combining stronger pre-deployment assurance, post-deployment surveillance with escalation thresholds, and tertiary response through investigation, rollback, remediation, and cross-site learning. The argument is not that AI failures are epidemics, but that high-impact clinical AI systems now function as critical digital health infrastructure requiring preparedness alongside lifecycle oversight. Full article
(This article belongs to the Section Techno-Social Smart Systems)
Show Figures

Figure 1

16 pages, 4919 KB  
Article
EA-UNET: An Enhanced and Efficient Model for Left-Turn Lane
by Haowei Wang, Haixin Liu, Fei Wang, Xingbin Chen, Baogang Li and Jiang Liu
Sensors 2026, 26(9), 2642; https://doi.org/10.3390/s26092642 - 24 Apr 2026
Abstract
Left-turn lanes are critical elements of urban intersections. Accurate and efficient lane detection is essential for the safe navigation of autonomous vehicles. To address the limitations of existing semantic segmentation algorithms—specifically, inadequate detection accuracy, high computational cost, and vulnerability to environmental disturbances—we propose [...] Read more.
Left-turn lanes are critical elements of urban intersections. Accurate and efficient lane detection is essential for the safe navigation of autonomous vehicles. To address the limitations of existing semantic segmentation algorithms—specifically, inadequate detection accuracy, high computational cost, and vulnerability to environmental disturbances—we propose a lightweight deep convolutional neural network named EA-UNet. First, we replace the standard U-Net encoder with EfficientNet-B0 to enhance feature extraction efficiency. Second, we introduce a novel contextual coordination module, termed MP-ASPP, which integrates a Convolutional Block Attention Module (CBAM) to further refine attention mechanisms. Finally, a comprehensive real-world dataset was constructed by collecting videos and images of left-turn waiting areas during real-vehicle testing. Experimental results demonstrate that EA-UNet significantly outperforms the baseline U-Net and other state-of-the-art models, achieving accurate and efficient segmentation of left-turn lanes even in complex scenes. Full article
(This article belongs to the Section Vehicular Sensing)
28 pages, 33073 KB  
Article
Pedestrian Localization Using Smartphone LiDAR in Indoor Environments
by Jaehun Kim and Kwangjae Sung
Electronics 2026, 15(9), 1810; https://doi.org/10.3390/electronics15091810 - 24 Apr 2026
Abstract
Many place recognition approaches, which identify previously visited places or locations by matching current sensory data, such as 2D RGB images and 3D point clouds, have been proposed to achieve accurate and robust localization and loop closure detection in global positioning system (GPS)-denied [...] Read more.
Many place recognition approaches, which identify previously visited places or locations by matching current sensory data, such as 2D RGB images and 3D point clouds, have been proposed to achieve accurate and robust localization and loop closure detection in global positioning system (GPS)-denied environments. Since visual place recognition (VPR) methods that rely on images captured by camera sensors are highly sensitive to variations in appearance, including changes in lighting, surface color, and shadows, they can lead to poor place recognition accuracy. In contrast, light detection and ranging (LiDAR)-based place recognition (LPR) approaches based on 3D point cloud data that captures the shape and geometric structure of the environment are robust to changes in place appearance and can therefore provide more reliable place recognition results than VPR methods. This work presents an indoor LPR method called PointNetVLAD-based indoor pedestrian localization (PIPL). PIPL is a deep network model that uses PointNetVLAD to learn to extract global descriptors from 3D LiDAR point cloud data. PIPL can recognize places previously visited by a pedestrian using point clouds captured by a low-cost LiDAR sensor on a smartphone in small-scale indoor environments, while PointNetVLAD performs place recognition for vehicles using high-cost LiDAR, GPS, and inertial measurement unit (IMU) sensors in large-scale outdoor areas. For place recognition on 3D point cloud reference maps generated from LiDAR scans, PointNetVLAD exploits the universal transverse mercator (UTM) coordinate system based on GPS and IMU measurements, whereas PIPL uses a virtual coordinate system designed in this study due to the unavailability of GPS indoors. In experiments conducted in campus buildings, PIPL shows significant advantages over NetVLAD (known as a convolutional neural network (CNN)-based VPR method). Particularly in indoor environments with repetitive scenes where geometric structures are preserved and image-based appearance features are sparse or unclear, PIPL achieved 39% higher top-1 accuracy and 10% higher top-3 accuracy compared to NetVLAD. Furthermore, PIPL achieved place recognition accuracy comparable to NetVLAD even with a small number of points in a 3D point cloud and outperformed NetVLAD even with a smaller model training dataset. The experimental results also indicate that PIPL requires over 76% less place retrieval time than NetVLAD while maintaining robust place classification performance. Full article
(This article belongs to the Special Issue Advanced Indoor Localization Technologies: From Theory to Application)
20 pages, 4990 KB  
Article
Curvature Radius Measurement Based on Interferogram Analysis and Deep Learning Model
by Yan-Yi Li, Chuen-Lin Tien, Hsi-Fu Shih, Han-Yen Tu and Chih-Cheng Chen
Photonics 2026, 13(5), 416; https://doi.org/10.3390/photonics13050416 - 24 Apr 2026
Abstract
Accurate estimation of curvature radius from interference fringes is critical in optical metrology and precision manufacturing. Conventional interferogram analytical approaches often require manual intervention and are sensitive to fringe variations related to noise and environmental vibrations. To address these limitations, we combine an [...] Read more.
Accurate estimation of curvature radius from interference fringes is critical in optical metrology and precision manufacturing. Conventional interferogram analytical approaches often require manual intervention and are sensitive to fringe variations related to noise and environmental vibrations. To address these limitations, we combine an improved Twyman–Green interferometer with different artificial intelligence (AI) deep learning models and utilize a self-developed MATLAB analysis program to propose a non-destructive and rapid measurement system for optical coating substrates. The proposed AI-assisted Twyman–Green interferometric system differs fundamentally from conventional wavefront sensing techniques in both principle and implementation. This paper utilizes the Twyman–Green interferometer to generate interference fringe datasets on B270 glass and sapphire substrates, and employs convolutional neural network (CNN), ResNet-18, and VGG-16 models for training and evaluation. The proposed method integrates image enhancement, fringe pattern clustering, and analysis and validation based on fast Fourier transform (FFT). Experimental results show that ResNet-18 outperforms other models, with a mean absolute percentage error of 5.44% on sapphire substrates and 3.40% on B270 glass substrates. These findings highlight the effectiveness and robustness of deep learning models, especially residual networks, in automatic ROC prediction for optical measurement applications. Full article
Show Figures

Figure 1

26 pages, 4696 KB  
Article
Exploring Variable Influences on the Compressive Strength of Alkali-Activated Concrete Using Ensemble Tree, Deep Learning Methods and SHAP-Based Interpretation
by Musa Adamu, Mahmud M. Jibril, Abdurra’uf M. Gora, Yasser E. Ibrahim and Hani Alanazi
Eng 2026, 7(5), 192; https://doi.org/10.3390/eng7050192 - 24 Apr 2026
Abstract
Growing concerns about global climate change and its negative consequences for communities have put immense pressure on the building industry, which is one of the primary sources of greenhouse gas emissions. Due to the environmental issues associated with the manufacture of sustainable construction [...] Read more.
Growing concerns about global climate change and its negative consequences for communities have put immense pressure on the building industry, which is one of the primary sources of greenhouse gas emissions. Due to the environmental issues associated with the manufacture of sustainable construction materials, alkali-activated concrete (AAC) has emerged as a competitive alternative to cement. To predict the compressive strength (CS) of AAC, four machine learning (ML) models, namely, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost), were employed in this study using 193 data points. The input variables include Precursor “P” (kg/m3), Blast Furnace Slag “BFS ratio”, Sodium hydroxide “Na” (kg/m3), silicate modulus “Ms”, water content “W” (kg/m3), fine aggregate “FA” (kg/m3), coarse aggregate “A” (kg/m3), and curing time “CT” (day), with CS (MPa) as the output variable. The dataset was checked for stationarity and then normalized to decrease data redundancy and increase integrity. Furthermore, three model combinations were developed based on the relationship between the input and target variables. The XGB-M3 model outperformed all other models with a high degree of accuracy, according to the study’s findings. Specifically, the Pearson correlation coefficient (PCC) was 0.9577, and the mean absolute percentage error (MAPE) was 14.95% during the calibration phase. SHAP, an explainable AI approach that provides interpretable insights into complex AI systems by assigning feature importance to model predictions, was employed. Results suggest the higher predictions from the XGB-M3 and RF-M3 models were largely driven by curing time (CT). Full article
(This article belongs to the Special Issue Artificial Intelligence for Engineering Applications, 2nd Edition)
27 pages, 7794 KB  
Article
Demagnetization Severity Detection in Permanent Magnet Synchronous Motors Based on Temperature Signal and Convolutional Neural Network
by Zhiqiang Wang, Shihao Yan, Haodong Sun, Xin Gu, Zhichen Lin and Kefei Zhu
Sensors 2026, 26(9), 2631; https://doi.org/10.3390/s26092631 - 24 Apr 2026
Abstract
To address the difficulty of detecting demagnetization severity in permanent magnet synchronous motors (PMSMs), this paper proposes a demagnetization severity detection method based on temperature signal and Convolutional Neural Network (CNN). First, the differences between local demagnetization and eccentricity fault in stator current [...] Read more.
To address the difficulty of detecting demagnetization severity in permanent magnet synchronous motors (PMSMs), this paper proposes a demagnetization severity detection method based on temperature signal and Convolutional Neural Network (CNN). First, the differences between local demagnetization and eccentricity fault in stator current harmonics are analyzed from an electromagnetic perspective, and fast Fourier transform (FFT) is used for frequency-domain analysis of the stator current to identify local demagnetization faults. On this basis, an electromagnetic–thermal coupling model is established by considering motor losses and heat dissipation boundary conditions to obtain the winding temperatures under different demagnetization severities and operating conditions. Furthermore, the temperature time series, together with speed and load torque, is constructed into a three-dimensional state space, and the proposed Conditionally Modulated Multi-Scale Convolutional Neural Network (CMSCNN) is introduced for feature learning to achieve demagnetization severity detection. Experimental results show that the proposed method achieves an average detection accuracy of 98.06% on the simulation test set and outperforms the baseline CNN model. On measured data collected from the faulty prototype, the average detection accuracy reaches 93.34%, verifying the effectiveness of the proposed method for demagnetization severity detection. Full article
(This article belongs to the Special Issue Sensors for Fault Diagnosis of Electric Machines)
Show Figures

Figure 1

26 pages, 11449 KB  
Article
Signal Intelligence: Vibration-Driven Deep Learning for Anomaly Detection of Rotary-Wing UAVs
by Alican Yilmaz, Erkan Caner Ozkat and Fatih Gul
Drones 2026, 10(5), 321; https://doi.org/10.3390/drones10050321 - 24 Apr 2026
Abstract
Unmanned aerial vehicles (UAVs) operating in safety-critical missions require effective anomaly detection methods to identify propulsion-system faults before they cause catastrophic failures. However, current vibration-based diagnostic models typically rely on datasets representing only discrete, isolated fault states, and do not capture the continuous [...] Read more.
Unmanned aerial vehicles (UAVs) operating in safety-critical missions require effective anomaly detection methods to identify propulsion-system faults before they cause catastrophic failures. However, current vibration-based diagnostic models typically rely on datasets representing only discrete, isolated fault states, and do not capture the continuous structural degradation that occurs during real flight operations. To address this gap, this study proposes a severity-ordered vibration data augmentation framework for anomaly detection in rotary-wing UAV propulsion systems. Controlled experiments were conducted under healthy, tape-induced imbalance, scratch, and cut propeller conditions using stepped throttle excitation from 10% to 100% in 10% increments, with 40 s per level. A severity-ordered arrangement strategy based on throttle level and a robust peak-to-peak severity metric generated approximately 7.5 h of augmented vibration data per axis, representing a continuous degradation trajectory. Three-axis continuous wavelet transform (CWT) scalograms of size 48×96×3 were used to train an unsupervised anomaly detection framework. Comparative experiments with Isolation Forest, One-Class SVM, and LSTM–AE demonstrated that the proposed Convolutional Neural Network (CNN)–Bidirectional Gated Recurrent Unit (BiGRU)–State-Space Model (SSM)–Autoencoder (AE) architecture achieved the best performance, reaching 0.9959 precision, 0.4428 recall, 0.6131 F1-score, and 0.9284 Area Under the Receiver Operating Characteristic Curve (AUROC). The ablation study further showed that incorporating temporal modeling and state-space dynamics improves detection robustness compared with CNN–AE and CNN–BiGRU–AE baselines. These results show that combining severity-ordered augmentation with deep temporal learning improves progressive propulsion anomaly detection in UAV vibration monitoring. This work introduces a methodology that connects rotor dynamics principles with deep learning, providing a continuous degradation manifold that improves early-stage detection and condition monitoring of UAV propulsion systems. Full article
Show Figures

Figure 1

26 pages, 1490 KB  
Systematic Review
Object Detection in Optical Remote Sensing Images: A Systematic Review of Methods, Benchmarks, and Operational Applications
by Neus Fontanet Garcia and Piero Boccardo
Remote Sens. 2026, 18(9), 1289; https://doi.org/10.3390/rs18091289 - 23 Apr 2026
Abstract
Object detection in optical remote sensing imagery has emerged as a crucial task in computer vision, with applications ranging between environmental monitoring to disaster management, precision agriculture, and urban planning. This review systematically examines current methodologies, categorising them into four principal approaches: (1) [...] Read more.
Object detection in optical remote sensing imagery has emerged as a crucial task in computer vision, with applications ranging between environmental monitoring to disaster management, precision agriculture, and urban planning. This review systematically examines current methodologies, categorising them into four principal approaches: (1) template matching-based methods, which leverage predefined patterns for object identification; (2) knowledge-based methods, which incorporate geometric and contextual information to enhance detection accuracy; (3) object-based image analysis (OBIA), which segments images into meaningful objects using spectral and spatial properties; (4) machine learning-based methods, particularly deep convolutional neural networks (CNNs), which have revolutionised the field through automatic feature learning. Each methodology’s performance characteristics, computational requirements, and suitability for different remote sensing applications are analysed. Our systematic review, following PRISMA guidelines, analysed 189 studies published from 2010 to 2025, of which 73 provided quantitative results on standard benchmarks. The three most critical challenges identified are as follows: (1) annotation bottleneck, as dense bounding box labelling of remote sensing imagery remains highly labour-intensive for deep learning approaches, (2) extreme scale variation spanning 2–3 orders of magnitude within single scenes, and (3) domain adaptation failures when models encounter new geographic regions or sensor characteristics. This review identifies critical research gaps and proposes prioritised future directions, emphasising foundation models for zero-shot detection, efficient architectures for resource-constrained deployment, and standardised benchmarks with size-specific metrics. The analysis provides practitioners with evidence-based decision frameworks for method selection and researchers with a roadmap for advancing object detection in remote sensing applications. Full article
23 pages, 2091 KB  
Article
A Photovoltaic Power Prediction Method Based on Wavelet Convolutional Neural Networks and Improved Transformer
by Yibo Zhou, Zihang Liu, Zhen Cheng, Hanglin Mi, Zhaoyang Qin and Kangyangyong Cao
Energies 2026, 19(9), 2040; https://doi.org/10.3390/en19092040 - 23 Apr 2026
Abstract
The output power of photovoltaic (PV) systems is influenced by various environmental factors, exhibiting strong nonlinearity and non-stationarity, which poses significant challenges for accurate forecasting. To address these issues, this paper proposes a short-term PV power forecasting method based on wavelet convolutional neural [...] Read more.
The output power of photovoltaic (PV) systems is influenced by various environmental factors, exhibiting strong nonlinearity and non-stationarity, which poses significant challenges for accurate forecasting. To address these issues, this paper proposes a short-term PV power forecasting method based on wavelet convolutional neural networks and an improved Transformer. First, the Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) is employed to decompose the original PV power sequence into several intrinsic mode functions (IMFs). Fuzzy entropy is then utilized to evaluate the complexity of each component, and subsequences with similar entropy values are reconstructed to reduce the non-stationarity of the original series. Subsequently, Pearson correlation coefficients and the maximal information coefficient (MIC) are applied to capture both linear and nonlinear relationships between each reconstructed component and meteorological features, enabling the selection of strongly correlated variables. On this basis, a wavelet convolutional network (WTConv) is introduced to perform multi-scale decomposition and frequency-band feature extraction on the reconstructed components by integrating wavelet transform with convolution operations, effectively expanding the receptive field and extracting deep-seated features of the sequences. Finally, an improved iTransformer model is adopted for time-series modeling, leveraging its inverted encoding structure and self-attention mechanism to fully capture long-term dependencies among multivariate variables. The proposed model is validated using actual power data from a PV plant in Ningxia, China, across four seasons. Comprehensive experiments, including ablation studies, comparative analyses, loss function convergence evaluation, and Diebold–Mariano significance tests, are conducted to thoroughly assess the model’s effectiveness and superiority. Experimental results demonstrate that the proposed model achieves excellent prediction accuracy and stability in spring, summer, autumn, and winter, showing strong potential for engineering applications. Full article
(This article belongs to the Section A2: Solar Energy and Photovoltaic Systems)
Show Figures

Figure 1

15 pages, 4945 KB  
Article
Evaluation of Deep Learning Models for Image-Based Classification of Timber Logs by Market Value
by Matevž Triplat, Žiga Lukančič and Vasja Kavčič
Forests 2026, 17(5), 518; https://doi.org/10.3390/f17050518 (registering DOI) - 23 Apr 2026
Abstract
The identification of standing tree species, timber logs, and on-site assessment of their quality and value using images holds significant potential for forestry applications, including inventory management, traceability under EU regulations like the Deforestation Regulation, and market valuation amid growing demands for sustainable [...] Read more.
The identification of standing tree species, timber logs, and on-site assessment of their quality and value using images holds significant potential for forestry applications, including inventory management, traceability under EU regulations like the Deforestation Regulation, and market valuation amid growing demands for sustainable practices. This study addresses this by classifying images of timber logs by tree species and market value using the Orange data mining software, which leverages pre-trained convolutional neural networks (Inception v3 and SqueezeNet) to generate embeddings from a dataset of 5549 images collected at a real timber auction in Slovenia, followed by logistic regression image classification. Results show high accuracy for tree species classification (up to 92.6%), but substantially lower accuracy for market value classification (40%–55%), reflecting the greater complexity of value determination from visual features. These findings underscore the promise of deep learning for species identification while indicating the need for further methodological advancements to enhance value classification reliability, which offers the practical impact for operational forestry and bioeconomy value chains. Full article
(This article belongs to the Special Issue Sustainable Forest Operations: Technology, Management, and Challenges)
Show Figures

Figure 1

Back to TopTop