Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (4,096)

Search Parameters:
Keywords = recurrent neural network

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
34 pages, 22401 KB  
Article
Sensor-Driven Short-Term Forecasting on the Metropolitan LA Traffic Dataset: A Comparative Study for Multi-Step Prediction
by Bowen Dong, Xinyu Zhang, Weiyan Zhu, Lingmin Hou, Chaoya Yan, Yifan Feng and Lixing Lin
Sensors 2026, 26(12), 3917; https://doi.org/10.3390/s26123917 (registering DOI) - 20 Jun 2026
Abstract
Short-term traffic forecasting is a critical component of intelligent transportation systems. While deep learning architectures for this task have proliferated rapidly, the sensor-level data characteristics—zero-value prevalence, distributional heterogeneity, and cross-sensor correlation structure—that drive architecture-specific failure modes remain insufficiently understood, and their implications for [...] Read more.
Short-term traffic forecasting is a critical component of intelligent transportation systems. While deep learning architectures for this task have proliferated rapidly, the sensor-level data characteristics—zero-value prevalence, distributional heterogeneity, and cross-sensor correlation structure—that drive architecture-specific failure modes remain insufficiently understood, and their implications for evidence-based model selection in real deployments have not been systematically addressed. This study addresses that question through a sensor-network diagnostic framework applied to the METR-LA dataset (Metropolitan Los Angeles; 207 inductive loop detectors, 5-min resolution). The framework integrates systematic characterization of sensor data properties, a controlled benchmark of four representative architectures—Transformer, Spatio-Temporal Graph Convolutional Network (STGCN), Diffusion Convolutional Recurrent Neural Network (DCRNN), and Gated Temporal Convolutional Network (Gated TCN)—under a unified 12→3 prediction setting, and a novel per-sensor regression analysis that quantitatively links zero-value ratios to model-specific prediction errors across all 207 sensors. Building on these findings, this study further proposes Graph-Enhanced Transformer (GETFormer), a lightweight hybrid architecture that augments the Transformer with a single-hop Graph Convolutional Network (GCN) layer and a gated residual fusion module. The diagnostic findings and condition-dependent model-selection guidelines provide an empirically grounded foundation for principled hybrid architecture development in urban traffic sensing. Full article
21 pages, 497 KB  
Article
Unsupervised Anomaly Detection Framework for Multimodal Data in Industrial Control Systems
by Yunsung Kim, Gyeongdeok An, Kihyun Kim and Jaecheol Ha
Sensors 2026, 26(12), 3914; https://doi.org/10.3390/s26123914 (registering DOI) - 20 Jun 2026
Abstract
Industrial control systems (ICSs) are cyber–physical environments in which physical process data and network communication data are generated simultaneously. Existing studies have mainly focused on either sensor-based or network-based anomaly detection, making it difficult to capture diverse attack indicators and motivating the use [...] Read more.
Industrial control systems (ICSs) are cyber–physical environments in which physical process data and network communication data are generated simultaneously. Existing studies have mainly focused on either sensor-based or network-based anomaly detection, making it difficult to capture diverse attack indicators and motivating the use of multimodal methods that can leverage complementary information from both modalities. In this paper, we propose an unsupervised multimodal anomaly detection framework for ICSs that jointly uses sensor and network modalities. For each modality, autoencoder-based single-modality models are trained in an unsupervised manner, and their anomaly scores and latent feature vectors are extracted. These outputs are temporally aligned to construct a time-aligned multimodal table, which is then used to implement and compare two fusion strategies: anomaly score fusion and latent feature fusion. In latent feature fusion, aligned modality-specific latent features are combined with canonical correlation analysis (CCA)-derived cross-modal correlation features. The experimental results showed that latent feature fusion achieved stable performance across multiple sensor–network encoder combinations. In particular, the gated recurrent unit–convolutional neural network (GRU–CNN) combination achieved the best F1-score of 0.9166 and ROC-AUC of 0.9795. In addition, the complementarity analysis showed that latent feature fusion recovered some missed detections by integrating complementary sensor and network evidence. These results demonstrate that latent feature fusion is an effective multimodal strategy for ICS anomaly detection. Full article
(This article belongs to the Collection Cryptography and Security in IoT and Sensor Networks)
Show Figures

Figure 1

26 pages, 3229 KB  
Review
Artificial Intelligence Algorithms in Tunnel Construction Risk Management: A Review of Research Trends, Application Scenarios and Bottlenecks
by Junqian Zhang, Jianling Huang, Xiaodong Hu, Qing’e Wang, Huihua Chen and Zhenxu Guo
Buildings 2026, 16(12), 2446; https://doi.org/10.3390/buildings16122446 (registering DOI) - 20 Jun 2026
Abstract
As tunnel engineering continues to advance toward deeper, longer, and more complex projects, the risks encountered during the construction phase have evolved into a combination of various disaster types and the accumulation of multiple contributing factors. Traditional empirical and semi-empirical risk management methods [...] Read more.
As tunnel engineering continues to advance toward deeper, longer, and more complex projects, the risks encountered during the construction phase have evolved into a combination of various disaster types and the accumulation of multiple contributing factors. Traditional empirical and semi-empirical risk management methods are increasingly revealing shortcomings in terms of timeliness, accuracy, and the ability to process multi-source data. In recent years, driven by advancements in computing power and sensor technology, artificial intelligence algorithms (AI algorithms) such as machine learning and deep learning have been rapidly adopted in tunnel construction risk management. This paper retrieved relevant literature from the Web of Science database covering the period from 2010 to 2025. After rigorous screening, 96 highly relevant papers were selected for bibliometric analysis. This paper systematically reviews research progress from two perspectives: algorithmic models and engineering applications. The review indicates that, in terms of algorithmic models, traditional machine learning, convolutional neural network, recurrent neural network, generative adversarial network, Transformer, and graph neural network constitute a multi-level technical framework encompassing feature representation, risk perception, and intelligent decision-making. In terms of applications, AI algorithms have been widely integrated into typical scenarios such as geological hazard identification and prediction, surrounding rock stability and deformation prediction, rock burst assessment and early warning, lining defect detection and structural safety assessment, construction-induced ground settlement prediction, and tunnel gas and fire hazard prediction, significantly enhancing risk identification and early warning capabilities. However, several challenges remain, including the scarcity of high-quality datasets, the prevalence of noisy, incomplete, and heterogeneous monitoring data, insufficient coupling between model interpretability and engineering mechanisms, limited cross-project transferability, and the lack of integrated management systems for multi-hazard lifecycle control. Based on this, this paper proposes future research directions in areas such as data infrastructure development, integration of mechanism constraints, and multi-hazard collaborative modeling, aiming to provide guidance for the further development of intelligent risk management in tunnel construction. Full article
(This article belongs to the Section Construction Management, and Computers & Digitization)
Show Figures

Figure 1

32 pages, 3092 KB  
Review
A Review on Deep State Space Models for Sequential Healthcare Data Prediction
by Wenjie Li, Yongming Xie and Yinglong Dai
Mathematics 2026, 14(12), 2210; https://doi.org/10.3390/math14122210 (registering DOI) - 19 Jun 2026
Abstract
Sequential data prediction is a crucial area in healthcare. Healthcare data have the characteristics of non-stationarity, long-range dependence (LRD), and irregular sampling. Modeling these complex temporal features is highly challenging. Recurrent Neural Networks (RNNs) and their variants are limited in learning long-range dependencies [...] Read more.
Sequential data prediction is a crucial area in healthcare. Healthcare data have the characteristics of non-stationarity, long-range dependence (LRD), and irregular sampling. Modeling these complex temporal features is highly challenging. Recurrent Neural Networks (RNNs) and their variants are limited in learning long-range dependencies (LRDs) due to the inherent issues of vanishing and exploding gradients. Transformers alleviate this limitation by using the self-attention mechanism. Its quadratic computational complexity and memory bottleneck limit its scalability in long-range healthcare data. In this context, Structured State Space Models (SSMs) have emerged as a promising alternative. Compared with conventional RNNs, they can alleviate the difficulty of modeling LRDs more efficiently, and many modern SSM variants achieve linear time sequence modeling while reducing the computational burden associated with Transformers. In this review, we provide a formal definition of Healthcare Process Modeling, compare the core theoretical frameworks of RNNs, Transformers, and SSMs, trace the architectural evolution of SSM architectures, and provide a comprehensive review of healthcare applications and open challenges, including LSSL, S4, S5, Mamba, and their related variants. Existing studies suggest that structured SSMs are promising for selected long-sequence healthcare prediction tasks, particularly when computational efficiency and long-context retention are important. With these advantages, they may help alleviate the computational burden in certain healthcare tasks and provide a basis for further exploring the practical application of data-driven healthcare systems in clinical practice. Full article
24 pages, 15691 KB  
Article
A Joint Fault Diagnosis and Severity Prediction Framework for Rolling Bearings Using PPCA-EMD and 1DCNN-BiGRU
by Wangshen Hao, Chunhui Zhu, Dongliang Zou, Chenyang Li, Shenglin Song and Shilong Zhang
Machines 2026, 14(6), 701; https://doi.org/10.3390/machines14060701 (registering DOI) - 18 Jun 2026
Abstract
Rolling bearing fault diagnosis remains challenging due to environmental noise, insufficient information sharing between diagnosis and prediction tasks, and poor model generalization ability. To address these issues, this paper proposes a fault diagnosis and severity prediction method integrating probabilistic principal component analysis (PPCA) [...] Read more.
Rolling bearing fault diagnosis remains challenging due to environmental noise, insufficient information sharing between diagnosis and prediction tasks, and poor model generalization ability. To address these issues, this paper proposes a fault diagnosis and severity prediction method integrating probabilistic principal component analysis (PPCA) and empirical mode decomposition (EMD) with a one-dimensional convolutional neural network (1DCNN) and bidirectional gated recurrent unit (BiGRU). The proposed model consists of two parallel branches for fault diagnosis and fault severity prediction. A self-attention mechanism is integrated into both branches to enhance feature extraction via adaptive feature weighting. In addition, parameter sharing and weighted loss functions are adopted to improve the training efficiency and collaborative learning between the two tasks. PPCA and EMD are employed for signal denoising and reconstruction while preserving fault-related features. Experiments on public datasets and industrial production-line data show that the proposed method improves the fault classification accuracy from 92.43% to 99.71% under different load conditions, while achieving 98.99% accuracy in fault severity prediction. Noise interference tests further demonstrate the effectiveness of the model. A production-line case study further illustrates the feasibility of applying the proposed method to real monitoring signals. These results confirm the effectiveness and practical potential of the proposed method for rolling bearing fault diagnosis and health assessment. Full article
Show Figures

Figure 1

33 pages, 2087 KB  
Article
DEP-TFDualNet: A Dual-Domain Attention Framework with Temporal–Frequency Fusion for Depression Recognition Using Three-Channel Frontal EEG
by Haijun Lin, Jiayi Liu and Dongxu Jiang
Sensors 2026, 26(12), 3861; https://doi.org/10.3390/s26123861 - 17 Jun 2026
Viewed by 57
Abstract
Early depression screening is important for timely intervention, and electroencephalography (EEG) offers an objective and potentially portable sensing modality for computer-aided assessment. However, recognition from fixed three-channel frontal EEG remains difficult because of limited spatial information and incomplete modeling of temporal–frequency characteristics and [...] Read more.
Early depression screening is important for timely intervention, and electroencephalography (EEG) offers an objective and potentially portable sensing modality for computer-aided assessment. However, recognition from fixed three-channel frontal EEG remains difficult because of limited spatial information and incomplete modeling of temporal–frequency characteristics and temporal dependencies. This study proposes DEP-TFDualNet for acquisition-constrained frontal resting-state EEG. The framework integrates multi-scale convolution, dual-domain channel attention, temporal modeling derived from the independent recurrent neural network (IndRNN) architecture, and decision-stage fusion of deep representations with low-order statistical descriptors through a Kolmogorov–Arnold Network (KAN)-based nonlinear projection layer. Experiments were conducted on the publicly available three-channel frontal EEG subset of the MODMA dataset. After additional quality control, 48 subjects were retained (22 patients with major depressive disorder, 26 healthy controls). Under subject-wise stratified five-fold cross-validation, DEP-TFDualNet achieved 85.42% accuracy, 85.26% macro-F1, 81.82% sensitivity, 88.46% specificity, an AUC of 0.82, and a Brier score of 0.121. It achieved the best threshold-based subject-level performance and the lowest Brier score among the evaluated models. These results provide preliminary evidence that simplified frontal EEG sensing may support depression recognition in acquisition-constrained settings, although larger and external validation is still required. Full article
34 pages, 1521 KB  
Review
Learning Rare Events: Deep Learning Approaches to Extreme Price Prediction
by Mark Sinclair, Andrew J. Shepley and Farshid Hajati
Forecasting 2026, 8(3), 52; https://doi.org/10.3390/forecast8030052 - 17 Jun 2026
Viewed by 38
Abstract
Price spikes are rare but economically significant events observed across electricity, financial, commodity, and cryptocurrency markets. Their abrupt magnitude, heavy-tailed distributions, and severe class imbalance make them difficult to forecast using conventional time-series methods. This systematic literature review, conducted in accordance with the [...] Read more.
Price spikes are rare but economically significant events observed across electricity, financial, commodity, and cryptocurrency markets. Their abrupt magnitude, heavy-tailed distributions, and severe class imbalance make them difficult to forecast using conventional time-series methods. This systematic literature review, conducted in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) framework, synthesises recent deep learning approaches to forward-looking price-spike prediction and classification. Searches of Scopus, Web of Science, and IEEE Xplore identified studies published between 2020 and 2026. Following screening and full-text eligibility assessment of approximately 300 studies, only 20 met the inclusion criteria and were included in the final synthesis, comprising 19 peer-reviewed papers and one doctoral thesis. The review develops a structured taxonomy spanning spike definitions, task formulations, model architectures, input design, and evaluation practices. A central finding is that predictive performance is driven more by problem formulation, label construction, and evaluation design than by model architecture. While architectures have diversified to include recurrent networks, transformers, graph neural networks, and hybrid frameworks, improvements are often attributable to differences in how the prediction problem is defined rather than the models themselves. Key limitations stem from inconsistent spike definitions and insufficient treatment of class imbalance, leading to a misalignment between modelling objectives and evaluation practices, further exacerbated by the absence of standardised benchmarks. These issues hinder comparability and can lead to overstated model performance by masking poor detection of rare but economically critical spike events. The review therefore identifies clear directions for future research, including standardised spike labelling, adoption of rare-event-appropriate evaluation frameworks, and problem formulations that explicitly target extreme-event prediction. Full article
Show Figures

Figure 1

23 pages, 2110 KB  
Article
A Lightweight LCGRU–Wave-SkipConvNet Framework for Speech–Noise Separation in Urban Acoustic Environments and Performing-Arts Spaces Toward Sustainable and Equitable Acoustic Communication
by Baoli Zhang, Yanping Lu, Dandan Wang and Hongyan Liu
Sustainability 2026, 18(12), 6242; https://doi.org/10.3390/su18126242 - 17 Jun 2026
Viewed by 112
Abstract
Urban acoustic environments and performing-arts spaces strongly influence speech communication quality, acoustic comfort, and public wellbeing, particularly in noise-exposed shared environments such as transport hubs, campuses, healthcare spaces, public service facilities, music-education settings, and rehearsal or performance-related spaces. To address speech–noise separation in [...] Read more.
Urban acoustic environments and performing-arts spaces strongly influence speech communication quality, acoustic comfort, and public wellbeing, particularly in noise-exposed shared environments such as transport hubs, campuses, healthcare spaces, public service facilities, music-education settings, and rehearsal or performance-related spaces. To address speech–noise separation in low signal-to-noise ratio and acoustically complex scenarios, this study proposes a lightweight two-stage deep learning framework termed LCGRU–Wave-SkipConvNet. In the preprocessing stage, a Lightweight Convolutional Gated Recurrent Unit (LCGRU) model is employed to achieve preliminary separation of target speech and background noise by capturing both spatial and temporal acoustic features. In the post-processing stage, a Wave-SkipConvNet model is introduced to further suppress residual noise and enhance speech quality. Experimental results demonstrate that the proposed framework achieves superior performance under different signal-to-noise ratios, sound-source angles, and target angle errors. For example, in the preprocessing stage, the LCGRU model achieved a perceptual evaluation of speech quality (PESQ) score of 2.64 at source angles between 0° and 30°, outperforming the convolutional neural network-long short-term memory (CNN-LSTM) model by 1.17. In the post-processing stage, the Wave-SkipConvNet model achieved higher short-time objective intelligibility (STOI) and segmental signal-to-noise ratio (segSNR) values than the comparison models under different SNR conditions. The proposed framework provides an effective and deployment-oriented AI solution for improving speech accessibility and acoustic comfort in urban acoustic environments and performing-arts spaces. Beyond speech enhancement, it offers practical potential for supporting healthier, more inclusive, and more equitable acoustic environments in noise-sensitive public and educational spaces. It should be noted that this study focuses on the objective acoustic environment and signal-level speech enhancement, rather than subjective soundscape perception, musical perception, or human perceptual evaluation. Full article
Show Figures

Figure 1

18 pages, 12615 KB  
Article
Deep-Learning-Based Baseline Evaluation of Public WiFi CSI Datasets for Contactless RF-Based Human Activity Recognition
by Tayyaba Parveen, Rehan Khan, Umer Saeed and Insoo Koo
Sensors 2026, 26(12), 3821; https://doi.org/10.3390/s26123821 - 16 Jun 2026
Viewed by 161
Abstract
WiFi channel state information (CSI) has become a compelling sensing modality for contactless human activity recognition. However, differences in datasets, preprocessing protocols and model configurations make consistent comparison and reproducibility challenging. This study presents a unified baseline evaluation of four widely adopted deep [...] Read more.
WiFi channel state information (CSI) has become a compelling sensing modality for contactless human activity recognition. However, differences in datasets, preprocessing protocols and model configurations make consistent comparison and reproducibility challenging. This study presents a unified baseline evaluation of four widely adopted deep learning architectures: multilayer perceptron (MLP), convolutional neural network (CNN), gated recurrent unit (GRU) and a hybrid CNN–GRU model across multiple publicly available CSI datasets encompassing a range of sensing tasks. We harmonize the datasets, implement a standardized preprocessing and training pipeline to reduce experimental inconsistencies and support controlled within-dataset comparisons of model behavior. Evaluations include single-person activity recognition, fall-risk estimation, multiperson occupancy classification and localization-aware activity recognition, representing progressively higher temporal and spatial complexity. Our results show dataset-dependent trends: CNNs provide an efficient accuracy–complexity trade-off in several structured activity scenarios, whereas GRUs are advantageous when temporal dynamics are more prominent, although with greater training and inference costs. In contrast, MLPs generally underperform due to limited capacity to capture spatial and temporal dependencies. Confusion matrix analysis reveals that dynamic behaviors and low-motion states remain challenging to distinguish, underscoring the importance of temporal modeling. By releasing the complete experimental pipeline and benchmarking results, this work establishes a reproducible reference framework for the research community and highlights directions for future investigation, including cross-dataset generalization, hybrid model design and lightweight deployment strategies. Full article
(This article belongs to the Section State-of-the-Art Sensors Technologies)
Show Figures

Graphical abstract

17 pages, 4574 KB  
Article
Fault Diagnosis Method for Rotating Machinery Based on Threshold-Free Recurrence Distance Visualization Convolutional Neural Network
by Chao Song, Fuzhou Feng, Feng Liu, Ziyu Liu and Hao Hu
Sensors 2026, 26(12), 3815; https://doi.org/10.3390/s26123815 - 16 Jun 2026
Viewed by 215
Abstract
Recursive Plots (RPs) can fully utilize the information of signals on a time scale, but their application involves the issue of manual threshold selection, and different thresholds have a significant impact on the analysis results of recursive plots, which in turn affects the [...] Read more.
Recursive Plots (RPs) can fully utilize the information of signals on a time scale, but their application involves the issue of manual threshold selection, and different thresholds have a significant impact on the analysis results of recursive plots, which in turn affects the accuracy of subsequent fault diagnosis models. Some scholars have proposed the no-threshold recursive plot method to address the above issues, but this method is not comprehensive enough and has limitations. On the basis of RPs, this article proposes a Threshold-Free Recurrence Distance (TFRD), which is combined with a Convolutional Neural Network (CNN) to form a TFRD-CNN rotating machinery fault diagnosis model. The accuracy of the method is tested using bearing vibration data from Western Reserve University, and the effectiveness of the model is verified using a planetary gearbox gear fault dataset. At the same time, the TFRD-CNN method is compared with a Markov Transition Field (MTF), Gramian Angular Fields (GAF), and RP and URP combined with CNN methods. The results show that the TFRD-CNN method has significant advantages. Full article
(This article belongs to the Section Fault Diagnosis & Sensors)
Show Figures

Figure 1

21 pages, 2658 KB  
Article
CNN-Based Acoustic Gait Recognition: A Benchmarking Framework
by Ilaisaane Tilisa Fonua and Shahram Latifi
Electronics 2026, 15(12), 2658; https://doi.org/10.3390/electronics15122658 - 16 Jun 2026
Viewed by 289
Abstract
Acoustic gait recognition is an emerging passive biometric modality that identifies individuals by unique walking sound patterns. This work presents a reproducible benchmarking framework for convolutional neural network (CNN)-based acoustic gait recognition, providing a systematic evaluation methodology across varying identity pool sizes. Raw [...] Read more.
Acoustic gait recognition is an emerging passive biometric modality that identifies individuals by unique walking sound patterns. This work presents a reproducible benchmarking framework for convolutional neural network (CNN)-based acoustic gait recognition, providing a systematic evaluation methodology across varying identity pool sizes. Raw footstep recordings from the AFPILD dataset were converted into 128-bin mel-spectrograms and used to train a compact CNN across identity pool sizes from 10 to 40 subjects. To ensure statistical reliability, a three-times-repeated five-fold stratified cross-validation protocol was implemented. Experimental results demonstrate strong discriminative capability, with validation accuracy reaching 94.92% and Equal Error Rate (EER) of 1.31% for the 40-subject configuration. A multi-seed subset validation experiment across five independent random subject draws per pool size confirmed that the observed scaling trend is consistent across subset compositions rather than an artifact of a single subject selection. Additional analysis confirmed the framework’s resilience to moderate environmental noise and its superiority over classical Mel-Frequency Cepstral Coefficients paired with a Support Vector Machine (MFCC-SVM) and Convolutional Recurrent Neural Network (CRNN) baselines, supporting the feasibility of acoustic gait recognition as a passive biometric modality. Full article
Show Figures

Figure 1

19 pages, 2236 KB  
Article
GRU-Based Online PID Gain Scheduling Enhanced by High-Quality Dataset Construction
by Xinhao Zhao, Zhaopeng Dong, Tao Zhu and Jiayi Zhu
Appl. Sci. 2026, 16(12), 6032; https://doi.org/10.3390/app16126032 - 15 Jun 2026
Viewed by 85
Abstract
To address the limited adaptability of fixed-parameter PID controllers under dynamically varying reference signals and the strong dependence of data-driven PID methods on training-data quality, this paper proposes a GRU-based online PID gain-scheduling framework supported by high-quality dataset construction. Diverse reference excitations are [...] Read more.
To address the limited adaptability of fixed-parameter PID controllers under dynamically varying reference signals and the strong dependence of data-driven PID methods on training-data quality, this paper proposes a GRU-based online PID gain-scheduling framework supported by high-quality dataset construction. Diverse reference excitations are first designed, and sequential quadratic programming (SQP) is used as an expert label generator to produce trajectory-level PID gain labels. A region-of-interest (ROI)-based dynamic sample selection strategy is then introduced to retain informative transient samples and reduce the dominance of redundant steady-state data. The gated recurrent unit (GRU) network learns a temporal mapping from error-state sequences to PID gains and is deployed online with closed-loop safeguards, including filtered derivative information, gain denormalization, smoothing, and actuator constraints. In a representative nominal neural-controller benchmark, GRU-PID achieves a rise time of 0.59 s, a settling time of 0.97 s, ISE = 2.10, ITAE = 39.35, and TV = 394.48, showing a favourable balance between tracking accuracy and control-signal smoothness. Five-seed tests further indicate that GRU-PID provides stable nominal performance comparable to competitive neural schedulers, while simulation-based robustness evaluations suggest lower tracking errors than the tested neural baselines under measurement noise, step disturbance, actuator saturation, and combined uncertainty scenarios within the considered benchmark setting. Full article
Show Figures

Figure 1

33 pages, 6006 KB  
Article
Deep Learning-Enhanced Dielectric Sensing for Rapid Quality Assessment of ‘Starks Gold’ Sweet Cherries
by Erhan Kavuncuoglu, Kamil Sacilik, Mehmet Akif Buzpinar, Burak Ozbey, Necati Cetin and Fernando Auat Cheein
Agronomy 2026, 16(12), 1161; https://doi.org/10.3390/agronomy16121161 - 13 Jun 2026
Viewed by 237
Abstract
Soluble solids content (SSC) is one of the most important indicators of sweetness, ripeness, and market quality in sweet cherries. However, conventional SSC determination is destructive, labor-intensive, and unsuitable for rapid or large-scale quality assessment. Therefore, there is a need for fast, non-destructive, [...] Read more.
Soluble solids content (SSC) is one of the most important indicators of sweetness, ripeness, and market quality in sweet cherries. However, conventional SSC determination is destructive, labor-intensive, and unsuitable for rapid or large-scale quality assessment. Therefore, there is a need for fast, non-destructive, and data-driven sensing approaches that can estimate internal fruit quality without damaging the sample. This study aimed to develop a non-destructive approach for SSC prediction in sweet cherries by combining open-ended coaxial probe dielectric spectroscopy with deep learning models. An open-ended coaxial probe measurement system was designed and developed to determine the dielectric properties of sweet cherries and was coupled with an Agilent E4991A impedance analyzer operating over a frequency range of 5–3005 MHz. A total of 10,080 dielectric measurements and 2100 reference SSC measurements were collected over 26 experimental days. The dielectric constant (ε′), loss factor (ε″), and loss tangent (tan δ) were extracted and used to construct separate ε′, ε″, tan δ, and integrated combined datasets. Six deep learning architectures, namely convolutional neural network (CNN), long short-term memory (LSTM), bidirectional long short-term memory (BiLSTM), gated recurrent unit (GRU), CNN-LSTM, and convolutional long short-term memory (ConvLSTM), were trained and optimized using Bayesian optimization and early stopping. CNN achieved the best performance on the tan δ dataset (test R2 = 0.9099, RMSE = 0.8354 °Brix, MAE = 0.6599 °Brix), whereas GRU yielded the highest accuracy on the integrated combined dataset (test R2 = 0.8622, RMSE = 1.0331 °Brix, MAE = 0.7958 °Brix). ConvLSTM provided the most consistent performance across all four datasets (test R2 = 0.8081–0.8651), demonstrating strong predictive capability and practical computational efficiency. These findings confirm the potential of reduced-range dielectric spectroscopy combined with deep learning for rapid, non-destructive SSC assessment in sweet cherries. Full article
(This article belongs to the Special Issue Smart Farming: Advancing Techniques for High-Value Crops)
Show Figures

Figure 1

31 pages, 861 KB  
Systematic Review
Artificial Intelligence and Remote Sensing for Inland Surface Water Quality Monitoring: A Systematic Literature Review of Tools, Methods, Challenges, and Future Directions
by Cristiano Capellani Quaresma, Orandi Mina Falsarella, Duarcides Ferreira Mariosa, Diego de Melo Conti, Jorge L. Gallego, Júlio Cardoso Pereira and Isabella Maria Tressino Bruno
Water 2026, 18(12), 1459; https://doi.org/10.3390/w18121459 - 13 Jun 2026
Viewed by 229
Abstract
Monitoring inland surface water quality is essential for water security, ecosystem conservation, public health, and sustainable water resource management. Although in situ measurements remain indispensable, they are often limited by high costs, restricted spatial coverage, low temporal frequency, and discontinuous monitoring networks. This [...] Read more.
Monitoring inland surface water quality is essential for water security, ecosystem conservation, public health, and sustainable water resource management. Although in situ measurements remain indispensable, they are often limited by high costs, restricted spatial coverage, low temporal frequency, and discontinuous monitoring networks. This study presents a systematic literature review, guided by the PRISMA 2020 framework, of empirical studies published between 2021 and 2025 on the integration of artificial intelligence (AI) and remote sensing (RS) for inland surface water quality monitoring. Searches were conducted in the Web of Science database, resulting in a final corpus of 367 peer-reviewed articles. Preliminary bibliometric characterization and qualitative content analysis were performed to identify sensors, platforms, AI paradigms, algorithms, estimated parameters, validation strategies, limitations, challenges, trends, and research gaps. The results show rapid growth in the field, with Sentinel-2 and Landsat-8 as the most recurrent sensors and multispectral data as the dominant spectral source. Machine learning approaches, especially Random Forest, Artificial Neural Networks, XGBoost, and Support Vector Machine, predominated, while deep learning, multi-source integration, hybrid models, and Explainable AI emerged as relevant trends. AI–RS integration shows strong potential to complement conventional monitoring, but persistent challenges remain regarding in situ data dependence, limited external and temporal validation, model transferability, generalization, uncertainty reporting, validation robustness, and interpretability. Full article
49 pages, 3211 KB  
Article
Regime-Aware Stock Index Forecasting Under Latent Market States: A Hybrid Statistical Learning Framework with Cross-Market Validation
by Chunxia Tian, Roengchai Tansuchat and Songsak Sriboonchitta
Forecasting 2026, 8(3), 50; https://doi.org/10.3390/forecast8030050 - 12 Jun 2026
Viewed by 121
Abstract
This study proposes a hybrid forecasting framework that integrates Kalman Filtering (KF), Markov Switching (MS), and nonlinear recurrent learning for stock-index prediction. The KF component smooths short-term price noise, the MS model identifies latent return–volatility regimes, and the LSTM/GRU components learn nonlinear temporal [...] Read more.
This study proposes a hybrid forecasting framework that integrates Kalman Filtering (KF), Markov Switching (MS), and nonlinear recurrent learning for stock-index prediction. The KF component smooths short-term price noise, the MS model identifies latent return–volatility regimes, and the LSTM/GRU components learn nonlinear temporal patterns from regime-conditioned information. The framework is evaluated using the CSI 300, S&P 500, and Nikkei 225 indices through forecasting-accuracy measures, Bootstrap Diebold–Mariano tests with Modified Bayes Factor evidence, out-of-sample trading simulations, and robustness checks. The empirical results show that regime conditioning is the primary source of forecasting and economic improvement. KF–MS–LSTM performs best for the CSI 300 and Standard MS performs strongest for the S&P 500, while KF–MS–LSTM and KF–MS–GRU are more competitive for the Nikkei 225. In contrast, models without regime information, including pure LSTM/GRU and the standalone Transformer, generally exhibit weaker forecasting and trading performance. The findings suggest that latent market-state information is more important than neural-network complexity alone for robust financial forecasting, while the incremental value of Kalman filtering and recurrent learning remains market dependent. Overall, the results support regime-aware forecasting as an interpretable and economically meaningful approach for stock-index prediction under heterogeneous market environments. Full article
Back to TopTop