Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (157)

Search Parameters:
Keywords = CNN-GRU-Attention

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 1790 KiB  
Article
A Hybrid Deep Learning Model for Aromatic and Medicinal Plant Species Classification Using a Curated Leaf Image Dataset
by Shareena E. M., D. Abraham Chandy, Shemi P. M. and Alwin Poulose
AgriEngineering 2025, 7(8), 243; https://doi.org/10.3390/agriengineering7080243 (registering DOI) - 1 Aug 2025
Abstract
In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the [...] Read more.
In the era of smart agriculture, accurate identification of plant species is critical for effective crop management, biodiversity monitoring, and the sustainable use of medicinal resources. However, existing deep learning approaches often underperform when applied to fine-grained plant classification tasks due to the lack of domain-specific, high-quality datasets and the limited representational capacity of traditional architectures. This study addresses these challenges by introducing a novel, well-curated leaf image dataset consisting of 39 classes of medicinal and aromatic plants collected from the Aromatic and Medicinal Plant Research Station in Odakkali, Kerala, India. To overcome performance bottlenecks observed with a baseline Convolutional Neural Network (CNN) that achieved only 44.94% accuracy, we progressively enhanced model performance through a series of architectural innovations. These included the use of a pre-trained VGG16 network, data augmentation techniques, and fine-tuning of deeper convolutional layers, followed by the integration of Squeeze-and-Excitation (SE) attention blocks. Ultimately, we propose a hybrid deep learning architecture that combines VGG16 with Batch Normalization, Gated Recurrent Units (GRUs), Transformer modules, and Dilated Convolutions. This final model achieved a peak validation accuracy of 95.24%, significantly outperforming several baseline models, such as custom CNN (44.94%), VGG-19 (59.49%), VGG-16 before augmentation (71.52%), Xception (85.44%), Inception v3 (87.97%), VGG-16 after data augumentation (89.24%), VGG-16 after fine-tuning (90.51%), MobileNetV2 (93.67), and VGG16 with SE block (94.94%). These results demonstrate superior capability in capturing both local textures and global morphological features. The proposed solution not only advances the state of the art in plant classification but also contributes a valuable dataset to the research community. Its real-world applicability spans field-based plant identification, biodiversity conservation, and precision agriculture, offering a scalable tool for automated plant recognition in complex ecological and agricultural environments. Full article
(This article belongs to the Special Issue Implementation of Artificial Intelligence in Agriculture)
Show Figures

Figure 1

26 pages, 4203 KiB  
Article
Research on Industrial Process Fault Diagnosis Method Based on DMCA-BiGRUN
by Feng Yu, Changzhou Zhang and Jihan Li
Mathematics 2025, 13(15), 2331; https://doi.org/10.3390/math13152331 - 22 Jul 2025
Viewed by 173
Abstract
With the rising automation and complexity level of industrial systems, the efficiency and accuracy of fault diagnosis have become a critical challenge. The convolutional neural network (CNN) has shown some success in the fault diagnosis field. However, typical convolutional kernels are commonly fixed-sized, [...] Read more.
With the rising automation and complexity level of industrial systems, the efficiency and accuracy of fault diagnosis have become a critical challenge. The convolutional neural network (CNN) has shown some success in the fault diagnosis field. However, typical convolutional kernels are commonly fixed-sized, which makes it difficult to capture multi-scale features simultaneously. Additionally, the use of numerous fixed-size convolutional filters often results in redundant parameters. During the feature extraction process, the CNN often struggles to take inter-channel dependencies and spatial location information into consideration. There are also limitations in extracting various time-scale features. To address these issues, a fault diagnosis method on the basis of a dual-path mixed convolutional attention-BiGRU network (DMCA-BiGRUN) is proposed for industrial processes. Firstly, a dual-path mixed CNN (DMCNN) is designed to capture features at multiple scales while effectively reducing the parameter count. Secondly, a coordinate attention mechanism (CAM) is designed to help the network to concentrate on main features more effectively during feature extraction by combining the channel relationship and position information. Finally, a bidirectional gated recurrent unit (BiGRU) is introduced to process sequences in both directions, which can effectively learn the long-range temporal dependencies of sequence data. To verify the fault diagnosis performance of the proposed method, simulation experiments are implemented on the Tennessee Eastman (TE) and Continuous Stirred Tank Reactor (CSTR) datasets. Some deep learning methods are compared in the experiments, and the results confirm the feasibility and superiority of DMCA-BiGRUN. Full article
Show Figures

Figure 1

31 pages, 7723 KiB  
Article
A Hybrid CNN–GRU–LSTM Algorithm with SHAP-Based Interpretability for EEG-Based ADHD Diagnosis
by Makbal Baibulova, Murat Aitimov, Roza Burganova, Lazzat Abdykerimova, Umida Sabirova, Zhanat Seitakhmetova, Gulsiya Uvaliyeva, Maksym Orynbassar, Aislu Kassekeyeva and Murizah Kassim
Algorithms 2025, 18(8), 453; https://doi.org/10.3390/a18080453 - 22 Jul 2025
Viewed by 400
Abstract
This study proposes an interpretable hybrid deep learning framework for classifying attention deficit hyperactivity disorder (ADHD) using EEG signals recorded during cognitively demanding tasks. The core architecture integrates convolutional neural networks (CNNs), gated recurrent units (GRUs), and long short-term memory (LSTM) layers to [...] Read more.
This study proposes an interpretable hybrid deep learning framework for classifying attention deficit hyperactivity disorder (ADHD) using EEG signals recorded during cognitively demanding tasks. The core architecture integrates convolutional neural networks (CNNs), gated recurrent units (GRUs), and long short-term memory (LSTM) layers to jointly capture spatial and temporal dynamics. In addition to the final hybrid architecture, the CNN–GRU–LSTM model alone demonstrates excellent accuracy (99.63%) with minimal variance, making it a strong baseline for clinical applications. To evaluate the role of global attention mechanisms, transformer encoder models with two and three attention blocks, along with a spatiotemporal transformer employing 2D positional encoding, are benchmarked. A hybrid CNN–RNN–transformer model is introduced, combining convolutional, recurrent, and transformer-based modules into a unified architecture. To enhance interpretability, SHapley Additive exPlanations (SHAP) are employed to identify key EEG channels contributing to classification outcomes. Experimental evaluation using stratified five-fold cross-validation demonstrates that the proposed hybrid model achieves superior performance, with average accuracy exceeding 99.98%, F1-scores above 0.9999, and near-perfect AUC and Matthews correlation coefficients. In contrast, transformer-only models, despite high training accuracy, exhibit reduced generalization. SHAP-based analysis confirms the hybrid model’s clinical relevance. This work advances the development of transparent and reliable EEG-based tools for pediatric ADHD screening. Full article
Show Figures

Graphical abstract

32 pages, 8923 KiB  
Article
A Comparative Study of Unsupervised Deep Learning Methods for Anomaly Detection in Flight Data
by Sameer Kumar Jasra, Gianluca Valentino, Alan Muscat and Robert Camilleri
Aerospace 2025, 12(7), 645; https://doi.org/10.3390/aerospace12070645 - 21 Jul 2025
Viewed by 228
Abstract
This paper provides a comparative study of unsupervised Deep Learning (DL) methods for anomaly detection in Flight Data Monitoring (FDM). The paper applies Long Short-Term Memory (LSTM), Gated Recurrent Units (GRUs), Convolutional Neural Network (CNN), classic Transformer architecture, and LSTM combined with a [...] Read more.
This paper provides a comparative study of unsupervised Deep Learning (DL) methods for anomaly detection in Flight Data Monitoring (FDM). The paper applies Long Short-Term Memory (LSTM), Gated Recurrent Units (GRUs), Convolutional Neural Network (CNN), classic Transformer architecture, and LSTM combined with a self-attention mechanism to real-world flight data and compares the results to the current state-of-the-art flight data analysis techniques applied in the industry. The paper finds that LSTM, when integrated with a self-attention mechanism, offers notable benefits over other deep learning methods as it effectively handles lengthy time series like those present in flight data, establishes a generalized model applicable across various airports and facilitates the detection of trends across the entire fleet. The results were validated by industrial experts. The paper additionally investigates a range of methods for feeding flight data (lengthy time series) to a neural network. The innovation of this paper involves utilizing Transformer architecture and LSTM with self-attention mechanism for the first time in the realm of aviation data, exploring the optimal method for inputting flight data into a model and evaluating all deep learning techniques for anomaly detection against the ground truth determined by human experts. The paper puts forth a compelling case for shifting from the existing method, which relies on examining events through threshold exceedances, to a deep learning-based approach that offers a more proactive style of data analysis. This not only enhances the generalization of the FDM process but also has the potential to improve air transport safety and optimize aviation operations. Full article
(This article belongs to the Section Air Traffic and Transportation)
Show Figures

Figure 1

17 pages, 1738 KiB  
Article
Multimodal Fusion Multi-Task Learning Network Based on Federated Averaging for SDB Severity Diagnosis
by Songlu Lin, Renzheng Tang, Yuzhe Wang and Zhihong Wang
Appl. Sci. 2025, 15(14), 8077; https://doi.org/10.3390/app15148077 - 20 Jul 2025
Viewed by 487
Abstract
Accurate sleep staging and sleep-disordered breathing (SDB) severity prediction are critical for the early diagnosis and management of sleep disorders. However, real-world polysomnography (PSG) data often suffer from modality heterogeneity, label scarcity, and non-independent and identically distributed (non-IID) characteristics across institutions, posing significant [...] Read more.
Accurate sleep staging and sleep-disordered breathing (SDB) severity prediction are critical for the early diagnosis and management of sleep disorders. However, real-world polysomnography (PSG) data often suffer from modality heterogeneity, label scarcity, and non-independent and identically distributed (non-IID) characteristics across institutions, posing significant challenges for model generalization and clinical deployment. To address these issues, we propose a federated multi-task learning (FMTL) framework that simultaneously performs sleep staging and SDB severity classification from seven multimodal physiological signals, including EEG, ECG, respiration, etc. The proposed framework is built upon a hybrid deep neural architecture that integrates convolutional layers (CNN) for spatial representation, bidirectional GRUs for temporal modeling, and multi-head self-attention for long-range dependency learning. A shared feature extractor is combined with task-specific heads to enable joint diagnosis, while the FedAvg algorithm is employed to facilitate decentralized training across multiple institutions without sharing raw data, thereby preserving privacy and addressing non-IID challenges. We evaluate the proposed method across three public datasets (APPLES, SHHS, and HMC) treated as independent clients. For sleep staging, the model achieves accuracies of 85.3% (APPLES), 87.1% (SHHS_rest), and 79.3% (HMC), with Cohen’s Kappa scores exceeding 0.71. For SDB severity classification, it obtains macro-F1 scores of 77.6%, 76.4%, and 79.1% on APPLES, SHHS_rest, and HMC, respectively. These results demonstrate that our unified FMTL framework effectively leverages multimodal PSG signals and federated training to deliver accurate and scalable sleep disorder assessment, paving the way for the development of a privacy-preserving, generalizable, and clinically applicable digital sleep monitoring system. Full article
(This article belongs to the Special Issue Machine Learning in Biomedical Applications)
Show Figures

Figure 1

46 pages, 10548 KiB  
Review
A Review of Hybrid LSTM Models in Smart Cities
by Bum-Jun Kim and Il-Woo Nam
Processes 2025, 13(7), 2298; https://doi.org/10.3390/pr13072298 - 18 Jul 2025
Viewed by 573
Abstract
Rapid global urbanization poses complex challenges that demand advanced data-driven forecasting solutions for smart cities. Traditional statistical and standalone Long Short-Term Memory (LSTM) models often struggle to capture non-linear dynamics and long-term dependencies in urban time-series data. This review critically examines hybrid LSTM [...] Read more.
Rapid global urbanization poses complex challenges that demand advanced data-driven forecasting solutions for smart cities. Traditional statistical and standalone Long Short-Term Memory (LSTM) models often struggle to capture non-linear dynamics and long-term dependencies in urban time-series data. This review critically examines hybrid LSTM models that integrate LSTM with complementary algorithms, including CNN, GRU, ARIMA, and SVM. These hybrid architectures aim to enhance prediction accuracy, integrate diverse data sources, and improve computational efficiency. This study systematically reviews principles, trends, and real-world applications, quantitatively evaluating hybrid LSTM models using performance metrics such as mean absolute error (MAE), root mean square error (RMSE), and the coefficient of determination (R2), while identifying key study limitations. The case studies considered include traffic management, environmental monitoring, energy forecasting, public health, infrastructure assessment, and urban waste management. For example, hybrid models have achieved substantial accuracy improvements in traffic congestion forecasting, reducing their mean absolute error by up to 29%. Despite the inherent challenges related to structural complexity, interpretability, and data requirements, ongoing research on attention mechanisms, model compression, and explainable AI has significantly mitigated these limitations. Thus, hybrid LSTM models have emerged as vital analytical tools capable of robust spatiotemporal prediction, effectively supporting sustainable urban development and data-driven decision-making in evolving smart city environments. Full article
Show Figures

Figure 1

19 pages, 5415 KiB  
Article
Intelligent Optimized Diagnosis for Hydropower Units Based on CEEMDAN Combined with RCMFDE and ISMA-CNN-GRU-Attention
by Wenting Zhang, Huajun Meng, Ruoxi Wang and Ping Wang
Water 2025, 17(14), 2125; https://doi.org/10.3390/w17142125 - 17 Jul 2025
Viewed by 257
Abstract
This study suggests a hybrid approach that combines improved feature selection and intelligent diagnosis to increase the operational safety and intelligent diagnosis capabilities of hydropower units. In order to handle the vibration data, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) is [...] Read more.
This study suggests a hybrid approach that combines improved feature selection and intelligent diagnosis to increase the operational safety and intelligent diagnosis capabilities of hydropower units. In order to handle the vibration data, complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) is used initially. A novel comprehensive index is constructed by combining the Pearson correlation coefficient, mutual information (MI), and Kullback–Leibler divergence (KLD) to select intrinsic mode functions (IMFs). Next, feature extraction is performed on the selected IMFs using Refined Composite Multiscale Fluctuation Dispersion Entropy (RCMFDE). Then, time and frequency domain features are screened by calculating dispersion and combined with IMF features to build a hybrid feature vector. The vector is then fed into a CNN-GRU-Attention model for intelligent diagnosis. The improved slime mold algorithm (ISMA) is employed for the first time to optimize the hyperparameters of the CNN-GRU-Attention model. The experimental results show that the classification accuracy reaches 96.79% for raw signals and 93.33% for noisy signals, significantly outperforming traditional methods. This study incorporates entropy-based feature extraction, combines hyperparameter optimization with the classification model, and addresses the limitations of single feature selection methods for non-stationary and nonlinear signals. The proposed approach provides an excellent solution for intelligent optimized diagnosis of hydropower units. Full article
(This article belongs to the Special Issue Optimization-Simulation Modeling of Sustainable Water Resource)
Show Figures

Figure 1

16 pages, 2059 KiB  
Article
A CNN-SA-GRU Model with Focal Loss for Fault Diagnosis of Wind Turbine Gearboxes
by Liqiang Wang, Shixian Dai, Zijian Kang, Shuang Han, Guozhen Zhang and Yongqian Liu
Energies 2025, 18(14), 3696; https://doi.org/10.3390/en18143696 - 13 Jul 2025
Viewed by 284
Abstract
Gearbox failures are a major cause of unplanned downtime and increased maintenance costs, making accurate diagnosis crucial in ensuring wind turbine reliability and cost-efficiency. However, most existing diagnostic methods fail to fully extract the spatiotemporal features in SCADA data and neglect the impact [...] Read more.
Gearbox failures are a major cause of unplanned downtime and increased maintenance costs, making accurate diagnosis crucial in ensuring wind turbine reliability and cost-efficiency. However, most existing diagnostic methods fail to fully extract the spatiotemporal features in SCADA data and neglect the impact of class imbalance, thereby limiting diagnostic accuracy. To address these challenges, this paper proposes a fault diagnosis model for wind turbine gearboxes based on CNN-SA-GRU and Focal Loss. Specifically, a CNN-SA-GRU network is constructed to extract both spatial and temporal features, in which CNN is employed to extract local spatial features from SCADA data, Shuffle Attention is integrated to efficiently fuse channel and spatial information and enhance spatial representation, and GRU is utilized to capture long-term spatiotemporal dependencies. To mitigate the adverse effects of class imbalance, the conventional cross-entropy loss is replaced with Focal Loss, which assigns higher weights to hard-to-classify fault samples. Finally, the model is validated using real wind farm data. The results show that, compared with the cross-entropy loss, using Focal Loss improves the accuracy and F1 score by an average of 0.24% and 1.03%, respectively. Furthermore, the proposed model outperforms other baseline models with average gains of 0.703% in accuracy and 4.65% in F1 score. Full article
Show Figures

Figure 1

20 pages, 1179 KiB  
Article
Conv1D-GRU-Self Attention: An Efficient Deep Learning Framework for Detecting Intrusions in Wireless Sensor Networks
by Kenan Honore Robacky Mbongo, Kanwal Ahmed, Orken Mamyrbayev, Guanghui Wang, Fang Zuo, Ainur Akhmediyarova, Nurzhan Mukazhanov and Assem Ayapbergenova
Future Internet 2025, 17(7), 301; https://doi.org/10.3390/fi17070301 - 4 Jul 2025
Viewed by 410
Abstract
Wireless Sensor Networks (WSNs) consist of distributed sensor nodes that collect and transmit environmental data, often in resource-constrained and unsecured environments. These characteristics make WSNs highly vulnerable to various security threats. To address this, the objective of this research is to design and [...] Read more.
Wireless Sensor Networks (WSNs) consist of distributed sensor nodes that collect and transmit environmental data, often in resource-constrained and unsecured environments. These characteristics make WSNs highly vulnerable to various security threats. To address this, the objective of this research is to design and evaluate a deep learning-based Intrusion Detection System (IDS) that is both accurate and efficient for real-time threat detection in WSNs. This study proposes a hybrid IDS model combining one-dimensional Convolutional Neural Networks (Conv1Ds), Gated Recurrent Units (GRUs), and Self-Attention mechanisms. A Conv1D extracts spatial features from network traffic, GRU captures temporal dependencies, and Self-Attention emphasizes critical sequence components, collectively enhancing detection of subtle and complex intrusion patterns. The model was evaluated using the WSN-DS dataset and demonstrated superior performance compared to traditional machine learning and simpler deep learning models. It achieved an accuracy of 98.6%, precision of 98.63%, recall of 98.6%, F1-score of 98.6%, and an ROC-AUC of 0.9994, indicating strong predictive capability even with imbalanced data. In addition to centralized training, the model was tested under cooperative, node-based learning conditions, where each node independently detects anomalies and contributes to a collective decision-making framework. This distributed approach improves detection efficiency and robustness. The proposed IDS offers a scalable and resilient solution tailored to the unique challenges of WSN security. Full article
Show Figures

Figure 1

19 pages, 8756 KiB  
Article
Predicting Industrial Copper Hydrometallurgy Output with Deep Learning Approach Using Data Augmentation
by Bagdaulet Kenzhaliyev, Nurtugan Azatbekuly, Serik Aibagarov, Bibars Amangeldy, Aigul Koizhanova and David Magomedov
Minerals 2025, 15(7), 702; https://doi.org/10.3390/min15070702 - 30 Jun 2025
Viewed by 354
Abstract
Sustainable copper extraction presents significant challenges due to waste generation and environmental impacts, requiring advanced predictive methodologies to optimize production processes. This study addresses a gap in applying deep learning to forecast hydrometallurgical copper production by comparing six recurrent neural network architectures: Vanilla [...] Read more.
Sustainable copper extraction presents significant challenges due to waste generation and environmental impacts, requiring advanced predictive methodologies to optimize production processes. This study addresses a gap in applying deep learning to forecast hydrometallurgical copper production by comparing six recurrent neural network architectures: Vanilla LSTM, Stacked LSTM, Bidirectional LSTM, GRU, CNN-LSTM, and Attention LSTM. Using time-series data from a full-scale industrial operation, we implemented a data augmentation approach to overcome data scarcity limitations. The models were evaluated through rigorous metrics and multi-step forecasting tests. The results demonstrated remarkable performance from five architectures, with Bidirectional LSTM and Attention LSTM achieving the highest accuracy (RMSE < 0.004, R2 > 0.999, MAPE < 1%). These models successfully captured and reproduced complex cyclical patterns in copper mass production for up to 500 time steps ahead. The findings validate our data augmentation strategy for enabling models to learn complex known cyclical patterns from limited initial data and establish a promising foundation for implementing AI-driven predictive systems that can enhance process control, reduce waste, and advance sustainability in hydrometallurgical operations. However, these performance metrics reflect the models’ ability to reproduce patterns inherent in the augmented dataset derived from a single operational cycle; validation on entirely independent operational data is crucial for assessing true generalization and is a critical next step. Full article
(This article belongs to the Section Mineral Processing and Extractive Metallurgy)
Show Figures

Figure 1

16 pages, 808 KiB  
Article
Enhancing Stock Price Forecasting with CNN-BiGRU-Attention: A Case Study on INDY
by Madilyn Louisa, Gumgum Darmawan and Bertho Tantular
Mathematics 2025, 13(13), 2148; https://doi.org/10.3390/math13132148 - 30 Jun 2025
Viewed by 384
Abstract
The stock price of PT Indika Energy Tbk (INDY) reflects the dynamics of Indonesia’s energy sector, which is heavily influenced by global coal price fluctuations, national energy policies, and geopolitical conditions. This study aimed to develop an accurate forecasting model to predict the [...] Read more.
The stock price of PT Indika Energy Tbk (INDY) reflects the dynamics of Indonesia’s energy sector, which is heavily influenced by global coal price fluctuations, national energy policies, and geopolitical conditions. This study aimed to develop an accurate forecasting model to predict the movement of INDY stock prices using a hybrid machine learning approach called CNN-BiGRU-AM. The objective was to generate future forecasts of INDY stock prices based on historical data from 28 August 2019 to 24 February 2025. The method applied a hybrid model combining a Convolutional Neural Network (CNN), Bidirectional Gated Recurrent Unit (BiGRU), and an Attention Mechanism (AM) to address the nonlinear, volatile, and noisy characteristics of stock data. The results showed that the CNN-BiGRU-AM model achieved high accuracy with a Mean Absolute Percentage Error (MAPE) below 3%, indicating its effectiveness in capturing long-term patterns. The CNN helped extract local features and reduce noise, the BiGRU captured bidirectional temporal dependencies, and the Attention Mechanism allocated weights to the most relevant historical information. The model remained robust even when stock prices were sensitive to external factors such as global commodity trends and geopolitical events. This study contributes to providing more accurate forecasting solutions for companies, investors, and stakeholders in making strategic decisions. It also enriches the academic literature on the application of deep learning techniques in financial data analysis and stock market forecasting within a complex and dynamic environment. Full article
Show Figures

Figure 1

24 pages, 2843 KiB  
Article
Classification of Maize Images Enhanced with Slot Attention Mechanism in Deep Learning Architectures
by Zafer Cömert, Alper Talha Karadeniz, Erdal Basaran and Yuksel Celik
Electronics 2025, 14(13), 2635; https://doi.org/10.3390/electronics14132635 - 30 Jun 2025
Viewed by 289
Abstract
Maize is a vital global crop, serving as a fundamental component of global food security. To support sustainable maize production, the accurate classification of maize seeds—particularly distinguishing haploid from diploid types—is essential for enhancing breeding efficiency. Conventional methods relying on manual inspection or [...] Read more.
Maize is a vital global crop, serving as a fundamental component of global food security. To support sustainable maize production, the accurate classification of maize seeds—particularly distinguishing haploid from diploid types—is essential for enhancing breeding efficiency. Conventional methods relying on manual inspection or simple machine learning are prone to errors and unsuitable for large-scale data. To overcome these limitations, we propose Slot-Maize, a novel deep learning architecture that integrates Convolutional Neural Networks (CNN), Slot Attention, Gated Recurrent Units (GRU), and Long Short-Term Memory (LSTM) layers. The Slot-Maize model was evaluated using two datasets: the Maize Seed Dataset and the Maize Variety Dataset. The Slot Attention module improves feature representation by focusing on object-centric regions within seed images. The GRU captures short-term sequential patterns in extracted features, while the LSTM models long-range dependencies, enhancing temporal understanding. Furthermore, Grad-CAM was utilized as an explainable AI technique to enhance the interpretability of the model’s decisions. The model demonstrated an accuracy of 96.97% on the Maize Seed Dataset and 92.30% on the Maize Variety Dataset, outperforming existing methods in both cases. These results demonstrate the model’s robustness, generalizability, and potential to accelerate automated maize breeding workflows. In conclusion, the Slot-Maize model provides a robust and interpretable solution for automated maize seed classification, representing a significant advancement in agricultural technology. By combining accuracy with explainability, Slot-Maize provides a reliable tool for precision agriculture. Full article
(This article belongs to the Special Issue Data-Related Challenges in Machine Learning: Theory and Application)
Show Figures

Figure 1

20 pages, 4558 KiB  
Article
Remaining Useful Life Prediction of Rolling Bearings Based on an Improved U-Net and a Multi-Dimensional Hybrid Gated Attention Mechanism
by Hengdi Wang and Aodi Shi
Appl. Sci. 2025, 15(13), 7166; https://doi.org/10.3390/app15137166 - 25 Jun 2025
Viewed by 458
Abstract
In practical scenarios, rolling bearing vibration signals suffer from detail loss, and information loss occurs during feature dimensionality reduction and fusion, leading to inaccurate life prediction results. To address these issues, this paper first proposes a method for predicting the remaining useful life [...] Read more.
In practical scenarios, rolling bearing vibration signals suffer from detail loss, and information loss occurs during feature dimensionality reduction and fusion, leading to inaccurate life prediction results. To address these issues, this paper first proposes a method for predicting the remaining useful life (RUL) of bearings, which combines an improved U-Net for enhancing vibration signals and a multi-dimensional hybrid gated attention mechanism (MHGAM) for dynamic feature fusion. The enhanced U-Net effectively suppresses the loss of signal details, while the MHGAM adaptively constructs health indices through multi-dimensional weighting, significantly improving prediction accuracy. Initially, the improved U-Net is utilized for signal preprocessing. By comprehensively considering both channel and spatial dimensions, the MHGAM dynamically assigns fusion weights across different dimensions to construct a health index. Subsequently, the health index is used as input for the Bi-GRU network model to obtain the remaining life prediction results. Finally, comparative analyses between the proposed method and other RUL prediction methods are conducted using the IEEE PHM 2012 bearing dataset (Condition 1: rotational speed 1800 r/min with radial load 4000 N; Condition 2: rotational speed 1650 r/min with radial load 4200 N) and engineering test data (rotational speed 1800 r/min with radial load 4000 N). Experimental results from the IEEE PHM 2012 bearing dataset indicate that this method achieves a low mean root mean square error (RMSE = 0.0504) and mean absolute error (MAE = 0.0239). The engineering test verification results demonstrate that the mean values of RMSE and MAE for this method are 7.8% lower than those of the CNN-BiGRU benchmark model and 14.6% lower than those of the TCN-BiGRU model, respectively. In terms of comprehensive prediction performance scores, the average scores improve by 7.8% and 9.3 percentage points compared with the two benchmark models, respectively. Under various test conditions, the prediction results of this method exhibit commendable comprehensive performance, significantly enhancing the prediction accuracy of bearing remaining useful life. Full article
Show Figures

Figure 1

22 pages, 2209 KiB  
Article
Very Short-Term Load Forecasting Model for Large Power System Using GRU-Attention Algorithm
by Tae-Geun Kim, Sung-Guk Yoon and Kyung-Bin Song
Energies 2025, 18(13), 3229; https://doi.org/10.3390/en18133229 - 20 Jun 2025
Viewed by 406
Abstract
This paper presents a very short-term load forecasting (VSTLF) model tailored for large-scale power systems, employing a gated recurrent unit (GRU) network enhanced with an attention mechanism. To improve forecasting accuracy, a systematic input feature selection method based on Normalized Mutual Information (NMI) [...] Read more.
This paper presents a very short-term load forecasting (VSTLF) model tailored for large-scale power systems, employing a gated recurrent unit (GRU) network enhanced with an attention mechanism. To improve forecasting accuracy, a systematic input feature selection method based on Normalized Mutual Information (NMI) is introduced. Additionally, a novel input feature termed the load variationis proposed to explicitly capture real-time dynamic load patterns. Tailored data preprocessing techniques are applied, including load reconstitution to account for the impact of Behind-The-Meter (BTM) solar generation, and a weighted averaging method for constructing representative weather inputs. Extensive case studies using South Korea’s national power system data from 2021 to 2023 demonstrate that the proposed GRU-attention model significantly outperforms existing approaches and benchmark models. In particular, when expressing the accuracy of the proposed method in terms of the error rate, the Mean Absolute Percentage Error (MAPE) is 0.77%, which shows an improvement of 0.50 percentage points over the benchmark model using the Kalman filter algorithm and an improvement of 0.27 percentage points over the hybrid deep learning benchmark (CNN-BiLSTM). The simulation results clearly demonstrate the effectiveness of the NMI-based feature selection and the combination of load characteristics for very short-term load forecasting. Full article
(This article belongs to the Special Issue Energy, Electrical and Power Engineering: 4th Edition)
Show Figures

Figure 1

21 pages, 4725 KiB  
Article
A Novel Open Circuit Fault Diagnosis for a Modular Multilevel Converter with Modal Time-Frequency Diagram and FFT-CNN-BIGRU Attention
by Ziyuan Zhai, Ning Wang, Siran Lu, Bo Zhou and Lei Guo
Machines 2025, 13(6), 533; https://doi.org/10.3390/machines13060533 - 19 Jun 2025
Viewed by 245
Abstract
Fault diagnosis is one of the most important issues for a modular multilevel converter (MMC). However, conventional solutions are deficient in two aspects. Firstly, they lack the necessary feature information. Secondly, they are incapable of performing open-circuit fault diagnosis of the modular multilevel [...] Read more.
Fault diagnosis is one of the most important issues for a modular multilevel converter (MMC). However, conventional solutions are deficient in two aspects. Firstly, they lack the necessary feature information. Secondly, they are incapable of performing open-circuit fault diagnosis of the modular multilevel converter with the requisite degree of accuracy. To solve this problem, an intelligent diagnosis method is proposed to integrate the modal time–frequency diagram and FFT-CNN-BiGRU-Attention. By selecting the phase current and bridge arm voltage as the core fault parameters, the particle swarm algorithm is used to optimize the Variational Modal Decomposition parameters, and the fault signal is decomposed and reconstructed into sensitive feature components. The reconstructed signals are further transformed into modal time–frequency diagrams via continuous wavelet transform to fully retain the time–frequency domain features. In the model construction stage, the frequency–domain features are first extracted using the fast Fourier transform (FFT), and the local patterns are captured through a combination with a convolutional neural network; subsequently, the timing correlations are analyzed using bidirectional gated loop cells, and the Attention Mechanism is introduced to strengthen the key features. Simulations show that the proposed method achieves 98.63% accuracy in locating faulty insulated gate bipolar transistors (IGBTs) in the sub-module, with second-level real-time response capability. Compared with the recently published scheme, it maintains stable performance under complex working conditions such as noise interference and data imbalances, showing stronger robustness and practical value. This study provides a new idea for the intelligent operation and maintenance of power electronic devices, which can be extended to the fault diagnosis of other power equipment in the future. Full article
(This article belongs to the Section Electromechanical Energy Conversion Systems)
Show Figures

Figure 1

Back to TopTop