Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (15)

Search Parameters:
Keywords = nested long short-term memory

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
15 pages, 1516 KB  
Article
Bio-Inspired Multi-Granularity Model for Rice Pests and Diseases Named Entity Recognition in Chinese
by Zhan Tang, Xiaoyu Lu, Enli Liu, Yan Zhong and Xiaoli Peng
Biomimetics 2025, 10(10), 676; https://doi.org/10.3390/biomimetics10100676 - 8 Oct 2025
Viewed by 639
Abstract
Rice, as one of the world’s four major staple crops, is frequently threatened by pests and diseases during its growth. With the rapid expansion of agricultural information data, the effective management and utilization of such data have become crucial for the development of [...] Read more.
Rice, as one of the world’s four major staple crops, is frequently threatened by pests and diseases during its growth. With the rapid expansion of agricultural information data, the effective management and utilization of such data have become crucial for the development of agricultural informatization. Named entity recognition technology offers precise support for the early prevention and control of crop pests and diseases. However, entity recognition for rice pests and diseases faces challenges such as structural complexity and prevalent nesting issues. Inspired by biological visual mechanisms, we propose a deep learning model capable of extracting multi-granularity features. Text representations are encoded using BERT, and the model enhances its ability to capture nested boundary information through multi-granularity convolutional neural networks (CNNs). Finally, sequence modeling and labeling are performed using a bidirectional long short-term memory network (BiLSTM) combined with a conditional random field (CRF). Experimental results demonstrate that the proposed model effectively identifies entities related to rice diseases and pests, achieving an F1 score of 91.74% on a self-constructed dataset. Full article
Show Figures

Figure 1

18 pages, 864 KB  
Article
Enhanced Semantic BERT for Named Entity Recognition in Education
by Ping Huang, Huijuan Zhu, Ying Wang, Lili Dai and Lei Zheng
Electronics 2025, 14(19), 3951; https://doi.org/10.3390/electronics14193951 - 7 Oct 2025
Viewed by 763
Abstract
To address the technical challenges in the educational domain named entity recognition (NER), such as ambiguous entity boundaries and difficulties with nested entity identification, this study proposes an enhanced semantic BERT model (ES-BERT). The model innovatively adopts an education domain, vocabulary-assisted semantic enhancement [...] Read more.
To address the technical challenges in the educational domain named entity recognition (NER), such as ambiguous entity boundaries and difficulties with nested entity identification, this study proposes an enhanced semantic BERT model (ES-BERT). The model innovatively adopts an education domain, vocabulary-assisted semantic enhancement strategy that (1) applies the term frequency–inverse document frequency (TF-IDF) algorithm to weight domain-specific terms, and (2) fuses the weighted lexical information with character-level features, enabling BERT to generate enriched, domain-aware, character–word hybrid representations. A complete bidirectional long short-term memory-conditional random field (BiLSTM-CRF) recognition framework was established, and a novel focal loss-based joint training method was introduced to optimize the process. The experimental design employed a three-phase validation protocol, as follows: (1) In a comparative evaluation using 5-fold cross-validation on our proprietary computer-education dataset, the proposed ES-BERT model yielded a precision of 90.38%, which is higher than that of the baseline models; (2) Ablation studies confirmed the contribution of domain-vocabulary enhancement to performance improvement; (3) Cross-domain experiments on the 2016 knowledge base question answering datasets and resume benchmark datasets demonstrated outstanding precision of 98.41% and 96.75%, respectively, verifying the model’s transfer-learning capability. These comprehensive experimental results substantiate that ES-BERT not only effectively resolves domain-specific NER challenges in education but also exhibits remarkable cross-domain adaptability. Full article
Show Figures

Figure 1

22 pages, 1250 KB  
Article
Entity Span Suffix Classification for Nested Chinese Named Entity Recognition
by Jianfeng Deng, Ruitong Zhao, Wei Ye and Suhong Zheng
Information 2025, 16(10), 822; https://doi.org/10.3390/info16100822 - 23 Sep 2025
Viewed by 602
Abstract
Named entity recognition (NER) is one of the fundamental tasks in building knowledge graphs. For some domain-specific corpora, the text descriptions exhibit limited standardization, and some entity structures have entity nesting. The existing entity recognition methods have problems such as word matching noise [...] Read more.
Named entity recognition (NER) is one of the fundamental tasks in building knowledge graphs. For some domain-specific corpora, the text descriptions exhibit limited standardization, and some entity structures have entity nesting. The existing entity recognition methods have problems such as word matching noise interference and difficulty in distinguishing different entity labels for the same character in sequence label prediction. This paper proposes a span-based feature reuse stacked bidirectional long short term memory network (BiLSTM) nested named entity recognition (SFRSN) model, which transforms the entity recognition of sequence prediction into the problem of entity span suffix category classification. Firstly, character feature embedding is generated through bidirectional encoder representation of transformers (BERT). Secondly, a feature reuse stacked BiLSTM is proposed to obtain deep context features while alleviating the problem of deep network degradation. Thirdly, the span feature is obtained through the dilated convolution neural network (DCNN), and at the same time, a single-tail selection function is introduced to obtain the classification feature of the entity span suffix, with the aim of reducing the training parameters. Fourthly, a global feature gated attention mechanism is proposed, integrating span features and span suffix classification features to achieve span suffix classification. The experimental results on four Chinese-specific domain datasets demonstrate the effectiveness of our approach: SFRSN achieves micro-F1 scores of 83.34% on ontonotes, 73.27% on weibo, 96.90% on resume, and 86.77% on the supply chain management dataset. This represents a maximum improvement of 1.55%, 4.94%, 2.48%, and 3.47% over state-of-the-art baselines, respectively. The experimental results demonstrate the effectiveness of the model in addressing nested entities and entity label ambiguity issues. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Graphical abstract

23 pages, 960 KB  
Article
Multivariate Air Quality Forecasting with Residual Nested LSTM Neural Network Based on DSWT
by Wangjian Li, Yiwen Zhang and Yaoyao Liu
Sustainability 2025, 17(5), 2244; https://doi.org/10.3390/su17052244 - 5 Mar 2025
Cited by 2 | Viewed by 2508
Abstract
With the continuous deterioration of air quality and the increasingly serious environmental problem of air pollution, accurate air quality prediction is of great significance for environmental governance. Air quality index (AQI) prediction based on deep learning is currently a hot research topic. The [...] Read more.
With the continuous deterioration of air quality and the increasingly serious environmental problem of air pollution, accurate air quality prediction is of great significance for environmental governance. Air quality index (AQI) prediction based on deep learning is currently a hot research topic. The neural network model method currently used for prediction has difficulty effectively coping with the high volatility of AQI data and capturing the complex nonlinear relationships and long-term dependencies in the data. To address these issues, this paper proposes multivariate air quality forecasting with a residual nested LSTM neural network based on the discrete stationary wavelet transform (DSWT) model. Firstly, the DSWT data-decomposition technique decomposes each AQI data point into multiple sub-signals. Then, each sub-signal is sent to the NLSTM layer for processing to capture the temporal relationships between different pollutants. The processed results are then combined, using residual connections to mitigate issues of gradient vanishing and explosion during the model training process. The inverse mean squared error method is combined with the simple weighted average method, to serve as the weight-update approach. Back propagation is then applied, to dynamically adjust the weights based on the prediction accuracy of each sample, further enhancing the model’s prediction accuracy. The experiment was conducted on the air quality index dataset of 12 observation stations in and around Beijing. The results show that the proposed model outperforms several existing models and data-processing methods in multi-task AQI prediction. There were significant improvements in mean absolute error (MAE), root mean square error (RMSE), mean absolute percentage error (MAPE), and R square (R2). Full article
(This article belongs to the Section Air, Climate Change and Sustainability)
Show Figures

Figure 1

46 pages, 4014 KB  
Article
Robust Human Activity Recognition for Intelligent Transportation Systems Using Smartphone Sensors: A Position-Independent Approach
by John Benedict Lazaro Bernardo, Attaphongse Taparugssanagorn, Hiroyuki Miyazaki, Bipun Man Pati and Ukesh Thapa
Appl. Sci. 2024, 14(22), 10461; https://doi.org/10.3390/app142210461 - 13 Nov 2024
Cited by 3 | Viewed by 3626
Abstract
This study explores Human Activity Recognition (HAR) using smartphone sensors to address the challenges posed by position-dependent datasets. We propose a position-independent system that leverages data from accelerometers, gyroscopes, linear accelerometers, and gravity sensors collected from smartphones placed either on the chest or [...] Read more.
This study explores Human Activity Recognition (HAR) using smartphone sensors to address the challenges posed by position-dependent datasets. We propose a position-independent system that leverages data from accelerometers, gyroscopes, linear accelerometers, and gravity sensors collected from smartphones placed either on the chest or in the left/right leg pocket. The performance of traditional machine learning algorithms (Decision Trees (DT), K-Nearest Neighbors (KNN), Random Forest (RF), Support Vector Classifier (SVC), and XGBoost) is compared against deep learning models (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Temporal Convolutional Networks (TCN), and Transformer models) under two sensor configurations. Our findings highlight that the Temporal Convolutional Network (TCN) model consistently outperforms other models, particularly in the four-sensor non-overlapping configuration, achieving the highest accuracy of 97.70%. Deep learning models such as LSTM, GRU, and Transformer also demonstrate strong performance, showcasing their effectiveness in capturing temporal dependencies in HAR tasks. Traditional machine learning models, including RF and XGBoost, provide reasonable performance but do not match the accuracy of deep learning models. Additionally, incorporating data from linear accelerometers and gravity sensors led to slight improvements over using accelerometer and gyroscope data alone. This research enhances the recognition of passenger behaviors for intelligent transportation systems, contributing to more efficient congestion management and emergency response strategies. Full article
Show Figures

Figure 1

17 pages, 1305 KB  
Article
rlaNet: A Residual Convolution Nested Long-Short-Term Memory Model with an Attention Mechanism for Wind Turbine Fault Diagnosis
by Ruiwang Sun, Longfei Guan and Naizhe Diao
Mathematics 2024, 12(22), 3460; https://doi.org/10.3390/math12223460 - 6 Nov 2024
Cited by 3 | Viewed by 1334
Abstract
This paper proposes a new fault diagnosis model for wind power systems called residual convolution nested long short-term memory network with an attention mechanism (rlaNet). The method first preprocesses the SCADA data through feature engineering, uses the Hermite interpolation method to handle missing [...] Read more.
This paper proposes a new fault diagnosis model for wind power systems called residual convolution nested long short-term memory network with an attention mechanism (rlaNet). The method first preprocesses the SCADA data through feature engineering, uses the Hermite interpolation method to handle missing data, and uses the mutual information-based dimensionality reduction technique to improve data quality and eliminate redundant information. rlaNet combines residual networks and nested long short-term memory networks to replace traditional convolutional neural networks and standard long short-term memory architectures, thereby improving feature extraction and ensuring the abstractness and depth of the extracted features. In addition, the model emphasizes the weighted learning of spatiotemporal features in the input data, enhances the focus on key features, and improves training efficiency. Experimental results show that rlaNet achieves an accuracy of more than 90% in wind turbine fault diagnosis, showing good robustness. Furthermore, noise simulation experiments verify the model’s resistance to interference, providing a reliable solution for wind turbine fault diagnosis under complex operating conditions. Full article
Show Figures

Figure 1

19 pages, 3066 KB  
Article
Comparative Analysis of Snowmelt-Driven Streamflow Forecasting Using Machine Learning Techniques
by Ukesh Thapa, Bipun Man Pati, Samit Thapa, Dhiraj Pyakurel and Anup Shrestha
Water 2024, 16(15), 2095; https://doi.org/10.3390/w16152095 - 25 Jul 2024
Cited by 2 | Viewed by 3555
Abstract
The rapid advancement of machine learning techniques has led to their widespread application in various domains, including water resources. However, snowmelt modeling remains an area that has not been extensively explored. In this study, we propose a state-of-the-art (SOTA) deep learning sequential model, [...] Read more.
The rapid advancement of machine learning techniques has led to their widespread application in various domains, including water resources. However, snowmelt modeling remains an area that has not been extensively explored. In this study, we propose a state-of-the-art (SOTA) deep learning sequential model, leveraging a Temporal Convolutional Network (TCN), for snowmelt forecasting of the Hindu Kush Himalayan (HKH) region. To evaluate the performance of our proposed model, we conducted a comparative analysis with other popular models, including Support Vector Regression (SVR), Long Short-Term Memory (LSTM), and Transformer models. Furthermore, nested cross-validation (CV) was used with five outer folds and three inner folds, and hyperparameter tuning was performed on the inner folds. To evaluate the performance of the model, the Mean Absolute Error (MAE), Root-Mean-Square Error (RMSE), R square (R2), Kling–Gupta Efficiency (KGE), and Nash–Sutcliffe Efficiency (NSE) were computed for each outer fold. The average metrics revealed that the TCN outperformed the other models, with an average MAE of 0.011, RMSE of 0.023, R2 of 0.991, KGE of 0.992, and NSE of 0.991 for one-day forecasts of streamflow. The findings of this study demonstrate the effectiveness of the proposed deep learning model as compared to traditional machine learning approaches for snowmelt-driven streamflow forecasting. Moreover, the superior performance of this TCN highlights its potential as a promising deep learning model for similar hydrological applications. Full article
(This article belongs to the Special Issue Cold Region Hydrology and Hydraulics)
Show Figures

Figure 1

14 pages, 1764 KB  
Article
Probabilistic Models and Deep Learning Models Assessed to Estimate Design and Operational Ocean Wave Statistics to Reduce Coastal Hazards
by Mourani Sinha, Mrinmoyee Bhattacharya, M. Seemanth and Suchandra A. Bhowmick
Geosciences 2023, 13(12), 380; https://doi.org/10.3390/geosciences13120380 - 12 Dec 2023
Cited by 1 | Viewed by 2404
Abstract
Probabilistic models for long-term estimations and deep learning models for short-term predictions have been evaluated and analyzed for ocean wave parameters. Estimation of design and operational wave parameters for long-term return periods is essential for various coastal and ocean engineering applications. Three probability [...] Read more.
Probabilistic models for long-term estimations and deep learning models for short-term predictions have been evaluated and analyzed for ocean wave parameters. Estimation of design and operational wave parameters for long-term return periods is essential for various coastal and ocean engineering applications. Three probability distributions, namely generalized extreme value distribution (EV), generalized Pareto distribution (PD), and Weibull distribution (WD), have been considered in this work. The design wave parameter considered is the maximal wave height for a specified return period, and the operational wave parameters are the mean maximal wave height and the highest occurring maximal wave height. For precise location-based estimation, wave heights are considered from a nested wave model, which has been configured to have a 10 km spatial resolution. As per availability, buoy-observed data are utilized for validation purposes at the Agatti, Digha, Gopalpur, and Ratnagiri stations along the Indian coasts. At the stations mentioned above, the long short-term memory (LSTM)-based deep learning model is applied to provide short-term predictions with higher accuracy. The probabilistic approach for long-term estimation and the deep learning model for short-term prediction can be used in combination to forecast wave statistics along the coasts, reducing hazards. Full article
(This article belongs to the Section Natural Hazards)
Show Figures

Figure 1

20 pages, 1017 KB  
Article
RAdam-DA-NLSTM: A Nested LSTM-Based Time Series Prediction Method for Human–Computer Intelligent Systems
by Banteng Liu, Wei Chen, Zhangquan Wang, Seyedamin Pouriyeh and Meng Han
Electronics 2023, 12(14), 3084; https://doi.org/10.3390/electronics12143084 - 16 Jul 2023
Cited by 10 | Viewed by 2394
Abstract
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage [...] Read more.
At present, time series prediction methods are widely applied for Human–Computer Intelligent Systems in various fields such as Finance, Meteorology, and Medicine. To enhance the accuracy and stability of the prediction model, this paper proposes a time series prediction method called RAdam-Dual stage Attention mechanism-Nested Long Short-Term Memory (RAdam-DA-NLSTM). First, we design a Nested LSTM (NLSTM), which adopts a new internal LSTM unit structure as the memory cell of LSTM to guide memory forgetting and memory selection. Then, we design a self-encoder network based on the Dual stage Attention mechanism (DA-NLSTM), which uses the NLSTM encoder based on the input attention mechanism, and uses the NLSTM decoder based on the time attention mechanism. Additionally, we adopt the RAdam optimizer to solve the objective function, which dynamically selects Adam and SGD optimizers according to the variance dispersion and constructs the rectifier term to fully express the adaptive momentum. Finally, we use multiple datasets, such as PM2.5 data set, stock data set, traffic data set, and biological signals, to analyze and test this method, and the experimental results show that RAdam-DA-NLSTM has higher prediction accuracy and stability compared with other traditional methods. Full article
Show Figures

Figure 1

25 pages, 11183 KB  
Article
Digital Model of Deflection of a Cable-Stayed Bridge Driven by Deep Learning and Big Data Optimized via PCA-LGBM
by Zili Xia, Junxiao Guo, Zixiang Yue, Youliang Ding, Zhiwen Wang and Shouwang Sun
Sustainability 2023, 15(12), 9623; https://doi.org/10.3390/su15129623 - 15 Jun 2023
Cited by 3 | Viewed by 2009
Abstract
Based on big data, we can build a regression model between a temperature field and a temperature-induced deflection to provide a control group representing the service performance of bridges, which has a positive effect on the full life cycle maintenance of bridges. However, [...] Read more.
Based on big data, we can build a regression model between a temperature field and a temperature-induced deflection to provide a control group representing the service performance of bridges, which has a positive effect on the full life cycle maintenance of bridges. However, the spatial temperature information of a cable-stayed bridge is difficult to describe. To establish a regression model with high precision, the improved PCA-LGBM (principal component analysis and light gradient boosting machine) algorithm is proposed to extract the main temperature features that can reflect the spatial temperature information as accurately and efficiently as possible. Then, in this article, we searched for a suitable digital tool for modeling the regressive relationship between the temperature variables and the temperature-induced deflection of a cable-stayed bridge. The multiple linear regression model has relatively low precision. The precision of the backpropagation neural network (BPNN) model has been improved, but it is still unsatisfactory. The nested long short-term memory (NLSTM) model improves the nonlinear expression ability of the regression model and is more precise than BPNN models and the classical LSTM. The architecture of the NLSTM network is optimized for high precision and to avoid the waste of computational costs. Based on the four main temperature features extracted via the PCA-LGBM, the NLSTM network with double hidden layers and 256 hidden units in each hidden layer has much higher precision than the other regression models. For the NLSTM regression model of the temperature-induced deflection of a cable-stayed bridge, the mean absolute error is only 4.76 mm, and the mean square error is only 18.57 mm2. The control value of the NLSTM regression model is precise and thus provides the potential for early detection of bridge anomalies. This article can provide reference processes and a data extraction algorithm for deflection modeling of other cable-stayed bridges. Full article
(This article belongs to the Special Issue Artificial Intelligence (AI) in Structural Health Monitoring)
Show Figures

Figure 1

18 pages, 6347 KB  
Article
Probabilistic Forecasting of Residential Energy Consumption Based on SWT-QRTCN-ADSC-NLSTM Model
by Ning Jin, Linlin Song, Gabriel Jing Huang and Ke Yan
Information 2023, 14(4), 231; https://doi.org/10.3390/info14040231 - 8 Apr 2023
Cited by 3 | Viewed by 3115
Abstract
Residential electricity consumption forecasting plays a crucial role in the rational allocation of resources reducing energy waste and enhancing the grid-connected operation of power systems. Probabilistic forecasting can provide more comprehensive information for the decision-making and dispatching process by quantifying the uncertainty of [...] Read more.
Residential electricity consumption forecasting plays a crucial role in the rational allocation of resources reducing energy waste and enhancing the grid-connected operation of power systems. Probabilistic forecasting can provide more comprehensive information for the decision-making and dispatching process by quantifying the uncertainty of electricity load. In this study, we propose a method based on stationary wavelet transform (SWT), quantile regression (QR), Bidirectional nested long short-term memory (BiNLSTM), and Depthwise separable convolution (DSC) combined with attention mechanism for electricity consumption probability prediction methods. First, the data sequence is decomposed using SWT to reduce the complexity of the sequence; then, the combined neural network model with attention is used to obtain the prediction values under different quantile conditions. Finally, the probability density curve of electricity consumption is obtained by combining kernel density estimation (KDE). The model was tested using historical demand-side data from five UK households to achieve energy consumption predictions 5 min in advance. It is demonstrated that the model can achieve both reliable probabilistic prediction and accurate deterministic prediction. Full article
(This article belongs to the Special Issue Feature Papers in Information in 2023)
Show Figures

Figure 1

19 pages, 2198 KB  
Article
Modelling of Deep Learning-Based Downscaling for Wave Forecasting in Coastal Area
by Didit Adytia, Deni Saepudin, Dede Tarwidi, Sri Redjeki Pudjaprasetya, Semeidi Husrin, Ardhasena Sopaheluwakan and Gegar Prasetya
Water 2023, 15(1), 204; https://doi.org/10.3390/w15010204 - 3 Jan 2023
Cited by 19 | Viewed by 5707
Abstract
Wave prediction in a coastal area, especially with complex geometry, requires a numerical simulation with a high-resolution grid to capture wave propagation accurately. The resolution of the grid from global wave forecasting systems is usually too coarse to capture wave propagation in the [...] Read more.
Wave prediction in a coastal area, especially with complex geometry, requires a numerical simulation with a high-resolution grid to capture wave propagation accurately. The resolution of the grid from global wave forecasting systems is usually too coarse to capture wave propagation in the coastal area. This problem is usually resolved by performing dynamic downscaling that simulates the global wave condition into a smaller domain with a high-resolution grid, which requires a high computational cost. This paper proposes a deep learning-based downscaling method for predicting a significant wave height in the coastal area from global wave forecasting data. We obtain high-resolution wave data by performing a continuous wave simulation using the SWAN model via nested simulations. The dataset is then used as the training data for the deep learning model. Here, we use the Long Short-Term Memory (LSTM) and Bidirectional LSTM (BiLSTM) as the deep learning models. We choose two study areas, an open sea with a swell-dominated area and a rather close sea with a wind-wave-dominated area. We validate the results of the downscaling with a wave observation, which shows good results. Full article
(This article belongs to the Section Oceans and Coastal Zones)
Show Figures

Figure 1

21 pages, 4341 KB  
Article
STC-NLSTMNet: An Improved Human Activity Recognition Method Using Convolutional Neural Network with NLSTM from WiFi CSI
by Md Shafiqul Islam, Mir Kanon Ara Jannat, Mohammad Nahid Hossain, Woo-Su Kim, Soo-Wook Lee and Sung-Hyun Yang
Sensors 2023, 23(1), 356; https://doi.org/10.3390/s23010356 - 29 Dec 2022
Cited by 27 | Viewed by 5115
Abstract
Human activity recognition (HAR) has emerged as a significant area of research due to its numerous possible applications, including ambient assisted living, healthcare, abnormal behaviour detection, etc. Recently, HAR using WiFi channel state information (CSI) has become a predominant and unique approach in [...] Read more.
Human activity recognition (HAR) has emerged as a significant area of research due to its numerous possible applications, including ambient assisted living, healthcare, abnormal behaviour detection, etc. Recently, HAR using WiFi channel state information (CSI) has become a predominant and unique approach in indoor environments compared to others (i.e., sensor and vision) due to its privacy-preserving qualities, thereby eliminating the need to carry additional devices and providing flexibility of capture motions in both line-of-sight (LOS) and non-line-of-sight (NLOS) settings. Existing deep learning (DL)-based HAR approaches usually extract either temporal or spatial features and lack adequate means to integrate and utilize the two simultaneously, making it challenging to recognize different activities accurately. Motivated by this, we propose a novel DL-based model named spatio-temporal convolution with nested long short-term memory (STC-NLSTMNet), with the ability to extract spatial and temporal features concurrently and automatically recognize human activity with very high accuracy. The proposed STC-NLSTMNet model is mainly comprised of depthwise separable convolution (DS-Conv) blocks, feature attention module (FAM) and NLSTM. The DS-Conv blocks extract the spatial features from the CSI signal and add feature attention modules (FAM) to draw attention to the most essential features. These robust features are fed into NLSTM as inputs to explore the hidden intrinsic temporal features in CSI signals. The proposed STC-NLSTMNet model is evaluated using two publicly available datasets: Multi-environment and StanWiFi. The experimental results revealed that the STC-NLSTMNet model achieved activity recognition accuracies of 98.20% and 99.88% on Multi-environment and StanWiFi datasets, respectively. Its activity recognition performance is also compared with other existing approaches and our proposed STC-NLSTMNet model significantly improves the activity recognition accuracies by 4% and 1.88%, respectively, compared to the best existing method. Full article
Show Figures

Figure 1

18 pages, 2108 KB  
Article
Energy Use Forecasting with the Use of a Nested Structure Based on Fuzzy Cognitive Maps and Artificial Neural Networks
by Katarzyna Poczeta and Elpiniki I. Papageorgiou
Energies 2022, 15(20), 7542; https://doi.org/10.3390/en15207542 - 13 Oct 2022
Cited by 16 | Viewed by 2192
Abstract
The aim of this paper is to present a novel approach to energy use forecasting. We propose a nested fuzzy cognitive map in which each concept at a higher level can be decomposed into another fuzzy cognitive map, multilayer perceptron artificial neural network [...] Read more.
The aim of this paper is to present a novel approach to energy use forecasting. We propose a nested fuzzy cognitive map in which each concept at a higher level can be decomposed into another fuzzy cognitive map, multilayer perceptron artificial neural network or long short-term memory network. Historical data related to energy consumption are used to construct a nested fuzzy cognitive map in order to better understand energy use behavior. Through the experiments, the usefulness of the nested structure in energy demand prediction is demonstrated, by calculating three popular metrics: Mean Square Error, Mean Absolute Error and the correlation coefficient. A comparative analysis is performed, applying classic multilayer perceptron artificial neural networks, long short-term memory networks and fuzzy cognitive maps. The results confirmed that the proposed approach outperforms the classic methods in terms of prediction accuracy. Moreover, the advantage of the proposed approach is the ability to present complex time series in the form of a clear nested structure presenting the main concepts influencing energy consumption on the first level. The second level allows for more detailed problem analysis and lower forecast errors. Full article
Show Figures

Figure 1

13 pages, 1074 KB  
Article
A Study on Standardization of Security Evaluation Information for Chemical Processes Based on Deep Learning
by Lanfei Peng, Dong Gao and Yujie Bai
Processes 2021, 9(5), 832; https://doi.org/10.3390/pr9050832 - 10 May 2021
Cited by 14 | Viewed by 3105
Abstract
Hazard and operability analysis (HAZOP) is one of the most commonly used hazard analysis methods in the petrochemical industry. The large amount of unstructured data in HAZOP reports has generated an information explosion which has led to a pressing need for technologies that [...] Read more.
Hazard and operability analysis (HAZOP) is one of the most commonly used hazard analysis methods in the petrochemical industry. The large amount of unstructured data in HAZOP reports has generated an information explosion which has led to a pressing need for technologies that can simplify the use of this information. In order to solve the problem that massive data are difficult to reuse and share, in this study, we propose a new deep learning framework for Chinese HAZOP documents to perform a named entity recognition (NER) task, aiming at the characteristics of HAZOP documents, such as polysemy, multi-entity nesting, and long-distance text. Specifically, the preprocessed data are input into an embeddings from language models (ELMo) and a double convolutional neural network (DCNN) model to extract rich character features. Meanwhile, a bidirectional long short-term memory (BiLSTM) network is used to extract long-distance semantic information. Finally, the results are decoded by a conditional random field (CRF), and then output. Experiments were carried out using the HAZOP report of a coal seam indirect liquefaction project. The experimental results for the proposed model showed that the accuracy rate of the optimal results reached 90.83, the recall rate reached 92.46, and the F-value reached the highest 91.76%, which was significantly improved as compared with other models. Full article
(This article belongs to the Special Issue Advance in Machine Learning)
Show Figures

Figure 1

Back to TopTop