A Hybrid Stacked CNN and Residual Feedback GMDH-LSTM Deep Learning Model for Stroke Prediction Applied on Mobile AI Smart Hospital Platform

Artificial intelligence (AI) techniques for intelligent mobile computing in healthcare has opened up new opportunities in healthcare systems. Combining AI techniques with the existing Internet of Medical Things (IoMT) will enhance the quality of care that patients receive at home remotely and the successful establishment of smart living environments. Building a real AI for mobile AI in an integrated smart hospital environment is a challenging problem due to the complexities of receiving IoT medical sensors data, data analysis, and deep learning algorithm complexity programming for mobile AI engine implementation AI-based cloud computing complexities, especially when we tackle real-time environments of AI technologies. In this paper, we propose a new mobile AI smart hospital platform architecture for stroke prediction and emergencies. In addition, this research is focused on developing and testing different modules of integrated AI software based on XAI architecture, this is for the mobile health app as an independent expert system or as connected with a simulated environment of an AI-cloud-based solution. The novelty is in the integrated architecture and results obtained in our previous works and this extended research on hybrid GMDH and LSTM deep learning models for the proposed artificial intelligence and IoMT engine for mobile health edge computing technology. Its main goal is to predict heart–stroke disease. Current research is still missing a mobile AI system for heart/brain stroke prediction during patient emergency cases. This research work implements AI algorithms for stroke prediction and diagnosis. The hybrid AI in connected health is based on a stacked CNN and group handling method (GMDH) predictive analytics model, enhanced with an LSTM deep learning module for biomedical signals prediction. The techniques developed depend on the dataset of electromyography (EMG) signals, which provides a significant source of information for the identification of normal and abnormal motions in a stroke scenario. The resulting artificial intelligence mHealth app is an innovation beyond the state of the art and the proposed techniques achieve high accuracy as stacked CNN reaches almost 98% for stroke diagnosis. The GMDH neural network proves to be a good technique for monitoring the EMG signal of the same patient case with an average accuracy of 98.60% to an average of 96.68% of the signal prediction. Moreover, extending the GMDH model and a hybrid LSTM with dense layers deep learning model has improved significantly the prediction results that reach an average of 99%.


Introduction
The proposed architecture aims to develop, analyze and incorporate artificial intelligence and deep learning technology and extend our previous research on mobile AI telemedicine platforms [1] to harness the findings of research and development in the fields 1.
An innovative automated proposed biomedical deep learning cloud platform for stroke patients' emergencies and remote using stacked convolutional neural networks the proposed solution offers complete intelligent healthcare services inside homes, for elderlies, families, and emergency care services. The main goal is heart stroke prediction, monitoring, and diagnosis. The AI-connected health platform includes deep learning models to the cloud and a mHealth module to send alerts. 2.
The innovative artificial intelligence telemedicine platform for stroke prediction and emergency situations. That depends on statistical methods for EMG signal tracking and prediction such as group handling methods (GMDH) neural network [8,9] for patient stroke real-time prediction. The GMDH deep learning model is further enhanced with LSTM deep learning module [18,19].

3.
A new real-time CNN-stroke and heart and BAN-IOT: a deep learning model for signal deep feature extraction and classification within big data streaming environment The architecture of the proposed intelligent hospital for the connected health modules is presented in Figure 1. It consists of a wearable sensors module that sends signal streams for signal processing modules and mobile AI health for stroke prediction. It can be connected to the cloud, as seen in the following figure, in order to accomplish effectively the aims of the research. The smart hospital stroke system, which integrates recent advances in artificial intelligence and predictive computing with telemedicine applications, is a continuously growing field in telemedicine. A stroke can cause sudden death and is a matter of urgency. It is one of America's leading causes of death. Heart/brain stroke, for these reasons, is an emergency and must be treated promptly before any complications occur. Recent research shows that a smart hospital heart/stroke system is at the forefront of current research, especially in the field of chronic diseases and emergency conditions such as heart attacks. Today, however, an intelligent patient control screening device is lacking. In addition, such a system needs smart algorithms for patient stroke prediction and emergencies to warn better diagnostic decisions and fast patient care response in the process. In this paper, a modern intelligent hybrid architecture is proposed. The most important activities and actions in this innovative architecture for smart hospital-connected health approach are concentrated around the individual person/patient. posed ensemble CNNs' performance. The proposed algorithm could classify the data into sixteen classes. In addition to that, the sensitivity was 99.03%, specificity was 99.94%, positive predictive was 99.03%, and accuracy was 99.88%. Predictive analytics in healthcare decision-making [3,47,48] deals with information retrieval to predict an unknown event of interest, typically a future event. Using technology that learns from data to predict these unknown events could drive better decisions.

Materials and Methods
The architecture of the proposed intelligent hospital for the connected health modules is presented in Figure 1. It consists of a wearable sensors module that sends signal streams for signal processing modules and mobile AI health for stroke prediction. It can be connected to the cloud, as seen in the following figure, in order to accomplish effectively the aims of the research. The smart hospital stroke system, which integrates recent advances in artificial intelligence and predictive computing with telemedicine applications, is a continuously growing field in telemedicine. A stroke can cause sudden death and is a matter of urgency. It is one of America's leading causes of death. Heart/brain stroke, for these reasons, is an emergency and must be treated promptly before any complications occur. Recent research shows that a smart hospital heart/stroke system is at the forefront of current research, especially in the field of chronic diseases and emergency conditions such as heart attacks. Today, however, an intelligent patient control screening device is lacking. In addition, such a system needs smart algorithms for patient stroke prediction and emergencies to warn better diagnostic decisions and fast patient care response in the process. In this paper, a modern intelligent hybrid architecture is proposed. The most important activities and actions in this innovative architecture for smart hospital-connected health approach are concentrated around the individual person/patient.  The proposed Mobile AI Health Agent receives all necessary inputs from medical sensors, and sensors operating on EMG, as will be explained in this article, but it can be extended to other biomedical sensors for stroke and heart diseases [24]. On the other hand, after the sensors' data and information are processed and results are obtained, they are delivered to the individual under tracking.
The proposed architecture of Figure 1 is also integrated with set of DSS tools for explainable artificial intelligence (XAI)-based human-centric (HC) applications, as shown in Figure 2, where a medical decision support system (MDSS) is proposed. It is directed to cover concrete individualized needs of the patient under treatment, medication, or social care, as well as on-hand competence of concrete sub-division in a medical institution taking care of this personal case. Here, the block of comparatively static diagnostics contains a personalized toolset dedicated to primary diagnosis, predictive state classification, assignments, and individual recommendations (A), based on a set of DSS tools built on the basis of XAI.
after the sensors' data and information are processed and results are obtained, they ar delivered to the individual under tracking.
The proposed architecture of Figure 1 is also integrated with set of DSS tools for ex plainable artificial intelligence (XAI)-based human-centric (HC) applications, as shown in Figure 2, where a medical decision support system (MDSS) is proposed. It is directed to cover concrete individualized needs of the patient under treatment, medication, or socia care, as well as on-hand competence of concrete sub-division in a medical institution tak ing care of this personal case. Here, the block of comparatively static diagnostics contain a personalized toolset dedicated to primary diagnosis, predictive state classification, as signments, and individual recommendations (A), based on a set of DSS tools built on the basis of XAI. The next block of dynamic diagnostics online is dedicated to individualized observa tion of the individual under permanent healthcare service and is responsible for person alized surveillance, all prescribed procedures, and active recommendations (B) for both actors: the patient, and health professionals as well.
The third block is similar to the second, but it is equipped with additional tools fo offline modeling of proposed procedures in case any uncertainty appears. So, the outpu of this block (C) contains all recommended procedures of surveillance, results of the in vitro modeling, and recommendations concerning all possible emergency activities. Figure 3 illustrates the relation between the XA_ tools concept and it is integrated into our mobile AI smart hospital platform, where all digital data and evaluations ar ready to be analyzed and pre-processed using deep learning [4,11,[13][14][15] and feature ex traction methods [28] for both Artificial intelligence telemedicine for smart ai hospita heart/stroke health units and IoT-edge cloud AI biomedical sensors processing [23][24][25]28] On the other hand, after the sensors' data and information are processed and results ar obtained, they are delivered to the individual under tracking (A), this is where this pape presents most of the research article. The next block of XAI architecture dynamic diagnos tics online is dedicated to individualized observation of the individual under permanen health care service and is responsible for personalized surveillance, all prescribed proce dures, and active recommendations (B) for both actors: a patient, and health professional as well. The next block of dynamic diagnostics online is dedicated to individualized observation of the individual under permanent healthcare service and is responsible for personalized surveillance, all prescribed procedures, and active recommendations (B) for both actors: the patient, and health professionals as well.
The third block is similar to the second, but it is equipped with additional tools for offline modeling of proposed procedures in case any uncertainty appears. So, the output of this block (C) contains all recommended procedures of surveillance, results of the in vitro modeling, and recommendations concerning all possible emergency activities. Figure 3 illustrates the relation between the XA_ tools concept and it is integrated into our mobile AI smart hospital platform, where all digital data and evaluations are ready to be analyzed and pre-processed using deep learning [4,11,[13][14][15] and feature extraction methods [28] for both Artificial intelligence telemedicine for smart ai hospital heart/stroke health units and IoT-edge cloud AI biomedical sensors processing [23][24][25]28]. On the other hand, after the sensors' data and information are processed and results are obtained, they are delivered to the individual under tracking (A), this is where this paper presents most of the research article. The next block of XAI architecture dynamic diagnostics online is dedicated to individualized observation of the individual under permanent health care service and is responsible for personalized surveillance, all prescribed procedures, and active recommendations (B) for both actors: a patient, and health professionals as well.
The main and most important activities and actions in this human-centric approach are concentrated around the individual on TIER 1 (on the left side of Figure 3). Here, DSS receives all necessary inputs from ordinary body sensors, sensors operating on information based on individual's location (location-based sensors) and for the first time involved Soft sensors dealing with information about environment. TIER 2 consists of fuzzy logic modules, which are not yet implemented in this paper. It will be considered for future work. The main and most important activities and actions in this human-centric approach are concentrated around the individual on TIER 1 (on the left side of Figure 3). Here, DSS receives all necessary inputs from ordinary body sensors, sensors operating on information based on individual's location (location-based sensors) and for the first time involved Soft sensors dealing with information about environment. TIER 2 consists of fuzzy logic modules, which are not yet implemented in this paper. It will be considered for future work.

Mobile AI Smart Hospital Platform: Artificial Intelligence Materials and Methods for Stroke Prediction at Home Care Emergencies Scenarios
An innovative automated biomedical deep learning cloud platform for stroke patients' emergencies and remote monitoring is presented in this section. Figure 4 illustrates the possible implementation of the system for home care stroke emergencies. Personalized early risk detection and intervention solutions for prevention and treatments based on early risk detection are paramount for people facing increased health and social risks. As shown in Figure 3, the proposed solution offers complete intelligent healthcare services inside homes, for the elderly, families, and emergency care services. The main goal is for heart/brain stroke prediction, monitoring, and diagnosis. The platform [24] includes deep learning models and a mHealth module to send alerts. Two important techniques, stacked convolutional and pooling layers for biomedical sensors signal correlations are presented in this section. . XAI general architecture consists of TIER1, TIER2, and TIER3. TIER 1 has been implemented as AI software modules for the proposed mobile AI smart hospital system.

Mobile AI Smart Hospital Platform: Artificial Intelligence Materials and Methods for Stroke Prediction at Home Care Emergencies Scenarios
An innovative automated biomedical deep learning cloud platform for stroke patients' emergencies and remote monitoring is presented in this section. Figure 4 illustrates the possible implementation of the system for home care stroke emergencies. Personalized early risk detection and intervention solutions for prevention and treatments based on early risk detection are paramount for people facing increased health and social risks. As shown in Figure 3, the proposed solution offers complete intelligent healthcare services inside homes, for the elderly, families, and emergency care services. The main goal is for heart/brain stroke prediction, monitoring, and diagnosis. The platform [24] includes deep learning models and a mHealth module to send alerts. Two important techniques, stacked convolutional and pooling layers for biomedical sensors signal correlations are presented in this section.

Stacked Convolutional and Pooling Layers for Biomedical Sensors Signal Correlations
This section exploits the usage of convolutional layers, as shown in Figure 5 and their ability to extract several activation maps per signal, thus enabling us to deeply extract the correlated signal features.

Stacked Convolutional and Pooling Layers for Biomedical Sensors Signal Correlations
This section exploits the usage of convolutional layers, as shown in Figure 5 and their ability to extract several activation maps per signal, thus enabling us to deeply extract the correlated signal features.

Stacked Convolutional and Pooling Layers for Biomedical Sensors Signal Correlations
This section exploits the usage of convolutional layers, as shown in Figure 5 and their ability to extract several activation maps per signal, thus enabling us to deeply extract the correlated signal features.   Thus, the usage of stacked convolutional and pooling layers has been introduced that act as deep convolutional networks for extracting hidden signal features. A Softmax layer is then added to evaluate the model's ability to classify input signals. The extracted features can later be used as input to another learning model that will act directly on the constructed feature vectors. Figure 5 illustrates the architecture of the tested CNN deep learning model.
The model has been tested in a scenario with a significant number of patients, sending biomedical real-time feedback through interconnected IoT devices and biomedical sensors, such as EMG, ECG, BAN, and IMU signals. The CNN model architecture is given as follows: Convolutional Layer: It aims to extract deep features of the input signal through several activation filters. The network extracts deep features by applying 1-D filters through the input signal and then outputs a different shape of the input signal. Pooling Layer: It aims to lower the dimensionality of the big data streamed to the convolutional layer and its output, reduces computation time, and helps the network converge. Max pooling has been used along with batch normalization. Fully Connected Layer: It flattens the output from higher dimensions down through a fully connected network of neurons, and then reduces its dimension. Multiple fully connected layers are used to feed the Softmax experimental classifier. The proposed stacked CNN network time measurements are shown in Table 1.

Stroke Prediction Using GMDH-Type Neural Network Enhanced with LSTM Module
This section describes the use of group method data handling (GMDH) to predict the value of the signal time series. For this, a multi-layered parametric iteration GMDH algorithm with polynomial reference functions is implemented.
It is a sorting out of gradually complex models generated from Kolmogorov-Gabor polynomial (Figures 6 and 7).

Stroke Prediction Using GMDH-Type Neural Network Enhanced with LSTM Module
This section describes the use of group method data handling (GMDH) to predict the value of the signal time series. For this, a multi-layered parametric iteration GMDH algorithm with polynomial reference functions is implemented.
It is a sorting out of gradually complex models generated from Kolmogorov-Gabor polynomial (Figures 6 and 7).

Stroke Prediction Using GMDH-Type Neural Network Enhanced with LSTM Module
This section describes the use of group method data handling (GMDH) to predict the value of the signal time series. For this, a multi-layered parametric iteration GMDH algorithm with polynomial reference functions is implemented.
It is a sorting out of gradually complex models generated from Kolmogorov-Gabor polynomial (Figures 6 and 7).   We transform the input signal into a supervised problem, therefore making the GMDH able to predict its behavior. The graph below ( Figure 8) shows a sample signal that will be later transformed into a supervised problem to feed the neural network. We transform the input signal into a supervised problem, therefore making the GMDH able to predict its behavior. The graph below ( Figure 8) shows a sample signal that will be later transformed into a supervised problem to feed the neural network. The best model is chosen by the minimum of a specified external criterion characteristic, thus making it equivalent to an artificial neural network with polynomial activation functions for neurons.
The values of the previously shown signal are then transformed into a supervised problem as follows: We then feed the resulting feature matrices and their corresponding labels to the neural network to start the training process. The developed GMDH deep learning model [30] is built based on the reference function, then during the training process based on the selected external criterion the neurons are eliminated during the training process and the best model is created.
A selection criterion exists to perform the neurons dropout at each layer thus performing the select-and-drop training process. Selection criteria are: validation score, bias, validation score, and bias. As seen in the graph, specific neurons are selected based on the criterion and are dropped out in the next layer training process.
The training process of the neural network is stopped based on two criteria: error is not decreasing anymore, or the neural network has reached its maximum number of layers.
Least mean squared error (LMSE) is used as a loss function. The maximum number of layers for the network is defined externally before the training process.
The resulting feature matrices and their corresponding labels are then fed to the neural network to start the training process. Several transfer functions are available for the polynomial neural network; the transfer function is used as an activation function for regression problems using GMDH-type neural network. A GMDH can formulate an optimization of the structure based on the current transfer function, each transfer function is also adaptively created by another self-organizing process.

A Proposed Hybrid LSTM with Dense Layers Deep Learning Model for Stroke Prediction
LSTM is a special kind of RNN [15,22], which shows outstanding performance on a large variety of problems. It maintains state (memory) across very long sequences, basic architecture is shown in Figure 9, because LSTM is very sensitive to the data ranges we applied data normalization and scaling in the input and output. We used standard scaling for the input. It can be solved using linear activation in the output layer. The best model is chosen by the minimum of a specified external criterion characteristic, thus making it equivalent to an artificial neural network with polynomial activation functions for neurons.
The values of the previously shown signal are then transformed into a supervised problem as follows: We then feed the resulting feature matrices and their corresponding labels to the neural network to start the training process. The developed GMDH deep learning model [30] is built based on the reference function, then during the training process based on the selected external criterion the neurons are eliminated during the training process and the best model is created.
A selection criterion exists to perform the neurons dropout at each layer thus performing the select-and-drop training process. Selection criteria are: validation score, bias, validation score, and bias. As seen in the graph, specific neurons are selected based on the criterion and are dropped out in the next layer training process.
The training process of the neural network is stopped based on two criteria: error is not decreasing anymore, or the neural network has reached its maximum number of layers.
Least mean squared error (LMSE) is used as a loss function. The maximum number of layers for the network is defined externally before the training process.
The resulting feature matrices and their corresponding labels are then fed to the neural network to start the training process. Several transfer functions are available for the polynomial neural network; the transfer function is used as an activation function for regression problems using GMDH-type neural network. A GMDH can formulate an optimization of the structure based on the current transfer function, each transfer function is also adaptively created by another self-organizing process.

A Proposed Hybrid LSTM with Dense Layers Deep Learning Model for Stroke Prediction
LSTM is a special kind of RNN [15,22], which shows outstanding performance on a large variety of problems. It maintains state (memory) across very long sequences, basic architecture is shown in Figure 9, because LSTM is very sensitive to the data ranges we applied data normalization and scaling in the input and output. We used standard scaling for the input. It can be solved using linear activation in the output layer. Sensors 2023, 23, x FOR PEER REVIEW 11 of 35 Figure 9. Basic LSTM deep learning gates [15,22].
A novel Stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs a novel architecture consisting of multiple LSTM recurrent dense networks as shown in Figure 10. Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods such as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and real-time execution on IoMT devices. In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
The first part chooses whether the information coming from the previous timestamp is to be remembered or is irrelevant and can be forgotten. In the second part, the cell tries to learn new information from the input to this cell. At last, in the third part, the cell passes the updated information from the current timestamp to the next timestamp.
In this article, we have built a hybrid LSTM model concatenated with dense layers. The LSTM modules are based on the basic parts of LSTM gates, each of which consists of three parts, the first part is called forget gate, the second part is known as the input gate and the last one is the output gate [15,22]. The detailed hybrid model LSTM/dense deep learning model is shown in Figure 10. For more illustrations as shown below. Its input_output takes all the EMG 8 channels of the EMG physical action dataset [47,48]. Our input features shape is (1, 256) for each 8muscle signal.
In addition, output classes are one-shot encode vectors with shape (None, 20). A novel Stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs a novel architecture consisting of multiple LSTM recurrent dense networks as shown in Figure 10. Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods such as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and real-time execution on IoMT devices. In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
The first part chooses whether the information coming from the previous timestamp is to be remembered or is irrelevant and can be forgotten. In the second part, the cell tries to learn new information from the input to this cell. At last, in the third part, the cell passes the updated information from the current timestamp to the next timestamp.
In this article, we have built a hybrid LSTM model concatenated with dense layers. The LSTM modules are based on the basic parts of LSTM gates, each of which consists of three parts, the first part is called forget gate, the second part is known as the input gate and the last one is the output gate [15,22]. The detailed hybrid model LSTM/dense deep learning model is shown in Figure 10. For more illustrations as shown below. Its input_output takes all the EMG 8 channels of the EMG physical action dataset [47,48]. Our input features shape is (1, 256) for each 8-muscle signal. A novel Stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs a novel architecture consisting of multiple LSTM recurrent dense networks as shown in Figure 10. Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods such as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and real-time execution on IoMT devices. In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
The first part chooses whether the information coming from the previous timestamp is to be remembered or is irrelevant and can be forgotten. In the second part, the cell tries to learn new information from the input to this cell. At last, in the third part, the cell passes the updated information from the current timestamp to the next timestamp.
In this article, we have built a hybrid LSTM model concatenated with dense layers. The LSTM modules are based on the basic parts of LSTM gates, each of which consists of three parts, the first part is called forget gate, the second part is known as the input gate and the last one is the output gate [15,22]. The detailed hybrid model LSTM/dense deep learning model is shown in Figure 10. For more illustrations as shown below. Its input_output takes all the EMG 8 channels of the EMG physical action dataset [47,48]. Our input features shape is (1, 256) for each 8muscle signal.
In addition, output classes are one-shot encode vectors with shape (None, 20). In addition, output classes are one-shot encode vectors with shape (None, 20). A novel Stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs a novel architecture consisting of multiple LSTM recurrent dense networks as shown in Figure 10. Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods such as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and real-time execution on IoMT devices. In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
The first part chooses whether the information coming from the previous timestamp is to be remembered or is irrelevant and can be forgotten. In the second part, the cell tries to learn new information from the input to this cell. At last, in the third part, the cell passes the updated information from the current timestamp to the next timestamp.
In this article, we have built a hybrid LSTM model concatenated with dense layers. The LSTM modules are based on the basic parts of LSTM gates, each of which consists of three parts, the first part is called forget gate, the second part is known as the input gate and the last one is the output gate [15,22]. The detailed hybrid model LSTM/dense deep learning model is shown in Figure 10. For more illustrations as shown below. Its input_output takes all the EMG 8 channels of the EMG physical action dataset [47,48]. Our input features shape is (1, 256) for each 8muscle signal.
In addition, output classes are one-shot encode vectors with shape (None, 20).

Data Preprocessing
Two methods have been used to preprocess the data The first one is called feature scaling, these methods do not aim to extract new information from the data, it changes the scale of it only.
The user's biofeedback signal is extracted on a server that is monitoring human health conditions based on emerging wireless mobile technologies with wireless body sensors. Different datasets have been used for the experiments. The first contains EMG signals with two target classes: normal and aggressive. This task is considered a time series data classification problem. Table 2 shows the EMG physical action dataset includes 4 different patients, performing 10 different actions, each patient is classified as binary normal or abnormal.

Data Preprocessing
Two methods have been used to preprocess the data The first one is called feature scaling, these methods do not aim to extract new information from the data, it changes the scale of it only.
The user's biofeedback signal is extracted on a server that is monitoring human health conditions based on emerging wireless mobile technologies with wireless body sensors. Different datasets have been used for the experiments. The first contains EMG signals with two target classes: normal and aggressive. This task is considered a time series data classification problem. Table 2 shows the EMG physical action dataset includes 4 different patients, performing 10 different actions, each patient is classified as binary normal or abnormal. The second method produces new features from the data such as RMS. We tested the two methods with different models and this section needs more investigation.
-Feature Scaling: -Standard Scaling: Standardize features by removing the mean and scaling to unit variance. This is a 2D vis. For the effect standard scaling at 8 channels of Subject 1 at normal state while handshaking. -

Robust Scaling
Scale features using statistics that are robust to outliers. -

Min-Max Scaling
It essentially shrinks the range such that the range is now between 0 and 1 (or −1 to 1 if there are negative values). -Normalizer The normalizer scales each value by dividing each value by its magnitude in ndimensional space for n number of features ( Figure 11). This is a 3D vis. For the effect normalization at 3 channels of Subject 1 at normal state while handshaking. The second method produces new features from the data such as RMS. We tested the two methods with different models and this section needs more investigation.
-Feature Scaling: -Standard Scaling: Standardize features by removing the mean and scaling to unit variance. This is a 2D vis. For the effect standard scaling at 8 channels of Subject 1 at normal state while handshaking. -

Robust Scaling
Scale features using statistics that are robust to outliers. -

Min-Max Scaling
It essentially shrinks the range such that the range is now between 0 and 1 (or −1 to 1 if there are negative values). -Normalizer The normalizer scales each value by dividing each value by its magnitude in n-dimensional space for n number of features ( Figure 11). This is a 3D vis. For the effect normalization at 3 channels of Subject 1 at normal state while handshaking. Figure 11. This is a 3D vis. For the effect normalization at 3 channels of Subject 1 at normal state while handshaking. The aim of this figure is to picture the way the data are pre-processed by representing three dimensions out of eight.
Wavelet transforms are some of the more efficient techniques for processing nonstationary signals such as biomedical signals (e.g., EMG). Wavelet transforms the signal into its time-frequency domains. There are two types of wavelet analysis, discrete wavelet transform (DWT) and continuous wavelet transform (CWT). Figure 11. The subfigure on the left corresponds to feature scaling process while the subfigure on the right corresponds to standard scaling. Standardize features by removing the mean and scaling to unit variance in order to process the signals easier.
Both of them consume little time for signal processing. CWT is more consistent, but DWT has proven efficiency in analyzing nonstationary signals, although it yields a highdimensional feature vector. In our research, discrete wavelet transform (DWT) is used for analyzing the EMG signal and extracting significant features which are very useful in identification of healthy, myopathic, and neuropathic subjects.
Seven features of the EMG signal are taken into consideration in this research. Root mean square (RMS), mean absolute value (MAV), zero crossing (ZC), slope sign change (SSC), and standard deviation (SD). Each one of these features is used as input to the classification process which is the next phase after the feature extraction process. The EMG MAV feature window size and values sample are presented in Figure 12. Wavelet transforms are some of the more efficient techniques for processing nonstationary signals such as biomedical signals (e.g., EMG). Wavelet transforms the signal into its time-frequency domains. There are two types of wavelet analysis, discrete wavelet transform (DWT) and continuous wavelet transform (CWT). Figure 11. The subfigure on the left corresponds to feature scaling process while the subfigure on the right corresponds to standard scaling. Standardize features by removing the mean and scaling to unit variance in order to process the signals easier.
Both of them consume little time for signal processing. CWT is more consistent, but DWT has proven efficiency in analyzing nonstationary signals, although it yields a highdimensional feature vector. In our research, discrete wavelet transform (DWT) is used for analyzing the EMG signal and extracting significant features which are very useful in identification of healthy, myopathic, and neuropathic subjects.
Seven features of the EMG signal are taken into consideration in this research. Root mean square (RMS), mean absolute value (MAV), zero crossing (ZC), slope sign change (SSC), and standard deviation (SD). Each one of these features is used as input to the classification process which is the next phase after the feature extraction process. The EMG MAV feature window size and values sample are presented in Figure 12. A Daubechies wavelet function [34] of degree four (db4) was applied on each frame of the EMG signals in training and testing data so that the next step is to extract time and time-frequency features from the resulting processed signal ( Figure 13). This will be the main expert system engine for suggested initial diagnosis and emergency calls to the nearest hospitals for overall patient management and safety. It will depend on neural networks and case-based reasoning technologies.
The first step in our system is the sensor collects data. Mobile sensing process is shown in Figure 14, where mobile device evaluates data. Then, mobile device sends aggregated data to the telemedicine server. Then, telemedicine server evaluates data and informs physicians about our upcoming developments in artificial intelligence expert system. A Daubechies wavelet function [34] of degree four (db4) was applied on each frame of the EMG signals in training and testing data so that the next step is to extract time and time-frequency features from the resulting processed signal ( Figure 13). A Daubechies wavelet function [34] of degree four (db4) was applied on each frame of the EMG signals in training and testing data so that the next step is to extract time and time-frequency features from the resulting processed signal ( Figure 13). This will be the main expert system engine for suggested initial diagnosis and emergency calls to the nearest hospitals for overall patient management and safety. It will depend on neural networks and case-based reasoning technologies.
The first step in our system is the sensor collects data. Mobile sensing process is shown in Figure 14, where mobile device evaluates data. Then, mobile device sends aggregated data to the telemedicine server. Then, telemedicine server evaluates data and informs physicians about our upcoming developments in artificial intelligence expert system. This will be the main expert system engine for suggested initial diagnosis and emergency calls to the nearest hospitals for overall patient management and safety. It will depend on neural networks and case-based reasoning technologies.
The first step in our system is the sensor collects data. Mobile sensing process is shown in Figure 14, where mobile device evaluates data. Then, mobile device sends aggregated data to the telemedicine server. Then, telemedicine server evaluates data and informs physicians about our upcoming developments in artificial intelligence expert system. Sensors 2023, 23, x FOR PEER REVIEW 15 of 35 Figure 14. Mobile sensing, sensor collects data, mobile device, and telemedicine server.
In this research, analyzing data of stroke based on EMG sensors, as shown in Figure  15 of muscle readings to enable extracting best features. Then, significant features for efficient classification are selected since it determines the success of the pattern classification system. However, it is quite problematic to extract the best feature parameters from the EMG signals that can reflect the unique feature of the signal to the motion command perfectly. Hence, multiple feature sets are used as inputs to the EMG signal classification process. Some of the features are classified as time domain, frequency domain, time-frequency domain, and time-scale domain; these feature types are successfully employed for EMG signal classification. The next step is the signal classification phase. The data acquisition process ( Figure 15) consists of both real-time methodology as experiments conducted by German researchers at the Brandenburg University [14][15][16], and comparison to the offline dataset of the UCI and the Ain Shams University researchers [3] for different EMG signals channels samples, different colors, at different scaling.
The EMG signals are known for their uniqueness in every subject. An EMG sample consists of five channels:  In this research, analyzing data of stroke based on EMG sensors, as shown in Figure 15 of muscle readings to enable extracting best features. Then, significant features for efficient classification are selected since it determines the success of the pattern classification system. However, it is quite problematic to extract the best feature parameters from the EMG signals that can reflect the unique feature of the signal to the motion command perfectly. Hence, multiple feature sets are used as inputs to the EMG signal classification process. Some of the features are classified as time domain, frequency domain, time-frequency domain, and time-scale domain; these feature types are successfully employed for EMG signal classification. The next step is the signal classification phase. In this research, analyzing data of stroke based on EMG sensors, as shown in Figure  15 of muscle readings to enable extracting best features. Then, significant features for efficient classification are selected since it determines the success of the pattern classification system. However, it is quite problematic to extract the best feature parameters from the EMG signals that can reflect the unique feature of the signal to the motion command perfectly. Hence, multiple feature sets are used as inputs to the EMG signal classification process. Some of the features are classified as time domain, frequency domain, time-frequency domain, and time-scale domain; these feature types are successfully employed for EMG signal classification. The next step is the signal classification phase. The data acquisition process ( Figure 15) consists of both real-time methodology as experiments conducted by German researchers at the Brandenburg University [14][15][16], and comparison to the offline dataset of the UCI and the Ain Shams University researchers [3] for different EMG signals channels samples, different colors, at different scaling.
The EMG signals are known for their uniqueness in every subject. An EMG sample consists of five channels:  The data acquisition process ( Figure 15) consists of both real-time methodology as experiments conducted by German researchers at the Brandenburg University [14][15][16], and comparison to the offline dataset of the UCI and the Ain Shams University researchers [3] for different EMG signals channels samples, different colors, at different scaling.
The EMG signals are known for their uniqueness in every subject. An EMG sample consists of five channels: Several feature combinations have been tested for obtaining the optimal signal results, which total 128 features. The number of classes of motions is 20, which consists of 10 normal and 10 aggressive physical actions.
The following are normal: bowing, clapping, handshaking, hugging, jumping, running, seating, standing, walking, and waving. the following are aggressive: elbowing, front-kicking, hammering, heading, kneeing, pulling, punching, pushing, side-kicking, and slapping. Figure 16 shows a sample of normal and abnormal EMG signals [14][15][16]. Seven features have been selected to obtain optimal results in signal classification. A sample of extracted features with the final selected features is shown in Table 3. Several feature combinations have been tested for obtaining the optimal signal results, which total 128 features. The number of classes of motions is 20, which consists of 10 normal and 10 aggressive physical actions.
The following are normal: bowing, clapping, handshaking, hugging, jumping, running, seating, standing, walking, and waving. the following are aggressive: elbowing, front-kicking, hammering, heading, kneeing, pulling, punching, pushing, side-kicking, and slapping. Figure 16 shows a sample of normal and abnormal EMG signals [14][15][16]. Seven features have been selected to obtain optimal results in signal classification. A sample of extracted features with the final selected features is shown in Table 3.     The following datasets were used: Dataset Name: EMG lower limb dataset The EMG lower limb dataset includes different 24 patients, performing three different actions, each patient is classified as binary normal, and abnormal.

Abv. Name of Feature Definition
Dataset Name: mHealth dataset The mHealth dataset includes different 10 subjects performing different 12 actions, it also includes different measurements of subject kinematic information.
Both of the datasets are being used in the experiment for evaluating the model speed and accuracy in handling case-by-case. One patient sends two signal channels per time: the EMG signals reading and the ECG signals reading.
So, each patient has a multi-dimensional vector describing his input X = [EMG Signal Samples, ECG Signal Samples].

Results
Accuracy reached 85% with the following characteristics: A learning rate of L = 0.0001 Binary Cross Entropy Loss (BCE) function for loss measurement. The accuracy increased to 92% by using each supplied signal feature as a CNN input. The total test time was 5 s for 4 test subjects and there were a total of 24.576 test signal samples.
There are two approaches for distributing deep learning models on the cloud, as shown below.

1.
Model Parallelism: In this paradigm, there is only one model distributed on different machines or different GPUs. For example, different layers may be assigned to different machines. This paradigm is useful for big models.

2.
Data Parallelism: In this paradigm, the model is copied between more than one machine. Each model runs on a different subset of the data. Figure 17 shows a graphical explanation of the two paradigms. All computations are completed based on model parallelism. The proposed mobile AI smart hospital platform consists of two main parts, the first The following properties of the GMDH-type polynomial network have been used during the test: The proposed mobile AI smart hospital platform consists of two main parts, the first one uses the stack CNN as AI cloud-based and the other GMDH and LSTM modules are used for the mobile AI app. For the first one, its main objective is presented as a new real-time CNN-stroke and Stroke and BAN-IOT: a deep learning model for signal deep feature extraction and classification within a cloud streaming environment. First, the use of stacked CNN is for handling the big data streaming of several signals sent from wearable sensors and body area networks (BAN) that include a variety of signals that do not correlate with each other on a shallow feature level. Scenario: Table 4 shows a sample of the EMG 8 channels. (2) The evaluation method has tested different modules for the overall new smart hospital platform, as AI-based software implementation only.
Performance and results on cloud platform (GCP): Table 5 shows the performance and validation accuracy that has been achieved in our experiments, when implementing the stacked CNN, using two Nvidia GPUs. The use of AI-based cloud simulating the smart hospital platform. Tests have been conducted on 10 different physical action signals, the test samples have been divided almost evenly, 5k samples for training and~5k samples for testing the prediction. Table 6, below, shows the characteristics and scores for a selected subject, where 10 aggressive different actions are predicted using the GMDH-type neural network.
On average, among the selected subjects the model is able to predict accurately 96.02% of the signal in low time.
The table below shows the characteristics and scores for the previously selected subject, 10 normal different actions are predicted using the GMDH-type neural network.
As shown in Table 6, different experiments have been conducted by training the GMDH deep learning model on different input sizes of EMG channels. As illustrated different layers have different accuracies, according to input sizes of EMG 4 channels, EMG 8 channels, and EMG of mHealth dataset. On average, the selected subject's model is able to predict accurately 96.85% of the signal in low time. The following plots demonstrate two selected actions from the previously conducted test experiments, 1 normal action, and 1 aggressive action.
As shown in Table 7, different experiments have been conducted by training the GMDH deep learning model on different input sizes of EMG channels. As illustrated  The graphs from Figure 16 show the following: original signal in red, read from the EMG sensor; predicted signal in orange, using the GMDH-type neural network; overlapped signals, to show prediction visual accuracy, in both blue and green, and finally, the crosscorrelation between the two signals. It can be clearly seen how visually both signals (predicted and original) are almost equal.
The graphs from Figure 18 show the following: original signal in red, read from the EMG sensor; predicted signal in orange, using the GMDH-type neural network; overlapped signals, to show prediction visual accuracy, in both blue and green, and finally, the crosscorrelation between the two signals. It can be clearly seen how visually both signals (predicted and original) are almost equal. The following plots demonstrate two selected actions from the previously conducted test experiments, 1 normal action, and 1 aggressive action.
As shown in Table 7, different experiments have been conducted by training the GMDH deep learning model on different input sizes of EMG channels. As illustrated different layers have different accuracies, according to input sizes of EMG 4 channels, EMG 8 channels, and EMG of mHealth dataset. The graphs from Figure 16 show the following: original signal in red, read from the EMG sensor; predicted signal in orange, using the GMDH-type neural network; overlapped signals, to show prediction visual accuracy, in both blue and green, and finally, the cross-correlation between the two signals. It can be clearly seen how visually both signals (predicted and original) are almost equal.
The graphs from Figure 18 show the following: original signal in red, read from the EMG sensor; predicted signal in orange, using the GMDH-type neural network; overlapped signals, to show prediction visual accuracy, in both blue and green, and finally, the cross-correlation between the two signals. It can be clearly seen how visually both signals (predicted and original) are almost equal.  We show the same graph but for a normal action below ( Figure 19). We show the same graph but for a normal action below ( Figure 19). In addition, based on the previous plot it can be clearly seen how visually both signals (predicted and original) are almost equal.
The previous graph shows a prediction sample of an aggressive action (Figure 20), and the following graph shows a closer look at the signal prediction details (Figure 21).  In addition, based on the previous plot it can be clearly seen how visually both signals (predicted and original) are almost equal.
The previous graph shows a prediction sample of an aggressive action (Figure 20), and the following graph shows a closer look at the signal prediction details (Figure 21). We show the same graph but for a normal action below ( Figure 19). In addition, based on the previous plot it can be clearly seen how visually both signals (predicted and original) are almost equal.
The previous graph shows a prediction sample of an aggressive action (Figure 20), and the following graph shows a closer look at the signal prediction details (Figure 21).  It can be clearly seen how the signal-predicted values are close to the original values, most of the values differ by a small value that will not change the behavior of the signal when analyzed, thus preserving the information needed for further analysis and classifications.
Next, we will show each subject test scores and then show a generalized average score for the model with training statistics.
The aggressive action/normal action test scores for Subject 1/Subject 2 are shown in Tables 8-11. It can be clearly seen how the signal-predicted values are close to the original values, most of the values differ by a small value that will not change the behavior of the signal when analyzed, thus preserving the information needed for further analysis and classifications.
Next, we will show each subject test scores and then show a generalized average score for the model with training statistics.
The aggressive action/normal action test scores for Subject 1/Subject 2 are shown in Tables 8-11. Table 8. Aggressive action test scores for Subject 1, the overall average reaches 90.01%, after training of adjusting different layers with different EMG signals channels as used before in Table 4, and    Table 9. Normal action test scores for Subject 1, the overall average reaches 81.43%, after training of adjusting different layers with different EMG signals channels as used before in Table 4 and The confusion matrix of the GMDH model is shown in Figure 22 is generated. Different metrics generated by the neural network of results are shown in Figure 23. This is to show for classification of whether the action is aggressive or normal a combination of results between normal action test scores and aggressive action test scores in order to more clearly see the results from these two cases. Additionally, the classification report represented below helps us to better understand the metrics taken into consideration for this example (Table 12). clearly see the results from these two cases. Additionally, the classification report represented below helps us to better understand the metrics taken into consideration for this example (Table 12).  clearly see the results from these two cases. Additionally, the classification report represented below helps us to better understand the metrics taken into consideration for this example (Table 12).    In Figure 23, section (a) presents the accuracy evaluation and (b) presents f1 score F = 2· precision·recall precision + recall Accuracy = tp + tn tp + tn + f p + f n In Figure 24, section (a) the loss function is represented while in section (b) the learning rate is represented. We can see that the learning rate is constant and equal to 10 −3 .
precision recall + tp tn Accuracy tp tn fp fn + = + + + In Figure 24, section (a) the loss function is represented while in section (b) the learning rate is represented. We can see that the learning rate is constant and equal to 10 −3 .  In Figure 25, the precision is represented regarding epochs (a) and in section (b) the recall is presented also regarding epochs. In Figure 25, the precision is represented regarding epochs (a) and in section (b) the recall is presented also regarding epochs. = false negative In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed Mobile AI Engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs. The highest precision of parallel LSTM achieves 99.9% and the average reaches 93.65%. Figure 26a shows LSTM Model training visualization, while Figure 26b shows LSTM accuracy curves and Figure 26c shows the overall hybrid LSMT model results. The main advantage of the parallel LSTM model is that it is more stable than the GMDH deep learning model, as it has been trained on the 8 EMG channels directly. In addition, the parallel LSTM model is more suitable for the implementation of the mobile health app. However, the GMDH deep learning is good as a standalone predictive model In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed Mobile AI Engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs. The highest precision of parallel LSTM achieves 99.9% and the average reaches 93.65%. Figure 26a shows LSTM Model training visualization, while Figure 26b shows LSTM accuracy curves and Figure 26c shows the overall hybrid LSMT model results. The main advantage of the parallel LSTM model is that it is more stable than the GMDH deep learning model, as it has been trained on the 8 EMG channels directly. In addition, the parallel LSTM model is more suitable for the implementation of the mobile health app. However, the GMDH deep learning is good as a standalone predictive model for mathematical modeling EMG signal predictions. In contrast to many GMDH deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs. The highest precision of parallel LSTM achieves 99.9% and the average reaches 93.65%. The main advantage of the parallel LSTM model is that it is more stable than the GMDH deep learning model, as it has been trained on the 8 EMG channels directly. Additionally, the parallel LSTM model is more suitable for the implementation of the mobile health app. However, the GMDH deep learning is good as a standalone predictive model for mathematical modeling EMG signal predictions. The GMDH could predict most of the signals tested accurately with a high R 2 score, below is a table of the general prediction scores tested on both normal and aggressive action signals. It can be seen that the ability of the GMDH to predict more event-based signals wit more peaks and more aggressive spectrum, is higher than normal signals with lower peak and less aggressive spectrum.
The main disadvantages of the GMDH model are that: the peaks of the signal for th GMDH predictive model vary from channel to channel of the EMG signal. In addition the GMDH training algorithm takes a lot of memory for training and may crash, it crashe at using memory above 13 GB and it is not suitable for real industry applications of th mobile AI health app. GMDH is a self-organizing approach by which gradually compli cated models are generated based on the evaluation of their performances on a set o multi-input-single-output data. However, it is good for modeling the predictive analytic modeling of the stroke prediction system. It can be tested in the future for cloud compu It can be seen that the ability of the GMDH to predict more event-based signals with more peaks and more aggressive spectrum, is higher than normal signals with lower peaks and less aggressive spectrum.
The main disadvantages of the GMDH model are that: the peaks of the signal for the GMDH predictive model vary from channel to channel of the EMG signal. In addition, the GMDH training algorithm takes a lot of memory for training and may crash, it crashes at using memory above 13 GB and it is not suitable for real industry applications of the mobile AI health app. GMDH is a self-organizing approach by which gradually complicated models are generated based on the evaluation of their performances on a set of multi-inputsingle-output data. However, it is good for modeling the predictive analytics modeling of the stroke prediction system. It can be tested in the future for cloud computing or AI high-performance computing side. However, the hybrid parallel LSTM model is suitable for the mobile AI app implementation in python.

Discussion
The GMDH could predict most of the signals tested accurately with a high R 2 score, and below is a table of the general prediction scores tested on both normal and aggressive action signals (Table 13). It can be seen that the ability of the GMDH to predict more event-based signals with more peaks and more aggressive spectrum is higher than normal signals with lower peaks and less-aggressive spectrum. Below are two different plots for a normal signal and an aggressive signal (Figures 27 and 28).  It can be seen that the ability of the GMDH to predict more event-based signals with more peaks and more aggressive spectrum is higher than normal signals with lower peaks and less-aggressive spectrum. Below are two different plots for a normal signal and an aggressive signal (Figures 27 and 28).  Concerning the mobile GMDH algorithms analysis and time series data forecasting: As we know time series data are a sequence of data taken in multiple time stamps. There are two main goals of processing time series data. The first task is to try to classify the data into predefined subcategories. The second task is to predict the future of the input data using the current data. There are multiple algorithms to process and forecast time series data (parametric and nonparametric algorithms). These include:   It can be seen that the ability of the GMDH to predict more event-based signals with more peaks and more aggressive spectrum is higher than normal signals with lower peaks and less-aggressive spectrum. Below are two different plots for a normal signal and an aggressive signal (Figures 27 and 28).  Concerning the mobile GMDH algorithms analysis and time series data forecasting: As we know time series data are a sequence of data taken in multiple time stamps. There are two main goals of processing time series data. The first task is to try to classify the data into predefined subcategories. The second task is to predict the future of the input data using the current data. There are multiple algorithms to process and forecast time series data (parametric and nonparametric algorithms). These include:  Concerning the mobile GMDH algorithms analysis and time series data forecasting: As we know time series data are a sequence of data taken in multiple time stamps. There are two main goals of processing time series data. The first task is to try to classify the data into predefined subcategories. The second task is to predict the future of the input data using the current data. There are multiple algorithms to process and forecast time series data (parametric and nonparametric algorithms). These include: Given a training set, layers are incrementally growing by regression analysis, then pruned with help of a validation set. The number of layers and units can be learned in a problem-dependent fashion. When the architecture consists of a multilayer procedure, it becomes equivalent to the artificial neural network with a polynomial activation function of neurons.
GMDH only: We tried to follow the same conditions described in the previous report for training and testing but we faced some problems due to missing information in the report so we tried our best to reproduce the previous results which takes a lot of time and effort. We depended on the python implementation "GmdhPy".
There are multiple ways we can define our inputs and our outputs given that we have four people. Every person has 20 actions given by 8 channels. At the same time, we have extracted 39 features from this data as described in previous reports. This gives us a huge space for experiments. We did not test all the possibilities, we only focused on the most obvious ones. We framed the problem as univariate time series forecasting. So the input is one channel for one subject. For example, we took the fourth subject with his first channel (RBic) and tried to predict the future of this channel as shown in Figure 29. most obvious ones. We framed the problem as univariate time series forecasting. So the input is one channel for one subject. For example, we took the fourth subject with his first channel (RBic) and tried to predict the future of this channel as shown in Figure 29. There are multiple hyper-parameters that should be studied in the future such as how many readings from the past should we depend on and the hyperparameters of the GMDH.
We took 10 readings from the past and tries to predict the next one in the future. As our scope was to develop a fast and stable model for the deployment we considered these settings: Calculating the root mean square error in time series data forecasting is very critical. So, we considered the time shift that happened in the data generation. So, GMDH achieved a train score of 1103.42 RMSE and a test score of 968.80 RMS We see that the predicted signal has higher peaks than the original signal as shown in Figure 30. The problem with the GMDH-only setup is that the network did not keep the previous information from the previous time steps or residual connections. As described in this paper, when GMDH is combined with some information from the previous time steps it can give better performance with residual connections results as shown in Figure 31. There are multiple hyper-parameters that should be studied in the future such as how many readings from the past should we depend on and the hyperparameters of the GMDH.
We took 10 readings from the past and tries to predict the next one in the future. As our scope was to develop a fast and stable model for the deployment we considered these settings: ref_functions = ('linear_cov'), criterion_type = 'validate', criterion_minimum_width = 5, stop_train_epsilon_condition = 0.001, layer_err_criterion = 'top', l2 = 0.5, manual_best_neurons_selection = True, min_best_neurons_count = 30, Calculating the root mean square error in time series data forecasting is very critical. So, we considered the time shift that happened in the data generation. So, GMDH achieved a train score of 1103.42 RMSE and a test score of 968.80 RMS We see that the predicted signal has higher peaks than the original signal as shown in Figure 30. The problem with the GMDH-only setup is that the network did not keep the previous information from the previous time steps or residual connections. As described in this paper, when GMDH is combined with some information from the previous time steps it can give better performance with residual connections results as shown in Figure 31.
achieved a train score of 1103.42 RMSE and a test score of 968.80 RMS We see that the predicted signal has higher peaks than the original signal as shown in Figure 30. The problem with the GMDH-only setup is that the network did not keep the previous information from the previous time steps or residual connections. As described in this paper, when GMDH is combined with some information from the previous time steps it can give better performance with residual connections results as shown in Figure 31. Training and testing: The number of subjects samples is four subjects, three male and one female. Three of them are because Subject 2 has noisy data.
We found that there is a pattern between the action and the aggressiveness, for example, when you are running this state is classified as normal. The upper body returns normal signals but the lower body returns high peak signals. Although in the aggressive class, for example, the lower and upper body return relatively high peak signals, as illustrated in Figure 32. In Figure 33, the subject is in an aggressive move, he is doing an elbowing move. We found that all eight channels have the same pattern of highly distributed points in all the graphs with some concentration in the middle. The number of subjects samples is four subjects, three male and one female. Three of them are because Subject 2 has noisy data.
We found that there is a pattern between the action and the aggressiveness, for example, when you are running this state is classified as normal. The upper body returns normal signals but the lower body returns high peak signals. Although in the aggressive class, for example, the lower and upper body return relatively high peak signals, as illustrated in Figure 32. Training and testing: The number of subjects samples is four subjects, three male and one female. Three of them are because Subject 2 has noisy data.
We found that there is a pattern between the action and the aggressiveness, for example, when you are running this state is classified as normal. The upper body returns normal signals but the lower body returns high peak signals. Although in the aggressive class, for example, the lower and upper body return relatively high peak signals, as illustrated in Figure 32. In Figure 33, the subject is in an aggressive move, he is doing an elbowing move. We found that all eight channels have the same pattern of highly distributed points in all the graphs with some concentration in the middle.  In Figure 33, the subject is in an aggressive move, he is doing an elbowing move. We found that all eight channels have the same pattern of highly distributed points in all the graphs with some concentration in the middle. Long Short-Term Memory Network (LSTM): Maintain state (memory) across very long sequences. Temporal information was used because LSTM is very sensitive to the data ranges we applied. Data normalization and scaling were used in the input and output. We used standard scaling for the input. It can be solved using linear activation in the output layer.
A novel stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs 612 a novel architecture consisting of a group method of data handling and multiple LSTM recurrent neural networks. Results: Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and real-time execution on IoMT devices. Conclusion: In 616 contrast to many compute-intensive deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
Mobile open architecture. We adopted an open architecture to make it easy for any researcher. They will be able to add their own model to the mobile in an easy way. Most mobile apps are currently closed systems, meaning that the researcher cannot add his/her model without many changes. Any model can be added instead of our trained models.
Findings: The first idea about the data, it has a huge variance and the data are not zero, meaning there are two levels of classification. The first level is normal or aggressive. This level depends on the activities which the subjects are doing to differentiate between the two classes. In the second level, for every state from level 1 (normal or aggressive), the subject was doing 10 activities for normal and 10 activities for aggressive.
To conclude, the signal for both classes normal and aggressive was processed and there were observed differences that helped us extract characteristics such as integrated EMG, modified mean absolute value 1, root mean square, average amplitude change, zero crossing, Willison amplitude, and waveform length. Great results were obtained by using a deep learning model, the new GMDP deep learning model.
Bearing all these things in mind, a model that can detect these two sides of behavior may help people that suffered a stroke to communicate easier and interact more with the Long Short-Term Memory Network (LSTM): Maintain state (memory) across very long sequences. Temporal information was used because LSTM is very sensitive to the data ranges we applied. Data normalization and scaling were used in the input and output. We used standard scaling for the input. It can be solved using linear activation in the output layer.
A novel stroke prediction algorithm is proposed based on EMG signals prediction. The proposed solution employs 612 a novel architecture consisting of a group method of data handling and multiple LSTM recurrent neural networks. Results: Experimental evaluations show superior EMG prediction performance compared to previous works. Measurements with different deep learning methods as combining CNN with LSTM show that the proposed algorithm meets performance requirements for continuous and realtime execution on IoMT devices. Conclusion: In 616 contrast to many compute-intensive deep learning-based approaches, the proposed algorithm is lightweight for the proposed mobile AI engine, and therefore, brings continuous diagnosis and prediction with accurate GMDH-LSTM-based EMG signal prediction to IoMT simulated inputs.
Mobile open architecture. We adopted an open architecture to make it easy for any researcher. They will be able to add their own model to the mobile in an easy way. Most mobile apps are currently closed systems, meaning that the researcher cannot add his/her model without many changes. Any model can be added instead of our trained models.
Findings: The first idea about the data, it has a huge variance and the data are not zero, meaning there are two levels of classification. The first level is normal or aggressive. This level depends on the activities which the subjects are doing to differentiate between the two classes. In the second level, for every state from level 1 (normal or aggressive), the subject was doing 10 activities for normal and 10 activities for aggressive.
To conclude, the signal for both classes normal and aggressive was processed and there were observed differences that helped us extract characteristics such as integrated EMG, modified mean absolute value 1, root mean square, average amplitude change, zero crossing, Willison amplitude, and waveform length. Great results were obtained by using a deep learning model, the new GMDP deep learning model.
Bearing all these things in mind, a model that can detect these two sides of behavior may help people that suffered a stroke to communicate easier and interact more with the environment. Even though this study was intended to help patients of a hospital, there may be applications in the security domain field in order to predict aggressive behavior.
In addition, a new hybrid LSTM/dense deep learning architecture has been added with detailed experimental results for EMG stroke prediction and as compared to GMDH, it is better as a parallel model that takes as input all the EMG 8 channels with high results; however, the GMDH algorithm can be easily deployed as mobile AI app with high accuracies. More results need to be tested in future work for parallel inputs to the GMDH algorithm. However, both models achieve high stroke prediction accuracies. Additionally, a single LSTM module has been integrated into the GMDH algorithm for enhancing the mobile AI implementation for stroke prediction.

Conclusions and Future Work
Artificial intelligence (AI) technologies in smart health patients' safety and care for connected health and intelligent diagnostics and predictive health ai edge computing integrated within smart hospital environments, have opened up new opportunities in healthcare systems and complex disease predictions and early detection for issues such as heart and stroke diseases. Smart hospital technologies [21][22][23][24][25][26][27] are a steadily growing field in artificial intelligence (AI), biomedical big data analytics [44][45][46][47] Internet of Medical Things (IoMT). Heart diagnosis and stroke prediction have urgent patient cases that may cause problems such as cardiovascular diseases [28][29][30][31], heart attacks, and brain strokes. It may also cause sudden death. These are the leading cause of death in the Middle East, Europe, and the United States. For these reasons, heart and stroke diseases are considered emergency cases. In the recent research of artificial intelligence technologies in the healthcare domain, what we witness is high competition and new revolution [1][2][3]. However, today's AI research and development of technologies in the fields of heart diseases diagnosis [16][17][18][19][20] and stroke prediction research are still missing a real-time AI-based heart diagnosis and stroke prediction system to be developed as AI-based platform R&D to be used in the industry and the new era of smart hospital developments [21][22][23][24][25][26][27]. This research paper innovation introduces a new AI system design that consists of an integrated real-time IOT-AI smart heart/stroke platform to be in the future inside hospitals as a new IoMT-AI-based heart/stroke platform and as an independent mobile AI telemedicine system for stroke prediction. Artificial intelligent IOT hospital edge-connected health diagnostic and predictive systems integrated with telemedicine services for both elder patients with chronic and brain stroke cases aim to help heart/brain stroke patients to discover their disease once it occurs based on EEG/ECG/EMG signals. However, classifying real-time ECG/EMG signals [14,15] is a complex task, especially for patient muscle signal feedback problems. This proposal introduces an integrated artificial intelligence telemedicine platform including AI software for heart disease diagnosis, and AI software for brain stroke diagnosis and prediction. The highest precision of parallel LSTM achieves 99.9% and the average reaches 93.65%. AI/DL telemedicine services could be useful for the nearest hospital and patients' telemonitoring at-home care services. In this research paper, we have only presented some innovative research results for the full mobile AI system cycle, and some real implementations in simulated tests. Our solution is more innovative than previous research on stroke prediction using only single deep learning or some sample stroke cases such as during sleeping, as discussed previously in the paper in Section 2.
This research article also presents an overall state-of-the-art artificial intelligence mobile health system architecture for stroke that can be implemented by AI and IoT companies such as Dell technologies for real-life scientific implementations. The main focus is on predictive analytics and edge computing solutions in healthcare and emergency situations.
Predictive analytics deals with information retrieval to predict an unknown event of interest, typically a future event. Using technology that learns from data to predict these unknown events could drive better decisions. This research paper utilizes the concepts of deep learning (GMDH) for signal predictions for mobile edge computing future implementations of complete solutions in a smart health home living scenario.
This research paper has successfully presented several steps in the predictive analytics process: identification of the problem and a determination of the outcomes and objectives is a crucial first step. The first data of the model used EMG from both real-time and offline datasets.
Future work may include different ways to process the signal, and data processing time concerning the processing time of GMDH it has taken nearly 30 min for the training, but the networking connections are out of the scope of this article and we are honored to present them in the future work of next phase of app implementation. Additionally, other deep learning model architectures will be presented in order to achieve better precision in classification. In addition, for the XAI decision-based tools, extended parts of TIER 2 and TIER 3 of fuzzy logic and surveillance-based systems will be considered for future works. Additionally, future work may include different ways to process the signal, and other deep learning model architectures in order to achieve better precision in classification.