ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals

Mota, Tais da Silva; Sarkar, Saket; Poojary, Rakshith; Alqasemi, Redwan

doi:10.3390/electronics14122481

Open AccessArticle

ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals

by

Tais da Silva Mota

^1,2,*,

Saket Sarkar

^1,2,

Rakshith Poojary

^1,2

and

Redwan Alqasemi

^1,2

¹

Department of Mechanical Engineering, University of South Florida, Tampa, FL 33620, USA

²

Center for Assistive Rehabilitation and Robotics Technology, University of South Florida, Tampa, FL 33620, USA

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(12), 2481; https://doi.org/10.3390/electronics14122481

Submission received: 17 May 2025 / Revised: 12 June 2025 / Accepted: 12 June 2025 / Published: 18 June 2025

(This article belongs to the Special Issue Advances in Intelligent Control Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

With an anticipated 3.6 million Americans who will be living with limb loss by 2050, the demand for active assistive devices is rapidly increasing. This study investigates the feasibility of leveraging a ChatGPT-based (Version 4o) model to predict motion based on input electroencephalogram (EEG) signals, enabling the non-invasive control of active assistive devices. To achieve this goal, three objectives were set. First, the model’s capability to derive accurate mathematical relationships from numerical datasets was validated to establish a foundational level of computational accuracy. Next, synchronized arm motion videos and EEG signals were introduced, which allowed the model to filter, normalize, and classify EEG data in relation to distinct text-based arm motions. Finally, the integration of marker-based motion capture data provided motion information, which is essential for inverse kinematics applications in robotic control. The combined findings highlight the potential of ChatGPT-generated machine learning systems to effectively correlate multimodal data streams and serve as a robust foundation for the intuitive, non-invasive control of assistive technologies using EEG signals. Future work will focus on applying the model to real-time control applications while expanding the dataset’s diversity to enhance the accuracy and performance of the model, with the ultimate aim of improving the independence and quality of life of individuals who rely on active assistive devices.

Keywords:

generative pre-trained transformer (GPT); electroencephalogram (EEG); motion capture (MoCap); rehabilitation robotics; machine learning

1. Introduction

By 2050, at least 3.6 million Americans are expected to be living with a missing limb, a prediction that is driven primarily by the aging population and rising rates of diabetes [1]. In response to this growing demand, advancements in both passive and active prosthetic devices, orthoses, and assistive robotics are imperative as they aim to enhance users’ independence in performing activities of daily living (ADLs) and ultimately, improve their quality of life. However, controlling these devices remains a significant challenge for many users. The complexity of operating such devices often leads to frustration and, in some cases, abandonment by patients [2], which reinforce the significance of developing more intuitive and user-friendly systems.

Brain–computer interfaces (BCIs) [3] offer a promising solution by utilizing non-invasive methods to capture the brain’s electroencephalogram (EEG) signals to infer a user’s intended muscle and joint movements. However, EEG signals are inherently noisy, highly variable, and exhibit low signal/noise ratios, which makes them difficult to use directly for reliable device control.

To bridge this gap, robust intermediary models are essential. Motion capture (MoCap) [4] technology plays a vital role by providing precise recordings of human joint angles and limb trajectories during motion, which can serve as ground truth data for training models. Furthermore, inverse kinematics (IK) [5] techniques enable the computation of joint configurations that is necessary to achieve desired end-effector (hand) positions for controlling assistive or robotic limbs based on intention signals. Together, MoCap and IK methods represent a comprehensive framework for mapping motion into robotic applications, yet integrating this information with real-time EEG decoding remains a significant research challenge.

While many EEG decoding studies rely on the use of discrete motion labels or electromyographic (EMG) signals as output parameters, a smaller subset has explored the use of MoCap data—such as joint angles and limb trajectories—as continuous targets for regression-based models. For instance, studies have demonstrated that low-frequency EEG features could be used to decode 3D upper-limb trajectories captured via MoCap [6]. Others have also shown that hand kinematics derived from MoCap could be predicted from EEG signals using multivariate linear regression, although with only moderate accuracy [7].

Furthermore, ongoing debates exist regarding whether interpretable, traditional machine learning models offer greater reliability for EEG decoding compared to complex deep learning architectures, which may suffer from overfitting and a lack of transparency [8]. Recent advances in generative pre-trained transformer (GPT) [9] technology provide a promising avenue by offering rapid and relatively accurate approximation models for a variety of applications, including biomedical contexts. One key advantage of GPT models is their ability to recognize complex data patterns, form correlations between disparate datasets, and effectively fine-tune noisy inputs to capture essential relationships [10].

In this study, ChatGPT (based on the GPT-4 API) was selected for its unique ability to translate natural language prompts into executable code. Unlike other generative models or traditional AutoML systems, ChatGPT 4o provides a highly accessible interface that enables researchers to iteratively refine models, preprocess data, and scaffold reproducible workflows without having to perform low-level programming from scratch. Unlike prior EEG–MoCap pipelines that rely on manual code development or AutoML frameworks, our study leverages the GPT-4 API to drive prompt-based code generation, annotation, and pipeline orchestration. This approach reduces boilerplate, embeds documentation directly into scripts, and creates a reproducible ‘prompt log’ of our development process, all while leaving core model training and inference to conventional TensorFlow Versin 2.14.0 (Google LLC, Mountain View, CA, USA) [11] and scikit-learn version 1.2.2 [12] routines. Given the increasing integration of generative AI tools in research and industry, it is important to critically assess not only their capabilities, but also their limitations and potential biases. While ChatGPT excels at generating coherent text and code snippets, its ability to generate scientifically rigorous and reliable machine learning models remains an open question.

Considering these capabilities and limitations, this study systematically investigates whether ChatGPT can generate machine learning pipelines that are capable of correlating EEG signals with human arm joint angles derived from motion capture (MoCap) data. Specifically, we evaluate the accuracy, quality, and usability of the studied models in approximating the relationship between EEG recordings and basic arm movements by leveraging the GPT-4 API for prompt-driven code generation and data wrangling scripts, while the core regression and classification architectures that we use are CNN-LSTM and Random Forest, respectively. The CNN-LSTM architecture used for classification was implemented in TensorFlow and trained using a supervised sliding window approach. Details of the model architecture and training configuration, including the data segmentation and performance metrics, are presented in Section 2.3. This approach is consistent with the CNN-LSTM framework presented in [13], which outlines a TensorFlow-based training and deployment pipeline. In this workflow, ChatGPT-4o expedites the pipeline assembly and prompt engineering, but all model weights and hyperparameters are learned via conventional deep learning routines outside the ChatGPT-4o environment, which allows end users to deploy and run the system independently without the need for repeated access to or reliance on the GPT API at runtime.

This preliminary effort lays the groundwork for the accurate and efficient EEG-based control of active assistive devices. Specifically, it contributes to the field of advanced controls by demonstrating how modeling EEG-based human behavior data can enhance the accuracy of robotic and prosthetic control. By enabling more intuitive and reliable interfaces, the approach has the potential to reduce user frustration and decrease the likelihood of device abandonment due to poor or confusing control methods.

To rigorously evaluate this approach, this study is structured around three core objectives, each of which addresses a key aspect of system development and validation. The remainder of this paper is organized as follows: Section 2 presents the Materials and Methods, which are structured around three core objectives. Objective 1 validates the use of ChatGPT for modeling simple numerical relationships. Objective 2 involves the collection of synchronized EEG and motion-labeled video data to classify five distinct arm motions. Objective 3 integrates marker-based motion capture (MoCap) data to enable joint-angle regression using CNN-LSTM models. This Section also details the technical workflows used, including EEG artifact filtering, MoCap signal parsing, and the full data synchronization and modeling pipeline that was scaffolded using GPT-generated scripts.

Section 3 presents the experimental results corresponding to each objective, while Section 4 discusses the implications of these findings. These sections highlight the effectiveness of ChatGPT in facilitating the rapid development of EEG-based pipelines, while also addressing the limitations of the current approach and offering recommendations for future improvements.

2. Materials and Methods

This study aimed to develop and evaluate a machine learning model capable of accurately correlating EEG signals with human arm motion, and to thereby enable the non-invasive control of active assistive devices. To accomplish this, the methodology was structured around three progressive objectives.

Objective 1: ChatGPT’s ability to model numerical relationships was validated by solving and predicting simple input–output correlations. This initial step established a foundational level of computational reasoning, ensuring the model can accurately relate input variables to output predictions;

Objective 2: Synchronized EEG signal collection and video-based motion labeling were introduced. The model was trained to filter, normalize, and classify EEG signals corresponding to distinct arm movements, which enabled the association of brain activity patterns with arm motion events;

Objective 3: Marker-based motion capture (MoCap) data were integrated to provide enriched motion parameters, including joint angles and velocities, which are critical for advanced motion modeling and inverse kinematics applications in robotic control.

All data were collected from one healthy adult volunteer (age 28, male). While this design provided tight control over electrode/contact placement and marker consistency, it limited inter-subject generalizability. Future work will recruit a diverse cohort (varying age, gender, anthropometry) to validate transferability across participants.

2.1. Objective 1: Numerical Relationship Modeling

For Objective 1, it was evaluated whether the ChatGPT-4o model can correlate numerical variables, such as x and y. The model was trained on input/output data generated through a known equation, then asked to predict the output from novel data that were not used for model training. This allowed for the accurate estimation of output variables without the explicit necessity of using exact mathematical models. The model was first given input values of x and output values of y and asked to derive a model that relates the two. Once it built a model representing the relationship, it was asked to generate estimated values of y given a random input of x. The expected values given from the model were then directly compared to known output values that were not used in training the model. These comparisons were used to generate the percentage accuracy of the results.

2.2. Objective 2: Synchronized EEG Signal Acquisition and Time-Stamped Video Recording

Upon successful completion of Objective 1, we advanced to Objective 2, which involved acquiring time-stamped video recordings of the participant’s arm movements. Such motions were categorized in five discrete arm motion classes: forward shoulder flexion, backward shoulder extension, lateral shoulder abduction, arm swing during gait, and “no motion” or idle. These arm movements were recorded on video while EEG data were captured simultaneously, which enabled the synchronized analysis of motion and brain activity. The EEG setup involved two primary steps: contact quality and EEG quality configuration. These steps primarily consisted of applying a saline solution to the electrodes to ensure proper contact and adjusting the headset on the user to optimize comfort and positioning. This process was conducted using the EMOTIV Pro Lite application (Version 4.5.7.570, Emotiv Inc., San Francisco, CA, USA), in line with the comprehensive guidelines provided within the application [14]. Once the electrodes were correctly positioned, their indicator lights turned green and the quality percentage reached 100%, as shown in the EMOTIV Pro Lite application.

A total of 18 trials were performed: 8ight using the high-end EEG headset (16-channel Emotiv EPOC+, Emotiv Inc., San Francisco, CA, USA) [15] and the remaining 10 using the low-end headset (5-channel Emotiv Insight, Emotiv Inc., San Francisco, CA, USA) [16]. The recordings were processed into files containing arm motions and their respective timestamps.

After initial data collection, the GPT model was trained to recognize EEG data, with a focus on identifying unreliable datasets, eliminating large signal peaks, and disregarding timestamps with extraneous points. After preprocessing, the model was trained using EEG signal segments paired with the annotated time intervals for each of the five distinct motion classes, which enabled it to learn precise correlations between patterns of brain activity and the corresponding arm movements. Upon training, the model’s accuracy was tested by comparing its predictions, based on novel EEG data, to the actual movements. This training and testing procedure is illustrated in Figure 1.

2.3. Objective 3: MoCap Integration

2.3.1. Data Collection Procedure

After timestamping and EEG decode testing, the MoCap data collection method was introduced in Objective 3. It served as a more complex method that can directly be used in IK tools. To maintain simplicity and consistency across trials, movements were categorized into five groups: random arm motion, which involved a mixture of forward, backward, and sideways motions; forward motion, consisting of forward shoulder flexion; backward motion, consisting of backward shoulder extension; sideways motion, consisting of lateral shoulder abduction; and common human motions, which included drinking water, giving a high five, arm swing during gait (walking), and reaching to grab an object. At this stage, a minor adjustment was made to the EEG setup: the 16-channel EPOC+ headset was removed to simplify the dataset; therefore, all the subsequent datasets were collected using only the 5-channel Insight EEG headset. The open-sourced application Cykit 3.0 for Python 3.7.x (CymatiCorp, open-source GitHub project) [17] was used to perform the data collection for all trials at a sampling rate of 128 hertz. Each trial was conducted over a 30 s interval to ensure standardized recording conditions across all categories. A total of 30 trials were conducted. The motion-capture data collection was conducted using MOCAP Vicon Nexus (Version 1.8.4, Vicon Motion Systems Ltd., Centennial, CO, USA) [18] at a sampling rate of 120 hertz.

The workflow began with the setup and calibration of the Vicon camera system, which included camera calibration and ambient setup. Markers were then placed on the subject using the upper body Plug-in-Gait marker set as shown in Figure 2. Following marker placement, a static trial was performed to calibrate the subject, a step required only once per session. Once calibrated, the subject proceeded with 30 s trials. A total of 30 trials were conducted using the Insight headset. In total, the setup process took from approximately one to one and a half hours, with the overall data collection time, including the actual trials, ranging from two to two and a half hours. Notice that only one subject was used for this study for all trials.

2.3.2. Data Processing Procedure

Upon data collection, the data were postprocessed to prepare them for model training. The postprocessing began with motion capture, where the Vicon Nexus version 1.8.4 software was utilized to conduct data review and export preparation. First, the trial data were loaded within the Data Management pane. Visual inspection of the captured markers and the subject’s movement in the 3D workspace ensured that all markers were tracked throughout the motion. Marker trajectories were examined to identify any inconsistencies or noisy data points. Analysis required confirmation that the capture volume was fully covered, and that the subject’s motion was adequately recorded. Marker trajectory reconstruction was conducted to generate 3D marker trajectories from the 2D camera views. After reconstruction, all markers were reviewed to ensure alignment with the subject’s anatomy and movement.

Using Vicon Nexus, marker trajectories were first auto-labeled and then manually corrected to align precisely with the Plug-in-Gait biomechanical model (Figure 2) and ensure that each marker was accurately identified. These were visually inspected using the Vicon Nexus software [18] and were exported via the software’s ASCII functionality.

The data that were exported included trial information such as the start time, date, and sampling rate, as well as joint data from the right wrist, elbow, and shoulder. By utilizing the ASCII export function, we ensured that the exported data were suitable for deriving equations necessary for inverse kinematics. At this stage, minor EEG timeline and empty channel removal was performed, while artifact handling was performed at the next stage.

With both EEG and MoCap datasets exported and preprocessed, the focus shifted to aligning their sampling rates and timestamps to prepare for integrated analysis. This process verified that both systems collected the same number of data points per minute (sampling rate) and that the start and end times of the recordings matched, enabling seamless integration of motion and brain activity data. The workflow behind this process is depicted in Figure 3.

As depicted in Figure 3, eeg_parser.py first ingests each raw EEG CSV by reading its header line to extract metadata fields—most importantly the original sampling rate (default 128 Hz) and absolute recording start timestamp—which are stored in a FileMetadata dataclass for later use. The module then loads the signal rows into a pandas Version 2.2.2 DataFrame (pandas development team, open-source, USA) [19], retaining only the channels AF3, T7, Pz, T8, and AF4, and applies a conversion function on each integer pair to convert raw readings into accurately scaled voltage values using the Emotiv EPOC+ calibration formula. Each channel is passed through a zero-phase FIR low-pass filter (Blackman–Harris window, 55 Hz cutoff) via the reusable filt_lowpass() routine, which effectively removes high-frequency noise. Filtering the signal in both the forward and reverse directions means that any phase shifts introduced during the first pass are exactly counteracted on the return pass, which ensures that the temporal relationships between neural oscillations remain unchanged. The filter’s cutoff at 55 Hz effectively removes muscle and environmental artifacts above this threshold, yielding a cleaner signal that retains the physiologically relevant components necessary for accurate down sampling and subsequent multimodal synchronization.

To achieve the target 120 Hz rate, polyphase resampling is performed with SciPy Version 1.15.3 ’s signal.resample_poly [20], which calculates the required number of output samples from the ratio of new to original sampling frequencies and applies built-in anti-aliasing filtering before decimation; any extras are trimmed or edge-padded to ensure exactly n_samples_new points. Quality-indicator columns (prefixed CQ_) are linearly interpolated onto the new 120 Hz time base using NumPy (Version 2.3.0) ’s [21] interp function, but are then dropped from the final DataFrame, as they are not required for downstream analysis. Both second- and millisecond-resolved timestamp columns (TIME_STAMP_s, TIME_STAMP_ms) are recomputed by linear spacing between the original epoch boundaries. A uniform “Time” column at 120 Hz is inserted, the processed DataFrame is written back to CSV with an updated header declaring the new sampling rate, and the FileMetadata return value provides original and new sample counts, exact start/end times, and the list of processed channels for downstream synchronization.

The mocap_parser.py module automates the preparation of the Vicon Nexus ASCII exports by first scanning the initial ten lines for date and time entries, which denote the recording end timestamp, and prepares them for the “Model Outputs” section to confirm the 120 Hz sampling rate. It locates the header row and parses all subsequent numeric rows into a DataFrame whose columns include the frame index, time in seconds, and three-axis joint-angle measurements (e.g., RShoulderAngles_X/Y/Z, RElbowAngles_X/Y/Z, RWristAngles_X/Y/Z). Non-numeric or incomplete rows are discarded, and the remaining values are cast to floating points. The recording start time is calculated by subtracting the total sample count divided by 120 Hz from the end timestamp, which yields precise start_time and end_time metadata. The cleaned trajectory data and the accompanying metadata—number of original samples, start/end times, and sampling rate—are returned for each trial, which ensures consistency when aligning them with EEG streams.

The final step was to synchronize the two data sets. Following the logic depicted in Figure 4, a process for data–time synchronization was created. In synchronize.py, trial identifiers (e.g., “T1”) are extracted from filenames using a regular expression to pair each MoCap file with its corresponding EEG file. For each matched pair, absolute timestamps are computed by adding the per-sample interval (index/sampling_rate) to the stored start_time to produce epoch-based time arrays for both modalities. The overlapping interval is determined by taking the later of the two start times and the earlier of the two end times; both DataFrames are filtered to this common window. A new relative time axis is generated by subtracting the common_start, and pandas.merge_asof() is employed on this axis to align nearest-neighbor samples within a millisecond tolerance. A continuous global time index at 120 Hz is inserted and the merged DataFrame is exported to CSV. The synchronization metadata—including common_start, common_end, duration, and sampling_rate—are logged to guarantee full reproducibility prior to modeling.

2.3.3. Model Training Procedure

Once synchronized and formatted, we scaffolded the machine learning pipeline via ChatGPT by issuing structured, iterative prompts to generate core processing scripts (e.g., eeg_parser.py, mocap_parser.py, synchronize.py; see Appendix A). For instance, the prompt “Generate a Python (Version 3.10) function to apply a zero-phase Butterworth band-pass filter (1–40 Hz) to multichannel EEG data, including metadata extraction” produced fully commented, ready-to-run code that was vetted and committed.

The generated eeg_parser.py module performed comprehensive inspection and preprocessing: EEG signals were filtered with the Butterworth routine and then normalized using a Z-score threshold of ±5 σ to flag outliers. Subsequent visual comparisons confirmed effective artifact reduction and the preservation of signal integrity.

Segmentation was implemented by prompting “Write a segmentation routine that produces sliding windows of 120 samples with a stride of 16 samples,” which yielded a script that captured temporal dynamics for motion events. A further prompt—“Provide standard-scaling wrappers for EEG and MoCap streams (mean = 0, σ = 1) and format into a unified input tensor”—generated the normalization and formatting code.

Model architecture was drafted with the instruction “Create a TensorFlow Sequential model with a 1D convolutional layer (32 filters, kernel size = 3), followed by max pooling, an LSTM layer (64 units) [22], 30% dropout, and dense layers for joint-angle regression; compile with the Adam optimizer, MSE loss, and a custom ±3° accuracy metric.” The returned Keras script [23] served as the baseline, with hyperparameter refinements (filter dimensions, learning rate, early-stopping patience) obtained through follow-up prompts and validated in TensorFlow.

Early stopping on validation loss was configured to restore the best weights upon convergence. The dataset was partitioned into two groups with a ratio of 80/20 for training/testing, with 20% of the training set being reserved for validation. Post-training evaluation comprised MAE, MSE, and R² metrics, accompanied by graphical comparisons of predicted versus actual joint angles.

Pipeline functionality was extended by requesting “Generate Random Forest classification code that takes predicted joint angles and labels motion as forward, backward, or sideways”, and the returned script was integrated unchanged. Classification performance was assessed via confusion matrices and classification reports.

The complete training script—including model instantiation, callback configuration, plotting routines, and command-line interface—was generated by ChatGPT and integrated as-is, which enabled rapid, reproducible development with full control being retained over model training and hyperparameter tuning within conventional deep-learning frameworks.

3. Results

The results from Objective 1 showed that, when given simple values of linear regression of x = 1,2,3,4,5, y = 2,4,6,8,10, the model generated a correct function of y = 2x and produced an accuracy of 100%. The same accuracy was obtained with more complex numerical approximations such as a range of linear, polynomial, exponential, periodic, and damped functions. When given values of x = 1,2,3,4,5, y = 2,9,28,65,128, the model correctly estimated a quadratic relationship of y = ax³ + bx² + cx + d, with the curve fitting equation being y = x³ + x², with 100% accuracy compared to expected values.

Continuing with Objective 2, when used with the EMOTIV EPOC+ headset, ChatGPT achieved a mean accuracy of 83.7%, with accuracy ranging from 54.0% to 100%, as depicted in Table 1. In contrast, when paired with the EMOTIV Insight headset and utilizing the CyKIT 3.0 software, the model reached an approximate accuracy of 79.8%, with a range of 65.0–96.5%. In tests aimed at differentiating between walking and other basic arm motions, the ChatGPT-based model produced a mean accuracy of 74.7% and a maximum accuracy of 89.2%. Moreover, when detecting whether the subject was idle or in baseline motion, ChatGPT achieved a 98% success rate with the EPOC+ headset and a 92% success rate with the Insight headset. In contrast, non-AI-driven research using the 16-channel EEG headset to detect arm motion demonstrated a maximum accuracy of 72% [24], while research using a 5-channel headset displayed maximum average accuracy of 45% [6].

Variance in the accuracy of the AI was caused by outlier sets in the data, which were the result of errors in measurement by the software and EEG device. During motion trials with the EMOTIV Insight in particular, improper movement could cause large values to be recorded by the headset even after movement concluded, and the model incorrectly interpreted these as movements. In subsequent trials, these improper movements were minimized by prompting the model to ignore outliers, which in turn highly increased the performance of the model. Both cases are included in Table 1 and therefore contribute to the large range of percent accuracies.

As for Objective 3, the developed machine learning model demonstrated varying degrees of accuracy in predicting joint angles from EEG signals, as indicated by the evaluation metrics. Overall, the model achieved a mean absolute error (MAE) of 16.99 degrees, a mean squared error (MSE) of 640.30, and a coefficient of determination (R²) score of 0.41, which signify a moderate predictive capability. Although the overall trend of the predicted joint angles generally aligns with the ground truth, as shown in Figure 5, specific deviations—such as spikes or outlier points—contribute disproportionately to the error metrics, notably increasing both the mean absolute error (MAE) and mean squared error (MSE). A limb-specific breakdown revealed significant variability in performance. The predictions for shoulder joint angles exhibited the highest error, with an MAE of 22.26 degrees, an MSE of 866.41, and a relatively low R² score of 0.24, which suggests that the model struggled to accurately capture shoulder movements from EEG inputs. Conversely, the model exhibited comparatively better performance for the elbow, with an MAE of 4.62 degrees, an MSE of 117.92, and a moderate R² score of 0.50, which indicate more reliable predictions for simpler elbow joint movements. The wrist joint predictions showed intermediate performance, with an MAE of 15.83 degrees, an MSE of 588.31, and an R² score of 0.55, demonstrating reasonable predictive reliability.

Visual analysis of the predicted versus actual joint angles in Figure 5 was conducted to further interpret the model’s performance. Predictions for the elbow joint closely mirrored the actual angles with notable accuracy, which indicates strong predictive capability in simpler single-axis joint movements. Conversely, the shoulder joint angle predictions exhibited considerable discrepancies, failing to consistently capture complex multi-axis dynamics, which reinforced the quantitative metrics indication of poorer model performance. The wrist predictions, while generally tracking actual movements, showed variable accuracy across different axes, which highlights challenges in predicting movements involving multi-degree-of-freedom joints. These observations underscore potential limitations in the ability of EEG-based modeling to accurately predict intricate joint motions and emphasize the necessity for enhanced model refinement and targeted preprocessing techniques to better manage complex joint kinematics.

Further evaluation using a confusion matrix from a Random Forest classifier, as shown in Figure 6, revealed strong classification capabilities, as it accurately identified the motion direction with high precision. Specifically, forward, backward, and sideways motions were classified with minimal misclassification, which demonstrated the effectiveness of the classifier layer in categorizing predicted joint angle movements. It is important to highlight the contrast between Figure 5 and Figure 6, where Figure 5 illustrates the model’s regression of joint angles—shoulder, elbow, and wrist—directly in joint space. Figure 6, by contrast, shows predictions of hand movement directions in Cartesian space that were derived from the same model outputs.

Additionally, the training and validation loss curves illustrated in Figure 7 displayed a steady convergence with decreasing losses over consecutive epochs.

4. Discussion of the Results

The results show that the model is capable of detecting correlations between numerical data points and can provide the best fitting equations for estimating output values. It also highlights the potential of a ChatGPT-based model to optimize non-invasive, EEG-based control systems for active assistive devices. Due to being trained on synchronized video and EEG data, the model effectively detects movement-related patterns in noisy EEG signals, which enables the control of active assistive devices. With this approach, the EPOC+ and Insight headsets achieved average accuracies of 83.7% and 79.8%, respectively. In contrast, other research with equivalent hardware but without AI support under comparable conditions reported accuracies of 72% and 45%. These findings highlight the model’s ability to filter out irrelevant data while effectively identifying key features associated with specific motions. This capability is crucial for ensuring accurate data analysis and improving the model’s practical application in real-world scenarios. While higher-end EEG headsets consistently outperformed their lower-end counterparts in terms of quality and precision, the enhancements in accuracy achieved through AI integration were particularly significant in the lower-end headsets. The use of AI not only mitigated some of the limitations of lower-end devices but also provided a robust framework for handling more complex data structures. After we established a robust EEG decoding performance, the next phase involved integrating high-resolution motion-capture data.

The ChatGPT-scaffolded CNN-LSTM pipeline demonstrated sufficient accuracy for predicting hand movement directions in Cartesian space—achieving over 85% classification accuracy (Figure 6)—while the joint-angle regression performance remained moderate (MAEs: shoulder ≈ 22°, elbow ≈ 18°, wrist ≈ 15°). Even with imperfect joint estimates, these results indicate that end-effector trajectories can be reliably inferred for some assistive tasks. In practice, combining the Cartesian predictions with an inverse-kinematics solver enables the closed-loop control of a robotic arm without the need to retrain the underlying model.

To utilize the proposed model in controlling power prosthetic arms, EEG data of each individual user must be collected while the user invokes arm motion commands—whether the user is an amputee or has intact arms—to produce EEG brain data. These data will then be used to train the model on the user’s EEG data. Once the model is trained, the prosthetic’s control can be linked to the model output to provide the intended motion of the prosthetic limb. This can also be generalized to control robotic arms that are attached to power wheelchairs or mobile platforms.

Although the CNN-LSTM architecture effectively captures localized temporal dependencies and general motion trends, its ability to model long-range dependencies is limited by the inherent constraints of convolutional and recurrent layers. Specifically, vanishing-gradient effects in the RNN components may inhibit performance on more complex or subtle temporal patterns, which suggests that future work should explore architectures designed for extended sequence modeling.

Another key limitation stems from the dataset’s scope. Although it was well-synchronized and cleaned, the dataset included only upper-limb motions across a restricted range of movements and joints.

To address current limitations in capturing long-range temporal dependence inherent in EEG signals and optimize noise mitigation, future development should investigate transformer-based architectures. Compared to CNN-LSTM architectures that have limited capability to model extensive temporal relationships [22], a transformer architecture, particularly those based on self-attention mechanisms, can directly compute relationships between all positions in a sequence, allowing the effective modeling of long-range dependencies without the constraints typically encountered by recurrent networks [25]. Furthermore, self-attention mechanisms dynamically weigh the relevance of different time points and signal features. This characteristic enhances robustness against noisy or inconsistent inputs, which are prevalent in EEG data due to artifacts and inherent biological variability [26]. Finally, such models will benefit from a systematic comparison of prompt formulations and the integration of explainable-AI methods (e.g., SHAP values, attention-maps), which will aid in auditing and refining the ChatGPT-generated code for alignment with domain best practices.

Expanding the dataset will also be critical. Including a broader range of motions and joints, particularly additional temporal features such as angular velocity and acceleration, in the MoCap dataset will improve generalization by providing insight into motion trends and facilitating better pattern recognition.

In parallel, improvements to the EEG dataset are essential for enhancing model performance. In our current design, EEG variables refer to the microvolt-level time-series signals recorded from each electrode, such as AF3, T7, Pz, T8, and AF4. When integrated into a transformer model, each channel can be treated as an independent temporal stream, which would allow the self-attention mechanism to assess and assign importance to different electrode activities across time. This approach is especially valuable for identifying distributed neural patterns and addressing the noisy, non-stationary nature of EEG data. In addition to raw signals, future work could incorporate derived features such as spectral band power in alpha, beta, or gamma ranges, event-related potentials, and inter-channel synchrony as enriched inputs for attention encoding. These additional features have the potential to further improve the model’s ability to decode complex motor commands with greater accuracy and robustness [5,6].

To ensure that the system is applicable across real-world scenarios, it will be important to include a wide range of participant profiles that vary in age, gender, and body characteristics. Combined with a more diverse set of human motion patterns, this inclusion will help the model adapt to individual differences in EEG signals and movement behavior. In preparation for real-time deployment, the full pipeline should be evaluated on embedded hardware to assess its feasibility for wearable or robotic systems. Automating the preprocessing workflow using GPT-powered tools could also support consistent performance and efficient deployment across different use cases.

Ultimately, this study demonstrates the potential of generative AI, particularly ChatGPT, in enabling non-invasive brain–computer interface systems. While the current results reflect moderate predictive accuracy, the findings highlight both the predictive potential of hybrid CNN-LSTM models and the value of ChatGPT-driven pipeline automation in accelerating the development of reliable EEG-based assistive technologies. To transition this system from research to clinical or industrial applications, several key challenges must be addressed. These include improving model accuracy through architectural advancements such as transformer integration, expanding and diversifying the dataset to enhance generalization, and reducing latency through optimized real-time processing on embedded platforms. The model’s robustness must also be verified via cross-subject studies, and the system’s usability should be enhanced by incorporating ergonomic, user-friendly design elements. Through architectural innovation, dataset expansion, systematic evaluation, and practical deployment considerations, this work advances the development of accessible, intelligent assistive technologies powered by generative AI.

Author Contributions

Conceptualization: R.A., methodology: R.A. and T.d.S.M., formal analysis: T.d.S.M. and S.S., software: T.d.S.M., R.P., S.S., testing/validation: T.d.S.M., R.P., S.S., draft preparation: T.d.S.M., R.P., S.S., editing: R.A. and T.d.S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was an unfunded project.

Informed Consent Statement

Informed consent was obtained from all subjects in the study.

Data Availability Statement

The original data presented in the study are available in the CARRT Lab’s EEG_GPT_Model repository at https://github.com/CARRTLABUSF/EEG_GPT_Model (accessed on 23 May 2025).

Acknowledgments

The machine learning model architectures scripts were generated using ChatGPT (OpenAI, 2024). It was also used to correct spelling, minor writing edits and machine learning output data interpretation. The specific version employed was ChatGPT (April 2024 version) [Large language model], available at https://openai.com/chatgpt (accessed on 23 May 2025). Appropriate prompting and manual verification were employed throughout the process to ensure alignment with research objectives and to maintain model accuracy. All authors have read and consented to the acknowledgements.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

EEG	Electroencephalogram
MoCap	Motion Capture Technology
BCI	Brain-Computer Interface
GPT	Generative Pre-Trained Transformer
IK	Inverse Kinematics
CNN	Convolutional Neural Network
LSTM	Long Short-Term Memory
MAE	Mean Absolute Error
MSE	Mean Squared Error
R²	Coefficient of Determination
API	Application Programming Interface
CSV	Comma Separated Value
ASCII	American Standard Code for Information Interchange
FIR	Finite Impulse Response
AutoML	Automated Machine Learning
ADLs	Activities of Daily Living
Hz	Hertz (Units of Frequency)

Appendix A

Major Steps:

Load & inspect the dataset
Clean (missing values + EEG artifact filtering)
Format for CNN-LSTM training (sliding windows, labels, normalization)
Train CNN-LSTM regression model (custom angle-accuracy metric)
Evaluate model performance (MAE, MSE, R², loss curves, predictions)
Add multi-step motion classifier layer
Combine logic for full pipeline integration

These steps cover the entire workflow from data loading and cleaning to final joint-angle predictions.

Step 0: Context and Definition
We have motion-capture data and EEG data for the following right arm movements:

Forward shoulder flexion (FM)
Backward shoulder extension (BM)
Lateral shoulder abduction (SM)
Random patterns (RM)

We measure the following joint angles on the right arm:

Shoulder (X, Y, Z)
Elbow (X only—ignoring Y and Z for elbow)
Wrist (X, Y, Z)

EEG is recorded from the EPOC Insight (channels AF3, T7, Pz, T8, AF4).
Sampling rate is 120 Hz, MOCAP and EEG are already synchronized.
We will predict motion capture joint angles using a CNN-LSTM regression model.
Folder structure: We have a “FM” folder for flexion, “BM” for extension, “SM” for abduction, and “RM” for random. Each folder contains multiple CSV files.

Step 1: Inspection and loading
File input: sll data sets in zip folders.
“You are given a dataset collected from one individual performing specific right arm movements.
Instructions:

Load and inspect the dataset, identify important metrics.
○
Dynamic Loading: Write Python code that loops over each folder, loads all CSV files, and stores them in a dictionary.
Inspection: Print out how many trials you’ve loaded and optionally check for missing data.
Plot relevant charts (e.g., time series, heatmaps, or joint-angle overlays) to visualize trends in:
○
EEG signals over time
○
Joint angles for each movement over time (Create one graph for each movement type)
Output a code that entails all this logic you have performed, utilizing the canvas feature.

Step 2: Cleaning
“Begin preprocessing the dataset to prepare it for machine learning. Follow these steps:

Identify and handle missing data:
○
Detect any missing values in the dataset.
○
Decide whether to remove or impute them, depending on the context.
○
In many EEG contexts, it’s often simpler to drop segments where EEG is NaN after artifact removal. Justify your decision.
Detect and eliminate EEG artifacts:
○
Describe common EEG artifacts (e.g., eye blinks, muscle movements, or noise).
○
Apply appropriate preprocessing techniques
- Use a Butterworth bandpass filter to keep 1–40 Hz
- After filtering, compute z-scores. Any sample with |z-score| > threshold (e.g., 5) is marked as NaN to remove large artifacts: Explain your approach.

At each step, describe what you’re doing, explain why it’s necessary, and show the corresponding code (preferably using Python with libraries like pandas, NumPy, and MNE if applicable).
Append to the previous code you have developed in canvas, add this section titled as the cleaning section.”
Note: At this step, if you’d like, you can ask to visualize before/after EEG cleaning for a channel.

Simply say:

“Plot the EEG data before and after cleaning for all EEG channels to visually compare the effects of preprocessing.

Use subplots or overlays to compare raw and cleaned signals side by side for each channel.
Include appropriate axis labels, titles, and legends for clarity.
Optionally, choose a representative time window if the dataset is large, to keep plots readable.

Explain the preprocessing steps that were applied and highlight noticeable differences in the plots.”

Step 3: Machine learning formatting
Prepare the cleaned dataset for training a CNN-LSTM model using time-series data from EEG (EPOC Insight) and motion capture (Vicon Nexus) recordings.
Task:

Data Formatting:
○
Convert the EEG and motion capture data into fixed-length time windows (e.g., sliding windows of T timesteps with overlap).
○
Ensure each window contains synchronized EEG and joint angle features.
Label Assignment:
○
Assign a label to each time window based on the associated movement type (e.g., shoulder flexion, extension, etc.).
○
For regression (predicting joint angles), store the relevant angle data in the target array.
Data Shaping for Model Input:
○
Reshape the final data into the format expected by CNN-LSTM:
▪
Input shape: (num_samples, time_steps, num_features)
▪
Output: y labels (either categorical or one-hot encoded)
Normalization:
○
Apply standard normalization (mean = 0, std = 1) across features to prepare them for deep learning.
○
Use separate scalers for Motion Capture Data and EEG data
○
Reshape the data as necessary, then recombine once each type is scaled.

Instructions:

Describe each preprocessing step clearly and justify its purpose in the context of CNN-LSTM modeling.
Show the full Python code (using pandas, NumPy, and optionally scikit-learn) that can be appended to the main code pipeline.

Step 4: Train CNN-LSTM Regression Model
“Now that the data is cleaned and formatted for CNN-LSTM modeling, build a model to predict motion capture joint angles (regression task).
Instructions:

Inputs and Outputs:
○
Input: Typically the EEG portion (or all features) across the window.
○
Output: Often the final frame (or average) of the MoCap angles in that window.
Model:
○
Start with a simple yet appropriate model architecture (e.g., a shallow LSTM, or CNN followed by a LSTM). Conv1D → MaxPool → LSTM → Dropout → Dense for the final regression output
○
Epochs = 40
○
Test size = 0.2
○
Random State = 42
○
Window size = 128 with stride of 16
○
Enable early stopping
○
Explain why this architecture was chosen.
○
Predict all joint angles in the window (i.e., sequence-to-sequence), or the average of the last few frames to smooth noise.
○
Include code to define, compile, and train the model using TensorFlow/Keras.
Save the Model:
○
After training, save the trained model (e.g., as an .h5 file) and any scalers (e.g., pickle or joblib) for later reuse.
Model Evaluation:
○
First, Obtain model predictions on the test set and reverse the normalization of the data to create regression metrics
○
Include a custom metric that measures the proportion of predicted angles within a certain threshold (e.g., ±3°, ±5°, etc.) from the ground truth.
○
Use appropriate regression metrics such as:
▪
Mean Absolute Error (MAE)
▪
Mean Squared Error (MSE)
▪
R² score
▪
Percent Accuracy Plot;
○
Explain why these metrics are relevant for evaluating motion prediction accuracy.
Plotting
○
Loss Curves: Training vs. Validation Loss over epochs.
○
Accuracy Curves: If using angle accuracy, plot training vs. validation accuracy.
○
Predictions vs. Ground Truth: Compare predicted joint angles to actual angles for a sample of data points, labeling each angle (Shoulder, Elbow, Wrist) clearly.

“Full Configuration:

config = {

“seed”: 42,

“window_size”: 64,

“stride”: 32,

“eeg_channels”: eeg_channels,

“mocap_features”: mocap_features,

“sampling_rate”: fs,

“z_thresh”: z_thresh,

“filter_range”: (1.0, 40.0),

“model_architecture”: [“Conv1D(32,3)”, “MaxPool(2)”, “LSTM(64)”, “Dropout(0.3)”, “Dense(32)”, “Dense(3)”],

“optimizer”: “adam”,

“loss”: “mse”,

“batch_size”: 32,

“epochs”: 100,

“early_stopping_patience”: 10,

“val_split”: 0.2

}”

Step 5: Add-on Multi-Step Layer for Detecting Motion
Goal: Predict future sequences of joint angles instead of just one frame, motion recognition on top of angle prediction. “The multi-step add-on doesn’t just boost scalar accuracy—it transforms your model from a single-frame predictor into a motion-aware system”
Instructions:

Use predicted joint angles to train a Random Forest classifier to categorize movements (forward, backward, sideways)
Predict directions from model-generated joint angles for random motions
Evaluate classifier performance via confusion matrix
Save classifier model and predictions for future analysis

Extra step: combine logic-full pipeline integration
Create a single, modular Python script that brings together the entire EEG processing and modeling workflow. The script should:

Set up imports and global configs for reproducibility.
Dynamically load EEG data from movement folders (FM, BM, SM, RM).
Clean signals using bandpass filtering and visualize raw vs. filtered EEG across channels.
Preprocess data with:
○
Sliding window segmentation
○
Feature normalization
○
Reshaping for CNN- LSTM input
Label and split data into train/test sets.
Build a CNN-LSTM baseline model with early stopping and custom metrics.
Evaluate the model using MAE, MSE, R², and optionally angle-accuracy. Include training loss/metric plots.
Save the best model and scalers for future use.

Future Improvements (optional):

Implement transformer-based architectures
Enable real-time EEG-MoCap classification and prediction

References

Boyle, J.P.; Thompson, T.J.; Gregg, E.W.; Barker, L.E.; Williamson, D.F. Projection of the year 2050 burden of diabetes in the US adult population: Dynamic modeling of incidence, mortality, and prediabetes prevalence. Popul. Health Metr. 2010, 8, 29. [Google Scholar] [CrossRef] [PubMed]
Smail, L.C.; Neal, C.; Wilkins, C.; Packham, T.L. Comfort and function remain key factors in upper limb prosthetic abandonment: Findings of a scoping review. Disabil. Rehabil. Assist. Technol. 2021, 16, 821–830. [Google Scholar] [CrossRef] [PubMed]
Mridha, M.F.; Das, S.C.; Kabir, M.M.; Lima, A.A.; Islam, R.; Watanobe, Y. Brain-Computer Interface: Advancement and Challenges. Sensors 2021, 21, 5746. [Google Scholar] [CrossRef] [PubMed]
Kim, D.; Jin, Y.; Cho, H.; Jones, T.; Zhou, Y.M.; Fadaie, A.; Popov, D.; Swaminathan, K.; Walsh, C.J. Learning-based 3D human kinematics estimation using behavioral constraints from activity classification. Nat. Commun. 2025, 16, 3454. [Google Scholar] [CrossRef] [PubMed]
Shah, M.F.; Khan, N.A.; Jamwal, P.K.; Chetty, G.; Goecke, R.; Hussain, S. Inverse kinematics solution for a six-degree-of-freedom upper limb rehabilitation robot using deep learning models. Neural Comput. Appl. 2025, 37, 12991–13009. [Google Scholar] [CrossRef]
Ofner, P.; Schwarz, A.; Pereira, J.; Müller-Putz, G.R.; Zhang, D. Upper limb movements can be decoded from the time-domain of low-frequency EEG. PLoS ONE 2017, 12, e0182578. [Google Scholar] [CrossRef] [PubMed]
Wang, J.; Bi, L.; Fei, W.; Tian, K. EEG-Based Continuous Hand Movement Decoding Using Improved Center-Out Paradigm. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 2845–2855. [Google Scholar] [CrossRef] [PubMed]
Roy, Y.; Banville, H.; Albuquerque, I.; Gramfort, A.; Falk, T.H.; Faubert, J. Deep learning-based electroencephalography analysis: A systematic review. J. Neural Eng. 2019, 16, 051001. [Google Scholar] [CrossRef] [PubMed]
Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. In Proceedings of the 34th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 6–12 December 2020; Curran Associates Inc.: Vancouver, BC, Canada, 2020; p. 159. [Google Scholar]
Ray, P.P. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet Things Cyber-Phys. Syst. 2023, 3, 121–154. [Google Scholar] [CrossRef]
Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. TensorFlow: A system for large-scale machine learning. In Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, Savannah, GA, USA, 2–4 November 2016; USENIX Association: Savannah, GA, USA, 2016; pp. 265–283. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Mon, Y.-J. Image-Tracking-Driven Symmetrical Steering Control with Long Short-Term Memory for Linear Charge-Coupled-Device-Based Two-Wheeled Self-Balancing Cart. Symmetry 2025, 17, 747. [Google Scholar] [CrossRef]
Eptiv Inc. EMOTIC Pro Application. 2024. Available online: https://www.emotiv.com/emotivpro/ (accessed on 12 December 2024).
EMOTIV. EMOTIC EPOC + EEG Headset. Available online: https://www.emotiv.com/products/epoc?srsltid=AfmBOooi2aM6KDSDmstpW-novXfZAA6z1HAGa8dcNIL5GuTlcekmvpBw (accessed on 17 January 2025).
EMOTIV. EMOTIV Insight—5 Channel Wireless EEG Headset. Available online: https://www.emotiv.com/products/insight?srsltid=AfmBOooi0Yxngy9d30GfpF4MCySJf2QaYDnTdGF0_rhy9tb8WnjJ9tXy (accessed on 11 January 2025).
Cohen, B. CyKit 3.0. 2017. Available online: https://github.com/CymatiCorp/CyKit?tab=License-1-ov-file#readme (accessed on 15 April 2025).
Vicon Motion Systems Ltd. Vicon Nexus MoCap System; Vicon Motion Systems Ltd.: Centennial, CO, USA, 2023. [Google Scholar]
McKinney, W. Data Structures for Statistical Computing in Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010. [Google Scholar]
Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. Author Correction: SciPy 1.0, fundamental algorithms for scientific computing in Python. Nat. Methods 2020, 17, 352. [Google Scholar] [CrossRef] [PubMed]
Harris, C.R.; Millman, K.J.; van der Walt, S.J.; Gommers, R.; Virtanen, P.; Cournapeau, D.; Wieser, E.; Taylor, J.; Berg, S.; Smith, N.J.; et al. Array programming with NumPy. Nature 2020, 585, 357–362. [Google Scholar] [CrossRef] [PubMed]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
Chollet, F. Keras. Available online: https://keras.io/ (accessed on 15 April 2025).
Planelles, D.; Hortal, E.; Costa, Á.; Úbeda, A.; Iáez, E.; Azorín, J.M. Evaluating classifiers to detect arm movement intention from EEG signals. Sensors 2014, 14, 18172–18186. [Google Scholar] [CrossRef] [PubMed]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is All You Need. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2023. [Google Scholar] [CrossRef]
Saeidi, M.; Karwowski, W.; Farahani, F.V.; Fiok, K.; Taiar, R.; Hancock, P.A.; Al-Juaid, A. Neural Decoding of EEG Signals with Machine Learning: A Systematic Review. Brain Sci. 2021, 11, 1525. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Flowchart depicting the training and test logic of for EEG and MoCap correlation model.

Figure 2. Upper body Plug-in-Gait marker set.

Figure 3. Flowchart depicting the collective data processing required prior to inputting data into ChatGPT-4o.

Figure 4. Workflow for determining global start and end times for EEG–MoCap data synchronization.

Figure 5. Predicted vs. actual joint angles for shoulder, elbow, and wrist using CNN-LSTM model (window = 120, epochs = 70) across all motions.

Figure 6. Confusion matrix of direction classification using Random Forest model on validation set.

Figure 7. Validation versus training loss plot.

Table 1. ChatGPT accuracy for high-end and low-end EEG headsets during basic arm motion using time-stamped data set.

Motion Sets	Percent Accuracy
	EPOC+			Insight
	Mean	Median	Range	Mean	Median	Range
Forward Shoulder Flexion	78%	80%	54–97%	67%	75%	58–85%
Backward Shoulder Extension	90%	90%	70–100%	83%	85%	74–100%
Lateral Shoulder Abduction	87%	90%	85–95%	80%	83%	60–93%
Arm Swing During Gait	N/A	N/A	N/A	74.7%	85%	57–89.2%
Idle baseline	98%	98%	N/A	92%	92%	N/A
All sets	83.7%	98%	54–100%	79.8%	92%	57–100%
Literature Values	72%	N/A	N/A	45%	N/A	N/A

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mota, T.d.S.; Sarkar, S.; Poojary, R.; Alqasemi, R. ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals. Electronics 2025, 14, 2481. https://doi.org/10.3390/electronics14122481

AMA Style

Mota TdS, Sarkar S, Poojary R, Alqasemi R. ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals. Electronics. 2025; 14(12):2481. https://doi.org/10.3390/electronics14122481

Chicago/Turabian Style

Mota, Tais da Silva, Saket Sarkar, Rakshith Poojary, and Redwan Alqasemi. 2025. "ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals" Electronics 14, no. 12: 2481. https://doi.org/10.3390/electronics14122481

APA Style

Mota, T. d. S., Sarkar, S., Poojary, R., & Alqasemi, R. (2025). ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals. Electronics, 14(12), 2481. https://doi.org/10.3390/electronics14122481

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

ChatGPT-Based Model for Controlling Active Assistive Devices Using Non-Invasive EEG Signals

Abstract

1. Introduction

2. Materials and Methods

2.1. Objective 1: Numerical Relationship Modeling

2.2. Objective 2: Synchronized EEG Signal Acquisition and Time-Stamped Video Recording

2.3. Objective 3: MoCap Integration

2.3.1. Data Collection Procedure

2.3.2. Data Processing Procedure

2.3.3. Model Training Procedure

3. Results

4. Discussion of the Results

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI