1. Introduction
The emergence of non-invasive methods for analyzing and detecting diseases is one of the most significant prospects in medicine. At the same time, this poses challenges related to the correct use of technology, the positioning of the sensors, and the constant evolution of the equipment [
1,
2]. Technological advancements allow for a preliminary diagnosis through machines without any intervention from healthcare professionals. This research is included in the development of systems to support ambient assisted living technologies [
3,
4,
5,
6]. Cutting-edge approaches in the healthcare area have helped in solving various computer vision-based tasks by analyzing different features from various biosignals, including the facial features [
7,
8].
Mobile devices can be connected to different devices to head the creation of sophisticated hand-held systems for the monitoring of health states [
9,
10,
11]. They are handy because they are portable and small, allowing their correct positioning for different measurements [
9,
10,
11]. These devices are equipped with different sensors, but more sensors can be connected through over-the-air connections [
12,
13,
14,
15,
16,
17,
18]. These devices with increasing number of functionalities, and the number of available sensors, boost the options for creation of systems that could assist older adults [
15,
19,
20,
21,
22]. The use of this data captured in each individual and their subsequent calculation present the potential of these projects.
This research is included in a project related to Timed-Up and Go test, where the individuals were provided with a smartphone having an accelerometer and magnetometer sensors. To perform the experiments, we used two BITalino devices (
https://bitalino.com/en/) that are affordable do-it-your-self boards facilitating the collection and analysis of variety of biomedical signals with inexpensive sensors. First, a BITalino device was positioned on the chair with a force sensor to detect the duration for which the individuals stood. Then, another BITalino device, with ECG and EEG sensors, was placed in the individual, and the different sensors were prepared for the acquisition of data during the test. Regarding the data acquired by the mobile device, the sampling rate is around 10 ms. Then, for the data acquired by the BITalino device, the sampling rate is exactly 100 ms. The similar frequencies enable the comparison of the data more accurately.
The main purpose of this study was the implementation of neural networks to identify the different diseases present in the population considered in the study reported in [
23,
24]. It was related to the timed-up and go test’s execution with institutionalized people from the Covilhã and Fundão municipalities. Thus, the implementation of the methods started with identifying persons by institutions, age, diseases, and groups of diseases.
The implemented neural networks with the WEKA software [
25] reported that the individuals might be recognized by institutions, where only the individuals from Centro Comunitário das Lameiras were not correctly identified. Similar results were obtained by age, where only persons who were 74, 85, and 86 years old were not correctly recognized. Regarding the recognition of the diseases, they were not correctly identified, because the sample consisted of a small number of individuals. However, after the categorization of the illnesses, cardiac diseases started to be recognized as a group of diseases.
Other studies related to processing of ECG and EEG data are reviewed and summarized in [
26,
27]. There are two main tracks regarding feature extraction—based on statistical features from the time and frequency domain, and ones based on deep learning. Our approach is using classical feature extraction because of the limited dataset size that we have. The results are on par with other approaches with a similar number of participants. What is novel in our approach is the combination of both sensors embedded on an inexpensive board, proving that even such affordable devices can provide satisfactory results and serve as indication of emerging diseases.
The Introductory section ends with this paragraph, and the remaining sections of this paper are organized as follows:
Section 2 presents the description of the structure of the method implemented for the recognition of persons by institution, age, diseases, and groups of diseases. The results obtained are presented in
Section 3. This paper ends with the discussion of the results and the presentation of the different conclusions in
Section 4.
2. Methods
Machine learning methods were implemented with ECG and EEG data to identify the persons by the institution, age, diseases, and groups of disorders. The flow of the proposed method includes several stages, including data collection, feature extraction, machine learning, and statistical methods, as presented in
Figure 1. We only considered the persons whose ECG and EEG data were correctly acquired, and the data were not filtered, extracting only the different features.
2.1. Data Collection
Following the previous studies [
23,
24], the data, as presented in
Table 1, were acquired from 14 institutionalized individuals aged between 71 and 97 years old (83 +/− 7.4) with different diseases included in some categories, as presented in
Section 3.4. The different data were acquired from various institutions, such as Centro Comunitário das Lameiras, Lar Minas, Lar da Misericórdia, and Lar da Nossa Senhora de Fátima. The data were collected by a mobile application connected by Bluetooth to a BITalino device. Different constraints were verified during the data collection, but these records were reliable for the analysis and correlation of the different diseases found in the population. For this study, only the ECG and EEG data acquired by a BITalino device were considered for the processing of the different disorders. The data acquisition faced some challenges, as presented in [
28,
29].
The acquisition of the data has different sample rate between devices, where the sampling rate of the sensors available in the mobile device is variable, because the instruction Sensor.DELAY_FASTEST does not have the same frequency on all devices. Regarding the device used, i.e., XIAOMI MI 6, reported that the frequency is around 10 ms, i.e., 100 Hz. Next, the sampling rate of the BITalino device connected by Bluetooth is 100 Hz. After the data acquisition, the different features were extracted for further comparison, as explained in
Section 2.2. After the data acquisition, the data were processed offline as presented in
Section 2.3.
2.2. Feature Extraction
Different features were extracted with the framework [
4] from the ECG and EEG signals, including heart rate, heart rate variability, average QRS amplitude, average R-R interval, and the average R-S interval from the ECG data, and the frequency and variability from the EEG data. These data were combined for the identification of institution, age, disease, and a group of disorders of different individuals.
2.3. Machine Learning
The machine learning method implemented was a neural network, i.e., multiplayer perceptron, implemented with the WEKA software [
25] with the following details:
The WEKA software is a free and open-source application to test different machine learning methods. It includes a set of methods, but we chose the Multiplayer Perceptron method [
30,
31], which is a method that consists of the training and the prediction of different classes with different weights to the input and output neurons. It also supports different attribute transformation methods, including ones for handling nominal and numeric data, which is important for medical datasets, which frequently encounter mixed data types [
32,
33].
2.4. Statistical Analysis
For the validation of the implemented method, different parameters were calculated, such as true positive (TP), false positive (FP), true negative (TN), and false negative (FN). With these values, the accuracy, precision, recall, and F1 score values were calculated to measure the performance of the implemented method.
4. Discussion and Conclusions
Machine learning techniques are helpful for the recognition of different diseases involved in the studied population. The application of machine learning techniques made it possible to identify with some accuracy the different patterns related to the extracted features, such as heart rate, heart rate variability, average QRS amplitude, average R-R interval, and average R-S interval from ECG data, and the frequency and variability from the EEG data. A combination of these parameters allowed us to identify, with some accuracy, the presence of certain diseases.
However, this study revealed some limitations related to the data acquisition and different constraints, and some data were excluded for several reasons, including the failure of the sensors. A small number of valid records implies that the machine learning method might benefit from larger datasets and samples for them to be reliable.
The obtained results revealed that the individuals related to institutions were recognized except for individuals from Centro Comunitário das Lameiras. The identification results related to age were also accurate except for the results for persons aged 74, 85, and 86 years old. Regarding the recognition of diseases and considering that we had a small dataset for the analysis, the isolated disorders were not recognized. However, when the disorders were categorized, some persons with cardiovascular diseases were identified. Thus, the proposed method reported low accuracies for illnesses, but the accuracy was higher for the recognition of persons by age and institution.
In the future we intend to study a larger number of individuals to increase the size of the dataset acquired. Next, other types of diseases will be analyzed, comparing healthy people with those suffering from certain disorders.