Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model

Hong, Junhee; Seol, Youngjin; Lee, Seunghyun; Yoon, Janghyeok; Lee, Jiho; Park, Ki-Su; Ha, Ji-Wan

doi:10.3390/math12203208

Open AccessArticle

Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model

by

Junhee Hong

¹,

Youngjin Seol

¹,

Seunghyun Lee

¹,

Janghyeok Yoon

^1,*,

Jiho Lee

²

,

Ki-Su Park

³ and

Ji-Wan Ha

⁴

¹

Department of Industrial Engineering, Konkuk University, 120 Neungdong-ro, Gwangjin-gu, Seoul 05029, Republic of Korea

²

Neopons Inc., 465, Dongdaegu-ro, Dong-gu, Daegu 41260, Republic of Korea

³

Department of Neurosurgery, Kyungpook National University School of Medicine, 130 Dongdeok-ro, Jung-gu, Daegu 41944, Republic of Korea

⁴

Department of Speech Pathology, Daegu University, 201 Daegudae-ro, Jillyang-eup, Gyeongsan 38453, Republic of Korea

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(20), 3208; https://doi.org/10.3390/math12203208

Submission received: 6 September 2024 / Revised: 29 September 2024 / Accepted: 10 October 2024 / Published: 13 October 2024

(This article belongs to the Special Issue Computational Modelling and Analytical Framework for Medical Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Rapid elderly population growth has increased the number of patients with cognitive impairment (CI). Early detection and ongoing medical treatment can slow CI progression and significantly reduce the cost of managing patients. However, distinguishing CI from natural cognitive decline associated with aging is challenging. Previous studies conducted to identify patients with CI using lifelog data did not consider changes in lifelog data over time because each data point was learned individually. This study introduces a model that predicts patients with CI based on sleep lifelog data and analyzes significant sleep factors that influence cognitive decline. This study followed three steps: (1) collecting sleep lifelog data from elderly Korean people and reconstructing sleep lifelog data as time-series data; (2) building a model to classify CI using a time series of sleep lifelog data and a long short-term memory model; and (3) identifying sleep factors that influence the onset of CI using an explainable AI algorithm. The proposed CI classification model achieved a sensitivity of 0.89, a specificity of 0.80, and an area under the receiver operating characteristic curve of 0.92. This study will facilitate the noninvasive screening, diagnosis, and continuous monitoring of CI in the elderly.

Keywords:

cognitive impairment; sleep lifelog data; deep learning; health care

MSC:

68T01

1. Introduction

Cognitive impairment refers to a condition in which intellectual functions, such as memory, language, and judgment, are diminished while the individual remains fully conscious [1]. The severity of cognitive impairment can range from mild to severe, with severe cases potentially leading to memory loss and the progression of dementia. Mild cognitive impairment (MCI) is an intermediate stage between normal cognition (NC) and dementia (DE), where it is not severe enough to affect daily activities but is an early sign that may eventually lead to dementia [2]. The progression rate from MCI to dementia is significantly higher than that in the general population, with 5–10% of patients with MCI deteriorating into dementia annually within a short period of time [3]. Not everyone diagnosed with MCI will progress to dementia, so early medication or rehabilitation can slow cognitive deterioration or restore function to normal [4]. Therefore, the early identification of cognitive impairment can contribute to socioeconomic cost savings by preventing dementia. However, it is difficult to distinguish between cognitive decline due to normal aging and MCI because there are no clear criteria for comparing them.

Accordingly, patient-data-driven research has been actively conducted to identify cognitive impairment by efficiently utilizing accumulated patient medical data. For example, studies have been conducted to detect cognitive decline and monitor patient conditions using neuroimaging, speech, and language data. An algorithm was developed to classify patients with cognitive impairment by applying computer-based signal processing and pattern recognition techniques to electroencephalography (EEG) data [5]. Additionally, machine learning algorithms have been applied to acoustic parameters extracted from the recorded speech signals to distinguish between cognitively impaired patients and healthy individuals [6]. Despite their contribution to identifying cognitive impairment, patient-data-driven approaches are limited in obtaining continuous data and may utilize incomplete data if patients refuse to participate [7,8].

To address these issues, an approach based on lifelog data that enables the collection of real-time information through sensors installed in everyday life was proposed. Lee, Kang [9] trained an artificial neural network (ANN) from sleep and activity data collected from wristbands worn by participants to classify normal individuals and MCI patients. Minamisawa, Okada [10] analyzed the impact of daily activity patterns on dementia by collecting activity data using multiple sensors installed in residential environments. Kim, Jang [11] applied lifelog data collected from patients living in nursing homes to a multilayer perceptron (MLP) to detect abnormal behaviors in patients with dementia. Approaches that utilize lifelog data have the advantage of collecting large amounts of data in real-time, which makes it better to train a classification model. Additionally, the regular updating of lifelog data facilitates the maintenance of predictive models’ effectiveness [12].

Although various studies have demonstrated the feasibility of identifying cognitive impairment through the analysis of patient data, several limitations remain. First, studies that utilize lifelog data have not fully leveraged the time-varying nature of continuous data. The temporal sequence of events or daily functions in everyday life can serve as a biomarker for diagnosing conditions that precede dementia [13,14]. However, previous studies have only utilized data from a single time point, despite the potential of real-time lifelog data to facilitate the analysis of changes in daily life over time. Therefore, this study aims to utilize the time-series characteristics of data as input variables to detect early signs of cognitive decline and improve the validity of cognitive impairment identification. Second, most data-driven studies have only focused on presenting learning outcomes and enhancing the performance of classification models; however, they have often lacked an interpretation for these results or identification of the primary factors influencing cognitive impairment. In various fields, identifying key variables in complex systems is critical for understanding underlying mechanisms. For instance, Lloret-Climent and Nescolarde-Selva [15] applied such an approach in the analysis of tourist networks to uncover key factors influencing system behavior. The black-box nature of machine learning techniques complicates the traceability of the model’s predictive process, making them unsuitable for direct application in the actual diagnostic process in medicine. Therefore, it is essential not only to identify cognitive impairment but also to provide evidence for changes in behavioral patterns as cognitive function declines. To achieve this, it is crucial to analyze the primary factors influencing cognitive impairment using interpretable artificial intelligence (AI) algorithms, thereby enhancing the reliability of the prediction results.

To address the limitations of previous studies, this study aims to develop a long short-term memory (LSTM) model for learning from sleep lifelog data collected using wearable devices. LSTM models are particularly suitable for this type of data because they effectively capture temporal dependencies, making them ideal for identifying gradual changes in cognitive function. Given that cognitive impairment manifests gradually, the time-series nature of this lifelog data provides critical insights into daily patterns that cannot be captured by single point observations. Our approach seeks to exploit this continuity by reconstructing lifelog data into time series sequences, which allows the model to learn from behavioral trends over time. Furthermore, we examined the association between 32 sleep factors and cognitive impairment using the Shapley additive explanations (SHAP), an interpretable AI algorithm. This step is particularly important for providing insights into the most influential sleep factors, linking the study results to clinically significant outcomes. This study was conducted in three phases: (1) First, we collected sleep lifelog data from public institutions to construct a database of patients with cognitive impairment and converted the daily data into a time series dataset. (2) Next, we designed a classification model to identify cognitive impairment using sleep factors extracted from the lifelog data. (3) Finally, we identified the sleep factors influencing cognitive decline and interpreted the predicted results. This approach not only allows for the early identification of cognitive impairment but also provides interpretable evidence to support the model’s predictions, facilitating potential clinical application.

The contributions of this study are as follows: First, this study utilizes a dataset comprising sequences of three, four, and five consecutive days rather than isolated time points. By capturing longitudinal patterns in subjects’ daily lives, this approach allows for a more comprehensive analysis of behavioral changes over time. This approach enables the learning of patterns in subjects’ daily lives over time, minimizing reliance on fragmentary judgments and thereby enhancing the effectiveness of cognitive impairment identification compared with previous studies. This innovation in data handling significantly minimizes the reliance on fragmentary judgments and enhances the model’s robustness. Second, this study identifies the indicators that influence cognitive decline and offers empirical evidence for their significance. By leveraging interpretable AI methods, such as SHAP, we provide insights into how specific sleep-related factors contribute to cognitive impairment. Consequently, biomarkers of cognitive impairment can be used to interpret the experimental results and provide medical insights into the relationship between sleep and cognitive impairment. Third, from a practical standpoint, this study supports expert decision-making by rapidly predicting the presence or absence of cognitive impairment using easily accessible lifelog data, thereby reducing the time and cost associated with cognitive impairment diagnosis. The model enables rapid, non-invasive predictions, significantly reducing the time and cost associated with traditional cognitive impairment diagnosis methods, thereby supporting more efficient decision-making processes for healthcare professionals.

The remainder of this paper is organized as follows: Section 2 reviews background research relevant to this study. Section 3 details the research methodology and processes. Section 4 presents the results of the empirical study, and Section 5 concludes the paper and discusses future study directions.

2. Data-Driven Cognitive Impairment Analysis

While cognitive decline in older adults due to aging is considered normal, MCI can lead to a significant reduction in cognitive abilities, including memory loss, disorientation, and impaired visuospatial function. MCI can rapidly progress to severe dementia, ultimately rendering independent living impossible. Therefore, the early identification and prevention of cognitive impairment is crucial. However, the early detection of cognitive impairment is challenging because of the difficulty in distinguishing MCI from cognitive decline associated with normal aging, which often results in MCI being overlooked. In recent years, MCI has been systematically managed at the national level, and academic efforts to predict and analyze the presence of cognitive impairment have been increasing [16].

Since the development of the mental status questionnaire in 1960 [17], numerous screening tools have been continuously created to assess cognitive status and diagnose psychiatric disorders. The mini-mental state examination (MMSE) is the most widely used tool for assessing cognitive function [18], capable of measuring cognitive impairment and detecting dementia within a short period of time. The Korean mini-mental state examination (K-MMSE), an adaptation of the MMSE for Korean individuals, is commonly used in Korea [19]. However, these screening tools may be inadequate for identifying the early symptoms of cognitive impairment owing to their low sensitivity and specificity for milder symptoms. Additionally, these tools can only diagnose a patient’s current condition without addressing issues related to daily behavior. Consequently, predictive methods based on data accumulated directly from patients are needed to identify cognitive impairment more effectively.

Neuroimaging modalities, such as computed tomography, magnetic resonance imaging, and EEG, are increasingly utilized for the clinical diagnosis of cognitive impairment. Among these, EEG offers the distinct advantage of real-time measurements, facilitating studies aimed at diagnosing cognitive impairment through the analysis of EEG signal patterns. Baker and Akrofi [5] leveraged computer-based signal processing and pattern recognition techniques on EEG data to classify patients with Alzheimer’s disease (AD) and controls and further predict the progression of MCI to AD. In addition to EEG, extensive research has focused on predicting cognitive impairment by analyzing various types of patient data, including voice, language, and facial expressions. Jarrold and Peintner [20] gathered voice sample data from healthy controls and patients with four dementia subtypes, extracting acoustic features and linguistic text to differentiate between the subtypes. Yu and Quatieri [21] used speech features derived from remotely collected speech data to predict cognitive impairment. Key factors such as pseudo-syllable rate, pitch variation, and articulation adjustments based on formant correlation measures were used to validate the predictive accuracy of clinical assessments of cognitive impairment. Although patient medical data can effectively identify cognitive impairment, the methods mentioned in previous studies are limited in that they are not suitable for the continuous data collection necessary for learning prediction models without burdening older adults [7,8].

The close relationship among cognitive impairment, daily physical activity, and sleep has been extensively studied [22]. Recent advances in sensor technology and wearable devices have enabled the collection of various lifelog data, such as activity and sleep information, facilitating studies aimed at predicting cognitive impairment using these data. Specifically, Lee and Kang [9] presented a model to classify individuals as normal or having MCI by training an ANN for the early identification of cognitive impairment. They extracted key behavioral factors from activity and sleep data collected from wristbands worn by subjects and trained ANN to distinguish patients with MCI from healthy controls. Minamisawa and Okada [10] used a variety of lifelog data collected from patients in a nursing home to detect abnormal behaviors in patients with dementia. They analyzed the impact of daily activities and sleep patterns on dementia detection using data from doors, motion, location, and sleep sensors. Kim and Jang [11] developed a machine learning-based framework for detecting abnormal behavior in patients with dementia using lifelog data. They trained an MLP using patient behavioral data collected using low-cost sensors, providing a cost-effective solution that can be widely implemented in many nursing homes. Table 1 lists previous studies that analyzed cognitive impairment using medical and lifelog data collected from patients.

The utilization of lifelog data collected through wearable devices offers a substantial advancement over traditional methods in several key respects. First, wearable devices, such as wristbands and rings, facilitate continuous and long-term data collection in real-world environments, providing a more comprehensive view of an individual’s daily behavior and sleep patterns. This non-invasive nature of wearable devices reduces the burden on elderly patients and increases compliance, as the devices can collect data passively without requiring active participation from patients. Furthermore, wearable devices are suitable for prolonged monitoring, capturing subtle changes in behavior or sleep over time that may be missed in short-term clinical assessments. This continuous data collection is vital for tracking the gradual progression of cognitive impairment and provides more consistent insights compared with the episodic data gathered in clinical settings.

The studies listed in Table 1 employed various types of data collected from patients to identify cognitive impairment. However, their primary focus on model training and enhancing performance often obscures the understanding of how patient behavior influences cognitive impairment. To address this issue, it is essential to identify the key factors associated with cognitive impairment and provide a rationale for predictions. Consequently, this study utilized SHAP, an interpretable AI algorithm, to identify critical factors and elucidate the internal mechanisms of the prediction model. Additionally, the temporal sequence of daily functions can serve as a biomarker for identifying cognitive impairment in the context of diagnosing pre-dementia conditions, necessitating consideration of how temporal data change over time. Hence, this study utilizes the temporal characteristics of sleep states and biological signals collected via wearable devices as input variables for the prediction model. We developed LSTM models to identify cognitive impairment and demonstrated how the input features contribute to the prediction through SHAP analysis.

3. Methodology

3.1. Long Short-Term Memory

With technological advancements, the quality of wearable sensor devices has significantly improved, facilitating comprehensive data collection for effective health-condition monitoring. Wearable devices are defined as technological apparatuses designed to be worn on the human body or on clothing [23]. Lifelog data generated by smart wearable devices or applications comprises time-series health information recorded at specific intervals. Several previous studies have proposed machine learning-based approaches utilizing various lifelog data; however, most of these studies do not fully exploit the time-series nature of the data; instead, they use values at specific points in time as input variables. Considering that LSTM neural networks have demonstrated strong performance with time-series data, we employed an LSTM model to analyze lifelog data in a time-series format.

Figure 1 presents the cell structure of an LSTM network. Cell state (

C_{t}

) represents the component of the model responsible for carrying information forward, with updates managed through a series of gates that determine which information is retained or discarded. First, the Forget Gate (Equation (1)) is responsible for deciding whether to discard past information by processing previous outputs (

h_{t - 1}

) and current inputs (

x_{t}

) through a sigmoid layer that generates a value between 0 and 1 to be applied to the previous state (

C_{t - 1}

). The input gate (Equations (2) and (3)) is responsible for storing current information. It first determines which information to update in the previous state (

C_{t - 1}

) through a sigmoid layer and then generates a new candidate vector (

{\tilde{C}}_{t}

) in the tanh layer. After determining the information to be discarded or updated through these gates, the cell state at the current time (

C_{t}

) is updated accordingly (Equation (4)). Finally, the output gates (Equations (5) and (6)) determine the final output (

h_{t}

) by multiplying the output value (

O_{t}

) obtained from the sigmoid layer with the value of the cell state (

C_{t}

) processed through the tanh layer.

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f})

(1)

i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i})

(2)

{\tilde{C}}_{t} = t a n h (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C})

(3)

C_{t} = f_{t} \otimes C_{t - 1} + i_{t} \otimes {\tilde{C}}_{t}

(4)

O_{t} = σ (W_{O} \cdot [h_{t - 1}, x_{t}] + b_{o})

(5)

h_{t} = O_{t} \otimes \tanh (C_{t})

(6)

In this manner, LSTM selectively manages input information through its internal mechanisms, allowing for the retention or discarding of cell-state information. By overcoming the long-term dependency challenge of recurrent neural networks through its long-term memory capabilities, LSTM is effective in capturing the unique characteristics of time-series data and maintaining temporal dependency. The sleep lifelog data utilized in this study were generated in real-time and exhibited time-varying characteristics. LSTMs demonstrate considerable advantages in predicting cognitive impairment by effectively capturing and modeling the time-series nature of such data.

3.2. Cognitive Impairment Prediction Model

This study analyzes data consisting of 32 variables representing participants’ sleep information. The diagnostic outcome variables include NC, MCI, and DE. The data are divided into two groups (1: cognitively impaired group, 0: normal functioning group) and classified into two binary categories based on the likelihood of cognitive impairment. In this study, the significantly smaller number of patients with dementia poses a risk of model’s performance degradation due to data imbalance in a multiclass classification approach. Additionally, the primary goal of this study is to identify the presence of cognitive impairment rather than to differentiate between its stage. Therefore, combining MCI and DE into a single group is considered the most appropriate strategy to ensure the model’s effectiveness.

The model’s performance in predicting cognitive impairment is evaluated using various quantitative metrics, including accuracy, precision, F1-score, sensitivity, and specificity. Specifically, accuracy reflects the proportion of correct predictions, precision represents the proportion of true positive predictions out of all positive predictions, and F1 score balances precision and recall providing a more comprehensive view of the model’s effectiveness [24]. This study also employs sensitivity and specificity metrics that are commonly used in clinical trials and diagnostic tests [25]. These metrics gauge the ability of the model to distinguish between individuals with and without certain health conditions. Specifically, sensitivity (Equation (7)) measures the proportion of true positive (TP) observations correctly identified by the model, whereas specificity (Equation (8)) measures the proportion of correctly identified true negative (TN) observations.

S e n s i t i v i t y = \sum \frac{T P}{T P + F N}

(7)

S p e c i f i c i t y = \sum \frac{T N}{T N + F P}

(8)

TP refers to instances in which the model correctly predicts a positive outcome when the actual condition is positive. TN denotes instances in which the model accurately predicts a negative outcome when the actual condition is negative. A false positive (FP) indicates a case in which the model incorrectly predicts an outcome as positive when the actual condition is negative. A false negative (FN) refers to instances in which the model incorrectly predicts an outcome as negative when the actual condition is positive.

This study utilizes the receiver operating characteristic (ROC) curve to evaluate the effectiveness and optimal cut-off values of the binary classification models. The ROC curve visually represents the relationship between the FP rate (1-specificity; x-axis) and the TP rate (sensitivity; y-axis), allowing for the assessment of model performance across different classification thresholds [26]. The area under the ROC curve (AUC) provides a numerical measure for performance evaluation, with values closer to 1 indicating better model performance. As shown in Figure 2, an ROC curve that approaches the upper left corner of the plot signifies higher accuracy, reflecting a higher TP rate and a lower FP rate.

3.3. Shapley’s Additive Explanations

Recently, machine learning-based models have been increasingly utilized in various domains. Consequently, the importance of interpretable machine learning has been emphasized because of the black-box nature of models, which means the ambiguity of the rationale behind both the prediction processes and the resulting outcomes. Interpretable machine learning encompasses technologies and methodologies that provide insights into the internal workings of a model by explaining its functionality [27]. This study interpreted the cognitive impairment prediction results of the highest-performing model using SHAP.

SHAP is a framework used to interpret the contribution of each input feature to the predicted outcome using its Shapley value, which represents the conditional expectation of the model [28]. It employs the additive feature attribution method (Equation (9)) to calculate the importance of each feature and assigns importance values to individual features based on the principles of cooperative game theory.

g (z^{'}) = ϕ_{0} + \sum_{i = 1}^{M} ϕ_{i} {z'}_{i}

(9)

The vector z′∈

{\{0,1\}}^{M}

represents the coalition vector indicating the presence or absence of the i-th feature (present = 1, absent = 0), where M denotes the total number of features,

ϕ_{i}

∈ ℝ is the importance value of the i-th feature, and

ϕ_{0}

represents the baseline value in the absence of the features. SHAP quantifies the importance of each feature by evaluating changes in the predictions of the model and describing how the current output value f(x) transitions from the baseline value E[f(z)]. When the model is nonlinear, or the features are not independent, the order of feature inclusion becomes significant. When the model is nonlinear or the features are not independent, the order in which the features are added to the prediction becomes significant. SHAP averages the

ϕ_{i}

values over all possible orderings of the features. Therefore, when defining

f_{x}

(S) =

E [f_{x} | x_{s}]

for a subset of features (S), the SHAP value (

ϕ_{i}

) is expressed as follows: (Equation (10))

ϕ_{i} = \sum_{S \subseteq {x_{1}, \dots, x_{m} \ {x_{i}}} \frac{|S|! (M - |S| - 1)!}{M!} (f_{x} (S ⋃ {x_{i}}) - f_{x} (S))

(10)

In this context,

f_{x} (S ⋃ \{x_{i}\})

and

f_{x} (S)

represent the model predictions with and without the i-th feature, respectively. The prediction of the original model is equal to the sum of the SHAP values for all features, with each SHAP value reflecting the importance of the corresponding features in relation to the model’s prediction. This study sought to interpret the relationship between sleep and cognitive impairment by utilizing Deep SHAP, which is specialized in deep learning models and can efficiently calculate the SHAP values in complex neural networks [28]. By assessing the contribution of each input variable, Deep SHAP provides valuable insights into both the direction and magnitude of the impact of individual sleep factors on cognitive impairment.

4. Experimental Results

Following the methodology outlined in the previous chapter, a three-step experimental procedure was implemented, as depicted in Figure 3. First, sleep lifelog data were collected from both healthy individuals and patients, from which sleep metrics were extracted. The data were preprocessed into sequences suitable for model training. Subsequently, LSTM models were developed to predict the presence of cognitive impairment. Finally, an interpretable AI algorithm was applied to identify sleep factors that influence cognitive impairment.

4.1. Data

This study employed the “Wearable Lifelog of Dementia High Risk Group” dataset, provided by AI-Hub (https://aihub.or.kr/) under the Korea Agency for Intelligence and Information Society. The dataset comprised sleep lifelog data from 300 individuals aged 55 years and older who wore ring-shaped wearable devices. This cohort included healthy individuals as well as patients diagnosed with MCI and dementia, selected based on comprehensive specialist diagnoses. Each entry in the dataset represents a single subject’s sleep data for one day, encompassing sleep information and the corresponding diagnostic labels, as outlined in Table 2. The dataset includes sleep lifelog information collected over periods ranging from 35 to 122 days, capturing sleep patterns such as duration, blood pressure, heart rate, and breathing. The dataset we received was already cleaned, with all missing data or outliers handled in-house. Consequently, no additional data cleaning was required for data quality control in this study. In total, 12,183 lifelog records from 174 subjects were analyzed in this study, and the final dataset statistics are summarized in Table 3. A variety of sleep features with potential implications for cognitive impairment was defined, and the final set of sleep features used for model training is presented in Table 4.

Cognitive impairment is often associated with sleep disturbances, manifesting as day–night reversals, difficulty initiating sleep, and frequent nocturnal awakenings [29,30]. Conversely, alterations in sleep patterns can also contribute to cognitive decline, with key physiological parameters indicative of sleep quality including respiratory rate, heart rate, and body movement [31]. Based on these considerations, 32 sleep factors were selected for model training. As presented in Table 4, sleep factors were divided into two primary categories. Sleep quality features are indicators that indirectly represent a subject’s sleep experience and include attributes such as sleep efficiency, sleep duration, and sleep latency. Statistical features consist of metrics such as one-minute averages, maximum values, and minimum values for factors such as respiration rate and heart rate. For example, the features ‘start’ and ‘end’ refer to indicators that denote the initiation and cessation of sleep within defined time intervals. Detailed descriptions and respective formats of each feature are presented in Table A1.

The target variable for diagnostic labeling was redefined into two categories: normal functioning (NC) and cognitively impaired (MCI, DE). To effectively capture sleep patterns from the lifelog data, the dataset was reorganized into a time-series format over a specified number of days. In this study, training sets were constructed by grouping continuous data into sequences of three, four, and five days for each subject.

4.2. Cognitive Impairment Prediction

The constructed training set was used to train the LSTM model. This study utilized all 12,183 data points collected from 174 subjects and transformed them into time-series data with sequences of three, four, and five days to train the LSTM model. To ensure a rigorous evaluation of the model, test data were constructed by isolating the final week of data from each subject within the entire dataset, whereas the remaining data were used for model training. To address class imbalance due to the larger proportion of the normal functioning group compared with the cognitively impaired group, we performed simple undersampling to balance the data distribution across classes in the training set. We employed a grid search to determine the optimal hyperparameters for each model based on the length of the time-series data. During the grid search process, we explored various combinations of LSTM units (64, 128, and 256), dense layer units (32, 64, and 128), and learning rates (0.001 to 0.01) to fine-tune the architecture. The final model comprised an LSTM layer with 128 units, followed by a dense layer with 64 units, and a final dense output layer for binary classification. The Adam optimizer with a learning rate of 0.001 was used to achieve optimal performance, given its adaptability in time-series prediction tasks. The selected hyperparameters provided the best balance between accuracy and validation loss. Additionally, we constructed support vector machine (SVM), logistic regression (LR), random forest (RF), and XGBoost models, which do not reflect time-series characteristics, to objectively compare the performance of the LSTM models presented in this study. We employed H₂O, a Python (H₂O version 3.46.0.1) AutoML library, to determine the optimal hyperparameters for each model based on the length of the time-series data. The final performances of all the trained models are summarized in Table 5.

In this study, model performance was evaluated based on sensitivity, specificity, and F1-score. When these values were comparable, we further compared models based on AUC. Sensitivity represents the model’s ability to correctly identify patients with cognitive impairment, while specificity reflects its accuracy in classifying healthy individuals. Overall, the LSTM models demonstrated superior performance compared with other machine learning models. Specifically, the LSTM model trained on the five-day sequence data achieved the highest performance, with a sensitivity of 0.89, a specificity of 0.80, and an AUC of 0.92. Additionally, it achieved an accuracy of 0.85 and an F1-score of 0.85, which are relatively high. As depicted in Figure 4, the ROC curve for the LSTM model is positioned in the upper left corner, reflecting the largest AUC relative to the other models, thus indicating its superior predictive capability for cognitive impairment.

In Table 6, the “real value” represents whether the subjects actually have cognitive impairment, while the “predicted value” denotes the outcomes generated by the model, where 0 indicates a normal state without cognitive impairment and 1 signifies the presence of cognitive impairment. The “prediction score” refers to the model’s output value, with the LSTM model employed in this study utilizing a threshold of 0.5. Prediction scores exceeding 0.5 are classified as indicative of the cognitively impaired group, whereas prediction scores below 0.5 are categorized as the normal functioning group.

Subsequently, we computed the precision@K score to analyze the practical assessment of the likelihood of developing cognitive impairment. The precision@K score measures the proportion of accurately predicted outcomes among the top k predictions when the model prediction scores are sorted in descending order. In other words, precision@K reflects the proportion of the top k subjects for which the diagnostic labels are correctly predicted and is defined as follows:

p r e c i s i o n @ K = \frac{# o f a c t u a l l y c o g n i t i v e i m p a i r m e n t p a t i e n t s}{T o p k p e o p l e p r e d i c t e d a s c o g n i t i v e i m p a i r m e n t p a t i e n t}

(11)

As illustrated in Figure 5, the precision@100 score for the LSTM model was 96%, indicating that 96 of the top 100 predicted patients had cognitive impairment. This result underscores the proficiency of the LSTM model in capturing the characteristics of time-series data and its effectiveness in predicting cognitive impairment. Although the other machine learning models did not utilize time-series data, the RF and XGBoost models performed relatively well in identifying patients with cognitive impairment. Consequently, it can be concluded that the LSTM model is highly suitable for identifying patients with cognitive impairment.

4.3. Identifying Influential Features Using Explainable AI

Finally, the SHAP algorithm was applied to the LSTM model trained with the five-day sequence data, which exhibited the highest predictive performance. To elucidate the model’s predictions and analyze the impact of sleep factors on cognitive impairment, two types of plots (summary and bar) were utilized. The summary plot displays the distribution of the Shapley values for each feature across the entire dataset, providing insights into the influence of each feature on the model’s predictions. The bar plot, on the other hand, presents the average absolute SHAP values, which reflect the overall importance of each feature in the prediction process. Figure 6 illustrates the summary plot for the LSTM model using five-day sequence data. In the summary plot, the x-axis represents the Shapley value, with a wider distribution to the right indicating a stronger positive influence on cognitive impairment, whereas a wider distribution to the left suggests a stronger negative influence. In this plot, red distributions indicate higher feature values, whereas blue distributions represent lower feature values. Figure 7 shows the bar plot of the LSTM model, where the length of the bars is the average of the absolute values of the SHAP values for each feature, indicating the global predictive impact.

The most significant factors, in order of importance, were average breaths per minute, heart rate variability, rapid eye movement (REM) sleep duration, deep sleep duration, and the percentage of tossing and turning. These findings indicated an association between sleep factors and cognitive impairment. Specifically, the LSTM model predicted that higher average breaths per minute and greater tossing rates were linked to an increased risk of cognitive impairment, suggesting that individuals with cognitive impairment tend to have poorer sleep quality. This is likely because sleep disorders lead to frequent tossing and turning, which disrupt deep sleep. Furthermore, a shorter duration of REM and deep sleep, along with an extended period of light sleep, was associated with an increased risk of cognitive impairment, revealing a significant relationship between sleep depth and cognitive impairment.

Existing medical research supports the notion that sleep disorders are closely linked to cognitive decline. Clinical studies have demonstrated a significant increase in the influx of red blood cells into the capillaries of the cerebral cortex during REM sleep [32,33,34]. Consequently, it can be inferred that REM sleep facilitates active substance exchange within the cerebral cortex, contributing to brain refreshment. This suggests that REM sleep is vital for maintaining cognitive health, as it supports memory consolidation, and neural maintenance. Thus, the lower percentage of REM sleep in individuals with cognitive impairment, as identified by our model, aligns with previous findings, further supporting the idea that REM sleep deprivation accelerates cognitive decline. Heart rate variability (HRV), another key feature identified by the model, is well-documented in the literature as being closely tied to sleep and overall health [35,36,37]. This study found that reduced HRV was associated with a higher model-predicted risk of cognitive impairment. Hence, the findings of this study underscore the importance of autonomic function and its potential to predict cognitive decline.

Moreover, this study’s findings concerning tossing and turning rates are also consistent with the established literature on sleep disturbances. Frequent night-time awakenings, increased tossing and turning, and extended periods of light sleep are commonly observed in individuals with cognitive impairment or dementia [38]. This study demonstrates that poor sleep quality, fragmented sleep, and disruptions in circadian rhythms are prevalent neuropsychiatric symptoms in individuals with cognitive impairment, significantly impairing their quality of life. Furthermore, it is well-established that sleep deprivation or disturbances can exacerbate the neurodegenerative process, as highlighted by studies linking increased amyloid-beta levels and elevated non-soluble tau protein to poor sleep quality [39]. These findings emphasize that sleep disruptions can accelerate the degeneration of neural pathways, contributing to the progression of cognitive decline.

This study revealed that individuals with sleep problems such as poor sleep quality, sleep disturbances, and extended periods of light sleep exhibit a significantly higher likelihood of being diagnosed with cognitive impairment or dementia. Indeed, sleep issues such as night-time awakenings or the reversal of the sleep–wake cycle are among the most prevalent neuropsychiatric symptoms observed in patients with cognitive impairment [40], which considerably diminish the quality of life of many older adults suffering from cognitive decline. Furthermore, disruptions in sleep mechanisms due to altered sleep patterns may accelerate the degeneration of neural pathways that regulate both somatic and psychiatric comorbidities [41]. Sleep deprivation resulting from sleep disturbances is associated with increased amyloid levels in the cerebrospinal fluid and elevated levels of non-soluble tau protein, both of which contribute to neurodegeneration. The findings of this study further underscore the significance of sleep characteristics in predicting cognitive impairment and highlight a strong correlation between sleep patterns and cognitive decline in older adults.

In this chapter, we employed SHAP to identify the key variables that significantly contribute to the prediction of cognitive impairment. The SHAP values generated by the model were used to quantitatively assess the contribution of these variables to the model predictions. Our analysis revealed that average breaths per minute, REM sleep duration, tossing and turning rates, and light sleep duration were among the most influential factors affecting the model’s predictions. These findings are consistent with previous research suggesting a strong link between sleep characteristics and cognitive decline, reinforcing the notion that sleep patterns are closely associated with cognitive impairment. Consequently, the prediction process of the model developed in this study was validated as rational, and it is anticipated that it can be effectively utilized by healthcare professionals to identify the early signs of cognitive decline and facilitate the early detection of cognitive impairment.

5. Conclusions

In this study, we employed time-series sleep lifelog data to predict cognitive impairment and utilized SHAP, an interpretable AI algorithm, to elucidate the model’s predictions. We collected sleep lifelog data from both healthy individuals and those with cognitive impairment, transformed them into a time series, and extracted relevant sleep factors indicative of the subjects’ sleep status. Subsequently, we trained seven machine learning models and determined that the LSTM model, utilizing five days of time-series data, achieved the highest performance. Finally, we applied SHAP to interpret the predictions of this model, providing insights into the factors influencing cognitive impairment.

The contributions of this study are summarized as follows: First, the cognitive impairment classification model developed in this study effectively leverages the temporal characteristics of sleep lifelog data, demonstrating superior performance in cognitive disorder classification compared with traditional machine learning models. This approach differentiates itself from existing research that utilizes structured measurement tools and traditional medical data and highlights the potential for incorporating lifelog data in the field of cognitive impairment. The proposed model, when used in conjunction with wearable devices, is expected to reduce both the cost and time associated with testing. Second, this study identified key indicators related to cognitive impairment and demonstrated their significance through empirical analysis. By applying interpretable AI techniques such as SHAP, this study revealed how specific sleep-related variables are linked to cognitive decline, offering valuable medical insights into the interaction between sleep patterns and cognitive health. Finally, on a practical level, this model shows potential for real-world applications in healthcare. When paired with wearable devices, it enables rapid and non-invasive predictions of cognitive impairment, substantially reducing the costs and time typically involved in cognitive impairment diagnosis, thus facilitating more efficient decision-making for medical professionals.

However, this study has several limitations. First, the sample size was limited to 174 participants, thereby constraining the dataset available for model training. Since this small sample size can increase the risk of overfitting, this study applied early stopping in the model training process. Future study should focus on expanding the dataset to enhance the model’s generalization ability and robustness in real-world applications. Additionally, increasing the volume of disease data is expected to address data imbalance and to distinguish effectively between different stages of cognitive impairment. Second, the study utilized only one to two months of lifelog data, which may be insufficient given the time required for progression from mild to severe cognitive impairment. Longer-term data collected over several years can help capture the gradual cognitive decline that occurs over time. In addition, incorporating multiple data sources such as neuroimaging, speech data, or genetic information could provide a more holistic view of cognitive impairment progression, leading to more accurate predictions. Finally, the proposed model was not tested in actual healthcare settings and has not been validated in real-world clinical practice. Future study should focus on validating the model in clinical settings to assess its practical effectiveness. Collaborating with healthcare providers could enable the integration of the model into clinical workflows, providing real-time feedback and facilitating the early detection of cognitive impairment. Additionally, alternative AI models such as gated recurrent units or attention mechanism should be explored to potentially enhance the model’s performance and adaptability in clinical environments.

Author Contributions

Conceptualization, J.H., Y.S., S.L., J.Y., J.L., K.-S.P., and J.-W.H.; Methodology, J.H., Y.S., S.L., and J.Y.; Software, J.H. and Y.S.; Validation, J.Y., K.-S.P., and J.-W.H.; Formal analysis, J.L.; Investigation, J.H., Y.S., S.L., J.Y., and J.L.; Resources, J.Y., K.-S.P., and J.-W.H.; Data curation, J.H.; Writing—original draft, J.H., Y.S., S.L., and J.Y.; Writing—review and editing, J.Y. and J.L.; Visualization, J.H. and Y.S.; Supervision, J.Y.; Project administration, J.H., J.Y., K.-S.P., and J.-W.H.; Funding acquisition, J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This paper was supported by Konkuk University in 2022.

Data Availability Statement

The data analyzed in this study are available in the AI-Hub (https://www.aihub.or.kr accessed on 1 May 2023) of the National Information Society Agency.

Conflicts of Interest

Author Jiho Lee was employed by the Neopons Inc. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. Detailed Description and Format of Each Feature

Table A1. Sleep feature list.

Name	Description	Format
sleep_awake	Waking time	seconds
sleep_deep	Deep sleep time	seconds
sleep_duration	Sleep time	seconds
sleep_efficiency	(sleep_total/sleep_duraion) × 100	1–100 or 0 if not available
sleep_light	Light sleep time	seconds
sleep_rem	REM sleep time	seconds
sleep_midpoint_time	Sleep midpoint time	time delta
sleep_midpoint_time_at_delta	Sleep midpoint time delta	time delta
sleep_onset_latency	Sleep incubation time	seconds
sleep_restless	Tossing and turning ratio	%
sleep_temperature_delta	Skin temperature deviation delta	celsius
sleep_temperature_deviation	Skin temperature deviation	celsius
sleep_total	Total sleep time	seconds
sleep_hypnogram_average	Average of sleep status logs per 5 min	‘1’ = deep sleep ‘2’ = light sleep ‘3’ = REM sleep ‘4’ = awake
start1-6	Whether the start of the sleep time is in one of the six time zones (0–4, 4–8, 8–12, 12–16, 16–20, 20–24 o’clock)	0 = no, 1 = yes
end1-6	Whether the end of sleep time is in one of the six time zones (0–4, 4–8, 8–12, 12–16, 16–20, 20–24 o’clock)	0 = no, 1 = yes
sleep_breath_average	Average breaths per minute	breaths per minute
sleep_hr_average	Average heart rate per minute	beats per minute
sleep_hr_min	Minimum value of heart rate per minute	beats per minute
sleep_hr_max	Maximum value of heart rate per minute	beats per minute
sleep_hr_median	Median value of heart rate per minute	beats per minute
rmssd_average	Average heart rate variability	milliseconds

References

Folstein, M.; Anthony, J.C.; Parhad, I.; Duffy, B.; Gruenberg, E.M. The meaning of cognitive impairment in the elderly. J. Am. Geriatr. Soc. 1985, 33, 228–235. [Google Scholar] [CrossRef] [PubMed]
Gauthier, S.; Reisberg, B.; Zaudig, M.; Petersen, R.C.; Ritchie, K.; Broich, K.; Belleville, S.; Brodaty, H.; Bennett, D.; Chertkow, H. Mild cognitive impairment. Lancet 2006, 367, 1262–1270. [Google Scholar] [CrossRef] [PubMed]
Sanford, A.M. Mild cognitive impairment. Clin. Geriatr. Med. 2017, 33, 325–337. [Google Scholar] [CrossRef] [PubMed]
Mitchell, A.J.; Shiri-Feshki, M. Rate of progression of mild cognitive impairment to dementia–meta-analysis of 41 robust inception cohort studies. Acta Psychiatr. Scand. 2009, 119, 252–265. [Google Scholar] [CrossRef] [PubMed]
Baker, M.; Akrofi, K.; Schiffer, R.; O’Boyle, M.W. EEG patterns in mild cognitive impairment (MCI) patients. Open Neuroimaging J. 2008, 2, 52–55. [Google Scholar] [CrossRef]
Tóth, L.; Hoffmann, I.; Gosztolya, G.; Vincze, V.; Szatlóczki, G.; Bánréti, Z.; Pákáski, M.; Kálmán, J. A speech recognition-based solution for the automatic detection of mild cognitive impairment from spontaneous speech. Curr. Alzheimer Res. 2018, 15, 130–138. [Google Scholar] [CrossRef]
Puce, A.; Hämäläinen, M.S. A review of issues related to data acquisition and analysis in EEG/MEG studies. Brain Sci. 2017, 7, 58. [Google Scholar] [CrossRef]
Jaiswal, K.; Sobhanayak, S.; Mohanta, B.K.; Jena, D. IoT-cloud based framework for patient’s data collection in smart healthcare system using raspberry-pi. In Proceedings of the 2017 International Conference on Electrical and Computing Technologies and Applications (ICECTA), Ras Al Khaimah, United Arab Emirates, 21–23 November 2017; pp. 1–4. [Google Scholar]
Lee, S.-H.; Kang, W.-S.; Moon, C. Lifelog-based classification of mild cognitive impairment using artificial neural networks. In Proceedings of the 2018 International Conference on Electronics, Information, and Communication (ICEIC), Honolulu, HI, USA, 24–27 January 2018; pp. 1–2. [Google Scholar]
Minamisawa, A.; Okada, S.; Inoue, K.; Noguchi, M. Dementia scale score classification based on daily activities using multiple sensors. IEEE Access 2022, 10, 38931–38943. [Google Scholar] [CrossRef]
Kim, K.; Jang, J.; Park, H.; Jeong, J.; Shin, D.; Shin, D. Detecting Abnormal Behaviors in Dementia Patients Using Lifelog Data: A Machine Learning Approach. Information 2023, 14, 433. [Google Scholar] [CrossRef]
Leevy, J.L.; Khoshgoftaar, T.M.; Bauder, R.A.; Seliya, N. Investigating the relationship between time and predictive model maintenance. J. Big Data 2020, 7, 36. [Google Scholar] [CrossRef]
Borson, S.; Frank, L.; Bayley, P.J.; Boustani, M.; Dean, M.; Lin, P.-J.; McCarten, J.R.; Morris, J.C.; Salmon, D.P.; Schmitt, F.A. Improving dementia care: The role of screening and detection of cognitive impairment. Alzheimers Dement. 2013, 9, 151–159. [Google Scholar] [CrossRef] [PubMed]
Verlinden, V.J.; van der Geest, J.N.; de Bruijn, R.F.; Hofman, A.; Koudstaal, P.J.; Ikram, M.A. Trajectories of decline in cognition and daily functioning in preclinical dementia. Alzheimers Dement. 2016, 12, 144–153. [Google Scholar] [CrossRef] [PubMed]
Lloret-Climent, M.; Nescolarde-Selva, J.A.; Alonso-Stenberg, K.; Montoyo, A.; Gutiérrez-Vázquez, Y. Applying Smarta to the analysis of tourist networks. Math. Methods Appl. Sci. 2022, 45, 3921–3932. [Google Scholar] [CrossRef]
Kasper, S.; Bancher, C.; Eckert, A.; Förstl, H.; Frölich, L.; Hort, J.; Korczyn, A.D.; Kressig, R.W.; Levin, O.; Palomo, M.S.M. Management of mild cognitive impairment (MCI): The need for national and international guidelines. World J. Biol. Psychiatry 2020, 21, 579–594. [Google Scholar] [CrossRef]
Kahn, R.L.; Goldfarb, A.I.; Pollack, M.; Peck, A. Brief objective measures for the determination of mental status in the aged. Am. J. Psychiatry 1960, 117, 326–328. [Google Scholar] [CrossRef]
Folstein, M.F.; Folstein, S.E.; McHugh, P.R. “Mini-mental state”: A practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 1975, 12, 189–198. [Google Scholar] [CrossRef]
Kang, Y.; NA, D.-L.; Hahn, S. A validity study on the Korean Mini-Mental State Examination (K-MMSE) in dementia patients. J. Korean Neurol. Assoc. 1997, 15, 300–308. [Google Scholar]
Jarrold, W.; Peintner, B.; Wilkins, D.; Vergryi, D.; Richey, C.; Gorno-Tempini, M.L.; Ogar, J. Aided diagnosis of dementia type through computer-based analysis of spontaneous speech. In Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, Baltimore, MD, USA, 27 June 2014; Resnik, P., Resnik, R., Mitchell, M., Eds.; The Association for Computational Linguistics: Stroudsburg, PA, USA, 2014; pp. 27–37. [Google Scholar]
Yu, B.; Quatieri, T.F.; Williamson, J.R.; Mundt, J.C. Cognitive impairment prediction in the elderly based on vocal biomarkers. In Proceedings of the Sixteenth Annual Conference of the International Speech Communication Association, Dresden, Germany, 6–10 September 2015. [Google Scholar]
Bubu, O.M.; Brannick, M.; Mortimer, J.; Umasabor-Bubu, O.; Sebastião, Y.V.; Wen, Y.; Schwartz, S.; Borenstein, A.R.; Wu, Y.; Morgan, D. Sleep, cognitive impairment, and Alzheimer’s disease: A systematic review and meta-analysis. Sleep 2017, 40, zsw032. [Google Scholar] [CrossRef]
Someya, T.; Bao, Z.; Malliaras, G.G. The rise of plastic bioelectronics. Nature 2016, 540, 379–385. [Google Scholar] [CrossRef]
Vujović, Ž. Classification model evaluation metrics. Int. J. Adv. Comput. Sci. Appl. 2021, 12, 599–606. [Google Scholar] [CrossRef]
Altman, D.G.; Bland, J.M. Diagnostic tests. 1: Sensitivity and specificity. BMJ: Br. Med. J. 1994, 308, 1552. [Google Scholar] [CrossRef] [PubMed]
Swets, J.A. Measuring the accuracy of diagnostic systems. Science 1988, 240, 1285–1293. [Google Scholar] [CrossRef] [PubMed]
Doshi-Velez, F.; Kim, B. Towards a rigorous science of interpretable machine learning. arXiv 2017, arXiv:1702.08608. [Google Scholar]
Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. Adv. Neural Inf. Process. Syst. 2017, 30, 2493. [Google Scholar]
Guarnieri, B.; Adorni, F.; Musicco, M.; Appollonio, I.; Bonanni, E.; Caffarra, P.; Caltagirone, C.; Cerroni, G.; Concari, L.; Cosentino, F. Prevalence of sleep disturbances in mild cognitive impairment and dementing disorders: A multicenter Italian clinical cross-sectional study on 431 patients. Dement. Geriatr. Cogn. Disord. 2012, 33, 50–58. [Google Scholar] [CrossRef]
Peter-Derex, L.; Yammine, P.; Bastuji, H.; Croisile, B. Sleep and Alzheimer’s disease. Sleep Med. Rev. 2015, 19, 29–38. [Google Scholar] [CrossRef]
Hahn, E.A.; Wang, H.-X.; Andel, R.; Fratiglioni, L. A change in sleep pattern may predict Alzheimer disease. Am. J. Geriatr. Psychiatry 2014, 22, 1262–1271. [Google Scholar] [CrossRef]
Natsubori, A.; Tsunematsu, T.; Karashima, A.; Imamura, H.; Kabe, N.; Trevisiol, A.; Hirrlinger, J.; Kodama, T.; Sanagi, T.; Masamoto, K. Intracellular ATP levels in mouse cortical excitatory neurons varies with sleep–wake states. Commun. Biol. 2020, 3, 491. [Google Scholar] [CrossRef]
Grant, D.A.; Franzini, C.; Wild, J.; Eede, K.J.; Walker, A.M. Autoregulation of the cerebral circulation during sleep in newborn lambs. J. Physiol. 2005, 564, 923–930. [Google Scholar] [CrossRef]
Bergel, A.; Deffieux, T.; Demené, C.; Tanter, M.; Cohen, I. Local hippocampal fast gamma rhythms precede brain-wide hyperemic patterns during spontaneous rodent REM sleep. Nat. Commun. 2018, 9, 5364. [Google Scholar] [CrossRef]
Snyder, F.; Hobson, J.A.; Morrison, D.F.; Goldfrank, F. Changes in respiration, heart rate, and systolic blood pressure in human sleep. J. Appl. Physiol. 1964, 19, 417–422. [Google Scholar] [CrossRef] [PubMed]
Coccagna, G.; Scaglione, C. Cardiocirculatory disorders and sleep. In Sleep: Physiology, Investigations, and Medicine; Springer: New York, NY, USA, 2003; pp. 589–597. [Google Scholar]
Bušek, P.; Vaňková, J.; Opavský, J.; Salinger, J.; Nevšímalová, S. Spectral analysis of heart rate variability in sleep. Physiol Res 2005, 54, 369–376. [Google Scholar] [CrossRef] [PubMed]
McCurry, S.M.; Logsdon, R.G.; Teri, L.; Vitiello, M.V. Sleep disturbances in caregivers of persons with dementia: Contributing factors and treatment implications. Sleep Med. Rev. 2007, 11, 143–153. [Google Scholar] [CrossRef] [PubMed]
Ferini-Strambi, L.; Liguori, C.; Lucey, B.P.; Mander, B.A.; Spira, A.P.; Videnovic, A.; Baumann, C.; Franco, O.; Fernandes, M.; Gnarra, O. Role of sleep in neurodegeneration: The consensus report of the 5th Think Tank World Sleep Forum. Neurol. Sci. 2024, 45, 749–767. [Google Scholar] [CrossRef]
Muangpaisan, W.; Intalapaporn, S.; Assantachai, P. Neuropsychiatric symptoms in the community-based patients with mild cognitive impairment and the influence of demographic factors. Int. J. Geriatr. Psychiatry A J. Psychiatry Late Life Allied Sci. 2008, 23, 699–703. [Google Scholar] [CrossRef]
Abbott, S.M.; Videnovic, A. Chronic sleep disturbance and neural injury: Links to neurodegenerative disease. Nat. Sci. Sleep 2016, 8, 55–61. [Google Scholar]

Figure 1. Long short-term memory (LSTM) cell structure.

Figure 2. Receiver operating characteristic (ROC) graph.

Figure 3. Overall experimental process.

Figure 4. ROC plot and AUC value (bottom right) for the LSTM model using five days.

Figure 5. Precision@K score of the models (K = 100).

Figure 6. Summary plot.

Figure 7. Bar plot.

Table 1. Prior studies on data-based approaches.

Authors	Year	Method	Dataset
Baker, Akrofi et al. [5]	2008	K-means clustering algorithm	Electroencephalography data
Jarrold, Peintner et al. [20]	2014	Multilayer perceptron	Speech data
Yu, Quatieri et al. [21]	2015	Support vector machine	Speech data
Lee, Kang et al. [9]	2018	Artificial neural networks	Activity and sleep data
Minamisawa, Okada et al. [10]	2022	Multilayer perceptron	Activity and sleep data
Kim, Jang et al. [11]	2023	Multilayer perceptron	Abnormal behavior data

Table 2. Examples of lifelog dataset.

Sleep_Bedtime_Start	Sleep_Bedtime_End	Sleep_Awake	Sleep_Deep	…	Sleep_Total	DIAG_NM
18 October 2020 18:38:28	19 October 2020 05:10:28	8700	10,110	…	29,220	MCI
19 October 2020 21:39:52	20 October 2020 05:37:52	6570	7440	…	22,110	MCI
20 October 2020 20:51:28	21 October 2020 05:45:28	10,530	4620	…	21,510	MCI
⋯
30 October 2020 01:14:20	30 October 2020 07:29:20	8520	3600	…	13,980	NC
30 October 2020 22:22:52	31 October 2020 07:47:52	9210	6450	…	24,690	NC
⋯
17 October 2020 20:20:28	18 October 2020 06:35:28	6810	390	…	30,090	DE

Table 3. Data statistics.

Feature	Value
Number of total people	174
Number of people with normal cognition	111
Number of people with mild cognitive impairment	51
Number of people with dementia	12
Number of sleep data	12,183

Table 4. Input sleep features.

Category	Features
Sleep quality features	sleep_awake, sleep_deep, sleep_duration, sleep_efficiency, sleep_light, sleep_rem, sleep_midpoint_time, sleep_midpoint_at_delta, sleep_onset_latency, sleep_restless, skin_temperature_delta, skin_temperature_deviation, sleep_total, sleep_hypnogram_average, start1, start2, start3, strat4, start5, start6, end1, end2, end3, end4, end5, end6
Statistical features	sleep_breath_average, sleep_hr_average, sleep_hr_min, sleep_hr_max, sleep_hr_median, rmssd_average

Table 5. Performances of the trained models.

Method	Sensitivity	Specificity	Area under the ROC Curve (AUC)	Accuracy	Precision	F1-Score
LSTM (three days)	0.87	0.74	0.88	0.81	0.77	0.82
LSTM (four days)	0.89	0.77	0.91	0.83	0.80	0.84
LSTM (five days)	0.89	0.80	0.92	0.85	0.82	0.85
XGBoost	0.68	0.76	0.81	0.72	0.74	0.71
Random forest (RF)	0.67	0.77	0.81	0.72	0.75	0.71
Logistic regression (LR)	0.59	0.60	0.63	0.60	0.60	0.60
Support vector machine (SVM)	0.62	0.59	0.64	0.61	0.60	0.61

Table 6. Examples of prediction results of the LSTM model using five days.

Diagnosis	Real Value	Predicted Value	Prediction Score
MCI	1	1	0.999999
MCI	1	1	0.999999
DE	1	1	0.999993
⋯
MCI	1	1	0.758631
NC	0	1	0.756962
⋯
NC	0	0	0.000031
NC	0	0	0.000017

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Hong, J.; Seol, Y.; Lee, S.; Yoon, J.; Lee, J.; Park, K.-S.; Ha, J.-W. Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model. Mathematics 2024, 12, 3208. https://doi.org/10.3390/math12203208

AMA Style

Hong J, Seol Y, Lee S, Yoon J, Lee J, Park K-S, Ha J-W. Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model. Mathematics. 2024; 12(20):3208. https://doi.org/10.3390/math12203208

Chicago/Turabian Style

Hong, Junhee, Youngjin Seol, Seunghyun Lee, Janghyeok Yoon, Jiho Lee, Ki-Su Park, and Ji-Wan Ha. 2024. "Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model" Mathematics 12, no. 20: 3208. https://doi.org/10.3390/math12203208

APA Style

Hong, J., Seol, Y., Lee, S., Yoon, J., Lee, J., Park, K.-S., & Ha, J.-W. (2024). Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model. Mathematics, 12(20), 3208. https://doi.org/10.3390/math12203208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Cognitive Impairment Using Sleep Lifelog Data and LSTM Model

Abstract

1. Introduction

2. Data-Driven Cognitive Impairment Analysis

3. Methodology

3.1. Long Short-Term Memory

3.2. Cognitive Impairment Prediction Model

3.3. Shapley’s Additive Explanations

4. Experimental Results

4.1. Data

4.2. Cognitive Impairment Prediction

4.3. Identifying Influential Features Using Explainable AI

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Detailed Description and Format of Each Feature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI