1. Introduction
In recent years, advancements in wearable technology have transformed the healthcare landscape, enhancing health monitoring and revolutionising patient–provider interactions, especially in remote care. Wearable devices, such as smartwatches and fitness trackers, continuously collect physiological data (heart rate, activity levels, and sleep partners, among others) that provide real-time insights into the health status. Integrating these technologies into patient care represents a shift from traditional episodic assessments to real-time monitoring, providing a comprehensive understanding of an individual’s health status [
1]. This approach represents a shift from traditional static assessments to real-time, dynamic monitoring, providing a comprehensive understanding of how risk factors evolve over time [
1].
Wearable technology also facilitates the early detection of health issues by identifying subtle changes in baseline metrics, such as a gradual increase in heart rate [
2]. Additionally, these devices empower patients to take a proactive approach to their health by providing immediate feedback and encouraging healthier lifestyles [
3]. The data generated can inform treatment plans and optimise healthcare strategies, allowing for tailored interventions and improved patient outcomes [
1,
4].
Research on wearable devices has highlighted their benefits and limitations. For example, a systematic review and meta-analysis [
5] explored the use of activity trackers in older adults, emphasising the need for further research in clinical practice and home-based monitoring. Moreover, older adults with cancer often face a unique set of challenges that can contribute to feelings of vulnerability and anxiety, frailty, and other age-related changes [
6]. Measuring and analysing self-reported and objective data are essential steps for cultivating a comprehensive understanding of these challenges. Self-reported information offers valuable insights into patients’ subjective experiences, including their perceived vulnerability, anxiety, and the emotional impact of their cancer journey.
Incorporating real-world data from wearable devices into clinical data presents a promising opportunity to enhance cancer patient care. While cancer patients undergo extensive evaluations and treatment regimens documented in their electronic health records (EHRs), these records often lack contextual information about their daily life, activity levels, and symptoms that significantly impact their cancer journey [
7].
Integrating data from wearables, mobile applications, and patient-reported outcomes can provide a more comprehensive view of the patient’s experience. Older adults with cancer face unique challenges, including vulnerability and anxiety, making it essential to analyse both self-reported and objective data. Self-reported insights into perceived vulnerability and emotional impact are particularly valuable.
Several studies have explored wearable devices for managing cancer survivors’ health. For instance, Rossi et al. (2018) [
8] examined the acceptability and validity of a physical activity tracker for endometrial cancer survivors, a group with high obesity rates. Another study focused on colorectal and endometrial patients, using wearables to change physical activity behaviour [
9]. A recent review highlighted the integration of wearables in prostate cancer research, assessing daily activity and adherence to exercise interventions [
10]. While these studies underscore the importance of wearables for monitoring activity and exercise adherence, they often lack a temporal analysis of these factors.
The rise of wearable health monitoring devices generates a wealth of continuous, real-world patient data, but extracting meaningful insights requires effective analysis. Traditional methods often struggle with the complexity of these datasets, while advanced analytics techniques, particularly those using artificial intelligence (AI), can uncover patterns that manual reviews might miss [
11]. Traditional prediction methods typically focus on demographic and clinical characteristics [
4], often overlooking the temporal aspect of disease progression [
12]. There is growing interest in applying AI in healthcare, but many AI techniques lack explainability, affecting healthcare professionals’ trust. Explainable AI (XAI) and interactive models seek to build trust between doctors and AI systems [
13].
Interactive process mining [
14] offers methods to extract knowledge from data stored in event logs generated by health information systems and real-world data sources like sensors. This approach can model individual behaviours and stratify patients based on clinical and real-world data [
15]. By combining continuous data with advanced analytics, process mining facilitates the dynamic assessment of disease processes and individual risk profiles.
Few studies have integrated real-world data from wearable devices with self-reported data to develop personalised risk models for cancer. For example, one study [
16] introduced a cancer-specific predictive model identifying features linked to poor prognosis in patients with non-small cell lung cancer, breast cancer, and non-Hodgkin lymphoma using AI, clinical data, questionnaire responses, and wearable data. Another review discussed the benefits of activity monitoring devices in categorising physical activity levels and health outcomes for lung cancer patients [
17]. Additionally, another study [
18] applied machine learning algorithms to analyse self-reported physical activity and heart rate data, assessing cardiovascular disease-related metrics.
Despite these contributions, further research is necessary to explore the incorporation of time-varying perspectives into risk models, leveraging data from wearable devices and the analytical potential of AI techniques. Integrating self-reported outcomes and objective data from wearable devices could significantly enhance the development of personalised risk models that account for underlying time-varying risk factors.
This study aims to integrate wearable technology and self-reported measures to develop personalised risk models for older adult cancer survivors, focusing on time-varying risk factors. Using process mining techniques and collaborating with healthcare professionals, this study analyses data and develops dynamic risk models for vulnerability and anxiety among older adult cancer survivors. This study is related to challenges C8, C4 and C1 of the Process Mining in Healthcare Manifesto [
19]. Unlike static assessment methods, this approach allows us to account for time-varying risk factors and measure their impact on self-reported vulnerability and anxiety. We analysed data collected during the LifeChamps study and collaborated with healthcare professionals (HCPs) to co-design and develop two dynamic risk models. Our results reveal how activity and sleep patterns influence self-reported vulnerability and anxiety among the study population.
The subsequent sections of this paper explore the various layers of our investigation. We start with the background of our study, relating to older adult cancer survivors. After this, we describe the materials, methods, and dataset used to build the results. Then, this work presents the results achieved in the inquiry. Following this, we delve into discussing the results and their limitations. Finally, we present our conclusions.
2. Background: Older Adult Cancer Survivors
Older cancer survivors face unique challenges stemming from their age and specific characteristics, including multimorbidity, frailty, and the psychosocial impacts of cancer. These factors can complicate treatment and overall care, while age-related changes in physiology and cognitive function may hinder their ability to cope with cancer treatment and recovery. The 2021 report from the International Society of Geriatric Oncology (SIOG) identified key priorities for advancing care for older cancer patients globally [
20]. This collaborative effort highlighted the need for shared decision-making between healthcare providers and older patients and a multidimensional evaluation of their physical, cognitive, psychological, social, and functional status. Collecting and analysing real-world data from passive monitoring technologies can give HCPs a holistic view of older adult cancer patients’ functional status. Additionally, continuous data can support advanced analytics and predictive modelling, empowering healthcare teams to make informed, data-driven decisions [
21].
Older adults with cancer face unique challenges that contribute to feelings of vulnerability and anxiety. They often have decreased physiological reserves and an increased risk of treatment-related toxicity due to multimorbidity, frailty, and age-related changes [
6]. This heightened vulnerability can lead to concerns about functional decline and recovery from cancer treatments. Additionally, the cancer diagnosis itself can be profoundly stressful, with older patients worrying about its impact on their quality of life and independence and the potential burden on their loved ones.
These feelings of vulnerability and anxiety significantly affect the well-being and treatment outcomes of older cancer survivors [
22]. Measuring these issues through self-reported data is crucial, as it provides insights into patients’ perceived vulnerability and emotional impact. Validated assessment tools, like patient-reported outcome measures (PROMs), can effectively capture these subjective experiences [
23].
A notable relationship exists between anxiety disorders and sleep quality; older adults with anxiety often experience poor sleep, making sleep efficiency—defined as the percentage of time spent sleeping compared to the time in bed—an essential focus. A sleep efficiency below 80% is a significant mortality risk, while a sleep efficiency of 90% or above offers substantial health benefits [
24]. Additionally, anxiety and physical activity levels are interconnected, where anxiety can lead to sedentary behaviour and low physical activity can exacerbate anxiety symptoms [
25]. Understanding the relationship between vulnerability and physical activity is vital for improving health outcomes in older adults. Regular physical activity enhances physical function and quality of life during and after cancer treatment [
26].
Further research is essential to integrate a time-varying perspective into risk models by utilising data from wearable devices. This approach requires researchers to consider how health metrics change over time rather than relying on static snapshots of a patient’s health. Wearable devices provide continuous data on various health parameters, such as activity levels and sleep patterns, allowing for a more nuanced understanding of health evolution, which is crucial for accurate risk assessment. For example, fluctuations in physical activity or sleep quality can significantly affect a cancer survivor’s vulnerability to anxiety and depression. Researchers can develop personalised risk models by combining self-reported outcomes—like feelings of vulnerability and anxiety—with objective data from wearables. These models would identify underlying time-varying risk factors impacting older adult cancer survivors.
4. Results
This section presents the results designed and developed in the context of the LifeChamps study following a co-creation process with clinical experts in the cancer field and using data from real-life pilots.
We conducted 12 working sessions with the 36 HCPs participating across four pilot sites, focusing on changes in participants’ quality of life through self-perceived parameters. HCPs chose anxiety/depression and vulnerability as the dimensions for analysis. Below is a summary of the three co-creation sessions:
Session 1. Clinical experts discussed their interests in analysing anxiety/depression and vulnerability within the LifeChamps context. It was decided to combine self-reported patient data with objective data from activity trackers and smart scales. The session also introduced analysis tools, allowing experts to familiarise themselves with preliminary results. Process mining experts gained clinical insights to develop initial findings using the PHQ-4 and VES-13 questionnaires.
Session 2. Preliminary results were presented, and data curation was conducted with clinical experts. Feedback indicated a desire to include RR computations in the analysis. Experts requested the addition of various dynamic risk variables, including weight, body composition, activity levels, sleep metrics, cancer type, and age. The team also discussed appropriate cut-off points for these measures.
Sessions 3. The final sessions showcased refined results and trained clinical experts on the tool’s functionalities. The analysis incorporated the Body Mass Index (BMI), sleep efficiency, and daily activity (steps). The main goal was to empower experts to analyse results and derive relevant clinical insights autonomously.
We adhered to the methodology outlined in
Section 3 to develop the dynamic risk models by applying process mining techniques to the log created from the data sources detailed in
Table 1. Concretely, patients were asked to fill out questionnaires once per month during the three months of the study, plus one at the beginning.
Process mining uses recorded action logs to extract and visualise workflows, making processes more understandable. In this diagram, data are structured as event logs, where each event corresponds to a case (a single patient), an activity (e.g., daily steps), and a specific point in time. A set of events for one patient forms a case, while multiple cases create a log.
On the left side of
Figure 1, the log is represented as a table. Each row is an event, showing the patient ID (ID_ANON), the recorded activity (daily steps), the value (number of steps), and the date when the activity was recorded. Events from the same patient are grouped under the same ID. Then, this log is analysed using the PALIA algorithm after the recorded daily steps are discretised into three categories: sedentary (fewer than 5000 steps), medium (between 5000 and 10,000 steps), and active (more than 10,000 steps). The workflow on the right side visualises the extracted patterns. Nodes represent different activity levels (sedentary, medium, active, and start/end). The heat map colouring indicates the median time spent in each activity. Green nodes reflect shorter durations, while red nodes indicate longer durations. Arrows mean the transition between one activity and the following one. The colour intensity of the arrows reflects the frequency of these transitions. For example, a more intense red arrow indicates a higher frequency of patients moving between those states. Then, this workflow summarises the sequence of activity levels, the duration spent in each state, and the frequency of transitions between them. For example, it can be observed that patients frequently transition between medium and sedentary. Also, a significant amount of time being spent in the sedentary state is indicated by its reddish node colour, while transitions to and from the active state being less frequent is indicated by lighter-coloured arrows.
The heat map legend at the bottom assists in the interpretation of the colour intensities of nodes (median duration) and transitions (frequency). This workflow provides insights into patients’ activity patterns, helping to identify trends and areas for intervention.
4.1. Vulnerability Dynamic Risk Model
This dynamic risk model combines self-reported vulnerability measures using the VES-13 with objective information from the activity tracker. We considered that there was “No vulnerability Risk (0)” when none of the VES-13 questionnaires were positive (scores < 3) and “Vulnerability Risk (1)” when at least one VES-13 questionnaire was positive (score ≥ 3). It is worth mentioning that three VES-13 questionnaires were collected for all participants during the LifeChamps study.
Working with clinicians in the different sessions, we analysed the activity level reported by the activity tracker with the number of steps per day to discover groups of activity behaviour as the time-varying risk variable. As explained in
Section 3, we considered a sedentary day as one where fewer than 5000 steps were registered, medium activity as between 5000 and 10,000 steps being registered, and an active day as one where more than 10,000 steps were registered.
Applying clustering techniques (see
Section 3), we discovered groups of patients with similar activity behaviour, maximising differences with the rest of the groups to determine patterns. In this case, we used the threshold distance that maximises the differences between the traces reporting vulnerability and those that do not. Based on the activity level per day (sedentary, medium or active) and the transitions among days, five groups with different behaviours were discovered (see
Figure 2):
Medium to active: Patients with medium activity but a few active days.
Medium to sedentary: Patients with medium activity but with some sedentary days.
Active: Patients with an active pattern.
Long sedentary: Patients with a long sedentary pattern.
Sedentary: Patients with sedentary activity but with a few days in the medium or active categories.
These behaviour groups are represented in
Figure 2. The different workflows summarise the sequence of activity levels, the duration spent in each state, and the frequency of transitions between them for each behaviour (with the time-varying risk variable being the number of steps per day). Moreover,
Table 5 includes the distribution of patients in each group.
As outlined, the model represents patients’ activity based on their daily step count across three states: sedentary, medium, and active. Each state is depicted as a node, with arrows indicating transitions between states. Additionally, nodes labelled @Start and @End are included to signify the beginning and end points of the process, respectively. The model was enhanced with a heat map following the criteria and colour scheme collaboratively decided upon with the HCPs, ensuring the selected options effectively met their needs and preferences. These options are explained in each figure caption.
Then, we computed the RR, measuring the probability of reporting or not reporting vulnerability in the exposed groups. The computation results are included in
Table 6. In each case, the exposed group corresponds with a concrete behaviour (non-exposed the rest), and the event would be reporting or not reporting vulnerability. We computed the RR for both the vulnerable and non-vulnerable groups, as we wanted to investigate the relative danger of sleep and activity patterns in reporting vulnerability and the relative protection of other sleep and activity patterns. For that reason, we computed RR for both groups. Groups with an RR exceeding 1 with a 95% confidence interval are highlighted in bold.
The results in
Table 6 indicate that patients with a prolonged sedentary activity pattern have a 2.5 times higher risk of reporting vulnerability than those with other activity patterns, with a 95% confidence interval. This suggests a positive association between sedentary behaviour and the reported vulnerability outcome.
Continuing the analysis, we examined the potential association between cancer type (exposure) and vulnerability as the health outcome. We calculated the RR for the five cancer types included in the LifeChamps intervention, namely melanoma, prostate-curative, prostate-metastatic, breast-curative, and breast-metastatic, focusing on their relationship with reported vulnerability (see
Table 7).
The Dynamic Risk Model for Vulnerability revealed that prostate-metastatic patients had more than three times the risk of reporting vulnerability compared to patients with other cancer types. Conversely, prostate-curative patients had 1.22 times the probability of not reporting vulnerability compared to patients with other cancer types. The reported vulnerability has a positive association with the prostate-metastatic cancer type. Conversely, the prostate-curative cancer type appears to be a protective factor against reported vulnerability in the study.
We also worked with health experts to discover and analyse other behaviours and affectations thanks to the data collected, such as body composition and heart rate variability. However, no other interesting and positive affectations were discovered for the Vulnerability model.
4.2. Anxiety/Depression Dynamic Risk Model
The Anxiety/Depression Dynamic Risk Model combined self-reported anxiety data from the PHQ-4 questionnaire with objective measures from the activity tracker. We considered that there was “No anxiety and depression risk” when none of the PHQ-4 questionnaires had a positive score (scores < 3). Conversely, we classified participants as having “Anxiety or depression risk (1)” when at least one PHQ-4 questionnaire had a positive score (score ≥ 3).
By incorporating self-reported and sensor-based data, this model aimed to provide a more comprehensive assessment of the participant’s mental health status and associated risk factors. The objective activity data from the tracker complemented the subjective questionnaire responses, enabling a deeper understanding of the relationship between physical activity and emotional well-being.
As explained in
Section 3, we considered sleep to be inadequate or insufficient when the sleep efficiency was below 0.8, sleep to be normal or sufficient when the efficiency was between 0.8 and 0.89, and sleep to be beneficial or more than enough when it was 0.9 or over. Applying clustering techniques, we discovered three distinct groups of patients with similar sleep behaviour and some outliers. This analysis was based on the sleep efficiency per day and the transitions between sleep patterns across different days, considering the duration of the pilot study. In this case, we used the threshold distance that maximised the differences between the traces reporting anxiety/depression and those that did not. The three identified sleep pattern clusters can be characterised as follows:
Insufficient Sleep: Patients in this group exhibited a consistently insufficient sleep duration throughout the pilot.
High Sleep Dynamism/Insufficient with Dynamism: Patients in this group demonstrated a highly dynamic sleep pattern, with substantial variations in sleep efficiency across different days. Moreover, patients presented insufficient sleep on many days.
Balanced Sleep: Patients in this group maintained a relatively balanced sleep efficiency pattern across the different days of the pilot.
Figure 3 shows these behaviour groups for sleep efficiency. The different workflows represent the analysis of daily sleep efficiency using process mining techniques. The nodes and transitions provide insights into the patterns and transitions between sleep efficiency categories over time. Therefore, they summarise the sleep quality levels, the duration spent in each state/level, and the frequency of transition between them for each behaviour (the time-varying risk variable as the daily sleep efficiency). For example, the workflow for the insufficient with dynamism group highlights frequent fluctuations between insufficient and sufficient sleep efficiency. It reveals that individuals tend to spend more time in an insufficient state, suggesting a potential area for intervention to improve sleep quality. Furthermore,
Table 8 includes the distribution of patients in each group. In addition to these three primary sleep behaviour clusters, the analysis also identified some outliers (48 patients) that did not fit neatly into the defined patterns.
This model illustrates patients’ sleep behaviour based on their daily sleep efficiency, categorised into three states: insufficient, regular or normal, and beneficial (more than sufficient). Each state is represented as a node, with arrows indicating transitions between them. Additionally, nodes labelled @Start and @End are included to denote the beginning and end points of the process.
The analysis revealed that certain factors, such as anxiety disorder, could be more likely to impact sleep efficiency. To further explore these relationships, we computed the RR of the identified sleep patterns concerning the reported anxiety and depression. The RR measure allowed us to quantify the probability of reporting anxiety and depression among the different sleep behaviour groups. The results of this computation are presented in
Table 9.
The key finding of our analysis revealed that patients with highly dynamic sleep patterns—characterised by significant fluctuations in sleep efficiency across days—had a twofold increased risk of reporting symptoms of anxiety and depression compared to those in other sleep pattern groups. This finding indicates a strong association between unstable sleep behaviour and a heightened likelihood of experiencing anxiety and depression. By identifying these risk factors, we can develop targeted interventions to improve sleep quality and mental health outcomes for LifeChamps participants.
As detailed in
Section 3, there is also a notable connection between anxiety levels and physical activity. To further explore this relationship, we calculated the relative risks (RR) associated with different activity patterns (see
Figure 2) and their correlation with reported anxiety and depression. The results of this analysis are presented in
Table 10.
In the context of the Anxiety/Depression Dynamic Risk Model, our analysis uncovered a significant relationship between activity patterns and reported anxiety levels. Consistent with the findings from the Vulnerability Dynamic Risk Model (see
Section 4.1), patients who demonstrated prolonged sedentary behaviour were more than twice as likely to report anxiety symptoms compared to those in other activity groups. Notably, the analysis did not identify any specific activity patterns that served as protective factors against anxiety. This indicates that sedentary behaviour may be a considerable risk factor for elevated anxiety levels among LifeChamps participants.
Furthermore, no association was found between cancer type (exposure) and anxiety/depression outcomes when calculating relative risks. Based on the data collected, we collaborated with health experts to explore additional behaviours and factors, such as body composition and heart rate variability. However, our analysis did not reveal any significant positive influences related to the Anxiety/Depression model.
5. Discussion
Wearable devices have emerged as powerful tools in healthcare, enabling individuals to monitor their health and allowing providers to track this information proactively. The Fitbit Charge 4 and Withings Smart Scale Body+ exemplify how wearables enhance patient monitoring in clinical settings. Beyond data collection, the integration of AI significantly enhances the potential of these devices, facilitating in-depth analysis and supporting healthcare experts in making informed decisions.
Our approach does not aim to simply forecast disease occurrence in a strictly predictive sense but rather to model and analyse dynamic risk behaviours over time. This result is achieved by integrating data from wearable devices and self-reported questionnaires, providing objective and subjective insights into patient behaviour. We employed process mining techniques explicitly focusing on the interactive process mining (IPM) paradigm to achieve this.
Interactive process mining is a data-driven methodology that combines elements of machine learning like grammar inference techniques [
42] or hidden Markov models and statistical techniques such as multistate models [
43] with domain expertise to generate explainable models [
14,
44]. These models are co-created with HCPs, allowing them to interact with and refine the results based on their clinical knowledge. IPM emphasises transparency and interpretability, enabling clinicians to better understand the factors influencing patient behaviour. It is particularly valuable for mental health conditions, allowing for actionable insights to inform personalised treatment plans.
The innovation of this approach lies in its ability to provide a dynamic and interactive framework for analysing patient behaviours over time. By leveraging wearable data and self-reported measures, IPM models enable healthcare providers to uncover patterns in patient activity, sleep, and emotional well-being, which are critical for identifying and managing risks. Additionally, the explainable nature of these models ensures that HCPs can trust and effectively use the insights generated to improve patient outcomes.
This methodology significantly differs from traditional forecasting models, focusing on understanding and explaining patient behaviour in real time rather than solely predicting future states. By integrating clinical expertise into the modelling process, IPM fosters a collaborative and iterative approach to risk assessment, enhancing its applicability in real-world healthcare settings.
While some studies have utilised real-world data from wearable sensors, research on time-varying perspectives for determining risk factors in health outcomes remains limited. For example, ref. [
18] applied machine learning algorithms for cardiovascular disease risk classification, showcasing AI’s potential in predictive modelling. Similarly, ref. [
45] explores AI technologies in cancer prediction, noting that most research has prioritised improving diagnosis and prognosis through traditional medical histories. Numerous studies also examine AI’s role in cancer diagnosis and therapy [
46,
47].
Our work explores behavioural patterns derived from objective data collected via wearable sensors, examining their relationship with self-reported health outcomes to identify risky behaviours. By integrating real-world wearable data with subjective patient reports, we utilise process mining techniques to develop dynamic risk models for vulnerability and anxiety/depression in older cancer survivors. These models account for evolving, time-varying risk factors, offering a nuanced understanding of mental health over time.
Conducted within the LifeChamps study, our research involved data from 121 patients diagnosed with breast, prostate, or melanoma cancer across Greece, Sweden, Spain, and the UK. We engaged 36 HCPs in a co-creation framework, ensuring our outcomes directly address the real-world needs of patients and clinicians. This approach enhances the relevance of our findings and sets a precedent for future research in integrating wearable technology, AI, and patient-reported outcomes. Our findings include two dynamic risk models—one for vulnerability and another for anxiety/depression—analysed by computing relevant variables’ RR. This analysis investigates the impact of time-varying behaviours on these outcomes, providing actionable insights to enhance patient care and inform future interventions.
We developed two dynamic risk models that integrate self-reported data on vulnerability and anxiety/depression, measured through the PHQ-4 and VES-13 questionnaires via a smartphone app and objective data from wearable sensors. Specifically, we analysed daily activity levels (steps) and sleep efficiency using a Fitbit Charge 4, categorising activity as sedentary, moderate, or active and sleep efficiency as insufficient, regular, or beneficial. Using process mining techniques and clustering algorithms, we identified distinct patient groups based on their activity and sleep patterns, incorporating a time-varying risk perspective. For activity, we observed four behavioural patterns: active, sedentary, primarily sedentary with occasional moderate activity, and medium activity with some sedentary days. Regarding sleep efficiency, three patterns emerged: insufficient sleep, balanced sleep across categories, and a highly dynamic sleep pattern with frequent changes. The analysis revealed that prolonged sedentary behaviour significantly increases the risk of reported vulnerability, while dynamic sleep patterns are strongly associated with higher risks of reported anxiety among older adult cancer survivors. The study also found that older adult cancer survivors with metastatic prostate cancer are at a higher risk of reporting vulnerability compared to those with other cancer types, highlighting the need for targeted interventions for this group.
Identifying these time-varying risk factors can inform targeted interventions to improve vulnerability and anxiety outcomes for LifeChamps participants. Notably, no specific activity patterns emerged as being protective against anxiety, indicating that sedentary behaviour may significantly elevate anxiety levels among participants. Furthermore, no correlation was found between cancer type and anxiety/depression outcomes. Although other measures, such as body composition, did not yield significant associations, the findings suggest promising avenues for further research, especially with more extended studies and patients.
A key aspect of our study was the interactive and iterative approach employed in developing the dynamic risk models. Using co-design and co-creation methods, we aligned the models with clinical experts’ analytical needs, enhancing their understanding of the data and fostering meaningful dialogue between AI and clinicians. This collaboration transforms raw data into actionable insights that directly inform patient care. The iterative process further familiarises clinical experts with the analysis techniques, enhancing their confidence in utilising AI methodologies. This empowerment improves their ability to interpret results and nurtures a culture of innovation in clinical settings. Continuous engagement with data allows clinicians to refine their inquiries, leading to more targeted investigations and improved patient outcomes.
Integrating real-world and clinical data in oncology is essential for capturing patient perspectives and daily activities, allowing healthcare providers to tailor interventions more effectively. AI plays a critical role in this context by analysing large datasets from various sources, including wearables, to generate insights that guide clinical decision-making. Utilising process mining techniques enables a thorough examination of patient outcomes, helping to identify risk factors based on individual profiles. Incorporating wearables, such as the Fitbit Charge 4 and Withings Smart Scale Body+, into clinical monitoring significantly enhances patient engagement and equips HCPs with valuable tools for personalised care. As technology advances, the potential for wearables to transform clinical practice will likely increase, underscoring the need for ongoing research in this area. However, using wearable devices for clinical purposes also raises several important considerations that researchers and clinicians must address. These include ensuring data accuracy and reliability, fostering user compliance and engagement, addressing population bias, providing adequate contextual information, navigating data privacy and security concerns, and addressing regulatory challenges. While the wearable devices implemented in the LifeChamps study can enhance data collection in clinical trials, it is crucial for researchers to be aware of their inherent limitations and to use them appropriately based on specific clinical objectives. To maximise the value of wearable technology in clinical research, it is essential to tackle these challenges through thoughtful study design, comprehensive participant education, and robust data management practices. By addressing these factors, the integration of wearable devices can significantly contribute to improved patient outcomes and enrich the overall quality of care.
The data collected for this study are intended to demonstrate a relationship between the physical behaviour of the patient with data coming from multiple complementary sources: wearable devices (Fitbit Charge 4), in-home sensors, and validated questionnaires (PHQ-4 and VES-13). The wearables provide objective data on physical activity, sleep efficiency, and physiological variables. At the same time, the questionnaires capture subjective dimensions such as anxiety and vulnerability, offering a comprehensive view of dynamic risks. Additionally, the longitudinal approach (three months) allowed us to analyse temporal changes in behaviours and risks, and the diversity of the sample (121 patients from four countries with different cancer types and treatment stages) ensures the heterogeneity needed to validate the models. One limitation of this study is the exclusive focus on physical activity level and sleep quality as risk factors, while other potential predictors, such as medical history, psychological characteristics, or biomarkers, were not included. This choice was driven by considerations of future scalability and sustainability, as these indicators can be measured affordably and reliably using widely available technologies. Additionally, legal and ethical constraints across multiple clinical sites made the real-time collection of more sensitive data, such as medical histories or biomarkers, logistically unfeasible. Importantly, this study aimed to explore whether meaningful insights could be derived using only patient-reported data in a sustainable manner, aligning with the goals of the LifeChamps project.
This study is exploratory and aims to provide a proof of concept for the proposed methodology of modelling and analysing dynamic risk behaviours in older cancer survivors. While our sample size of 121 participants is limited, it was carefully selected to ensure diversity in cancer types (breast, prostate, and melanoma), treatment stages, and representation across multiple countries. The primary goal of this study is not to make generalisable statistical claims but to demonstrate the feasibility and potential of integrating wearable data, self-reported measures, and process mining techniques to uncover clinically actionable insights. Thus, there was no need to perform a simple size calculation [
48].
We acknowledge that the sample size restricts the generalisability of our findings to broader populations. However, this exploratory approach provides an important foundation for future research involving larger, more representative cohorts. By focusing on proof of concept, we aim to validate our methodology and highlight its potential for application in real-world healthcare settings, paving the way for more comprehensive studies in the future.
In reflecting on the ethical concerns associated with medical data processing, it is crucial to recognise the multifaceted nature of these issues. Our study’s adherence to stringent ethical guidelines, including those set by the European Union, the Declaration of Helsinki, and other international standards, underscores our commitment to protecting patient data. The ethical approval obtained from multiple review boards and the implementation of a Data Protection Impact Assessment (DPIA) highlight our proactive approach to identifying and mitigating data protection risks. Furthermore, the establishment of a Joint Controller/Data Sharing Agreement among all involved parties ensures a collaborative effort in maintaining data confidentiality and integrity. These measures not only address potential ethical concerns but also reinforce the importance of transparency, patient autonomy, and trust in medical research.
6. Conclusions
Our analysis yielded valuable insights into how specific activity and sleep behaviours affect self-reported vulnerability and anxiety and how wearable sensors can provide data on these behaviours, analysed through AI techniques.
Despite this study’s contributions, several limitations warrant consideration. First, while patient-reported outcome measures (PROMs) are typically viewed as quantitative, they rely on subjective patient ratings regarding emotional states. This subjectivity can introduce variability, influenced by factors such as individual emotional conditions and the perceived effectiveness of interventions.
Another potential limitation of this study is the possibility of measurement errors associated with the wearable sensors used, particularly in tracking physical activity levels in older adult individuals. This issue arises from this study’s sustainability-driven approach, which prioritised the use of affordable and widely accessible devices. However, as this study was a feasibility study rather than a clinical trial, the level of accuracy provided by these devices was considered sufficient for this study’s objectives. Additionally, the interactive methodology (IPM) employed opens the possibility of applying advanced process mining data cleaning techniques, such as those described in [
49], to collaboratively detect and address data inaccuracies in partnership with HCPs, thus enhancing the reliability of future applications.
Another limitation is that the RR values presented in
Table 6,
Table 7,
Table 9 and
Table 10 are based on small subsamples, sometimes comprising only 20–30 participants. This may result in wide confidence intervals and restrict the robustness of the conclusions. However, as this study is based on feasibility data, the objective was not to derive definitive clinical evidence but to explore the statistical behaviour of the population. The RR calculations provide preliminary insights that, when combined with the visual graphical models, allow HCPs to investigate patterns and generate ideas for personalised treatments. This exploratory approach highlights the potential for future research while acknowledging the limitations of the current findings.
This study involved 121 unique patients, allowing for diverse representation across demographics and cancer types and limiting generalisability. While the sample size meets current recommendations for feasibility studies [
48,
50], a larger cohort could strengthen the findings. Additionally, the LifeChamps study included four pilot sites, each with distinct healthcare contexts. Although all adhered to the same protocol, these differences may have affected participant perceptions and responses, introducing potential bias while enriching the insights from a transnational perspective.
While sufficient for initial observations, the three-month duration of the study may not capture the full range of fluctuating behaviours like activity and sleep, suggesting that more extended observation periods are needed for comprehensive analysis.
In summary, this study provides important insights into the dynamics of patient-reported outcomes and the role of wearable devices in monitoring variables affecting vulnerability and anxiety. However, the limitations related to subjectivity, sample size, and duration necessitate careful interpretation within their specific context. The preliminary results align with the existing literature, paving the way for future research.
Future studies should address these limitations by extending the observation period to capture fluctuations in activity and sleep behaviours throughout the cancer treatment journey. Incorporating additional data sources, such as electronic health records and physiological metrics or environmental factors, could provide a more holistic understanding of the factors influencing health outcomes, especially in older adult cancer survivors. Data collected during this study by the activity tracker included physiological data such as the heart rate variability or the breath rate. We worked with health experts to analyse these data, and although no interesting affectations were discovered in our case, more research is needed in this field.
This research underscores the potential for future studies to deepen our understanding of how activity and sleep behaviours impact self-reported vulnerability and anxiety, mainly through wearables and AI techniques. Longitudinal studies across various chronic conditions, including diabetes and heart disease, could reveal commonalities and differences in behavioural impacts on mental health. By building on this foundational work, future research can enhance the findings’ generalisability and practical relevance. It will facilitate targeted intervention design and foster integrated care approaches, ultimately improving treatments and outcomes for diverse populations managing chronic diseases. The methodology tested in this study was applied to a specific dataset, focusing on analysing concrete processes within the given population. As such, the findings are not intended to be generalisable at this stage. However, as more data become available and become statistically representative, the same methodology can be used to create models that represent broader populations. Additionally, the methodology is adaptable and reproducible across different contexts. By applying the study design to other populations, differences in behaviour and outcomes can be analysed, and deviations in processes can be identified using advanced process mining techniques. This adaptability, enabled by the visual behaviour analysis of the interactive process mining approach, is an advantage that supports future research in diverse settings.