Next Article in Journal
Detection of Carcinoma-Associated Fibroblasts Derived from Mesothelial Cells via Mesothelial-to-Mesenchymal Transition in Primary Ovarian Carcinomas
Next Article in Special Issue
From Real-World Data to Causally Interpretable Models: A Bayesian Network to Predict Cardiovascular Diseases in Adolescents and Young Adults with Breast Cancer
Previous Article in Journal
Performance of Contrast-Enhanced Mammography (CEM) for Monitoring Neoadjuvant Chemotherapy Response among Different Breast Cancer Subtypes
Previous Article in Special Issue
Lung Cancer and Air Quality in a Large Urban County in the United States
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Use Cases Requiring Privacy-Preserving Record Linkage in Paediatric Oncology

1
Center for Health and Bioresources, AIT Austrian Institute of Technology, 8020 Graz, Austria
2
Institute of Neural Engineering, Graz University of Technology, 8010 Graz, Austria
3
St. Anna Kinderkrebsforschungs GmbH, 1090 Wien, Austria
4
Institut Curie, 75005 Paris, France
*
Author to whom correspondence should be addressed.
Current address: Department of Research and Development, Unicancer, Immuno-Oncology Group, 94270 Le Kremlin-Bicêtre, France.
Cancers 2024, 16(15), 2696; https://doi.org/10.3390/cancers16152696
Submission received: 29 June 2024 / Revised: 24 July 2024 / Accepted: 24 July 2024 / Published: 29 July 2024
(This article belongs to the Special Issue Advances in Cancer Data and Statistics)

Abstract

:

Simple Summary

Large datasets concerning childhood cancers are rare. Therefore, it is important to fully exploit all available data, which are distributed over several resources, including biomaterials, images, clinical trials, and registries. With privacy-preserving record linkage (PPRL), datasets can be merged, without disclosing the patients’ identities. Although PPRL is already implemented or described in various settings, use case descriptions are fragmented and incomplete. The present paper gives an overview of current and future use cases of PPRL in childhood cancer. We screened the literature, projects, and trial protocols, analysed a hypothetical patient journey, and discussed use cases with experts. All the identified use cases were structured along six key dimensions. We conclude that PPRL is a key concept in childhood cancer. Therefore, PPRL strategies should already be considered when starting research projects, to avoid distributed data silos, to maximise the knowledge derived from collected data, and, ultimately, to improve outcomes for children with cancer.

Abstract

Large datasets in paediatric oncology are inherently rare. Therefore, it is paramount to fully exploit all available data, which are distributed over several resources, including biomaterials, images, clinical trials, and registries. With privacy-preserving record linkage (PPRL), personalised or pseudonymised datasets can be merged, without disclosing the patients’ identities. Although PPRL is implemented in various settings, use case descriptions are currently fragmented and incomplete. The present paper provides a comprehensive overview of current and future use cases for PPRL in paediatric oncology. We analysed the literature, projects, and trial protocols, identified use cases along a hypothetical patient journey, and discussed use cases with paediatric oncology experts. To structure PPRL use cases, we defined six key dimensions: distributed personalised records, pseudonymisation, distributed pseudonymised records, record linkage, linked data, and data analysis. Selected use cases were described (a) per dimension and (b) on a multi-dimensional level. While focusing on paediatric oncology, most aspects are also applicable to other (particularly rare) diseases. We conclude that PPRL is a key concept in paediatric oncology. Therefore, PPRL strategies should already be considered when starting research projects, to avoid distributed data silos, to maximise the knowledge derived from collected data, and, ultimately, to improve outcomes for children with cancer.

Graphical Abstract

1. Introduction

Over half a century, consecutive clinical trials have improved survival rates across the spectrum of paediatric oncology (PO). Nevertheless, more than 6000 children die from cancer in Europe each year [1]. Since large datasets in PO are inherently rare, it is paramount to fully exploit all available data. Current PO research data are distributed over several resources holding biomaterials and their respective analysis, various image modalities, and clinical trial and registry data. In addition to clearly defined research goals in prospective settings, secondary use of data opens up important research and knowledge generation opportunities. However, secondary use is associated with various hurdles, including technical issues, legal barriers due to increasingly complex regulations, and ethical discussions about explicit informed consent for each analysis as compared to broad or even opt-out consent. Due to these and other reasons, most data are currently locked up in data silos.
The following levels of privacy can be distinguished:
  • Personalised data, containing personal unique identifiers (IDs) or quasi-identifiers (QIDs), such as name, data of birth, and others, as defined by [2].
  • Pseudonymised data, containing patient codes (“pseudonyms”) that can be associated with related QIDs, if necessary.
  • Anonymised data, which cannot be associated with related QIDs anymore.
  • Aggregated data, e.g., statistical results as published in journals.
According to the General Data Protection Regulation (GDPR) [3], whenever possible, research data need to be collected in a pseudonymised way. As such, in general, different resources must generate different pseudonyms for one and the same patient. To overcome barriers due to distributed (pseudonymised) data, the European Joint Programme on Rare Diseases (EJP RD) is currently creating an effective rare diseases research ecosystem (https://www.ejprarediseases.org/ (accessed on 23 July 2024)). The EJP RD is based on the FAIR (Findable, Accessible, Interoperable, and Re-usable) principles [4], which are key to exploit existing data and support big data and artificial intelligence applications, as recently demanded by the European Society for Paediatric Oncology (SIOPE) [5,6].
Privacy-preserving record linkage (PPRL) concerns the linking of personalised or pseudonymised datasets, without disclosing the patients’ identities. PPRL is a key concept for PO research [7]. During pseudonymisation, PPRL can avoid unintended double registration, which would lead to empty or fragmented pseudonymised records and—if followed by double randomisation—unbalanced cohorts.
PPRL is required for (a) merging patients from contexts with identical variables and/or (b) merging different data types from multiple sources holding the same patients (Figure 1).
Most PPRL publications focus on the technical implementation of PPRL algorithms. In general, PPRL algorithms can follow phonetic coding-based (e.g., [8,9]), hashing-based (e.g., [10,11]), reference-value-based (e.g., [12,13]), embedding-based (e.g., [14,15,16]), differential privacy-based (e.g., [17]), or secure multiparty computation-based techniques, which can either be applied on QIDs or on the clinical data themselves (e.g., [18,19,20,21,22]). Some solutions only support linkage in case of perfectly matching records. More comprehensive solutions can also link slightly differing records, e.g., in case of typing errors or missing data. Since both false positive as well as false negative linkage may have severe consequences, the optimal threshold of accordance must be chosen depending on the use case.
To compare PPRL solutions, Vatsalan et al. [23] and Gkoulalas-Divanis et al. [24] published PPRL overview papers, each with one out of many dimensions relating to application areas. A comprehensive overview of various aspects of PPRL was published by Christen et al. [25]. Their introduction summarises several aspects of why data should be linked at all and which sources of data might be linked, and they describe selected use cases. In particular, use cases focusing on health service research, national health insurance data in Germany, official statistics, and longitudinal studies are considered herein. In the conclusions of three recently published whitepapers [26,27,28], the European Union Agency for Cybersecurity (ENISA) highlights the importance of pseudonymisation and PPRL in distributed research environments as well as the need for cooperation in terms of implementation, development of application scenarios, regulatory support, and dissemination of best practice models. The use cases described especially in [26] represent a valuable resource for the present paper. PPRL is also expected to play a major role in the upcoming European Health Data Space (EHDS), which is supposed to provide a framework for structured and systematic secondary use of health data in the EU on a large scale [29].
For PO in Europe, there are mainly two PPRL services currently available: (1) SPIDER (https://eu-rd-platform.jrc.ec.europa.eu/spider/ (accessed on 23 July 2024)) focusses on PPRL between primary sources holding QIDs with perfect matches only (no typing errors, etc.). (2) The European Patient Identity (EUPID) Services [30] were designed to also link pseudonymised resources. EUPID further supports PPRL in case of typing errors based on phonetic hashing algorithms. The EJP RD virtual platform currently supports EUPID.
PPRL has been implemented in various PO settings. However, the related use cases have been described implicitly, based on the requirements of the respective projects. Use cases specified in the literature are fragmentary and spread over various publications. Therefore, the present paper provides a comprehensive overview of current and future use cases for PPRL in PO.

2. Materials and Methods

We followed three strategies to compile a comprehensive overview of PPRL use cases in PO.
First, we analysed the literature, existing PO projects, and trial protocols concerning how and why they implemented PPRL. For the literature search, we applied the following search terms in PubMed:
((((“record” OR “dataset” OR “registr*”) AND (“link*” OR “merg*” OR “combin*”)) AND (“privacy” OR “GDPR” OR “data protection” OR “pseudonym*” OR “anonym*” OR “de-identif*” OR “deidentif*” OR “leak*”)) OR “PPRL”) AND ((“paediatr*” OR “pediatr*” OR “child*” OR “infant”) AND (“cancer” OR “oncolog*” OR “tumor” OR “tumour”)).
Titles and abstracts and, in a second step, full-text publications were manually filtered to include only papers containing information concerning use cases of PPRL in PO, and the relevant use cases were extracted. The search terms revealed 85 articles, of which 23 articles containing use cases of PPRL in PO were considered.
Secondly, inspired by a recent paper by Ly et al. [31], we identified use cases along a hypothetical patient journey, as shown in Figure 2.
Finally, we discussed the identified use cases with stakeholders from (a) SIOPE and (b) the EJP RD to complement the list and to firm up the single use cases.
The identified use cases were then analysed and categorised along key dimensions.

3. Results

3.1. Overview of Dimensions in Privacy-Preserving Record Linkage

During our research, we found that a one-dimensional list of use cases is not feasible due to the variety of different PPRL applications in PO. Therefore, we categorised use cases along key dimensions, as illustrated in Figure 3.
Section 3.2 describes single dimensions of Figure 3 (“horizontal” slices), i.e., distributed personalised/pseudonymised data sources, record linkage/linked data, and data analysis, including PO examples.
Section 3.3 summarises multi-dimensional use cases (“vertical” paths through Figure 3), based on the EJP RD use case specification format: “As a <stakeholder>, I would like to <research focus> so that I can <objective> and visualise the result as a <output format>” (Terms between “<” and “>” were replaced by use-case-specific contents).

3.2. Single-Dimensional Use Cases

3.2.1. Distributed Personalised/Pseudonymised Data Sources

This chapter summarises the rationale for why PO data are distributed not only in routine care but also in research-related data environments.

Multiple Hospitals

Patients might be treated in more than one hospital over time, leading to distributed personalised records, thus increasing the risk of double registration.
PO example—Specialised and local hospitals: Patients with complex or rare diagnoses move from local to specialised hospitals. Vice versa, less complex cases are transferred from specialised to local hospitals.
PO example—Second opinion: Due to the severity of paediatric cancer, parents often seek a second opinion from another hospital.

Different Types of Data

Pseudonymised research data of different types are often distributed over different sources.
PO example—Data sources in neuroblastoma research: Neuroblastoma trial data are collected in several electronic data capture systems. Additionally, there are national and international registries (e.g., the French National Registry of Childhood Cancers [29]). Bio-sample data are stored in biobanks, some of which can be queried via BBMRI-ERIC [32]. Sequencing results can be stored in the RD-Connect Genome-Phenome Analysis Platform (GPAP) [33]. DICOM images are collected on a pseudonymised image management server.

Suspected Diagnosis

Eligibility criteria for clinical trials often require a confirmed diagnosis, although confirming a suspected diagnosis may already provide valuable research data (e.g., tumour samples in a biobank).
PO example—SIOPEN BIOPORTAL: SIOPE Neuroblastoma (SIOPEN) is currently establishing a “SIOPEN BIOPORTAL” (NCT05192980) (https://clinicaltrials.gov/study/NCT05192980 (accessed on 23 July 2024)), which supports registration and data collection based on suspected diagnoses with a specific BIOPORTAL pseudonym. This pseudonym can be used to send biological samples to reference laboratories. After confirmation of the diagnosis, the patient is registered to the corresponding trial (e.g., low/high risk) and a second trial-specific pseudonym is generated.

Multiple Subsequent Trials

Throughout their lifetime, patients can be included in one or more trials.
PO example—Poor response: If a high-risk neuroblastoma patient enrolled in an upfront treatment trial such as HR NBL2 (NCT04221035) (https://clinicaltrials.gov/study/NCT04221035 (accessed on 23 July 2024)) did not respond to the induction chemotherapy, the patient was transferred to an even more intensified treatment scheme, e.g., VERITAS (NCT03165292) (https://clinicaltrials.gov/study/NCT03165292 (accessed on 23 July 2024)).
PO example—Relapse: In case of a relapse, patients previously treated in a primary tumour study were frequently enrolled to relapse studies, such as BEACON (NCT02308527) (https://clinicaltrials.gov/study/NCT02308527 (accessed on 23 July 2024)). Subsequently, patients could further be enrolled to national or international precision medicine programs or phase 1 or 2 Innovative Therapies for Children with Cancer (ITCC) trials like ESMART (NCT02813135) (https://clinicaltrials.gov/study/NCT02813135 (accessed on 23 July 2024)).

Multiple Concurrent Trials

There are trials that allow for or even foresee the participation in other trials at the same time.
PO example—OMS trial: The Opsoclonus Myoclonus Syndrome (OMS) trial (https://clinicaltrials.gov/study/NCT01868269 (accessed on 23 July 2024)) collects data concerning OMS, a rare syndrome in children that appears frequently in patients with neuroblastoma. Neuroblastoma patients may additionally be treated according to neuroblastoma trial protocols, where oncological data are collected. Therefore, the OMS infrastructure links OMS-specific trial data with oncological data from different neuroblastoma trials [34].

Transition from a Trial to Long-Term Follow-Up

Long-term follow-up data are essential but often not collected in trial databases or only for short periods.
PO example—Long-term follow-up: Since PO patients are frequently treated in more than one trial, long term follow-up data should be preserved for all trials, even for decades, without re-entering data in each context. The SIOPEN BIOPORTAL envisages reporting the long-term follow-up data within a neuroblastoma registry, which includes core, patient, tumour, and epidemiological data, with the aim to operate for longer periods.

Transition from Routine Care to Research (and Back)

For any trial patient, specific data are collected in routine care and others in research infrastructures, requiring transition from one to the other.
PO example—Continuity of care: With improving treatment options, continuity of care for PO survivors beyond cancer treatment is becoming increasingly important. SIOPE is working on guidelines dealing with the transition from paediatric patients to adolescent or adult survivors [35].
PO example—Late effects: The majority of the nearly 500,000 PO survivors in Europe experience late effects, with increasing incidence over the lifetime and impacts on their daily life [36]. Identification of late effects, even decades after the treatment, and development of long-term follow-up guidelines require transitions from research to routine care and back.
PO example—End-of-life treatment: Henson et al. used linked data to analyse prescriptions dispended in the community during end-of-life care of cancer patients [37].

3.2.2. Record Linkage/Linked Data

Temporarily Linked Data

Based on related pseudonyms of distributed contexts, data are temporarily linked in a central service, specific calculations are applied to the linked data, and the result is returned to the user.
PO example—Counting shared patients: Via the EJP RD virtual platform, users will be able to count patients that are registered in two different contexts, such as the HR-NBL1 trial and the International Neuroblastoma Risk Group (INRG) databases.

Linked Data Stores and Registries

Linked data can be stored in specific data stores, either to support subsequent tasks or to serve as a long-term data source (e.g., a linked registry). Therefore, regular updates must be supported, including the addition of new data, completion of previously missing data, or correction of data.
PO example—INRG database: Based on agreements between SIOPEN and other research organisations (e.g., US Children’s Oncology Group, COG), core PO trial data are submitted to the INRG-DB [38] after publication. The HR-NBL1 trial [39] was amended several times. Over the years, different randomisations were amended, and the corresponding results were successively published. Some patients were randomised more than once (e.g., pre-induction and pre-maintenance). Data of such patients were sent to the INRG-DB once per randomisation, and PPRL ensured that such patients were not registered twice.
PO example—The international diffuse intrinsic pontine glioma registry [40] provides an infrastructure for acquisition of biological specimens, imaging, and correlative clinical and genomics data to facilitate basic and translational research studies in diffuse intrinsic pontine glioma from 55 collaborating institutions in the United States, Canada, Australia, and New Zealand, including clinical, demographic, radiologic, and pathologic data.

3.2.3. Data Analysis

Counts, Statistical Measures, and Statistical Plots

Counting linked data without counting any patient twice and/or including filters (e.g., age < 3 years) requires PPRL. Statistical analyses can be applied on linked data, providing specific measures (e.g., mean value) or statistical plots (e.g., Kaplan–Meier curve).
PO example—Long-term toxicity: Calculate the rate of toxicity in patients who have been treated with a specific drug (e.g., cisplatin) and had a specific symptom thereafter (e.g., ototoxicity) in late-effect research scenarios. Or evaluate the effect of cancer treatment on paternity through use of assisted reproduction technology [41].
PO example—Large-scale statistics: Various studies applied PPRL to perform large-scale statistical analyses on cancer data, e.g., to analyse long-term survival after paediatric cancer [42] or after cancer in general (including PO) [43], to apply benchmarking on paediatric cancer survival [44], to develop a national resource of patient-level genomics laboratory records including PO patients [45], or to accelerate research on pontine glioma [40]. Walker et al. applied PPRL to analyse the prevalence of PO and other conditions increasing the risk of severe COVID-19 disease [46]. Finally, PPRL was applied to determine the incidence of melanoma [47] and of childhood cancer in general [48].
PO example—Effect of specific parameters on PO incidence: Analyse the effect of specific parameters from one dataset on the PO incidence as documented in another source (see, e.g., [49,50,51,52,53,54]).
PO example—Healthcare management: Use linked datasets for cost effectiveness analyses [55], data quality assessment [43], benchmarking [56], or capacity building [57].

Tables with Subsets of Linked Data

Subsets of linked data are provided to the user.
PO example—List of follow-ups of a patient: Provision of a table listing all follow-ups of a specific patient, documented in different data sources, including date of follow-up, originating data source, and the documented status of the patient.

Single Case Visualisation

Distributed data of a single patient are comprehensively visualised, either in a personalised (healthcare) or pseudonymised (research) context.
PO example—Tumour boards: Tumour boards enhance cancer treatment strategies. In the realm of PO, these boards are frequently conducted in a virtual setting, facilitating the collaboration of external experts with local physicians (see, e.g., [58]). Consequently, it is advisable to share only pseudonymised data that have emerged from both routine care (e.g., images) and research (e.g., randomisation arms).

Patient Apps and Digital Companions

Patient apps and digital companions represent a powerful way to provide outcomes of PPRL not only to researchers but also to patients.
PO example—Survivorship passport: The survivorship passport links core treatment data with follow-up recommendations based on guidelines [59]. It will closely be linked to electronic medical records and to national cancer and survivorship registries, supporting regular updates and generation of care plans. The implementation of the survivorship passport is currently being piloted in six European countries [60].

Artificial Intelligence (AI)

To tap the full potential of AI, PPRL of distributed data sources is essential.
PO example—Image-defined risk factors: Image-defined risk factors (tumour size, anatomical regions, etc.) are key for risk-adjusted treatment of cancer. However, raw images either remain in the local centre’s Picture Archiving and Communication System (PACS) or they are separately stored on specific image management servers. In the PRIMAGE project [61], DICOM-formatted raw image data from a PO image management server were integrated with clinical trial data and stored on the PRIMAGE platform. AI was applied to predict outcomes from the images.

3.3. Multi-Dimensional Use Cases

Table 1 presents a selection of identified high-priority use cases following specific “vertical” paths along the dimensions illustrated in Figure 2, represented in the format developed in the EJP RD project.

4. Discussion

PO stands to benefit significantly from PPRL due to the scarcity and distribution of data, multimodal treatments, and complex patient journeys. This paper provides a comprehensive overview of potential use cases, emphasising that the list is not exhaustive and new dimensions may emerge in the future. While focusing on PO, most aspects discussed are also applicable to other conditions, rare diseases in particular.
Most identified use cases concern the analysis of clinical outcomes (long-term toxicities, event-free survival, overall survival) and the effect of certain parameters on the incidence/prevalence of PO. However, we identified a wide range of use cases, which require PPRL in PO care and/or PO research.
Many identified use cases concerned neuroblastoma research, where a European IT infrastructure supporting record linkage on clinical trial data, biological data from different samples, and imaging data has already been introduced in the FP5 project SIOPEN-R-NET. During the FP7 project ENCCA (2011–2015), the EUPID Services have been developed, which provide a more privacy-preserving approach for linking records. Since then, PPRL services have continuously supported collaborative European neuroblastoma research by providing large, linked datasets, which contribute to significantly improved survival rates as achieved in the past decade.
PPRL is a complex task, which is associated with various risks in terms of usability, costs, privacy preservation, potential misuse, the risk of breaking confidentiality, the risk of mismatched identities, etc. Different technical approaches for realising PPRL have different advantages and disadvantages in all these dimensions. Special care should be taken if PPRL is applied based on QIDs that can change over time (e.g., names in case of marriages, gender, etc.), especially if PPRL is to be supported in long-term settings, such as, e.g., the survivorship passport [59]. The present paper, however, concentrates on use cases, omitting technical details, privacy and security aspects, and consent considerations.
The use cases in this paper were identified in a three-step approach (screening of the literature, projects, and trials, hypothetical patient journey, and discussion with experts). On the one hand, the literature search was conducted by one researcher only. Therefore, use cases might have been overlooked that would have been identified if screening had been performed by more researchers. However, due to the multistep approach, such use cases were most likely identified in another step. On the other hand, even articles that did not cover PPRL and PO were considered, if use cases that are also applicable for PPRL in PO were described.
Selecting the right PPRL solution before starting data collection is crucial, especially for pseudonymised record linkage. The lack of standards for and interoperability between PPRL services is acknowledged, and efforts within EJP RD are underway to address this. References to implementation projects are provided for some use cases, with varying stages of development. The evolution from project design to pseudonymisation and linked data analyses is noted. As applications progress, PPRL is recognised as a key concept of a broader privacy-preserving research landscape. Integration with other approaches like privacy-preserving artificial intelligence and federated/distributed learning is anticipated. For PO in Europe, currently, mainly SPIDER and the EUPID Services are implemented, which should be considered when setting up new projects. Choosing the appropriate privacy-preserving strategy will be a key consideration in future research across diverse fields, not limited to PO.

5. Conclusions

Research in PO has been at the forefront of PPRL implementations, making significant strides in realising tangible outcomes from linked records. As PPRL gains prominence across various application domains, both existing and new use cases are anticipated to evolve. The long-term nature of underlying diseases underlines the importance of adopting a robust PPRL strategy early on. By doing so, the benefits of linked data can be harnessed sooner, ensuring that datasets collected without a PPRL concept do not miss out on future applications and fall short of maximising patient benefits.

Author Contributions

Conceptualisation, D.H., G.S. (Gudrun Schleiermacher), R.L. and G.S. (Guenter Schreier); methodology, D.H. and K.K.; validation, K.K., E.S., B.J., M.F., V.D. and P.D.-M.; formal analysis, D.H., E.S., B.J., V.D. and P.D.-M.; investigation, D.H., K.K., E.S., M.B., B.J., M.F., V.D., P.D.-M., G.S. (Gudrun Schleiermacher), R.L. and G.S. (Guenter Schreier); writing—original draft preparation, D.H.; writing—review and editing, D.H., K.K., E.S., M.B., B.J., M.F., V.D., P.D.-M., G.S. (Gudrun Schleiermacher), R.L. and G.S. (Guenter Schreier); visualisation, D.H.; supervision, G.S. (Gudrun Schleiermacher), R.L. and G.S. (Guenter Schreier); project administration, D.H.; funding acquisition, G.S. (Gudrun Schleiermacher), R.L. and G.S. (Guenter Schreier). All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the European Joint Programme on Rare Diseases (EJP RD, grant number 825575).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

D.H., K.K., E.S., M.B., B.J., M.F. and G.S. (Guenter Schreier) are employed at the AIT Austrian Institute of Technology, the provider of the EUPID Services. The remaining authors declare no conflict of interest.

Abbreviations

AIArtificial Intelligence
COGUS Children’s Oncology Group
EHDSEuropean Health Data Space
EJP-RDEuropean Joint Programme on Rare Diseases
ENISAEuropean Union Agency for Cybersecurity
EUPIDEuropean Patient Identity
FAIRFindable, Accessible, Interoperable, and Re-usable
GDPRGeneral Data Protection Regulation
GPAPRD-Connect Genome Phenome Analysis Platform
INRGInternational Neuroblastoma Risk Group
ITCCInnovative Therapies for Children with Cancer
OMSOpsoclonus Myoclonus Syndrome
PACSPicture Archiving and Communication System
POPaediatric Oncology
PPRLPrivacy-Preserving Record Linkage
PSNPseudonym
QIDQuasi-Identifier
SIOPEEuropean Society for Paediatric Oncology
SIOPENSIOPE Neuroblastoma

References

  1. World Health Organization. International Agency for Research on Cancer Cancer Today; World Health Organization: Geneva, Switzerland, 2020. [Google Scholar]
  2. Office for Civil Rights. Guidance Regarding Methods for De-identification of Protected Health Information in Accordance with the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule; Office for Civil Rights: Washington, DC, USA, 2012.
  3. The European Parliament and of the Council. Regulation (eu) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the Protection of Natural Persons with Regard to the Processing of Personal Data and on the Free Movement of Such Data, and Repealing Directive 95/46/EC (General Data Protection Regulation); The European Parliament and of the Council: Brussels, Belgium, 2016. [Google Scholar]
  4. Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef] [PubMed]
  5. SIOP Europe; CCI Europe; PANCARE. A Europe for Children & Adolescents with Cancer: Don’t Leave Them Behind—Position Paper of the European Paediatric Cancer Community; SIOP Europe: Woluwe-Saint-Lambert, Ethiopia, 2022. [Google Scholar]
  6. SIOP Europe; CCI Europe; PANCARE. International Childhood Cancer Awareness Day (ICCD2020): Artificial Intelligence and Big Data: Towards the Next Frontier in Paediatric Cancer Research & Innovation in Europe; SIOP Europe: Woluwe-Saint-Lambert, Ethiopia, 2020. [Google Scholar]
  7. Vassal, G.; Lazarov, D.; Rizzari, C.; Szczepański, T.; Ladenstein, R.; Kearns, P.R. The impact of the EU General Data Protection Regulation on childhood cancer research in Europe. Lancet Oncol. 2022, 23, 974–975. [Google Scholar] [CrossRef] [PubMed]
  8. Karakasidis, A.; Verykios, V. Secure Blocking + Secure Matching = Secure Record Linkage. J. Comput. Sci. Eng. 2011, 5, 223–235. [Google Scholar] [CrossRef]
  9. Etienne, B.; Cheatham, M.; Grzebala, P. An Analysis of Blocking Methods for Private Record Linkage. In Proceedings of the AAAI Fall Symposia Symposium Series, Arlington, VA, USA, 17–19 November 2016; pp. 244–248. [Google Scholar]
  10. Dusserre, L.; Quantin, C.; Bouzelat, H. A one way public key cryptosystem for the linkage of nominal files in epidemiological studies. Medinfo 1995, 8 Pt 1, 644–647. [Google Scholar] [PubMed]
  11. Quantin, C.; Bouzelat, H.; Allaert, F.A.; Benhamiche, A.M.; Faivre, J.; Dusserre, L. How to ensure data security of an epidemiological follow-up: Quality assessment of an anonymous record linkage procedure. Int. J. Med. Inform. 1998, 49, 117–122. [Google Scholar] [CrossRef]
  12. Pang, C.; Gu, L.; Hansen, D.; Maeder, A. Privacy-Preserving Fuzzy Matching Using a Public Reference Table. In Intelligent Patient Management; McClean, S., Millard, P., El-Darzi, E., Nugent, C., Eds.; Springer Berlin Heidelberg: Berlin/Heidelberg, Germany, 2009; pp. 71–89. [Google Scholar]
  13. Vatsalan, D. Scalable and Approximate Privacy-Preserving Record Linkage; Australian National University: Canberra, Australia, 2014. [Google Scholar]
  14. Scannapieco, M.; Figotin, I.; Bertino, E.; Elmagarmid, A.K. Privacy preserving schema and data matching. In Proceedings of the ACM SIGMOD Conference, Beijing, China, 11–14 June 2007. [Google Scholar]
  15. Yakout, M.; Atallah, M.J.; Elmagarmid, A. Efficient Private Record Linkage. In Proceedings of the 2009 IEEE 25th International Conference on Data Engineering, Shanghai, China, 29 March–2 April 2009; pp. 1283–1286. [Google Scholar]
  16. Bonomi, L.; Xiong, L.; Chen, R.; Fung, B.C.M. Frequent grams based embedding for privacy preserving record linkage. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, Maui, HI, USA, 29 October–2 November 2012; pp. 1597–1601. [Google Scholar]
  17. He, X.; Machanavajjhala, A.; Flynn, C.; Srivastava, D. Composing Differential Privacy and Secure Computation: A Case Study on Scaling Private Record Linkage. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, USA, 30 October–3 November 2017; pp. 1389–1406. [Google Scholar]
  18. Inan, A.; Kantarcioglu, M.; Bertino, E.; Scannapieco, M. A Hybrid Approach to Private Record Linkage. In Proceedings of the 2008 IEEE 24th International Conference on Data Engineering, Cancun, Mexico, 7–12 April 2008; pp. 496–505. [Google Scholar]
  19. Kuzu, M.; Kantarcioglu, M.; Inan, A.; Bertino, E.; Durham, E.; Malin, B. Efficient Privacy-Aware Record Integration. In Proceedings of the 16th International Conference on Extending Database Technology, Genoa, Italy, 18–22 March 2013; pp. 167–178. [Google Scholar]
  20. Stammler, S.; Kussel, T.; Schoppmann, P.; Stampe, F.; Tremper, G.; Katzenbeisser, S.; Hamacher, K.; Lablans, M. Mainzelliste SecureEpiLinker (MainSEL): Privacy-preserving record linkage using secure multi-party computation. Bioinformatics 2022, 38, 1657–1668. [Google Scholar] [CrossRef]
  21. Kussel, T.; Brenner, T.; Tremper, G.; Schepers, J.; Lablans, M.; Hamacher, K. Record linkage based patient intersection cardinality for rare disease studies using Mainzelliste and secure multi-party computation. J. Transl. Med. 2022, 20, 458. [Google Scholar] [CrossRef]
  22. Dalal, H.M.; Doherty, P.; McDonagh, S.T.; Paul, K.; Taylor, R.S. Virtual and in-person cardiac rehabilitation. BMJ 2021, 373, n1270. [Google Scholar] [CrossRef]
  23. Vatsalan, D.; Christen, P.; Verykios, V. A taxonomy of privacy-preserving record linkage techniques. Inf. Syst. 2013, 38, 946–969. [Google Scholar] [CrossRef]
  24. Gkoulalas-Divanis, A.; Vatsalan, D.; Karapiperis, D.; Kantarcioglu, M. Modern Privacy-Preserving Record Linkage Techniques: An Overview. IEEE Trans. Inf. Forensics Secur. 2021, 16, 4966–4987. [Google Scholar] [CrossRef]
  25. Christen, P.; Ranbaduge, T.; Schnell, R. Linking Sensitive Data; Springer: New York, NY, USA, 2020. [Google Scholar]
  26. European Union Agency for Cybersecurity. Data Pseudonymisation: Advanced Techniques & Use Cases—Technical Analysis of Cybersecurity Measures in Data Protection and Privacy; European Union Agency for Cybersecurity: Attiki, Greece, 2021. [Google Scholar]
  27. European Union Agency for Cybersecurity. Deploying Pseudonymisation Techniques—The case of the Health Sector; European Union Agency for Cybersecurity: Attiki, Greece, 2022. [Google Scholar]
  28. European Union Agency for Cybersecurity. Pseudonymisation Techniques and Best Practices—Recommendations on Shaping Technology According to Data Protection and Privacy Provisions; European Union Agency for Cybersecurity: Attiki, Greece, 2019. [Google Scholar]
  29. Gatta, G.; Botta, L.; Rossi, S.; Aareleid, T.; Bielska-Lasota, M.; Clavel, J.; Dimitrova, N.; Jakab, Z.; Kaatsch, P.; Lacour, B.; et al. Childhood cancer survival in Europe 1999–2007: Results of EUROCARE-5—A population-based study. Lancet Oncol. 2014, 15, 35–47. [Google Scholar] [CrossRef] [PubMed]
  30. Nitzlnader, M.; Schreier, G. Patient identity management for secondary use of biomedical research data in a distributed computing environment. Stud. Health Technol. Inform. 2014, 198, 211–218. [Google Scholar] [PubMed]
  31. Ly, S.; Runacres, F.; Poon, P. Journey mapping as a novel approach to healthcare: A qualitative mixed methods study in palliative care. BMC Health Serv. Res. 2021, 21, 915. [Google Scholar] [CrossRef] [PubMed]
  32. Reihs, R.; Proynova, R.; Maqsood, S.; Ataian, M.; Lablans, M.; Quinlan, P.R.; Lawrence, E.; Bowman, E.; van Enckevort, E.; Bučík, D.F.; et al. BBMRI-ERIC Negotiator: Implementing Efficient Access to Biobanks. Biopreserv. Biobank. 2021, 19, 414–421. [Google Scholar] [CrossRef] [PubMed]
  33. Laurie, S.; Piscia, D.; Matalonga, L.; Corvó, A.; Fernández-Callejo, M.; Garcia-Linares, C.; Hernandez-Ferrer, C.; Luengo, C.; Martínez, I.; Papakonstantinou, A.; et al. The RD-Connect Genome-Phenome Analysis Platform: Accelerating diagnosis, research, and gene discovery for rare diseases. Hum. Mutat. 2022, 43, 717–733. [Google Scholar] [CrossRef] [PubMed]
  34. Hayn, D.; Falgenhauer, M.; Kropf, M.; Nitzlnader, M.; Welte, S.; Ebner, H.; Ladenstein, R.; Schleiermacher, G.; Hero, B.; Schreier, G. IT Infrastructure for Merging Data from Different Clinical Trials and Across Independent Research Networks. Stud. Health Technol. Inform. 2016, 228, 287–291. [Google Scholar] [PubMed]
  35. Mulder, R.L.; van der Pal, H.J.H.; Levitt, G.A.; Skinner, R.; Kremer, L.C.M.; Brown, M.C.; Bárdi, E.; Windsor, R.; Michel, G.; Frey, E. Transition guidelines: An important step in the future care for childhood cancer survivors. A comprehensive definition as groundwork. Eur. J. Cancer 2016, 54, 64–68. [Google Scholar] [CrossRef] [PubMed]
  36. Hjorth, L.; Haupt, R.; Skinner, R.; Grabow, D.; Byrne, J.; Karner, S.; Levitt, G.; Michel, G.; van der Pal, H.; Bárdi, E.; et al. Survivorship after childhood cancer: PanCare: A European Network to promote optimal long-term care. Eur. J. Cancer 2015, 51, 1203–1211. [Google Scholar] [CrossRef] [PubMed]
  37. Henson, K.E.; Brock, R.; Shand, B.; Coupland, V.H.; Elliss-Brookes, L.; Lyratzopoulos, G.; Godfrey, P.; Haigh, A.; Hunter, K.; McCabe, M.G.; et al. Cohort profile: Prescriptions dispensed in the community linked to the national cancer registry in England. BMJ Open 2018, 8, e020980. [Google Scholar] [CrossRef]
  38. Liang, W.H.; Federico, S.M.; London, W.B.; Naranjo, A.; Irwin, M.S.; Volchenboum, S.L.; Cohn, S.L. Tailoring Therapy for Children With Neuroblastoma on the Basis of Risk Group Classification: Past, Present, and Future. JCO Clin. Cancer Inform. 2020, 4, 895–905. [Google Scholar] [CrossRef]
  39. Ladenstein, R.; Pötschger, U.; Valteau-Couanet, D.; Luksch, R.; Castel, V.; Ash, S.; Laureys, G.; Brock, P.; Michon, J.M.; Owens, C.; et al. Investigation of the Role of Dinutuximab Beta-Based Immunotherapy in the SIOPEN High-Risk Neuroblastoma 1 Trial (HR-NBL1). Cancers 2020, 12, 309. [Google Scholar] [CrossRef] [PubMed]
  40. Baugh, J.; Bartels, U.; Leach, J.; Jones, B.; Chaney, B.; Warren, K.E.; Kirkendall, J.; Doughman, R.; Hawkins, C.; Miles, L.; et al. The international diffuse intrinsic pontine glioma registry: An infrastructure to accelerate collaborative research for an orphan disease. J. Neurooncol. 2017, 132, 323–331. [Google Scholar] [CrossRef] [PubMed]
  41. Kitlinski, M.; Giwercman, A.; Elenkov, A. Paternity through use of assisted reproduction technology in male adult and childhood cancer survivors: A nationwide register study. Hum. Reprod. 2023, 38, 973–981. [Google Scholar] [CrossRef] [PubMed]
  42. Youlden, D.R.; Baade, P.D.; Hallahan, A.R.; Valery, P.C.; Green, A.C.; Aitken, J.F. Conditional survival estimates for childhood cancer in Australia, 2002–2011: A population-based study. Cancer Epidemiol. 2015, 39, 394–400. [Google Scholar] [CrossRef] [PubMed]
  43. Oberaigner, W.; Siebert, U. Are survival rates for Tyrol published in the Eurocare studies biased? Acta Oncol. 2009, 48, 984–991. [Google Scholar] [CrossRef] [PubMed]
  44. Botta, L.; Gatta, G.; Didonè, F.; Lopez Cortes, A.; Pritchard-Jones, K.; Group, B.P.W. International benchmarking of childhood cancer survival by stage at diagnosis: The BENCHISTA project protocol. PLoS ONE 2022, 17, e0276997. [Google Scholar] [CrossRef]
  45. Loong, L.; Huntley, C.; McRonald, F.; Santaniello, F.; Pethick, J.; Torr, B.; Allen, S.; Tulloch, O.; Goel, S.; Shand, B.; et al. Germline mismatch repair (MMR) gene analyses from English NHS regional molecular genomics laboratories 1996–2020: Development of a national resource of patient-level genomics laboratory records. J. Med. Genet. 2023, 60, 669–678. [Google Scholar] [CrossRef] [PubMed]
  46. Walker, J.L.; Grint, D.J.; Strongman, H.; Eggo, R.M.; Peppa, M.; Minassian, C.; Mansfield, K.E.; Rentsch, C.T.; Douglas, I.J.; Mathur, R.; et al. UK prevalence of underlying conditions which increase the risk of severe COVID-19 disease: A point prevalence study using electronic health records. BMC Public Health 2021, 21, 484. [Google Scholar] [CrossRef]
  47. Paulson, K.G.; Gupta, D.; Kim, T.S.; Veatch, J.R.; Byrd, D.R.; Bhatia, S.; Wojcik, K.; Chapuis, A.G.; Thompson, J.A.; Madeleine, M.M.; et al. Age-Specific Incidence of Melanoma in the United States. JAMA Dermatol. 2020, 156, 57–64. [Google Scholar] [CrossRef]
  48. Sayeed, S.; Barnes, I.; Ali, R. Childhood cancer incidence by ethnic group in England, 2001–2007: A descriptive epidemiological study. BMC Cancer 2017, 17, 570. [Google Scholar] [CrossRef]
  49. Hammer, G.P.; Seidenbusch, M.C.; Schneider, K.; Regulla, D.; Zeeb, H.; Spix, C.; Blettner, M. Cancer incidence rate after diagnostic X-ray exposure in 1976–2003 among patients of a university children’s hospital. Rofo 2010, 182, 404–414. [Google Scholar] [CrossRef] [PubMed]
  50. Hargreave, M.; Mørch, L.S.; Andersen, K.K.; Winther, J.F.; Schmiegelow, K.; Kjaer, S.K. Maternal use of hormonal contraception and risk of childhood leukaemia: A nationwide, population-based cohort study. Lancet Oncol. 2018, 19, 1307–1314. [Google Scholar] [CrossRef] [PubMed]
  51. Waitman, L.R.; Song, X.; Walpitage, D.L.; Connolly, D.C.; Patel, L.P.; Liu, M.; Schroeder, M.C.; VanWormer, J.J.; Mosa, A.S.; Anye, E.T.; et al. Enhancing PCORnet Clinical Research Network data completeness by integrating multistate insurance claims with electronic health records in a cloud environment aligned with CMS security and privacy requirements. J. Am. Med. Inform. Assoc 2022, 29, 660–670. [Google Scholar] [CrossRef]
  52. Smoll, N.R.; Brady, Z.; Scurrah, K.J.; Lee, C.; Berrington de González, A.; Mathews, J.D. Computed tomography scan radiation and brain cancer incidence. Neuro Oncol. 2023, 25, 1368–1376. [Google Scholar] [CrossRef] [PubMed]
  53. Specht, I.O.; Huybrechts, I.; Frederiksen, P.; Steliarova-Foucher, E.; Chajes, V.; Heitmann, B.L. The influence of prenatal exposure to trans-fatty acids for development of childhood haematopoietic neoplasms (EnTrance): A natural societal experiment and a case-control study. Nutr. J. 2018, 17, 13. [Google Scholar] [CrossRef] [PubMed]
  54. Duke, J.; Wood, F.; Semmens, J.; Edgar, D.W.; Rea, S. Trends in hospital admissions for sunburn in Western Australia, 1988 to 2008. Asia Pac. J. Public Health 2013, 25, 102–109. [Google Scholar] [CrossRef] [PubMed]
  55. Hylin, H.; Thrane, H.; Pedersen, K.; Kristiansen, I.S.; Burger, E.A. The healthcare costs of treating human papillomavirus-related cancers in Norway. BMC Cancer 2019, 19, 426. [Google Scholar] [CrossRef] [PubMed]
  56. Mathoulin-Pélissier, S.; Pritchard-Jones, K. Evidence-based data and rare cancers: The need for a new methodological approach in research and investigation. Eur. J. Surg. Oncol. 2019, 45, 22–30. [Google Scholar] [CrossRef] [PubMed]
  57. Mushtaq, N.; Qureshi, B.M.; Javed, G.; Sheikh, N.A.; Bakhshi, S.K.; Laghari, A.A.; Enam, S.A.; Anwar, S.S.M.; Hilal, K.; Kabir, A.; et al. Capacity building for pediatric neuro-oncology in Pakistan- a project by my child matters program of Foundation S. Front. Oncol. 2024, 14, 1325167. [Google Scholar] [CrossRef]
  58. Coebergh, J.W.; van den Hurk, C.; Louwman, M.; Comber, H.; Rosso, S.; Zanetti, R.; Sacchetto, L.; Storm, H.; van Veen, E.B.; Siesling, S.; et al. EUROCOURSE recipe for cancer surveillance by visible population-based cancer RegisTrees in Europe: From roots to fruits. Eur. J. Cancer 2015, 51, 1050–1063. [Google Scholar] [CrossRef]
  59. Haupt, R.; Essiaf, S.; Dellacasa, C.; Ronckers, C.M.; Caruso, S.; Sugden, E.; Zadravec Zaletel, L.; Muraca, M.; Morsellino, V.; Kienesberger, A.; et al. The ‘Survivorship Passport’ for childhood cancer survivors. Eur. J. Cancer 2018, 102, 69–81. [Google Scholar] [CrossRef] [PubMed]
  60. Chronaki, C.; Charalambous, E.; Cangioli, G.; Schreier, G.; van den Oever, S.; van der Pal, H.; Kremer, L.; Uyttebroeck, A.; Van den Bosch, B.; Trinkunas, J.; et al. Factors Influencing Implementation of the Survivorship Passport: The IT Perspective. Stud. Health Technol. Inform. 2022, 293, 161–168. [Google Scholar] [PubMed]
  61. Martí-Bonmatí, L.; Alberich-Bayarri, Á.; Ladenstein, R.; Blanquer, I.; Segrelles, J.D.; Cerdá-Alberich, L.; Gkontra, P.; Hero, B.; García-Aznar, J.M.; Keim, D.; et al. PRIMAGE project: Predictive in silico multiscale analytics to support childhood cancer personalised evaluation empowered by imaging biomarkers. Eur. Radiol. Exp. 2020, 4, 22. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (a) Merging patients from contexts with identical variables mainly focuses on the identification of duplicate patient records in different contexts. (b) Merging different data types from multiple sources holding the same patients focuses on the identification of related patient records in different contexts.
Figure 1. (a) Merging patients from contexts with identical variables mainly focuses on the identification of duplicate patient records in different contexts. (b) Merging different data types from multiple sources holding the same patients focuses on the identification of related patient records in different contexts.
Cancers 16 02696 g001
Figure 2. Hypothetical patient journey and related data sources. In this example, a patient is diagnosed with a primary tumour that worsens during treatment in trial 1. Treatment in a second trial 2 is successful. The cancer survivor suffers from a relapse a certain time after healing, which can again be treated successfully. In this patient journey, we end up with nine different pseudonyms (PSNs) for one patient. Green: personalised data; yellow: pseudonymised data; blue: linked data.
Figure 2. Hypothetical patient journey and related data sources. In this example, a patient is diagnosed with a primary tumour that worsens during treatment in trial 1. Treatment in a second trial 2 is successful. The cancer survivor suffers from a relapse a certain time after healing, which can again be treated successfully. In this patient journey, we end up with nine different pseudonyms (PSNs) for one patient. Green: personalised data; yellow: pseudonymised data; blue: linked data.
Cancers 16 02696 g002
Figure 3. Overview of data residing in personalised routine care and different pseudonymised contexts, illustrating how data can be linked and how the linked data can be analysed to obtain new knowledge. On the right-hand side, the relation to the respective six key dimensions of record linkage is shown. Grey boxes reference chapters describing single-dimensional use cases.
Figure 3. Overview of data residing in personalised routine care and different pseudonymised contexts, illustrating how data can be linked and how the linked data can be analysed to obtain new knowledge. On the right-hand side, the relation to the respective six key dimensions of record linkage is shown. Grey boxes reference chapters describing single-dimensional use cases.
Cancers 16 02696 g003
Table 1. Subset of possible use cases per stakeholder requiring privacy-preserving record linkage identified as potentially useful in paediatric oncology (PO). The use cases are displayed in the EJP RD use a case specification format, i.e., “As a <stakeholder>, I would like to <research focus> so that I can <objective> and visualise the result as a <output format>”.
Table 1. Subset of possible use cases per stakeholder requiring privacy-preserving record linkage identified as potentially useful in paediatric oncology (PO). The use cases are displayed in the EJP RD use a case specification format, i.e., “As a <stakeholder>, I would like to <research focus> so that I can <objective> and visualise the result as a <output format>”.
Stakeholder
“As a...”
Research Focus
“...I Would Like to...”
Objective
“...so That I Can...”
Output Format
“...and Visualise the Results as...”
clinician
(corresponds to
PO patient
OR
parent of a PO patient)
know if there have been patients in the past in any source who were similar to my patient, and their outcomechoose the optimal therapy for my patienta table with key patient characteristics, data sources, and outcomes
discuss my patient’s case in a tumour board based on all the distributed data of my patientcomprehensive overview of my patient’s case including tables, images, etc.
compare the predicted survival rate of my patient, depending on the selection of treatment A or treatment Btwo survival probabilities as derived from an AI algorithm (preferably including explanations)
paediatric cancer survivorknow which sources hold any data or samples of minehave an overview of my data and samplesa table of data sources and types of data/samples they hold of mine
receive life-long suggestions concerning screenings, etc., based on all my own data and recent study resultsimprove my life expectancy, health status, and quality of lifesuggestions and summaries provided by a survivorship app
make sure that my data are used for further research in the most valuable waycontribute to the improvement of paediatric cancer treatment
researcher in the field of POcompare the outcome of treatment A with that of treatment B (taken from source A) in patients fulfilling a certain eligibility criterion (taken from source B)see if the type of treatment is correlated with the outcomea Kaplan–Meier-curve with two groups, for treatments A and B
compare the outcome of study A with that of study B in patients fulfilling certain criteriasee if one of the studies’ outcomes is superiora Kaplan–Meier curve with two groups, for studies A and B
know all follow-up results of one of the patients in my study, no matter within which context the follow-up was performeduse the most recent follow-up data for the analysis of my studya table of all follow-ups including dates, sources, and results
researcher in the field of biologyknow which biobanks have further samples on which I need to perform an additional experimentcontact these biobanks and ask for additional materiala table with the sample type and biobank including contact information
know specific results, as stored in genome–phenome analysis platforms, which were achieved with the probes from my biobankgain further insights into the properties of my samplesa table with platforms and results per probe in my biobank
know which sources contain patients with a specific biomarkercontact these sources to perform further research on that biomarkera table with the number of patients per source
future principal investigator of a specific research activityknow how many cases fulfilling specific eligibility criteria are present in at least two data sourcesestimate the number of cases for my studya number of overall cases
know which data sources contain cases fulfilling specific eligibility criteria (age, diagnosis, biomarker, treatment, etc.)get in contact with these sources to consider them in my studya table with cases per source
correlate variables in patients from various sources, fulfilling my eligibility criteria, with one anotheroptimise stratification in my studycorrelation between the variables
member of the SIOPE boardcompare the survival rate of all neuroblastoma patients in all sources in 2001–2010 to the rate in 2011–2020evaluate the improvements in neuroblastoma research in Europea boxplot
compare the rate of severe adverse events per member state during immunotherapy with a certain biological in all sourcesidentify differences across Europe and improve potential shortfallsa landscape, color-coded with the rate of severe adverse events
healthcare politicianknow whether a specific type of PO treatment is cost-effectiveoptimise PO treatment based on outcomes and coststable listing costs and outcomes of two different PO treatment options
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Hayn, D.; Kreiner, K.; Sandner, E.; Baumgartner, M.; Jammerbund, B.; Falgenhauer, M.; Düster, V.; Devi-Marulkar, P.; Schleiermacher, G.; Ladenstein, R.; et al. Use Cases Requiring Privacy-Preserving Record Linkage in Paediatric Oncology. Cancers 2024, 16, 2696. https://doi.org/10.3390/cancers16152696

AMA Style

Hayn D, Kreiner K, Sandner E, Baumgartner M, Jammerbund B, Falgenhauer M, Düster V, Devi-Marulkar P, Schleiermacher G, Ladenstein R, et al. Use Cases Requiring Privacy-Preserving Record Linkage in Paediatric Oncology. Cancers. 2024; 16(15):2696. https://doi.org/10.3390/cancers16152696

Chicago/Turabian Style

Hayn, Dieter, Karl Kreiner, Emanuel Sandner, Martin Baumgartner, Bernhard Jammerbund, Markus Falgenhauer, Vanessa Düster, Priyanka Devi-Marulkar, Gudrun Schleiermacher, Ruth Ladenstein, and et al. 2024. "Use Cases Requiring Privacy-Preserving Record Linkage in Paediatric Oncology" Cancers 16, no. 15: 2696. https://doi.org/10.3390/cancers16152696

APA Style

Hayn, D., Kreiner, K., Sandner, E., Baumgartner, M., Jammerbund, B., Falgenhauer, M., Düster, V., Devi-Marulkar, P., Schleiermacher, G., Ladenstein, R., & Schreier, G. (2024). Use Cases Requiring Privacy-Preserving Record Linkage in Paediatric Oncology. Cancers, 16(15), 2696. https://doi.org/10.3390/cancers16152696

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop