LizAI XT—AI-Accelerated Management Platform for Complex Healthcare Data at Scale, Beyond EMR/EHR and Dashboards
Abstract
1. Introduction
2. Materials and Methods
2.1. Database Preparation
2.2. Clinical Data Mega-Structure by LizAI XT
2.3. Accuracy Assessment of Structuring Clinical Variables
Standard Error of Accuracy
3. Results
3.1. Preparation of Clinically Relevant Database for LizAI XT Performance Evaluation
3.2. Clinical Data Mega-Structure by LizAI XT—A Case of Prostate Cancer
4. LizAI XT Performance Evaluation
4.1. Overall LizAI XT Performance Accuracy
4.2. Analysis of Outliers in Accuracy and Their Impacts on the Overall LizAI XT Performance
4.3. Speed of LizAI XT in Data Mega-Structure
5. Discussion and Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| EHR | Electronic Health Record |
| EMR | Electronic Medical Record |
| NLP | Natural Language Processing |
| LLM | Large Language Model |
| COPD | Chronic Obstructive Pulmonary Disease |
| HIPAA | Health Insurance Portability and Accountability Act |
| GDPR | General Data Protection Regulation |
| FHIR | Fast Healthcare Interoperability Resources |
| HL7 | Health Level Seven |
| DICOM | Digital Imaging and Communications in Medicine |
| LOINC | Logical Observation Identifiers Names and Codes |
| SNOMED-CT | Systematized Nomenclature of Medicine Clinical Terms |
| ICD-10 | International Classification of Diseases, 10th Revision |
| GPU | Graphics Processing Unit |
| AWS | Amazon Web Services |
| CDC | Centers for Disease Control and Prevention |
| NIH | National Institutes of Health |
References
- OpenAI. ChatGPT. 2025. Available online: https://chatgpt.com/ (accessed on 15 January 2025).
- Cloud, G. Vertex AI: Machine Learning on Google Cloud. 2023. Available online: https://cloud.google.com/vertex-ai (accessed on 19 February 2023).
- Services, A.W. AWS HealthLake: Transforming Healthcare Data with AI. 2023. Available online: https://aws.amazon.com/healthlake/ (accessed on 19 February 2023).
- Microsoft. Copilot. 2025. Available online: https://copilot.microsoft.com/ (accessed on 15 January 2025).
- Liu, X.; Faes, L.; Kale, A.U.; Wagner, S.K.; Fu, D.J.; Bruynseels, A.; Mahendiran, T.; Moraes, G.; Shamdas, M.; Kern, C.; et al. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: A systematic review and meta-analysis. Lancet Digit. Health 2019, 1, e271–e297. [Google Scholar] [CrossRef] [PubMed]
- Kalra, S.; Tizhoosh, H.R.; Shah, S.; Choi, C.; Damaskinos, S.; Safarpoor, A.; Shafiei, S.; Babaie, M.; Diamandis, P.; Campbell, C.J.; et al. Pan-cancer diagnostic consensus through searching archival histopathology images using artificial intelligence. npj Digit. Med. 2020, 3, 31. [Google Scholar] [CrossRef] [PubMed]
- Rao, A.; Pang, M.; Kim, J.; Kamineni, M.; Li, W.; Prasad, A.K.; Landman, A.; Dreyer, K.J.; Succi, M.D. Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study. J. Med. Internet Res. 2023, 25, e48659. [Google Scholar] [CrossRef] [PubMed]
- University of North Carolina School of Medicine; IBM Watson Health. IBM’s Watson AI Recommends Same Treatment as Doctors in 99% of Cancer Cases. Futurism (29 November 2016) News Article. Dom Galeon (Reporter). Available online: https://futurism.com/ibms-watson-ai-recommends-same-treatment-as-doctors-in-99-of-cancer-cases (accessed on 15 January 2025).
- Weiskopf, N.G.; Weng, C. Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research. J. Am. Med. Inform. Assoc. 2013, 20, 144–151. [Google Scholar] [CrossRef]
- The Role of Data in AI. 2025. Available online: https://www.gpai.ai/projects/data-governance/role-of-data-in-ai.pdf (accessed on 15 January 2025).
- Zha, D.; Bhat, Z.P.; Lai, K.H.; Yang, F.; Hu, X. Data-centric ai: Perspectives and challenges. In Proceedings of the 2023 SIAM International Conference on Data Mining (SDM); SIAM: Philadelphia, PA, USA, 2023; pp. 945–948. [Google Scholar]
- Pirmani, A.; Oldenhof, M.; Peeters, L.M.; De Brouwer, E.; Moreau, Y. Accessible ecosystem for clinical research (federated learning for everyone): Development and usability study. JMIR Form. Res. 2024, 8, e55496. [Google Scholar] [CrossRef]
- Sande, D.v.d.; Genderen, M.E.v.; Huiskens, J.; Veen, R.E.R.; Meijerink, Y.; Gommers, D.; Bommel, J.v. Generating insights in uncharted territories: Real-time learning from data in critically ill patients—An implementer report. BMJ Health Care Inform. 2021, 28, e100447. [Google Scholar] [CrossRef]
- Sampson, R.; Shapiro, S.D.; He, W.; Denmark, S.; Kirchoff, K.; Hutson, K.; Paranal, R.; Forney, L.; McGhee, K.; Harvey, J. An integrated approach to improve clinical trial efficiency: Linking a clinical trial management system into the research integrated network of systems. J. Clin. Transl. Sci. 2022, 6, e63. [Google Scholar] [CrossRef]
- Kern, L.M.; Ringel, J.B.; Rajan, M.; Casalino, L.P.; Pesko, M.F.; Pinheiro, L.C.; Colantonio, L.D.; Safford, M.M. Ambulatory care fragmentation and total health care costs. Med. Care 2024, 62, 277–284. [Google Scholar] [CrossRef]
- Sainsbury, D.; Butterworth, S.; Fell, M.R.; Humphries, K.; Mehendale, F.V.; Richard, B. Towards breaking down cleft data silos to improve clinical research and patient outcomes. BMJ 2022, 378, o1799. [Google Scholar] [CrossRef]
- U.S. Department of Health and Human Services. Health Insurance Portability and Accountability Act of 1996 (HIPAA), Public Law 104–191. 1996. Available online: https://aspe.hhs.gov/reports/health-insurance-portability-accountability-act-1996 (accessed on 6 March 2025).
- General Data Protection Regulation. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46. Off. J. Eur. Union 2016, L 119, 1–88. [Google Scholar]
- Burnum, J.F. The misinformation era: The fall of the medical record. Ann. Intern. Med. 1989, 110, 482–484. [Google Scholar] [CrossRef]
- World Economic Forum. 4 Ways Data Is Improving Healthcare. 2025. Available online: https://www.weforum.org/ (accessed on 17 January 2025).
- Data Protection Commission. Data Protection Commission Launches Inquiry into Google AI Model. 2025. Available online: https://www.dataprotection.ie/en/news-media/press-releases/data-protection-commission-launches-inquiry-google-ai-model (accessed on 19 February 2025).
- Walonoski, J.; Kramer, M.; Nichols, J.; Quina, A.; Moesel, C.; Hall, D.; Duffett, C.; Blais, R.; Swain, A.; Clive, J.; et al. Synthea: An approach, method, and software mechanism for generating synthetic patients and the synthetic electronic health care record. J. Am. Med. Inform. Assoc. 2018, 25, 230–238. [Google Scholar] [CrossRef]
- Centers for Disease Control and Prevention. Centers for Disease Control and Prevention Website. 2025. Available online: https://www.cdc.gov/ (accessed on 6 March 2025).
- National Institutes of Health. National Institutes of Health Website. 2025. Available online: https://www.nih.gov/ (accessed on 6 March 2025).
- Regenstrief Institute. Logical Observation Identifiers Names and Codes (LOINC). 2025. Available online: https://loinc.org/ (accessed on 6 March 2025).
- SNOMED International. Systematized Nomenclature of Medicine—Clinical Terms (SNOMED CT). 2025. Available online: https://www.snomed.org/ (accessed on 6 March 2025).
- National Library of Medicine. RxNorm: Standardized Nomenclature for Clinical Drugs. 2025. Available online: https://www.nlm.nih.gov/research/umls/rxnorm/ (accessed on 6 March 2025).
- World Health Organization. International Classification of Diseases, 10th Revision (ICD-10). 2025. Available online: https://www.who.int/classifications/icd/en/ (accessed on 6 March 2025).
- Cloud, A. Qwen2.5: A Large Language Model. 2024. Available online: https://qwenlm.github.io/ (accessed on 7 March 2024).
- Tabei, Y.; Kiryu, H.; Kin, T.; Asai, K. A fast structural multiple alignment method for long RNA sequences. BMC Bioinform. 2008, 9, 33. [Google Scholar] [CrossRef] [PubMed]
- Kim, D.; Langmead, B.; Salzberg, S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Chem. Biol. 2015, 12, 357–360. [Google Scholar] [CrossRef] [PubMed]
- Shatsky, M.; Nussinov, R.; Wolfson, H.J. Optimization of multiple-sequence alignment based on multiple-structure alignment. Proteins Struct. Funct. Bioinform. 2005, 60, 751–767. [Google Scholar] [CrossRef]
- Kehr, B.; Trappe, K.; Holtgrewe, M.; Reinert, K. Genome alignment with graph data structures: A comparison. BMC Bioinform. 2014, 15, 99. [Google Scholar] [CrossRef]
- Chen, Y.; Hong, J.; Cui, W.; Zaneveld, J.; Wang, W.; Gibbs, R.; Xiao, Y.; Chen, R. CGAP-Align: A High Performance DNA Short Read Alignment Tool. PLoS ONE 2013, 8, e61033. [Google Scholar] [CrossRef]
- Menke, M.; Berger, B.; Cowen, L. Matt: Local Flexibility Aids Protein Multiple Structure Alignment. PLoS Comput. Biol. 2008, 4, e1000100. [Google Scholar] [CrossRef]
- Gosset, W.S. The probable error of a mean. Biometrika 1908, 6, 1–25. [Google Scholar] [CrossRef]
- Mann, H.B.; Whitney, D.R. On a Test of Whether One of Two Random Variables is Stochastically Larger than the Other. Ann. Math. Stat. 1947, 18, 50–60. [Google Scholar] [CrossRef]
- Efron, B. Bootstrap Methods: Another Look at the Jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
- Cohen, J. Statistical Power Analysis for the Behavioral Sciences, 2nd ed.; Routledge: New York, NY, USA, 1988. [Google Scholar]
- Reddy, S.; Rogers, W.; Makinen, V.P.; Coiera, E.; Brown, P.; Wenzel, M.; Weicken, E.; Ansari, S.; Mathur, P.; Casey, A.; et al. Evaluation framework to guide implementation of AI systems into healthcare settings. BMJ Health Care Inform. 2021, 28, e100444. [Google Scholar] [CrossRef] [PubMed]
- Gilvaz, A.; Abraham, G.M.; Radhakrishnan, S.; Hasmath, Z. From Admission to Discharge, How Artificial Intelligence Could Optimize Patient Care: A Brief Review. Am. J. Hosp. Med. 2019, 3, 2019.016. [Google Scholar] [CrossRef]
- Khan, M.; Zhao, X. Drawbacks of Artificial Intelligence and Their Potential Solutions in the Healthcare Sector. Biomed. Mater. Devices 2023, 1, 731–738. [Google Scholar] [CrossRef]
- Ogaga, M.; Zhao, X. The Rise of Artificial Intelligence and Machine Learning in HealthCare Industry. Int. J. Res. Innov. Appl. Sci. 2023, VIII, 250–253. [Google Scholar] [CrossRef]
- Ramakrishnaiah, Y.; Macesic, N.; Webb, G.; Peleg, A.; Tyagi, S. EHR-ML: A generalisable pipeline for reproducible clinical outcomes using electronic health records. medRxiv 2024. [Google Scholar] [CrossRef]
- Alanazi, A. Clinicians’ views on using artificial intelligence in healthcare: Opportunities, challenges, and beyond. Cureus 2023, 15, e45255. [Google Scholar] [CrossRef]
- Deliberato, R.O.; Celi, L.A.; Stone, D.J. Clinical note creation, binning, and artificial intelligence. JMIR Med. Inform. 2017, 5, e7627. [Google Scholar] [CrossRef]
- Ronquillo, C.E.; Mitchell, J.; Alhuwail, D.; Peltonen, L.M.; Topaz, M.; Block, L.J. The Untapped Potential of Nursing and Allied Health Data for Improved Representation of Social Determinants of Health and Intersectionality in Artificial Intelligence Applications: A Rapid Review. Yearb. Med. Inform. 2022, 31, 094–099. [Google Scholar] [CrossRef]







| Disease | Short Description | Number of Patients | Number of Medical Files | Number of Clinical Variables |
|---|---|---|---|---|
| Colorectal Cancer | Cancer affecting the colon or rectum. | 1000 | 5317 | 105 |
| Prostate Cancer | A common male cancer in the prostate gland. | 1000 | 22,781 | 50 |
| Contraceptives | Medications or devices used for birth control. | 1000 | 5718 | 67 |
| Female Reproduction | Conditions related to women’s reproductive health. | 1000 | 5102 | 25 |
| Gout | Arthritic condition caused by uric acid crystal buildup in joints. | 1000 | 1492 | 41 |
| Attention Deficit Disorder (ADD) | Neurodevelopmental disorder affecting focus and impulse control. | 1000 | 5549 | 41 |
| Epilepsy | Neurological disorder causing recurrent seizures. | 1000 | 6279 | 36 |
| COPD | Progressive lung disease causing breathing difficulties. | 1000 | 5327 | 75 |
| Asthma | Chronic condition causing airway inflammation and difficulty breathing. | 1000 | 8360 | 67 |
| Allergic Rhinitis | Inflammation of nasal passages due to allergens. | 1000 | 5397 | 42 |
| Bronchitis | Inflammation of bronchial tubes, leading to coughing and mucus production. | 1000 | 11,991 | 51 |
| Dermatitis | Inflammation of the skin causing redness and itching. | 1000 | 5229 | 42 |
| Atopy | Genetic tendency to develop allergic conditions. | 1000 | 4996 | 25 |
| Food Allergies | Immune response triggered by certain foods. | 1000 | 5480 | 35 |
| Appendicitis | Inflammation of the appendix, often requiring surgery. | 1000 | 5322 | 43 |
| Ear Infections | Infections of the middle ear, causing pain and fluid buildup. | 1000 | 8371 | 36 |
| Total | 16,000 | 112,711 | 781 |
| Variables Categories | Descriptions | Examples |
|---|---|---|
| Immunizations | Administered vaccines of any kinds. | DTaP, Influenza Vaccine |
| Codes (medical) | Medical encounter and/or procedure identifiers. | Encounter for Check-Up, Death Certification |
| Names (medical) | Titles of medical encounters. | Chemotherapy Encounter, Routine Colonoscopy Encounter |
| Medications (treatments) | Prescribed drugs or treatments. | Oxaliplatin Injection, Leucovorin Injection |
| Symptoms | Reported health complaints. | Abdominal Pain, Fatigue |
| Conditions | Diagnosed diseases and/or disorders. | Anemia (Disorder), Malignant Tumor of Colon |
| Observations | Recorded health measurements. | Hemoglobin Level, Pain Severity Score |
| Care plans | Structured treatment or health plans. | Cancer Care Plan, Healthy Diet |
| Procedures | Medical interventions or diagnostics. | Colonoscopy, Biopsy of Colon |
| Devices (methods) | Medical equipment for patient use. | Oxygen Concentrator (Physical Object), Wheelchair Accessory (Physical Object) |
| Anonymized ID | iPSA | ISUP Score in Biopsy Specimen | Date of Biopsy | Imaging for Primary Staging | ADT Duration | Other Systemic Therapy Primary Treatment | Radiation Prostate | Number of Pelvic Lymph Nodes in Imaging | Type of Local Salvage Treatment |
|---|---|---|---|---|---|---|---|---|---|
| P_844 | 44 ng/mL | 8 (3 + 5) | 2017-07-31 | PSMA-PET/CT or PET/MR | None | Enzalutamide | None | 5 | Radiotherapy of the thoracic segment of the spinal column |
| P_33 | 30 ng/mL | 10 (5 + 5) | 2015-12-11 | MRI of the pelvis–prostate | None | None | Yes | None | HDR |
| P_272 | 8.8 ng/mL | 4 | 2020-05-16 | MRI of the pelvis–prostate | 9 months | Enzalutamide | Yes | None | None |
| P_229 | 42 ng/mL | 5 | unknown | PET/CT scan | 5 months | Enzalutamide | Yes | None | Conventional fractionation IMRT combined with HDR |
| P_478 | 47 ng/mL | 7 (3 + 4) | unknown | PET/CT imaging | None | None | Yes | 2 | SBRT plus HDR |
| P_32 | 18 ng/mL | 3 | 2019-09-27 | PET/CT | 9 months | Enzalutamide | Yes | None | None |
| P_441 | 38 ng/mL | 3 | unknown | MRI of the pelvis–prostate | None | None | Yes | None | SBRT plus HDR for 2 months |
| P_221 | 8.8 ng/mL | None | 02.11.2022 | MRI of the pelvis–prostate | 4 years | None | Yes | None | Brachytherapy |
| P_844 | 44 ng/mL | 8 (3 + 5) | 2017-07-31 | PSMA-PET/CT or PET/MR imaging | None | Enzalutamide | None | 5 | Radiotherapy of the thoracic segment of the spinal column |
| P_33 | 30 ng/mL | 10 (5 + 5) | 2015-12-11 | MRI of the pelvis–prostate | None | None | Yes | None | HDR |
| P_227 | 28 ng/mL | None | 2017-02-15 | PET/CT scan | None | None | Yes | 2 | IMRT (Intensity-Modulated Radiation Therapy) |
| P_424 | 41 ng/mL | None | 2002-06-22 | MRI of the pelvis–prostate | None | Enzalutamide | None | None | STRING: Brachytherapy |
| P_673 | 8.8 ng/mL | None | 2003-09-09 | PSMA-PET/CT or PET/MR | None | None | Yes | None | Brachytherapy monotherapy |
| P_272 | 8.8 ng/mL | 4 | 2020-05-16 | MRI of the pelvis–prostate | 9 months | Enzalutamide | Yes | None | None |
| P_229 | 42 ng/mL | 5 | unknown | PET/CT scan | 5 months | Enzalutamide | Yes | None | Conventional fractionation IMRT combined with HDR |
| P_478 | 47 ng/mL | 7 (3 + 4) | unknown | PET/CT imaging | None | None | Yes | 2 | SBRT plus HDR |
| P_32 | 18 ng/mL | 3 | 2019-09-27 | PET/CT | 9 months | Enzalutamide | Yes | None | None |
| P_441 | 38 ng/mL | 3 | unknown | MRI of the pelvis–prostate | None | None | Yes | None | SBRT plus HDR for 2 months |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Nguyen, T.T.; Elmaleh, D.R. LizAI XT—AI-Accelerated Management Platform for Complex Healthcare Data at Scale, Beyond EMR/EHR and Dashboards. Big Data Cogn. Comput. 2025, 9, 275. https://doi.org/10.3390/bdcc9110275
Nguyen TT, Elmaleh DR. LizAI XT—AI-Accelerated Management Platform for Complex Healthcare Data at Scale, Beyond EMR/EHR and Dashboards. Big Data and Cognitive Computing. 2025; 9(11):275. https://doi.org/10.3390/bdcc9110275
Chicago/Turabian StyleNguyen, Trung Tin, and David Raphael Elmaleh. 2025. "LizAI XT—AI-Accelerated Management Platform for Complex Healthcare Data at Scale, Beyond EMR/EHR and Dashboards" Big Data and Cognitive Computing 9, no. 11: 275. https://doi.org/10.3390/bdcc9110275
APA StyleNguyen, T. T., & Elmaleh, D. R. (2025). LizAI XT—AI-Accelerated Management Platform for Complex Healthcare Data at Scale, Beyond EMR/EHR and Dashboards. Big Data and Cognitive Computing, 9(11), 275. https://doi.org/10.3390/bdcc9110275

