From Sensing to Sense-Making: A Framework for On-Person Intelligence with Wearable Biosensors and Edge LLMs
Highlights
- A key bottleneck in real-world wearable sensing is the transformation of noisy physiological signals into actionable decisions; a cognitive co-pilot architecture is proposed linking sensing, probabilistic state estimation, LLM-based contextual reasoning, and attention-aware intervention.
- Local, uncertainty-aware, edge-deployed reasoning is a necessary architectural condition for trustworthy decision support in high-stakes environments, and several research gaps exist to achieve this goal.
- Effective wearable-AI systems will require integrated sociotechnical design combining sensor validation, uncertainty-calibrated inference, grounded LLM reasoning, and cognitive-engineering-driven interface policies.
- Beyond model accuracy, progress should be evaluated in downstream human outcomes (trust calibration, workload, decision quality, long-term reliance), reframing success as improved human–system integration.
Abstract
1. Introduction
2. Edge Computing as an Enabler
2.1. Layer 1: Multimodal Sensing
2.2. Layer 2: Probabilistic State Estimates
- Fusion as Estimation. Feature concatenation can fail silently when one channel is corrupted. Fusion should be treated as a state estimation problem: combine multiple noisy observations into a posterior belief over latent states. Classical filters (e.g., Kalman variants, particle filters) and modern neural Bayesian approximations can serve this role; the critical design point is to produce a distribution (or at least calibrated confidence) in addition to a discrete label [49,50].
- Calibration and Abstention. A classifier that always outputs a label is dangerous in high-stakes contexts. Layer 2 should support abstention (i.e., insufficient confidence) and graceful degradation (i.e., automated fallback to simpler models when signals are poor) [51,52]. Providing users with uncertainty information can support calibrated trust in a system and organization.
- Continual Improvement. Federated learning provides a principled mechanism to update models across multiple devices while keeping training data local, aggregating parameter updates [53,54]. In practice, federated pipelines must still handle adversarial updates, heterogeneity, and auditability; but as an architectural concept, they align naturally with occupational privacy constraints.
2.3. Layer 3: From States to Insights
2.4. Layer 4: From Insight to Action
2.5. A Formal Reference Architecture
3. On-Person Cognitive Co-Pilots: A Framework
4. Research Agenda and Future Directions
4.1. Transitioning Research from Lab to Field
4.2. Illustrative Edge Deployment Scenario
4.3. Illustrative End-to-End Walkthroughs
4.4. Practical Limitations and Deployment Constraints
5. Governance, Privacy, and Autonomy
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| AWQ | Activation-Aware Weight Quantization |
| CNNs | Convolutional Neural Networks |
| EDA | Electrodermal Activity |
| EEG | Electroencephalography |
| EMG | Electromyography |
| GRUs | Gated Recurrent Units |
| GPTQ | Generative Pre-trained Transformer Quantization |
| HR | Heart Rate |
| HRV | Heart Rate Variability |
| ICE | Isolated, Confined, and Extreme (environments) |
| IMU | Inertial Measurement Unit |
| LLM | Large Language Model |
| LSTMs | Long Short-Term Memory networks |
| NIST | National Institute of Standards and Technology |
| PPG | Photoplethysmography |
| RAG | Retrieval-Augmented Generation |
| RMF | Risk Management Framework |
| SOPs | Standard Operating Procedures |
| SpO2 | Peripheral Capillary Oxygen Saturation |
| TEEs | Trusted Execution Environments |
| TinyML | Tiny Machine Learning |
References
- Macias Alonso, A.K.; Hirt, J.; Woelfle, T.; Janiaud, P.; Hemkens, L.G. Definitions of Digital Biomarkers: A Systematic Mapping of the Biomedical Literature. BMJ Health Care Inform. 2024, 31, e100914. [Google Scholar] [CrossRef]
- Neef, D. Digital Exhaust: What Everyone Should Know about Big Data, Digitization and Digitally Driven Innovation; Pearson Education: London, UK, 2015; ISBN 978-0-13-383796-4. [Google Scholar]
- Schütz, N.; Knobel, S.E.J.; Botros, A.; Single, M.; Pais, B.; Santschi, V.; Gatica-Perez, D.; Buluschek, P.; Urwyler, P.; Gerber, S.M.; et al. A Systems Approach towards Remote Health-Monitoring in Older Adults: Introducing a Zero-Interaction Digital Exhaust. NPJ Digit. Med. 2022, 5, 116. [Google Scholar] [CrossRef]
- Lane, N.D.; Miluzzo, E.; Lu, H.; Peebles, D.; Choudhury, T.; Campbell, A.T. A Survey of Mobile Phone Sensing. IEEE Commun. Mag. 2010, 48, 140–150. [Google Scholar] [CrossRef]
- Parasuraman, R.; Sheridan, T.B.; Wickens, C.D. A Model for Types and Levels of Human Interaction with Automation. IEEE Trans. Syst. Man. Cybern. Part A Syst. Hum. 2000, 30, 286–297. [Google Scholar] [CrossRef]
- Brunyé, T.T.; Yau, K.; Okano, K.; Elliott, G.; Olenich, S.; Giles, G.E.; Navarro, E.; Elkin-Frankston, S.; Young, A.L.; Miller, E.L. Toward Predicting Human Performance Outcomes from Wearable Technologies: A Computational Modeling Approach. Front. Physiol. 2021, 12, 738973. [Google Scholar] [CrossRef]
- Trivedi, A.R.; Tayebati, S.; Kumawat, H.; Darabi, N.; Kumar, D.; Kosta, A.K.; Venkatesha, Y.; Jayasuriya, D.; Jayasinghe, N.; Panda, P.; et al. Intelligent Sensing-to-Action for Robust Autonomy at the Edge: Opportunities and Challenges. In Proceedings of the 2025 Design, Automation & Test in Europe Conference (DATE), Verona, Italy, 20–22 April 2026; pp. 1–10. [Google Scholar]
- Raymond, D.A.; Kumar, P.; Goureshettiwar, P. Intergration of Wearable Biosensors and Data Analytics for Remote Health Monitoring. In Proceedings of the 2024 Second International Conference on Intelligent Cyber Physical Systems and Internet of Things (ICoICI), Coimbatore, India, 28–30 August 2024; pp. 707–713. [Google Scholar]
- Endsley, M.R. Toward a Theory of Situation Awareness in Dynamic Systems. In Situational Awareness; Routledge: Abingdon, UK, 2011. [Google Scholar]
- Norman, D. Cognitive Engineering. In User Centered System Design; CRC Press: Boca Raton, FL, USA, 1986. [Google Scholar]
- Wickens, C.D. Multiple Resources and Mental Workload. Hum. Factors 2008, 50, 449–455. [Google Scholar] [CrossRef] [PubMed]
- Lee, J.D.; Kirlik, A.; Dainoff, M.J. The Oxford Handbook of Cognitive Engineering; OUP: New York, NY, USA, 2013; ISBN 978-0-19-975718-3. [Google Scholar]
- Miller, T. Explanation in Artificial Intelligence: Insights from the Social Sciences. Artif. Intell. 2019, 267, 1–38. [Google Scholar] [CrossRef]
- Amershi, S.; Weld, D.; Vorvoreanu, M.; Fourney, A.; Nushi, B.; Collisson, P.; Suh, J.; Iqbal, S.; Bennett, P.N.; Inkpen, K.; et al. Guidelines for Human-AI Interaction. In CHI ’19: Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2019; pp. 1–13. [Google Scholar]
- Ma, S.; Chen, Q.; Wang, X.; Zheng, C.; Peng, Z.; Yin, M.; Ma, X. Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making. In CHI ’25: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems; Association for Computing Machinery: New York, NY, USA, 2025; pp. 1–23. [Google Scholar]
- Cai, G.; Tian, R.; Yang, L.; Jia, Y.; Li, L.; Wang, J. Efficient Inference for Edge Large Language Models: A Survey. Tsinghua Sci. Technol. 2026, 31, 1365–1380. [Google Scholar] [CrossRef]
- Husom, E.J.; Goknil, A.; Astekin, M.; Shar, L.K.; KÃ¥sen, A.; Sen, S.; Mithassel, B.A.; Soylu, A. Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency. ACM Trans. Internet Things 2025, 6, 28. [Google Scholar] [CrossRef]
- Xianyong, S.; Mishra, D.K.; Özcan, Ö.Ö.; Karahan, M.; Panpan, T.; Zhongshan, X.; Vijayaraj Kumar, P. Wearable Smart Sensors Integration with AI and Machine Learning for Tracking Human Health. Biosens. Bioelectron. X 2025, 27, 100711. [Google Scholar] [CrossRef]
- Satyanarayanan, M. The Emergence of Edge Computing. Computer 2017, 50, 30–39. [Google Scholar] [CrossRef]
- Shi, W.; Cao, J.; Zhang, Q.; Li, Y.; Xu, L. Edge Computing: Vision and Challenges. IEEE Internet Things J. 2016, 3, 637–646. [Google Scholar] [CrossRef]
- Chan, K.S.; Johnsen, F.T. Cybersecurity in Tactical Edge Networks. IEEE Secur. Priv. 2025, 23, 10–20. [Google Scholar] [CrossRef]
- He, P.; Wang, Y.; Zheng, G.; Zhou, H. Design and Implementation of an Edge Computing-Based Underground IoT Monitoring System. Mining 2025, 5, 54. [Google Scholar] [CrossRef]
- Haliassos, A.; Kasvis, D.; Karathanos, S. The Challenges of Data Privacy and Cybersecurity in Cloud Computing and Artificial Intelligence (AI) Applications for EQA Organizations. EJIFCC 2025, 36, 599–604. [Google Scholar]
- Warden, P.; Situnayake, D. TinyML: Machine Learning with TensorFlow Lite on Arduino and Ultra-Low-Power Microcontrollers; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2019; ISBN 978-1-4920-5201-2. [Google Scholar]
- Touvron, H.; Lavril, T.; Izacard, G.; Martinet, X.; Lachaux, M.-A.; Lacroix, T.; Rozière, B.; Goyal, N.; Hambro, E.; Azhar, F.; et al. LLaMA: Open and Efficient Foundation Language Models. arXiv 2023, arXiv:2302.13971. [Google Scholar] [CrossRef]
- Dettmers, T.; Lewis, M.; Belkada, Y.; Zettlemoyer, L. GPT3.Int8(): 8-Bit Matrix Multiplication for Transformers at Scale. Adv. Neural Inf. Process. Syst. 2022, 35, 30318–30332. [Google Scholar]
- Clifford, G.D.; Behar, J.; Li, Q.; Rezek, I. Signal Quality Indices and Data Fusion for Determining Clinical Acceptability of Electrocardiograms. Physiol. Meas. 2012, 33, 1419. [Google Scholar] [CrossRef]
- Taskasaplidis, G.; Fotiadis, D.A.; Bamidis, P.D. Review of Stress Detection Methods Using Wearable Sensors. IEEE Access 2024, 12, 38219–38246. [Google Scholar] [CrossRef]
- Allen, J. Photoplethysmography and Its Application in Clinical Physiological Measurement. Physiol. Meas. 2007, 28, R1. [Google Scholar] [CrossRef]
- Zhang, Y.; Song, S.; Vullings, R.; Biswas, D.; Simões-Capela, N.; van Helleputte, N.; van Hoof, C.; Groenendaal, W. Motion Artifact Reduction for Wrist-Worn Photoplethysmograph Sensors Based on Different Wavelengths. Sensors 2019, 19, 673. [Google Scholar] [CrossRef]
- Boucsein, W. Electrodermal Activity; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; ISBN 978-1-4614-1126-0. [Google Scholar]
- Heikenfeld, J.; Jajack, A.; Rogers, J.; Gutruf, P.; Tian, L.; Pan, T.; Li, R.; Khine, M.; Kim, J.; Wang, J.; et al. Wearable Sensors: Modalities, Challenges, and Prospects. Lab. A Chip 2018, 18, 217–248. [Google Scholar] [CrossRef] [PubMed]
- Urigüen, J.A.; Garcia-Zapirain, B. EEG Artifact Removal—State-of-the-Art and Guidelines. J. Neural Eng. 2015, 12, 031001. [Google Scholar] [CrossRef]
- Lara, O.D.; Labrador, M.A. A Survey on Human Activity Recognition Using Wearable Sensors. IEEE Commun. Surv. Tutor. 2013, 15, 1192–1209. [Google Scholar] [CrossRef]
- Bulling, A.; Blanke, U.; Schiele, B. A Tutorial on Human Activity Recognition Using Body-Worn Inertial Sensors. ACM Comput. Surv. 2014, 46, 33. [Google Scholar] [CrossRef]
- Chung, S.; Lim, J.; Noh, K.J.; Kim, G.; Jeong, H. Sensor Data Acquisition and Multimodal Sensor Fusion for Human Activity Recognition Using Deep Learning. Sensors 2019, 19, 1716. [Google Scholar] [CrossRef]
- Mahato, K.; Saha, T.; Ding, S.; Sandhu, S.S.; Chang, A.-Y.; Wang, J. Hybrid Multimodal Wearable Sensors for Comprehensive Health Monitoring. Nat. Electron. 2024, 7, 735–750. [Google Scholar] [CrossRef]
- Stojchevska, M.; Steenwinckel, B.; Van Der Donckt, J.; De Brouwer, M.; Goris, A.; De Turck, F.; Van Hoecke, S.; Ongenae, F. Assessing the Added Value of Context during Stress Detection from Wearable Data. BMC Med. Inf. Decis. Mak. 2022, 22, 268. [Google Scholar] [CrossRef] [PubMed]
- Stoppa, M.; Chiolerio, A. Wearable Electronics and Smart Textiles: A Critical Review. Sensors 2014, 14, 11957–11992. [Google Scholar] [CrossRef]
- Yang, K.; McErlain-Naylor, S.A.; Isaia, B.; Callaway, A.; Beeby, S. E-Textiles for Sports and Fitness Sensing: Current State, Challenges, and Future Opportunities. Sensors 2024, 24, 1058. [Google Scholar] [CrossRef]
- Azeem, M.; Shahid, M.; Masin, I.; Petru, M. Design and Development of Textile-Based Wearable Sensors for Real-Time Biomedical Monitoring; a Review. J. Text. Inst. 2025, 116, 80–95. [Google Scholar] [CrossRef]
- Davis, N.; Heikenfeld, J.; Milla, C.; Javey, A. The Challenges and Promise of Sweat Sensing. Nat. Biotechnol. 2024, 42, 860–871. [Google Scholar] [CrossRef]
- Demirci Uzun, S. Machine Learning-Based Prediction and Interpretation of Electrochemical Biosensor Responses: A Comprehensive Framework. Microchem. J. 2025, 218, 115656. [Google Scholar] [CrossRef]
- Giorgi, G.; Tonello, S. Wearable Biosensor Standardization: How to Make Them Smarter. Standards 2022, 2, 366–384. [Google Scholar] [CrossRef]
- Subbaswamy, A.; Saria, S. From Development to Deployment: Dataset Shift, Causality, and Shift-Stable Models in Health AI. Biostatistics 2020, 21, 345–352. [Google Scholar] [CrossRef]
- Clifford, G.D.; Liu, C.; Moody, B.; Lehman, L.H.; Silva, I.; Li, Q.; Johnson, A.E.; Mark, R.G. AF Classification from a Short Single Lead ECG Recording: The PhysioNet/Computing in Cardiology Challenge 2017. In Proceedings of the 2017 Computing in Cardiology (CinC), Rennes, France, 24–27 September 2017; pp. 1–4. [Google Scholar]
- Ordóñez, F.J.; Roggen, D. Deep Convolutional and LSTM Recurrent Neural Networks for Multimodal Wearable Activity Recognition. Sensors 2016, 16, 115. [Google Scholar] [CrossRef] [PubMed]
- Hammerla, N.Y.; Halloran, S.; Ploetz, T. Deep, Convolutional, and Recurrent Models for Human Activity Recognition Using Wearables. arXiv 2016, arXiv:1604.08880. [Google Scholar] [CrossRef]
- Kalman, R.E. A New Approach to Linear Filtering and Prediction Problems. J. Basic. Eng. 1960, 82, 35–45. [Google Scholar] [CrossRef]
- Lampinen, J.; Vehtari, A. Bayesian Approach for Neural Networks—Review and Case Studies. Neural Netw. 2001, 14, 257–274. [Google Scholar] [CrossRef] [PubMed]
- Guo, C.; Pleiss, G.; Sun, Y.; Weinberger, K.Q. On Calibration of Modern Neural Networks. In Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 1321–1330. [Google Scholar]
- Begoli, E.; Bhattacharya, T.; Kusnezov, D. The Need for Uncertainty Quantification in Machine-Assisted Medical Decision Making. Nat. Mach. Intell. 2019, 1, 20–23. [Google Scholar] [CrossRef]
- McMahan, B.; Moore, E.; Ramage, D.; Hampson, S.; Arcas, B.A. y Communication-Efficient Learning of Deep Networks from Decentralized Data. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA, 20–22 April 2017; pp. 1273–1282. [Google Scholar]
- Kairouz, P.; McMahan, H.B. Advances and Open Problems in Federated Learning. Found. Trends Mach. Learn. 2021, 14, 1–210. [Google Scholar] [CrossRef]
- Subbaswamy, A.; Adams, R.; Saria, S. Evaluating Model Robustness and Stability to Dataset Shift. In Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, Virtual, 13–15 April 2021. [Google Scholar]
- Sun, Y.; Li, S.; Sun, W.; Jin, T.; Guo, M.; Fu, M.; Liu, D.; Huang, X. A Lightweight Uncertainty Modeling Approach for Wearable Sensor Signals Based on Sample Overlap Estimation. Measurement 2026, 262, 120051. [Google Scholar] [CrossRef]
- Saria, S. Individualized Sepsis Treatment Using Reinforcement Learning. Nat. Med. 2018, 24, 1641–1642. [Google Scholar] [CrossRef]
- Rudin, C.; Chen, C.; Chen, Z.; Huang, H.; Semenova, L.; Zhong, C. Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges. Stat. Surv. 2022, 16, 1–85. [Google Scholar] [CrossRef]
- Bommasani, R.; Hudson, D.A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M.S.; Bohg, J.; Bosselut, A.; Brunskill, E.; et al. On the Opportunities and Risks of Foundation Models. arXiv 2022, arXiv:2108.07258. [Google Scholar]
- Patil, A.; Jadon, A. Advancing Reasoning in Large Language Models: Promising Methods and Approaches. arXiv 2025, arXiv:2502.03671. [Google Scholar] [CrossRef]
- Zollner, D.; Vasiliev, R.; Castellano, B.; Molnar, G.; Janssen, P. A Technical Examination to Explore Conditional Multimodal Contextual Synthesis in Large Language Models. Res. Sq. 2024. [Google Scholar] [CrossRef]
- Barberini, S.; Everleigh, D.; Aldershaw, C.; Highcastle, A.; Bartholomew, F. Architecting Contextual Gradient Synthesis for Knowledge Representation in Large Language Models. TechRxiv 2024. [Google Scholar] [CrossRef]
- Huang, W.; Abbeel, P.; Pathak, D.; Mordatch, I. Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents. In Proceedings of the 39th International Conference on Machine Learning, Baltimore, MD, USA, 17–23 July 2022; pp. 9118–9147. [Google Scholar]
- Yang, Q.; Farseev, A.; Ongpin, M.; Huang, A.; Chu-Farseeva, Y.-Y.; You, D.-M.; Lepikhin, K.; Nikolenko, S. Fusing Predictive and Large Language Models for Actionable Recommendations in Creative Marketing. ACM Trans. Inf. Syst. 2025, 43, 116. [Google Scholar] [CrossRef]
- ElShawi, R.; Sherif, Y.; Al-Mallah, M.; Sakr, S. Interpretability in Healthcare: A Comparative Study of Local Machine Learning Interpretability Techniques. Comput. Intell. 2021, 37, 1633–1650. [Google Scholar] [CrossRef]
- Zhang, L.; Chen, Z. Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems. Energy Build. 2024, 313, 114278. [Google Scholar] [CrossRef]
- Lyu, D.; Wang, X.; Chen, Y.; Wang, F. Language Model and Its Interpretability in Biomedicine: A Scoping Review. iScience 2024, 27, 109334. [Google Scholar] [CrossRef]
- Parisineni, S.R.A.; Pal, M. Enhancing Trust and Interpretability of Complex Machine Learning Models Using Local Interpretable Model Agnostic Shap Explanations. Int. J. Data Sci. Anal. 2024, 18, 457–466. [Google Scholar] [CrossRef]
- Bai, Z.; Wang, P.; Xiao, T.; He, T.; Han, Z.; Zhang, Z.; Shou, M.Z. Hallucination of Multimodal Large Language Models: A Survey. arXiv 2025, arXiv:2404.18930. [Google Scholar]
- Huang, L.; Yu, W.; Ma, W.; Zhong, W.; Feng, Z.; Wang, H.; Chen, Q.; Peng, W.; Feng, X.; Qin, B.; et al. A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions. ACM Trans. Inf. Syst. 2025, 43, 42. [Google Scholar] [CrossRef]
- Wen, B.; Xu, C.; Han, B.; Wolfe, R.; Wang, L.L.; Howe, B. Mitigating Overconfidence in Large Language Models: A Behavioral Lens on Confidence Estimation and Calibration. In Proceedings of the NeurIPS 2024 Workshop on Behavioral Machine Learning, Vancouver, BC, Canada, 10 October 2024. [Google Scholar]
- Lewis, P.; Perez, E.; Piktus, A.; Petroni, F.; Karpukhin, V.; Goyal, N.; Küttler, H.; Lewis, M.; Yih, W.; Rocktäschel, T.; et al. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Proceedings of the Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2020; Volume 33, pp. 9459–9474. [Google Scholar]
- Gao, Y.; Xiong, Y.; Gao, X.; Jia, K.; Pan, J.; Bi, Y.; Dai, Y.; Sun, J.; Wang, M.; Wang, H. Retrieval-Augmented Generation for Large Language Models: A Survey. arXiv 2024, arXiv:2312.10997. [Google Scholar] [CrossRef]
- Ke, Y.H.; Jin, L.; Elangovan, K.; Abdullah, H.R.; Liu, N.; Sia, A.T.H.; Soh, C.R.; Tung, J.Y.M.; Ong, J.C.L.; Kuo, C.-F.; et al. Retrieval Augmented Generation for 10 Large Language Models and Its Generalizability in Assessing Medical Fitness. npj Digit. Med. 2025, 8, 187. [Google Scholar] [CrossRef]
- Banbury, C.R.; Reddi, V.J.; Lam, M.; Fu, W.; Fazel, A.; Holleman, J.; Huang, X.; Hurtado, R.; Kanter, D.; Lokhmotov, A.; et al. Benchmarking TinyML Systems: Challenges and Direction. arXiv 2021, arXiv:2003.04821. [Google Scholar] [CrossRef]
- Team, G.; Riviere, M.; Pathak, S.; Sessa, P.G.; Hardin, C.; Bhupatiraju, S.; Hussenot, L.; Mesnard, T.; Shahriari, B.; Ramé, A.; et al. Gemma 2: Improving Open Language Models at a Practical Size. arXiv 2024, arXiv:2408.00118. [Google Scholar] [CrossRef]
- Zheng, Y.; Chen, Y.; Qian, B.; Shi, X.; Shu, Y.; Chen, J. A Review on Edge Large Language Models: Design, Execution, and Applications. ACM Comput. Surv. 2025, 57, 209. [Google Scholar] [CrossRef]
- Wang, X.; Tang, Z.; Guo, J.; Meng, T.; Wang, C.; Wang, T.; Jia, W. Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models. ACM Comput. Surv. 2025, 57, 228. [Google Scholar] [CrossRef]
- Wang, R.; Gao, Z.; Zhang, L.; Yue, S.; Gao, Z. Empowering Large Language Models to Edge Intelligence: A Survey of Edge Efficient LLMs and Techniques. Comput. Sci. Rev. 2025, 57, 100755. [Google Scholar] [CrossRef]
- Sparrenberg, L.; Deußer, T.; Berger, A.; Sifa, R. Small and Fast LLMs on Commodity Hardware: Post-Training Quantization in Llama. Cpp. In Proceedings of the 2025 IEEE 12th International Conference on Data Science and Advanced Analytics (DSAA), Birmingham, UK, 9–12 October 2025; pp. 1–10. [Google Scholar]
- Egashira, K.; Vero, M.; Staab, R.; He, J.; Vechev, M. Exploiting LLM Quantization. Adv. Neural Inf. Process. Syst. 2024, 37, 41709–41732. [Google Scholar] [CrossRef]
- Cvach, M. Monitor Alarm Fatigue: An Integrative Review. Biomed. Instrum. Technol. 2012, 46, 268–277. [Google Scholar] [CrossRef]
- Michels, E.A.M.; Gilbert, S.; Koval, I.; Wekenborg, M.K. Alarm Fatigue in Healthcare: A Scoping Review of Definitions, Influencing Factors, and Mitigation Strategies. BMC Nurs. 2025, 24, 664. [Google Scholar] [CrossRef] [PubMed]
- Bliss, J.P.; Dunn, M.C. Behavioural Implications of Alarm Mistrust as a Function of Task Workload. Ergonomics 2000, 43, 1283–1300. [Google Scholar] [CrossRef] [PubMed]
- Sweller, J. Cognitive Load during Problem Solving: Effects on Learning. Cogn. Sci. 1988, 12, 257–285. [Google Scholar] [CrossRef]
- Wickens, C.D. Multiple Resources and Performance Prediction. Theor. Issues Ergon. Sci. 2002, 3, 159–177. [Google Scholar] [CrossRef]
- Endsley, M.R.; Jones, D.G. Designing for Situation Awareness: An Approach to User-Centered Design, 3rd ed.; CRC Press: Boca Raton, FL, USA, 2025; ISBN 978-1-003-38823-4. [Google Scholar]
- Parasuraman, R.; Riley, V. Humans and Automation: Use, Misuse, Disuse, Abuse. Hum. Factors 1997, 39, 230–253. [Google Scholar] [CrossRef]
- Mosier, K.L.; Skitka, L.J. Automation Use and Automation Bias. Proc. Hum. Factors Ergon. Soc. Annu. Meet. 1999, 43, 344–348. [Google Scholar] [CrossRef]
- Parasuraman, R.; Manzey, D.H. Complacency and Bias in Human Use of Automation: An Attentional Integration. Hum. Factors 2010, 52, 381–410. [Google Scholar] [CrossRef]
- Romeo, G.; Conti, D. Exploring Automation Bias in Human–AI Collaboration: A Review and Implications for Explainable AI. AI Soc. 2026, 41, 259–278. [Google Scholar] [CrossRef]
- Dzindolet, M.T.; Peterson, S.A.; Pomranky, R.A.; Pierce, L.G.; Beck, H.P. The Role of Trust in Automation Reliance. Int. J. Hum. -Comput. Stud. 2003, 58, 697–718. [Google Scholar] [CrossRef]
- Lee, J.D.; See, K.A. Trust in Automation: Designing for Appropriate Reliance. Hum. Factors 2004, 46, 50–80. [Google Scholar] [CrossRef]
- Tatasciore, M.; Loft, S. Calibrating Reliance on Automated Advice: Transparency and Trust Calibration Feedback. Int. J. Hum. –Comput. Interact. 2025, 41, 14723–14733. [Google Scholar] [CrossRef]
- Bandodkar, A.J.; Wang, J. Non-Invasive Wearable Electrochemical Sensors: A Review. Trends Biotechnol. 2014, 32, 363–371. [Google Scholar] [CrossRef] [PubMed]
- Chong, Y.-W.; Ismail, W.; Ko, K.; Lee, C.-Y. Energy Harvesting for Wearable Devices: A Review. IEEE Sens. J. 2019, 19, 9047–9062. [Google Scholar] [CrossRef]
- Ali, A.; Shaukat, H.; Bibi, S.; Altabey, W.A.; Noori, M.; Kouritem, S.A. Recent Progress in Energy Harvesting Systems for Wearable Technology. Energy Strategy Rev. 2023, 49, 101124. [Google Scholar] [CrossRef]
- Perez, A.J.; Zeadally, S. Recent Advances in Wearable Sensing Technologies. Sensors 2021, 21, 6828. [Google Scholar] [CrossRef]
- Mahato, K. Introduction to Biosensors for Personalized Health. In Biosensors for Personalized Healthcare; Mahato, K., Chandra, P., Eds.; Springer Nature: Singapore, 2024; pp. 1–25. ISBN 978-981-97-5473-1. [Google Scholar]
- Qureshi, R.; Irfan, M.; Ali, H.; Khan, A.; Nittala, A.S.; Ali, S.; Shah, A.; Gondal, T.M.; Sadak, F.; Shah, Z.; et al. Artificial Intelligence and Biosensors in Healthcare and Its Clinical Relevance: A Review. IEEE Access 2023, 11, 61600–61620. [Google Scholar] [CrossRef]
- Prabhune, S.; Berndt, D.J. Deploying Large Language Models with Retrieval Augmented Generation. arXiv 2024, arXiv:2411.11895. [Google Scholar]
- Frantar, E.; Ashkboos, S.; Hoefler, T.; Alistarh, D. GPTQ: Accurate Post-Training Quantization for Generative Pre-Trained Transformers. arXiv 2023, arXiv:2210.17323. [Google Scholar]
- Lin, J.; Tang, J.; Tang, H.; Yang, S.; Chen, W.-M.; Wang, W.-C.; Xiao, G.; Dang, X.; Gan, C.; Han, S. AWQ: Activation-Aware Weight Quantization for On-Device LLM Compression and Acceleration. Proc. Mach. Learn. Syst. 2024, 6, 87–100. [Google Scholar] [CrossRef]
- Antikainen, E.; Njoum, H.; Kudelka, J.; Branco, D.; Rehman, R.Z.U.; Macrae, V.; Davies, K.; Hildesheim, H.; Emmert, K.; Reilmann, R.; et al. Assessing Fatigue and Sleep in Chronic Diseases Using Physiological Signals from Wearables: A Pilot Study. Front. Physiol. 2022, 13, 968185. [Google Scholar] [CrossRef]
- Brunyé, T.T.; Goring, S.A.; Cantelon, J.A.; Eddy, M.D.; Elkin-Frankston, S.; Elmore, W.R.; Giles, G.E.; Hancock, C.L.; Masud, S.B.; McIntyre, J.; et al. Trait-Level Predictors of Human Performance Outcomes in Personnel Engaged in Stressful Laboratory and Field Tasks. Front. Psychol. 2024, 15, 1449200. [Google Scholar] [CrossRef] [PubMed]
- Tiwari, A.; Cassani, R.; Kshirsagar, S.; Tobon, D.P.; Zhu, Y.; Falk, T.H. Modulation Spectral Signal Representation for Quality Measurement and Enhancement of Wearable Device Data: A Technical Note. Sensors 2022, 22, 4579. [Google Scholar] [CrossRef]
- Tomonaga, S.; Mizutani, H.; Doya, K. Training Recurrent Neural Networks with Inherent Missing Data for Wearable Device Applications. Proc. AAAI Conf. Artif. Intell. 2025, 39, 29512–29513. [Google Scholar] [CrossRef]
- Gong, T.; Jang, S.Y.; Acer, U.G.; Kawsar, F.; Min, C. Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables. IEEE Trans. Mob. Comput. 2025, 24, 9319–9333. [Google Scholar] [CrossRef]
- Iyengar, R.; Dong, Q.; Nguyen, C.; Pillai, P.; Satyanarayanan, M. Offload Shaping for Wearable Cognitive Assistance. Electronics 2024, 13, 4083. [Google Scholar] [CrossRef]
- González-Potes, A.; Martínez-Castro, D.; Paredes, C.M.; Ochoa-Brust, A.; Mena, L.J.; Martínez-Peláez, R.; Félix, V.G.; Félix-Cuadras, R.A. Hybrid AI and LLM-Enabled Agent-Based Real-Time Decision Support Architecture for Industrial Batch Processes: A Clean-in-Place Case Study. AI 2026, 7, 51. [Google Scholar] [CrossRef]
- Roy, K. Process-Grounded Knowledge-Infused Learning and Decision Making. Ph.D. Thesis, University of South Carolina, Columbia, SC, USA, 2025. [Google Scholar]
- Griffin, C.; Thomson, L.; Shlegeris, B.; Abate, A. Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols. arXiv 2024, arXiv:2409.07985. [Google Scholar] [CrossRef]
- Malloy, T.; Gonzalez, C. Applying Generative Artificial Intelligence to Cognitive Models of Decision Making. Front. Psychol. 2024, 15, 1387948. [Google Scholar] [CrossRef] [PubMed]
- Mai, M.V.; Ozkaynak, M.; Marquard, J.L.; Simonson, R.J.; Holden, R.J.; Barton, H.J.; Catchpole, K.; Jamil, M.; Brauer, S.; Kandaswamy, S. SHAPE-AI: Development and Expert Validation of a Survey for Human—AI Performance Evaluation in Healthcare. medRxiv 2026. [Google Scholar] [CrossRef]
- Abbas, Q.; Jeong, W.; Lee, S.W. Explainable AI in Clinical Decision Support Systems: A Meta-Analysis of Methods, Applications, and Usability Challenges. Healthcare 2025, 13, 2154. [Google Scholar] [CrossRef]
- Brunyé, T.T.; Mitroff, S.R.; Elmore, J.G. Artificial Intelligence and Computer-Aided Diagnosis in Diagnostic Decisions: 5 Questions for Medical Informatics and Human-Computer Interface Research. J. Am. Med. Inf. Assoc. 2026, 33, 543–550. [Google Scholar] [CrossRef]
- Barocas, S.; Nissenbaum, H. Big Data’s End Run around Procedural Privacy Protections. Commun. ACM 2014, 57, 31–33. [Google Scholar] [CrossRef]
- Capulli, E.; Druda, Y.; Palmese, F.; Butt, A.H.; Domenicali, M.; Macchiarelli, A.G.; Silvani, A.; Bedogni, G.; Ingravallo, F. Ethical and Legal Implications of Health Monitoring Wearable Devices: A Scoping Review. Soc. Sci. Med. 2025, 370, 117685. [Google Scholar] [CrossRef]
- Radanliev, P. Privacy, Ethics, Transparency, and Accountability in AI Systems for Wearable Devices. Front. Digit. Health 2025, 7, 1431246. [Google Scholar] [CrossRef]
- Tabassi, E. Artificial Intelligence Risk Management Framework (AI RMF 1.0); NIST: Gaithersburg, MD, USA, 2023. [Google Scholar]
- Raji, I.D.; Smart, A.; White, R.N.; Mitchell, M.; Gebru, T.; Hutchinson, B.; Smith-Loud, J.; Theron, D.; Barnes, P. Closing the AI Accountability Gap: Defining an End-to-End Framework for Internal Algorithmic Auditing. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency; Association for Computing Machinery: New York, NY, USA, 2020; pp. 33–44. [Google Scholar]
- Sabt, M.; Achemlal, M.; Bouabdallah, A. Trusted Execution Environment: What It Is, and What It Is Not. In Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA, Washington, DC, USA, 20–22 August 2015; Volume 1, pp. 57–64. [Google Scholar]
- Zuboff, S. The Age of Surveillance Capitalism. In Social Theory Re-Wired; Routledge: Abingdon, UK, 2023. [Google Scholar]
- Selbst, A.D.; Boyd, D.; Friedler, S.A.; Venkatasubramanian, S.; Vertesi, J. Fairness and Abstraction in Sociotechnical Systems. In Proceedings of the Conference on Fairness, Accountability, and Transparency; Association for Computing Machinery: New York, NY, USA, 2019; pp. 59–68. [Google Scholar]
- Mehrotra, S.; Degachi, C.; Vereschak, O.; Jonker, C.M.; Tielman, M.L. A Systematic Review on Fostering Appropriate Trust in Human-AI Interaction: Trends, Opportunities and Challenges. ACM J. Responsible Comput. 2024, 1, 26. [Google Scholar] [CrossRef]
- Li, Y.; Wu, B.; Huang, Y.; Luan, S. Developing Trustworthy Artificial Intelligence: Insights from Research on Interpersonal, Human-Automation, and Human-AI Trust. Front. Psychol. 2024, 15, 1382693. [Google Scholar] [CrossRef]
- Montealegre-López, N. Exploring the Role of Trust in AI-Driven Decision-Making: A Systematic Literature Review. Manag. Rev. Q. 2025. [Google Scholar] [CrossRef]


| Layer | Functional Role | Input Variables | Output Formalization |
|---|---|---|---|
| Acquisition | Quality-aware data collection | Paired output where quality is an explicit input | |
| Estimation | Probabilistic state mapping | ||
| Reasoning | Grounded interpretation | : Schema-constrained actions or interpretations | |
| Policy | Attention-control and gatekeeping | , user state, task demands | : Intervention decision (e.g., alert, defer, abstain) |
| Metric | Layer 1 (Sensing) | Layer 2 (State Estimation) | Layer 3 (Reasoning) | Layer 4 (Intervention Policy) |
|---|---|---|---|---|
| Primary role | Signal acquisition, preprocessing, signal-quality flags | Probabilistic state estimation, uncertainty, drift detection, personalization within bounds | Contextual synthesis, explanation, option generation, retrieval-grounded recommendation | Decide whether, when, and how to cue and advise user. |
| Most realistic device location today | Watch, chest-worn node, textile node, or other wearable sensor | Smartphone, body-worn computer, vehicle-mounted edge processor | Smartphone, body-worn computer, or other local edge processor | Likely same device as Layer 2 or 3. |
| Near-term implementation status | Clearly implementable today | Realistically implementable today on companion hardware | Implementable today only when bounded, compressed, and event-triggered | Implementable today as lightweight gating logic. |
| Temporal profile | Continuous or high-frequency sampling | Recurrent low-latency updates | Intermittent and event-triggered | Near-immediate once upstream outputs are available. |
| Typical latency target | Milliseconds to sub-second preprocessing windows | Sub-second to few-second state updates | Second-level acceptable in many use cases if not safety-critical; should be faster when urgency is high | Low-latency cue selection once recommendation state is available. |
| Main memory/compute pressure | Very limited memory; lightweight DSP/feature extraction only | Compact predictive model, short-term history, calibration/drift checks | Highest memory burden; retrieval store, compressed model, schema enforcement, tool use | Minimal additional compute beyond policy evaluation. |
| Main power pressure | Highest sensitivity to battery drain and duty cycle | Moderate; repeated on-device inference must remain stable over prolonged use | High per-call energy cost but reduced by intermittent use | Low, provided cueing logic remains simple. |
| What should usually not run here | Full contextual reasoning, persistent retrieval, large-model inference | Heavy free-form generation if tighter real-time inference is required | Continuous always-on sampling or raw-signal preprocessing | Complex interpretation without uncertainty and drift inputs. |
| Preferred implementation style | Lightweight signal processing, feature extraction, compression, local quality estimation | Compact probabilistic model, bounded personalization, abstention and drift logic | Compressed local LLM, small reasoning model, rules + retrieval hybrid, strict schemas | Rule-based or utility-based gating policy. |
| Fallback under constraint | Lower sampling rate, fewer channels, more buffering, reduced transmission | Lower update frequency, simpler model, wider uncertainty, increased abstention | Template-based logic, rule-based heuristics, deferred explanation, abstention | Silence/defer cue, request recalibration, or log only. |
| Conceptual vs implementable distinction | Largely implementable with current wearables | Largely implementable on current companion devices | Partly implementable now; strongest claims remain conditional on bounded local reasoning and compression | Implementable now if framed as a lightweight attention policy. |
| Example Outcome | Layer 1 Inputs | Layer 2 Outputs | Layer 3 Outputs | Layer 4 Action |
|---|---|---|---|---|
| Alert | PPG, skin temperature, IMU, task context, acceptable quality | Elevated fatigue and thermal strain posterior, moderate-to-high confidence, low drift | Retrieved local heat guidance, structured recommendation to rest and reassess | Brief haptic or audio cue at feasible task boundary |
| Defer | Physiological arousal and task-demand indicators | Elevated stress and workload posterior, moderate confidence | Monitor and defer recommendation due to task phase | No immediate alert, delay cue until lower-demand period |
| Abstain | Poor contact, motion artifact, missingness, unusual context | Wide uncertainty, unstable posterior, drift flag | No strong recommendation, request recalibration, sensor check | Abstain, log event, or fall back to simpler monitoring |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Brunyé, T.T.; Petrimoulx, M.V.; Cantelon, J.A. From Sensing to Sense-Making: A Framework for On-Person Intelligence with Wearable Biosensors and Edge LLMs. Sensors 2026, 26, 2034. https://doi.org/10.3390/s26072034
Brunyé TT, Petrimoulx MV, Cantelon JA. From Sensing to Sense-Making: A Framework for On-Person Intelligence with Wearable Biosensors and Edge LLMs. Sensors. 2026; 26(7):2034. https://doi.org/10.3390/s26072034
Chicago/Turabian StyleBrunyé, Tad T., Mitchell V. Petrimoulx, and Julie A. Cantelon. 2026. "From Sensing to Sense-Making: A Framework for On-Person Intelligence with Wearable Biosensors and Edge LLMs" Sensors 26, no. 7: 2034. https://doi.org/10.3390/s26072034
APA StyleBrunyé, T. T., Petrimoulx, M. V., & Cantelon, J. A. (2026). From Sensing to Sense-Making: A Framework for On-Person Intelligence with Wearable Biosensors and Edge LLMs. Sensors, 26(7), 2034. https://doi.org/10.3390/s26072034

