Previous Article in Journal
Metadata Suffices: Optimizer-Aware Fake Account Detection with Minimal Multimodal Input
Previous Article in Special Issue
Robust Clinical Querying with Local LLMs: Lexical Challenges in NL2SQL and Retrieval-Augmented QA on EHRs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering

1
School of Information Technology and Computer Science, Nile University, Giza 12588, Egypt
2
Graduate School of Information Science, University of Hyogo, Kobe 650-0047, Japan
3
Advanced Medical Engineering Research Institute, University of Hyogo, Himeji 670-0836, Japan
*
Authors to whom correspondence should be addressed.
Big Data Cogn. Comput. 2025, 9(12), 299; https://doi.org/10.3390/bdcc9120299 (registering DOI)
Submission received: 27 October 2025 / Revised: 18 November 2025 / Accepted: 21 November 2025 / Published: 23 November 2025

Abstract

High-performing medical Large Language Models (LLMs) typically require extensive fine-tuning with substantial computational resources, limiting accessibility for resource-constrained healthcare institutions. This study introduces a confidence-driven multi-model framework that leverages model diversity to enhance medical question answering without fine-tuning. Our framework employs a two-stage architecture: a confidence detection module assesses the primary model’s certainty, and an adaptive routing mechanism directs low-confidence queries to Helper models with complementary knowledge for collaborative reasoning. We evaluate our approach using Qwen3-30B-A3B-Instruct, Phi-4 14B, and Gemma 2 12B across three medical benchmarks; MedQA, MedMCQA, and PubMedQA. Results demonstrate that our framework achieves competitive performance, with particularly strong results in PubMedQA (0.95) and MedMCQA (0.78). Ablation studies confirm that confidence-aware routing combined with multi-model collaboration substantially outperforms single-model approaches and uniform reasoning strategies. This work establishes that strategic model collaboration offers a practical, computationally efficient pathway to improve medical AI systems, with significant implications for democratizing access to advanced medical AI in resource-limited settings.
Keywords: medical question answering; large language models; multi-model collaboration; confidence-driven routing; zero-shot learning, uncertainty calibration, model routing, cost–accuracy trade-off medical question answering; large language models; multi-model collaboration; confidence-driven routing; zero-shot learning, uncertainty calibration, model routing, cost–accuracy trade-off

Share and Cite

MDPI and ACS Style

Elshaer, Z.; Rashed, E.A. CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering. Big Data Cogn. Comput. 2025, 9, 299. https://doi.org/10.3390/bdcc9120299

AMA Style

Elshaer Z, Rashed EA. CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering. Big Data and Cognitive Computing. 2025; 9(12):299. https://doi.org/10.3390/bdcc9120299

Chicago/Turabian Style

Elshaer, Ziad, and Essam A. Rashed. 2025. "CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering" Big Data and Cognitive Computing 9, no. 12: 299. https://doi.org/10.3390/bdcc9120299

APA Style

Elshaer, Z., & Rashed, E. A. (2025). CURE: Confidence-Driven Unified Reasoning Ensemble Framework for Medical Question Answering. Big Data and Cognitive Computing, 9(12), 299. https://doi.org/10.3390/bdcc9120299

Article Metrics

Back to TopTop