Next Article in Journal
Sine-Wave Filter Design Method for High-Speed PMSMs in High-Frequency (250 Hz) Drives
Previous Article in Journal
Beyond Resilience: Antifragility in Critical Infrastructure Cybersecurity
Previous Article in Special Issue
Virtual Testbed for Cyber-Physical System Security Research and Education: Design, Evaluation, and Impact
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Private-RAG: A Privacy-Preserving Retrieval-Augmented Generation Method for Large Model Inference

1
College of Computer Science and Technology, Xinjiang Normal University, Urumqi 830054, China
2
Xinjiang Engineering Research Center for Smart Education and Application, Urumqi 830054, China
*
Author to whom correspondence should be addressed.
Electronics 2026, 15(12), 2567; https://doi.org/10.3390/electronics15122567 (registering DOI)
Submission received: 26 April 2026 / Revised: 1 June 2026 / Accepted: 8 June 2026 / Published: 10 June 2026
(This article belongs to the Special Issue Trends in Information Systems and Security)

Abstract

Retrieval-augmented generation improves the factual consistency, knowledge timeliness, and scenario adaptability of large model inference services by incorporating external knowledge. However, it also introduces structural privacy risks, including private-knowledge leakage, prompt injection, and progressive information extraction in multi-turn interactions. To address these issues, this paper proposes Private-RAG, a privacy-preserving retrieval-augmented generation method for large model inference. The method constructs a composite threat model and a quantitative evaluation framework for the RAG pipeline, and further develops a layered collaborative defense mechanism consisting of controlled retrieval, sensitivity-aware context minimization, structured prompt isolation, and multi-criterion output gating. In addition, a risk feedback-driven budget accounting method is introduced to enable dynamic risk control in multi-turn interaction scenarios. Experimental results show that Private-RAG effectively reduces private-knowledge leakage, improves robustness against prompt injection, and suppresses cumulative privacy exposure while maintaining question-answering utility and a controllable deployment latency (e.g., 1165 ms), demonstrating superior privacy protection and inference robustness.
Keywords: large models; inference services; retrieval-augmented generation; layered collaborative defense; dynamic risk control large models; inference services; retrieval-augmented generation; layered collaborative defense; dynamic risk control

Share and Cite

MDPI and ACS Style

Yang, Q.; Li, Y.; Ma, X. Private-RAG: A Privacy-Preserving Retrieval-Augmented Generation Method for Large Model Inference. Electronics 2026, 15, 2567. https://doi.org/10.3390/electronics15122567

AMA Style

Yang Q, Li Y, Ma X. Private-RAG: A Privacy-Preserving Retrieval-Augmented Generation Method for Large Model Inference. Electronics. 2026; 15(12):2567. https://doi.org/10.3390/electronics15122567

Chicago/Turabian Style

Yang, Qianren, Yong Li, and Xiang Ma. 2026. "Private-RAG: A Privacy-Preserving Retrieval-Augmented Generation Method for Large Model Inference" Electronics 15, no. 12: 2567. https://doi.org/10.3390/electronics15122567

APA Style

Yang, Q., Li, Y., & Ma, X. (2026). Private-RAG: A Privacy-Preserving Retrieval-Augmented Generation Method for Large Model Inference. Electronics, 15(12), 2567. https://doi.org/10.3390/electronics15122567

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop