This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
LogPPO: A Log-Based Anomaly Detector Aided with Proximal Policy Optimization Algorithms
by
Zhihao Wang
Zhihao Wang ,
Jiachen Dong
Jiachen Dong and
Chuanchuan Yang
Chuanchuan Yang *
State Key Laboratory of Photonics and Communications, School of Electronics, Peking University, Beijing 100871, China
*
Author to whom correspondence should be addressed.
Smart Cities 2026, 9(1), 5; https://doi.org/10.3390/smartcities9010005 (registering DOI)
Submission received: 16 November 2025
/
Revised: 18 December 2025
/
Accepted: 24 December 2025
/
Published: 26 December 2025
Abstract
Cloud-based platforms form the backbone of smart city ecosystems, powering essential services such as transportation, energy management, and public safety. However, their operational complexity generates vast volumes of system logs, making manual anomaly detection infeasible and raising reliability concerns. This study addresses the challenge of data scarcity in log anomaly detection by leveraging Large Language Models (LLMs) to enhance domain-specific classification tasks. We empirically validate that domain-adapted classifiers preserve strong natural language understanding, and introduce a Proximal Policy Optimization (PPO)-based approach to align semantic patterns between LLM outputs and classifier preferences. Experiments were conducted using three Transformer-based baselines under few-shot conditions across four public datasets. Results indicate that integrating natural language analyses improves anomaly detection F1-Scores by 5–86% over the baselines, while iterative PPO refinement boosts classifier’s “confidence” in label prediction. This research pioneers a novel framework for few-shot log anomaly detection, establishing an innovative paradigm in resource-constrained diagnostic systems in smart city infrastructures.
Share and Cite
MDPI and ACS Style
Wang, Z.; Dong, J.; Yang, C.
LogPPO: A Log-Based Anomaly Detector Aided with Proximal Policy Optimization Algorithms. Smart Cities 2026, 9, 5.
https://doi.org/10.3390/smartcities9010005
AMA Style
Wang Z, Dong J, Yang C.
LogPPO: A Log-Based Anomaly Detector Aided with Proximal Policy Optimization Algorithms. Smart Cities. 2026; 9(1):5.
https://doi.org/10.3390/smartcities9010005
Chicago/Turabian Style
Wang, Zhihao, Jiachen Dong, and Chuanchuan Yang.
2026. "LogPPO: A Log-Based Anomaly Detector Aided with Proximal Policy Optimization Algorithms" Smart Cities 9, no. 1: 5.
https://doi.org/10.3390/smartcities9010005
APA Style
Wang, Z., Dong, J., & Yang, C.
(2026). LogPPO: A Log-Based Anomaly Detector Aided with Proximal Policy Optimization Algorithms. Smart Cities, 9(1), 5.
https://doi.org/10.3390/smartcities9010005
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.