This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games
by
Junhyuk Kim
Junhyuk Kim 1
,
Jisun Park
Jisun Park 2
and
Kyungeun Cho
Kyungeun Cho 3,*
1
Department of Computer and Artificial Intelligence, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
2
NUI/NUX Platform Research Center, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
3
Department of Computer Science and Artificial Intelligence, College of Advanced Convergence Engineering, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(3), 419; https://doi.org/10.3390/math14030419 (registering DOI)
Submission received: 25 December 2025
/
Revised: 18 January 2026
/
Accepted: 21 January 2026
/
Published: 25 January 2026
Abstract
High sample complexity presents a major challenge in applying multi-agent reinforcement learning (MARL) to dynamic, high-dimensional sports such as basketball. To address this problem, we proposed the knowledge-embedded modular framework (KEMF), which partitions the environment into offense, defense, and loose-ball modules. Each module employs specialized policies and a knowledge-based observation layer enriched with basketball-specific metrics such as shooting success and defensive accuracy. These metrics are also incorporated into a dynamic and dense reward scheme that offers more direct and situation-specific feedback than sparse win/loss signals. We integrated these components into a multi-agent proximal policy optimization (MAPPO) algorithm to enhance training speed and improve sample efficiency. Evaluations using the commercial basketball game Freestyle indicate that KEMF outperformed previous methods in terms of the average points, winning rate, and overall training efficiency. An ablation study confirmed the synergistic effects of modularity, knowledge-embedded observations, and dense rewards. Moreover, a real-world deployment in 1457 live matches demonstrated the robustness of the framework, with trained agents achieving a 52.43% win rate against experienced human players. These results underscore the promise of the KEMF to enable efficient, adaptive, and strategically coherent MARL solutions in complex sporting environments.
Share and Cite
MDPI and ACS Style
Kim, J.; Park, J.; Cho, K.
Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics 2026, 14, 419.
https://doi.org/10.3390/math14030419
AMA Style
Kim J, Park J, Cho K.
Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics. 2026; 14(3):419.
https://doi.org/10.3390/math14030419
Chicago/Turabian Style
Kim, Junhyuk, Jisun Park, and Kyungeun Cho.
2026. "Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games" Mathematics 14, no. 3: 419.
https://doi.org/10.3390/math14030419
APA Style
Kim, J., Park, J., & Cho, K.
(2026). Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics, 14(3), 419.
https://doi.org/10.3390/math14030419
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article metric data becomes available approximately 24 hours after publication online.