Previous Article in Journal
Fused Unbalanced Gromov–Wasserstein-Based Network Distributional Resilience Analysis for Critical Infrastructure Assessment
Previous Article in Special Issue
Training Agents for Strategic Curling Through a Unified Reinforcement Learning Framework
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games

1
Department of Computer and Artificial Intelligence, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
2
NUI/NUX Platform Research Center, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
3
Department of Computer Science and Artificial Intelligence, College of Advanced Convergence Engineering, Dongguk University-Seoul, 30 Pildongro 1-gil, Jung-gu, Seoul 04620, Republic of Korea
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(3), 419; https://doi.org/10.3390/math14030419 (registering DOI)
Submission received: 25 December 2025 / Revised: 18 January 2026 / Accepted: 21 January 2026 / Published: 25 January 2026
(This article belongs to the Special Issue Applications of Intelligent Game and Reinforcement Learning)

Abstract

High sample complexity presents a major challenge in applying multi-agent reinforcement learning (MARL) to dynamic, high-dimensional sports such as basketball. To address this problem, we proposed the knowledge-embedded modular framework (KEMF), which partitions the environment into offense, defense, and loose-ball modules. Each module employs specialized policies and a knowledge-based observation layer enriched with basketball-specific metrics such as shooting success and defensive accuracy. These metrics are also incorporated into a dynamic and dense reward scheme that offers more direct and situation-specific feedback than sparse win/loss signals. We integrated these components into a multi-agent proximal policy optimization (MAPPO) algorithm to enhance training speed and improve sample efficiency. Evaluations using the commercial basketball game Freestyle indicate that KEMF outperformed previous methods in terms of the average points, winning rate, and overall training efficiency. An ablation study confirmed the synergistic effects of modularity, knowledge-embedded observations, and dense rewards. Moreover, a real-world deployment in 1457 live matches demonstrated the robustness of the framework, with trained agents achieving a 52.43% win rate against experienced human players. These results underscore the promise of the KEMF to enable efficient, adaptive, and strategically coherent MARL solutions in complex sporting environments.
Keywords: multi-agent reinforcement learning; game artificial intelligence; team sports game multi-agent reinforcement learning; game artificial intelligence; team sports game

Share and Cite

MDPI and ACS Style

Kim, J.; Park, J.; Cho, K. Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics 2026, 14, 419. https://doi.org/10.3390/math14030419

AMA Style

Kim J, Park J, Cho K. Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics. 2026; 14(3):419. https://doi.org/10.3390/math14030419

Chicago/Turabian Style

Kim, Junhyuk, Jisun Park, and Kyungeun Cho. 2026. "Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games" Mathematics 14, no. 3: 419. https://doi.org/10.3390/math14030419

APA Style

Kim, J., Park, J., & Cho, K. (2026). Enhancing Multi-Agent Reinforcement Learning via Knowledge-Embedded Modular Framework for Online Basketball Games. Mathematics, 14(3), 419. https://doi.org/10.3390/math14030419

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop