Next Article in Journal
Rheological and Stability Assessment of Alternative Polymer Modifiers for Coal Dust Combustion
Previous Article in Journal
KLSBench: Evaluating LLM Capabilities on Korean Literary Sinitic Texts in Historical Context
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Constructing Non-Markovian Decision Process via History Aggregator

School of Computer Science, Peking University, Beijing 100871, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(2), 955; https://doi.org/10.3390/app16020955
Submission received: 22 December 2025 / Revised: 10 January 2026 / Accepted: 13 January 2026 / Published: 16 January 2026
(This article belongs to the Special Issue Advances in Intelligent Decision-Making Systems)

Abstract

In the domain of algorithmic decision-making, non-Markovian dynamics manifest as a significant impediment, especially for paradigms such as Reinforcement Learning (RL), thereby exerting far-reaching consequences on the advancement and effectiveness of the associated systems. Nevertheless, the existing benchmarks are deficient in comprehensively assessing the capacity of decision algorithms to handle non-Markovian dynamics. To address this deficiency, we have devised a generalized methodology grounded in category theory. Notably, we established the category of Markov Decision Processes (MDP) and the category of non-Markovian Decision Processes (NMDP), and proved the equivalence relationship between them. This theoretical foundation provides a novel perspective for understanding and addressing non-Markovian dynamics. We further introduced non-Markovianity into decision-making problem settings via the History Aggregator for State (HAS). With HAS, we can precisely control the state dependency structure of decision-making problems in the time series. Our analysis demonstrates the effectiveness of our method in representing a broad range of non-Markovian dynamics. This approach facilitates a more rigorous and flexible evaluation of decision algorithms by testing them in problem settings where non-Markovian dynamics are explicitly constructed.
Keywords: non-Markovian decision process; category theory; reinforcement learning non-Markovian decision process; category theory; reinforcement learning

Share and Cite

MDPI and ACS Style

Wang, Y.; Li, L.; Li, W. Constructing Non-Markovian Decision Process via History Aggregator. Appl. Sci. 2026, 16, 955. https://doi.org/10.3390/app16020955

AMA Style

Wang Y, Li L, Li W. Constructing Non-Markovian Decision Process via History Aggregator. Applied Sciences. 2026; 16(2):955. https://doi.org/10.3390/app16020955

Chicago/Turabian Style

Wang, Yongyi, Lingfeng Li, and Wenxin Li. 2026. "Constructing Non-Markovian Decision Process via History Aggregator" Applied Sciences 16, no. 2: 955. https://doi.org/10.3390/app16020955

APA Style

Wang, Y., Li, L., & Li, W. (2026). Constructing Non-Markovian Decision Process via History Aggregator. Applied Sciences, 16(2), 955. https://doi.org/10.3390/app16020955

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop