Tag function approximation

Understanding SARSA: Finite-Sample Analysis and Its Impact on Reinforcement Learning

In the world of reinforcement learning, few algorithms have gained as much attention as SARSA (State-Action-Reward-State-Action). This on-policy algorithm is designed to learn optimal policies in Markov decision processes (MDPs). The recent research conducted by Shaofeng Zou, Tengyu Xu, and… Continue Reading →

© 2024 Christophe Garon — Powered by WordPress

Theme by Anders NorenUp ↑