Markov Decision Process
Reinforcement learning (RL) is a paradigm of machine learning where an agent learns to make decisions by performing actions in an environment to achieve a goal. The concept of Markov Decision Processes (MDPs) is central to understanding how reinforcement learning works. An MDP provides a mathematical framework for modeling decision-making