Reinforcement learning is a type of machine learning in which an agent learns to behave in an unknown environment by performing actions and seeing the ensuing results. The agent’s objective is to learn to act in ways that maximizes expected long-term rewards.
When it comes to reinforcement learning, there is no expected outcome—the agent makes the best decision based on its knowledge, for which it is either rewarded or penalized. As such, it is bound to learn from past experiences.