What is reinforcement learning

a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex and uncertain environment.

Difference between Reinforcement Learning and Supervised Learning

Sequential data as input(not i.i.d) 序列化输入
The learner is not told which actions to take, but instead must discover which actions yield the most reward by trying them.
Trial-and-error exploration(balance between exploration and exploitation)
There is no supervisor, only a reward signal, which is also delayed.

Features of Reinforcement Learning

Trial-and-error exploration
Delayed reward
Time matters (sequential data, non i.i.d data)
Agent’s actions affect the subsequent data it receives (agent’s action changes the environment)