Back

强化学习纲要(IntroToRL)笔记

What is reinforcement learning

a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives while interacting with a complex and uncertain environment.

Difference between Reinforcement Learning and Supervised Learning

  • Sequential data as input(not i.i.d) 序列化输入
  • The learner is not told which actions to take, but instead must discover which actions yield the most reward by trying them.
  • Trial-and-error exploration(balance between exploration and exploitation)
  • There is no supervisor, only a reward signal, which is also delayed.

Features of Reinforcement Learning

  • Trial-and-error exploration
  • Delayed reward
  • Time matters (sequential data, non i.i.d data)
  • Agent’s actions affect the subsequent data it receives (agent’s action changes the environment)
Licensed under CC BY-NC-SA 4.0
comments powered by Disqus
Built with Hugo
Theme Stack designed by Jimmy