To content
Fakultät für Informatik

Reinforcement Learning

Contents:

  • The reinforcement learning problem
  • Multi-armed bandits
  • Markov Decision processes
  • Dynamic programming
  • Monte Carlo Methods
  • Temporal-difference learning
  • On- and off-policy methods
  • Elligibility traces
  • Policy gradients