This course will cover the following topics:-
(1) Background and definitions
(2) The reward hypothesis
(3) Markov decision process
(4) Reinforcement learning objective
(5) The policy gradient theorem
(6) Reinforcement baseline
(7) State value
(8) Actor critic methods