Assignment #8

Background for all three papers

Provide a short discussion of each of the assigned papers (listed under Course Materials). Below are some questions to think about.

RL in Robotics
Just read sections 2.2.2 and 2.3. This reading is for background on policy-gradient methods; focus on finite-difference and REINFORCE methods.

Solving deep memory POMDPs with recurrent policy gradients

Deep recurrent Q learning for partially observable MDPs

Upload a single PDF file through Stellar by Mar 7 at 10 am.