Homework 6

Homework for Lecture 6: Temporal Differnce 📝

Instructions:

- Show ALL Work, Neatly and in Order.
- No credit for Answers Without Work.
- Submit a single PDF file including all solutions.
- DO NOT submit individual files or images.
- For coding questions, submit ONE .py file with comments.

Note

For this homework, you only need numpy, gymnasium & pygame.

Coding Exercise 1: SARSA

For the GridWorldEnv environment, code the SARSA algorithm using the provided hyperparameters.

Coding Exercise 2: Q-Learning

For the GridWorldEnv environment, code the Q-learning algorithm using the provided hyperparameters.

Coding Exercise 3: Double Q-Learning

For the GridWorldEnv environment, code the Double Q-learning algorithm using the provided hyperparameters.