Page 1 of 1

2. (6pts) Consider the following environment represented as a directed graph, in which circles represent states, double

Posted: Thu Jul 14, 2022 2:18 pm
by answerhappygod
2 6pts Consider The Following Environment Represented As A Directed Graph In Which Circles Represent States Double 1
2 6pts Consider The Following Environment Represented As A Directed Graph In Which Circles Represent States Double 1 (33.66 KiB) Viewed 27 times
2 6pts Consider The Following Environment Represented As A Directed Graph In Which Circles Represent States Double 2
2 6pts Consider The Following Environment Represented As A Directed Graph In Which Circles Represent States Double 2 (19.79 KiB) Viewed 27 times
2. (6pts) Consider the following environment represented as a directed graph, in which circles represent states, double circles represent the goal, and arrows represent actions. Assume you start at state 0 . Assume all actions are deterministic. Transitioning to state 1 produces a reward of 0 , while transitioning to state 2 produces a reward of 1.
- 2-a What are the optimal state-values and state-action-values for this environment? - 2-b What is the optimal policy for this environment? - 2-c Assume we introduce a discount factor of 0.95 into our value functions. Determine the new values of the state-value and state-action-value functions as well as the new optimal policy. Describe the effect of the discount factor on the optimal policy.