juanjose49 / omscs-cs7641-machine-learning-assignment-4

I am open sourcing the boiler plate code necessary for Assignment 4 so we can focus on the analysis instead.
GNU Lesser General Public License v3.0
51 stars 79 forks source link

Policy equality #6

Closed icarusalways closed 8 years ago

icarusalways commented 8 years ago

Added methods for parsing command line arguments so that testing can be done without recompilation Valid command options are vi=true (run value iteration) pi=true (run policy iteration) ql=true (run QLearning) converge=true (run until consecutive policies are equal to one another) cc=5 (integer representing the number of policies need to be equal to consider convergence)

Created a method for checking if two policies are equal to one another to test convergence. Policies are equal to one another if for all valid states the same action is selected.

Notes: For value iteration convergence seems to be working properly. For policy iteration the policies converge quickly but are not optimal. Haven't wrapped my head around this yet.