Closed alvarofpp closed 8 months ago
Following the package standard for DRL, we need the class to have the following methods:
start
step
[Observation, Reward, Done]
reset
Observation
close
Suggestions:
EnvironmentBase
urnai.environments
Notes:
abc.ABC
Savable
Following the package standard for DRL, we need the class to have the following methods:
start
: Starts the environment.step
: Executes an action in the environment, and returns a tuple of[Observation, Reward, Done]
.reset
: Resets the environment and returns anObservation
.close
: Closes the environment.Suggestions:
EnvironmentBase
.urnai.environments
.Notes:
abc.ABC
.Savable
class.