I was playing with the Pendulum-v0 from OpenAIGym.jl, which has a continuous action space. It turns out that Reinfoce.jl's assertion for checking the bounds of action chosen by policy (iterators.jl, line 40) does not support the continuous action space of Pendulum-v0 which is of type LearnBase.IntervalSet{Array{Float64,1}}. EDIT: It works, I didn't pass the data in the required form :(
I was playing with the
Pendulum-v0
fromOpenAIGym.jl
, which has a continuous action space. It turns out thatReinfoce.jl
's assertion for checking the bounds of action chosen by policy (iterators.jl, line 40) does not support the continuous action space ofPendulum-v0
which is of typeLearnBase.IntervalSet{Array{Float64,1}}
. EDIT: It works, I didn't pass the data in the required form :(