opencog / rocca

Rational OpenCog Controlled Agent (ROCCA). Use OpenCog to control a rational agent in OpenAI Gym and Malmo environments.
GNU Affero General Public License v3.0
30 stars 18 forks source link

[WIP] Learning to solve the Cartpole-v1 environment #19

Closed ntoxeg closed 2 years ago

ntoxeg commented 3 years ago

This is my attempt at making ROCCA able to solve (i. e. achieve average total reward of at least 195 over 100 trials) the Cartpole-v1 Gym environment.

Current Challenges

A more general challenge is to figure out how the system should discover things like “I should check if the angle of the pole is smaller than some number”, so generally come up with predicates and calculations to be applied on observations.

ngeiswei commented 3 years ago

That looks very cool, @ntoxeg, note that I likely won't be able to look into it (as well as merging your PR #18) before a couple weeks. Then I should have more time for it (we'll probably want to schedule a call or something).

ntoxeg commented 3 years ago

I understand @ngeiswei, I will keep working on it and we can certainly schedule a call when you have more time to discuss this.

ntoxeg commented 3 years ago

Rebased against master.

Some summary of current changes:

ntoxeg commented 3 years ago

Ah, I also should mention that I’ve been working on using the atomspace-rocks to have snapshots of the AtomSpace saved under snapshots directory - I don’t use that for anything yet but it could prove useful in the future for observing how the knowledge base evolves over time.

ngeiswei commented 3 years ago

Note that multithreading support is very prototypical at this point. It shouldn't crash reasoning but it's not gonna bring much performance gain.

ntoxeg commented 2 years ago

Closing because of #32, no point in holding this open indefinitely.