UoA-CARES / cares_reinforcement_learning

CARES Reinforcement Learning Package
11 stars 2 forks source link

Update to Data Ratio #210

Open beardyFace opened 1 month ago

beardyFace commented 1 month ago

G loop parameter move into internal algorithm parameter - not all algorithms update both the actor and critic per G update cycle. See CrossQ as an example.