Closed wcarvalho closed 2 years ago
Notes on versions:
e7e99762369c2ab2871d1c4bc6b6ab776eddf48c
)
TypeError: __init__() got an unexpected keyword argument 'checkpoint_time_delta_minutes'
@wcarvalho I don't see any obvious problems. There are couple of things you can try to narrow-down the problem:
This should tell you whether the problem is related to your R2D2 modifications and if it is isolated to R2D2 or not.
Please reopen if this is still an issue / have some more findings.
First, I want to acknowledge what a great library this is and I'm sure it's really hard to get everything working for the general public.
I've been plagued by a memory leak for the past 3 months and I have no idea why it's happening. I've been able to reduce it to basically vanilla ACME R2D2 code + a custom haiku module. As a note, this was happening with my own custom acme agent that was based off of DQN. Here is a minimal code that can reproduce the leak.
Changes:
Some notes on the leak:
10_000
I've attached a graph creating using
memory_profiler
. I think the red line is the replay buffer. I think yellow (which keeps growing) is the learner.