Open steventrouble opened 1 year ago
On it. I ran pip freeze
on the stability cluster and diffed it with LambdaLabs. Looks like there's quite a few differences there, so it won't be a super easy fix. I don't see any that look like they'd obviously cause an issue.
I'm assuming there's only one package that's mismatched, so I'll do a quick binary search on the packages to see which fixes the failure.
Running on a new LambdaLabs instance (A100 x1, 40GB) returns an error in
selfplay_worker.py
:The issue seems likely to be a version mismatch between gym and some other library we didn't specify the version of in the requirements.txt. Long term, we'll want to update everything to use gymnasium and the latest versions of ALE, but for now we need to figure out which package is causing this conflict and freeze it in the requirements file.
Full error below.