Open henrycharlesworth opened 7 months ago
Adding @tmynn to this thread as he has put the integration together.
@SGevorg, @henrycharlesworth, seems this line points to the real error:
TypeError: Timeout.__init__() missing 1 required positional argument: 'lock_file'
@henrycharlesworth, are you using the latest version of Aim?
I think so - using 3.17.5. I tried a number of earlier versions and this didn't seem to help.
Is there any solution for this? I have been getting this error when I try retrieve an existing run with hash.
❓Question
The library fairseq has built in support for aim, but I am struggling to get it working. I'm not sure if it's something I'm doing wrong or if maybe the fairseq support is out of date, but the fairseq repo is fairly inactive so I thought I would ask here.
I am working locally and run
aim server
, and see: "Server is mounted on 0.0.0.0:53800".I then run my fairseq experiment, adding to my config.yaml file:
then run my experiment. It seems to be working initially - aim detects the experiment and the log starts with:
but then I get an error:
Does anyone have any idea what might be causing this/if there's something wrong with the approach I'm taking? I've tried with a variety of different aim versions (going back to the versions when fairseq was more actively being developed) and I still get errors.