aimhubio / aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
https://aimstack.io
Apache License 2.0
5.18k stars 317 forks source link

Remote aim server leftover checkpoints #2548

Open vanhumbeecka opened 1 year ago

vanhumbeecka commented 1 year ago

🐛 Bug

I have the following setup:

However, the 'latest checkpoint' is always stored locally instead of the server. Inside the repository where I start my code from, the following folder is created aim: (yes, including the colon). You can see the results in the screenshot.

It seems to be some leftover checkpoints from aim? I'm not sure. Inspecting the output in aim shows no signs of issues. Everything seems to be in order.

Screenshot 2023-02-25 at 12 58 07

To reproduce

See above

Expected behavior

I expect there is nothing logged locally, and everything is stored on the remote aim server.

Environment

alberttorosyan commented 1 year ago

Hey @vanhumbeecka! Thanks for submitting the issue. In fact, Aim do not support storing checkpoints just yet (as there's no artifact support). On the other hand the implementation of lightning trainer has some complicated logic of selecting the save_dir. You can check it here.

@tmynn, @mahnerak I recall you had some ideas how this can be worked around? Please share your thoughts.