Closed apls777 closed 4 years ago
Thanks for digging into this.
We can investigate the compressed checkpoint and will fix it soon.
Thanks for letting us know. Checkpoints updated.
Great! Thank you, Xianzhi!
@apls777 I'm still seeing a broken checkpoint wrt spinenet-190 in #913, any chance you could confirm this still works?
@raj-shah I can confirm that it worked a year ago when they re-uploaded this checkpoint. I guess they changed the codebase slightly but didn't update some checkpoints, so I would suggest you try to check out a 1-year-old version of this repo and try this checkpoint again.
@apls777 thanks for the tip! I suspected the same and have already tried all branches (up to and including r2.1
) but sadly no luck!
@raj-shah Try r1.15
, it looks like r1.x
and r2.x
branches are being updated independently.
@apls777 I seem to have missed r2.2.0
, works perfectly now. Thanks for looking into it!
The easiest way to check it is to use the
inspect_checkpoint.py
script:It's printing the variables and at some point shows the following error:
The same happens for the SpineNet-143 checkpoint:
Moreover, the data files in the SpineNet-143 and SpineNet-190 checkpoints have exactly the same size:
It looks like the variables that didn't fit into the 512 MB of the data file were removed.
Could you, please, reupload those checkpoints to the GS bucket?
@xianzhidu @pengchongjin