Closed YuffieHuang closed 2 years ago
Hi @YuffieHuang, I found a checkpoint, but I'm not very sure if it belongs to this figure. You can test it and let me know if it works. Here you find the link.
@vgaraujov The sad thing happens that there seems a mismatch between the shared checkpoint and the current network structure. The shared checkpoint contains a weight named "infonce.Wk.weight". However, weights such as "Wk.0.weight", "Wk.0.bias", "Wk.1.weight", and "Wk.1.bias" are missing, which are required by the current structure of CPCv1.
I just checked my local repository (fortunately, I still have it), and I found another checkpoint. I think that one is correct—the last one undoubtedly belongs to a previous version of my implementation. Please check the same link. Let me know the result.
Yes! It works! Thank you so much!
Hi @vgaraujov. I'm sorry to bother you again. The training really takes a lot of time, especially since I can only use 1 GPU now. I'm wondering if you can share any checkpoints that you trained on the BookCorpus dataset? In that case, I am able to continue training the model and fine-tune it with my own dataset. Thank you so much!