Open dkfjdkfjdkfj opened 4 years ago
I'm having the same issue and can't seem to resolve.
any updates?
I used transformers language model training to do this. this page
I used transformers language model training to do this.
How does this help?
So this issue has actually been resolved elsewhere https://github.com/kaushaltrivedi/fast-bert/issues/179
It is to do with the version of the transformers library being used. If you downgrade to 2.3.0 it works fine.
I failed in LM fine-tuning with such an error: RuntimeError: CUDA error: device-side assert triggered I don't know how to solve it even though I made many attempts.
It is appreciated if the sample for fine-tuning could be provided.