Closed druskacik closed 8 months ago
Hi, did you see https://github.com/JonasGeiping/cramming/issues/34#issuecomment-1745372529? I think this might be a related problem.
Okay, that solved the issue. I also had to change the head_dim
value because of the matrix multiplication error. Thanks.
Ok, I'm glad!
For anyone reading this in the future, I'm also definitely accepting PR's to fix this problem, I just haven't had time to do it myself.
I'd like to fine-tune this model for token classification task. As suggested in #35 , instantiating from
AutoModelForTokenClassification
should work. However, I see an error.Versions: