Open dbanka opened 9 months ago
I am also interested in doing that, I have tried changing the code using LongQLoRA which uses part most of the code of this repository. However, I am getting a strange issue of illegal memory access in cuda. Not really sure why is that the case.
Is it possible to increase the context length of phi-2 using LongLora? If yes, what changes need to be done to support it?