Open harpomaxx opened 1 year ago
Honestly, it's updating to transformers 4.30, adding one other dependency package, and about 8 changes in the code if I recall correctly. Plus it works with multi-gpus.
Unfortunately I lost my changes from my running copy when I updated for the API updates, but I think most of the work is already done in my fork.
Contributions are welcome
@merrymercy is this issue still open for contribution?
@02shanks absolutely!!!!
@surak as this is my first code contribution, could you please guide me through the process? Where should I start?
Well, the usual:
Nothing special, really!
@surak @merrymercy I have just created the PR. Can you please review it?
Loading a vicuna13B using 4bit quantization from the transformers library is possible load_in_4bit. How difficult could be for Fastach to support it?