Closed sebastian-weisshaar closed 1 year ago
Difference in GPU memory usage between 4bit and 8bit. Note that 4bit implementation also finishes quicker than 8bit.
4bit WandB: https://wandb.ai/jina-ai/jerboa/runs/92zjw2hv?workspace=user-jinaai 8bit WandB: https://wandb.ai/jina-ai/jerboa/runs/0dp6yn49?workspace=user-jinaai
Implements 4bit aspect from QLora paper for training Updated dependencies for poetry DOES NOT IMPLEMENT 4bit for generation