Open ArthurMinovsky opened 1 year ago
LLaMA2 13B can train in quantization 4 bit and sequence length 1024