Zero initializer for biases

As I mentioned in https://github.com/johnsmith0031/alpaca_lora_4bit/issues/124 I used this library to load vicuna models and at some point I started getting Inf/NaN results during inferencing these 2 models:

https://huggingface.co/TheBloke/vicuna-7B-GPTQ-4bit-128g
https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g/tree/main

After diving into the issue I realized it has no bias weights, so default initializer (as you mentioned correctly @johnsmith0031 ) was not overridden by loading weights.

So I replaced default initializer to zeros.

(p.s. still remains a mistery for me why so different behavior on different platforms - it should initialize biases with some garbage from memory, and I expected this garbage to have similar chance to go inf after converting from float32 to float16, but screw it)

johnsmith0031 / alpaca_lora_4bit

Zero initializer for biases #126