Fix NaN or Inf after initializing Vicuna models (due to lack of bias weights)

As I mentioned in https://github.com/johnsmith0031/alpaca_lora_4bit/issues/124 I used this library to load vicuna models and at some point I started getting Inf/NaN results during inferencing these 2 models:

https://huggingface.co/TheBloke/vicuna-7B-GPTQ-4bit-128g
https://huggingface.co/TheBloke/vicuna-13B-1.1-GPTQ-4bit-128g/tree/main

After diving into the issue I realized it has no bias weights, so default initializer (as you mentioned correctly @johnsmith0031 ) was not overriden by loading weights.

So I replaced default initializer to zeros.

(p.s. still remains mistery for me why so different behaviour on different platforms - it should initialize biases with some garbage from memory, and I expected this garbage to have similar chance to go inf after converting from float32 to float16, but screw it)

johnsmith0031 / alpaca_lora_4bit

Fix NaN or Inf after initializing Vicuna models (due to lack of bias weights) #125