Closed sebastian-weisshaar closed 1 year ago
If debug==True a low footprint version of LlamaForCausalLM gets initialised without the trained weights. Memory footprint is 10x smaller.
debug==True
If
debug==True
a low footprint version of LlamaForCausalLM gets initialised without the trained weights. Memory footprint is 10x smaller.