question regarding the float16 and bfloat

huggingface / transformers-bloom-inference

Fast Inference Solutions for BLOOM

Apache License 2.0

561 stars 114 forks source link

Closed allanj closed 1 year ago

allanj commented 1 year ago

In this code the first argument in with deepspeed.OnDevice(dtype=dtype, device="meta"): we use float16, while

model = AutoModelForCausalLM.from_config(config, torch_dtype=torch.bfloat16)

here we use bfloat16.

I wonder why we should have this inconsistency?

mayank31398 commented 1 year ago

@allanj it doesn't really matter. when specifying meta, it only does memory allocation with empty tensors and both are 16-bit.