OpenAccess-AI-Collective / axolotl

Go ahead and axolotl questions
https://openaccess-ai-collective.github.io/axolotl/
Apache License 2.0
6.83k stars 749 forks source link

run examples/llama-2/qlora-fsdp.yml failed #1426

Open XiepengLi opened 3 months ago

XiepengLi commented 3 months ago

Please check that this issue hasn't been reported before.

Expected Behavior

should start training

Current behaviour

site-packages/torch/nn/modules/module.py", line 1688, in __getattr__
    raise AttributeError(f"'{type(self).__name__}' object has no attribute '{name}'")
AttributeError: 'LlamaForCausalLM' object has no attribute 'hf_quantizer'

Steps to reproduce

accelerate launch -m axolotl.cli.train examples/llama-2/qlora-fsdp.yml

Config yaml

No response

Possible solution

No response

Which Operating Systems are you using?

Python Version

3.12

axolotl branch-commit

main

Acknowledgements

NanoCode012 commented 3 months ago

Hey, does this issue still occur on latest main with upto date dependency?

XiepengLi commented 3 months ago
ValueError: Must flatten tensors with uniform dtype but got torch.float32 and torch.float16
accupham commented 3 months ago
ValueError: Must flatten tensors with uniform dtype but got torch.float32 and torch.float16

Have same issue too.