microsoft / BitNet

Official inference framework for 1-bit LLMs
MIT License
11.2k stars 760 forks source link

how to generate a dummy modelQwen models #52

Open summerHearts opened 4 weeks ago

summerHearts commented 4 weeks ago

Traceback (most recent call last): File "/BitNet/utils/generate-dummy-bitnet-model.py", line 1048, in main() File "BitNet/utils/generate-dummy-bitnet-model.py", line 971, in main model_class = Model.from_model_architecture(hparams["architectures"][0]) File "BitNet/utils/generate-dummy-bitnet-model.py", line 312, in from_model_architecture raise NotImplementedError(f'Architecture {arch!r} not supported!') from None NotImplementedError: Architecture 'Qwen2ForCausalLM' not supported! Image

How should I handle it,thanks

masembebrian commented 3 weeks ago

Check Compatibility: Ensure that the model architecture you're trying to use is supported in BitNet. Review the model class or BitNet documentation for supported architectures.

Update or Modify: If Qwen2ForCausalLM is not supported, you might need to modify the code to use a supported architecture or contribute by adding support if needed.

masembebrian commented 3 weeks ago

assign me this issue.