convert huggingface model to megatron. "Only llama v2 available using huggingface"

epfLLM / Megatron-LLM

distributed trainer for LLMs

Other

529 stars 76 forks source link

convert huggingface model to megatron. "Only llama v2 available using huggingface" #49

Closed uygnef closed 1 year ago

uygnef commented 1 year ago

Hi @AleHD

When I run this script:

python weights2megatron/weights2megatron.py llama --size=7

--out=/path/to/megatron/weights/ --cache-dir=/path/to/llama-7b/

It says:

assert version == 2, "Only llama v2 available using huggingface" Does this script not support Llama v1?

AleHD commented 1 year ago

At the moment you need to specify the cache directory to the weights given officialy by meta. We are working on using decapoda-research llama implementation to retrieve the weights automatically with huggingface.