Exporting Llama3's tokenizer

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

https://pytorch.org/executorch/

Other

1.72k stars 293 forks source link

Open vifi2021 opened 4 months ago

vifi2021 commented 4 months ago

Hello,

I am following https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md#option-c-download-and-export-llama3-8b-model to make Llama3-8B-instruct to run on an S21 Ultra. But seems like the examples.models.llama2.tokenizer.tokenizer cannot process llama3's tokenizer.model.

Has anyone run into this issue?

iseeyuan commented 4 months ago

@larryliu0820 , could you help look at this?

larryliu0820 commented 4 months ago

vifi2021 commented 4 months ago

Thanks for the fast response. But I am not running it on my computer. I am running it on an Android phone. In https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md#step-5-run-benchmark-on-android-phone step 2.2, it requires to upload model, tokenizer to the phone:

adb push <model.pte> /data/local/tmp/llama/
adb push <tokenizer.bin> /data/local/tmp/llama/

Looks like we still need tokenizer.bin?

jonatananselmo commented 4 months ago

Just do:

adb push /data/local/tmp/llama/

And use wherever you need to specify the tokenizer.

JacobSzwejbka commented 4 months ago

@kirklandsign Since theres some android question can u take a look

mergennachin commented 4 months ago

@vifi2021

In Step 4, we have this "For Llama3, you can pass the original tokenizer.model (without converting to .bin file)."

It also applies to subsequent steps.

@kirklandsign

In our readme file, for Step 5, 2.2 and 2.3, we make that clear.