Closed prd-tuong-nguyen closed 3 weeks ago
Will attempt to repro and see what is going on
Looks like this could be an issue with auto-detecting RoPE scaling.
Thank guys, I hope this is fixed soon.
@tgaddair hello bro, do you have any update on this?
@tgaddair Hey bro, LoRaX seems able to start with microsoft/Phi-3-mini-4k-instruct
but it also give this warning (I think these warning is really important)
2024-06-05T05:20:37.878887Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|endoftext|>' was expected to have ID '32000' but was given ID 'None'
2024-06-05T05:20:37.878921Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|assistant|>' was expected to have ID '32001' but was given ID 'None'
2024-06-05T05:20:37.878925Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder1|>' was expected to have ID '32002' but was given ID 'None'
2024-06-05T05:20:37.878927Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder2|>' was expected to have ID '32003' but was given ID 'None'
2024-06-05T05:20:37.878930Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder3|>' was expected to have ID '32004' but was given ID 'None'
2024-06-05T05:20:37.878933Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder4|>' was expected to have ID '32005' but was given ID 'None'
2024-06-05T05:20:37.878943Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|system|>' was expected to have ID '32006' but was given ID 'None'
2024-06-05T05:20:37.878946Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|end|>' was expected to have ID '32007' but was given ID 'None'
2024-06-05T05:20:37.878948Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder5|>' was expected to have ID '32008' but was given ID 'None'
2024-06-05T05:20:37.878951Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|placeholder6|>' was expected to have ID '32009' but was given ID 'None'
2024-06-05T05:20:37.878954Z WARN tokenizers::tokenizer::serialization: /usr/local/cargo/registry/src/index.crates.io-6f17d22bba15001f/tokenizers-0.19.1/src/tokenizer/serialization.rs:159: Warning: Token '<|user|>' was expected to have ID '32010' but was given ID 'None'
2024-06-05T05:20:37.880592Z WARN lorax_router: router/src/main.rs:447: `--revision` is not set
2024-06-05T05:20:37.880608Z WARN lorax_router: router/src/main.rs:448: We strongly advise to set it to a known supported commit.
Sorry @prd-tuong-nguyen for the delay. I'll try and take a look at this today!
Hey @prd-tuong-nguyen, put together #499, which addressed the issue on my side. Should have a new main
image for you to test out shortly!
@tgaddair cool bro, I will check the latest image
@tgaddair The model seems to have started successfully but I still see this warning mentioned above
When run by another framework, it will show something like:
Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained.
System Info
I meet this error when start LoraX with model
microsoft/Phi-3-mini-128k-instruct
Information
Tasks
Reproduction
Run LoraX by docker by pass base model as
microsoft/Phi-3-mini-128k-instruct
Expected behavior
Server start successfully with Phi-3 model