Closed mazhai closed 11 months ago
错误信息里说了:
ValueError: The combination of base model (size: 32000) and tokenizer (size: 49954) is not a valid configuration. Please check our project wiki for further information.
Valid configurations (base model / tokenizer):
- Continue pre-training original LLaMA: 32000 / 32000
- Pre-training Chinese LLaMA based on original LLaMA: 32000 / 49953
- Continue pre-training Chinese LLaMA: 49953 / 49953
- Continue pre-training Chinese Alpaca: 49954 / 49954
请换用chinese-llama-tokenizer
@airaria 谢谢,使用错tokenizer了
提交前必须检查以下项目
问题类型
模型训练与精调
基础模型
LLaMA-7B
操作系统
Linux
详细描述问题
run_pt.sh
chinese_alpaca_plus_lora_7b adapter_config.json
依赖情况(代码类问题务必提供)
peft 0.3.0 torch 2.0.1+cu118 torchaudio 2.0.2+cu118 torchvision 0.15.2+cu118 transformers 4.30.0
系统是: Ubuntu 22.04.2 LTS
运行日志或截图