baichuan2-7b Search Results

449 results
for baichuan2-7b

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

baichuan-inc/Baichuan2 #198

Deepspeed使用from_pretrained加载baichuan-inc/Baichuan2-7B-Interm…

在deepspeed启动训练的.py文件中使用AutoModelForCausalLM.from_pretrained加载baichuan-inc/Baichuan2-7B-Intermediate-Checkpoints中的checkpoint太慢，加载7B的模型都需要几十分钟。但是在jupyter notebook中加载就很快，几十秒能完成。或者用deepspeed加载7B-bas…

shutttttdown updated 1 year ago
2
baichuan-inc/Baichuan2 #322

对于特定输入baichuan2-7b-base模型会输出为空

经过我的实验，当我设置input为 ``` {'input_ids': tensor([[43707, 4007, 1833, 10329, 1836, 7293, 3799, 65, 22792, 2169, 4007, 1833, 2218, 2079, 65, 92413, 11721, 7293, 2835, 1754, …

guankaisi updated 5 months ago
3
ztxz16/fastllm #340

baichuan2flm.py ..python3.10/site-packages/accelerate/big_…

调用的是baichuan2flm.py转的是Baichuan2-7B-Chat模型时报错。 /fastllm/build/tools/baichuan2flm.py", line 12, in model.to("cpu") python3.10/site-packages/accelerate/big_modeling.py", line 415, in wrapper …

zxzxde updated 10 months ago
2
AniZpZ/AutoSmoothQuant #16

name 'position_ids' is not defined

when I run Baichuan2-7B-base model ,meet the issue that "position_ids" is not defined. And I checked the code and did not find the declaration of position_ids in the code ``` pt-p26l37d2-worker-0 …

LMX-xin updated 7 months ago
1
baichuan-inc/Baichuan2 #135

ValueError: Tokenizer class BaichuanTokenizer does not exist…

when i use local model, it will happen ValueError: Tokenizer class BaichuanTokenizer does not exist or is not currently imported. so，what should i do

98765mm updated 9 months ago
4
intel-analytics/ipex-llm #11959

how to use ipex-llm to run both the original and quantized v…

model_path = "/home/test/models/LLM/baichuan2-7b/pytorch" # Load and optimize the INT4 model with IPEX low_bit = "sym_int4" model_int4 = BigdlForCausalLM.from_pretrained(model_path, load_in_low_b…

tao-ov updated 2 months ago
1
baichuan-inc/Baichuan2 #218

请问在微调模型时，报错RuntimeError: mat1 and mat2 shapes cannot be mult…

用的模型是Baichuan2-7B-Chat-4bits，用lora微调，双3090，不用lora会oom 训练数据是官方给的belle_chat_ramdon_10k.json

Admiraljj updated 10 months ago
2
songquanpeng/one-api #419

支持 Debug 日志输出更加详细的内容以供调试

**例行检查** [//]: # (方框内删除已有的空格，填 x 号) + [x] 我已确认目前没有类似 issue + [x] 我已确认我已升级到最新版本 + [x] 我已完整查看过项目 README，尤其是常见问题部分 + [x] 我理解并愿意跟进此 issue，协助测试和提供反馈 + [x] 我理解并认可上述内容，并理解项目维护者精力有限，**不遵循规则的 issue 可能…

sapipoZZZ updated 10 months ago
6
triton-inference-server/tensorrtllm_backend #462

How to deploy one model instance across multiple GPUs to tac…

I am trying to deploy a Baichuan2-7B model on a machine with 2 Tesla V100 GPUs. Unfortunately each V100 has only 16GB memory. I have applied INT8 weight-only quantization, so the size of the engine I…

shil3754 updated 3 months ago
8
baichuan-inc/Baichuan2 #313

loss 全是0

刚下载的项目使用官方提供的脚本 hostfile="" deepspeed --hostfile=$hostfile fine-tune.py \ --report_to "none" \ --data_path "data/belle_chat_ramdon_10k.json" \ --model_name_or_path "baichuan-inc/Ba…

whk6688 updated 1 month ago
9

上一页 1...2 3 4 5 6 7 8...45 下一页

449 results for baichuan2-7b

449 results
for baichuan2-7b