bloomz Search Results - Githubissues

389 results
for bloomz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #741

【bloom】convert_checkpoint.py local variable 'int8_weights' …

I follow the readme : ## Build model with both INT8 weight-only and INT8 KV cache enabled python convert_checkpoint.py --model_dir ./bloom/560m/ \ --dtype float16 \ …

scarydemon2 updated 9 months ago
1
aws/sagemaker-python-sdk #3931

Bloomz models having task name as textgeneration1 on JumpSta…

**Describe the bug** I was looking for the bloomz models on jumpstart and I noticed the task name for them is oddly `textgeneration1`. Is that on purpose ? **To reproduce** Code snippet: ``` …

mrgiba updated 6 months ago
2
NouamaneTazi/bloomz.cpp #15

Bloomz 176B inference doesn't work

Hello, I have converted bloomz model successfully, but the inference doesn't work. ``` ./main -m ./models/ggml-model-bloomz-f16.bin -t 8 -n 128 main: seed = 1679167152 bloom_model_load: load…

agemagician updated 1 year ago
9
microsoft/DeepSpeedExamples #338

Error when using BLOOMZ for reward model training

Hello, I‘m tring to use BLOOMZ for reward model training, and get error: ``` Traceback (most recent call last): File "/users5/xydu/ChatGPT/DeepSpeed-Chat/training/step2_reward_model_finetuning/tr…

Luoyang144 updated 1 year ago
16
microsoft/DeepSpeedExamples #598

RuntimeError with BLOOMZ: Expected all tensors to be on the …

I'm facing the above error in both stage 1 and stage 2 when using BLOOMZ 3B and 560M. I tried adding "model.to(device)" and "model.to('cuda')" to main.py but neither worked. The error only appears w…

karim1104 updated 10 months ago
1
casper-hansen/AutoAWQ #288

bloomz_7b1 error message TypeError: forward() missing 1 requ…

RUNNING ``` from awq import AutoAWQForCausalLM from transformers import AutoTokenizer model_path = '/data/bloomz_7b1' quant_path = '/data/bloomz_7b1_4bit' quant_config = { "zero_point": True, …

oreojason updated 3 months ago
6
tloen/alpaca-lora #145

Finetuning is so slow on bloomz-7b1

Hi, Are there any configurations for other models than llama? When I first tried to run the finetune script for `blooomz-7b1` model, I had this error: `ValueError: Target modules ['q_proj', 'v_pro…

raihan0824 updated 1 year ago
7
NouamaneTazi/bloomz.cpp #14

Quantization doesn't work with Bloomz 176B

Hello, I have successfully converted the bloomz 176B model to fp16. However, the quantization doesn't work and throw an error: ``` ./quantize ./models/ggml-model-bloomz-f16.bin ./models/ggml-m…

agemagician updated 1 year ago
10
casper-hansen/AutoAWQ #400

Issues with Bloom models

Hi! I'm trying to quantize a 3B Bloom Model (https://huggingface.co/bigscience/bloom-3b). But it seems like it's missing the alibi tensor when performing the forward pass of the model. Could y…

celisa updated 5 months ago
3
NouamaneTazi/bloomz.cpp #20

Is there any plan for mT0 model support?

BLOOMZ and mT0 models are related, and mT0-13B performs better than BLOOMZ-176B in some cases. The mT0-13B will be a killer model for normal user devices after a GPTQ-4bit quantization. Hope the…

qdwang updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...39 下一页

389 results for bloomz

389 results
for bloomz