bloomz Search Results - Githubissues

huggingface/optimum-habana #1426

bigscience / bloomz-7b1 finetune error

### System Info ```shell optimum 1.21.4 optimum-habana 1.14.0.dev0 transformers 4.45.2 +------------------------------------------------------------------…

11989890 updated 1 month ago

AutoGPTQ/AutoGPTQ #137

Advice for quantizing BLOOMZ 175B

Hi @PanQiWei I'd be most grateful if you could give me a bit of help. I have been trying to quantize BLOOMZ 175B but can't currently get it done. BLOOMZ has 70 layers, and is a total of 360GB.…

TheBloke updated 3 months ago

gregor-ge/mBLIP #14

Training the bloomz model

Thank you again for your excellent work. I have trained a model mT0 using my own dataset, and it performs well. Now, I am attempting to train bloomz model, but I'm encountering an issue where the trai…

bexxnaz updated 4 months ago

PaddlePaddle/PaddleNLP #8663

【LLM】模型参数支持列表

# 模型参数支持专区大家好，PaddleNLP 团队在这里为大家整理了各个模型参数的详细信息，方便大家使用。 ## 模型参数 ### Base Models | Model | 0.5B | 1~2B | 3~4B | 6~8B | 13~14B | 30~32B | 50~60B | 65~72B | 110B | >110B | |:---------:|:--…

DrownFish19 updated 5 days ago

mit-han-lab/smoothquant #66

Activation scales for bloomz 7.1b

The article at https://huggingface.co/blog/generative-ai-models-on-intel-cpu mentions that smoothquant was applied on bloomz 7b1 model also. But in https://huggingface.co/mit-han-lab/smoothquant-scale…

bil-ash updated 10 months ago

OpenNMT/CTranslate2 #1324

Exception when exporting bloomz model

Operaring system: Ubuntu 22.04.2, Python 3.10.6, CTranslate2 3.16. When exporting the bigscience/bloomz model using: _ct2-transformers-converter --force --model bigscience/bloomz --output_dir …

jordimas updated 10 months ago

artidoro/qlora #8

Failed to load bloomz 7Bmt

Hi thank you very much for this great library! I am really excited about it! I tried to fine tune bloomz 7B with the 4 bit lora on alpaca by running: python qlora.py --model_name_path my_bloomz_path.…

ouwei2013 updated 1 year ago

vllm-project/vllm #5128

[Feature]: How to Enable VLLM to Work with PreTrainedModel O…

### 🚀 The feature, motivation and pitch I have fine-tuned multiple LoRA models to act as expert layers within an MOE architecture. How can I leverage a VLLM to accelerate this? Currently, VLLM acce…

zhaofangtao updated 1 month ago

rustformers/llm #361

Bloom 176B inference is broken

I have a q4_0 quant of bloomz-176B (created with bloomz.cpp). As with bloomz.cpp, inference on this model is broken. For example, this command: llm infer -a bloom -m bloomz_q4_0.bin -p "A short stor…

HaileyStorm updated 1 year ago

bigscience-workshop/xmtf #20

bloomz-mt universal checkpoint

Hello! Thanks a lot for your job! I want to finetune bloomz-mt by your Megatron-DeepSpeed，but I can not find a universal version checkpoint of bloomz-mt or bloomz. I only found the bloom universal c…

LiuShixing updated 1 year ago

396 results for bloomz

396 results
for bloomz