bloomz Search Results - Githubissues

396 results
for bloomz

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/peft #1030

'PeftModelForCausalLM' object has no attribute 'merge_and_un…

### System Info peft=0.3.0 accelerate=0.20.3 transformers=4.30.2 platform=debian.amd64 python=3.7.12 ### Who can help? _No response_ ### Information - [x] The official example scrip…

gmanlan updated 1 year ago
4
huggingface/peft #1240

[LoftQConfig + LoraConfig] throws size matmul mismatch error

### System Info - docker image: `pytorch/pytorch:2.1.1-cuda12.1-cudnn8-devel` - pip list ``` Package Version ----------------------- ------------ accelerate 0.25.0…

SoundProvider updated 10 months ago
8
huggingface/accelerate #1515

ValueError: You can't train a model that has been loaded in …

@younesbelkada (Thanks again for developing these great libraries and responding on Github!) Related issue: https://github.com/huggingface/accelerate/issues/1412 With the bleeding edge `transf…

akkikiki updated 7 months ago
28
chatchat-space/Langchain-Chatchat #346

[FEATURE] 未来可增加Bloom系列模型吗？根据甲骨易的测试，这系列中文评测效果不错

**功能描述 / Feature Description** Bloom系列模型，包括 Bloomz_560m, bloomz_1b1, bloomz_3b, bloomz_7b1_mt 平均评测表现均超过MOSS模型。后两个模型接近ChatGLM. 感谢团队的大爱~~ 参考文章：https://www.geekpark.net/news/318946

SaraiQX updated 1 year ago
1
huggingface/peft #1081

PEFT with quantized LLMa

### Feature request I would like to inquire if prompt tuning feature is available with 4-bit quantized LLAMA-2 model. ### Motivation 4-bit quantization makes the models easy to use and hardwa…

Sai-Ashish updated 11 months ago
3
huggingface/peft #303

Peft is much slower than the origin models when doing infere…

I try to finetune the bloomz-1b7 model for translation and using peft lora. And the fine-tuned model without lora is twice as fast as the one with lora. I use the TextGenerationPipeline to generate …

SefaZeng updated 1 year ago
9
huggingface/accelerate #1945

Accelerate FSDP throws error with BLOOM model: AttributeErro…

### System Info ```Shell - `Accelerate` version: 0.22.0 - Platform: Linux-4.15.0-142-generic-x86_64-with-glibc2.27 - Python version: 3.10.12 - Numpy version: 1.25.2 - PyTorch version (GPU?): 2.0.…

LuciusLan updated 1 year ago
2
petals-infra/chat.petals.dev #14

Change separators for BLOOMZ in the few-shot mode

Currently, BLOOMZ behaves well only for the first output in the few-shot mode, then outputs `` and forgets everything. This is visible in the English-to-Spanish translation example. We need to use …

borzunov updated 1 year ago
2
NVIDIA/TensorRT-LLM #136

Invalid batch dimension of Tensorrt llm Engine

Bloom 7b model example: **# int8 weight only + int8 kv cache** `python3 hf_bloom_convert.py -i ./bloomz-7b -o ./bloom_7b_kv --calibrate-kv-cache -t float16` **# Build model with both INT8 weigh…

luxiushu2023 updated 1 year ago
6
shibing624/MedicalGPT #49

rm模型训练过程

### Describe the bug 在基于bloomz-560m模型训练rm模型，观察到训练过程中仍然是1块gpu在训练； ![image](https://github.com/shibing624/MedicalGPT/assets/26675984/2cd7eb8d-01bd-4d03-9438-4e78bf49e7a2) ### To Reproduce 训练脚本如下： …

Vincent131499 updated 1 year ago
4

上一页 1...15 16 17 18 19 20 21...40 下一页

396 results for bloomz

396 results
for bloomz