guanaco Search Results - Githubissues

488 results
for guanaco

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #392

Question: Caching in HF API?

Hi! I noticed that when querying the same prompt twice with the HF APIs (guanaco-33b) it caches it and returns it immediately (virtual 400tps) whereas new requests happen with speed around 20 to 30 tp…

SinanAkkoyun updated 1 year ago
2
eosphoros-ai/DB-GPT #237

[BUG]: 无法安装requirements.txt里面的包，因为包冲突

ERROR: Cannot install -r requirements.txt (line 30) and huggingface-hub==0.13.4 because these package versions have conflicting dependencies. The conflict is caused by: The user requested hugg…

chenxian01 updated 1 year ago
3
lm-sys/FastChat #1505

[Chatbot Arena] Add Wizard-Vicuna-Uncensored 13B and 30B mod…

[TheBloke](https://huggingface.co/TheBloke)'s (Tom Jobbins's) _Wizard-Vicuna-Uncensored_ models are performing very well for their size on the [Open LLM Leaderboard](https://huggingface.co/spaces/Hugg…

EwoutH updated 1 year ago
3
turboderp/exllama #10

Splitting model on multiple GPUs produces RuntimeError

When attempting to split the model on multiple GPUs, I get the following error: ``` > python test_chatbot.py -d /home/john/Projects/Python/text-models/text-generation-webui/models/TheBloke_guanaco…

h3ss updated 1 year ago
19
bitsandbytes-foundation/bitsandbytes #430

Cannot merge LORA layers when the model is loaded in 8-bit m…

when running "guanaco_7B_demo_colab.ipynb" i take load_in_4bit=True condition but meet valueError of "Cannot merge LORA layers when the model is loaded in 8-bit mode".

coolchenshan updated 1 year ago
2
Facico/Chinese-Vicuna #185

ValueError: Can't find 'adapter_config.json' at './lora-Vicu…

[nano@archlinux Chinese-Vicuna]$ python interaction.py ===================================BUG REPORT=================================== Welcome to bitsandbytes. For bug reports, please submit your …

adaaaaaa updated 1 year ago
2
huggingface/trl #597

How to run using multi-GPUs?

Hi, I'm not so familiar with the training method using multi-GPUs. I have a machine with 8 A100s, what should I do to full params SFT a llama2-7B model? How to use the trl tool? Thanks.

jyC23333 updated 1 year ago
6
huggingface/trl #483

_prepare_non_packed_dataloader() is removing shorter samples

I tried to train the Falcon-7b model based on the tutorial from huggingface (https://colab.research.google.com/drive/1BiQiw31DT7-cDp1-0ySXvvhzqomTdI-o?usp=sharing) with my own dataset. When i loaded …

Blubberblub updated 1 year ago
2
ggerganov/llama.cpp #2266

Mixing GPU architectures seems to cause occasional. non-fata…

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [X] I am running the latest code. Development is very rapid so there are no tagged versions as of…

quarterturn updated 1 year ago
2
LostRuins/koboldcpp #325

Text generation is broken when using --gpulayers on Mac

Hello! I was trying to use GPU offload on M1 Max with 32Gb of Ram to see if it will speed up things or not. Replies generating indeed faster (I think about 3 times as faster), but they are nonsensical…

Hapurubok updated 1 year ago
5

上一页 1...33 34 35 36 37 38 39...49 下一页

488 results for guanaco

488 results
for guanaco