ctransformers Search Results

460 results
for ctransformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

casper-hansen/AutoAWQ #162

Getting OOM error while loading llama 70b using AWQ.

Below is the error i am getting while loading TheBloke/llama-2-70b-chat-AWQ model: OutOfMemoryError: CUDA out of memory. Tried to allocate 112.00 MiB (GPU 0; 22.20 GiB total capacity; 21.30 GiB alrea…

ab6995 updated 1 year ago
14
abetlen/llama-cpp-python #382

[ubuntu] [1080ti] CUDA error 3 at .. ggml-cuda.cu:2268: init…

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [X] I am running the latest code. Development is very rapid so there are no tagged versions as of…

andzejsp updated 1 year ago
4
langflow-ai/langflow #701

Error while running docker compose: FAILED: /tmp/tmpw53zyla9…

I get the following error when I run `docker compose up --build` on macos. I've already tried installed build-essentials. ``` langflow % docker compose up --build [+] Building 51.1s (27/27) …

3eif updated 1 year ago
5
xorbitsai/inference #496

BUG: failed to load models

### Describe the bug I have built a Docker image myself and deployed xinference on k8s. The homepage can be accessed normally. However, loading the model failed, and the error message is 'not found'.…

zysno1 updated 1 year ago
5
Aider-AI/aider #138

No more extra pay for chatgpt please

Hi, just wondering with the new open sources coming out like llama2 or any other at that level i hugginface why are we still using API from openai???... If i can operate at the same level as GT4 using…

jordanshow10000 updated 1 year ago
23
ggerganov/ggml #158

starcoder -- not enough space in the context's memory pool

I'm getting errors with starcoder models when I try to include any non-trivial amount of tokens. I'm getting this with both my raw model (direct .bin) and quantized model regardless of version (pre Q4…

bluecoconut updated 1 year ago
12
oobabooga/text-generation-webui #3900

Can't load GPTQ model with ExLlamav2

### Describe the bug Can't load GPTQ model with ExLlamav2_HF and ExLlamav2. I have tried these two models: - TheBloke_upstage-llama-30b-instruct-2048-GPTQ_gptq-4bit-128g-actorder_True - TheBloke_Op…

Simplegram updated 1 year ago
10
QwenLM/Qwen #136

[BUG] FlashAttention推理时还是需要关闭，目前开启输出是错乱的

### 是否已有关于该错误的issue？ | Is there an existing issue for this? - [X] 我已经搜索过已有的issues | I have searched the existing issues ### 当前行为 | Current Behavior 使用示例代码 ``` python from transformers import Au…

Trangle updated 1 year ago
4
oobabooga/text-generation-webui #3858

gguf never uses vram

### Describe the bug Load this model. No matter what settings I set (such as gpu layers), model runs entirely on cpu https://huggingface.co/dhairya0907/meta-llama-2-7b-chat-hf-gguf-v1 I did tr…

thistleknot updated 1 year ago
23
tweag/rules_haskell #1130

Absolute runpaths in ELF libraries generated by `cabal_packa…

Quite similar to #1127 , although this issue is triggered in a different context by a different rule, so probably worth a different issue **Describe the bug** `cabal_package` generates `.so` fil…

thufschmitt updated 4 years ago
1

上一页 1...40 41 42 43 44 45 46...46 下一页

460 results for ctransformers

460 results
for ctransformers