exllama Search Results - Githubissues

1000+ results
for exllama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

turboderp/exllama #27

will it work with Nvidia P40 24GB on Linux?

I'm developing AI assistant for fiction writer. As openai API gets pretty expensive with all the inference tricks needed, I'm looking for a good local alternative for most of inference, saving gpt4 ju…

waan1 updated 1 year ago
29
ollama/ollama #1237

GPTQ / ExLlamaV2 (EXL2) quantisation

# Feature Description Please provide a detailed written description of what you were trying to do, and what you expected `llama.cpp` to do as an enhancement. # Motivation It sounds like it's …

0xdevalias updated 2 months ago
6
henk717/KoboldAI #501

model_backend argument doesn't handle spaces (even properly…

root@C.8174303:~/KoboldAI$ ./play.sh --model models/Aurora-Nights-103B-v1.0-5.0bpw-h6-exl2 --model_backend "ExLlama V2" --model_parameters help Colab Check: False, TPU: False INFO | __main__::…

BlairSadewitz updated 9 months ago
3
casper-hansen/AutoAWQ #3

Implement exllama q4_matmul kernel as alternative

ExLlama has implemented very optimized CUDA kernels. We should import the kernels to see just how efficient it could be in AWQ. https://github.com/turboderp/exllama/blob/master/exllama_ext/exllama_…

casper-hansen updated 1 year ago
5
ParisNeo/lollms-webui #496

AI not able to read documents uploaded

## Expected Behavior When I upload a document, I should be able to refer to it in the chat and prompt the AI to perform tasks with the contents. ## Current Behavior I upload a document and the A…

ba2512005 updated 8 months ago
2
turboderp/exllamav2 #587

Error in quant

When converting [nemolita-21b](https://huggingface.co/win10/nemolita-21b), which is a merged model, the `convert.py` runs into this error: ```shell Traceback (most recent call last): File "/hom…

Orion-zhen updated 2 days ago
3
turboderp/exllama #258

Tried to build setup exllama but encountering ninja related …

Hello everyone Im trying to setup exllama in an Azure ML compute and I followed the instructions here https://github.com/turboderp/exllama, but unfortunately Im getting an error when trying to call…

BwandoWando updated 1 year ago
3
huggingface/transformers #28568

Optimised 4bit inference kernels

### Feature request Integration of new 4bit kernels https://github.com/IST-DASLab/marlin ### Motivation provide faster Inference than awq/exllama for batch sizes upto 32 ### Your contribut…

nivibilla updated 1 month ago
9
turboderp/exllama #306

Using Exllama backend requires all the modules to be on GPU …

I'm sorry I am unable to find relevant doc on Internet on how to load all modules on GPU. I got this error message from my code: ``` Found modules on cpu/disk. Using Exllama backend requires al…

tigerinus updated 1 year ago
1
turboderp/exllama #257

stop-string support?

I'm using ExLLama with the Oobabooga text-generation UI. With the model: TheBloke_llama2_70b_chat_uncensored-GPTQ The model works great, but using ExLLama as a loader the model talks to itself, gen…

krypterro updated 1 year ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for exllama

1000+ results
for exllama