llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zed-industries/zed #18490

Expand AI Code Completion beyond Copilot and Supermaven

### Check for existing issues - [X] Completed ### Describe the feature After going through: https://zed.dev/docs/completions Zed currently supports completions via external LLM APIs like GitHub …

zerocorebeta updated 2 weeks ago
2
csag-uct/Metadata-Harmonisation-Tool #15

Remove need for API Keys

We really on OpenAI API calls in two key areas: 1) To generate the distributions we create a prompt and then send it to the API for a response. Ideally we could just swap out this API call for a ca…

peterm790 updated 2 months ago
2
NumbersStationAI/DuckDB-NSQL #9

Simple example fails with No module named 'llama_cpp'

After installing the dependencies (adjusted per #5 and #8), and attempting to run an example such as the one from the readme, it fails on the first line when `llama_cpp` isn't installed.

jaraco updated 7 months ago
1
LlamaEdge/LlamaEdge #71

Model list on `https://huggingface.co/second-state`

### Summary - Provide k-quant models - Maintain existing gguf models - Embedding models - [x] [second-state/Nomic-embed-text-v1.5-Embedding-GGUF](https://huggingface.co/second-state/Nomic-…

apepkuss updated 3 weeks ago
2
lmstudio-ai/lmstudio-bug-tracker #186

Bug: No way to set the quantisation used for the k/v context…

I was trying to find where to set which quantisation to use for the K/V context cache and it seems you can't in LM Studio. K/V cache quantisation is required to run models context efficiently by re…

sammcj updated 1 week ago
1
OpenNMT/CTranslate2 #1650

Benchmarking common LLMs on ctranslate2, llama.cpp, and bits…

My initial testing comparing ct2 (using int8) and the ```bitsandbytes``` library at 4 and 8 bit...nicely done ctranslate2 people. Looking forward to testing GGUF in there as well. ![image](https:/…

BBC-Esq updated 6 months ago
1
NVIDIA/TensorRT-LLM #1232

Various CUDA errors when pp_size > 1.

First of all, when pp_size = 1, everything is good with tp_size = 1,2,4,8. My test on pipeline parallelism (pp_size > 1) always failed with different error in the last few rows in this post. Firs…

jybbjybb updated 5 days ago
1
Mobile-Artificial-Intelligence/maid_llm #10

Crash occurs on low-level API Android devices

I found the crash occur on some low level API（

canluhuang updated 3 months ago
1
NVIDIA/TensorRT-LLM #1552

Cannot process new request: [TensorRT-LLM][ERROR] Assertion…

### System Info GPU 2* A30, TRT-LLM branch main, commid id: 66ef1df492f7bc9c8eeb01d7e14db01838e3f0bd ### Who can help? _No response_ ### Information - [x] The official example scripts - [ ] …

sleepwalker2017 updated 5 days ago
2
ggerganov/llama.cpp #9440

Feature Request: Pixtral by Mistral support (pixtral-12b-240…

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.…

arch-btw updated 2 hours ago
10

上一页 1...68 69 70 71 72 73 74...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp