koboldcpp Search Results

882 results
for koboldcpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LostRuins/koboldcpp #96

LoRa support

There are some new models coming out which are being released in LoRa adapter form (such as [this one](https://huggingface.co/kaiokendev/SuperCOT-LoRA/tree/main)). Since there is no merge released, th…

horenbergerb updated 1 year ago
7
oobabooga/text-generation-webui #713

ggml llama crash at 'typing...' when using characters

Downloaded stuff from here for testing https://rentry.org/nur779 CPU goes up to 60% for like 15 seconds then it dies. If I don't load any char it works with the default settings, but it's really sc…

Enferlain updated 1 year ago
17
ggerganov/llama.cpp #4055

Multi GPU CUDA - 8x performance degradation when splitting t…

**Problem:** I am aware everyone has different results, in my case I am running llama.cpp on a 4090 primary and a 3090 secondary, so both are quite capable cards for llms. I am getting around 800% s…

cmp-nct updated 5 months ago
17
ScoopInstaller/Scoop #5538

[Feature] command option to only refresh buckets but not upd…

## Feature Request #### Is your feature request related to a problem? Please describe. Presently, in order to know if package update exists one has to do ```scoop status``` But, scoop status …

hgkamath updated 1 year ago
3
ggerganov/ggml #220

ggml : unified file format

Obsoletes #147, #150, https://github.com/ggerganov/llama.cpp/issues/1575, https://github.com/ggerganov/llama.cpp/issues/1590, https://github.com/rustformers/llm/discussions/143, and probably some othe…

philpax updated 8 months ago
82
ggerganov/llama.cpp #1572

WHY WHY WHY ?????

WHY did you guys end support for older Llama models ? why is backwards compatibility not added when you change formats ? This is what pisses me off about open source, its absolute fraken chaos, things…

raymerjacque updated 1 year ago
39
LostRuins/koboldcpp #250

SSE-streaming endpoint problem

I use SSE-streaming endpoint (/api/extra/generate/stream) in my application. I notice that with every request the prompt is not handled completely, but only some small part of it. Although in the cons…

Vladonai updated 1 year ago
22
LostRuins/koboldcpp #12

I see output generated in console but not inside webUI

Sorry if this is vague. I'm not super technical but I managed to get everything installed and working (Sort of). Anyway, when I entered the prompt "tell me a story" the response in the webUI was "O…

dogjamboree updated 1 year ago
18
abetlen/llama-cpp-python #232

Performance issues with high level API

After noticing a big, visibly noticeable slowdown in the ooba text ui compared to llama.cpp, I wrote a test script to profile llama-cpp-python's high level API: ``` from llama_cpp import Llama ll…

AlphaAtlas updated 1 year ago
22
ggerganov/llama.cpp #647

Confusion about the model versioning

So back when project started, we had the first "unversioned" model format without the embedded tokens, with the magic 0x67676d6c (ggml). Problem with that was that it didn't have any versioning sup…

anzz1 updated 1 year ago
21

上一页 1...83 84 85 86 87 88 89...89 下一页

882 results for koboldcpp

882 results
for koboldcpp