llama-cpp Search Results

1000+ results
for llama-cpp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

abetlen/llama-cpp-python #1354

llama cpp python server for llava slow token per second

Darwin Feedloops-Mac-Studio-2.local 23.3.0 Darwin Kernel Version 23.3.0: Wed Dec 20 21:31:00 PST 2023; root:xnu-10002.81.5~7/RELEASE_ARM64_T6020 arm64 command: python -m llama_cpp.server --model ./…

Kev1ntan updated 7 months ago
3
containers/ramalama #458

Ramalama Container needs updating on the quay.io to use new …

Hey @ericcurtin I was testing out the new changes and I noticed the ramalama container file on quay.io needs to be updated to have llama-simple-chat Everything worked when I built the container from …

bmahabirbu updated 3 days ago
2
h2oai/h2ogpt #1896

taking long time to give response (around 2 min)

Hello I am running in the following machine. CPU: 12th Gen Intel(R) Core(TM) i7-12700 RAM: 32GB, speed: 4400MT/s NVIDIA RTX A2000 12GB model is: llama-2-7b-chat.Q6_K.gguf And it takes a…

mbbutt updated 1 week ago
5
casper-hansen/AutoAWQ #502

awqint4 to gguf ,ModuleNotFoundError: No module named 'awq.a…

I want to use awq quantize a model, and use llama.cpp convert to gguf. but I followed the tutorial but got an error：Traceback (most recent call last): File "/root/ld/ld_project/llama.cpp/convert_m…

LDLINGLINGLING updated 4 months ago
5
netdur/llama_cpp_dart #33

add_subdirectory given source "./llama.cpp" which is not an…

CMake Error at CMakeLists.txt:16 (add_subdirectory): add_subdirectory given source "./llama.cpp" which is not an existing directory.

tush23 updated 2 months ago
2
oobabooga/text-generation-webui #6225

RuntimeWarning: Detected duplicate leading "<|begin_of_text|…

### Describe the bug Whenever I load up certain GGUFs, I get the above error message in the terminal. I have seen it happen on Bartowski Q8 quant of Llama3 70B Instruct (3-part file) and llama-3-70B-…

Kaszebe updated 1 month ago
4
huggingface/transformers #34238

GGUF support for BERT architecture

### Feature request I want to add the ability to use GGUF BERT models in transformers. Currently the library does not support this architecture. When I try to load it, I get an error TypeError: Ar…

Dimmension updated 1 month ago
1
abetlen/llama-cpp-python #1403

Multimodal Llama3 Support

I came across a model on Huggingface that supports Llama3 multimodal [Bunny-Llama-3-8B-V: bunny-llama](https://huggingface.co/BAAI/Bunny-Llama-3-8B-V), and I'd like to be able to deploy it using lla…

xx025 updated 6 months ago
1
abetlen/llama-cpp-python #1369

Cache misses previous generation

# Expected Behavior The server should cache both the previous prompt and the last generation. # Current Behavior The cache misses at the end of the previous prompt, forcing to evaluate the pr…

ultoris updated 4 months ago
7
langchain-ai/langchain-nextjs-template #28

Module not found: Can't resolve 'fs'

你好，按照教程运行 yarn dev ，出现以下错误： ⨯ ./node_modules/@kwsites/file-exists/dist/src/index.js:6:13 Module not found: Can't resolve 'fs' https://nextjs.org/docs/messages/module-not-found Import trace f…

ray040237 updated 4 months ago
1

上一页 1...63 64 65 66 67 68 69...100 下一页

1000+ results for llama-cpp

1000+ results
for llama-cpp