llama-rs Search Results

928 results
for llama-rs

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #265

Llama inference using docker.

I am trying to run this through docker with a llama model (7B): ``` docker run --gpus all --shm-size 1g -p 8081:80 -v ./7B-transformed:/data/7B ghcr.io/yk/text-generation-inference:llama --model-i…

saul-jb updated 1 year ago
4
joonspk-research/generative_agents #3

🪐Feat: Support of local LLMs which are Free to run (DONE)✅

Hello, would it be possible to integrate something like GPT4All which runs locally and doesn't cost unlike OpenAI?

SaturnCassini updated 1 year ago
15
rustformers/llm #301

Add support for new k-quants quantization format

`llama.cpp` now supports new k-quants quantizations which achieve good model perplexity even in high quantizations. See https://github.com/ggerganov/llama.cpp/pull/1684 . We should also support th…

LLukas22 updated 1 year ago
7
Atome-FE/llama-node #13

How can I install Types?

I got a lot of error messages like this one: Cannot find module 'llama-node/dist/llm/llama-rs' or its corresponding type declarations. How can I install the types?

Achalogy updated 1 year ago
6
rustformers/llm #67

Making results independent from threadcount/batch size (from…

This may be something to keep an eye on: https://github.com/ggerganov/llama.cpp/pull/439 Looks like the corresponding code is here: https://github.com/rustformers/llama-rs/blob/bf7bdbcfff3114dcbdaf…

KerfuffleV2 updated 1 year ago
7
rustformers/llmcord #8

Error

``` C:\Users\micro\Downloads\llamacord>cargo run --release Finished release [optimized] target(s) in 0.16s Running `target\release\llamacord.exe` thread '' panicked at 'called `Result::un…

dillfrescott updated 1 year ago
4
rustformers/llm #59

llama-cli: Could not load model: InvalidMagic { path: ... }

Model sucessfully runs on `llama.cpp` but not in `llama-rs` Command: ``` cargo run --release -- -m C:\Users\Usuário\Downloads\LLaMA\7B\ggml-model-q4_0.bin -p "Tell me how cool the Rust programmin…

mguinhos updated 1 year ago
11
rustformers/llm #167

GPT-2 segfaults when used through the CLI

Trying any GPT-2 GGML model through the CLI appears to cause an immediate segfault: ``` llama-rs # cargo run --bin llm gpt2 infer -m models/gpt2/cerebras-2.7b-q4_0.bin -p "Now, this is a story all…

philpax updated 1 year ago
8
pyrossh/rust-embed #182

Compression feature breaks incremental builds

When using the compression feature, incremental builds where only the embedded files change do not trigger a rebuild. This is most obvious when using both compression and debug-embed. I think this …

djmarcin updated 1 year ago
10
rustformers/llm #299

Support ggml metal backend

Support for Metal GPU acceleration on macOS (and I assume iOS) just merged in llama.cpp master: https://github.com/ggerganov/llama.cpp/pull/1642 It would be great if this could also be employed fro…

pixelspark updated 1 year ago
29

上一页 1...79 80 81 82 83 84 85...93 下一页

928 results for llama-rs

928 results
for llama-rs