llama-rs Search Results

928 results
for llama-rs

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rustformers/llm #88

Using cached sessions in REPL mode ?

Hi everyone! I've been using `llama-rs` recently and I especially love the caching session feature. I've been building an API that allows users to maintain multiple conversation threads and switch…

nsarrazin updated 1 year ago
2
rustformers/llm #102

Fails to run with gpt4-x-alpaca-13b-native-ggml-model-q4_0

See https://huggingface.co/Bradarr/gpt4-x-alpaca-13b-native-ggml-model-q4_0 ``` cargo run --release -- -m ./gpt4-x-alpaca-13b-native-ggml-model-q4_0.bin -p "How do you do ?" ``` ``` thread 'm…

rumatoest updated 1 year ago
2
rustformers/llm #13

30B model doesn't load

Following the same steps works for 7B and 13B model, with the 30B parameters I get ``` thread 'main' panicked at 'Could not load model: Tensor tok_embeddings.weight has the wrong size in model fil…

RCasatta updated 1 year ago
12
LLukas22/llm-rs-python #15

4 bit quantization not happening - code getting stuck -

![image](https://github.com/LLukas22/llm-rs-python/assets/20523129/d74de86c-cb54-422f-8639-1144cb699e55) AutoConverter work and produces fp16weights (13GB for llama 7b) these can be loaded as wel…

sidharthiimc updated 1 year ago
4
sobelio/llm-chain #10

Add advanced support for splitting strategies for tokens

Function for measuring context windows 1. A tokenisation algo 2. number of tokens to split to 3. Returns a Vec where each string can fit the window.

williamhogman updated 1 year ago
3
llamaxyz/llama #193

perf: benchmark gas usage and improve it

First, for the `revokePolicy` method that loops over all roles: 1. Check if we can revoke a policy if `numRoles` is 255 AND the user holds 0 roles 2. Check if we can revoke a policy if `numRoles` is…

mds1 updated 1 year ago
2
rustformers/llm #24

Failing on windows 11 at cargo build --release

Hi, I am trying to figure out the problem, help will be great. Failing on windows 11 at cargo build --release Error message: `Caused by: process didn't exit successfully: `C:\Users\touhidu…

touhi99 updated 1 year ago
8
Narsil/ggblas #1

Potential Data Race

Hi, I think there is possibly data race in the function `ggml_compute_forward_mul_mat`. From my understanding, the problem is that we are mutating c matrix, `cp`. But there is a possibility of dif…

mert-kurttutan updated 1 year ago
7
huggingface/text-generation-inference #369

Inference issue data:{"error":"Request failed during generat…

### System Info The error as specified in the title occurs with various nvidia/cuda configurations 530/12.1 530/11.8 470/11.4 etc. and with various version of pythia (sft-1 and sft-4) and starcoder. …

NoJster updated 1 year ago
8
rustformers/llm #3

error when loading a model

I fixed `main.rs` to refer to `&args.model_path`, but now I get a new error: ``` Could not load model: invalid utf-8 sequence of 1 bytes from index 0 ``` I created these models using the tools…

faassen updated 1 year ago
7

上一页 1...83 84 85 86 87 88 89...93 下一页

928 results for llama-rs

928 results
for llama-rs