llama-rs Search Results

922 results
for llama-rs

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sigoden/aichat #129

[Feature Request] ability to use local llms, perhaps with th…

It'd be nice to be able to use this tool w/ a offline, locally running, open source, llm instead of using the openAI API's. You could probably use the https://github.com/rustformers/llm crate to achie…

AnthonyMichaelTDM updated 1 year ago
6
huggingface/candle #416

Cant run example - right access problem

Execute: cargo run --example llama Have error: Running on CPU, to run on GPU, build this example with `--features cuda` loading the model weights from meta-llama/Llama-2-7b-hf Error: request …

evgenyigumnov updated 1 year ago
5
rustformers/llm #16

Real-time chat platform

It'd be good to be able to bounce ideas off each other in real-time instead of through issues for more moment-to-moment discussion. The popular choices in the Rust world are Discord and Zulip, from wh…

philpax updated 11 months ago
6
bionic-gpt/bionic-gpt #100

System no longer starting up (port in use for database)

I started a new model server using llama_cpp_server with the following command python3 -m llama_cpp.server --model ~/dev/models/codellama-13b.Q5_K_M.gguf --n_gpu_layers 35 --n_batch 12000 This sta…

kulbinderdio updated 12 months ago
2
TabbyML/tabby #749

Out of memory when deploying TabbyML/CodeLlama-7B in Modal w…

**Describe the bug** Out of memory when deploying TabbyML/CodeLlama-7B in Modal with default Modal app.py script **Information about your version** ``` IMAGE_NAME = "tabbyml/tabby:0.5.4" MODEL_…

costanzo updated 1 year ago
2
EleutherAI/lm-evaluation-harness #1079

Distributed VLLM on H100 RuntimeError: Inplace update to inf…

Command: lm_eval --model vllm \ --model_args pretrained=${MODELDIR},tokenizer_mode="slow",tensor_parallel_size=$NUM_GPU,dtype=auto,gpu_memory_utilization=0.8 \ --tasks arc_challenge \ …

imraviagrawal updated 11 months ago
1
huggingface/text-generation-inference #1104

RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' o…

### System Info ## Running on CPU ### CPU Details: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Address sizes: 46 bits physical, 48 bits virtual Byte Ord…

UYousafzai updated 1 year ago
1
huggingface/text-generation-inference #926

Unexpected higher allocated GPU memory post-deployment

### System Info I deploy meta-llama/Llama-2-13b-chat-hf with tgi and get almost ~80GB allocated, reported by nvidia-smi: ``` +-----------------------------------------------------------------------…

marioplumbarius updated 1 year ago
2
hpcaitech/ColossalAI #4903

[BUG]: TypeError: attention_forward() got an unexpected keyw…

### 🐛 Describe the bug 在application下面的colossal-llama2，运行train.sh的时候，有如下报错信息： ``` bash train.sh /data_lc/envs/coloai/lib/python3.10/site-packages/colossalai/initialize.py:48: UserWarning: `config…

linchen111 updated 1 year ago
4
hiyouga/LLaMA-Factory #1813

QLoRA continue, not resume?

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I have started with QLoRA DPO-datasets and would like to continue on PT-datasets. What parameters…

Katehuuh updated 11 months ago
2

上一页 1...69 70 71 72 73 74 75...93 下一页

922 results for llama-rs

922 results
for llama-rs