llama3 Search Results - Githubissues

1000+ results
for llama3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Adriankhl/godot-llm #20

Use llama3.1

Does anyone know how to use llama3.1 or 3 in this addon. I've tried downloading "Meta-Llama-3.1-8B-Instruct-Q2_K.gguf" from https://huggingface.co/bullerwins/Meta-Llama-3.1-8B-Instruct-GGUF/tree/main…

AlexDe-v updated 2 weeks ago
4
EricLBuehler/mistral.rs #783

CUDA_ERROR_ILLEGAL_ADDRESS when running Llama3 and Llama3.1

This occurs when using two GPUs, but it does not occur when I use just the one. I made sure to update to the docker image used in the dockerfile. commit: a702c6dd2944aaf75800b11f4dfeec6fe5a9b068…

ShelbyJenkins updated 1 month ago
4
OpenRLHF/OpenRLHF #519

OOM with Llama3-8b on 8*H100

I was running the example script: `examples/scripts/train_ppo_llama.sh`. Basically, it's ppo on llama3-8b with 8*H100, flash_attn, zero3, gradient_checkpointing, adam_offload, but it's OOM after some…

sharptcode updated 1 week ago
1
NVIDIA/TensorRT-LLM #2317

Stark Difference in GPU Usage of Triton Servers with Llama3 …

Hi, I have noticed that there was a huge difference in memory usage for runtime buffers and decoder for llama3 and llama3.1. Is it possible to know why? I have built an 8bit quantised llama3 engine a…

jasonngap1 updated 58 minutes ago
2
kolbytn/mindcraft #170

llama3

C:\Users\razvan\Downloads\mindcraft-main\mindcraft-main>node main.js file:///C:/Users/razvan/Downloads/mindcraft-main/mindcraft-main/settings.js:8 "profiles": [ ^^^^^^^^^^ SyntaxError: U…

mrrat1337 updated 2 months ago
5
janhq/models #25

Llama3.2-11b-Vision

gabrielle-ong updated 1 week ago
2
aws-samples/bedrock-access-gateway #71

Unable to use Llama 3.1 or 3.2 - Unsupported model

**Describe the bug** I am trying to use Meta 1 and 2 which require inference support. I am getting this error: `Unsupported model us.meta.llama3-1-70b-instruct-v1:0, please use models API to get…

tahpot updated 2 days ago
2
vllm-project/vllm #10440

[Bug]: Input prompt (35247 tokens) is too long and exceeds l…

I am trying to send a rather long prompt (36k tokens) to VLLM supported models, in particular llama3_8B_Instruct. However I am getting the error below: scheduler.py:648] Input prompt (36893 tokens)…

Crista23 updated 3 days ago
4
triton-inference-server/tensorrtllm_backend #636

Stark Difference in GPU Usage of Triton Servers with Llama3 …

**Description** I have noticed that there was a huge difference in memory usage for runtime buffers and decoder for llama3 and llama3.1. **Triton Information** What version of Triton are you usin…

jasonngap1 updated 3 weeks ago
1
mudler/LocalAI #3669

llama3.2 vision models

**Is your feature request related to a problem? Please describe.** Llama3.2 was released, and as it has multimodal support would be great to have it in LocalAI **Describe the solution you'd li…

mudler updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for llama3

1000+ results
for llama3