llama-3-2 Search Results

1000+ results
for llama-3-2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google/saxml #29

Issue when trying to convert LLama 405b to SAXML

After downloading the 405B Model, I try to run the tool convert_llama_ckpt.py but I keep getting this error. I am using : Compute: n2d-highmem-96 with 768 GB of memory on Vertex Workbench Python ver…

aalferez123 updated 2 months ago
1
meta-llama/llama-stack #341

Create the distribution of AMD ROCm GPU

### 🚀 The feature, motivation and pitch Create the distribution of AMD ROCm GPU like the distributions/meta-reference-gpu which is base on NVIDIA GPU. ### Alternatives _No response_ ### Additional…

alexhegit updated 3 weeks ago
3
intel-analytics/ipex-llm #11708

Error: llama runner process has terminated: exit status 0xc0…

I have setup ipex-llm by following [install ipex-llm for llamacpp]( https://github.com/intel-analytics/ipex-llm/blob/main/docs/mddocs/Quickstart/llama_cpp_quickstart.md#1-install-ipex-llm-for-llamacpp…

EMPERORAYUSH updated 3 months ago
10
lmstudio-ai/lms #43

all model downloads are failing

hi please see below, I can download these directly no problem, but when I try though LMStudion this fails. I'm on win10, been using your amazing software for a few good months now, this is happeni…

tendaysaweek updated 2 months ago
2
AGI-Edgerunners/LLM-Adapters #73

Inference very slow since some of the params are going to CP…

We did following : 1. Took nvidia/Llama-3.1-Nemotron-70B-Instruct-HF base model and performed fine tuning using our custom data set for classification task . Training completed in 6 hrs or so and w…

pulkitmehtaworkmetacube updated 6 days ago
1
EricLBuehler/mistral.rs #850

Error: DriverError(CUDA_ERROR_INVALID_PTX, "a PTX JIT compil…

## Describe the bug LLAMA 3.2 11B Vision cannot start after loading model ``` Error: DriverError(CUDA_ERROR_INVALID_PTX, "a PTX JIT compilation failed") when loading utanh_bf16 ``` my…

nikolaydubina updated 2 weeks ago
6
bytedance/ABQ-LLM #9

Is there a plan to support model Qwen2?

gloritygithub11 updated 1 week ago
2
unslothai/unsloth #868

KeyError: 'EOS_TOKEN' when exporting GGUF with certain templ…

I encountered an issue when trying to export a GGUF model file for Mistral Nemo and Mistral 7B finetunes using the `unsloth` library. The error occurs during the `save_pretrained_gguf` function call, …

Yandrik updated 2 months ago
4
vllm-project/vllm #9320

[Bug]: 当vLLM 部署实现 OpenAI API，并且生成模型使用llama 3 8b instruct做RAG…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

asilverlight updated 1 month ago
2
vllm-project/vllm #4699

[Performance]: large rate of decrease in generation throughp…

### Proposal to improve performance _No response_ ### Report of performance regression Model: meta-llama/Meta-Llama-3-8B-Instruct GPU: 1x A6000 | SamplingParams.logprobs | Generation Throughput…

jeffrey-fong updated 1 month ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama-3-2

1000+ results
for llama-3-2