llama-3-2 Search Results

1000+ results
for llama-3-2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

unslothai/unsloth #1267

save_pretrained_merged ruins my model

Good day After load saved lora model, i save it to merged. And after load it from merged, i have generation like '+++++ 1000000000000000000000000000000000000000000000000…

Romiroz updated 5 days ago
3
huggingface/transformers.js #1019

Pretrained Llama tokenizers don't yield the expected tokeniz…

### System Info TypeScript 5.5.4 transformers.js 3.0.2 Node.js v20.170 ### Environment/Platform - [X] Website/web-app - [ ] Browser extension - [X] Server-side (e.g., Node.js, Deno, Bun) - [ ] De…

JulienVig updated 2 weeks ago
2
pytorch/executorch #6584

Inquired for Llama-3.2-1&3B-INT4-E08 SpinQuant Rotation Matr…

Dear @shewu-quic, @cccclai, ..please mention anyone relevant Could you share the command used to create the R matrix for generating the Llama-3.2-3B & 1B SpinQuant-INT4-E08 .pth files you've releas…

crinex updated 2 weeks ago
1
substratusai/kubeai #311

Proposal: Mount a PVC in ReadManyOnly mode as model storage

Use case: Users have pre-provisioned PVs that contain models on them and support ReadManyOnly. The user would be responsible for ensuring a compatible model is stored on the PV and creating a PVC. …

samos123 updated 5 hours ago
2
meta-llama/llama-stack #520

Redis persistence store not working

### System Info Collecting environment information... PyTorch version: 2.2.2 Is debug build: False CUDA used to build PyTorch: None ROCM used to build PyTorch: N/A OS: macOS 13.6.6 (x86_64) G…

cheesecake100201 updated 19 hours ago
1
unslothai/unsloth #1309

Does tensorRT-LLM support serving 4bit quantised unsloth Lla…

We want to deploy https://huggingface.co/unsloth/Llama-3.2-1B-Instruct-bnb-4bit which is 4-bit quantized version of llama-3.2-1B model. It is quantized using bitsandbytes. Can we deploy this using ten…

jayakommuru updated 4 days ago
1
NVIDIA/TensorRT-LLM #2472

Does tensorRT-LLM support serving 4bit quantised unsloth Lla…

We want to deploy https://huggingface.co/unsloth/Llama-3.2-1B-Instruct-bnb-4bit which is 4-bit quantized version of llama-3.2-1B model. It is quantized using bitsandbytes. Can we deploy this using ten…

jayakommuru updated 5 days ago
2
ggerganov/llama.cpp #10381

Refactor: Allow adding both tokens and embeddings to `llama_…

### Background Description Ref: https://github.com/ggerganov/llama.cpp/pull/7553 , required for supporting future vision models (https://github.com/ggerganov/llama.cpp/issues/8010) I initially pla…

ngxson updated 1 week ago
1
geerlingguy/ollama-benchmark #1

Benchmark AMD GPUs on Raspberry Pi 5

To get this to work, first you have to get an external AMD GPU working on Pi OS. The most up-to-date instructions are currently on my website: [Get an AMD Radeon 6000/7000-series GPU running on Pi 5](…

geerlingguy updated 4 days ago
22
Sinaptik-AI/pandas-ai #1420

Pandasai was working with watsonx till a few days back... ag…

### System Info Python : 3.12.4 pandasai : 2.2.14 ibm_watsonx_ai : 0.2.6 ### 🐛 Describe the bug from pandasai import SmartDataframe import pandas as pd from pandasai.llm import IBMwatsonx #…

raviisrani updated 3 weeks ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for llama-3-2

1000+ results
for llama-3-2