llama-3 Search Results - Githubissues

1000+ results
for llama-3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #9016

Bug: llama-server scales default context incorrectly for mul…

### What happened? LLaMA 3 has been trained 8192 context. When using a single slot with the llama.cpp HTTP server this slot is assigned the full 8192 context. However, when using multiple slots and n…

JohannesGaessler updated 1 month ago
2
hiyouga/LLaMA-Factory #5617

fsdp+qlora failed with cuda error

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-6.11.2-arch1-1-x86_64-with-glibc2.40 - Pyt…

Orion-zhen updated 1 month ago
1
huggingface/ratchet #260

4 bit quantization support

I would like to use this library for in-browser web ml inference because with the upcoming CPU support it is better than 1. ggml.cpp(llama.cpp/whisper.cpp) - as it supports both CPU and GPU and can u…

bil-ash updated 1 month ago
1
h2oai/h2ogpt #1784

Running Llama 3.1 on Mac OS with m2 chip has errors

I tried to run h2ogpt with this command : `python generate.py --base_model=meta-llama/Meta-Llama-3.1-8B-Instruct --use_auth_token=...` and it triggered errors ```The attention mask and the p…

antoninadert updated 3 months ago
3
jfisher52/StyleRemix #2

Issue regarding model size

Hi, I have a question regarding the huggingface model weights. I was trying to load some your adapters and play with them but I found that the adapters were very large (~4GB) as in the screenshot be…

00ber updated 1 week ago
2
intel-analytics/ipex-llm #11883

Can ipex-llm[cpp] support the bge-m3 model?

The official ollama supports this model in v0.3.4 [https://github.com/ollama/ollama/releases/tag/v0.3.4](https://github.com/ollama/ollama/releases/tag/v0.3.4) Tried with ollama in 2.1.0b20240820, …

jianjungu updated 2 months ago
1
SparkJiao/dpo-trajectory-reasoning #2

Request for more Details on Training the Reward Model

Hi, I am currently attempting to reproduce the experiments detailed in the section titled "Process Rewards Annotating (Taking LogiQA-v2 as an Example)" from your README.md file. However, as I reach…

brightest66 updated 1 day ago
4
huggingface/optimum-neuron #722

SPECULATE option error

### System Info ```shell I'm running inf2 neuron TGI on Sagemaker with optimum-neuron=0.0.25. I'm using the SPECULATE=2 option but I get the following message in the logs: Error: No such opt…

SteliosGian updated 2 weeks ago
2
google-ai-edge/ai-edge-torch #237

data_ptr_value % kDefaultTensorAlignment == 0 was not true.

### Description of the bug: Hi @pkgoogle , i used example c++ code to inference model i transfer, it can show some error. - my command ``` bazel run -c opt //ai_edge_torch/generative/example…

nigelzzzzzzz updated 1 month ago
10
EricLBuehler/mistral.rs #629

Pre-built binary for macOS Silicon does not seem to use Meta…

## Describe the bug Download https://github.com/EricLBuehler/mistral.rs/releases/download/v0.2.2/mistralrs-server-aarch64-apple-darwin.tar.xz Use a tool like [asitop](https://github.com/tlkh/asito…

ChristianWeyer updated 3 months ago
5

上一页 1...77 78 79 80 81 82 83...100 下一页

1000+ results for llama-3

1000+ results
for llama-3