llama-3-2 Search Results

1000+ results
for llama-3-2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2440

[Volta] [No flash attention] Llama 3.1 8B Instruct failed to…

### System Info Hi everyone, when trying to update from Llama 3 8B Instruct to Llama 3.1 8B Instruct, I noticed a crash: ```bash Args { model_id: "meta-llama/Meta-Llama-3.1-8B-Instruct", …

ladi-pomsar updated 3 days ago
12
NVIDIA/TensorRT-LLM #801

summarize problem on llama(bettertransformers not supported)

Hi: we're trying to summarize smooth quant llama, but was reported: ``` Loading checkpoint shards: 100%|████████████████████████████████████████████████████| 3/3 [00:13=4.36 and torch>=2.1.1 t…

forrestjgq updated 1 week ago
1
dottxt-ai/outlines #1131

Infinite repetitions and invalid JSON - Outlines with MLX

### Describe the issue as clearly as possible: On certain prompts, the LLM can spiral into an infinite loop providing the same item repeatedly, until stopped by max_tokens parameter. In that case, t…

ea167 updated 2 months ago
1
ggerganov/llama.cpp #10161

Bug: CANN E89999

### What happened? common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable) /owner/ninth/llama.cpp/ggml/src/ggml-cann.cpp:61: CANN error: E89999: In…

ninth99 updated 2 weeks ago
15
run-llama/llama_index #15749

[Bug]: ModuleNotFoundError: No module named 'llama_index'

### Bug Description I´m deploying a web app in Pythonanywhere, that a is hosting for python web apps in linux machines. I´ve created a virtual env with virtualenvwrapper and set it up in the web a…

frangoncab updated 2 months ago
5
mukel/llama3.java #8

Bundle all standalone models in a single project.

So far I've ported the following models to Java: Llama 3 & 3.1, Mistral/Codestral/Mathstral/Nemostral (+ Tekken tokenizer), Qwen2, Phi3 and Gemma 1 & 2 ... All models are bundled as a single ~2K li…

mukel updated 1 month ago
1
NVIDIA/TensorRT-LLM #752

When run llama2, Caught signal 11 (Segmentation fault)

Hi, I had tried to test llama2 based on TensorRT-LLM. my environments (based on "nvcr.io-nvidia-tritionserver-23.10-trtllm-python-py3"): > cuda 12.2 > gpu A100 40G (1) > python 3.10.12 > ubunt…

namang301 updated 1 week ago
8
llm-attacks/llm-attacks #110

Slow optimization.

I experimented with LLaMA 2. I want to **replicate multiple experiments**. To begin with, I ran the demo, and the following results were obtained. My results: ![image](https://github.com/user-at…

snsonsoon updated 1 month ago
1
TabbyML/tabby #3056

llama-server <embedding> exited with status code -1

**Describe the bug** llama-server exited with status code -1 **Information about your version** Unable to get version as it will not start. Docker image used: ``` REPOSITORY …

Gnomesenpai updated 2 months ago
12
huggingface/accelerate #3169

[Bug] accelerate ignores `TPU`

### System Info ```Shell latest version. tested via both `pip install -U accelerate` and `pip install git+https://github.com/huggingface/accelerate` ``` ### Information - [ ] My own modif…

steveepreston updated 1 week ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama-3-2

1000+ results
for llama-3-2