llama-3 Search Results - Githubissues

1000+ results
for llama-3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/huggingface_hub #2423

response_format with regex does not seem to work

### Describe the bug When passing a `response_format` of type `regex` to `chat_completion`, the output does not always respect the format. ### Reproduction This does not follow the regex: ``` fro…

aymeric-roucher updated 1 month ago
5
jehna/humanify #92

Where is the output?

I ran the command like this: ```bash bun x humanifyjs local responsez.js ggml_vulkan: Found 1 Vulkan devices: Vulkan0: NVIDIA GeForce GTX 1070 (NVIDIA) | uma: 0 | fp16: 0 | warp size: 32 [nod…

Duoquote updated 6 days ago
4
nomic-ai/gpt4all #2734

After update to v3.1.0 it close upon loading any model

### Bug Report GPT4ALL was working well before the recent update. Today I update to v3.1.0. After that when I load a model it instead of loading the model. ### Steps to Reproduce Open gpt…

lakhwinder108 updated 1 month ago
6
ollama/ollama #4985

CUDA error: out of memory - Phi-3 Mini 128k prompted with 20…

### What is the issue? I get a CUDA out of memory error when sending large prompt (about 20k+ tokens) to Phi-3 Mini 128k model on laptop with Nvidia A2000 4GB RAM. At first about 3.3GB GPU RAM and …

kozuch updated 3 months ago
1
continuedev/continue #2104

(intellij) tab completion outputs too verbose completion "it…

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue](ht…

laurentperez updated 2 weeks ago
3
vllm-project/vllm #8027

[Bug]: I get one word inconsistent responses using the v0.5.…

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### 🐛 Describe the bug I deployed the vllm server using below…

Mikehade updated 2 weeks ago
1
huggingface/tokenizers #1546

"Solution" to memory hogging in train_new_from_iterator with…

Hi So I was training a new tokenizer from Llama Tokenizer (meta-llama/Llama-2-7b-hf), on a medium sized corpus (Fineweb-10BT sample : 15 million documents with average length of 2300 characters). A…

morphpiece updated 1 month ago
7
princeton-nlp/SimPO #42

reward/chosen is decreasing

![image](https://github.com/user-attachments/assets/1231e002-8c11-4251-bba2-1fb02a067007) Hi! I am fine-tuning LLaMA3 on the hh-rlhf dataset using SimPo and noticed that the reward/chosen rewar…

zhangguoxin1 updated 4 weeks ago
6
SeargeDP/ComfyUI_Searge_LLM #6

Slow - ~5 min per generation

Any ideas, why it can be slow? For example I'm using KoboldCPP with the same Mistral model and it answers immediately in realtime, almost like ChatGPT (I have RTX 4090). It also starts in like 15 seco…

jnpatrick99 updated 2 weeks ago
4
edgenai/llama_cpp-rs #90

Building with latest version of llama.cpp

On adding llama_cpp-rs to my Cargo.toml, llama.cpp seems to be locked to an older version. I'm trying to use Phi-3 128k in a project and I'm unable to because the [PR that was merged into llama.cpp](h…

cooperll updated 3 months ago
2

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for llama-3

1000+ results
for llama-3