mistral Search Results - Githubissues

1000+ results
for mistral

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

haotian-liu/LLaVA #1313

[Usage] can not apply inference with a merged mistral mode…

### Describe the issue Issue: I have finetuned mistral llava model with a sample dataset and the training was well here is the commands of training ``` deepspeed llava/train/train_mem.py -…

sayedmohamedscu updated 6 months ago
2
unslothai/unsloth #1053

Qwen-2.5 Coder-7B-Instruct: ValueError: Unsloth: Untrained t…

I am trying to finetune Qwen-2.5 Coder-7B-Instruct on my custom dataset but am getting the following error: `` ValueError: Unsloth: Untrained tokens of [[]] found, but embed_tokens & lm_head not t…

dante3112 updated 1 week ago
12
continuedev/continue #1817

Cannot Add Custom Docs & I am Seeing Many Errors

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue](ht…

CFalcon075 updated 1 day ago
1
ollama/ollama #5547

Mixtral 8x22b inference output is empty or gibberish

### What is the issue? Mixtral 8x22b instruct outputs are either empty or gibberish. I have tried various quantizations: q4, q4_k_m, q5, etc. All seem problematic. Other models (e.g., llama3, com…

PLK2 updated 4 months ago
2
aws/amazon-sagemaker-examples #4553

[Bug Report] All fine tunes for Mistral 7b using sagemaker j…

All fine tunes for Mistral 7b using sagemaker jumpstart are currently failing with: "ImportError: cannot import name 'insecure_hashlib' from 'huggingface_hub.utils' (/opt/conda/lib/python3.10/site-…

aadupirn updated 9 months ago
3
oobabooga/text-generation-webui #6277

Unexpected behavior: Model loads on single GPU but fails on …

### Describe the bug I'm experiencing an unexpected behavior when trying to load the following model: Model name: Mistral-Large-Instruct-2407-IMat-GGUF Quantization: Q6_K, size 100.59GB When…

ro99 updated 3 months ago
2
d0ugal/mistral-ansible-actions #2

Proposal to have a Workbook for TripleO Users

It might be useful to have a workbook for TripleO users which does the following: 1. Uploads a directory of playbooks to a Swift container (and runs them from there) 2. Builds the Ansible Inventor…

fultonj updated 7 years ago
1
LMCache/LMCache #232

[Feature Request] Distributed serving is failing with CUDA o…

**Is your feature request related to a problem? Please describe.** Tried to run custom 40B model, whose weights can be loaded with 2 80GB GPU's VRAM. lmcache is able to load small models with in sin…

kzos updated 1 day ago
5
vllm-project/vllm #3291

v0.3.3 api server can't startup with neuron sdk

————————env lib detail:—————————————— inf2.24xlarge ubuntu@ip-172-31-12-212:~/vllm$ pip list|grep -i neuron aws-neuronx-runtime-discovery 2.9 libneuronxla 2.0.755 neuro…

qingyuan18 updated 3 weeks ago
1
01-ai/Yi-Coder #12

grad_norm becomes nan when finetune 9b models

First, thanks for your great works. I've tried to finetune the Yi-Coder-9B-Chat models on my own dataset but here comes the problems. ## Problems 'grad_norm' becomes nan when I try to finetune t…

zero90169 updated 1 week ago
2

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for mistral

1000+ results
for mistral