guanaco Search Results - Githubissues

481 results
for guanaco

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

artidoro/qlora #194

How to replicate Guanaco(OASST1) MMLU test results?

Hi, has anyone tried to replicate the MMLU test results of Guanaco (OASST1) in Table 5 in the paper? I have tried with the original training scripts provided at `./scripts/finetune_guanaco*.sh`.…

hangxu0304 updated 3 months ago
3
huggingface/transformers #26203

RuntimeError: Caught RuntimeError in replica 0 on device 0

### System Info transformers version -> 4.33 python version -> 3.10.6 I try to finetune this huggingface model : NousResearch/Llama-2-70b-chat-hf With this huggingface dataset : mlabonne/gua…

ArnaudHureaux updated 1 week ago
18
unslothai/unsloth #575

LLama3 colab notebook - formatting_prompts_func glitch?

Hi, while following along the instructions for the LLama-3-8B-Instruct in https://colab.research.google.com/drive/1XamvWYinY6FOSX9GLvnqSjjsNflxdhNc?usp=sharing I ran into what seems like a glitch.…

devzzzero updated 3 months ago
7
InternLM/xtuner #533

Mixtral 8x7B SFT 问题

您好，正在尝试微调mixtral 8x7b，但是训练一段时间后loss不再下降，输出也有些问题使用的config如下： ```python # Copyright (c) OpenMMLab. All rights reserved. import torch from datasets import load_dataset from mmengine.dataset im…

aiyinyuedejustin updated 5 months ago
4
Facico/Chinese-Vicuna #247

运行generate脚本之后，在页面提问，很久没有产生回答，后台无报错

如果你遇到问题需要我们帮助，你可以从以下角度描述你的信息，以便于我们可以理解或者复现你的错误（学会如何提问不仅是能帮助我们理解你，也是一个自查的过程）： 1、你使用了哪个脚本、使用的什么命令 CUDA_VISIBLE_DEVICES=0,1,6,7 python generate.py 2、你的参数是什么（脚本参数、命令参数） parser.add_argument("--model_…

mmmminyuhan updated 9 months ago
2
huggingface/trl #1623

Learning to generate EOS tokens

@edbeeching and I noticed sometimes the trained SFT models do not learn to stop generations. In other words, the model never learn to generate EOS tokens. Upon some digging, I noticed this is mainl…

vwxyzjn updated 2 months ago
7
deepset-ai/haystack #5355

PromptNode not usable if AutoTokenizer fails

**Describe the bug** Using the PromptNode with the HF Inference API Endpoint with the model `timdettmers/guanaco-33b-merged` will throw a recursion error, due to a (at least) missing `unk_token` of t…

HallerPatrick updated 4 months ago
1
All-Hands-AI/OpenHands #311

ollama: 'llama2' not found, try pulling it first

**STEP 3 Give Feedback / Get Help: https://github.com/BerriAI/litellm/issues/new LiteLLM.Info: If you need to debug this error, use `litellm.set_verbose=True'. AGENT ERROR: O…

R3verseIN updated 1 month ago
17
artidoro/qlora #152

Diverging evaluation loss using finetuning scripts Guanaco 7…

Is anyone else having this issue when using the [finetune_guanaco_7b.sh](https://github.com/artidoro/qlora/blob/main/scripts/finetune_guanaco_7b.sh) script? I keep seeing the evaluation loss diverge r…

KJ-Waller updated 4 months ago
13
Facico/Chinese-Vicuna #238

transformers和pydantic问题

1、你使用了哪个脚本、使用的什么命令使用的bash scripts/finetune.sh 2、你的参数是什么（脚本参数、命令参数）参数 TOT_CUDA="0,1,3" CUDAs=(${TOT_CUDA//,/ }) CUDA_NUM=${#CUDAs[@]} PORT="12345" DATA_PATH="data/newfl_data.json" #"../data…

ww0o0 updated 1 year ago
1

上一页 1...10 11 12 13 14 15 16...49 下一页

481 results for guanaco

481 results
for guanaco