instruct-finetune Search Results

1000+ results
for instruct-finetune

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/autotrain-advanced #299

Missing config.json file after training using AutoTrain

Context: Environment: Google Colab (Pro Version using a V100) for training. Tool: Utilizing Hugging Face AutoTrain for fine-tuning a language model. Sequence of Events: **Initial Training:** …

KabaTubare updated 6 months ago
17
meta-llama/llama-recipes #6

pre-training recipe

Any plans to add a recipe for further pre-training on custom data with optional tokenizer vocab extension in the style of [chinese-llama](https://github.com/ymcui/Chinese-LLaMA-Alpaca/blob/main/README…

enpassanty updated 6 months ago
5
ludwig-ai/ludwig #3995

Issues fine tuning Mistral

**Describe the bug** I am trying to finetune Mistral Instruct V0.2 and I keep getting this error. `TypeError: LoraConfig.__init__() got an unexpected keyword argument 'use_rslora` ` **To Reproduc…

vinven7 updated 6 months ago
1
ollama/ollama #6479

v0.3.7-rc5 no longer uses multiple GPUs for a single model

### What is the issue? Moving from 0.3.6 to 0.3.7-rc5, Ollama no longer uses both GPUs for a single model when the model will not fit on one card. If I load two models, though, it will use the secon…

Maltz42 updated 2 months ago
7
modelscope/ms-swift #839

finetune 好的模型用不了

from transformers import LlavaNextProcessor, LlavaNextForConditionalGeneration import torch from PIL import Image import requests from modelscope import snapshot_download from transformers impo…

AlexJJJChen updated 6 months ago
4
ggerganov/llama.cpp #7938

Bug: Phi-3 Tokenizer Adds Whitespaces on re-tokenization (wh…

### What happened? The llama.cpp tokenizer for Phi-3 has odd behavior, where re-tokenizing the same text over and over keeps adding whitespaces to the first non-BOS token. This has several issues: …

Harsha-Nori updated 2 months ago
10
microsoft/DeepSpeed #5819

[BUG] Deepspeed ZeRO3 not partitioning model parameters

**Describe the bug** Even after applying ZeRO3, model parameters are copied, not partitioned, across all the available GPUs **To Reproduce** When I run the below code with this command: `deepspee…

echo-yi updated 2 months ago
9
deepseek-ai/DeepSeek-Coder #54

Running finetune_deepseekcoder.py results in return code = -…

Thank you for the handy fine tuning guide but I am not able to get started. I tried using the default settings as a POC but it ends up erroring out. This is the output I get when using the sampl…

hobpond updated 8 months ago
5
microsoft/DeepSpeedExamples #271

[Deepspeed-Chat] OOM issue on opt-1.3B on a 8xV100 machine (…

Hello, I am testing out the SFT stage of the example on a `p3.16xlarge` machine. But it OOMs. Is there anything that I is missing from my configs? Note: I commented out the data so that it pick…

kouroshHakha updated 2 months ago
14
ollama/ollama #3243

Support Steam Deck Docker amdgpu - gfx1033

### What is the issue? Steam Deck GPU not supported (apperantly) Logs: > time=2024-03-19T11:24:28.162Z level=INFO source=images.go:806 msg="total blobs: 54" > time=2024-03-19T11:24:28.420Z lev…

FairyTail2000 updated 5 days ago
41

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for instruct-finetune

1000+ results
for instruct-finetune