instruction-tuning Search Results

1000+ results
for instruction-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

previousnext/containers #174

Unable to locate a valid CA certificate file

Arrived here via instructions at https://www.previousnext.com.au/blog/join-us-drupalgov-2020-code-sprint#set-up-a-development-environment--2 that use a gist that references `previousnext/php-apache:…

kayakr updated 4 years ago
1
nebuly-ai/optimate #219

Generate custom dataset from few user samples

# Description The first huge difficulty for training an AI assistant is to get a dataset reach enough and big enough for starting the training at all. ChatLLaMA needs three different type of da…

diegofiori updated 1 year ago
1
syuqings/video-paragraph #3

RL not working

[issue] The fine-tuning step doesn't increase the scores (it even decreases the score). Please refer to the green line in the chart below. ![image](https://user-images.githubusercontent.com/39104…

Kashu7100 updated 3 years ago
2
openlm-research/open_llama #65

OpenLLaMA can quickly learn how to code

I know it's mentioned in the readme of the repo that this model apparently can't code because of the spaces that are merged. And this has been discussed in #40 . However, I did some fine-tuning on …

jorgemcgomes updated 1 year ago
4
hpcaitech/ColossalAI #3353

[DOC]: clear Coati model format

### 📚 The doc issue Currently, our load and save models involve both PyTorch and HF formats, and instructions for stage123 and inference need to be added to avoid misunderstandings and usage by users…

binmakeswell updated 1 year ago
1
wolfSSL/wolfssl #8148

[Bug]: Build fails on aarch64 with LTO

### Contact Details Here, on GitHub, preferrably. ### Version 5.7.4 ### Description The build fails with error messages like the following on aarch64 Linux machines, which should compile wolfssl …

vifino updated 20 hours ago
9
huggingface/text-embeddings-inference #261

Support gte-Qwen1.5-7B-instruct

### Model description Here is the model description > gte-Qwen1.5-7B-instruct is the latest addition to the gte embedding family. This model has been engineered starting from the [Qwen1.5-7B](https:…

reverland updated 4 months ago
1
FlagOpen/FlagEmbedding #743

how to adjust hyperparameter for finetune llm embed

llm embed has the following training script. I don't know how to adjust hyperparameters like train_batch_size, learning rate, warmup_ratio, ... torchrun --nproc_per_node=8 run_dense.py \ --output_…

QuangTQV updated 6 months ago
2
OSU-NLP-Group/GrokkedTransformer #2

Question about your paper

Hi, I've been reading your paper and I find it amusing that by grokking, transformer model could reach high accuracy even to evaluate data. Completely different from I know that overfitting is bad. I …

fahadh4ilyas updated 5 months ago
3
haotian-liu/LLaVA #808

[Question] How to use the fine-tuned model？

### Question I have two questions. 1. I follow the instruction in scripts/v1.5 to pre-train and fine-tune the model. After pre-training, I get the mm_projector.bin; and after fine-tuning I get adap…

ZY123-GOOD updated 10 months ago
4

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for instruction-tuning

1000+ results
for instruction-tuning