supervised-finetuning Search Results

596 results
for supervised-finetuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeedExamples #920

FileNotFoundError: [Errno 2] No such file or directory: 'num…

After finishing install successfully, i got this error when ran this command: python e2e_rlhf.py --actor-model facebook/opt-1.3b --reward-model facebook/opt-350m --deployment-type single_gpu ![捕获](…

zhiwentian updated 1 month ago
4
microsoft/DeepSpeedExamples #924

AttributeError： 'DeepSpeedEngine' object has no attribute 'm…

https://github.com/microsoft/DeepSpeedExamples/blob/957ae3141946daf9a6bc5731e261032a13a82f05/applications/DeepSpeed-Chat/training/step1_supervised_finetuning/main.py#L367 just train one epoch，i got…

lovychen updated 1 month ago
1
yangzhipeng1108/DeepSpeed-Chat-ChatGLM #9

AutoModelForCausalLM

AutoModelForCausalLM 中class没有chatglm你是如何解决的呢

Altrouge7 updated 10 months ago
4
pavancm/CONTRIQUE #1

performance of supervised finetune using CONTRIQUE？

Hello，I wonder the performance of supervised finetune using CONTRIQUE encoder compared to imagenet pretrained model, but I can't find such exp in paper. can you share the results if you have done…

xinyeCH updated 2 years ago
1
shibing624/MedicalGPT #410

Windows四卡3090平台跑baichuan2-13b时，感觉模型好像没有分布到各个显卡上，显存一下就满了oom了。…

你好徐老师，我使用的是windows四卡3090机器，每张显卡24G显存，因为是win平台，就写了一个bat脚本来运行 `@echo off set CUDA_VISIBLE_DEVICES=0,1,2,3 call python supervised_finetuning.py ^ --model_type baichuan ^ --model_name_or_p…

Ruiruiz30 updated 1 month ago
3
facebookresearch/contriever #9

Script finetuning on MSMarco

Thanks a lot for releasing the code and the scripts for pre-training. I'm trying to reproduce the numbers on MS-Marco after fine-tuning and it would be great if you could also release the scripts f…

gangiswag updated 2 years ago
1
microsoft/DeepSpeed #3214

[BUG]Out of memory when training, and is streaming mode supp…

## description - datasets: 1.1 GB Chinese corpus, about 2 million lines - devices - CPU=8 GPU=1 Memory=320G Node=1 Type=A100-SXM-80GB ## question The training process was always killed …

wqw547243068 updated 1 year ago
5
albertan017/LLM4Decompile #20

Training budget estimation

We trained our model using [AnghaBench](https://github.com/brenocfg/AnghaBench) compilation results across four optimization levels (O0~O3), selecting samples under 1024 tokens. That gave us a total o…

QiuJYWX updated 2 months ago
3
mims-harvard/UniTS #28

If the pre-training of UniTS includes a test dataset, can it…

I have a question whether the pre-trained UniTS training dataset contains the test dataset. If so, then the training process of time series prediction and time series completion is essentially a self-…

IkeYang updated 2 months ago
10
mims-harvard/UniTS #14

IncompatibleKeys Error when load pre-trained model to do the…

Nice work and I have a question here: I am trying to pre-train and finetune a model on my own datasets. However, some warnings were raised when loading the pre-trained model during finetuning: …

websitefingerprinting updated 5 months ago
4

上一页 1...2 3 4 5 6 7 8...60 下一页

596 results for supervised-finetuning

596 results
for supervised-finetuning