supervised-finetuning Search Results

596 results
for supervised-finetuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/ConvNeXt-V2 #72

Why not provide 22k-supervised finetuning model??? I am real…

Hi, I am looking for the 22k-supervised fine-tuning ConvNeXt-V2-H model without 1k-supervised fine-tuning. I want to use it to fine-tune on ade20k, reproducing the result in Table 7 of the paper.

yan-hao-tian updated 1 month ago
2
facebookresearch/ConvNeXt-V2 #11

image reconstruction result comparision of the pretrained Co…

@shwoo93 @s9xie hi, did you make a comparision about the effect of image reconstruction of the pretrained V1/V2 model? How about comparing with MAE-pretrained ViT?

songkq updated 1 year ago
4
Shen-Lab/GraphCL #66

How to use for Graph Clustering.

How can I use GraphCL for fully unsupervised Graph Clustering? So far, all that I've found the method for Graph Clustering is actually for node clustering or not a fully unsupervised learning metho…

PeterDeSOM updated 1 year ago
1
hms-dbmi/CHIEF #19

questions about baselines

Dear authors, First, I want to applaud you for your great paper acceptance, and thank you for making the model weights available. I read your paper carefully, but I still have questions I hope you …

tranh215980 updated 2 weeks ago
3
Azure/MS-AMP #180

AttributeError: 'ScalingTensor' object has no attribute 'vie…

**What's the issue, what's expected?**: Error when using ms-amp to do llm sft. ms-amp deepspeed config: "msamp": { "enabled": true, "opt_level": "O1|O2|O3", # all tried "use_te": false } …

LSC527 updated 2 months ago
3
microsoft/DeepSpeedExamples #518

RLHF model return '{: {: {:' of every input

I used as readme of deepspeed chat. training/step1_supervised_finetuning/training_scripts/single_node/run_1.3b.sh training/step2_reward_model_finetuning/training_scripts/single_node/run_3…

kuangdao updated 1 year ago
4
microsoft/DeepSpeed #3284

self.qkv_gemm_func returns ValueError: The deleter and conte…

**Describe the bug** I am getting the following error while attempting to run deepspeed-chat step 3 with the actor model CarperAI/openai_summarize_tldr_sft (gpt-j 6B) and critic model CarperAI/openai…

publicstaticvo updated 1 year ago
1
hms-dbmi/CHIEF #24

CHIEF pretrained features are worse than random initializati…

We are following the concerns being raised about this study both publicly on this forum (#23, #20, #21), on pubpeer (https://pubpeer.com/publications/C8CFF9DB8F11A586CBF9BD53402001), and privately. Mo…

amy-galon updated 6 days ago
11
gatheluck/PaperReading #566

[2023] DINOv2: Learning Robust Visual Features without Super…

## 論文リンク - [arXiv](https://arxiv.org/abs/2304.07193) - [github](https://github.com/facebookresearch/dinov2) ## 公開日（yyyy/mm/dd） 2023/04/14 ## 概要 ### Research Question 研究で明らかにしたい問を端的に表したも…

gatheluck updated 1 year ago
1
huggingface/transformers #27640

Allow passing 2D attention mask

### Feature request Allow passing a 2D attention mask in `model.forward`. ### Motivation With this feature, it would be much easier to avoid cross-context contamination during pretraining and super…

UniverseFly updated 1 month ago
11

上一页 1...1 2 3 4 5 6 7...60 下一页

596 results for supervised-finetuning

596 results
for supervised-finetuning