pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zihangdai/xlnet #230

Further pretrain in domain specific corpus

Hi, I tried to further pretrain XLNet in domain specific corpus like what is recommanded in BERT. But I got worse result. Has anyone tried further pretraining? Does it work? Thanks!

hexiaoyupku updated 4 years ago
2
hpcaitech/ColossalAI #5016

how to speed up llama 2 70b model loading

### Describe the feature I am following [Colossal-LLaMA-2](https://github.com/hpcaitech/ColossalAI/tree/main/applications/Colossal-LLaMA-2) to continue pretraining. I am using a 8x a100 80G node. And…

fancyerii updated 5 months ago
7
yangdongchao/Text-to-sound-Synthesis #16

provide examples with [mask] token?

Hi, could you please share some caption examples for pretraining on Audioset? I'm a little confused about the [mask] token setting for clip text encoder.

jzq2000 updated 1 year ago
2
facebookresearch/dino #193

How to fine-tune on a custom dataset?

Is there a guide for how to fine-tune DINO on a custom imagenet-formatted dataset? (after pretraining on custom data)

rringham updated 2 years ago
1
CompVis/stable-diffusion #377

Training code for text-2-img model

Is there a training code for the text-2-image model for stable diffusion (all training steps including pretraining of Autoencoder with KL)? I could only find the inference command in the repo. Than…

anime26398 updated 1 year ago
3
facebookresearch/DepthContrast #36

environment install error

When I install the environments, a lot of packages conflict each other. could you please release a docker for training? Recently, I have read your new work “Self-Supervised Pretraining for Large-S…

YanhaoWu updated 1 year ago
1
bigshanedogg/survey #2

[SimCLR v2] Big Self-Supervised Models are Strong Semi-Super…

## Problem statement 1. Unlabeled data를 간접적으로 활용하는 방법 외에 NLP에서 활용하는 것과 같이 더 직접적으로 활용하는 task-agnostic한 방법을 찾는다. - NLP에서의 활용: unsupervised pretrain -> supervised fine-tune 2. unlabeled data를 활용해…

bigshanedogg updated 2 years ago
1
facebookresearch/fairseq #3965

ADAM KeyError: 'max_exp_avg_sq' when further pretraining XLS…

## 🐛 Bug When I run the fairseq-hydra-train script with pretraining config (large), but loading the XLSR 53 checkpoint, I get a KeyError: 'max_exp_avg_sq' on fairseq/optim/adam.py after some traini…

fmobrj updated 2 years ago
2
DeepGraphLearning/SiamDiff #3

Question about EC downstream task

Dear Sir/Madam, Thank you for such a great repo, but I am confused by the result for EC downstream task result. The result of multiview contrast in paper "PROTEIN REPRESENTATION LEARNING BY GEOMETR…

Heisenburger2020 updated 9 months ago
1
researchmm/soho #10

The Accuracy of Masked Visual Modeling

Hi, what is your mvm accuracy of pretrained model? I only got about 30% when pretraining and wanted to know if that is normal?

mhyeh updated 1 year ago
1

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for pretraining

1000+ results
for pretraining