pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-team/keras-nlp #130

Add a keras.io guide for pretraining a transformer with kera…

Working on a keras.io guide for pretraining a keras-nlp transformer model from scratch, using word piece tokenizer, transformer encoder, embedding layers, and our MLM layer helpers. Will link a dra…

mattdangerw updated 1 year ago
3
thunlp/PEVL #14

Visual Relation Detection Reproducibility

Hello, Thank you for the wonderful work. I was trying to reproduce the results shown in paper for VRD. Currently I am getting around the following scores: R@20: 0.5417 R@50: 0.6160 R@100: 0…

willxxy updated 1 year ago
2
LTH14/targeted-supcon #3

reproduce cifar10

Hi, thank you for the code. Could you please provide your cifar10 code for reproducing? I have followed your supplemental material and run the code for 1000 epochs for pretraining (Moco based). But …

YunYunY updated 9 months ago
19
training-transformers-together/training-transformers-together.github.io #2

[Section] Demo introduction

https://github.com/training-transformers-together/hf-website-how-to-join Demo page (updated on push): https://training-transformers-together.github.io/ - [x] intro and motivation text - [x] liv…

justheuristic updated 2 years ago
2
Charleo85/DeepCar #2

Is this network working on CUB dataset?

I am trying to reproduce RACNN network performance with pytorch. But there isn't details about how to train a APN network. without pretraining, rank loss doesn't decrease. i am wonder this code w…

jeong-tae updated 5 years ago
1
guxd/DialogBERT #12

can't load pretrained model

self.context_mlm_trans and self.context_order_trans are expecting a different key-structure RuntimeError: Error(s) in loading state_dict for BertPredictionHeadTransform: Missing key(s) in stat…

rokosbasilisk updated 2 years ago
2
ZhenYangIACAS/NMT_GAN #26

dis_saveto

Hi, What file corresponds to dis_saveto? Looking forword to your reply.

SunXiaoqian1 updated 1 year ago
10
clovaai/donut #112

Switch pretrained Decoder(for example, uses RoBERTa-xlm to r…

You guys did a great job in VDU field. Congratulations! By the way, I wonder if I can replace the mBART by RoBERTa-XLM on finetune process without doing pretraining jobs again?

hzhuangdy updated 8 months ago
2
tflearn/tflearn #731

ValueError: List argument 'inputs' to 'MergeSummary' Op with…

While working on an autoencoder I started implementing pertaining phase, It all looked good for quite a while and the weights and biases were properly shared among layers but when started training the…

jdvala updated 7 years ago
1
NVlabs/handover-sim2real #5

Training time?

Hi, if I would like to reproduce the results with A6000 from Nvidia, how long will it take to train the model from scratch?

LyuJZ updated 7 months ago
4

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for pretraining

1000+ results
for pretraining