pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

guxd/DialogBERT #12

can't load pretrained model

self.context_mlm_trans and self.context_order_trans are expecting a different key-structure RuntimeError: Error(s) in loading state_dict for BertPredictionHeadTransform: Missing key(s) in stat…

rokosbasilisk updated 2 years ago
2
jerryji1993/DNABERT #65

Error when loading pretrained model for finetunning from a c…

Hi, Very simple issue, this error: "ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group" Is displayed when I'm trying to load a a pre-trained m…

danarte updated 7 months ago
1
eminorhan/optimized-mae #1

finetune not working (fused AdamW error)

`RuntimeError: params, grads, exp_avgs, and exp_avg_sqs must have same dtype, device, and layout`

eminorhan updated 4 months ago
1
FlyEgle/MAE-pytorch #12

What are the rules for setting the parameters of vit-tiny's …

Thanks for your work! I’m pretraining the vit-tiny for my own dataset, but i can not determine the setting for decoder's parameters (depth/embed_dim/num_heads), just consistent with vit-base/large/hug…

zzzzzzyang updated 1 year ago
2
octo-models/octo #23

offline/sim evaluation recommendations

Excited by the work, great paper and open release. I am interested in testing some ideas that will involve pretraining (e.g. architecture changes, etc.), likely without access to a real-world setu…

daniellawson9999 updated 6 months ago
1
Oneflow-Inc/DLPerf #119

tf1.x+ngc关于两机训练的问题

在两个服务器上，起了两个容器，然后在里面装好了openmpi之类的通信工具。简单用horovodrun 命令测试了一下，似乎应该是通的？ ``` horovodrun -np 8 -H localhost:8 -p 10000 echo "233" 2021-01-30 03:50:03.454606: I tensorflow/stream_executor/platform/d…

zzt941006 updated 3 years ago
7
yoheikikuta/paper-reading #53

[2020] Don't Stop Pretraining: Adapt Language Models to Doma…

## 論文リンク https://arxiv.org/abs/2004.10964 ## 公開日（yyyy/mm/dd） 2020/04/23 ## 概要大量の wiki などから学習したモデルを用いてNLP タスクを解く際に、タスクに特化させるための pre-training 手法について整理して実験しましたという論文。タスクに特化させるための pre-training …

yoheikikuta updated 2 years ago
11
showlab/UniVTG #24

Training Detail for Pretrain

Hello, thanks for your fancy work. I want to make sure that the pretrain model is verified on the val set of the QVHighlight dataset, ？and the ckpt is selected by comparing R1@0.3 ? What's more，could …

EasonXiao-888 updated 5 months ago
6
nokitoino/DecompilerAI #6

MSP T5 Fine Tuning

Projects like CodeT5 use masked span prediction for better context understanding. Do you think this will be necessary?

pathquester updated 5 months ago
1
harshraj22/ultramnist #4

Add more visualizations

See what initial layers of the models 'see'. Use pretraining techniques that help them to see better.

harshraj22 updated 2 years ago
1

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for pretraining

1000+ results
for pretraining