pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Kunye-Shen/FSMINet #1

Question about model training

Hi, I have read this good work and found it achieves wonderful performance. However, I have a question about the training process: Is the model trained from scratch on RSI data without any pretraining…

hkkevinhf updated 2 years ago
2
amazon-science/video-contrastive-learning #11

Pretrain the model on UCF101

Hi, I pretrained the model on UCF101, and the linear evaluation on UCF101 is 74.0946%. ![1](https://user-images.githubusercontent.com/71969945/173597775-2230691d-dc28-45e8-8b23-832779bafba9.png) …

wangll1212 updated 2 years ago
5
facebookresearch/XLM #154

Can you check whether my BERT pretraining is normal ofr not?

I followed your BERT pretraining. However, after one week of training, the loss is still around 7.3. I use 8 GPU with 14 per batch. The rest is same as default. INFO - 08/01/19 14:20:20 - 2:02:04 -…

gaopengcuhk updated 4 years ago
6
bigcode-project/starcoder #160

Removal request & notice: permissive licensing might often s…

I'd just like you to know that code with permissive licensing with attribution requirements **are possibly unsuitable for training set inclusion.** I'm bringing this to your attention not as a lawyer,…

ell1e updated 3 months ago
2
jozhang97/side-tuning #6

Questions about training process

Could you tell me the complete training process of side tuning? According to my understanding, your training process should be divided into the following stages: - the pretraining of the base model …

slcheng97 updated 10 months ago
1
axolotl-ai-cloud/axolotl #1510

Merge lora adaptor leads to saving the model in FP32 though …

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

amitagh updated 3 months ago
2
kwonmha/bert-vocab-builder #15

BERT trained on custom corpus

Hi M. H. Kwon, Your tokenization script is really helpful. I trained a bert model with custom corpus using Google's Scripts like create_pretraining_data.py, run_pretraining.py ,extract_features.py…

anidiatm41 updated 3 years ago
1
xuyige/BERT4doc-Classification #13

OOM when batchSize=1

Hi, thanks for your great work. While running run_pretraining.py, I kept getting OOM for any size of the matrix. I already reduce the batch size to 1 but didn't help. I'm using 960M, TensorFlow-gpu…

chen3082 updated 3 years ago
3
microsoft/DeBERTa #20

[bug] incomplete code

In `deberta.mlm`, `MaskedLayerNorm ` is not imported from `deberta.ops`, and `PreLayerNorm` is undefined. And I'm not sure if `deberta.mlm` contains codes for pretraining?

shenfe updated 3 years ago
1
clovaai/bros #12

Suggestions for implement pre-training

Thanks for your impressive work. Can you share how to implement pretraining code?

WeihongM updated 2 years ago
1

上一页 1...79 80 81 82 83 84 85...100 下一页

1000+ results for pretraining

1000+ results
for pretraining