pre-training Search Results

1000+ results
for pre-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

salesforce/BLIP #111

Pre-training with GPUS

Hi, when i use the command "python -m torch.distributed.run --nproc_per_node=8 pretrain.py --config ./configs/Pretrain.yaml --output_dir output/Pretrain ". It shows "ERROR:torch.distributed.elastic.m…

VincentWangty updated 1 year ago
3
FlagOpen/FlagEmbedding #977

bge continue pretrain？

Hi, I would like to ask about incorporating additional training objectives that are beneficial to downstream tasks during the pre-training of BGE on top of the MLM task. Specifically, my downstream…

CarllllWang updated 1 month ago
1
OpenGVLab/unmasked_teacher #38

pre-training script and checkpoint of ssv2

Hi, I saw in the article that you reported the results of pre-training on SSV2, but I could not find its pre-training script and checkpoint. Could you please provide it? Or did I not find the link? Lo…

wjj-w updated 4 months ago
1
junwenxiong/diff_sal #4

about the performance on DHF1k

Thank you for the excellent work! But I'm having difficulty reproducing the results on DHF1k using diff-sal. I’ve downloaded the pre-trained checkpoint on DHF1k provided in this repository, but I’m…

yunlong10 updated 2 weeks ago
1
glory20h/FitHuBERT #1

Weights for pre training

Do you have trained model weights? I would greatly appreciate it if you could provide it!

NicholasLiuSc updated 9 months ago
1
Luffy03/VoCo #18

About how multimodal upstream tasks migrate to unimodal down…

Hello! I am very inspired by your work. Referring to your work, I have some doubts while pre-training MRI data. I want to use brain tumor MRI containing four modalities for pre-training, what to do …

zzzjjj98 updated 1 month ago
2
tensorflow/nmt #422

Word vector pre-training

Hi, I have a question: Will Bleu be improved if the pre-trained word vectors are embedded in the neural machine translation model for retraining? Looking forward to your advice or answers. Best…

yapingzhao updated 5 years ago
1
wenet-e2e/wenet #2604

wenet2.2.1 recognize.py uses attention decoding method, the …

Wenet2.2.1 version, the test set is librispeech's test_clean, the test results, the official pre-training model u2++conformer, two questions: 1. recognize.py uses attention mode to decode, and a larg…

Yong2Fern updated 3 weeks ago
2
megvii-research/PETR #40

loading pre-training weights for training

Why is the loss still huge when loading pre-training weights for training, just like not loading them?

GuuuuuG updated 1 year ago
5
yeates/DMT #2

DMTimg pre-training ablation

From the paper also, in the 4.2 ablation study, I don't find any clear element why the network need to be trained in two stages (DMTimg and DMTvid) especially cause you have claimed: > Therefore, w…

bhack updated 1 year ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for pre-training

1000+ results
for pre-training