pretraining Search Results

1000+ results
for pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sail-sg/poolformer #53

PoolFormer pretrained using MAE

Hi, I really enjoy reading and doing one 2d pose estimation project using PoolFormer as backbone, also love the idea of metaformer. Have you thought about pretraining the model using MAE? Would you…

Summer723 updated 11 months ago
1
NVIDIA/Megatron-LM #536

[QUESTION] Should llama or gpt-like models have padding atte…

**Your question** Hello, as far as I know about Megatron, I've only seen padding mask for bert implementation. Yet in Huggingface transformers library, the llama model should also take in the paddin…

kisseternity updated 3 weeks ago
6
dmlc/gluon-nlp #1413

Post DeferredCompute Verification

## Description After https://github.com/dmlc/gluon-nlp/pull/1356 (Thanks @szha and @leezu!), GluonNLP has now fully embraced the new Gluon 2.0 API. We will no longer need to worry about the `hybrid_f…

sxjscience updated 3 years ago
8
AILab-CVC/SEED #12

Training code for SEED-LLaMA

Hi, I am wondering if at any point the training code for SEED-LLaMA will be made available?

shubhamgarg21 updated 4 months ago
1
facebookresearch/dino #193

How to fine-tune on a custom dataset?

Is there a guide for how to fine-tune DINO on a custom imagenet-formatted dataset? (after pretraining on custom data)

rringham updated 2 years ago
1
Project-MONAI/model-zoo #48

host/load pretrained weights for 3D resnet

**Is your feature request related to a problem? Please describe.** PR https://github.com/Project-MONAI/MONAI/pull/2253 implements a generic version of resnet for spatial 1/2/3D inputs. It'd be very u…

wyli updated 4 months ago
11
salesforce/LAVIS #469

blip2 pretain dataset

hi，it seems like that the dataset of pretrain stage1 and stage2 mentioned in the blip2 paper contains coco,cc3m,cc12m,sbu and laion ,but the config file only include coco and vg dataset.which is true …

jingwang97 updated 11 months ago
1
nomic-ai/contrastors #43

Filtering Data For Contrastive Pretraining

hello. The command to run in the Filtering Data For Contrastive Pretraining section of https://github.com/nomic-ai/contrastors/tree/main/scripts/text is ```sh torchrun --nproc-per-node= --dataset…

daegonYu updated 3 weeks ago
1
google-research/text-to-text-transfer-transformer #634

Problem with custom sentencepiece model

Hi guys, I trained from scratch a new sentencepiece model on my pretraining dataset, however I still get unk tokens. Do you know why? I remember the last summer was working smoothly! Specifically:…

ghost updated 3 years ago
2
chenyilun95/tf-cpn #24

How about training from scratch?

Hi! Thanks for providing such a wonderful work. I wonder have you tried a ResNet backbone without ImageNet pretraining? Is it possible that a pre-trained model might become one of the keys of the p…

kaleidoscopical updated 6 years ago
3

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for pretraining

1000+ results
for pretraining