-
Populate the name of the snippet based on selected fields. Example names of real life snippets
- Moco_global_2018_Profile Age 3-5 Weeks_campaign_IRL Generic v2_Stream_EN_Rel
- Moco_global_2017_c…
-
Hello,
How does your pipeline work for patch classification? After extracting the features from the patches, for classification what code needs to be executed? TIA.
-
您好!
我想问一下:
Q1: 在模型训练的时候,模型最终是几个输出?是 **BatchFormer前的classifier** + **BatchFormer后的classifier** 这两个输出呢?然后分别与ground truth计算呢?
Q2: 代码里面,如下所示,这行代码表示的是**BatchFormer后的classifier**的输出呢?还是对模型两个classifier…
-
Hi,
Thank you for your valuable work. Could you please provide how long have you trained your model (moco v3)?
To be specific, how many epochs (for your currently released model) and how long does…
-
I met an issue training resnet-50 with moco-v3. Under the distributed training setting with 16 V100 GPUs (each process only has one gpu, batch size 4096), I can get the training loss at about 27.2 in …
-
As there is no code related to figure7 interpolation in the GitHub repository.
-
Hello, I was wondering whether there is a documentation for training the model from scratch on any type of dataset. Thanks!
-
-
There's small nuances in how the dynamo runners benchmark models that can make certain torchbench models fail
Some models might be explicitly skipped, others might fail because of some dtype conve…
-
Dear author
I have reproduced your code using 64 V100 GPUs. Every setting is the same as paper (batch size 4096), The end-to-end finetuning is almost the same as paper. However, the linear pro…