-
Example: https://mila.quebec/en/publications/
It would be nice to reuse the same code as in the Mila website. Not sure if that's 'easily' possible via RTD
-
### 检查清单
- [X] 合法的、无木马植入的站点。
- [X] 有实质性原创内容的 HTTPS 站点,发布过至少 5 篇原创文章,内容题材不限。
- [X] 有独立域名,非免费域名。
### 站点信息
```json
{
"title": "Lil'Log",
"url": "https://lilianweng.github.io/",
"avatar…
-
个人理解的是:
loss=apha*LOSSsoft+(1-alpha)*LOSShard
不知道temp\*temp*2.0是什么意思
-
You have submitted the 80.7 for wrn-40-2 (Teacher Model), is it class_accuracy or ss_accuracy? I got this accuracy but as a class_accuracy, but you referreing best accuracy to the ss_accuracy that is …
-
Why LVViT_Teacher return aux_head(x[:, 1:]) instead of tokens?
LVViT_Teacher:
```
x = self.norm(x)
x_cls = self.head(x[:,0])
x_aux = self.aux_head(x[:,1:])
return x_cls, x_aux
```
VisionTr…
-
Hi,
Is there any plan to also release the learned cluster centers of the FCN? One would think they are a part of the model. I understand they are not required for inference but may be handy for fine …
-
Hi Ahmed,
I was wondering about the choice of 200 training epochs for the baselines. Have you tried training for longer, e.g. matching the training time of the KE approaches? If I train for 2200 ep…
-
Greetings,
This is Aman Goyal. I am currently pursuing research in MSU in the domain of knowledge distillation and I had come across your paper and github repo.
I actually wanted to train on BDD10…
-
Hi,
I want to do the experiment using other dataset such as VOC dataset. What should I do before executing the examples/object_detection.py script?
I converted the VOC annotation to COCO f…
-
Why do you use validation dataset (500 images Cityscapes) to select best model? Valid dataset (500 images Cityscapes) is only used for evaluation, not for model selection (for saving best model), this…