self-knowledge-distillation Search Results

241 results
for self-knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neuropoly/neuro.polymtl.ca #80

List publications with label filtering

Example: https://mila.quebec/en/publications/ It would be nice to reuse the same code as in the Mila website. Not sure if that's 'easily' possible via RTD

jcohenadad updated 1 year ago
1
612901/examples #31

Lil'Log

### 检查清单 - [X] 合法的、无木马植入的站点。 - [X] 有实质性原创内容的 HTTPS 站点，发布过至少 5 篇原创文章，内容题材不限。 - [X] 有独立域名，非免费域名。 ### 站点信息 ```json { "title": "Lil'Log", "url": "https://lilianweng.github.io/", "avatar…

612901 updated 1 year ago
1
mepeichun/Efficient-Neural-Network-Bilibili #3

temp * temp * 2.0 * alpha 的意思

个人理解的是: loss=apha*LOSSsoft+(1-alpha)*LOSShard 不知道temp\*temp*2.0是什么意思

Duducoco updated 2 years ago
1
winycg/HSAKD #6

The results that are submitted in the paper? is it class _ac…

You have submitted the 80.7 for wrn-40-2 (Teacher Model), is it class_accuracy or ss_accuracy? I got this accuracy but as a class_accuracy, but you referreing best accuracy to the ss_accuracy that is …

Tariq195 updated 2 years ago
2
raoyongming/DynamicViT #12

About distill

Why LVViT_Teacher return aux_head(x[:, 1:]) instead of tokens? LVViT_Teacher: ``` x = self.norm(x) x_cls = self.head(x[:,0]) x_aux = self.aux_head(x[:,1:]) return x_cls, x_aux ``` VisionTr…

hegc updated 2 years ago
2
yaoing/DAN #15

Trained cluster centers

Hi, Is there any plan to also release the learned cluster centers of the FCN? One would think they are a part of the model. I understand they are not required for inference but may be handy for fine …

amostayed updated 2 years ago
3
ahmdtaha/knowledge_evolution #6

choice of training epochs

Hi Ahmed, I was wondering about the choice of 200 training epochs for the baselines. Have you tried training for longer, e.g. matching the training time of the KE approaches? If I train for 2200 ep…

hlml updated 2 years ago
10
NVlabs/DIODE #6

Regarding training using BDD100K dataset

Greetings, This is Aman Goyal. I am currently pursuing research in MSU in the domain of knowledge distillation and I had come across your paper and github repo. I actually wanted to train on BDD10…

AmanGoyal99 updated 2 years ago
9
yoshitomo-matsubara/torchdistill #123

How to run my own dataset using the object detection example…

Hi, I want to do the experiment using other dataset such as VOC dataset. What should I do before executing the examples/object_detection.py script? I converted the VOC annotation to COCO f…

Coldfire93 updated 2 years ago
24
microsoft/ProDA #24

Model selection

Why do you use validation dataset (500 images Cityscapes) to select best model? Valid dataset (500 images Cityscapes) is only used for evaluation, not for model selection (for saving best model), this…

manhminno updated 2 years ago
17

上一页 1...16 17 18 19 20 21 22...25 下一页

241 results for self-knowledge-distillation

241 results
for self-knowledge-distillation