knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UKPLab/sentence-transformers #1367

Model trained on JW300 from multilingual distillation paper

Hello @nreimers! In section 4.3 of your paper: "Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation", you trained a student XLM-R model on JW300. Is it possible to share /…

heffernankevin updated 2 years ago
5
GUOShuxuan/kd-6d-pose-adlp #1

feature distillation?

Hi, thanks for your great work. Dose the code contain the feature distillation part?

Liuchongpei updated 11 months ago
1
microsoft/unilm #105

Any plans to release a sample code for MINI-LM distillation?

Hi, Thank you for releasing the distilled MINI-LM models from pre-trained Transformer models. I wonder if you have any plans to release a sample code for MINI-LM distillation implementations in eit…

leimao updated 2 years ago
14
tanluren/yolov3-channel-and-layer-pruning #125

这个项目的知识蒸馏有hint learning这一部分吗

策略二提到是借鉴了”Learning Efficient Object Detection Models with Knowledge Distillation“，我想问的是这个项目的知识蒸馏是不是没有做hint learning这一部分，仅使用的文章关于求输出层软目标损失的策略。

shaolingongfuhao updated 3 years ago
1
cloneofsimo/consistency_models #3

[Discussion] How to understand Consistency Training (CT) in …

Thanks for the brilliant work! I am reading this legendary paper and get this question that I want to discuss here. The paper starts at introducing a new method to distill knowledge from a trained …

cantabile-kwok updated 1 year ago
4
Koukyosyumei/RPDPKDFL #1

Reviews

- Reviewer Yydy strength > I like the intuition of using confidence gaps (obtained through logits only) to approximate the original private model, but there shall be more details about the inver…

Koukyosyumei updated 2 years ago
2
number9473/nn-algorithm #31

Paying More Attention to Attention: Improving the Performanc…

# Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer # - Author: Sergey Zagoruyko, Nikos Komodakis - Origin: [http://www.gitxiv.c…

joyhuang9473 updated 7 years ago
1
NVlabs/DIODE #14

[GCS] access to data

Hi @akshaychawla, can you give me access to the GCS data? Thanks so much.

doublelei updated 1 month ago
3
tianlizhang/OKDPH #2

咨询您一个问题

最近阅读了您的这篇‘Generalization Matters: Loss Minima Flattening via Parameter Hybridization for Efficient Online Knowledge Distillation’论文，对于这篇论文，我深受启发，于是进行了复现。在复现过程中，因为imagenet数据集比较庞大，于是我使用了Tiny-imagenet数据集…

lyhwyl updated 8 months ago
1
p517332051/GAN-Knowledge-Distillation-SSD #1

the channel between teacher featuremap and student featurema…

Hello! when trainning the Dnet, the channel between teacher featuremap and student featuremap maybe different ,how to deal with it?

jiqirenno1 updated 5 years ago
2

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation