knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DachengLi1/MPCFormer #5

Question about "{d}" and "{p,d}"

As is written in your article:.“p” stands for using weights in T as initialization, “d” stands for applying knowledge distillation with T as the teacher. My question is：Does “using weights in T as in…

hujunyi96 updated 1 year ago
7
balthazarneveu/geosciences #1

Segmentation

https://wandb.ai/balthazarneveu/geosciences-segmentation - [x] LR scheduler (plateau) - [x] Validation accuracy - [x] Validation Metric IoU, dice coeff - [x] Augmentations (vertical reverse, hor…

balthazarneveu updated 8 months ago
3
isabellapoles/LOTUS #1

about the Intra-class Semantic Knowledge Variation Distillat…

Hi ! Thank you for you awesome work. There is one detail I don't quite understand. Since the data is unpaired, the two labels are different, why can KL loss be used to constraint the student model?

Devil-Ideal updated 3 weeks ago
2
wutaiqiang/awesome-GNN2MLP-distillation #1

尊敬的作者您好，想请问下您在未来会有开源PGKD的打算吗

今天拜读了您的论文“Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs”，我认为您提出的idea和方法非常精炼且实用，给了我很大的启发；不知您之后是否会开源这个工作的相关代码，如果能开源的话那真的是不胜感激

zza234s updated 1 year ago
2
thevasudevgupta/gsoc-wav2vec2 #27

Ideas from the wav2vec2 repo

### **Initial action plans** Copying these things from the wav2vec2 repo for safe housekeeping. * An immediate quantize could be to convert the fine-tuned model using TFLite APIs. [Post-trainin…

sayakpaul updated 2 years ago
17
number9473/nn-algorithm #46

Learning from Noisy Labels with Distillation

# Learning from Noisy Labels with Distillation # - Author: Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, Jia Li - Origin: [https://arxiv.org/abs/1703.02391v1](https://arxiv.org…

joyhuang9473 updated 7 years ago
1
NVIDIA/TensorRT-Model-Optimizer #108

[RFC] TensorRT Model Optimizer - Product Roadmap

# TensorRT Model Optimizer - Product Roadmap [TensorRT Model Optimizer](https://github.com/NVIDIA/TensorRT-Model-Optimizer) (ModelOpt)’s north star is to be the best-in-class model optimization toolki…

hchings updated 3 days ago
5
wangqiangneu/MT-PaperReading #21

19-EMNLP-Mask-Predict: Parallel Decoding of Conditional Mask…

## 简介 non-autogression decoding. 用iterative的方式不断refine已有的翻译结果。第一轮是完全NAT，然后再次基础上，选择confidence差的N个词，mask掉，去对mask的内容进行refine，迭代X轮结束。对mask的恢复过程类似BERT。整篇文章读起来很舒服，听起来也合理。后面实验部分也很饱满。 ## 论文信息 * Author: F…

wangqiangneu updated 4 years ago
2
bohanzhuang/Towards-Effective-Low-bitwidth-Convolutional-Neural-Networks #1

A confusion about the code?

Hello. I really enjoyed reading your paper and the code. But I have a confusion that the "Stochastic precision" and "hint-based knowledge distillation" presented in "Effective Training of Convolution…

CYang0515 updated 2 years ago
1
XFeiF/ComputerVision_PaperNotes #44

21ICCV # Emerging Properties in Self-Supervised Vision Trans…

[Paper](https://arxiv.org/abs/2104.14294) [Code](https://github.com/facebookresearch/dino) Authors: Mathilde Caron, Hugo Touvron, etc. FBAI. ![](https://raw.githubusercontent.com/fac…

XFeiF updated 3 years ago
1

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation