knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

WangYZ1608/Knowledge-Distillation-via-ND #10

There are something wrong in nd_loss compute

The error is following: File "D:\PyCharm_workspace\KD\Knowledge-Distillation-via-ND-main\CIFAR\ReviewKD++\utils.py", line 62, in project_center loss += 1 - torch.dot(s, e_c) / max_norm RuntimeE…

qiuxiaqing updated 7 months ago
2
rwth-i6/returnn #1625

rf.BatchNorm keeps updating statistics when used in eval mod…

Currently `rf.BatchNorm` decides whether to update the running statistics based on the `rf.get_run_ctx().train_flag` as in [this line](https://github.com/rwth-i6/returnn/blob/master/returnn/frontend/n…

mnghiap updated 2 months ago
1
irfanICMLL/structure_knowledge_distillation #43

training fails with "RuntimeError: cuda runtime error (11) :…

``` RTX 2080 Ti python 3.7.7 hcff3b4d_5 cuda100 1.0 0 pytorch pytorch 0.4.1 py37_py…

betogulliver updated 2 years ago
2
FlagOpen/FlagEmbedding #1224

问一个小白的问题，我就是想让一些句子与另一些句子向量更接近，还有一些句子向量更远，是组织成有query，pos和neg的…

问一个小白的问题，我就是想让一些句子与另一些句子向量更接近，还有一些句子向量更远，是组织成有query，pos和neg的训练数据，微调就可以吗？训练数据中不用再加上pos_scores，neg_scores，prompt和type这些吧？微调时执行的命令参考 https://github.com/FlagOpen/FlagEmbedding/tree/master/example…

czhcc updated 3 days ago
2
harvardnlp/seq2seq-attn #100

code for seq-level distillation

Hi Yoon, As mentioned in the [Sequence-Level Knowledge Distillation](https://arxiv.org/pdf/1606.07947.pdf), implementation of the distillation model is released in this repo, but I didn't find the …

Oneplus updated 3 years ago
1
Lightning-AI/pytorch-lightning #17592

An example for knowledge distillation. Especially for the lo…

### Description & Motivation _No response_ ### Pitch A example for knowledge distillation. Especially for the load the teacher model's weight ,and train the student model. Now I have a trained t…

emilyemliyM updated 11 months ago
2
WongKinYiu/yolov7 #381

we want to submit a PR about Source-Free compression trainin…

Hello, we have done a Source-Free compression training function, and the benefits of YOLOv7 are as follows. I want to submit a PR, is it ok? | model | method | input size | mAPval 0.5:0.95 | predic…

leiqing1 updated 2 years ago
5
open-mmlab/mmrazor #197

When performing knowledge distillation (e.g., using CWD algo…

For example, the teacher model is faster rcnn and the student model is yolo v3.Where can I find out what modules the models have? When I write a random module, I get a key error.

lb-hit updated 1 year ago
6
YoojLee/paper_review #55

Open-Vocabulary Object Detection Via Vision And Language Kno…

![image](https://github.com/YoojLee/paper_review/assets/52986798/4133f5cb-d108-472c-86a5-2db4f4983933) ## Summary CLIP과 같은 open vocabulary image classification model (VLMs)으로부터 two stage detector에…

YoojLee updated 1 year ago
1
liuzechun/Nonuniform-to-Uniform-Quantization #4

Concerns about the loss function

Hello, thanks for your excellent work and code! In the paper, the authors claim that they use the same knowledge distillation scheme as LSQ to train the quantized models. I show the screenshot as …

HaoKun-Li updated 1 year ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation