knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVlabs/DeepInversion #27

How to use the code for ADI?

I am having trouble figuring out how to use the code for performing ADI. What are the settings that we need to set for performing ADI?

theAverageArchit updated 2 months ago
2
microsoft/DeepSpeed #2207

ZeroQuant quantization kernels and LKD

Hi, I was trying out the compression library for ZeroQuant quantization (for GPT-J model). While I was able to compress the model, I didn't see any throughput/latency gain from the quantization dur…

sdpmas updated 3 months ago
13
liuyuan-pal/Gen6D #86

Something about improvement

Hello author, I have two questions to ask you: 1. As you said in your paper, it takes about 0.64s to process each photo. Obviously, this cannot meet the real-time requirements. If I want to improve, …

1gjjuser1 updated 1 year ago
3
LiheYoung/Depth-Anything #179

Question about training.

If the pseudo labels predicted by the teacher model are inaccurate, how does the student model obtain the correct information from the unlabeled data? Why does the student model outperform the teacher…

838959823 updated 4 months ago
2
aliyun/Revisiting-Knowledge-Distillation-an-Inheritance-and-Exploration-Framework #2

Found an error in processor/IE-KD.py

Thank you for providing your code. It is helpful for studying Knowledge distillation on Images. But, while I try to check the performance IE-KD using this repository, There is a code that I don't u…

Park-ing-lot updated 2 years ago
2
nizhenliang/Dual-Relation-Knowledge-Distillation #1

Where are the codes?

RQ-W updated 10 months ago
2
boostcampaitech7/level2-objectdetection-cv-21 #38

[FEAT] Knowledge Distillation (KD)

## Background - 큰 모델(teacher)에서 작은 모델(student)로 지식을 전이하여 모델 경량화나 추론 속도 개선을 이루면서도 성능을 유지하거나 향상시키는 기법 - KD는 큰 모델의 성능을 작은 모델에 전이하기 때문에, student 모델이 경량화되어 더 빠른 추론 속도를 제공하면서도 성능 저하를 최소화 - Object Detecti…

boyamie updated 1 month ago
1
google-research/scenic #706

Hi， i have some questions about the project-NCR

1. For the kl_loss function **_"pairwise_kl_loss"_**, why the **"kldiv_loss_per_pair = weighted_t_softmax * ( jnp.log(weighted_t_softmax) - s_softmax_temp) # [n, m]".** Actually, in my understa…

LanXiaoPang613 updated 2 weeks ago
4
haitongli/knowledge-distillation-pytorch #23

Computing teacher outpouts is called only onece?

Teacher model's outputs are only computed before training epoch. https://github.com/peterliht/knowledge-distillation-pytorch/blob/master/train.py#L277 It assumes that inputs are fixed in each epoch…

ssqfs updated 4 years ago
1
Dooders/Experiments #9

Experiment: Evaluating Cooperative vs. Competitive Dynamics …

To evaluate the behavior of the two agent types—**IndividualAgent** (competitive, individualistic behavior) and **SystemAgent** (collaborative, cooperative behavior)—design a series of experiments tha…

csmangum updated 1 week ago
1

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation