knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

VectorSpaceLab/OmniGen #138

Inference steps

I see that using 50 inference steps is mentioned in the paper, but I don't see many details about it. I'm curious if that number was arrived at through testing, or if 50 steps was picked as a reasona…

temperfugit updated 2 weeks ago
2
aifoundry-org/MHAQ #17

Initial distillation implementation

Use inference from FP checkpoint as teacher with L2 norm to implement knowledge distillation.

janchk updated 1 month ago
1
ShijieZhou-UCLA/feature-3dgs #46

Test images and annotations for testing Replica dataset

Thank you very much for sharing the code! However, I noticed there isn't any information on which specific images were used for testing on Replica. Could you please let me know if there’s a way to acc…

SuhoPark0706 updated 1 month ago
1
ultralytics/yolov5 #1762

Any plan for Knowledge Distillation?

## 🚀 Feature Use a teacher model to train a student model, which is lighter than the teacher model It is a brilliant method for model simplification without a decrease in accuracy ## Motivation …

hzhuangdy updated 1 month ago
9
mpc001/Lipreading_using_Temporal_Convolutional_Networks #24

The code for the Knowledge Distillation loss

Hi. Did you publish the code for the Knowledge Distillation loss? I couldn't find it in the code. If it is not there, could you please publish the code? Thanks

YairDeitcher updated 1 month ago
2
larq/zoo #61

Explicit layer naming

### Feature motivation When using the ready made models as parts of bigger networks it can be necessary to get the outputs of specific layers. One example are encoder-decoder style networks with skip…

timdebruin updated 4 years ago
5
adap/flower #2013

FedNTD

# FedNTD * **Title:** Preservation of the Global Knowledge by Not-True Distillation in Federated Learning * **Venue:** NeurIPS 2022 * **Link to paper:** https://papers.nips.cc/paper_files/paper/2022/…

jafermarq updated 9 months ago
10
wwzhuang01/Math-PUMA #1

请问文章中三个阶段都是微调吗？

另外第二阶段的正向kl与负向kl具体有什么作用呢

128Ghe980 updated 2 weeks ago
2
FlagOpen/FlagEmbedding #891

微调bge-reranker-v2-m3时loss一直变大

配置： !torchrun --nproc_per_node 1 \ -m FlagEmbedding.reranker.run \ --output_dir /bge-reranker-v2-m3-finetune \ --model_name_or_path /bge-reranker-v2-m3/bge-reranker-v2-m3 \ --train_data output.js…

LinXin04 updated 2 weeks ago
5
Oneflow-Inc/models #378

MetaKD问题记录

songzetao updated 2 years ago
5

上一页 1...31 32 33 34 35 36 37...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation