knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ChaoningZhang/MobileSAM #129

Enhancements to MobileSAM's Lightweight Encoder for Real-Tim…

Dear MobileSAM Developers, I hope this message finds you well. I am reaching out to discuss potential enhancements to the MobileSAM framework, particularly concerning its lightweight encoder's perf…

yihong1120 updated 11 months ago
1
thevasudevgupta/gsoc-wav2vec2 #27

Ideas from the wav2vec2 repo

### **Initial action plans** Copying these things from the wav2vec2 repo for safe housekeeping. * An immediate quantize could be to convert the fine-tuned model using TFLite APIs. [Post-trainin…

sayakpaul updated 2 years ago
17
UKPLab/sentence-transformers #1164

the 'make_multilingual.py' result is poor

I have tryied the example [【 Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation 】](https://github.com/UKPLab/sentence-transformers/tree/master/examples/training/multiling…

ScottishFold007 updated 2 years ago
8
megvii-research/mdistiller #29

About the implementation of FitNets

Hello, your work on knowledge distillation is great! However, I have some problems about the code of FitNets. I found you just use sum of losses to get backward, specifically, the `loss_feat` and `l…

Coinc1dens updated 1 year ago
1
GUOShuxuan/kd-6d-pose-adlp #1

feature distillation?

Hi, thanks for your great work. Dose the code contain the feature distillation part?

Liuchongpei updated 11 months ago
1
NVlabs/DIODE #14

[GCS] access to data

Hi @akshaychawla, can you give me access to the GCS data? Thanks so much.

doublelei updated 1 month ago
3
UKPLab/sentence-transformers #3084

query adapter native in training

Hi there! Now that using adapters works, does it make sense to include it that you can use an adapter for the query / and the sentence2 natively with model.train?

achibb updated 4 hours ago
3
jiphyeonjeon/season3 #38

DistilBERT, a distilled version of BERT: smaller, faster, ch…

## 집현전 중급반 스터디 - 2022년 6월 26일 일요일 9시 - 장동건님 김제우님 김종은님 이기성님 발표 - 논문 링크: https://arxiv.org/abs/1910.01108 > ### Abstract > As Transfer Learning from large-scale pre-trained models becomes more pr…

jinmang2 updated 2 years ago
3
ahoho/kd-topic-models #1

Would you like to post a simple example notebook?

Hi, Thanks for releasing your code in this repository! It would be exciting to try the neural topic models with Knowledge Distillation. I was wondering whether you would like to give a simple e…

sharon-gao updated 4 years ago
4
fulfulggg/Information-gathering #729

2ヘッド知識蒸留: 補助ヘッドによるロジット活用の強化

## タイトル: 2ヘッド知識蒸留: 補助ヘッドによるロジット活用の強化 ## リンク: https://arxiv.org/abs/2411.08937 ## 概要: 従来の知識蒸留は、生徒モデルの予測確率を正解ラベルと教師モデルの予測確率の両方に合わせることに注力しています。しかし、ロジットから予測確率への変換は、ある種の不可欠な情報を曖昧にする可能性があります。この問題に対処するため…

fulfulggg updated 1 week ago
2

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation