knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

wantedly/machine-learning-round-table #20

[2019/10/16] Machine Learning 輪講

## Why Machine Learning 輪講は最新の技術や論文を追うことで、エンジニアが「技術で解決できること」のレベルをあげていくことを目的にした会です。 prev. #19 ## What 話したいことがある人はここにコメントしましょう！面白いものを見つけた時点でとりあえず話すという宣言だけでもしましょう！

agatan updated 5 years ago
2
sseung0703/KD_methods_with_TF #18

The results of Soft Logits fluctuate quite a lot

Hi, when train the student network using soft logits method and running the code: python3 train_w_distill2.py --Distillation=Soft_logits --train_dir=soft_logits --main_scope=Student_w_Soft_logits…

zhongshaoyy updated 5 years ago
3
huggingface/pytorch-image-models #464

[FEATURE] ReLabel ImageNet support

hiyyg updated 2 years ago
8
thu-coai/DA-Transformer #10

model miniaturization

Hi, I tried to train a miniaturized model with 6-layer encoder 3-layer decoder and 256 hidden dims, but found that the accuracy of the model declines rapidly. Is there any suggestion for model miniatu…

JunchengYao updated 1 year ago
3
huggingface/transformers #15166

Add FastSpeech2

# 🌟 New model addition ## Model description FastSpeech2 is a TTS model that outputs mel-spectrograms given some input text. From the [paper](https://arxiv.org/abs/2006.04558) abstract: > Non-…

jaketae updated 2 years ago
7
as-ideas/TransformerTTS #36

Audio Alignment

Hey, What steps should we use to allign the audios(non english). I see there is something called "Compute alignment dataset" which you guys use for the forward model. What exactly does that help in…

aayushkubb updated 4 years ago
3
microsoft/DeepSpeedExamples #577

step3_rlhf_finetuning and two tokenizers

Hello. I'm trying to train a GPT-J 6B, and as a critical model I have trained several networks of different/similar families (gpt2, gpt-neo, bloom, ...) I know that in step 3 only a tokenizer is us…

GenVr updated 1 year ago
3
facebookresearch/BLINK #69

Reproduce the recall result on Zero-shot EL dataset

Hi, I use the code and Hyper-parameters you released on github to train bert-base-uncased on the Zero-shot EL dataset, but I can't get the result you showed on paper, I want to know how should I adju…

xiepuzhao updated 3 years ago
11
tensorflow/model-optimization #644

Problem regarding loading a QAT SavedModel

Prior to filing: check that this should be a bug instead of a feature request. Everything supported, including the compatible versions of TensorFlow, is listed in the overview page of each technique. …

sayakpaul updated 3 years ago
11
BaoZhuhan/BaoZhuhan #5

个人科研准备

BaoZhuhan updated 2 months ago
6

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation