score-distillation Search Results

380 results
for score-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/fairseq #4502

validation loss is not decreasing on NAT with zh-en data

hi, i want to train a NAT model for zh-en (about 260k) . I get about 30 BLEU on teacher model , but always overfit on student model There are the following scripts: zh-en preprocessing: `fairse…

kkeleve updated 2 years ago
4
number9473/challenge #9

OpenAI Retro Contest

# OpenAI Retro Contest # - Author: openai - Origin: https://contest.openai.com/details - Related: - Retro Contest: Results https://blog.openai.com/first-retro-contest-retrospective/ - Gott…

joyhuang9473 updated 6 years ago
21
irthomasthomas/undecidability #681

MultiAgentLLM a faithful recreation of the Small LLMs Are We…

- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm) # RichardAragon/MultiAgentLLM **DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM) (Update)…

irthomasthomas updated 8 months ago
2
arcee-ai/mergekit #286

Merging BERT-based embedding models

Hello! I've noticed that #269 introduces support for BERT-based model merging. I've tried it out on a few that I fancy, and I've been having a few issues. ### My Config ```yaml models: - mo…

tomaarsen updated 6 months ago
6
mars-sim/mars-sim #280

Food production values

I started adding onto the food production XML (fixed a typo "slat" to "salt") as well as the resources and meal XMLs, and noticed some discrepancies where the output amount did not seem to match the i…

Paculino updated 1 month ago
9
irthomasthomas/undecidability #680

self-speculative-decoding/README.md at main · dilab-zju/self…

- [ ] [self-speculative-decoding/README.md at main · dilab-zju/self-speculative-decoding](https://github.com/dilab-zju/self-speculative-decoding/blob/main/README.md?plain=1) # Self-Speculative Decod…

irthomasthomas updated 8 months ago
1
ultralytics/ultralytics #6197

I want to create a code to make a knowledge distillation mod…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

kazu-gra updated 1 week ago
35
leela-zero/leela-zero #1835

SAI, a Sensible Artificial Intelligence that plays Go (LZ + …

https://arxiv.org/abs/1809.03928 This is an alternate approach to handling the dynamic komi issue: instead of training one output for multiple komi, try to train 2 parameters that inform the situat…

gcp updated 4 years ago
108
yanx27/2DPASS #16

Baseline training schedule

First of all, thank you for uploading this very comprehensive and capable code. It surpasses the quality of what others upload on Github in the 3D-Semantic Space. Has there been any additional trai…

L-Reichardt updated 1 year ago
10
osuossu8/kaggle-solution #1

[2019] TensorFlow 2.0 Question Answering

## Competition link https://www.kaggle.com/c/tensorflow2-question-answering ## Description In this competition, your goal is to predict short and long answer responses to real questions about…

osuossu8 updated 4 years ago
12

上一页 1...7 8 9 10 11 12 13...38 下一页

380 results for score-distillation

380 results
for score-distillation