-
hi, i want to train a NAT model for zh-en (about 260k) . I get about 30 BLEU on teacher model , but always overfit on student model
There are the following scripts:
zh-en preprocessing:
`fairse…
-
# OpenAI Retro Contest #
- Author: openai
- Origin: https://contest.openai.com/details
- Related:
- Retro Contest: Results https://blog.openai.com/first-retro-contest-retrospective/
- Gott…
-
- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm)
# RichardAragon/MultiAgentLLM
**DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM)
(Update)…
-
Hello!
I've noticed that #269 introduces support for BERT-based model merging. I've tried it out on a few that I fancy, and I've been having a few issues.
### My Config
```yaml
models:
- mo…
-
I started adding onto the food production XML (fixed a typo "slat" to "salt") as well as the resources and meal XMLs, and noticed some discrepancies where the output amount did not seem to match the i…
-
- [ ] [self-speculative-decoding/README.md at main · dilab-zju/self-speculative-decoding](https://github.com/dilab-zju/self-speculative-decoding/blob/main/README.md?plain=1)
# Self-Speculative Decod…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…
-
https://arxiv.org/abs/1809.03928
This is an alternate approach to handling the dynamic komi issue: instead of training one output for multiple komi, try to train 2 parameters that inform the situat…
-
First of all, thank you for uploading this very comprehensive and capable code. It surpasses the quality of what others upload on Github in the 3D-Semantic Space.
Has there been any additional trai…
-
## Competition link
https://www.kaggle.com/c/tensorflow2-question-answering
## Description
In this competition, your goal is to predict short and long answer responses to real questions about…