mlcommons / algorithmic-efficiency

MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
https://mlcommons.org/en/groups/research-algorithms/
Apache License 2.0
319 stars 60 forks source link

Fix WMT variant targets #756

Closed priyakasimbeg closed 3 months ago

priyakasimbeg commented 3 months ago

The internal target setting configurations had a bug where the variants all used post-layernorm. We reran the target setting procedure with corrected configurations. These updated targets are the minimum BLEU score values of:

  1. the previously checked in targets
  2. the targets as determined by the full target setting procedure on internal experiments with corrected configs.
  3. the best eval metrics achieved with the GitHub code with the updated hparam point from the target setting procedure.

These updates result in slightly easier targets than previously checked in tarets.

github-actions[bot] commented 3 months ago

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅