mozilla / firefox-translations-training

Training pipelines for Firefox Translations neural machine translation models
https://mozilla.github.io/firefox-translations-training/
Mozilla Public License 2.0
135 stars 28 forks source link

Refactor b-cpu-xlargedisk worker pools to allow for experimentation with different configurations #674

Closed gabrielBusta closed 2 weeks ago

gabrielBusta commented 2 weeks ago

Refactor worker pools to troubleshoot #669

gabrielBusta commented 2 weeks ago

@eu9ene do I make the PR against release? Or do I land it in main and cherry-pick to release?

gabrielBusta commented 2 weeks ago

Thanks @bhearsum

eu9ene commented 2 weeks ago

@gabrielBusta I think it got stuck in "pending" state https://firefox-ci-tc.services.mozilla.com/tasks/e7q4S3z_TlmLKV9VGIGbeQ

gabrielBusta commented 2 weeks ago

@eu9ene Firefox CI is down :( https://mozilla.slack.com/archives/C030SPMMYQN/p1718302580423619

gabrielBusta commented 2 weeks ago

But these are probably pending because the config patch has not landed (it's deploying right now)

gabrielBusta commented 2 weeks ago

@eu9ene it is un-stuck now

eu9ene commented 2 weeks ago

@gabrielBusta I think changing the workers did not retrigger CI

eu9ene commented 2 weeks ago

Ok, I'm merging because I need to start testing it all together