Open j0ma opened 2 years ago
While training a bilingual EN-RU model, OOM occurs. This slows down training a lot.
While training a bilingual EN-RU model, OOM occurs. This slows down training a lot.