HomebrewNLP / Olmax

HomebrewNLP in JAX flavour for maintable TPU-Training
BSD 2-Clause "Simplified" License
45 stars 5 forks source link

MESA/SAM #59

Closed ClashLuke closed 2 years ago

ClashLuke commented 2 years ago

Sharpness Aware Minimization does not seem to help. This pull request exists to record this failure and its code: ![Uploading grafik.png…]()