AnswerDotAI / bert24

Apache License 2.0
25 stars 3 forks source link

Add support for FA2 deterministic mode #80

Open warner-benjamin opened 3 days ago

warner-benjamin commented 3 days ago

Changes

This PR adds a config option to enable Flash Attention 2's deterministic mode. Determinism will slow down FA2, so I have disabled it by default for pretraining configs but enabled it for all evaluation configs.

Tests