Closed rhaps0dy closed 2 months ago
Running into this issue as well.
Working on SDPA support over here https://github.com/huggingface/transformers/pull/31031
Should I notify you guys if it has been merged?
Working on SDPA support over here huggingface/transformers#31031
Should I notify you guys if it has been merged?
Yeah that would be nice, thanks.
Running the README command
python -m sae EleutherAI/pythia-160m togethercomputer/RedPajama-Data-1T-Sample
gives error:The reason is that
attn_implementation="sdpa"
. Would a PR to make this configurable be welcome?