Open Jack-Khuu opened 2 months ago
First surfaced in https://github.com/pytorch/torchchat/pull/1057, the replace_attention_with_custom_sdpa_attention function, used when exporting models in torchchat, can be replaced with the equivalent API provided in the Excecutorch https://github.com/pytorch/executorch/blob/main/examples/models/llama2/source_transformation/sdpa.py
replace_attention_with_custom_sdpa_attention
Task: Swap the torchchat implementation with that of ExecuTorch's. Delete the then defunct code from torchchat
No response
I think #1057 resolved this. Can we close?
Not quite, #1057 was the Pr the flagged it
Should be easy PR, just needs testing
🚀 The feature, motivation and pitch
First surfaced in https://github.com/pytorch/torchchat/pull/1057, the
replace_attention_with_custom_sdpa_attention
function, used when exporting models in torchchat, can be replaced with the equivalent API provided in the Excecutorch https://github.com/pytorch/executorch/blob/main/examples/models/llama2/source_transformation/sdpa.pyTask: Swap the torchchat implementation with that of ExecuTorch's. Delete the then defunct code from torchchat
Alternatives
No response
Additional context
No response
RFC (Optional)
No response