stanford-crfm / levanter

Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
https://levanter.readthedocs.io/en/latest/
Apache License 2.0
520 stars 82 forks source link

Fix transformer-engine attention import #795

Closed jennifgcrl closed 2 weeks ago

jennifgcrl commented 3 weeks ago

Renamed upstream

dlwh commented 2 weeks ago

Thank you! I've been procrastinating fixing this