huggingface / optimum-tpu

Google TPU optimizations for transformers models
Apache License 2.0
66 stars 17 forks source link

☝️ Update Jetstream Pytorch revision #91

Closed tengomucho closed 2 weeks ago

tengomucho commented 3 weeks ago

What does this PR do?

This is a refactoring PR. It updates the Jetstream/Pytorch version to a newer revision, that allows to pass the sampler function as parameter to the prefill and generate functions, as we requested. This simplifies our code, making the HfEngine usage redundant, so we can drop it.