huggingface / optimum-tpu

Google TPU optimizations for transformers models
Apache License 2.0
75 stars 19 forks source link

☝️ Update Jetstream Pytorch revision #91

Closed tengomucho closed 2 months ago

tengomucho commented 2 months ago

What does this PR do?

This is a refactoring PR. It updates the Jetstream/Pytorch version to a newer revision, that allows to pass the sampler function as parameter to the prefill and generate functions, as we requested. This simplifies our code, making the HfEngine usage redundant, so we can drop it.