Open parikshitsaikia1619 opened 1 year ago
Hello @Ki6an,
I am working on speeding up a finetuned t5-mini batch cpu inference.
On the batch size = 10, sequence length = 300 tokens:
Maybe I am doing something wrong, but after fastT5 it was supposed to be faster right?
pytorch:
fastT5
Collab notebook link: Link
Please let me know your thoughts.
Hello @Ki6an,
I am working on speeding up a finetuned t5-mini batch cpu inference.
On the batch size = 10, sequence length = 300 tokens:
Maybe I am doing something wrong, but after fastT5 it was supposed to be faster right?
pytorch:
fastT5
Collab notebook link: Link
Please let me know your thoughts.