huggingface / optimum-habana

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Apache License 2.0
153 stars 202 forks source link

Add warmup time and compile time log for the eval/prediction. #1489

Open jiminha opened 1 week ago

jiminha commented 1 week ago

What does this PR do?

We are adding log prints for compile time in our evaluation/prediction loop for all inference test cases. To get the compile time, use the --throughput_warmup_steps flag (same logic as in text_generation/run_generation.py). This will also increase the throughput numbers since it will remove the warmup time from the throughput calculation.

HuggingFaceDocBuilderDev commented 1 week ago

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.