buckets = [64, 128, 256, 512, 1024, 2048, 4096, 8192]
This change will pad the input length to the nearest upper bound in buckets, which will avoid creating many graphs for different lengths to leverage the HPU accelerator.
[x] pad_to_buckets is a new argument for LMEvalParser
[x] auto device is enabled if pad_to_buckets is not set.
[x] add hpu evaluation support in example.
[x] install evaluation when install neural_compressor_pt.
Expected Behavior & Potential Risk
lm_eval example test gives the same result as before.
Type of Change
lm_eval evaluation enhancement
Description
buckets = [64, 128, 256, 512, 1024, 2048, 4096, 8192] This change will pad the input length to the nearest upper bound in buckets, which will avoid creating many graphs for different lengths to leverage the HPU accelerator.
pad_to_buckets
is a new argument forLMEvalParser
pad_to_buckets
is not set.Expected Behavior & Potential Risk
lm_eval example test gives the same result as before.