QA's benchmark scripts set this env var TEXT_GENERATION_SERVER_IGNORE_EOS_TOKEN=true but it's not used any place.
This change overwrite ignore_eos_token in the stopping criteria when the env var is true. It will help benchmarks get a fixed number of output tokens.
QA's benchmark scripts set this env var TEXT_GENERATION_SERVER_IGNORE_EOS_TOKEN=true but it's not used any place. This change overwrite ignore_eos_token in the stopping criteria when the env var is true. It will help benchmarks get a fixed number of output tokens.