Open ZJLi2013 opened 1 month ago
Hi, please check:
Do you find a file named rank0_managed_weights.safetensors
or so inside the engine dir?
Is there a field named manage_weights
in config.json
, plugin_config
part?
It seems that you are building from a model config without weights, not a checkpoint. In such cases TRT-LLM generates random weights, but is not supported by fast_build
yet.
System Info
DGX H100
Who can help?
when build engine with :
and then benchmark with gptMangerBenchmark, it reports:
is it expected behavior with fast-build ?
btw, wo
--fast-build
, the engine build and benchmark looks all right.Thanks
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
fast-build flag should also build workable engines,
actual behavior
with fast-build flag, transformer layers are ignored somehow
additional notes
no more