Referring to [src/optimum/nvidia/export/converter.py] -> class 'TensorRTModelConverter' this could 'Take a local model and create the TRTLLM checkpoint and engine'
Questions:
What are applicable local model format? e.g. JAX, HuggingFace, DeepSpeed
How to use this script individually to generate TRTLLM checkpoint/engine? Could you please share if any tutorial?
Referring to [src/optimum/nvidia/export/converter.py] -> class 'TensorRTModelConverter' this could 'Take a local model and create the TRTLLM checkpoint and engine' Questions:
Thank you.