Open hpk23 opened 11 months ago
Hi @hpk23 , we're working on a trition backend example for T5 structure. The progress is tracked under https://github.com/NVIDIA/TensorRT-LLM/issues/800. We appreciate your patience for a few more weeks as we finalize the structure
I'm trying to use the mT5 model, for mT5, tensorrt-llm build creates an engine for encoder and decoder, how should I organize the directory structure in this case? (In all models, there seems to be only an example where there is only a decoder).