Open CallmeZhangChenchen opened 1 month ago
We have the LLM API for end-to-end generation, you may have a try. Here are the demos for it.
@Superjomn The LLM Api doesn't feel right to use directly, now I'm using the source code to add support for the TTS model, Met a operator ‘RelPositionMultiHeadedAttention’ , and There is no implementation of this in TensoRT-LLM, whether can consider to add
But I'm not sure there's any existing code, set_rel_attn_table, precompute_relative_attention_bias
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."
https://github.com/FunAudioLLM/CosyVoice/blob/main/cosyvoice/llm/llm.py
In the llm code of TTS like this, Is TensorRT-LLM suitable for use? Are advised to refer to which Demo?