Open sfc-gh-zhwang opened 1 year ago
Hi, Is there any tutorial that we can refer to so that we could serve a deberta model using fastertransformer in Triton? I think the steps would be:
However, I only see the step 1 with a tensorflow example.
https://github.com/NVIDIA/FasterTransformer/pull/725
Hi, Is there any tutorial that we can refer to so that we could serve a deberta model using fastertransformer in Triton? I think the steps would be:
However, I only see the step 1 with a tensorflow example.