triton-inference-server / fastertransformer_backend

BSD 3-Clause "New" or "Revised" License
411 stars 133 forks source link

triton support using factertransfer backend for flan-ul2 and flan-ul2-alpaca-lora #138

Open ma-siddiqui opened 1 year ago

ma-siddiqui commented 1 year ago

How we can run triton with fastertransfer backend for flan-ul2-alpaca-lora?

Please share the steps. how to do this?