triton-inference-server / fastertransformer_backend

BSD 3-Clause "New" or "Revised" License
411 stars 134 forks source link

run end_to_end_test_llama.py error #134

Open SherronBurtint opened 1 year ago

SherronBurtint commented 1 year ago

Running python3 tools/end_to_end_test_llama.py, an error was prompted, [400] HTTP end point doesn't support models with decoupled transaction policy