Closed fpetrini15 closed 10 months ago
Simple POC for launching the server in the background and running inference.
Steps:
triton repo add -m opt125 --source hf:facebook/opt-125m
triton bench run -m opt125
Simple POC for launching the server in the background and running inference.
Steps:
triton repo add -m opt125 --source hf:facebook/opt-125m
triton bench run -m opt125