Open cblmemo opened 9 months ago
Another user bumped this. On PyTriton, different ports are used for HTTP, gRPC, and metrics: https://triton-inference-server.github.io/pytriton/0.5.2/guides/deploying_in_clusters/#exposing-ports
To discuss: Can we allow a syntax where each replica can open multiple ports, and we assume the first primary port to route traffic is the "first" port?
One user required to see the ray dashboard on each service replica.