Open cblmemo opened 11 months ago
Another user bumped this. On PyTriton, different ports are used for HTTP, gRPC, and metrics: https://triton-inference-server.github.io/pytriton/0.5.2/guides/deploying_in_clusters/#exposing-ports
To discuss: Can we allow a syntax where each replica can open multiple ports, and we assume the first primary port to route traffic is the "first" port?
This issue is stale because it has been open 120 days with no activity. Remove stale label or comment or this will be closed in 10 days.
Not stale, any plans for this? @concretevitamin
Not stale, any plans for this? @concretevitamin
We should get to it. What ports and for what purposes do you need on each replica? How do you work around it?
One user required to see the ray dashboard on each service replica.