Add `container_port` to model YAML

machine-learning-exchange / katalog

MLX Katalog is a project to hold the default content samples to bootstrap Machine Learning eXchange.

Apache License 2.0

12 stars 13 forks source link

Currently, when models are served via the "Launch" tab of the MLX UI, the container port is fixed and assumed to be 5000. This works for the containerized MAX models, all of which serve inferencing requests on port 5000. However any other models to be registered in MLX that are not listening on port 5000 would have to be rebuild before they can be registered and deployed in MLX.

This PR introduces a new optional field to the model YAML to allow users to specify the port on which the containerized model listens to inferencing requests.

There is PR #371 on the mlx repo to make use of the container_port parameter from the model YAML in the model-config component that is used in the model deployment pipeline in MLX.

/cc @Tomcli @rafvasq

machine-learning-exchange / katalog

Add `container_port` to model YAML #75