milvus-io / milvus

A cloud-native vector database, storage for next generation AI applications
https://milvus.io
Apache License 2.0
29.02k stars 2.79k forks source link

[Feature]: Implement serving ONNX model in Milvus #24955

Open xiaofan-luan opened 1 year ago

xiaofan-luan commented 1 year ago

Is there an existing issue for this?

Is your feature request related to a problem? Please describe.

ONNX is a computation backend serving models.

Milvus will need to run ONNX cases in following scenarios:

  1. User defined metrics
  2. Ranking/Reranking
  3. Embedding finetuning -> UDF function to finetune the embeddings ..... more suggestion on the usages

Describe the solution you'd like.

No response

Describe an alternate solution.

No response

Anything else? (Additional Context)

No response

xiaofan-luan commented 1 year ago

/assign @soothing-rain