Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature stores, nearest neighbor search, and exploration strategies) into end-to-end recommendation pipelines that can be served with Triton Inference Server.
Apache License 2.0
90
stars
30
forks
source link
[BUG] Unable to serve a topK session-based model on Triton #383
Bug description
I am trying to serve a session-based topk_model that is generated using
to_top_k_encoder
method using Merlin models library.How ever I am getting the following error from the line below:
Steps/Code to reproduce bug
Please run the code in this gist to repro the issue.
Expected behavior
We should be able to serve a topk_model on Triton and be able to return topK scores and indices.
Environment details
I am using merlin-tensorflow:23.06 and pulling
main
branches for each library.