tensorlakeai / indexify

A realtime serving engine for Data-Intensive Generative AI Applications
https://docs.tensorlake.ai
Apache License 2.0
906 stars 112 forks source link

make gpu_memory vec #855

Closed maxkozlovsky closed 2 months ago

maxkozlovsky commented 2 months ago

There can be more than one gpu unit, so make gpu_memory a vector.