superduper-io / superduper

Superduper: Integrate AI models and machine learning workflows with your database to implement custom AI applications, without moving your data. Including streaming inference, scalable model hosting, training and vector search.
https://superduper.io
Apache License 2.0
4.65k stars 448 forks source link

Optimize model writing and loading during prediction #2368

Closed blythed closed 1 month ago

blythed commented 1 month ago

We should have separate processes writing model outputs, while other processes are loading data and predicting. This is particularly obvious when the inputs and outputs are large (order of magnitude of work in I/O comparable to prediction).