I have a few points to discuss on the custom API bit around the thought
Since the format of a custom API is implementation dependent, this should probably not be a part of the library itself. Everyone is free to add thier own implementation in their codebase as they need it. Maybe a documentation and example would suffice.
My ask - could you separate this into two PRs? One adding support for Ollama and the other for local embedings? We can get the Ollama part merged and the other bit can be added post more thought.
I have a few points to discuss on the custom API bit around the thought
My ask - could you separate this into two PRs? One adding support for Ollama and the other for local embedings? We can get the Ollama part merged and the other bit can be added post more thought.