Open k8si opened 7 months ago
Rejigged your request to get a better sense what you are asking about. Hope it matches
Request:
Add an API endpoint /info
that returns a JSON dictionary with:
/embedding
endpoint output between v0.6.2 and later versions).Use Case:
Currently, there's no way to retrieve this information via an API, leading to potential integration issues and inefficient prompt management. An /info
endpoint would enhance adaptability and usability.
It would be useful to have an API endpoint like
/info
that would return a json dict containing 1) the version of llamafile and 2) model metadata/configuration.Llamafile version allows me to adapt client code to work with each release. E.g. I think the output of the
/embedding
endpoint changed between release v0.6.2 and what's currently on master. This will be a breaking change for my LlamaIndex integration -- v0.6.2 llamafiles will work but v0.6.3 llamafiles will not (in certain cases).It would also be useful to have a way to get model metadata via an endpoint. E.g. user systems need to know the model's max sequence length in order to truncate/batch prompts appropriately. Currently there's no way to get this info.