elastic / elasticsearch

Free and Open Source, Distributed, RESTful Search Engine
https://www.elastic.co/products/elasticsearch
Other
69.56k stars 24.62k forks source link

[ML] Add configurable chunking options to create endpoint Inference API #111856

Open dan-rubinstein opened 1 month ago

dan-rubinstein commented 1 month ago

Description

Currently, the Inference API uses a word based chunker that can not be configured by the user. This change would allow users calling the Inference API:

  1. Set and configure a chunking strategy when creating an inference endpoint. This chunking strategy will be used by default when performing inferences.
  2. Provide a chunking strategy when performing an inference to override the inference endpoint's configured chunking strategy.

If no chunking strategy is provided, the API will continue to function as is using the existing word based chunking strategy.

elasticsearchmachine commented 1 month ago

Pinging @elastic/ml-core (Team:ML)