elastic / kibana

Your window into the Elastic Stack
https://www.elastic.co/products/kibana
Other
19.72k stars 8.14k forks source link

ELSER / interface endpoint installation and management #192461

Open legrego opened 1 week ago

legrego commented 1 week ago

This is a pre-requisite for the unified knowledge base.

Being able to use semantic search within a cluster has some prerequisites: it requires an inference endpoint deployed on the cluster, and deploying an inference endpoint requires having at least one ML node present on the cluster. Deploying an inference endpoint, even with ELSER, isn’t necessarily trivial in some scenarios (e.g air gap mode).

For internal usages of semantic search (such as retrieval from the knowledge bases), our solution teams are using our internal ELSER model. However, for this, they’re forced to set up the inference endpoint, and to manage its lifecycle.

elasticmachine commented 1 week ago

Pinging @elastic/appex-ai-infra (Team:AI Infra)

pgayvallet commented 2 days ago

From https://github.com/elastic/kibana/issues/188554:

Note there are also requirements to allow for the automated configuration of inferences, which would require credentials with the manage_inference cluster privilege. This is out of scope of the current discussion.

Apparently, today the kibana system user does not have the manage_inference cluster privilege. This is going to be blocker for us here.

Note that we don't strictly need to be able to manage any inference endpoint, but we will need the Kibana user to at least be able to create (and then manage) the ELSER-based inference endpoints we're planning to use for the KB indices.

spong commented 1 day ago

Apparently, today the kibana system user does not have the manage_inference cluster privilege. This is going to be blocker for us here.

Here's some prior art from when we tried to enable it a few months back: https://github.com/elastic/elasticsearch/pull/108262 https://github.com/elastic/elasticsearch/pull/108386

Gist was we would let the search team drive the need and work with Kibana Security on enabling. I think with the requirements we have now we can drive the need now too 🙂


And tangentially, since I haven't seen this mentioned, I just wanted to note that some of the Search UI's like within Search Connectors already support some flows where they install ELSER via an Inference Endpoint. This has some interesting interactions with how the assistants currently deploy ELSER via the trainedModels API. I go over those details in this slack thread here, so just a heads up!