POST /_bulk
{ "index": { "_index": "index-test" } }
{ "name": "permission", "test": "Writing a list of random sentences is harder than I initially thought it would be.", "doc_keyword": "workable", "doc_index": 4976 }
{ "index": { "_index": "index-test" } }
{ "name": "sister", "test": "The fifty mannequin heads floating in the pool kind of freaked them out", "doc_keyword": "angry"}
{ "index": { "_index": "index-test" } }
{ "name": "hair", "test": "Too many prisons have become early coffins", "doc_keyword": "likeable", "doc_index": 2351 }
{ "index": { "_index": "index-test" } }
{ "name": "editor", "test": "Greetings from the real universe", "doc_index": 9871 }
{ "index": { "_index": "index-test" } }
{ "name": "statement", "test": "People keep telling me orange but I still prefer pink", "doc_keyword": "entire", "doc_index": 8242 }
Check that there are no documents with empty passage_embedding value:
Run check for documents with empty passage_embedding. If search has returned anything (>= 1 hits) that means there are docs without embeddings. This is not the right behavior, all docs were ingested with embeddings, and only operation that caused embeddings to disappear was update :
I assume that behind the scenes document is still updated but because all fields are "included" it copies passage_embedding field value from original document.
Describe the bug
Doc values got updated after
update_by_query
call in case ingest pipeline is configured and one of processors in that pipeline has failed.Related component
Indexing
To Reproduce
passage_embedding
value:update
:Expected behavior
Because processor has been configured with 'ignore_failures
`false
we expect that update call has failed and no changes are stored.Additional Details
Plugins ml-commons, k-NN, neural-search
Host/Environment (please complete the following information):
Additional context I've tried same scenario without exclude setting for "passage_embedding" field and it works as expected.
I assume that behind the scenes document is still updated but because all fields are "included" it copies
passage_embedding
field value from original document.