We should investigate whether we need to keep indexing the features1 and features2 vectors in the catalogue pipeline images_indexed index. The relevant portion of the mapping file currently looks like this (both locally, and in the production ES cluster):
The current version of Elasticsearch adds "index": true by default, but this might not have been the case in older versions. There is a suspicion that we specified these mappings while working with an ES version which did not index them by default, and then updated to a version which started indexing them by default.
This suspicion arose from the fact that until recently, we were running automated tests with a version of Elasticsearch (8.5) which did not support indexing such large vectors in the first place.
The question we should answer is: "Is there a good reason for indexing these two vectors?" If the answer is no, we should stop indexing them because doing so is expensive.
We should investigate whether we need to keep indexing the
features1
andfeatures2
vectors in the catalogue pipelineimages_indexed
index. The relevant portion of the mapping file currently looks like this (both locally, and in the production ES cluster):However, the same portion looked different in the original mapping file we defined (note the absence of the
"index": true
property):The current version of Elasticsearch adds
"index": true
by default, but this might not have been the case in older versions. There is a suspicion that we specified these mappings while working with an ES version which did not index them by default, and then updated to a version which started indexing them by default.This suspicion arose from the fact that until recently, we were running automated tests with a version of Elasticsearch (8.5) which did not support indexing such large vectors in the first place.
The question we should answer is: "Is there a good reason for indexing these two vectors?" If the answer is no, we should stop indexing them because doing so is expensive.