elastic / elasticsearch

Free and Open Source, Distributed, RESTful Search Engine
https://www.elastic.co/products/elasticsearch
Other
977 stars 24.82k forks source link

Use Well Known Binary Format for `geo_point` synthetic source #108990

Open salvatore-campagna opened 5 months ago

salvatore-campagna commented 5 months ago

Description

Currently geo_ponit doc values use quantisation when storing doc values. As a result, reconstructing documents in synthetic source using doc values results in accuracy loss and in the reconstructed document not matching the original source. We can fix the accuracy issue by using the WKB format and consider giving users a storage versus accuracy tradeoff by taking advantage of the store option.

The idea is to make store: true the default and using WKB for the stored field. When store is true we can reconstruct the document in synthetic source by using the stored field without accuracy loss. When store: false is used we can use the. quantized representation in doc values by trading accuracy for storage reduction.

elasticsearchmachine commented 5 months ago

Pinging @elastic/es-storage-engine (Team:StorageEngine)

lkts commented 5 months ago

Same as #108981 ?