Closed rendel closed 8 years ago
Reverting back to the original idea, it is generally better to leave the control to the user. Also, Kibi can easily detect when a field can be encoded with integers, and can recommend it to the user.
Should I open an issue in kibi to support this feature ? So it can give a warning.
Yes, I think we need an umbrella issue for the integration of the siren-join plugin with Kibi. At the moment Kibi is missing:
We can automatically detect when terms can be encoded as integer instead of long using the max value returned by the field stats. In addition, when integer encoding can be applied, we could use a vint encoding approach to compress the set of terms, as it is more likely that these integers will not follow a random distribution (e.g., generated by hash) compared to a long value (which is likely to be a generated hash).