As a first solution for #466, we need to force users to add the columnStats when indexing Tables with the following characteristics:
Underlying data source changes constantly.
DataFrame contains non-deterministic columns to index.
DataFrame contains non-deterministic predicates.
ColumnStats would infer the data's min/max values before the DataFrame Analysis, which can produce inconsistent results when loading the DataFrame twice for Indexing in any of the above use cases.
As a first solution for #466, we need to force users to add the
columnStats
when indexing Tables with the following characteristics:ColumnStats would infer the data's min/max values before the DataFrame Analysis, which can produce inconsistent results when loading the DataFrame twice for Indexing in any of the above use cases.