Currently, high cardinality categorical variables are extremely slow. The following optimizations would be helpful:
Detect in construction of categorical aggregators whether all values is selected in the denominator. If so, we can short circuit the denominator subset check.
Only check denominator if value is not found in numerator.
Currently, high cardinality categorical variables are extremely slow. The following optimizations would be helpful: