Open cmenguy opened 1 year ago
Hey @cmenguy, I'm happy to hear that you found it helpful.
Could you please share some more details:
Thanks @Gelerion here are some details:
Dataframe
API as val hllSketches = data.groupBy('dimensionId).agg(hll_sketch_build('userId).as("sketch"))
spark-sketches
I used 1.0.0 which was I think the latest available in Maven Central at the time.Thx for the details.
To ensure compatibility with the 3.2.x
branch, it is recommended that you use 0.9.0
.
This is due to internal Spark changes in the 3.3.x branch that are not compatible with the 3.2.x branch.
Anyways, didn't expect an NPE error, I will check it.
Great library, would love to see more in that area!
I am able to use it with Theta sketches just fine, but on the same data with HLL I am seeing the following NPE error: