mikemccand / luceneutil

Various utility scripts for running Lucene performance tests
Apache License 2.0
205 stars 115 forks source link

Added NAD facets to nightly benchmark #183

Closed mdmarshmallow closed 2 years ago

mdmarshmallow commented 2 years ago

Added python wrapper for the NAD facets and added to nightly benchmark.

jpountz commented 2 years ago

What does "NAD" mean?

mdmarshmallow commented 2 years ago

NAD is the National Address Database (for the United States) which holds a large amount of addresses (over 60M) in the US. It a natural hierarchy we could use for a hierarchical faceting test, and there were lots of data points so it worked well for a "high cardinality" hierarchical faceting test.