Added basic test to verify that faceting works with high cardinality …

mikemccand / luceneutil

Various utility scripts for running Lucene performance tests

Apache License 2.0

205 stars 115 forks source link

…fields, files added in this commit are not accurate benchmarks

Wrote a script that reads the NAD database (can be downloaded here: https://www.transportation.gov/gis/national-address-database/national-address-database-nad-disclaimer), then indexes and runs some basic faceting tests. This is not an accurate benchmark, but can probably be used as the basis for a real high cardinality faceting benchmark in the future. It also serves a test to make sure that faceting is still able to be used with high cardinality facets even if benchmark timing is not accurate.

Also needs to be used in conjunction with the SSDV hierarchical field changes: https://github.com/apache/lucene/pull/509

mikemccand / luceneutil

Added basic test to verify that faceting works with high cardinality … #156