ngds / ngds-metadatabase-v4

MIT License
3 stars 1 forks source link

Remove some high-hit keywords #6

Closed aazaff closed 3 years ago

aazaff commented 4 years ago

Some of the keywords are really useless because they are so generic. For example, "geothermal" doesn't really make sense given that the entire database is geothermal. Also there are many metadata keywords like usginres:document:im... that are more or less non-sensical.

We could either:

  1. Create a small list of keywords to be filtered out of the facets that you could hard-code in.
  2. Maybe change the facet algorithm to ignore keywords with more than 5,000 hits or something like that.