osm-search / Nominatim

Open Source search based on OpenStreetMap data
https://nominatim.org
GNU General Public License v3.0
2.98k stars 701 forks source link

Implement a lighter format for wikipedia importance tables #3424

Closed lonvia closed 1 month ago

lonvia commented 1 month ago

Adds support for the new simpler CSV format for wikipedia importance values. This also comes with a much simplified table structure: redirects and articles are now in the same table and all unnecessary information has been dropped leaving only wikipedia article, wikidata ID and importance.

Support for the old-style wikipedia importance dumps remains in place for now. There will be official CSV dumps once we have removed the last obstacles in the generation process in https://github.com/osm-search/wikipedia-wikidata.

mtmail commented 1 month ago

I think nominatim/tools/check_database.py needs to check for the new database table, too.

lonvia commented 1 month ago

The latest commit should be implementing that. Or do you have a different place in mind?

mtmail commented 1 month ago

ah I see it now