openeventdata / Dictionaries

PETRARCH actor, agent and verb dictionaries
22 stars 16 forks source link

Big changes to the dictionaries #7

Closed ahalterman closed 10 years ago

ahalterman commented 10 years ago

This reorganization is aimed at making the dictionaries more up-to-date, but more importantly, making them easier to understand and edit so people besides me can make changes.

I did a few big things.

  1. Re-did some of the formatting to make it Petry
  2. Re-named files to get rid of dates in filenames and set up for future consolidation of dictionaries into just a few files (with as much in Countries as possible).

Some experiments:

  1. Added a bunch of offices and titles to the dictionaries for the US (things like "The State Department" = USAGOV). Before, they included people's names pretty comprehensively, but no titles. I need to get an interactive Petrarch instance up so I can test dictionary changes to see how much I need to tweak the little things.
  2. I added explicit section breaks to the US dictionaries, labeling what each section includes. This should make it easier to make sure everything's up to date and should probably be extended to other countries, too.
  3. Began a post-2011 Egypt update but man, that's a lot of work.

Some production fixes:

  1. Added some "airliner = CVL" stuff into the agents file, along with a few other updates, and deleted some weird artifacts from Wordnet.
  2. Added the most minimal coverage of Serbia, Bosnia, Croatia, Montenegro, and Slovenia. They weren't in Countries at all.