srophe / britishLibrary-data

GNU General Public License v3.0
0 stars 3 forks source link

Entity reconciliation for persName, placeName, bibl, and ref (for titles) #90

Open wlpotter opened 2 years ago

wlpotter commented 2 years ago

As with authors and works (cf. #89 for the latter), we will need to create a spreadsheet to work on entity reconciliation of entities tagged with persName, placeName, bibl, or ref elements.

These will be trickier as they can appear in a number of locations both in msContents and in additions.

The hardest might be ref elements as we use this element for more than just titles. Will need further research for this latter.

wlpotter commented 2 years ago

Not a priority for first release, I don't think. But what we might want to do first is look at how to clean up some of this data. For example, if we have an author element, I don't think that same entity should be tagged with a persName in the title element. I also don't think we want to have patronymics tagged?

There are also cases of tagging things like 'saints' or 'departed' (that latter was me I think...) when we may instead want to have these as eventual subject tags in the work record. The data normalization should take precedence over entity reconciliation, I believe.

wlpotter commented 1 month ago

persName, placeName, and bibls were started during workshop proofreading. ref for titles should be split into separate issue.