HeardLibrary / vandycite

0 stars 0 forks source link

Gallery title cleanup #22

Closed baskaufs closed 1 year ago

baskaufs commented 2 years ago

Split off of #3

Correct languages, figure out languages for those without. Identify parts of titles that are actually translations and remove to use for labels in other languages. Compare titles with labels to see if there is any pattern for mistakes where the titles are in en but got missed. See if there is any pattern for titles identified as en but which are actually fr or some other language. How to handle titles that have a non-en title, but translation in parens -- can we use that to build labels in other languages?

baskaufs commented 2 years ago

This is addressed in the work_flow.ipynb script, in the Process titles for language section. Marymae is working on correcting the output for eventual upload as title properties for works that don't already have them.