HeardLibrary / vandycite

0 stars 0 forks source link

Decide what to do about English labels/descriptions captured from Commons #15

Closed baskaufs closed 2 years ago

baskaufs commented 2 years ago

The titles from ACT are clean and succinct, so by default I used them as the English labels for the future Wikidata items. However, there was additional information I gleaned from Commons that is presented in the column after the label_en column. Are there any cases where these are better labels than the ones from ACT? Or should they just be deleted and forgotten about?

baskaufs commented 2 years ago

After examining the hundred or so "label_commons" values in the output of the processing script, they are so all over the place that I don't think there is any systematic way to make use of them. I think that it is enough that they are available for reference while cleaning up the labels and any other metadata fields that their information might inform. They can just be deleted when the table is ready for upload. If the output is pushed to GitHub, they could always be re-examined if necessary.