internetarchive / openlibrary

One webpage for every book ever published!
https://openlibrary.org
GNU Affero General Public License v3.0
5.11k stars 1.34k forks source link

Reimport not correcting or warning mismatches #1622

Open tfmorris opened 5 years ago

tfmorris commented 5 years ago

This record was apparently imported incorrectly originally, with an extra space in the title, but on reimport was neither corrected nor flagged as a problem: https://openlibrary.org/books/OL22235433M/Gra_mmaire_au_secondaire_100?m=history

The reimport source record has the correct title(s): https://ia801502.us.archive.org/fetchmarc.php?path=%2F8%2Fitems%2Fgrammaire100ause0000chen%2Fgrammaire100ause0000chen_marc.xml as does the original source record: https://openlibrary.org/show-records/marc_laurentian/openlibrary.mrc:730901965:548

xayhewalo commented 4 years ago

@hornc This seems related to your work. I've labeled it high priority since it seems critical to the current data clean-up efforts. Have there been any improvements to the import process since this issue was made? Also are you willing to be assignee?