cdli-gh / data

This is a copy of the daily dump of catalogue and ATF data from the Cuneiform Digital Library Initiative (http://cdli.ucla.edu)
http://cdli.ucla.edu/bulk_data
53 stars 12 forks source link

Catalogue data for P277115 and P277116 run together #15

Closed rillian closed 5 years ago

rillian commented 5 years ago

There seems to be some corruption in the catalogue data export for P277115 and P277116. On line 124209 of cdli_catalogue_1of2.csv, the sub-genre comments column of the first tablet stops abruptly, without a closing quotation mark, and is followed by the entry for the second tablet on the same line.

[...],"Account; payments of shekel of ?; 10x16x2(u.e.)x2(le.e.,,,,21198/zz001w65nd,"no atf",[..]
rillian commented 5 years ago

The archive page for P277115 (N 2004) has slightly more data in this column, but still looks a little strange.

Account; payments of shekel of ?; 10x16x2(u.e.)x2(le.e. lines

Maybe there's a special character in the database which is causing escaping problems with the export?

epageperron commented 5 years ago

Merged #18 in @rillian you're not alone :) I'll try to fix this ASAP

epageperron commented 5 years ago

I've updated the entry, updated cat in MySQL, ran the export script and the commit script. Please try again anytime. @rillian, @jnovotny-lmu

rillian commented 5 years ago

Confirmed fixed. Thanks!