LafayetteCollegeLibraries / metadb-api

A migration API for (legacy) MetaDB
1 stars 1 forks source link

Bad date values exist for "biol101" and East Asia Image Collection Project Items #24

Open jrgriffiniii opened 7 years ago

jrgriffiniii commented 7 years ago
lafayette-preserve/app/models/concerns/lafayette_concerns/work_behavior.rb:54:in `parse': invalid date (ArgumentError)

is found for values such as 1962-02

jrgriffiniii commented 7 years ago

This is also found within EAIC Projects:

{:title_japanese=>["(北海道アイヌ風俗) 疾走せんとする馬上の(アイヌ)人"], :description_indicia=>["[1/2 divided back] 郵便はがき", "[printed in brown]", "POST CARD"], :date_image_upper=>["1930"], :coverage_location=>[], :date_original=>[], :date_search=>[], :creator_maker=>[], :creator_company=>[], :creator_digital=>[], :description_text_english=>[], :description_critical=>[], :description_inscription_english=>[], :description_inscription_japanese=>[], :description_text_japanese=>[], :format_digital=>[], :format_extent=>[], :publisher_digital=>[], :relation_is_part_of=>[], :rights_digital=>[], :title_chinese=>[], :title_korean=>[], :description_ethnicity=>["Ainu"], :relation_seealso=>["[ip2145]"], :contributor=>["Michaela Kelly", "Rika Kamiyasu"], :format_medium=>["Picture postcard"], :coverage_location_country=>["Japan"], :title_english=>["[ip2146] Ainu on horse"], :date_artifact_upper=>["1945-08-15"], :date_artifact_lower=>["1933-02-15"], :date_image_lower=>[], :description_citation=>["(Nihon chiri fūzoku taikei vol. 14, 1930, p. 55)"], :subject_ocm=>["130 GEOGRAPHY", "136 FAUNA", "230 ANIMAL HUSBANDRY", "290 CLOTHING", "530 ARTS", "531 DECORATIVE ART"], :identifier_url_download=>[], :title=>["[ip2146] Ainu on horse"], :resource_type=>["image/jp2"]}

Where a datestamp is (erroneously) formatted as just 1930 for date_image_upper

jrgriffiniii commented 7 years ago

Another case from the EAIC Projects can be found in the following:

{:contributor=>["Ashutosh Tamrakar"], :coverage_location=>["Izu"], :coverage_location_country=>["Japan"], :creator_company=>[], :creator_digital=>["The East Asia Image Collection is a joint project between Paul D. Barclay, General Editor and Associate Professor of History, and Digital Scholarship Services, Skillman Library, Lafayette College. "], :creator_maker=>[], :date_artifact_lower=>[], :date_artifact_upper=>[], :date_image_lower=>[], :date_image_upper=>[], :date_original=>["19500100"], :date_search=>[], :description_citation=>[], :description_critical=>[], :description_ethnicity=>[], :description_indicia=>[], :description_inscription_english=>[], :description_inscription_japanese=>[], :description_text_english=>["Island off Izu penin."], :description_text_japanese=>[], :format_digital=>["Master TIF image captured with at least 4000 pixels across the long edge using SilverFast AI Studio 6.6 software and a Nikon 5000D slide scanner.  Online display image was converted to JPG format."], :format_extent=>["35mm"], :format_medium=>["Photographic slide"], :publisher_digital=>["Special Collections & College Archives, Skillman Library, Lafayette College"], :relation_is_part_of=>["East Asia Image Collection", "Gerald & Rella Warner Japan Slide Collection"], :relation_seealso=>[], :rights_digital=>["This image is posted publicly for non-profit educational use, excluding print publication.  For additional information, please see http://digital.lafayette.edu/copyright for our Reproduction, Use, and Copyright Guidelines."], :subject_ocm=>["130 GEOGRAPHY", "133 TOPOGRAPHY AND GEOLOGY", "500 WATER AND AIR TRANSPORT", "501 BOATS"], :title_chinese=>[], :title_english=>["[js0535] Island off Izu"], :title_japanese=>[], :title_korean=>[], :identifier_url_download=>["warner-slides-japan-00535"], :title=>["warner-slides-japan-00535"], :resource_type=>["image/jp2"]}

Where 19500100 is used for date_original

jrgriffiniii commented 7 years ago

Values for the fields date, date.original.display, date.original.search, date.approximate, and date.range can also contain problematic values between various projects:

Most of the ranges specified by cases such as 1925,[...],1930 appear to be contiguous