emory-libraries / dlp-curate

Digital curation and preservation workbench for the Emory Preservation Repository.
11 stars 4 forks source link

Run preprocessor on Office of Alumni Pubs photos box 10 #1628

Closed kmichaelis closed 3 years ago

kmichaelis commented 3 years ago

Please run the preprocessor on the csv for files in box 10 of the Office of Alumni Publications Photographs.

CSV file

This can be run in the Langmuir mode.

See the Langmuir notes in the Rake Tasks Tutorial.

When finished, please save the CSV in the FY21 processed folder.

bwatson78 commented 3 years ago

@kmichaelis I'm gonna fix this bit here: EUA0179_B010_F013_I016_P002ARCH.tif--it's producing errors.

bwatson78 commented 3 years ago

@kmichaelis The processor is expecting metadata for this item here:

{:metadata=>nil, :filesets=>{2=>#<CSV::Row "source_row":124 "deduplication_key":"EUA0179_B010_F002_I0029" "type":"fileset" "fileset_label":nil "preservation_master_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F002/ARCH/EUA0179_B010_F002_I0029_P002_ARCH.tif" "intermediate_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F002/PROD/EUA0179_B010_F002_I0029_P002_PROD.tif" "other_identifiers":nil "abstract":nil "administrative_unit":nil "local_call_number":nil "creator":nil "date_created":nil "Desc - Date Created - Date Precision":nil "date_issued":nil "content_genres":nil "holding_repository":nil "institution":nil "publisher":nil "emory_rights_statements":nil "rights_statement":nil "subject_names":nil "subject_geo":nil "subject_topics":nil "title":nil "content_type":nil "data_classifications":nil "Ingest.workflow_notes":nil "Digital Object - Parent Identifier":nil "visibility":nil "Directory Path":nil "File Size":nil "Filename":nil "Path":nil "Ingest.workflow_rights_basis":nil "Ingest.workflow_rights_basis_date":nil "Ingest.workflow_rights_basis_note":nil "Accession.workflow_rights_basis":nil "Accession.workflow_rights_basis_date":nil "Accession.workflow_rights_basis_reviewer":nil "Accession.workflow_rights_basis_note":nil "sensitive_material":nil "sensitive_material_note":nil "extent":nil "sublocation":nil "source_collection_id":nil>}}
kmichaelis commented 3 years ago

@bwatson78 file has been edited and reuploaded.

bwatson78 commented 3 years ago

@kmichaelis Same issue with this line:

{:metadata=>nil, :filesets=>{1=>#<CSV::Row "source_row":1109 "deduplication_key":"EUA0179_B010_F011_I0052" "type":"fileset" "fileset_label":nil "preservation_master_file":nil "intermediate_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F011/PROD/EUA0179_B010_F011_I0052_P001_PROD.tif" "other_identifiers":nil "abstract":nil "administrative_unit":nil "local_call_number":nil "creator":nil "date_created":nil "Desc - Date Created - Date Precision":nil "date_issued":nil "content_genres":nil "holding_repository":nil "institution":nil "publisher":nil "emory_rights_statements":nil "rights_statement":nil "subject_names":nil "subject_geo":nil "subject_topics":nil "title":nil "content_type":nil "data_classifications":nil "Ingest.workflow_notes":nil "Digital Object - Parent Identifier":nil "visibility":nil "Directory Path":nil "File Size":nil "Filename":nil "Path":nil "Ingest.workflow_rights_basis":nil "Ingest.workflow_rights_basis_date":nil "Ingest.workflow_rights_basis_note":nil "Accession.workflow_rights_basis":nil "Accession.workflow_rights_basis_date":nil "Accession.workflow_rights_basis_reviewer":nil "Accession.workflow_rights_basis_note":nil "sensitive_material":nil "sensitive_material_note":nil "extent":nil "sublocation":nil "source_collection_id":nil>, 2=>#<CSV::Row "source_row":1111 "deduplication_key":"EUA0179_B010_F011_I0052" "type":"fileset" "fileset_label":nil "preservation_master_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F011/ARCH/EUA0179_B010_F011_I0052_P002_ARCH.tif" "intermediate_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F011/PROD/EUA0179_B010_F011_I0052_P002_PROD.tif" "other_identifiers":nil "abstract":nil "administrative_unit":nil "local_call_number":nil "creator":nil "date_created":nil "Desc - Date Created - Date Precision":nil "date_issued":nil "content_genres":nil "holding_repository":nil "institution":nil "publisher":nil "emory_rights_statements":nil "rights_statement":nil "subject_names":nil "subject_geo":nil "subject_topics":nil "title":nil "content_type":nil "data_classifications":nil "Ingest.workflow_notes":nil "Digital Object - Parent Identifier":nil "visibility":nil "Directory Path":nil "File Size":nil "Filename":nil "Path":nil "Ingest.workflow_rights_basis":nil "Ingest.workflow_rights_basis_date":nil "Ingest.workflow_rights_basis_note":nil "Accession.workflow_rights_basis":nil "Accession.workflow_rights_basis_date":nil "Accession.workflow_rights_basis_reviewer":nil "Accession.workflow_rights_basis_note":nil "sensitive_material":nil "sensitive_material_note":nil "extent":nil "sublocation":nil "source_collection_id":nil>}}
kmichaelis commented 3 years ago

@bwatson78 I edited and reuploaded the csv.

bwatson78 commented 3 years ago

@kmichaelis Same issue with this line:

{:metadata=>nil, :filesets=>{2=>#<CSV::Row "source_row":1417 "deduplication_key":"EUA0179_B010_F012_I0091" "type":"fileset" "fileset_label":nil "preservation_master_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F012/ARCH/EUA0179_B010_F012_I0091_P002_ARCH.tif" "intermediate_file":"dmfiles/MARBL/Archives/EUA_0179_AlumniPubs/B010/F012/PROD/EUA0179_B010_F012_I0091_P002_PROD.tif" "other_identifiers":nil "abstract":nil "administrative_unit":nil "local_call_number":nil "creator":nil "date_created":nil "Desc - Date Created - Date Precision":nil "date_issued":nil "content_genres":nil "holding_repository":nil "institution":nil "publisher":nil "emory_rights_statements":nil "rights_statement":nil "subject_names":nil "subject_geo":nil "subject_topics":nil "title":nil "content_type":nil "data_classifications":nil "Ingest.workflow_notes":nil "Digital Object - Parent Identifier":nil "visibility":nil "Directory Path":nil "File Size":nil "Filename":nil "Path":nil "Ingest.workflow_rights_basis":nil "Ingest.workflow_rights_basis_date":nil "Ingest.workflow_rights_basis_note":nil "Accession.workflow_rights_basis":nil "Accession.workflow_rights_basis_date":nil "Accession.workflow_rights_basis_reviewer":nil "Accession.workflow_rights_basis_note":nil "sensitive_material":nil "sensitive_material_note":nil "extent":nil "sublocation":nil "source_collection_id":nil>}}
bwatson78 commented 3 years ago

@kmichaelis Can you also convert EUA0179_B010_F013_I016_P002ARCH.tif to EUA0179_B010_F013_I016_P002_ARCH.tif in your next edit?

kmichaelis commented 3 years ago

@bwatson78 fixed and reuploaded csv

bwatson78 commented 3 years ago

@kmichaelis Processed file in the folder indicated above.

kmichaelis commented 3 years ago

Thanks @bwatson78 !