ucsdlib / damspas-rd

A Digital Collections application based on Hyrax
MIT License
3 stars 2 forks source link

Support batch ingest of non-Work records #48

Open mcritchlow opened 7 years ago

mcritchlow commented 7 years ago

Descriptive summary

DAMS Manager currently provides support for batch ingesting subjects. This feature needs to be ported to horton, with support for new linked record object types:

Currently assuming the file format for ingest is Excel .xlsx

Rationale

The pre-ingesting of records that Works will link to is part of the current DAMS4 workflow that @ucsdlib/domm would like to continue and port to DAMS5.

This feature would be used (optionally) prior to the delimited batch ingest for Works.

Related work

46

ghost commented 7 years ago

Yes, we will continue to use Excel .xlsx for the ingest file. Updated template to be provided.

New feature for Pre-ingest validation

ghost commented 7 years ago

Here is the sample import Excel file. I added alternateLabel for cases where we are changing the label from the data provider to a different form - we still want access via that other form. [Uploading non-object_import.xlsx…]()

lsitu commented 7 years ago

@GregReser It looks like something is wrong with the sample import Excel file "Uploading non-object_import.xlsx…" that you attached. Could you take a look?

ghost commented 7 years ago

@lsitu Can't get the spreadsheet to upload, so I sent it to you in an email

lsitu commented 7 years ago

Thanks @GregReser. Yes, I see we can't attach the XML format files in the past and I have to use a different file extensions like .txt. But it looks like it should support files types like PNG, GIF, JPG, DOCX, PPTX, XLSX, TXT, PDF, or ZIP.

lsitu commented 7 years ago

@GregReser I see the following Agent types in the sample import Excel file: Agent:Person Agent:Group Agent:Organization

From the data dictionary, I am a little confusing with their complexities:

ghost commented 7 years ago

@lsitu We have added several new columns to the template. I have created a draft for our discussions at: https://docs.google.com/a/ucsd.edu/spreadsheets/d/1EVfMBmWfamqEXZWE9mgWMZPGT_7zN8wGtYArZisJUXA/edit?usp=sharing

Since we are still discussing some issues, like ORCID, there may be more changes coming. :)