ualbertalib / jupiter

Jupiter is a University of Alberta Libraries-based initiative to create a sustainable and extensible digital asset management system. This is phase 2 (Digitization).
https://era.library.ualberta.ca/
MIT License
23 stars 10 forks source link

ERA database cleaning -- "Temporary Community/Collection" -- non-migrated items #1337

Open abombak opened 4 years ago

abombak commented 4 years ago

This is not so much a bug as an investigation of ~200 ERA items that did not migrate successfully into Jupiter. The items were identified and tagged with a "Temporary Community/Collection" label and their metadata exported into a spreadsheet for investigation/re-entry. Student Holly was able to re-ingest all but 69 or so by using the existing record information to add missing data, assign a corrected Community/Collection, etc. Each successful/unsuccessful re-ingest has been noted in the spreadsheet. The remaining 68 failed items still need to be further investigated and corrected or deleted. The remaining fails seem to related to license selection, missing files, and incomplete records. The Temp C/C spreadsheet is open to all with the link: https://docs.google.com/spreadsheets/d/106B7FRNK0oLYPjNCDwDapMNG-V-owtwyACLff8_uswE/edit?usp=sharing

leahvanderjagt commented 4 years ago

@pbinkley @weiweishi I need to know if I can have a student access and review the failed items or if we are trying to locate them. We have the capacity to do this review, I need to ensure we are delivering data where promised in the application, please let us know if there is any way for one of our SLIS students to access the 68 failed item files.

abombak commented 4 years ago

Hi Leah, the Issue on GitHub has a more complete explanation -- this Temp C/C list has already been checked by student Holly and resolved most of the deposit issues (around 137?, from memory) -- these are the remaining items that will have to be investigated from the back end, I believe. Reasons for item failure are noted in the spreadsheet. I'm happy to provide more details during our ERA Cleaning meeting tomorrow w/ you, me, and Weiwei.

Anna

On Mon, Oct 28, 2019 at 12:48 PM leahvanderjagt notifications@github.com wrote:

@pbinkley https://github.com/pbinkley @weiweishi https://github.com/weiweishi I need to know if I can have a student access and review the failed items or if we are trying to locate them. We have the capacity to do this review, I need to ensure we are delivering data where promised in the application, please let us know if there is any way for one of our SLIS students to access the 68 failed item files.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/ualbertalib/jupiter/issues/1337?email_source=notifications&email_token=ADYF3CFNLVHRBUUUCB2RH43QQ4XYRA5CNFSM4JE4E7GKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOECN7R6Y#issuecomment-547092731, or unsubscribe https://github.com/notifications/unsubscribe-auth/ADYF3CDPO2WRTELNEHFIMG3QQ4XYRANCNFSM4JE4E7GA .

--

Anna Bombak, MLISDigital Content SpecialistUniversity of Alberta780-492-2202

leahvanderjagt commented 4 years ago

Sounds good, Antony is looking for a work so I am just hoping to get someone on this asap. LV

On Mon, Oct 28, 2019 at 1:16 PM abombak notifications@github.com wrote:

Hi Leah, the Issue on GitHub has a more complete explanation -- this Temp C/C list has already been checked by student Holly and resolved most of the deposit issues (around 137?, from memory) -- these are the remaining items that will have to be investigated from the back end, I believe. Reasons for item failure are noted in the spreadsheet. I'm happy to provide more details during our ERA Cleaning meeting tomorrow w/ you, me, and Weiwei.

Anna

On Mon, Oct 28, 2019 at 12:48 PM leahvanderjagt notifications@github.com wrote:

@pbinkley https://github.com/pbinkley @weiweishi https://github.com/weiweishi I need to know if I can have a student access and review the failed items or if we are trying to locate them. We have the capacity to do this review, I need to ensure we are delivering data where promised in the application, please let us know if there is any way for one of our SLIS students to access the 68 failed item files.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub < https://github.com/ualbertalib/jupiter/issues/1337?email_source=notifications&email_token=ADYF3CFNLVHRBUUUCB2RH43QQ4XYRA5CNFSM4JE4E7GKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOECN7R6Y#issuecomment-547092731 , or unsubscribe < https://github.com/notifications/unsubscribe-auth/ADYF3CDPO2WRTELNEHFIMG3QQ4XYRANCNFSM4JE4E7GA

.

--

Anna Bombak, MLISDigital Content SpecialistUniversity of Alberta780-492-2202

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/ualbertalib/jupiter/issues/1337?email_source=notifications&email_token=ABMJ4XCKSQKTSJBD3N4GYRLQQ43ARA5CNFSM4JE4E7GKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOECOCJQY#issuecomment-547103939, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABMJ4XGZ2KZIZKIL3NTZ7LTQQ43ARANCNFSM4JE4E7GA .

weiweishi commented 4 years ago

With the 69 remaining ones that need to be investigated, there are some needs to be investigated on the backend (the ones resulted in error messages), I will look into them. Others will need decisions on what license can be assigned, and whether they can be safely deleted. I wonder how we can approach those. Happy to discuss more at our meeting tomorrow.