sul-dlss / FOLIO-Project-Stanford

Task management for Stanford’s analysis of FOLIO.
2 stars 0 forks source link

Extract data for WITHDRAWN and SHADOW items #520

Closed dlrueda closed 10 months ago

dlrueda commented 1 year ago

We want to extract some information about these items/titles, many of which will not be migrated to FOLIO.

Suggested fields to retain https://docs.google.com/document/d/1rPN4xACFj09WiVUvski0nfhBEMwmS5LGNKaLmfrZyR0/edit

Work with Irina on data format, I think the plan is to supply her data and she will either work with IndexData to load to metadb, or use tab-delimited format and be able to include in Tableau queries/dashboards perhaps.

dlrueda commented 12 months ago

Scripts /s/SUL/Bin/folio_symphony_extract/Bibs/get_withdrawn_shadow_data.ksh and item_xinfo_wd.ksh written and called manually to generate 3 tsv files in /s/SUL/Dataload/folio_symphony_extract/Bibs/Withdrawn_data: SHADOW-WITHDRAWN.items.techstaff.tsv SHADOW-WITHDRAWN.tagdata.tsv SHADOW-WITHDRAWN.items.tsv

A couple working files made it in there too but didn’t bother to clean everything up perfectly since it’s a one-time job.

Header added to bib tag file manually instead of adding to script; could easily just add to script if want to be able to run from cron.

dlrueda commented 12 months ago

Delivered tsv files G-drive for External data for metadb

and @trapido notified for review