sul-dlss / FOLIO-Project-Stanford

Task management for Stanford’s analysis of FOLIO.
2 stars 0 forks source link

Need to account for non-"36105*" barcodes for BW-PARENT #354

Closed shelleydoljack closed 1 year ago

shelleydoljack commented 1 year ago

The find_bwparents.pl script greps and substitutes from the bw child 590 fields:

my $barcode_regex = '\|b\s*[0-9]+';
my $sed_substitution = 's/^.\+|b\(361[0-9]*\) .*$/\1/';

to get a file of BW-PARENT barcodes. But not all BW-PARENT barcodes follow this regex. There are some, like 001AAE2400, 48849-1001, 001AAZ0657.

shelleydoljack commented 1 year ago

@dlrueda i updated the find_bwparents.pl script and am running it with the /s/SUL/Dataload/folio_symphony_extract/Bibs/Bibs_workdir/bwchild_items_all to see that it works. I hope to have it working for the extract we want to do this evening.

shelleydoljack commented 1 year ago

Some barcodes that look like 001AGV5216 or like *-1001 might be BW-CHILD or BW-PARENT. Need to get a count of these and see what we're looking at as issues.

shelleydoljack commented 1 year ago

Number of non-36105 barcodes for BW-PARENTs: 1,155. Number of non-36105 barcodes for BW-CHILD: 211,440.

shelleydoljack commented 1 year ago

Number of non-36105 barcodes from BW-CHILD marc records' 590 fields that are BW-PARENTs: 6 Number of non-36105 barcodes from BW-CHILD marc records' 590 fields that are BW-CHILD: 1,599 non_36105_590_bcs_icat1.txt

shelleydoljack commented 1 year ago

Not sure what should be done here. Wait for Data Control to fix the 1,599 that are BW-CHILD in 590 $b? Modify find_bwparents.pl in the folio_symphony_extract scripts to not filter these out and let incorrect boundwith relationships get created in FOLIO for now (but supposedly will get corrected by the time we migrate to FOLIO)?

shelleydoljack commented 1 year ago

During the meeting with Vitus, we decided that I will send him the file of non-36105* barcodes from BW-CHILD marc records' 590 fields that are BW-CHILD and include the library too. Data Control or spec-coll will resolve the issues in Symphony so that we can expand the find_bwparents.pl to include non-36105 barcodes. If the item id in $b is a copy note and not a bw note (or if it is both), they will move it to subfield d.

shelleydoljack commented 1 year ago

Updated find_bwparents.pl to include non-"real" barcodes when evaluating the 590 subfield b fields to get the BW-PARENT barcode.