WGBH-MLA / ams

Archival Management System to support the American Archive of Public Broadcasting
GNU General Public License v3.0
5 stars 8 forks source link

List of Digitized, Not Online Items from KCAW #887

Closed ekemeyer closed 5 months ago

ekemeyer commented 5 months ago

Details

Could I get an excel spreadsheet ID'ing everything from Alaska station KCAW that is digitized but not available online? The station manger requested it. It should be about 40 items I think. Let me know if you have any questions - thanks!

Submitted by: Michelle CC in communications: Priority: Medium (within this month) URL: https://americanarchive.org/catalog?f%5Bcontributing_organizations%5D%5B%5D=KCAW+%28AK%29&f[access_types][]=digitized Slack message thread:

foo4thought commented 5 months ago

Done; the GUIDs you seek are in the attached file.

Gory details:

mylocal:~ kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=id&rows=10&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].id'

| tr '_' '-' >> Desktop/guids_KCAW_online_not_digitized.txt;start=$[$start+10];done

mylocal:~ kevin_carter$ wc -l Desktop/guids_KCAW_online_not_digitized.txt

  40 Desktop/guids_KCAW_online_not_digitized.txt

Kevin Carter Senior Systems Analyst/Programmer 617-300-5923


From: Erica Titkemeyer @.> Sent: Friday, May 31, 2024 3:09 PM To: WGBH-MLA/ams @.> Cc: Kevin Carter @.>; Assign @.> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Assigned #887https://github.com/WGBH-MLA/ams/issues/887 to @foo4thoughthttps://github.com/foo4thought.

— Reply to this email directly, view it on GitHubhttps://github.com/WGBH-MLA/ams/issues/887#event-13004814265, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAD6DGPZRE5NTWKZXUEFNTTZFDDIBAVCNFSM6AAAAABITKEVPWVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJTGAYDIOBRGQZDMNI. You are receiving this because you were assigned.Message ID: @.***>

cpb-aacip-190-03qv9snr cpb-aacip-190-08hdr8mc cpb-aacip-190-106wwqzq cpb-aacip-190-12m63zwr cpb-aacip-190-149p8f85 cpb-aacip-190-15bcc424 cpb-aacip-190-16c2fsj7 cpb-aacip-190-19s1rq6k cpb-aacip-190-22h70v08 cpb-aacip-190-27mpg6wr cpb-aacip-190-31qftx7n cpb-aacip-190-322bvt62 cpb-aacip-190-36tx997m cpb-aacip-190-386hdvqv cpb-aacip-190-38jdfrm8 cpb-aacip-190-40xpp0g9 cpb-aacip-190-41zcrp65 cpb-aacip-190-44pk0t66 cpb-aacip-190-48sbchfw cpb-aacip-190-49t1g5xn cpb-aacip-190-5269pdxg cpb-aacip-190-53jwt0h3 cpb-aacip-190-569324jf cpb-aacip-190-5918986m cpb-aacip-190-612ngmcx cpb-aacip-190-676t1nwq cpb-aacip-190-6986716v cpb-aacip-190-77fqzdqq cpb-aacip-190-82x3fpjk cpb-aacip-190-83xsjc8h cpb-aacip-190-86b2rm41 cpb-aacip-190-88qbzvd3 cpb-aacip-190-89d51n3x cpb-aacip-190-92g79nww cpb-aacip-190-93ttf79w cpb-aacip-190-95j9kpc1 cpb-aacip-190-95w6mkbw cpb-aacip-190-98z8wm8f cpb-aacip-190-01pg4f9t cpb-aacip-190-47rn8tst

foo4thought commented 5 months ago
  1. convert query of catalog controller to query the api controller
  2. use curl and jq in BASH to collect the GUIDs in batches until done
  3. send email with results
  4. type this

Bonus: since the number of results was small, here they are: cpb-aacip-190-03qv9snr cpb-aacip-190-08hdr8mc cpb-aacip-190-106wwqzq cpb-aacip-190-12m63zwr cpb-aacip-190-149p8f85 cpb-aacip-190-15bcc424 cpb-aacip-190-16c2fsj7 cpb-aacip-190-19s1rq6k cpb-aacip-190-22h70v08 cpb-aacip-190-27mpg6wr cpb-aacip-190-31qftx7n cpb-aacip-190-322bvt62 cpb-aacip-190-36tx997m cpb-aacip-190-386hdvqv cpb-aacip-190-38jdfrm8 cpb-aacip-190-40xpp0g9 cpb-aacip-190-41zcrp65 cpb-aacip-190-44pk0t66 cpb-aacip-190-48sbchfw cpb-aacip-190-49t1g5xn cpb-aacip-190-5269pdxg cpb-aacip-190-53jwt0h3 cpb-aacip-190-569324jf cpb-aacip-190-5918986m cpb-aacip-190-612ngmcx cpb-aacip-190-676t1nwq cpb-aacip-190-6986716v cpb-aacip-190-77fqzdqq cpb-aacip-190-82x3fpjk cpb-aacip-190-83xsjc8h cpb-aacip-190-86b2rm41 cpb-aacip-190-88qbzvd3 cpb-aacip-190-89d51n3x cpb-aacip-190-92g79nww cpb-aacip-190-93ttf79w cpb-aacip-190-95j9kpc1 cpb-aacip-190-95w6mkbw cpb-aacip-190-98z8wm8f cpb-aacip-190-01pg4f9t cpb-aacip-190-47rn8tst

foo4thought commented 5 months ago

Hi Kevin,

Awesome, thanks! Is there any easy way to get a list of the titles attached to these GUIDs? The station manager wants to compare the list to the list of items he wants to send for Mellon.

Thanks, Michelle

From: Kevin Carter @.> Date: Tuesday, June 4, 2024 at 7:06 AM To: WGBH-MLA/ams @.>, WGBH-MLA/ams @.> Cc: Assign @.>, Michelle Kelley @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887) Done; the GUIDs you seek are in the attached file.

Gory details:

mylocal:~ kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=id&rows=10&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].id'

| tr '_' '-' >> Desktop/guids_KCAW_online_not_digitized.txt;start=$[$start+10];done

mylocal:~ kevin_carter$ wc -l Desktop/guids_KCAW_online_not_digitized.txt

  40 Desktop/guids_KCAW_online_not_digitized.txt

Kevin Carter

Senior Systems Analyst/Programmer

[GBH Archives]https://www.wgbh.org/foundation/what-we-do/media-library-and-archives


From: Erica Titkemeyer @.> Sent: Friday, May 31, 2024 3:09 PM To: WGBH-MLA/ams @.> Cc: Kevin Carter @.>; Assign @.> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Assigned #887https://github.com/WGBH-MLA/ams/issues/887 to @foo4thoughthttps://github.com/foo4thought.

— Reply to this email directly, view it on GitHubhttps://github.com/WGBH-MLA/ams/issues/887#event-13004814265, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAD6DGPZRE5NTWKZXUEFNTTZFDDIBAVCNFSM6AAAAABITKEVPWVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJTGAYDIOBRGQZDMNI. You are receiving this because you were assigned.Message ID: @.***>

Michelle Kelley Media Historian and Curator

foo4thought commented 5 months ago

Done; see attached.

It was this easy:

mylocal:Desktop kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=xml&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].xml' | xpath -e @.**="http://americanarchiveinventory.org"]/text()|//pbcoreTitle/@|//pbcoreTitle/text()' 2>/dev/null | tr -s '\n' '\t' | sed 's#(cpb-aacip).#\1-#g' | tee -a guid_titles_KCAW.txt;echo >> guid_titles_KCAW.txt;start=$[$start+1];done

Kevin Carter Senior Systems Analyst/Programmer 617-300-5923


From: Michelle Kelley @.> Sent: Tuesday, June 4, 2024 9:18 AM To: Kevin Carter @.>; WGBH-MLA/ams @.>; WGBH-MLA/ams @.> Cc: Assign @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Hi Kevin,

Awesome, thanks! Is there any easy way to get a list of the titles attached to these GUIDs? The station manager wants to compare the list to the list of items he wants to send for Mellon.

Thanks,

Michelle

Michelle Kelley

Media Historian and Curator

[GBH Archives] https://www.wgbh.org/foundation/what-we-do/media-library-and-archives

From: Kevin Carter @.> Date: Tuesday, June 4, 2024 at 7:06 AM To: WGBH-MLA/ams @.>, WGBH-MLA/ams @.> Cc: Assign @.>, Michelle Kelley @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Done; the GUIDs you seek are in the attached file.

Gory details:

mylocal:~ kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=id&rows=10&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].id'

| tr '_' '-' >> Desktop/guids_KCAW_online_not_digitized.txt;start=$[$start+10];done

mylocal:~ kevin_carter$ wc -l Desktop/guids_KCAW_online_not_digitized.txt

  40 Desktop/guids_KCAW_online_not_digitized.txt

Kevin Carter

Senior Systems Analyst/Programmer

[GBH Archives]https://www.wgbh.org/foundation/what-we-do/media-library-and-archives


From: Erica Titkemeyer @.> Sent: Friday, May 31, 2024 3:09 PM To: WGBH-MLA/ams @.> Cc: Kevin Carter @.>; Assign @.> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Assigned #887https://github.com/WGBH-MLA/ams/issues/887 to @foo4thoughthttps://github.com/foo4thought.

— Reply to this email directly, view it on GitHubhttps://github.com/WGBH-MLA/ams/issues/887#event-13004814265, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAD6DGPZRE5NTWKZXUEFNTTZFDDIBAVCNFSM6AAAAABITKEVPWVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJTGAYDIOBRGQZDMNI. You are receiving this because you were assigned.Message ID: @.***>

cpb-aacip-190-03qv9snr titleType="Series" Dr White
cpb-aacip-190-08hdr8mc titleType="Program" The Carnatic Music Tradition
cpb-aacip-190-106wwqzq titleType="Raw Footage" Debut Concert
cpb-aacip-190-12m63zwr titleType="Raw Footage" Raven Radio 1st Birthday
cpb-aacip-190-149p8f85 titleType="Program" Cloak & Dagger - Part I cpb-aacip-190-15bcc424 titleType="Segment" Various Xeno Pieces cpb-aacip-190-16c2fsj7 titleType="Program" ANB Basketball Special
cpb-aacip-190-19s1rq6k titleType="Program" The Last Merry-Go-Round cpb-aacip-190-22h70v08 titleType="Program" Lunch with a Sitkan cpb-aacip-190-27mpg6wr titleType="Segment" APC Story
cpb-aacip-190-31qftx7n titleType="Program" Last ATC
cpb-aacip-190-322bvt62 titleType="Series" Dr White
cpb-aacip-190-36tx997m titleType="Program" Bombing of Angoon
cpb-aacip-190-386hdvqv titleType="Program" The Carnatic Music Tradition
cpb-aacip-190-38jdfrm8 titleType="Program" Rich McClear's Last Show
cpb-aacip-190-40xpp0g9 titleType="Program" W. Christianson Interview
cpb-aacip-190-41zcrp65 titleType="Segment" Rofkar Interview
cpb-aacip-190-44pk0t66 titleType="Program" Lovers
cpb-aacip-190-48sbchfw titleType="Series" Dr White
cpb-aacip-190-49t1g5xn titleType="Segment" Various Xeno Pieces cpb-aacip-190-5269pdxg titleType="Program" A Conversation with John Cage in India
cpb-aacip-190-53jwt0h3 titleType="Program" Alaskan Arts Southeast
cpb-aacip-190-569324jf titleType="Program" Air Traffic Control cpb-aacip-190-5918986m titleType="Raw Footage" Alaska Native Women's Conference
cpb-aacip-190-612ngmcx titleType="Segment" Will Swagel Stories cpb-aacip-190-676t1nwq titleType="Series" Dr White
cpb-aacip-190-6986716v titleType="Series" Dr White
cpb-aacip-190-77fqzdqq titleType="Raw Footage" Mt Edgecombe HS Opens
cpb-aacip-190-82x3fpjk titleType="Segment" Staff Tribute
cpb-aacip-190-83xsjc8h titleType="Promo" The Carnatic Music Tradition / Promos
cpb-aacip-190-86b2rm41 titleType="Program" The Carnatic Music Tradition
cpb-aacip-190-88qbzvd3 titleType="Program" Alaska Day Special / Oral History
cpb-aacip-190-89d51n3x titleType="Segment" Why the Tides Ebb and Flow
cpb-aacip-190-92g79nww titleType="Segment" Laurie Hill Pieces / 1
cpb-aacip-190-93ttf79w titleType="Series" Dr White
cpb-aacip-190-95j9kpc1 titleType="Raw Footage" Alaska Native Women's Conference
cpb-aacip-190-95w6mkbw titleType="Program" The Promise cpb-aacip-190-98z8wm8f titleType="Series" Dr White
cpb-aacip-190-01pg4f9t titleType="Segment" Sam/Brady Interview cpb-aacip-190-47rn8tst titleType="Program" Jake & Mary Retrospective

foo4thought commented 5 months ago

Amazing, thank you!

From: Kevin Carter @.> Date: Tuesday, June 4, 2024 at 10:28 AM To: Michelle Kelley @.>, WGBH-MLA/ams @.>, WGBH-MLA/ams @.> Cc: Assign @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887) Done; see attached.

It was this easy:

mylocal:Desktop kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=xml&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].xml' | xpath -e @.**="http://americanarchiveinventory.org"]/text()|//pbcoreTitle/@|//pbcoreTitle/text()' 2>/dev/null | tr -s '\n' '\t' | sed 's#(cpb-aacip).#\1-#g' | tee -a guid_titles_KCAW.txt;echo >> guid_titles_KCAW.txt;start=$[$start+1];done

Kevin Carter

Senior Systems Analyst/Programmer

[GBH Archives]https://www.wgbh.org/foundation/what-we-do/media-library-and-archives


From: Michelle Kelley @.> Sent: Tuesday, June 4, 2024 9:18 AM To: Kevin Carter @.>; WGBH-MLA/ams @.>; WGBH-MLA/ams @.> Cc: Assign @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Hi Kevin,

Awesome, thanks! Is there any easy way to get a list of the titles attached to these GUIDs? The station manager wants to compare the list to the list of items he wants to send for Mellon.

Thanks,

Michelle

Michelle Kelley

Media Historian and Curator

[GBH Archives]https://www.wgbh.org/foundation/what-we-do/media-library-and-archives

From: Kevin Carter @.> Date: Tuesday, June 4, 2024 at 7:06 AM To: WGBH-MLA/ams @.>, WGBH-MLA/ams @.> Cc: Assign @.>, Michelle Kelley @.***> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Done; the GUIDs you seek are in the attached file.

Gory details:

mylocal:~ kevin_carter$ start=0;numFound=$(curl -sS 'https://americanarchive.org/api.json?fl=id&rows=1&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.numFound');while [ "$start" -lt "$numFound" ];do curl -sS 'https://americanarchive.org/api.json?fl=id&rows=10&q=%28contributing_organizations:%22KCAW+%28AK%29%22%20AND%20access_types:digitized%29%20NOT%20%28access_types:online%29&start='$start | jq -r '.response.docs[].id'

| tr '_' '-' >> Desktop/guids_KCAW_online_not_digitized.txt;start=$[$start+10];done

mylocal:~ kevin_carter$ wc -l Desktop/guids_KCAW_online_not_digitized.txt

  40 Desktop/guids_KCAW_online_not_digitized.txt

Kevin Carter

Senior Systems Analyst/Programmer

[GBH Archives]https://www.wgbh.org/foundation/what-we-do/media-library-and-archives


From: Erica Titkemeyer @.> Sent: Friday, May 31, 2024 3:09 PM To: WGBH-MLA/ams @.> Cc: Kevin Carter @.>; Assign @.> Subject: Re: [WGBH-MLA/ams] List of Digitized, Not Online Items from KCAW (Issue #887)

Assigned #887https://github.com/WGBH-MLA/ams/issues/887 to @foo4thoughthttps://github.com/foo4thought.

— Reply to this email directly, view it on GitHubhttps://github.com/WGBH-MLA/ams/issues/887#event-13004814265, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAD6DGPZRE5NTWKZXUEFNTTZFDDIBAVCNFSM6AAAAABITKEVPWVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJTGAYDIOBRGQZDMNI. You are receiving this because you were assigned.Message ID: @.***>

Michelle Kelley Media Historian and Curator