sul-dlss / happy-heron

Self-Deposit for the Stanford Digital Repository (SDR): H2 is a Rails web application enabling users to deposit scholarly content into SDR
Apache License 2.0
10 stars 2 forks source link

Generate report of Hydrus items to be migrated to H2 with Hydrus item types "conference paper/presentation", "class project", or "audio - music" #1290

Closed amyehodge closed 3 years ago

amyehodge commented 3 years ago

We need the druids for these items and which item type they are (one option is three different lists, the other is a second field that indicates the item type -- either is fine). It would also be helpful to have the collection druid and the collection name.

We need to make sure we have new classifications for these items selected for the "real" migration to H2, and we have someone available to work on this project right now.

jmartin-sul commented 3 years ago

@justinlittman would something that uses the DSA Report class be the way to go? or is that not the right tool? e.g.: https://github.com/sul-dlss/dor-services-app/pull/2563 (i just stumbled on that PR, and remembered that this ticket was on the prod priorities board, apologies if i'm off-base there)

jcoyne commented 3 years ago

@jmartin-sul in the past we've done these reports by running jq on the migration export dump from Hydrus.

jmartin-sul commented 3 years ago

@jmartin-sul in the past we've done these reports by running jq on the migration export dump from Hydrus.

ah, good to know, thanks @jcoyne

ndushay commented 3 years ago

Type "conference paper / presentation"

39 druids:

bg405cn7261
cx844yp3164
cz459xn4262
dv537dc2889
dw464nx2179
gt921ck4066
gx295qj8615
hp802gp8515
jt476kt9091
jx225zq1407
kq234ry7019
kv589hn5123
mn521kw5258
mv421rg3034
nd901zk5193
nk436sr5232
nv468td2285
pj369hf5457
pr103tf2767
pv779rt4759
py865tw6670
pz232fg1727
qb052kj0451
qb779jh9716
qq075gd5297
qr172kx7896
qt647gg7872
qv848kx4545
rr563mg4637
rz180xn4072
sz249ym9704
vr813gk6824
wc333hh5393
wg827ks1643
wp525cy3737
wr653xj5115
xk848ts1579
yv187wv5370
zb349td2077
ndushay commented 3 years ago

Type "audio - music"

7 of these:

bp210jw8496
fr255ss8264
gq839ym6983
ny703sg0564
rv324yc5503
wn714rn8040
wx775sq0683
ndushay commented 3 years ago

Type "class project"

217 of these

bd257ry0198
bg674gb7182
bk468mz8701
bn645ms7799
bq136mn9632
bq514xy6347
bq949jx3878
bz956qd2228
cd039yv5615
cf231ff4889
cf376gf4387
cj994wv9209
ck076py9387
ck693dy1144
cm550jb1674
cq110bm5728
cq717gy4240
cz591rp9623
dc887wj7942
dg327ng8260
dj108dd7814
dj561gb3422
dj976ms2312
dk711xt5587
dm242zn8219
dm781nw8672
dp886ky3020
dr224dj7512
dt026vw5129
dv518bd5967
dw947kz5195
dy270yj7833
dy344zb9809
dz214cf9700
fg851fy8121
fk278kj3254
fn608sh0928
fp015cm7917
fp759qt8876
fp834ty0524
fs139jt1720
fs604dj8228
fs705pc2615
ft295fb4704
fv398xd2044
fv675yx5851
fv941xg6266
fw589fr1729
fx000fk3114
fy320xn2931
fy799pw0096
gb572dc5913
gd455wp4677
gh880qx4380
gk257xf6317
gm402ft1737
gp769cq3718
gq933wy1656
gr767nd3884
gv732wh5892
gw177bx5881
gw697cc5138
gx479bb7600
gz384sy3895
hc201tk2429
hd588yd8158
hk397dr9783
hm222bn8688
hm991rt5124
hn951xz7163
hs429mr6109
hx382vg0150
hy929hg5110
hz624jy4688
jc152df6888
jd045wt1319
jm692nq7002
jp938zf3925
jr426sf0743
js780pq1046
jt538gk0858
jx805mb7872
jx857dp6014
jy090tm9223
jy645rc0297
jz088xy0649
jz546ky4132
kb563dm7664
kd111tb1513
kh419rj2062
kn739yp2655
kq227vw2007
kw319xt6304
kz399yh9465
mc387ng9724
mk991cw1850
mn973mz7665
mt383sb2445
mz129gm4554
nb840zt5577
nc661tt5414
nd391jb6114
ng858zt7694
nh560hz2167
nk347ns1208
nm285bw4093
nn760bj8992
nq056kr6430
ns197jk3080
ns731nj4483
nv657tt7387
pb771vh1182
pc656kw5299
ph548xz9766
ph593rb2535
ph594dj5574
pt732wb3548
pv355sr7217
pw220df6697
pz096fn6930
pz927kv2918
qc361sc8497
qd780gf9821
qd966yz4379
qh633mt1707
qj199sc8055
qk712xd8602
qq843gd3617
qt429km6702
qy007ht7512
qz450qt9772
rd819fy3516
rf642dj5262
rh549nv9529
rj923rs7122
rq984fn7118
rz313bp1096
sb333zd2595
sc020qp6018
sd594yd8504
sg281dj9441
sm461gq2713
sm862mg6771
sn711vd1677
sq073bv1790
sr777yx9772
ss205nw6650
ss220mq0904
sv819td9283
sw321tg6624
sx492rh2858
sx590rg4630
sy416rh3970
sz273tt1006
tb139gn2220
tb756jg0064
tc625tb6216
td341fd9085
td603xj1523
td675km9733
tm098cy6884
tm461wy4388
tp025mq4329
tp962mq3896
tt077rt2826
tt285pn0136
tw797wb4650
tx509sr6198
tz766vs5402
vd066ct9425
vf728sh2937
vq436sq9834
vr646xy0142
vr835kx1498
vs132mc7125
vv534gd3897
wc053rq8118
wc921mf2460
wd685zw8578
wf839km6990
wg655ry3881
wh741sb6416
wh783qn6746
wj733zq9620
wk816qp3769
wn953rh3588
wp445sb1858
wq244qx7599
wr266qx9067
wt128vz0037
wy024by6643
wz402qb2020
xc152vm6227
xc503rz2123
xh402kw4203
xm004kv6381
xm721vf4813
xn454xj0011
xn623tj0433
xn661rs3028
xq002wt7817
xr386rg2530
xs713rg8006
xt137vp7092
xx122kh9331
xy547ss8747
xy899kw0142
yd935rz8116
yj940py7511
yp641rm9054
yr390pd3005
yy594jf5798
yy915qh3725
zh110wz9727
zh653qs2040
zp684gc4352
zy305gk4285
ndushay commented 3 years ago

The above all came from a migration dump on the h2 stage box dated Feb 26 (which i scp'ed to my laptop):

-rw-rw-r--   1 ndushay  staff  12038948 Feb 26 09:55 items.jsonl

In case it is needed in the future:

Step 1: find the number of objects with the desired type:

$ ack '"item_type":"audio - music' *.jsonl | wc -l
       7
$ ack '"item_type":"class project' *.jsonl | wc -l
     217
$ ack '"item_type":"conference' *.jsonl | wc -l
      39

Step 2: Use gron to capture the line numbers of the matching records:

$ gron -s items.jsonl | grep 'item_type' | grep -E 'audio - music'
json[680].item_type = "audio - music";
json[681].item_type = "audio - music";
json[1458].item_type = "audio - music";
json[2348].item_type = "audio - music";
json[2422].item_type = "audio - music";
json[2469].item_type = "audio - music";
json[3763].item_type = "audio - music";

$ gron -s items.jsonl | grep 'item_type' | grep -E 'audio - music' >>audio_lines.txt

Step 3: Use the line numbers in a bash script to retrieve the druids

for line in `cat class_project_lines.txt | cut -d '[' -f2 | cut -d ']' -f1`; do gron -s items.jsonl | grep "json\[$line\]" | grep druid | grep -v collection | cut -d ':' -f2 | cut -d '"' -f1; done >class_project.druids.txt

Step 4: Sort the file of druids:

sort class_project.druids.txt
ndushay commented 3 years ago

@amyehodge I'm going to let you close this, or let me know if I didn't fulfill the ticket.

amyehodge commented 3 years ago

@ndushay Can we get at least the collection druid for each of these items as well? I had included this in the original request above. Particularly for the longer list of class projects. Thanks.

ndushay commented 3 years ago

Sure Amy - will do. Sorry I didn't read carefully enough.

ndushay commented 3 years ago

CSV for all requested

druid,coll druid,coll name,item type
druid:bp210jw8496,druid:kx581zg6755,MUSIC 200A: Proseminar in Musicology and Music Bibliography,audio - music
druid:fr255ss8264,druid:pn826mx7934,Chris Chafe Collection,audio - music
druid:gq839ym6983,druid:pn826mx7934,Chris Chafe Collection,audio - music
druid:ny703sg0564,druid:vw848pw8430,Southward,audio - music
druid:rv324yc5503,druid:tz959sb6952,Graduate School of Education Open Archive,audio - music
druid:wn714rn8040,druid:hn730ks3626,Stanford University Libraries staff publications and research,audio - music
druid:wx775sq0683,druid:rr255gf6892,Notation in Science Communication ePortfolios,audio - music

druid:bd257ry0198,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:bg674gb7182,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:bk468mz8701,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:bn645ms7799,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:bq136mn9632,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:bq514xy6347,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:bq949jx3878,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:bz956qd2228,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:cd039yv5615,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:cf231ff4889,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:cf376gf4387,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:cj994wv9209,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:ck076py9387,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:ck693dy1144,druid:qk912vj7408,Engineering Student Papers,class project
druid:cm550jb1674,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:cq110bm5728,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:cq717gy4240,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:cz591rp9623,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dc887wj7942,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dg327ng8260,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dj108dd7814,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dj561gb3422,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dj976ms2312,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dk711xt5587,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dm242zn8219,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dm781nw8672,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dp886ky3020,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:dr224dj7512,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dt026vw5129,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dv518bd5967,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:dw947kz5195,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dy270yj7833,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:dy344zb9809,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:dz214cf9700,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:fg851fy8121,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:fk278kj3254,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:fn608sh0928,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fp015cm7917,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:fp759qt8876,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:fp834ty0524,druid:nr325kt8685,Sensory Ecology of Marine Animals,class project
druid:fs139jt1720,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fs604dj8228,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fs705pc2615,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:ft295fb4704,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fv398xd2044,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fv675yx5851,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fv941xg6266,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:fw589fr1729,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:fx000fk3114,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fy320xn2931,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:fy799pw0096,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:gb572dc5913,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:gd455wp4677,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:gh880qx4380,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:gk257xf6317,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:gm402ft1737,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:gp769cq3718,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,class project
druid:gq933wy1656,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:gr767nd3884,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,class project
druid:gv732wh5892,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:gw177bx5881,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:gw697cc5138,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:gx479bb7600,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:gz384sy3895,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:hc201tk2429,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:hd588yd8158,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:hk397dr9783,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:hm222bn8688,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:hm991rt5124,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:hn951xz7163,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:hs429mr6109,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:hx382vg0150,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:hy929hg5110,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:hz624jy4688,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:jc152df6888,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:jd045wt1319,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:jm692nq7002,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:jp938zf3925,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:jr426sf0743,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:js780pq1046,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:jt538gk0858,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:jx805mb7872,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:jx857dp6014,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:jy090tm9223,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:jy645rc0297,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:jz088xy0649,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:jz546ky4132,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:kb563dm7664,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:kd111tb1513,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:kh419rj2062,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:kn739yp2655,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:kq227vw2007,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:kw319xt6304,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:kz399yh9465,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:mc387ng9724,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:mk991cw1850,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:mn973mz7665,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:mt383sb2445,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:mz129gm4554,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:nb840zt5577,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:nc661tt5414,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:nd391jb6114,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:ng858zt7694,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:nh560hz2167,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:nk347ns1208,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:nm285bw4093,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:nn760bj8992,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:nq056kr6430,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:ns197jk3080,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:ns731nj4483,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:nv657tt7387,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,class project
druid:pb771vh1182,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:pc656kw5299,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:ph548xz9766,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:ph593rb2535,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:ph594dj5574,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:pt732wb3548,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:pv355sr7217,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:pw220df6697,druid:fk441ps0303,Sounds of Stanford Collection - ARS.0154,class project
druid:pz096fn6930,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:pz927kv2918,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:qc361sc8497,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:qd780gf9821,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:qd966yz4379,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:qh633mt1707,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:qj199sc8055,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:qk712xd8602,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:qq843gd3617,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:qt429km6702,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:qy007ht7512,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:qz450qt9772,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:rd819fy3516,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:rf642dj5262,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:rh549nv9529,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:rj923rs7122,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,class project
druid:rq984fn7118,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,class project
druid:rz313bp1096,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sb333zd2595,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sc020qp6018,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sd594yd8504,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sg281dj9441,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sm461gq2713,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sm862mg6771,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sn711vd1677,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:sq073bv1790,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sr777yx9772,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:ss205nw6650,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:ss220mq0904,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sv819td9283,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sw321tg6624,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sx492rh2858,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:sx590rg4630,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sy416rh3970,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:sz273tt1006,druid:bs335mc1127,Boothe Prize Winners, Stanford University,class project
druid:tb139gn2220,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:tb756jg0064,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:tc625tb6216,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:td341fd9085,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:td603xj1523,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:td675km9733,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:tm098cy6884,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:tm461wy4388,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:tp025mq4329,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:tp962mq3896,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:tt077rt2826,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:tt285pn0136,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:tw797wb4650,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:tx509sr6198,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:tz766vs5402,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:vd066ct9425,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:vf728sh2937,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:vq436sq9834,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:vr646xy0142,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:vr835kx1498,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:vs132mc7125,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:vv534gd3897,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wc053rq8118,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wc921mf2460,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wd685zw8578,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wf839km6990,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:wg655ry3881,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wh741sb6416,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:wh783qn6746,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:wj733zq9620,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wk816qp3769,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:wn953rh3588,druid:nr325kt8685,Sensory Ecology of Marine Animals,class project
druid:wp445sb1858,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:wq244qx7599,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:wr266qx9067,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:wt128vz0037,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:wy024by6643,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:wz402qb2020,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xc152vm6227,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xc503rz2123,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xh402kw4203,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xm004kv6381,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xm721vf4813,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xn454xj0011,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xn623tj0433,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:xn661rs3028,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xq002wt7817,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xr386rg2530,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xs713rg8006,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:xt137vp7092,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xx122kh9331,druid:fk441ps0303,Sounds of Stanford Collection - ARS.0154,class project
druid:xy547ss8747,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:xy899kw0142,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:yd935rz8116,druid:qk912vj7408,Engineering Student Papers,class project
druid:yj940py7511,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:yp641rm9054,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:yr390pd3005,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:yy594jf5798,druid:gm265cj0081,Stanford@SEA -- Student papers,class project
druid:yy915qh3725,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:zh110wz9727,druid:rr255gf6892,Notation in Science Communication ePortfolios,class project
druid:zh653qs2040,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:zp684gc4352,druid:kq629sd5182,ME310 Project Based Engineering Design,class project
druid:zy305gk4285,druid:gm265cj0081,Stanford@SEA -- Student papers,class project

druid:bg405cn7261,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:cx844yp3164,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:cz459xn4262,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:dv537dc2889,druid:qk912vj7408,Engineering Student Papers,conference paper / presentation
druid:dw464nx2179,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:gt921ck4066,druid:nj669yc9949,Lighting the Way project deliverables,conference paper / presentation
druid:gx295qj8615,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:hp802gp8515,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:jt476kt9091,druid:kv708vv4482,Library Staff Presentations,conference paper / presentation
druid:jx225zq1407,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:kq234ry7019,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:kv589hn5123,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:mn521kw5258,druid:hm410dp4419,East Asia Library materials,conference paper / presentation
druid:mv421rg3034,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:nd901zk5193,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:nk436sr5232,druid:gs860rq5377,Policy, Organization Leadership Studies (POLS) Program Field Projects, Graduate School of Education,conference paper / presentation
druid:nv468td2285,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:pj369hf5457,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:pr103tf2767,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:pv779rt4759,druid:xs573nh8717,CIFE Publications,conference paper / presentation
druid:py865tw6670,druid:qk912vj7408,Engineering Student Papers,conference paper / presentation
druid:pz232fg1727,druid:nj669yc9949,Lighting the Way project deliverables,conference paper / presentation
druid:qb052kj0451,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:qb779jh9716,druid:xs573nh8717,CIFE Publications,conference paper / presentation
druid:qq075gd5297,druid:nj669yc9949,Lighting the Way project deliverables,conference paper / presentation
druid:qr172kx7896,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:qt647gg7872,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:qv848kx4545,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:rr563mg4637,druid:nj669yc9949,Lighting the Way project deliverables,conference paper / presentation
druid:rz180xn4072,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:sz249ym9704,druid:kv708vv4482,Library Staff Presentations,conference paper / presentation
druid:vr813gk6824,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:wc333hh5393,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:wg827ks1643,druid:tz959sb6952,Graduate School of Education Open Archive,conference paper / presentation
druid:wp525cy3737,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:wr653xj5115,druid:kv708vv4482,Library Staff Presentations,conference paper / presentation
druid:xk848ts1579,druid:hn730ks3626,Stanford University Libraries staff publications and research,conference paper / presentation
druid:yv187wv5370,druid:jp429tw5870,Lighting the way,conference paper / presentation
druid:zb349td2077,druid:jp429tw5870,Lighting the way,conference paper / presentation
amyehodge commented 3 years ago

No worries, @ndushay . Thanks!

amyehodge commented 3 years ago

Thanks so much, @ndushay !

ndushay commented 3 years ago

For posterity, from rails console:

item_lines = File.readlines('../hydrus-migration/items.jsonl')
item_hashes = item_lines.map {|l| JSON.parse(l)}
coll_lines = File.readlines('../hydrus-migration/collections.jsonl')
coll_hashes = coll_lines.map {|l| JSON.parse(l)}
coll_hashes.compact! # there was a blank line

audio_hashes = items_as_hash.select {|h| h["item_type"] == "audio - music"}
result = audio_hashes.map {|h| "#{h['druid']},#{h['collection']},#{coll_hashes.find { |c| c['druid'] == h['collection'] }['name']},#{h['item_type']}"}
puts result.sort
# repeat for other item types