sul-dlss / dor-services-app

A Rails application exposing Digital Object Registry functions as a RESTful HTTP API
https://sul-dlss.github.io/dor-services-app/
Other
3 stars 2 forks source link

report on usage of groupedValue #4244

Open ndushay opened 1 year ago

ndushay commented 1 year ago

Arcadia:

for dros and collections

run a report on groupedValue? That was a MODSism.

ndushay commented 1 year ago

I ran using property_existence*.rb reports.

DROs - no occurrences. Collections - 10 occurrences.

Also here:

collection_druid,catkey,collection_name
druid:cx804cx9497,13161135,"al-  Shāmil fī-al-ṣināʻah al-ṭibbiyah, 13th cent"
druid:fd953pg7906,7913308,"Zhongguo wu ci xian ji ren kou pu cha shu ju"
druid:gy247sz1687,2922639,"[Collection of anti-Napoleon broadsides and broadsides on the proposed French invasion of England in 1803]"
druid:nh800mm2361,7649813,"China 2000 township population census data with maps"
druid:mq015fg9108,6640186,"Jing ji ban yue kan"
druid:sf111ch8577,11415872,"Guo ji mao yi dao bao"
druid:sh973sy2698,4235167,"Sea-floor images and data from multibeam surveys in San Francisco Bay, Southern California, Hawaii, the Gulf of Mexico, and Lake Tahoe, California--Nevada"
druid:sk296jm6766,411101,"Who's who in East Africa"
druid:wy489qg9472,11668039,"Jean and Juliette Logerais papers, 1885-1910s"
ndushay commented 1 year ago

This DRO druid says I got it wrong:

https://argo.stanford.edu/items/druid:jc764sf5117.json

image

ndushay commented 1 year ago

Okay, now showing 44K occurrences in DROs:

drosGroupedValue.csv

$ wc -l drosGroupedValue.csv 
44389 drosGroupedValue.csv

$ head drosGroupedValue.csv 
item_druid,catkey,collection_druid,collection_name
druid:wf711dr8563,3139502,,""
druid:wf697mc3225,3421337,,""
druid:wf707db3686,409884,,""
druid:wf835rv6544,372841,,""
druid:wf742hv8540,1040631,,""
druid:wf756wk1315,3974080,,""
druid:wf825mr3467,3352168,,""
druid:wf828pf4428,190789,,""
druid:wf714gm0334,372497,,""

$ tail drosGroupedValue.csv 
druid:wf586rk5853,1663237,,""
druid:wf593xm4029,353788,,""
druid:wf602pw7864,3747764,,""
druid:wf605ty9067,4698244,,""
druid:wf607dv8805,2854335,,""
druid:wf609hp1935,9751459,,""
druid:wf624pf3032,9381970,,""
druid:wf656gb0171,6712616,,""
druid:wf670gc1060,5056647,,""
druid:wf684fn0436,1610706,,""

none of them have a collection, some don't have catkeys

arcadiafalcone commented 1 year ago

I looked at the first three and the last three and all but one would lose the groupedValue if refreshed from source.

ndushay commented 1 year ago

Maybe refresh "all" from source and rerun reports?

arcadiafalcone commented 1 year ago

Something is wrong with getting the collection name - wf742hv8540 e.g. is in Google Books (as I suspect many of these are).

Working on the refresh.

ndushay commented 1 year ago

Something is wrong with getting the collection name - wf742hv8540 e.g. is in Google Books (as I suspect many of these are).

yes - Mike found that in his PR review; I'm running the report again. Sorry about that.

ndushay commented 1 year ago

updated dros report - now with collection info!

40,975 lines with header.

drosGroupedValue.csv