NCEAS / fairdataone

DataONE FAIR manuscript
Other
1 stars 0 forks source link

create table for dialects used by each repository #13

Open jeanetteclark opened 3 years ago

jeanetteclark commented 3 years ago

should be easy to obtain via Solr query

jeanetteclark commented 3 years ago

the function get_mn_dialects() returns the table below with columns for the number of documents in each dialect, plus a column for the min and max upload date to the CN from each member node. I also added a column guessing whether the node is active based on whether there have been any new uploads in 2021. We should check this list against what is expected (related to issue 9, to make sure repos that seem inactive are being harvested correctly)

mn EML ISO DublinCore FGDC Dryad min_date max_date active
urn:node:LTER 80191 0 0 0 0 2004-03-06 2021-04-14 TRUE
urn:node:PISCO 72302 0 0 0 0 2003-03-26 2021-04-14 TRUE
urn:node:NEON 21692 0 0 0 0 2019-07-03 2019-12-01 FALSE
urn:node:TERN 14631 0 0 0 0 2015-05-21 2019-06-21 FALSE
urn:node:ARCTIC 13514 4054 52 0 0 2009-11-18 2021-04-15 TRUE
urn:node:KNB 9087 0 0 0 0 2001-10-24 2021-04-10 TRUE
urn:node:FEMC 6585 0 0 0 0 2017-09-29 2020-03-27 FALSE
urn:node:TFRI 3149 0 0 0 0 2006-10-21 2020-05-24 FALSE
urn:node:SANPARKS 1625 0 0 0 0 2004-06-09 2014-03-24 FALSE
urn:node:PPBIO 1408 0 0 0 0 2010-06-11 2020-12-16 FALSE
urn:node:ESS_DIVE 961 0 0 1 0 2018-03-28 2021-04-12 TRUE
urn:node:EDI 927 0 0 0 0 2013-10-17 2021-04-15 TRUE
urn:node:GOA 584 0 0 0 0 2012-09-11 2019-02-28 FALSE
urn:node:LTER_EUROPE 343 0 0 0 0 2013-07-16 2016-01-14 FALSE
urn:node:CA_OPC 278 0 0 0 0 2017-08-16 2021-04-12 TRUE
urn:node:ONEShare 199 0 0 0 0 2012-12-19 2016-11-01 FALSE
urn:node:KUBI 172 0 0 0 0 2013-12-10 2013-12-10 FALSE
urn:node:ESA 157 0 0 0 0 2006-03-29 2013-08-10 FALSE
urn:node:METAGRIL 88 0 0 0 0 2019-01-18 2021-04-14 TRUE
urn:node:IOE 85 0 0 0 0 2014-06-28 2016-06-09 FALSE
urn:node:CAS_CERN 57 0 0 0 0 2018-09-26 2018-10-25 FALSE
urn:node:OTS_NDC 41 0 0 0 0 2018-02-09 2018-02-22 FALSE
urn:node:USANPN 4 0 0 0 0 2013-10-25 2014-02-11 FALSE
urn:node:NKN 1 9 0 1 0 2015-05-28 2016-06-17 FALSE
urn:node:CLOEBIRD 1 0 0 0 0 2017-12-18 2017-12-18 FALSE
urn:node:PANGAEA 0 507546 0 0 0 2017-08-04 2018-05-02 FALSE
urn:node:NCEI 0 50967 0 0 0 2016-03-24 2018-04-06 FALSE
urn:node:ARM 0 10931 0 0 0 2019-12-17 2021-04-11 TRUE
urn:node:IEDA_MGDL 0 9733 0 0 0 2019-03-04 2021-01-18 TRUE
urn:node:GRIIDC 0 8581 0 0 0 2016-11-21 2020-02-28 FALSE
urn:node:NRDC 0 2226 0 0 0 2015-11-23 2018-04-06 FALSE
urn:node:R2R 0 1787 0 0 0 2016-12-09 2017-01-26 FALSE
urn:node:IEDA_EARTHCHEM 0 888 0 0 0 2019-03-04 2021-01-18 TRUE
urn:node:IEDA_USAP 0 695 0 0 0 2019-03-04 2021-01-15 TRUE
urn:node:RW 0 335 0 0 0 2017-07-24 2021-04-09 TRUE
urn:node:TDAR 0 0 34484 0 0 2009-05-14 2019-05-29 FALSE
urn:node:IARC 0 0 611 0 0 2015-05-08 2016-12-23 FALSE
urn:node:US_MPC 0 0 258 0 0 2014-12-16 2014-12-16 FALSE
urn:node:BCODMO 0 0 7 0 0 2016-11-13 2016-11-13 FALSE
urn:node:FIGSHARE_CARY 0 0 5 0 0 2019-02-21 2019-02-21 FALSE
urn:node:CDL 0 0 0 31603 0 2012-07-05 2014-10-20 FALSE
urn:node:USGS_SDC 0 0 0 21984 0 2015-10-27 2018-01-27 FALSE
urn:node:ORNLDAAC 0 0 0 1237 0 2012-06-19 2015-02-17 FALSE
urn:node:NMEPSCOR 0 0 0 1217 0 2015-05-19 2017-08-22 FALSE
urn:node:EDACGSTORE 0 0 0 357 0 2013-10-14 2014-03-25 FALSE
urn:node:RGD 0 0 0 272 0 2014-09-25 2014-09-25 FALSE
urn:node:SEAD 0 0 0 110 0 2012-10-28 2020-11-10 FALSE
urn:node:EDORA 0 0 0 28 0 2014-09-24 2014-09-24 FALSE
urn:node:DRYAD 0 0 0 0 187131 2007-12-06 2018-09-18 FALSE
urn:node:mnUCSB1 NA NA NA NA NA 2020-08-06 2021-04-14 TRUE