subugoe / r-recipes

R recipes to tackle everyday data analytics problems
MIT License
6 stars 1 forks source link

Cross-comparison with my attempt #1

Open rossmounce opened 6 years ago

rossmounce commented 6 years ago

Hi @njahn82

really interesting code & results. It seems to differ quite a bit from the conclusions I drew here: https://github.com/rossmounce/opening-up-citations

Maybe I should re-run my code? But surely that much can't have changed since I did it (2018-03-26). Maybe I made an error somewhere?

Perhaps we should clarify what our exact units of measure here are. I am looking at and counting CrossRef Member IDs (these are analagous to but not quite 1:1 with 'publishers').

When doing my analysis, in terms of CR Member IDs I found only 363 with "open" citations. In this Wikimedia blog post 490 "publishers" are quoted as participating: https://blog.wikimedia.org/2018/04/02/initiative-for-open-citations-birthday/

Your code seems to reach the conclusion that it is 1,062 publishers participating with open citations?

What is the explanation for these wildly different answers all produced/published within days of each other! 😃

rossmounce commented 6 years ago

Is it possible you are counting DOI prefixes? [oh, clearly NO. You are counting by CR Member IDs]

I note the situation is quite complicated for some CrossRef Member IDs in that, within the CrossRef Member ID there might be many DOI prefixes and that in some instances some of the prefixes within the CR Member ID are 'open' but also some are 'closed'

For instance CR Member ID 11818 (Duke University Libraries) This represents the DOI prefixes: 10.17602, 10.29266, 10.7924

10.29266 is set to “limited” (default, not open) whilst 10.7924 and 10.17602 (Morphosource) are set to “open” one can see this visually here: http://browse.crossref.org/members/11818#prefixes

How are you counting "publishers" participating? Thanks for making me think harder about my/our work!

rossmounce commented 6 years ago

Huh... I'm glad I dated the run of my code.

I've just re-ran my code and I'm getting your numbers. Since 2018-03-26, a duration of just ten days, a whopping 700 members have opened-up their citations!!!! WHAT?!?!?!

njahn82 commented 6 years ago

Awesome, but also a bit to good to be true?!

njahn82 commented 6 years ago

Maybe Crossref can explain this large increase that happened within a couple of days.

rossmounce commented 6 years ago

I am trying to work out a 'diff' now between my results then and now, but it's not easy as I only saved the CLOSED list, not the open. So fascinating!

I previously had 9552 member IDs as "closed" but now see only 8923 as "closed" so that would seem to indicate that 629 publishers have flipped from closed to open.

The remaining 71 publishers could simply be new publisher members that have been added to the CR database and are open from the start(?)

I suspect it's a whole load of tiny university based members that have suddenly gone open (somehow).

[1] "PPPM STIKES Jen. Achmad Yani Yogykarta" 
[2] "Politeknik APP Jakarta"                 
[3] "Crimean Astrophysical Observatory"      
[4] "Bryansk State Technical University BSTU"
[5] "Universitas Djuanda"                    
[6] "STEI Tazkia"     

even so it clearly warrants explanation!