IQSS / miniverse

Reference/Debug use: Using the Django ORM to explore the Dataverse database
2 stars 7 forks source link

Make metrics above installation map on dataverse.org not misleading #65

Closed jggautier closed 5 years ago

jggautier commented 6 years ago

The count of dataverses, datasets and file downloads above the Dataverse installation map look like they're from all installations, but they're from Harvard Dataverse only. We have no automated way to update those metrics with counts from the known installations.

screen shot 2017-11-28 at 6 04 14 pm

How can we make those numbers not misleading?

(Sorry for the weirdly-phrased titled. Trying not to use it to prescribe a solution.)

jggautier commented 6 years ago

Can we remove those numbers for now (and just have the count of installations in the center)?

pdurbin commented 6 years ago

Can we remove those numbers (and just have the count of installations in the center)?

@jggautier I love this idea.

pdurbin commented 6 years ago

The numbers are misleading. Check out this tweet from Jochen Apel at https://twitter.com/Duhem_/status/935133423877976065

"Quick, why does this website https://dataverse.org/ say that there are~50k datasets over all installations if Harvard Dataverse alone lists ~75k datasets?"

pdurbin commented 6 years ago

See this post for some confusion over the gray dots: https://groups.google.com/d/msg/dataverse-community/eQlSLFgzQXI/t6FANVynBgAJ

I think we should simply remove the gray dots for now. Like @jggautier said above, let's just show the orange dots and list the number of installations. The three non-installation metrics (dataverses, datasets, and files) are causing confusion. Perhaps in the future, we can try again with metrics that are not so specific to Harvard Dataverse, but rather reflect the entire Dataverse community. On my "Flagship Bias" doc at https://docs.google.com/document/d/1-LtDv_Yaf_3EPuMWcg5ZxH7M2ACrmCzPzgmbwbt8gMQ/edit?usp=sharing this is what I wrote:

Metrics also appear above the map on the project home page. The number of installations around the world is fantastic to highlight but the number of dataverse, datasets, and downloads apply only to the "flagship" installation.

pdurbin commented 6 years ago

I just approved pull request #66. The change makes sense. @mallove simply commented out the inaccurate metrics. Thanks!

pdurbin commented 6 years ago

This looks so much better:

screen shot 2018-01-23 at 12 34 05 pm

Thank you, @mallove !!

@jggautier you opened this issue. Are you ready to close it?

jggautier commented 6 years ago

Closing! Thanks @mallove!

pdurbin commented 6 years ago

It was noticed during SpinachCon 2018 ( https://github.com/IQSS/dataverse/issues/4505 ) that the misleading numbers are back so I'm re-opening this issue. Here's a screenshot from today:

screen shot 2018-04-18 at 4 22 27 pm

pdurbin commented 5 years ago

I spoke with @djbrooke today about this and he said I could go ahead and create an issue in the main issue tracker so I just did: https://github.com/IQSS/dataverse/issues/5429

Closing this one in favor of the new one.