gbif / analytics

The analytics scripts used to calculate and generate all time series (etc.) graphs
https://www.gbif.org/analytics/global
Apache License 2.0
8 stars 3 forks source link

GBIF Sweden gap analysis - include publisher in the invasive species query #3

Closed sdaume closed 9 years ago

sdaume commented 9 years ago

Is it possible to include the publisher in the invasive species query ("query 4"). I would like to explore if there are strong geographical and temporal biases in the records with regard to the publishers.

timrobertson100 commented 9 years ago

Sure thing. Do you wish publisher (which will be an ID to an institution which in some cases will be difficult to resolve for legacy data), or do you mean the country of publisher please?

I'd recommend the later as I doubt it makes too much sense to use a per institution analysis.

sdaume commented 9 years ago

I was actually thinking about country + institution (ID and/or name possibly). My guess would be that for invasive species observations the number of reporting institutions per country is fairly limited, but maybe I am wrong. Let me know if this is feasible or if we may end up with a very large dataset.

timrobertson100 commented 9 years ago

Fixed with https://github.com/gbif/analytics/commit/e3874bb260b27c6bee80b1485753b37963485560

sdaume commented 9 years ago

I actually used the wrong query number for this request. Query 4 ("pollinators") now has 'publisher_id', 'publisher_country' added. This should remain as I would like to explore this for pollinators as well, but could we add 'publisher_id', 'publisher_country' to query 5 as well?