gbif / ipt

GBIF Integrated Publishing Toolkit (IPT)
https://www.gbif.org/ipt
Apache License 2.0
127 stars 57 forks source link

Requesting a summary report generation feature for IPT #1541

Open nbekolay opened 3 years ago

nbekolay commented 3 years ago

Good afternoon,

There's been considerable discussion among members of the OBIS Canada team re: best practices for tracking and reporting on the number of datasets hosted on the OBIS Canada IPT and the data that they contain.

Would it be possible for GBIF to create a summary report function that administrators could use to generate .xlsx/.csv tables listing all datasets hosted on their respective IPTs, along with the number of event and occurrence records (both total and present, if possible) contained within each dataset?

If this report could also list the last publication date, the publication status (i.e. "published" or "not published") and the visibility status (i.e. "private" or "public") of each dataset, that would be ideal. In other words, a table similar to the view presented in the "Manage Resources" tab with the addition of a tally of occurrence records (total and present) would be ideal. A summary like this would make the reporting process far more efficient and straightforward.

Thanks in advance! Nicholas Bekolay Fisheries and Oceans Canada/OBIS Canada

nbekolay commented 3 years ago

This is a quick follow-up re: our request for an IPT tracking report. When can we expect this request to be reviewed and assigned? And if a tracking report is to be introduced to IPT, when might this feature be available for use and/or testing by administrators?

Please include @cornthwaitem on all future replies as I will be assigned to a new team after April 1. Thank you!

ahahn-gbif commented 3 years ago

Hi @nbekolay, @cornthwaitem, thanks for your feedback. Just a preliminary response, as there there is currently no active IPT development again, yet.

I am not sure I completely understand the present situation of the installation (http://ipt.iobis.org/obiscanada/), maybe you can help: of the 164 datasets running on the installation, currently only 20 appear to be registered to and indexed by GBIF. Is that intentional, or an oversight?

Most of what you sketch out above should be possible in principle (data evaluation e.g. for presence / absence excepted). For datasets shared through GBIF, all this would be available through the API. Future plans for IPT development have not been drawn up and prioritized yet. It is primarily a product to support the needs of the GBIF publisher community that is also open to use by others.

pieterprovoost commented 3 years ago

@nbekolay While I realize that this does not solve the published / not published and public / private part of your request, I would like to add that event and occurrence statistics for publicly published datasets are available from the OBIS API:

nbekolay commented 3 years ago

@pieterprovoost Thank you, Pieter. These are definitely helpful tools for quantifying the broad-strokes metrics of OBIS Canada's recent contributions. To facilitate internal reporting within our organization, we've been manually tracking stats for individual publications. We were hoping to automate that process through a reporting feature, if and when possible.

nbekolay commented 3 years ago

@ahahn-gbif I'm going to defer to Maria (@cornthwaitem), the Canadian node manager for OBIS, for a response re: intentions and a timeline for GBIF publication.