codeforIATI / iati-ideas

💡 Ideas for new codeforIATI projects and blogs
https://ideas.codeforiati.org
0 stars 0 forks source link

Dashboard comparison of activities between endpoints #41

Open matmaxgeds opened 3 years ago

matmaxgeds commented 3 years ago

Rationale

It would be useful to easily see that the registry has X activities, DSv2 has y activities (no v1 or validator fails, no duplicates - just first instance, can be a short processing delay), datadump has Z (same as registry) activities, DSClassic has A activities (missing duplicate IDs etc), D-portal has C (no secondary publishers)....which ones have cached/historial data........this way users can judge which source best meets their needs - maybe as part of the guide to using IATI data - but ideally via an auto updated thing as it will go out of date fast.

Proposal

notshi commented 3 years ago

Here are some links for d-portal that will get you some of the data you need. These are generated directly from the database so they are always up to date.

  1. Total unique activity count json
  2. Total activity count json - includes duplicate ids
  3. Total publisher count json
  4. Total v1 count json - slow search as it's quite intensive ~18 seconds
  5. List of publishers json
  6. List of unique identifiers in csv
  7. List of identifiers in csv - includes duplicate ids

Getting the list of identifiers in json is intensive and will need paging so have opted for csv which is relatively quick.

matmaxgeds commented 3 years ago

Thanks @notshi, also linking up to @andylolz's gist which has many of the others: https://gist.github.com/andylolz/18649e999f2bfca17174ae3e90bb37a0

notshi commented 3 years ago

Woah, that's great! Really interesting to see these numbers on the same page. Hopefully we can get a clearer picture of the IATI ecosystem.

siemvaessen commented 3 years ago

I like this idea. As part of our efforts for AIDA we are thinking about setting up a /stats page with all the endpoints etc crunching some basic stats. Could we create a useful doc of stats needed/handy?