The-Academic-Observatory / academic-observatory-workflows

Telescopes, Workflows and Data Services for the Academic Observatory
https://academic-observatory-workflows.readthedocs.io
Apache License 2.0
16 stars 0 forks source link

Data Aggregation Improvements #22

Open rhosking opened 3 years ago

rhosking commented 3 years ago

A list of useful improvements to the DOI/Entity Aggregation Pipeline. This list also replaces and organises a few issues that have been around the backlog for awhile and need addressing. Closing The-Academic-Observatory/observatory-platform#272, The-Academic-Observatory/observatory-platform#146, The-Academic-Observatory/observatory-platform#129, The-Academic-Observatory/observatory-platform#110, The-Academic-Observatory/observatory-platform#70 as they are now covered here

Cross cutting issues

Grids

Groups

Countries

Regions

Publishers, Funders and Journals

Funders

Citations

Events

Diversity

bechandcock commented 3 years ago

Group Aggregation / Dashboard Sandbox-Dev The dashboard for Sandbox-Dev may have some data aggregation issues on the Groups tab. For example, for "us_btaa_chicago" which has 3 GRIDs, one with no data;

bechandcock commented 3 years ago

DOI: MAG: Some of the GRIDs for MAG author affiliations are not being assigned correctly. It is unclear to me if in the DOI table microsoft_academic_graph.authors.authors.GridId is assigned by us or comes direct from MAG, as it is where the "raw" author affiliation is standardised. e.g.

DOI: 10.1080/13658816.2018.1521523

bechandcock commented 3 years ago

Discipline Aggregation There is a need for aggregation of disciplines to coarser levels, e.g. if the 19 Microsoft Academic Graph level-0 Fields of Study were aggregated as follows: