nlpsandbox / participation-dashboard

A flexdashboard that reports participation metrics for the NLP Sandbox
Apache License 2.0
0 stars 0 forks source link

New dashboard - Dahboard that reports the performance of the best solution of each task #24

Closed tschaffter closed 3 years ago

tschaffter commented 3 years ago

Goal

The NLP Sandbox decomposed the PHI annotation task into smaller, modular tasks like the date annotation task, person name annotation task, etc. One of the motivation is to enable tool developers to identify where their time would be best invested by looking at the leaderboard of each task. For example, if there is multiple solutions with a near perfect score for the date annotation task but no satisfying solution yet for the person name annotation task, this will indicates to the developer that their time would be best spent working on a new solution for the person name annotation task.

Instead of visiting all the leaderboards in order to obtain this information, we could compile this information in a small dashboard made of tiles (square or rectangle), one for each task. Each tile should include the following information:

Therefore, just by looking at the color of the tiles, one would be able to identify the challenging tasks for which no satisfying solution has been submitted yet (red-orange colors).

@andrewelamb We can discuss offline how to get the above information if needed.

Note that for each of the current task, we report a score for two datasets, so we can either:

Also since we report more than one performance metric, we would need to select one.

And to complicate further, we have task like the Location annotation task that report scores for two variants of the task. :)

Prototype

andrewelamb commented 3 years ago

Screenshot from 2021-09-01 08-20-15

andrewelamb commented 3 years ago

@tschaffter How does the color palette look to you? Some other options:

andrewelamb commented 3 years ago

Screenshot from 2021-09-09 09-57-06