rpodcast / prod-shiny-workshop

11 stars 0 forks source link

Choose data sets used in app examples #3

Open rpodcast opened 2 years ago

rpodcast commented 2 years ago

Ideally we can find data sets that have broad appeal and easily accessible.

rpodcast commented 2 years ago

Assorted ideas after some long searching:

rpodcast commented 2 years ago

Thanks to @chendaniely , we have a new contender: The Metropolitan Museum of Art Collection API and associated metadata. Important details:

These data will need a bit of cleaning and munging to prepare for an eventual application, but there are quite a few concents that we can illustrate effectively here. Such as executing APIs on demand, hosting data in a database, custom theming, etc.

rpodcast commented 2 years ago

I've spent a fair bit of time reviewing the MET art collection API and associated CSV of the data. I'm still working through the best ways to clean the data to make it easier for my workshop apps, but I'm thinking the app(s) at a high level could answer a few of the following questions and let the users drill down certain aspects of the data:

Artwork Questions / Summaries

Artist Questions / Summaries

General App Requirements

rpodcast commented 2 years ago

Update on data exploration: I have filtered for only public domain records and those art with only a single artist (i.e. entries that do not have multiple artists split by |). We still have almost 200k rows to play with, which is good enough for me!