rfordatascience / tidytuesday

Official repo for the #tidytuesday project
Creative Commons Zero v1.0 Universal
6.75k stars 2.39k forks source link

Know your meme dataset #458

Open gkaramanis opened 2 years ago

gkaramanis commented 2 years ago

Dataset kym21_03_2022.zip (JSON):

https://owncloud.ut.ee/owncloud/s/2LosgCo4bTjGM8n

Article: https://knowyourmeme.com/editorials/insights/where-do-memes-come-from-the-top-platforms-from-2010-2022

Seen at: https://s2.washingtonpost.com/camp-rw/?trackId=61b504ca9bbc0f79fd77b746&s=63135e17ab732227d00897ce

jonthegeek commented 1 year ago

What is the source of the dataset? We need to make sure we can track the usage rights of any datasets we use. The Washington Post link is dead, and the Article doesn't share a dataset.

gkaramanis commented 1 year ago

It’s most probably scraped, the other dataset I had found was https://www.kaggle.com/datasets/podsyp/a-lot-of-memes-info-stats

The link was for the How to read this chart newsletter, it had images from the article, no dataset there

lgibson7 commented 1 month ago

Hi @gkaramanis. Thanks for submitting this issue. Would you be willing to submit the data set through a PR? You can find the instructions on how to do so here.