Daniel-Mietchen / ideas

A dumping ground for halfbaked ideas, some of which will hopefully be worked on soon
Other
26 stars 6 forks source link

Look into Zika-related Wikipedia pageview data #1305

Open Daniel-Mietchen opened 4 years ago

Daniel-Mietchen commented 4 years ago

as per The impact of news exposure on collective attention in the United States during the 2016 Zika epidemic and https://analytics.wikimedia.org/published/datasets/one-off/zika-research/ .

Something similar for COVID is probably a good idea, and perhaps already underway.

Daniel-Mietchen commented 4 years ago

Archived the data via http://web.archive.org/web/20200920011811/https://analytics.wikimedia.org/published/datasets/one-off/zika-research/ and http://web.archive.org/web/20200920011819if_/http://web.archive.org/screenshot/https://analytics.wikimedia.org/published/datasets/one-off/zika-research/

Daniel-Mietchen commented 4 years ago

The README was not properly archived — have to retry.

Daniel-Mietchen commented 4 years ago

Next try sits at http://web.archive.org/web/20200920014248/https://analytics.wikimedia.org/published/datasets/one-off/zika-research/ — seems to load forever, with the same result as above.

So I am going to paste the content of the README here:

"Dataset 1: 
Content of each line of the dataset: day, pageview_count
The dataset contains the daily number of pageview counts of 128 different Wikipedia pages related to the Zika virus (aggregated and summed to total) originated in the United States, from January 1st to December 31st, 2016.

Dataset 2:
Content of each line of the dataset: day, pageview_count, state
The dataset contains the daily number of pageview counts of 128 different Wikipedia pages related to the Zika virus (aggregated and summed to total) originated in the United States, disaggregated by state, from January 1st to December 31st, 2016.

Dataset 3:
Content of each line of the dataset:
US_city, pageview_count_Zika,pageview_count_total
The dataset contains the total number of pageview counts of 128 different Wikipedia pages related to the Zika virus (pageview_count_Zika) originated in 788 cities (US_city) of the United States with population larger than 40,000 in 2016.
The dataset also contains the total number of pageview counts to all Wikipedia pages (all Wikipedia projects, pageview_count_total) originated in 788 cities (US_city) of the United States with population larger than 40,000 in 2016."