Use correct data sets 2006-2022

valeriehase commented 8 months ago

We have two data sets we need to combine: 2_data_preperation_data_old.csv & 1_read_newdata_data_new.csv

old data set (2006-2021, N = 188,291)

the GEC data set:
- originally 1998-2018, reduced to 2006-2018 for this analysis since this is the year we could resample all outlets via Factiva
- N = 153,203
- search terms: "global warming, climate change, greenhouse effect
the Flottes/Festschrift data set:
- includes new files resampled for 2019-2021
- N = 35,088 (ONLY new files 2019-2021)
- search terms: "global warming, climate change, greenhouse effect

--> both "2_data_preperation_data_old.csv" which is read in in step 2 (_2_datapreperation.ipynb)

new data set (2006-2022, N = 36,648)

includes old data resampled with new search terms (2006-2021) and new data with old and new search terms (2022)
search terms: (a) Top 3 urgent terms (2006-2022): (climate crisis OR climate emergency OR climate catastrophe) NOT (climate change OR global warming OR greenhouse effect) (FACTIVA) or (climate crisis ODER climate emergency ODER climate catastrophe) UND NICHT (climate change ODER global warming ODER greenhouse effect) (NEXIS) (b) All neutral and urgent terms (2006-2022): Search with (climate warming OR climatic change OR greenhouse warming OR warming climate OR climatic disruption OR climate catastrophe OR climate chaos OR climate crisis OR climate disaster OR climate emergency OR global heating OR climate breakdown OR climate threat) NOT (climate change OR global warming OR greenhouse effect) (FACTIVA) or (climate warming ODER climatic change ODER greenhouse warming ODER warming climate ODER climatic disruption ODER climate catastrophe ODER climate chaos ODER climate crisis ODER climate disaster ODER climate emergency ODER global heating ODER climate breakdown ODER climate threat) UND NICHT (climate change ODER global warming ODER greenhouse effect) (NEXIS) (c) Neutral terms (2022): Third, search for climate change OR global warming OR greenhouse effect (FACTIVA) or climate change ODER global warming ODER greenhouse effect (NEXIS)

--> in "1_read_newdata_data_new.csv" which is created in step 1 (_1_readnewdata.ipynb)

valeriehase commented 8 months ago

I replaced the "old" data file in teams (which included data from 1996 also but only texts where "neutral" terms occurred at least twice) with the correct one.

valeriehase commented 8 months ago

Agreed and done!

XiaoyueYXY / Climate-Compounds

Use correct data sets 2006-2022 #8

old data set (2006-2021, N = 188,291)

new data set (2006-2022, N = 36,648)