fivethirtyeight / russian-troll-tweets

770 stars 215 forks source link

character encoding for content column #6

Closed eyejosh closed 6 years ago

eyejosh commented 6 years ago

seems the character encoding is off for the "content" column. special chars are showing up as weird text:

Ð?Ñ?иÑ?ина #118 ЯÑ?овая пÑ?осиÑ? Ð?асилÑ?евÑ? создаÑ?Ñ? меÑ?од для Ñ?Ñ?иÑ?елей по вÑ?явлениÑ? инÑ?еÑ?неÑ?-зависимÑ?Ñ? деÑ?ей

convert to UTF-8 maybe?

edsu commented 6 years ago

Is this just for IRAhandle_tweets_6.csv?

eyejosh commented 6 years ago

I think it’s for all of them

dmil commented 6 years ago

Thanks for your note. Please see #5