fivethirtyeight / russian-troll-tweets

770 stars 215 forks source link

Desperately Seeking Schema #22

Open jpallas opened 6 years ago

jpallas commented 6 years ago

It would be nice if the table in the README could be updated with information about the type of each field. In particular, for those fields that are enumerated constants (such as post_type and account_type), list the set of valid values and for all fields indicate whether they are nullable. Since the data format is not raw Twitter data, maybe a link to https://help.salesforce.com/articleView?id=mc_ss_csv_report_headers.htm&type=5 would be helpful, too.

bet4a commented 6 years ago

I’m not 100% certain about which fields may be nullable. But here’s a list of enumerated constant field values, for what it’s worth:

EvanCarroll commented 6 years ago

Schema can be found in my project, https://github.com/EvanCarroll/russian-troll-tweets/blob/master/PostgreSQL/create.psql

EvanCarroll commented 6 years ago

New Version 2.0 Schema for PostgreSQL. We now have Primary Keys (unique twitter ids), and int8 account ids. https://github.com/EvanCarroll/russian-troll-tweets/blob/version_2/PostgreSQL/create.psql