DocNow / twarc-csv

A plugin for twarc2 for converting tweet JSON into DataFrames and exporting to CSV.
MIT License
31 stars 10 forks source link

Add usernames from expansions to columns #47

Closed igorbrigadir closed 1 year ago

igorbrigadir commented 2 years ago

Currently there are user IDs only, but it would help to also have user names. Not all user columns should be added, as this would make an unreasonably wide dataframe (would need all author columns for each quoted, retweeted, etc user) but just adding names is enough to make things easier.

Additionally, document extracting and converting user objects too maybe.

igorbrigadir commented 2 years ago

On second look this is significantly more awkward - but might still be possible.

igorbrigadir commented 1 year ago

I went with a simpler approach, just usernames there are now extra columns:

in_reply_to_username
retweeted_username
quoted_username

Which should cover all the possibilities (A tweet can be both a reply and a quote for example)