godkingjay / selenium-twitter-scraper

This is a Twitter Scraper which uses Selenium for scraping tweets. It is capable of scraping tweets from home, user profile, hashtag, query or search, and advanced searches.
Apache License 2.0
177 stars 47 forks source link

(request) Scrape each post/tweet [TWEET_URL], optionally [POSTER_TWITTER_ID], [TWEET_ID] #5

Closed badjoerichards closed 1 year ago

badjoerichards commented 1 year ago

The [TWEET_URL] is important to reference the every post/tweet that has been scraped. Can this be done?

Additionally, [POSTER_TWITTER_ID] and [TWEET_ID] is also useful data if possible

This should make the scraper very useful and powerful.

godkingjay commented 1 year ago

It's possible. However, for the [POSTER_TWITTER_ID] and [TWEET_ID] we have to save them into a string format like this:

[POSTER_TWITTER_ID]

user_id:{user_id}

[TWEET_ID]

tweet_id:{tweet_id}

To save the full id in the csv.

commits: doc: update additional data 'pd' description feat: scrape tweet link, tweet id, and user id feat(ipynb): Add tweet link, tweet id, and user id

badjoerichards commented 1 year ago

This has become an incredible tool. thank you!