shaikhsajid1111 / twitter-scraper-selenium

Python's package to scrap Twitter's front-end easily
https://pypi.org/project/twitter-scraper-selenium
MIT License
305 stars 47 forks source link

Install false #24

Open hoangthangbk51 opened 2 years ago

hoangthangbk51 commented 2 years ago

image Please check . Some thinks wrong . Thanks for suppot

shaikhsajid1111 commented 2 years ago

You're using Python 3.10?

shaikhsajid1111 commented 2 years ago

You can clone this repository and try to install it from the source as well.

  1. Clone the repository
  2. Open Terminal in the repository directory
  3. Use the command pip install .
hoangthangbk51 commented 2 years ago

Ok thanks for support . I will try . And feedback. So what is vesion of python requirement ?

hoangthangbk51 commented 2 years ago

image

hoangthangbk51 commented 2 years ago

After install it show "name "ex" not defined ." Please check on picture

shaikhsajid1111 commented 2 years ago

Can you please share what code you're using that causes this error?. I just tried and it is working fine for me but might be something which is happening at your side

hoangthangbk51 commented 2 years ago

from twitter_scraper_selenium import scrap_profile scrap_profile(twitter_username="LauraHa51649745",output_format="csv",browser="chrome",tweets_count=10,filename="microsoft",directory="home\use\downloads") this is my code .

shaikhsajid1111 commented 2 years ago

You're probably hitting the authentication page here. I got it too few times, image

hoangthangbk51 commented 2 years ago

OK thanks . i got it . I have a question . i have a list of usenam id with txt file , and i would like to get the lastest tweet link and the lastest retweet link of all use on the list . Is posible ?

shaikhsajid1111 commented 2 years ago

Yeah, it is possible

hoangthangbk51 commented 2 years ago

OH , I i happy to hear that. Can you give me some instruction

shaikhsajid1111 commented 2 years ago

There are several ways I see,

  1. You can use twitter's API. But It often has a rate limit and is sometimes hard to deal with.
  2. You can use this library with the proxy of the country which might have loose internet laws. You're hitting auth wall frequently probably that's the reason. I tried the same username twice or thrice and it worked for me.
  3. You can even check this library https://github.com/JustAnotherArchivist/snscrape. Not sure if is still working but last year it was working though, it uses Twitter's API I assume.

If you're planning to create your own web crawler, you can't use simple HTTP requests to scrape because front-end code is injected via JavaScript. You must use a Web browser in an unofficial way but still, you'll have to deal with auth wall. Have a look at their API documentation, they've tweet id in their API response which they call the status field, you can just use that single field to generate a tweet's URL. Tweet's URL can be generated just by simple string concatenation https://twitter.com/{twitter_username}/status/{status} . I think they probably will have some similar mechanism for retweets as well. Your data for requirements are much simpler and can be resolved via API itself, if you need more data which is not present in their API then you should go with this library.

hoangthangbk51 commented 2 years ago

Thankyou so much. I will learn and try