twintproject / twint

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
MIT License
15.75k stars 2.72k forks source link

difference in tweets #344

Open Arwa200 opened 5 years ago

Arwa200 commented 5 years ago

Issue Template

Please use this template!

Initial Check

[QUESTION]

I am wondering if you have ever notice that there is difference between recorded numbers in twitter and output numbers from twint tool, I notice that in favorite, media and tweets and ,with a consideration that some tweets has been removed, or posts from private accounts still there is a clear difference between them! Thank You

Make sure you've checked the following:

  • [] Python version is 3.6;
  • [] Updated Twint with pip3 install --upgrade -e git+https://github.com/twintproject/twint.git@origin/master#egg=twint;
  • [] I have searched the issues and there are no duplicates of this issue/question/request.

Environment Details

Using OS

pielco11 commented 5 years ago

May you provide an example? (like with which user, etc.) Thank you in advance!

Arwa200 commented 5 years ago

sure, I will get you back soon as i took the screen shots..

Arwa200 commented 5 years ago

for followers I just seen another issue and you label it a twitter flow tried with @TEDTalks for likes got "3184" Screen Shot 2019-06-14 at 6 52 32 AM

for media got more than original number! "5764" Screen Shot 2019-06-14 at 6 52 46 AM

did this simultaneously on Screen Shot 2019-06-14 at 6 52 53 AM

Thank you for your time

pielco11 commented 5 years ago

About likes, it could be that your target liked tweets of private accounts, which are protected and not visible to everyone. Otherwise Twitter may have just blocked your requests

About media, that's quite strange. Did you get this issue with other users too? -------- Messaggio originale -------- On 17 Giu 2019, 03:40, Arwa ha scritto:

for followers I just seen another issue and you label it a twitter flow tried with @tedtalks for likes got "3184" Screen Shot 2019-06-14 at 6 52 32 AM

for media got more than original number! "5764" Screen Shot 2019-06-14 at 6 52 46 AM

did this simultaneously on Screen Shot 2019-06-14 at 6 52 53 AM

Thank you for your time

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

Arwa200 commented 5 years ago

yes, tried with Journalism2ls, got 4708 out of Screen Shot 2019-06-18 at 7 31 43 AM

on Screen Shot 2019-06-18 at 7 32 10 AM

at the end of cod I got this in terminal Screen Shot 2019-06-18 at 7 46 54 AM

pielco11 commented 5 years ago

Timeout error is about a connection issue, not Twint

I'll dig deeper as soon as possible

Thanks for reporting -------- Messaggio originale -------- On 18 Giu 2019, 06:48, Arwa ha scritto:

yes, tried with Journalism2ls, got 4708 out of Screen Shot 2019-06-18 at 7 31 43 AM

on Screen Shot 2019-06-18 at 7 32 10 AM

at the end of cod I got this in terminal Screen Shot 2019-06-18 at 7 46 54 AM

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.