Closed dnikola closed 3 years ago
Hi @JustAnotherArchivist
after some research and installing development version i see that option --jsonl works fine so i got answer on question 1.
what i have seen i search for some hashtag on instagram which include letters like Š Ć Č and when url is retrieved also see that this letters are converted Š in S , Č in C, Ć in C what is wrong.
I get only few post's and the rest are skipped because letters are converted ?
Thanks
Hi @JustAnotherArchivist
Thanks for your fast replay, please check mine
edit
after pasting link second one i have seen that this characters has been converted so tried to search for hashtag like that and got 26 result
?
snscrape --jsonl --verbose instagram-hashtag ispričajpričudokraja
works as expected for me and returns 26 posts.
- Right. What you get as JSONL output is everything that snscrape extracts currently. The modules/scrapers differ strongly in that respect. If you have specific suggestions what should be added, please file an issue for that.
- That sounds like an issue with your Windows terminal. You probably have to enable UTF-8 (cf. UnicodeEncodeError on Windows command prompt when UTF-8 output is produced #122).
snscrape --jsonl --verbose instagram-hashtag ispričajpričudokraja
works as expected for me and returns 26 posts.
Hi @JustAnotherArchivist
thanks for your replay.
- Facebook doesn't let users list posts with hashtags unless you're logged in. snscrape doesn't support logging in.
- No tricks. That scraper doesn't seem to work at all for me at the moment, although it might depend on the profile page (as some use the old design and others the new). Facebook is a mess and will also quickly ban your IP, so fixing it is tricky.
hi let me replay
could you please check adding Facebook hashtag? I can share our current scripts with you.
Regards Nikola
Huh. I just took a look again, and it seems like https://www.facebook.com/hashtag/hashtag does work now. It only returned a few posts the last time I checked (which was admittedly a long time ago, probably over a year). I assume that's what your scripts are based on as well? I'll file a separate issue on adding support for this.
Huh. I just took a look again, and it seems like https://www.facebook.com/hashtag/hashtag does work now. It only returned a few posts the last time I checked (which was admittedly a long time ago, probably over a year). I assume that's what your scripts are based on as well? I'll file a separate issue on adding support for this.
Yes it is based on that :) also, when you visit profile / page and scroll down you can load more post... possible to make your script scrape more post from page / profile ?
Reopened #31
snscrape already handles the scrolling. If it stops mid-feed, that normally means your IP is banned, although snscrape is supposed to throw an error in that situation (cf. #208, #250).
Reopened #31
snscrape already handles the scrolling. If it stops mid-feed, that normally means your IP is banned, although snscrape is supposed to throw an error in that situation (cf. #208, #250).
it just return first two post, no error :) so i think that doesn't scroll
Facebook probably changed how the scrolling works then. It definitely used to work. I'll see if I can debug that sometime soon. Facebook's code is a massive mess to work with though.
Closing this as the questions have been answered. I'll look into the potential Facebook scraper bugs when I have time.
Hello
this library look amazing. I would have few questions if possible please repaly:
Best regards!