JustAnotherArchivist / snscrape

A social networking service scraper in Python
GNU General Public License v3.0
4.31k stars 698 forks source link

Facebook module mistakenly ignores some photo URLs #42

Closed JustAnotherArchivist closed 5 years ago

JustAnotherArchivist commented 5 years ago

For example, scraping DanoveAOdvodoveTipyJozefaMihala after a while yields the warning:

2019-05-26 00:43:57.507  WARNING  snscrape.modules.facebook  Ignoring odd link: https://www.facebook.com/photo.php?fbid=10207652901634185&set=a.1131882710860&type=3&__xts__%5B0%5D=...

It should yield https://www.facebook.com/photo.php?fbid=10207652901634185 instead.

JustAnotherArchivist commented 5 years ago

Another example: https://www.facebook.com/photo.php?fbid=186962201667236 when scraping manfred.weber.

One thing worth noting is that both of these examples have a different originator account name than the one I was running the scrape on. Not sure if that's just a coincidence or not.