kevinzg / facebook-scraper

Scrape Facebook public pages without an API key
MIT License
2.27k stars 611 forks source link

Does not see "text" posts at all #86

Closed TearsOfTheEarth closed 3 years ago

TearsOfTheEarth commented 3 years ago

Hi,

The parser does not pick "large text" posts, here is an example of such: https://www.facebook.com/groups/aerosokol/permalink/3425362160831169/

TearsOfTheEarth commented 3 years ago

Quick fix

--- /tmp/extractors.py  2020-07-13 20:02:51.990000000 +0300
+++ facebook-scraper/facebook_scraper/extractors.py     2020-07-13 20:22:45.070000000 +0300
@@ -136,7 +136,7 @@
                 response = self.request(url)
                 element = response.html.find('.story_body_container', first=True)

-        nodes = element.find('p, header')
+        nodes = element.find('p, header, span[role=presentation]')
         if nodes:
             post_text = []
             shared_text = []