apurvmishra99 / facebook-scraper-selenium

GNU General Public License v3.0
210 stars 80 forks source link
automated-scraper facebook-crawler facebook-groups facebook-groups-scraper facebook-pages-scraper facebook-scraper facebook-scraper-software facebook-scraper-tool facebook-scraping facebook-status-scraper selenium selenium-python selenium-scraper

FaceBook-Scraper [2020]

Scrape posts from any group or user into a .csv file without needing to register for any API access


How to use it?

Firstly, make sure you have selenium >= 3.141.0, GeckoDriver and FireFox installed.

Store your email and password for Facebook login in credentials.txt.

Use scraper.py to collect the data.

usage: fb-scraper/scraper.py [-h] [--pages PAGES [PAGES ...]] [--groups GROUPS [GROUPS ...]][-d DEPTH]
Data Collection
arguments:
  -h, --help            show this help message and exit
  -p, --pages PAGES [PAGES ...]
                        List the pages you want to scrape
                        for recent posts

  -g, --groups GROUPS [GROUPS ...]
                        List the groups you want to scrape
                        for recent posts

  -d DEPTH, --depth DEPTH
                        How many recent posts you want to gather in
                        multiples of (roughly) 8.

Example: python scraper.py --pages feelzesty -d 20


The output is posts.csv inside the script folder.

Output is in three columns: PosterName(Author), uTime and Text