williamxie11 / glassdoor-interview-scraper

Web scraper for Glassdoor interview review data
33 stars 30 forks source link

glassdoor-interview-scraper

Glassdoor web crawler and scraper providing interview experience data for Decoding The Interview.

This was an academic project for CS 410 - Text and Information Systems at UIUC and is no longer maintained.

Installation

Usage

  1. Open the scraper Python script with a text editor of your choice.
  2. Add your Glassdoor account username and password username and password
  3. Specify the number of pages, the company name, and the URL of the interviews page for the company on Glassdoor with your specified filters selected scraper settings
  4. Run the scraper
    $ python scraper_v1.2.py

NOTE: Glassdoor will require you to insert CAPTCHA on login or during the scraping process. The script will poll until CAPTCHA is entered during scraping.

Results

response

The web scraper will output a JSON with the name "[company name].json" in the same directory. Each data point in the JSON corresponds to one interview review on Glassdoor with attributes (see above) for each portion of the review.

Changelog

v1.2

v1.1

v1.0