EssamWisam / cmp-docs

A comprehensive guide for prospective, current and past students in the computer engineering department of Cairo university.
https://cmp-docs.pages.dev
52 stars 8 forks source link

⏰ Reminder to Run LinkedIn Scraper #65

Closed github-actions[bot] closed 2 weeks ago

github-actions[bot] commented 2 weeks ago

It's been two weeks.

It's time to run the LinkedIn script to get the latest titles and current positions of CMP students and graduates.

To run the script follow the steps below and if it's not your first time running the script, you can just start from step 4:

Steps for running the LinkedIn script:

  1. Make sure you have Python 3 downloaded on your device. You can check by running the command below in your bash terminal and it should display the Python version if it is already installed.
    python --version
  2. Install all the needed Python packages using the requirements.txt present in the scripts/linkedin-scraper directory.
    pip install -r "scripts/linkedin-scraper/requirements.txt"
  3. Download the Chrome Driver that is compatible with your OS and Chrome Version from this link. It should be a zip file of about 10 MBs or less. Extract it using WinRAR or a similar archive manager. Then copy the chromedriver.exe file to the scripts/linkedin-scraper directory.
  4. Set the enivronment variables with valid LinkedIn credentials in the bash terminal as following:
    export LINKEDIN_SCRAPER_EMAIL=<email>
    export LINKEDIN_SCRAPER_PASSWORD=<password>

    and replace <email> and <password> with the actual LinkedIn credentials. Note, you should probably avoid using your main LinkedIn account credentials to avoid running the risk of it being banned by LinkedIn after multiple scraping.

  5. Finally, you can run the script on all the class yaml files using the command below:
    python "scripts/linkedin-scraper/run.py" 

    and if you want to run the script for a certain class only, use the command below and replace 20XX with the graduation year of said class:

    python "scripts/linkedin-scraper/linkedin-scraper.py" "public/department/Extras/Classes/C20XX.yaml"

Last Notes: