austinoboyle / scrape-linkedin-selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
MIT License
447 stars 162 forks source link

Job Crawler/Scraper/Parser #6

Open austinoboyle opened 6 years ago

austinoboyle commented 6 years ago

Scrape jobs by various filters:

First Use Case: Scrape all jobs in Kingston

Relevant URL https://www.linkedin.com/jobs/search/?keywords=&location=Kingston%2C%20Ontario%2C%20Canada&sortBy=DD

Process:

  1. Scrape Basic Info for All Jobs
  2. Based on Basic Scrape (job_id), run parallel scrape to get detailed info on all jobs

Basic Fields

Detailed Info

simarpreetsingh-019 commented 3 years ago

@austinoboyle I am working on a similar issue for my project, mostly founded which class I should parse and extract info, but i got struck when i try to download the source code of page, i got an utput like: ` r = requests.get('https://linkedin.com/jobs/') html_content = r.content

print(html_content)

print() soup = BeautifulSoup(html_content,'html.parser') print(soup) `

to which i got output: `

`

If you or anyone else can help me with how to get exact source code?

would be helpful for this issue also. I know its an older issue but i thought of why creating new one when similar issue is already here. if needed , i would make new one.

anilabhadatta commented 2 years ago

I did a pull request , Added Jobs and People in CompanyScraper If possible please test it on a temporary linked in account.