PaulMcInnis / JobFunnel

Scrape job websites into a single spreadsheet with no duplicates.
MIT License
1.78k stars 210 forks source link

upgraded requirements and setup for python 3.9 #146

Closed rambalachandran closed 2 years ago

rambalachandran commented 2 years ago

Pull Request Template

Description

Please include a summary of the change. Please also include relevant motivation and context. List any additional libraries that will be affected. List any developers that will be affected or those who you had merge conflicts with.

Context of change

Please add options that are relevant and mark any boxes that apply.

Type of change

Please mark any boxes that apply.

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration.

Checklist:

Please mark any boxes that have been completed.

thebigG commented 2 years ago

Hey @rambalachandran, thanks for the update!

I'm curious: is JobFunnel working on your end?

rambalachandran commented 2 years ago

@thebigG No not working at this time. I'm also getting the error mentioned in Issue #144. You were working on a fix. Is that complete? If not, can you please let me know if there is an old commit when it was working that I can revert to and upgrade the requirements and test.

PaulMcInnis commented 2 years ago

Unfortunately this project has been more or less killed by that issue, until someone with some interest can help us out. Captcha has effectively broken our cli tool.

thebigG commented 2 years ago

Just like @PaulMcInnis said: CAPTCHA and dynamically-loaded sites have unfortunately rendered this project useless. It is unfortunate, but the CAPTCHA is something we just simply cannot solve by just writing code. If they want to give us a CAPTCHA, they will give us CAPTCHA. There isn't much we can do about it.

I tried getting around it, but couldn't really figure anything out. If anyone is interested in what I was working on(which I'm not anymore), you can look here:https://github.com/thebigG/JobFunnel/tree/dynamic_scraping

If anyone wants to give it a go, go ahead by all means. But honestly I'm not sure if this issue can really be solved in code. Though I hope I'm wrong about that.

PaulMcInnis commented 2 years ago

Seems our pipeline isn't getting any runners... hmm.

PaulMcInnis commented 2 years ago

Closing this until we resolve #148