Closed troy-conte closed 8 months ago
glassdoor the problem is the cookies need to be generated on initialization of scraping. it's hardcoded. Indeed has no rate limiting, we have the api key. but it's good addition for linkedin and ziprecruiter in particular and could be used for all the modules. how do you envision the interface for the user?
Ok interesting. Ill dig into glassdoor and see if theres a way to do it another way. I'm getting blocked by the api on indeed, too many requests. Even after I've tried to change IP but there really is not point if they know exactly which key is causing the commotion lol. LinkedIn usually works and ziprecruiter I don't find helpful so might not be worth it for just linkedIn. Also it seems you randomly chose a variable rate limit within a frequency band so honestly that is already the best solution right now.
Yea I meant to say there is indeed rate limiting when doing the searches as I haven't transitioned the repo to use the API for the job search. But fetching the descriptions when we use the api key doesn't have rate limits.
If y'all don't care about the easy apply filter and the time range filter for indeed, we can switch to the api.
i personally don't use easy apply, I'm more interested in jobs people can't easily apply for. What are the limits to the API? If everyone is using the same key won't they block/limit search requests?
glassdoor the problem is the cookies need to be generated on initialization of scraping. it's hardcoded. Indeed has no rate limiting, we have the api key. but it's good addition for linkedin and ziprecruiter in particular and could be used for all the modules. how do you envision the interface for the user?
I think you mean ziprecruiter that was the only one that I found that had cookies hardcoded. Looking to generating new ones...
I believe the Indeed API key is the global public key used by everyone on the site, could be wrong here though, haven't checked in a minute.
I believe the Indeed API key is the global public key used by everyone on the site, could be wrong here though, haven't checked in a minute.
"on the site" meaning on indeed or jobspy?
I believe the Indeed API key is the global public key used by everyone on the site, could be wrong here though, haven't checked in a minute.
"on the site" meaning on indeed or jobspy?
Indeed
closing as I only see issues with LinkedIn and increasing the delay still results in being blocked.
Can you make a rate limiter adjustable on the request polling so we can adjust to attempt to avoid being blocked? I also don't see rate limiters on indeed and glassdoor?