cermak-petr / actor-zillow-api-scraper

Apify actor for extracting data about homes from Zillow.com using it's internal API.
https://apify.com/petr_cermak/zillow-api-scraper
Apache License 2.0
46 stars 40 forks source link

Can't scrape a list of ZPIDs. #14

Closed fmarcanob closed 3 years ago

fmarcanob commented 3 years ago

Hi there! After a couple hundred extractions, the log starts saying "Data extraction failed - zpid: xxxxxxx" a lot, until it finished all the list. Anything that can be done? Thank you very much.

I have attached log file and screenshot.

r7Y43IxfnODwpu9q5.log Screenshot 2021-02-09 at 16 57 13

pocesar commented 3 years ago

@fmarcanob you can set your version to beta on the platform, it's ongoing changes. it should work better

fmarcanob commented 3 years ago

@fmarcanob you can set your version to beta on the platform, it's ongoing changes. it should work better

Thank you for your answer. I wasn't capable of doing much with beta, because it doesn't allow to use custom proxies. Is it possible to change this? With the stable version, I was able to extract amounts between 250-800 for each run, instead of the 8k (using 250 USA proxies). In the end, it always goes to that data extraction failed error and I have to stop and start running again minus the IDs that were processed with success.

pocesar commented 3 years ago

your proxies are wrong, you have a lot of tabs after the urls. I couldn't find any run using beta. the task is also configured to max 200 items image

I've added a new task (called beta-zillow, check the tasks section) for you with changes to the input