Bunsly / HomeHarvest

Python package for scraping real estate property data
https://tryhomeharvest.com/
MIT License
301 stars 66 forks source link

Zillow/Redfin removed? #54

Closed cradcore closed 6 months ago

cradcore commented 6 months ago

In the GitHub keywords, previous issues/pull requests and the git history I see both mentions of zillow and redfin scraping. However at first glance it looks like the scrapers were removed in version 3.0 (29664e4eee3db0ef978b68d28ffe45155142d7dc)

Are these sites no longer supported or is there just a different way to scrape from them that I'm missing?

ZacharyHampton commented 6 months ago

We removed support for them. There isn't much purpose of aggregating real estate listings, other than for the purpose of FSBO.

cradcore commented 6 months ago

@ZacharyHampton Isn't that the point of HomeHarvest? I was looking to use HomeHarvest to scrape the data from these sites and get rent trends in my area, and while realtor.com does have some, the vast majority of rentals are on Zillow

ZacharyHampton commented 6 months ago

The previous purpose was that, yes. But we realized there isn't much point of aggregating the data, as its mostly the same. Realtor.com is much more scalable for open source, doing so with Zillow is a bit harder.

cradcore commented 5 months ago

If anyone is looking for Zillow support still, I have created a fork with Zillow scraping added back in: https://github.com/cradcore/HomeHarvest

sabatale commented 2 weeks ago

@cradcore Doesn't work. Returns 202.

cradcore commented 2 weeks ago

@sabatale

#!/bin/env python3

from homeharvest import scrape_property

data = scrape_property(
    location="90210",
    site_name="zillow",
    listing_type="for_rent"
)
print("Properties: {}".format(len(data)))
Properties: 321

Working just fine for me just fine but you can add an issue to my github with more details, including what youre trying to run and the logs, if you want

sabatale commented 2 weeks ago

My bad, it looks like NordVPN IPs are blacklisted. That's why.

Cheers! (ps. issues on your repo are disabled)