Bunsly / HomeHarvest

Python package for scraping real estate property data
https://tryhomeharvest.com/
MIT License
280 stars 60 forks source link

Apartment style type listings return NaN values #86

Open itsjacobhere opened 3 weeks ago

itsjacobhere commented 3 weeks ago

Nearly all returned features for listings with the Apartment style type return Nan for nearly all features including: list price, beds, full_baths, sqft, year_built, etc.. This doesn't occur for every Apartment style listing, some do return features with filled values but the majority are NaN.

see example of the proportion of NaN values for each style type: Screenshot 2024-06-21 225855

ZacharyHampton commented 2 weeks ago

Thanks for the issue.

I'll take a look and see if this is an issue on our end.

itsjacobhere commented 2 weeks ago

Yeah I figured it could be an issue with the data source, I was able to find the listings that returned Nan on realtor.com and all the data is there, so I'm not sure if it's an issue retrieving it or something else.

ZacharyHampton commented 2 weeks ago

Likely an issue with us then. Can you send a code example where you're getting many NaNs and a listing in the bunch that has the proper data on site?

itsjacobhere commented 2 weeks ago

Yes I can send a more detailed example when I get home Monday or Tuesday! If I recall it was happening with pretty simple queries with the "for_rent" type and the returned apartments were mostly Nan values. I was primarily looking at San Diego, CA but it was happening for other locations too. I was able to just Google the full address and find the listing on realtor.com with everything there