whchien / funda-scraper

FundaScaper scrapes data from Funda, the Dutch housing website. You can find listings from house-buying or rental market, and historical data. 🏡
GNU General Public License v3.0
113 stars 53 forks source link
houseprice python webscraping

FundaScraper 🏡

Project Status: Active – The project has reached a stable, usable state and is being actively developed. Build Status codecov Downloads PyPI version PEP8

FundaScaper provides the easiest way to perform web scraping on Funda, the Dutch housing website. You can find houses either for sale or for rent, and access historical data from the past few years.

Please note:

  1. Scraping this website is ONLY allowed for personal use (as per Funda's Terms and Conditions).
  2. Any commercial use of this Python package is prohibited. The author holds no liability for any misuse of the package.

Install

Install with pip:

pip install funda-scraper

Clone the repository:

git clone https://github.com/whchien/funda-scraper.git
cd funda-scraper
export PYTHONPATH=${PWD}
python funda_scraper/scrape.py --area amsterdam --want_to rent --page_start 1 --n_pages 3 --save

Quickstart

from funda_scraper import FundaScraper

scraper = FundaScraper(
    area="amsterdam", 
    want_to="rent", 
    find_past=False, 
    page_start=1, 
    n_pages=3, 
    min_price=500, 
    max_price=2000
)
df = scraper.run(raw_data=False, save=True, filepath="test.csv")
df.head()

image

Customizing Your Scraping

You can pass several arguments to FundaScraper() for customized scraping:

The scraped raw result contains following information:

To fetch the data without preprocessing, specify scraper.run(raw_data=True).

Note: Information regarding listing dates is no longer available since Q4 2023. Funda requires users to log in to see this information.

More information

Check the example notebook for further details. If you find this project helpful, please give it a star.