FundaScaper
provides the easiest way to perform web scraping on Funda, the Dutch housing website. You can find houses either for sale or for rent, and access historical data from the past few years.
Please note:
pip install funda-scraper
git clone https://github.com/whchien/funda-scraper.git
cd funda-scraper
export PYTHONPATH=${PWD}
python funda_scraper/scrape.py --area amsterdam --want_to rent --page_start 1 --n_pages 3 --save
from funda_scraper import FundaScraper
scraper = FundaScraper(
area="amsterdam",
want_to="rent",
find_past=False,
page_start=1,
n_pages=3,
min_price=500,
max_price=2000
)
df = scraper.run(raw_data=False, save=True, filepath="test.csv")
df.head()
if __name__ == "__main__":
before your script.You can pass several arguments to FundaScraper()
for customized scraping:
area
: Specify the city or specific area you want to look for, e.g. Amsterdam, Utrecht, Rotterdam, etc.want_to
: Choose either buy
or rent
to find houses either for sale or for rent.find_past
: Set to True
to find historical data; the default is False
.page_start
: Indicate which page to start scraping from; the default is 1
. n_pages
: Indicate how many pages to scrape; the default is 1
. min_price
: Indicate the lowest budget amount.max_price
: Indicate the highest budget amount.min_floor_area
: Indicate the minimum floor area.max_floor_area
: Indicate the maximum floor area.days_since:
: Specify the maximum number of days since the listing date.property_type
: Specify the desired property type(s).sort
: Specify sorting criteria.The scraped raw result contains following information:
To fetch the data without preprocessing, specify scraper.run(raw_data=True)
.
Note: Information regarding listing dates is no longer available since Q4 2023. Funda requires users to log in to see this information.
Check the example notebook for further details. If you find this project helpful, please give it a star.