Closed JuntingHe closed 3 years ago
Function 2 takes in the scraped tibble scarping from Function 1 and returns a cleaned tibble object containing information like listing url, price, number of bedroom, area in sqft, and city and ready for filtering.
In designing the function, I tried to use databases of the cities like world.cities or canada.cities to verify they is actually city information in the city
column but in fact, some cities like "Burnaby" are missing in those datasets. Hence, I decided to use city list instead of those database.
Function to clean web-scraped data with Pandas and Regex