Closed Fitzclutchington closed 3 days ago
Not blocked per se, but current issue is that using requests
to get the site data only returns the <head>
element. My trusty advisor (chatGPT) mentioned this may be due to the site being rendered with javascript. A solution there is to use seleneium. Will look into this next.
A few data issues, some weights/ capacities are empty when they shouldn't be, but we have enough data coverage to handle. I'm also sick of data scraping for the moment.
Scrape and process safes from this url: https://empiresafe.com/pre-owned/all/
Create a CSV that can uploaded to the DB
Post processing is mostly extracting the individual dimensions from the dimension string.