Need to test web scrapers to ensure that our scrapers aren't broken due to changes in the scraping sites design. Using metacritic as an example in the following.
If metacritic changes the carat sign for the next page, it will break our code. We need to be notified when this happens.
Proposed idea - Parse a few games from metacritic and store it as the desired result. Do automated testing to run current code and generate a new csv. Do a file diff comparison between the two. If the two files are the same, the code is running as expected.
This is an expensive slow process but it is important to know the health of our scrapers. Could also be exposed as an endpoint which notifies developers and consumers when a scraper is broken.
Need to test web scrapers to ensure that our scrapers aren't broken due to changes in the scraping sites design. Using metacritic as an example in the following.
If metacritic changes the carat sign for the next page, it will break our code. We need to be notified when this happens.
Proposed idea - Parse a few games from metacritic and store it as the desired result. Do automated testing to run current code and generate a new csv. Do a file diff comparison between the two. If the two files are the same, the code is running as expected.
This is an expensive slow process but it is important to know the health of our scrapers. Could also be exposed as an endpoint which notifies developers and consumers when a scraper is broken.