rephrased unittest instruction for new parser as it was sa bit unclear
added normalization for windows newlines \r\n
added encoding when saving html for testing. The previous implementation just saved the byte stream that requests fetched and then upon opening it was decoded as utf-8. This does not work in all cases and caused me serious headaches. It should not be assumed that byted object fetched by requests is always utf-8 encoded.
Hi,
Adding scraper for kuchynalidla.sk.
Also a few minor changes.
\r\n
html
for testing. The previous implementation just saved the byte stream that requests fetched and then upon opening it was decoded asutf-8
. This does not work in all cases and caused me serious headaches. It should not be assumed that byted object fetched by requests is alwaysutf-8
encoded.