I'm trying to crawl a page (like http://021-online.com) which has charset=gb2312.
When I try to read head/title field using cheerio (that's embedded) I got, let say, crap instead of proper chars.
Am I missing some configuration, or is it a bug that prevents from properly crawling non-utf8 pages?
It may also be a problem of cheerio rather than roboto itself.
Hi,
I'm trying to crawl a page (like http://021-online.com) which has charset=gb2312. When I try to read head/title field using cheerio (that's embedded) I got, let say, crap instead of proper chars.
Am I missing some configuration, or is it a bug that prevents from properly crawling non-utf8 pages? It may also be a problem of cheerio rather than roboto itself.