Closed RealXuChe closed 3 years ago
Yeah thanks.
for now you can pass the html content via html
parameter.
can you share the website?
Here's the website: https://www.wenku8.net/novel/2/2231/index.htm And, I've made a mistake, the encoding of this site is GB2312.
Should be fixed now. (v1.1.12)
I'm working with a legacy Chinese site with BIG5 text encoding, and I'm not able to set text encoding by passing arguments through
request_args
, because requests don't support it.So the results I get was garbled, like this:
'¡ ̧ÔÚÕâ ̧öÊÀ1⁄2ç ̧æÖÕÒÔÇ°©¤©¤A¡1-promise/result-'
.Encoding can only be set by writing to the
encoding
property of requests object (According to this).So maybe adding an
encoding
param and set encoding in_get_soup
inauto_scraper.py
would be a good idea.