Closed amsteel closed 3 years ago
from autoscraper import AutoScraper
import requests
scraper = AutoScraper()
url = 'https://www.ptwxz.com/html/11/11014/'
wanted_list = ['第一章 牢狱之灾']
r = requests.get(url)
r.encoding = r.apparent_encoding
result = scraper.build(url, html=r.text, wanted_list=wanted_list)
print(result)
Thank you. It works and apparently just encoding issue. Just find another similar one in the closed issue. Maybe worth add something in the Doc/wiki.
Fixed in v1.1.12
The build returns blank for this url:
https://www.ptwxz.com/html/11/11014/