Open pantasa opened 2 months ago
You have to rate limit in the initializer of the crawler script written for the source. e.g:
class sixnineshu(Crawler):
base_url = [
"https://69shuba.cx"
]
def initialize(self):
self.init_parser("html.parser")
self.init_executor(ratelimit=20)
Let us know
Novel URL: <your novel url or query> https://www.69shuba.pro/book/49986.htm App Location: PIP | EXE | Discord | Telegram pip|exe App Version: x.y.z newest
Describe this issue
domain is not supported eventhough I've modify the code but it's only successfully scrape the first 100 chapter after that got ConnectTimeout: HTTPSConnectionPool(host='www.69shuba.pro', port=443) i think it's got ban by IP
'Connection to www.69shuba.pro timed out. (connect timeout=7)')) ConnectTimeout: HTTPSConnectionPool(host='www.69shuba.pro', port=443): Max retries exceeded with url: /txt/43616/38188989 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x0000025E38E8DCD0>, 'Connection to www.69shuba.pro timed out. (connect timeout=7)')) ConnectTimeout: HTTPSConnectionPool(host='www.69shuba.pro', port=443): Max retries exceeded with url: /txt/43616/38188990 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x0000025E38EADE20>, 'Connection to www.69shuba.pro timed out. (connect timeout=7)')) ConnectTimeout: HTTPSConnectionPool(host='www.69shuba.pro', port=443): Max retries exceeded with url: /txt/43616/38193667 (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object a
t 0x0000025E38EAEFC0>, 'Connection to www.69shuba.pro timed out. (connect timeout=7)'))