Open anotatta opened 2 days ago
Thanks, great addon. I am trying out the cli version to crawl.
Win10 environment, running the single-file cli version 2.045 Command: single-file https://www.wikipedia.org -browser-wait-until=load --browser-executable-path "C:\Program Files (x86)\Google\Chrome\Application\chrome.exe" --crawl-links=true --crawl-inner-links-only=false --crawl-external-links-max-depth=1 --crawl-rewrite-rule="^.*wikipedia.*$"
single-file https://www.wikipedia.org -browser-wait-until=load --browser-executable-path "C:\Program Files (x86)\Google\Chrome\Application\chrome.exe" --crawl-links=true --crawl-inner-links-only=false --crawl-external-links-max-depth=1 --crawl-rewrite-rule="^.*wikipedia.*$"
Error: Unreachable URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js Stack: Error: Unreachable URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js at Connection.onFrameNavigated (file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/lib/cdp-client.js:306:15) at innerInvokeEventListeners (ext:deno_web/02_event.js:754:7) at invokeEventListeners (ext:deno_web/02_event.js:801:5) at dispatch (ext:deno_web/02_event.js:658:9) at Connection.dispatchEvent (ext:deno_web/02_event.js:1043:12) at Connection.#onMessage (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:205:18) at WebSocket.<anonymous> (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:168:83) at innerInvokeEventListeners (ext:deno_web/02_event.js:754:7) at invokeEventListeners (ext:deno_web/02_event.js:801:5) at dispatch (ext:deno_web/02_event.js:658:9)
Unreachable URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js
URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js
Stack: Error:
at Connection.onFrameNavigated (file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/lib/cdp-client.js:306:15)
at innerInvokeEventListeners (ext:deno_web/02_event.js:754:7)
at invokeEventListeners (ext:deno_web/02_event.js:801:5)
at dispatch (ext:deno_web/02_event.js:658:9)
at Connection.dispatchEvent (ext:deno_web/02_event.js:1043:12)
at Connection.#onMessage (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:205:18)
at WebSocket.<anonymous> (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:168:83)
Why is it referencing your file .../gildas/...
I want to add that if I let the program run to the end, even with the error, the program did produce the files to all the external links. Not sure what the error is about.
Thanks, great addon. I am trying out the cli version to crawl.
Win10 environment, running the single-file cli version 2.045 Command:
single-file https://www.wikipedia.org -browser-wait-until=load --browser-executable-path "C:\Program Files (x86)\Google\Chrome\Application\chrome.exe" --crawl-links=true --crawl-inner-links-only=false --crawl-external-links-max-depth=1 --crawl-rewrite-rule="^.*wikipedia.*$"
Error:
Unreachable URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js
URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js
Stack: Error:
Unreachable URL: file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/single-file-cli-api.js
at Connection.onFrameNavigated (file:///Users/gildas/Desktop/Dev/project-single-file/single-file-cli/lib/cdp-client.js:306:15)
at innerInvokeEventListeners (ext:deno_web/02_event.js:754:7)
at invokeEventListeners (ext:deno_web/02_event.js:801:5)
at dispatch (ext:deno_web/02_event.js:658:9)
at Connection.dispatchEvent (ext:deno_web/02_event.js:1043:12)
at Connection.#onMessage (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:205:18)
at WebSocket.<anonymous> (https://jsr.io/@simple-cdp/simple-cdp/1.8.5/mod.js:168:83)
at innerInvokeEventListeners (ext:deno_web/02_event.js:754:7)
at invokeEventListeners (ext:deno_web/02_event.js:801:5)
at dispatch (ext:deno_web/02_event.js:658:9)
Why is it referencing your file .../gildas/...