Closed peterk closed 5 years ago
@peterk thanks for opening up the issue. Would you mind (and are able to) sharing the docker file used when you encountered this issue?
I am currently attempting to reproduce.
Thank's for looking in to it. Like this:
FROM zenika/alpine-chrome
USER root
RUN apk add git
RUN apk add --update make
RUN apk add --update g++
RUN apk add --update vim
RUN apk add --update bash
RUN git clone https://github.com/N0taN3rd/Squidwarc.git
WORKDIR Squidwarc
RUN npm install
USER chrome
ENTRYPOINT ["chromium-browser", "--headless", "--disable-gpu", "--disable-software-rasterizer", "--no-sandbox"]
(sorry, missed the last line in the first copy/paste)
Process:
docker build -t squidwarc:0.7 .
docker run squidwarc:0.7
(in other terminal - id whatever id it got when you started it)
docker exec -it 809f5544ba2d /bin/bash
(in container)
./run-crawler.sh -c conf.json
@peterk it appears that zenika/alpine-chrome
already launches chrome but does not launch chrome with --remote-debugging-port=9222
which is required for Squidwarc to operate properly.
My suggestion would to be, when using that image, to not use the default entry point rather simply run node index.js -c conf.json
.
Also you will need to modify the conf file to enable headless mode by adding
"headless": true
to it
I have also updated the README.md file to explain usage of Squidwarc@1.2.0
Great! Thank you!
Are you submitting a bug report or a feature request?
Bug report
What is the current behavior?
Tried setting up a docker image based on the zenika/alpine-chrome image. Copied over Squidwarc. Headless Chrome starts correctly but trying to run the included conf.json crawl script I get the following error:
What is the expected behavior?
The crawl script should execute correctly.
What's your environment?
Other information
Thank you for what looks like a great project!