Closed noghartt closed 9 months ago
@noghartt ,
I see you are using localhost
in your start_urls
. That doesn't work since you are running the scrapper inside a container. Inside the container, localhost
points to the containers internal network, and there's nothing running on port 3000
on that network. That's why you are getting connection refused
.
What you should do instead is use the special docker domain that points to the host network. On mac os, it is host.docker.internal
. So your start_urls
should be:
"start_urls": [
"http://host.docker.internal:3000"
],
But, since you are using a port, the crawling functionality might not work as expected. You might need to run your site server on port 80
on localhost. See #50 for details.
@noghartt ,
I see you are using
localhost
in yourstart_urls
. That doesn't work since you are running the scrapper inside a container. Inside the container,localhost
points to the containers internal network, and there's nothing running on port3000
on that network. That's why you are gettingconnection refused
.What you should do instead is use the special docker domain that points to the host network. On mac os, it is
host.docker.internal
. So yourstart_urls
should be:"start_urls": [ "http://host.docker.internal:3000" ],
Hey, @wanderanimrod! It's works!
I appreciate your help, thanks!
Description
I'm trying to run the Typesense DocSearch Scraper on a Docusaurus build locally (
http://localhost:3000
). But I'm facing an issue that seems related toscrapy
:Steps to reproduce
Run the command
Expected Behavior
Run command and start scrapping putting in the DB:
The env file is:
Actual Behavior
Facing this error message:
Metadata
Docusaaurus Scraper Config file
Typesense Version:
0.25.1
Typesense Scraper Version:
0.9.1
OS:
macOS Sonoma