langstream docker run test -app examples/docker-chatbot -s ./secrets.yaml
we get the following error,
15:23:56.896 [crawler-webcrawler-source-1-runner-465eeb4a-f140-4b8d-b683-ee51ee76f401] INFO a.l.a.webcrawler.WebCrawlerSource -- The last cycle didn't produce any new documents
15:23:56.896 [crawler-webcrawler-source-1-runner-465eeb4a-f140-4b8d-b683-ee51ee76f401] INFO a.l.a.webcrawler.crawler.WebCrawler -- Crawling url: https://aws.amazon.com/about-aws/whats-new/2023/11
15:23:57.086 [crawler-webcrawler-source-1-runner-465eeb4a-f140-4b8d-b683-ee51ee76f401] WARN a.l.a.webcrawler.crawler.WebCrawler -- A redirection to a forbidden domain happened (from https://aws.amazon.com/about-aws/whats-new/2023/11 to /about-aws/whats-new/2023/11/)
Workaround
Adding the slash (/) character suffix at the seed-urls and allowed-domains fixed the error.
Setup
Web crawler configuration
When we execute the below command,
we get the following error,
Workaround
Adding the slash (
/
) character suffix at theseed-urls
andallowed-domains
fixed the error.