jina-ai / reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
https://jina.ai/reader
Apache License 2.0
6.97k stars 549 forks source link

Error fetching webpage content via code but successful via browser #40

Open hzaaron opened 6 months ago

hzaaron commented 6 months ago

When attempting to fetch webpage content via code, I encounter an error consistently. However, I've noticed that the same URL can be successfully accessed via a browser. The error message I'm receiving is as follows:

Error data from response: { data: null, path: 'url', code: 400, name: 'ParamValidationError', status: 40001, message: 'TypeError: Invalid URL', readableMessage: 'ParamValidationError(url): TypeError: Invalid URL' }

URL:https://www.trustpilot.com/review/cortexi.io

I've tried troubleshooting this issue, but so far, I haven't been able to pinpoint the exact cause. Any insights or suggestions on how to resolve this would be greatly appreciated. Thank you!

Manamama commented 6 months ago

Fyi, it works for me:

... URL Source: https://www.trustpilot.com/review/cortexi.io Markdown Content: Write a review Image 1: a row of white stars on a black background
5-star

Etc