jina-ai / reader

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
https://jina.ai/reader
Apache License 2.0
7.05k stars 555 forks source link

Jina Reader doesn't work for a website #1099

Closed laloneo closed 5 days ago

laloneo commented 6 days ago

Hello I have tried every possible setting but can't get content of

https://r.jina.ai/https://artbat.komi.io/

Website: https://artbat.komi.io/

Any help would be much appreciated

Andreas

nomagick commented 5 days ago

For some reason, Reader's internal browser cannot successfully load this website. It appears to be a bug in the implementation of this website. However, it could also be some unknown bot prevention mechanism which deliberately causes this error to block access to bots like Reader.

If you can get the HTML content of this website using other tools, you can submit it directly to Reader with a POST request with url and html set in HTTP body. Providing html stops Reader from loading the URL with the internal browser.