Closed sherlcok314159 closed 3 weeks ago
You can try with https://github.com/lwthiker/curl-impersonate/ , which sometimes help. Otherwise you will need a more sophisticated system.
Thanks. But how can I combine this with freshrss?
A typical way is to use a system such as RSS Bridge, which outputs an RSS feed, which can be consumed by FreshRSS. But first step is to find an approach that works manually.
Some sites can not be scraped without javascript
Try feedless tool. It can help in some cases.
Thanks for the above replies. My solution is to use a local headless browser to handle this by python. It is quite light.
Some sites can not be scraped without javascript. And I tried different useragents such as curl/8.21. All the useragents failed.
Site: https://rsshub.app/zhubai/posts/havefun