Closed pschupp closed 7 months ago
Did you try to change the used user agent in tour feed settings ?
I did try that.
On Thu, Jul 21, 2022 at 10:32:54PM -0700, Romain de Laage wrote:
Did you try to change the used user agent in tour feed settings ? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: <miniflux/v2/issues/1491/ 1192199412 ZjQcmQRYFpfptBannerStart This Message Is From an External Sender This message came from outside your organization.
ZjQcmQRYFpfptBannerEnd
Did you try to change the used user agent in tour feed settings ?
— Reply to this email directly, [1]view it on GitHub, or [2]unsubscribe. You are receiving this because you authored the thread.*Message ID: <miniflux/ @.***>
References:
[1] https://urldefense.com/v3/__https://github.com/miniflux/v2/issues/1491*issuecomment-1192199412__;Iw!!LQC6Cpwp!vgnfqjT25X_CWEdz1WGw6DP9iBstvYB-qhJr6MXuWoqEXaK6dzzDaqVo7G2BG9ivfPWsWEPYvlydo2x6V95wo8fX3D6jeEo$ [2] https://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/AI5X6LKOYHFZTLDEVFDTYZDVVIXANANCNFSM5254CD6A__;!!LQC6Cpwp!vgnfqjT25X_CWEdz1WGw6DP9iBstvYB-qhJr6MXuWoqEXaK6dzzDaqVo7G2BG9ivfPWsWEPYvlydo2x6V95wo8fXrGg3Wlc$
I just tried to change the user agent with curl with the one in your example, it didn't work. But it worked with wget. I use HTTP 1.1 and the same headers in both... I don't understand
Anyway, thanks for trying. The following works for me:
curl 7.74.0 (x86_64-pc-linux-gnu) libcurl/7.74.0 OpenSSL/1.1.1n zlib/1.2.11 brotli/1.0.9 libidn2/2.3.0 libpsl/0.21.0 (+libidn2/2.3.0) libssh2/1.9.0 nghttp2/1.43.0 librtmp/2.3
curl --user-agent 'Mozilla/5.0 (X11; Linux x86_64; rv:95.0) Gecko/20100101 Firefox/95.0)' 'https://www.fiercebiotech.com/rss/biotech/xml'
GNU Wget 1.21 built on linux-gnu. (-cares +digest -gpgme +https +ipv6 +iri +large-file -metalink +nls +ntlm +opie +psl +ssl/gnutls)
wget --user-agent 'Mozilla/5.0 (X11; Linux x86_64; rv:95.0) Gecko/20100101 Firefox/95.0)' 'https://www.fiercebiotech.com/rss/biotech/xml'
Either without the useragent returns the the captcha for curl or gives a 403 for wget. Sorry I can't be more helpful, but I'm afraid this stuff is out of my depth!
I did try that.
This feed (https://www.fiercebiotech.com/rss/biotech/xml
) works for me if you disable HTTP/2 to avoid fingerprinting. Requires Miniflux >= 2.0.1.
The RSS feed in question is: (https://www.fiercebiotech.com/rss/biotech/xml)
I'm well aware this issue has been disussed in various other posts with solutions being:
I have tried all three solutions with Miniflux and they have not changed the outcome. I have tried pproxy and tinyproxy on my local host as proxies.
Using curl, the default UserAgent returns the captcha, but using (Mozilla/5.0 (X11; Linux x86_64; rv:95.0) Gecko/20100101 Firefox/95.0) returns the feed properly.
My questions are: