eafer / rdrview

Firefox Reader View as a command line tool
Apache License 2.0
844 stars 36 forks source link

fw.com urls fail #40

Open tvraman opened 3 months ago

tvraman commented 3 months ago

as an example: https://www.fool.com/investing/2024/07/03/robinhood-golds-red-flag/?source=iedfolrf0000001 rdrview says coult not fetch web page. Used to work until about a week ago.

eafer commented 2 months ago

That's weird, that page is working fine for me. What's the exact command that you used? Have you seen this happen multiple times? Have you tried fetching the page with curl directly and piping it to rdrview?

tvraman commented 2 months ago

Ernesto Fernández @.***> writes:

see below:

19:20:29 rdrview $ gh issue list

Showing 8 of 8 open issues in eafer/rdrview

ID TITLE LABELS UPDATED

40 fw.com urls fail about 4 hours ago

12 Convert to text about 27 days ago

19:20:34 rdrview $ gh issue view 40 fw.com urls fail eafer/rdrview#40 Open • tvraman opened about 2 days ago • 1 comment

as an example: coult not fetch web page. Used to work until about a week ago.

eafer (Owner) • 4h • Newest comment

That's weird, that page is working fine for me. What's the exact command that you used? Have you seen this happen
multiple times? Have you tried fetching the page with curl directly and piping it to rdrview?

View this issue on GitHub: https://github.com/eafer/rdrview/issues/40 19:20:58 rdrview $ curl 'https://www.fool.com/investing/2024/07/03/robinhood-golds-red-flag/?source=iedfolrf0000001' | wc % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 100 131k 0 131k 0 0 23718 0 --:--:-- 0:00:05 --:--:-- 30497

curl fetched it happily.

2209 8825 134526

19:24:47 ~ $ rdrview -T "title,sitename,body" -H 'https://www.fool.com/investing/2024/07/03/robinhood-golds-red-flag/?source=iedfolrf0000001 rdrview says' rdrview: couldn't fetch the webpage# rdrview failed

eafer commented 2 months ago

19:24:47 ~ $ rdrview -T "title,sitename,body" -H 'https://www.fool.com/investing/2024/07/03/robinhood-golds-red-flag/?source=iedfolrf0000001 rdrview says' rdrview: couldn't fetch the webpage# rdrview failed

There is a weird "rdrview says" inside the url, that's why the fetch fails.

tvraman commented 2 months ago

that is weird, still fails without that cruft:

rdrview -T "title,sitename,body" -H 'https://www.fool.com/investing/2024/07/03/robinhood-golds-red-flag/?source=iedfolrf0000001'

rdrview: couldn't fetch the webpage --

Thanks,

--Raman(I Search, I Find, I Misplace, I Research) ♉ Id: kg:/m/0285kf1 🦮 LOC: https://id.loc.gov/authorities/names/n97059241.html