mozilla / readability

A standalone version of the readability lib
Other
8.8k stars 598 forks source link

`isProbablyReaderable` returns true on a unreadable page #887

Open tianzhich opened 3 months ago

tianzhich commented 3 months ago

I found that when a page contains multiple article tags, isProbablyReaderable will make a wrong judgment. Do we only need to check the first occurrence of the article tag?

For the website https://thebaffler.com/, it's strange that isProbablyReaderable returned true, but when I opened it with Firefox browser, the address bar did not show the reader icon. I'm not sure if Firefox is based on isProbablyReaderable for implementation.

image