mozilla / readability

A standalone version of the readability lib
Other
8.95k stars 606 forks source link

https://www.cnbc.com articles show cookies information #722

Open LunarisDream42 opened 2 years ago

LunarisDream42 commented 2 years ago

Similar to #623 but different website. Issue affects all articles.

Example: https://www.cnbc.com/2017/04/07/you-dont-need-to-upgrade-your-smartphone.html image image

tthhtao commented 2 years ago

Hi, this is Jintao. I would like to work on this bug.

JohnCido commented 1 year ago

Any updates on this? I just encountered this problem.

ivanlabsii commented 3 months ago

I could reproduce this. I've tried to workaround the issue by saving the html file from the Safari, loading the HTML file and executing the script. This has partially resolved the problem and there is no cookie notice and whole the content is grabbed, but with tons of garbage text added both at the start in the middle and at the end. I am not sure if this should be treated as this issue, new issue, or no issue at all.