postlight / parser

📜 Extract meaningful content from the chaos of a web page
https://reader.postlight.com
Apache License 2.0
5.41k stars 442 forks source link

Not able to extract full content from wired.com #590

Open mrgodhani opened 3 years ago

mrgodhani commented 3 years ago

Expected Behavior

https://www.wired.com/story/what-comes-after-the-international-space-station Article should be parsed successfully or any wired.com articles.

Current Behavior

None of the current wired.com articles are extracting full content.

Steps to Reproduce

  1. Try parsing wired.com link https://www.wired.com/story/what-comes-after-the-international-space-station
  2. Notice content coming out is blank and full content is not extracted.
amirsol81 commented 3 years ago

I can duplicate this with almost all Wired articles I've tried.