postlight / parser

📜 Extract meaningful content from the chaos of a web page
https://reader.postlight.com
Apache License 2.0
5.35k stars 436 forks source link

feat: remove obsolete custom extractors #712

Closed sdoire closed 1 year ago

sdoire commented 1 year ago

This PR removes 3 custom extractors that are no longer needed because the generic extractor works on their new webpage layouts. Extractors removed for:

NJ.com CinemaBlend HowToGeek

Note: Some additional changes can be seen in the dist/mercury.js file from recent changes that had not been run to update the dist file yet.

Video below shows the results of using the generic extractor on those three sites.

https://user-images.githubusercontent.com/10479709/201155516-051e9ad1-02c0-4d9b-95ee-0352fac4dda9.mov