postlight / parser

📜 Extract meaningful content from the chaos of a web page
https://reader.postlight.com
Apache License 2.0
5.37k stars 443 forks source link

custom parser for cbc.ca #699

Closed zhemaituk closed 1 year ago

zhemaituk commented 1 year ago

parser for cbc.ca. Specifically improved parsing of publishedDate (was not extracted out of the box), author for different layouts, dek and title.