postlight / parser

📜 Extract meaningful content from the chaos of a web page
https://reader.postlight.com
Apache License 2.0
5.4k stars 442 forks source link

Mercury Reader and API issues with videogamer.com #618

Open Organizer21 opened 3 years ago

Organizer21 commented 3 years ago

Platform

Expected Behavior

I was expecting a cleaned up version of the articles

Current Behavior

I'm only seeing the headline and no text + unrelated articles from the left side-column

Steps to Reproduce

Parse or use with this e.g. article https://www.videogamer.com/news/star-wars-squadrons--virtua-fighter-5-ultimate-showdown-among-playstation-plus-games-for-june

Detailed Description

First time checking in here and fairly new to Mercury. Not sure we're supposed to report things like this for the core engine to get improved and/or if this means someone needs to create a custom extractor for this (likely too tech for me).

Possible Solution

Updates to the core or a custom extractor I'd presume.