A known faiilure mode of the story extraction for some news sitees (currently excluded form config.json) is to extract the story twice i.e. each story parargraph is heard twice. Fix would lead to wider range of usable sites, further investigataion is needed.
A known faiilure mode of the story extraction for some news sitees (currently excluded form config.json) is to extract the story twice i.e. each story parargraph is heard twice. Fix would lead to wider range of usable sites, further investigataion is needed.
Example sites to be provided in comments.