mozilla / readability

A standalone version of the readability lib
Other
8.9k stars 604 forks source link

The pages from cnet.com are partially missing byline information in Reader View #161

Open avaida opened 9 years ago

avaida commented 9 years ago

Affected:

Sample pages:

Screenshot: screenshot from 2015-04-17 15 07 11

leplatrem commented 9 years ago

This is very similar to #153 and #45

There are several author tags in this page, and once we find one, we ignore the next ones. https://github.com/mozilla/readability/blob/master/Readability.js#L537

<span class="authors">
<span class="author">By: <a href="/profiles/dcarnoy/" rel="author"><span>David Carnoy</span></a></span>
<span class="author"><span class="separator">,</span>
<a href="/profiles/theaudiophiliac/" rel="author"><span>Steve Guttenberg</span></a></span>
</span>