xissy / chrome-three-sentences

A google chrome extension to summarize a TLDR webpage to just three sentences.
10 stars 1 forks source link

Isolated case - summarizes bio instead of the article #1

Open lukeseo opened 11 years ago

lukeseo commented 11 years ago

Like the extension!

Try this page, it only summarizes the author's bio.

http://techcrunch.com/2013/09/18/yahoo-updates-its-flagship-iphone-app-with-cinemagraphs-read-later-feature-more-news-and-tumblr/

xissy commented 11 years ago

@lukeseo thanks for your feedback!

I confirmed the issue. To summary a web page, the program has to extract article body section first. 3-Sentences uses Boilerpipe as a article body extractor. Unfortunately, Boilerpipe with TechCrunch is confusing where the right article section is. It returns the author's bio section instead of the article section.

I guess that a lot of people want to summarize TechCrunch like you and me. So I'll add some treatments especially for TechCrunch to either the next version of 3-Sentences chrome extension or server side.

Furthermore, because it might be possible to meet these special cases with various popular websites, I'm going to devise an elegant solution which can filter these special management targets.