MojoJolo / textteaser

TextTeaser is an automatic summarization algorithm.
MIT License
1.97k stars 251 forks source link

Different results running locally vs. textteaser.com #23

Closed jpreed00 closed 10 years ago

jpreed00 commented 10 years ago

Hey all, I saw this software running on reddit.com as a summarizing bot and I thought I would give it a try. I've got it running locally (via SimpleREPL), though the summary I'm getting is different than the one I get when I use the textteaser.com website.

The only "options" I've run across that seem to make any difference are the Summarizer private variables _summarySize and _keywordsSize. What settings/setup do I need in order to match the output from textteaser.com? Or is it just a matter of the website having a larger/different corpus? If so, where can I find/download a copy of the corpus?

MojoJolo commented 10 years ago

TextTeaser.com is running continuously and all the articles it summarized are saved in the database. Previous articles are used to determine the keywords and might affect the result.