karlhigley / lexrank-summarizer

A Spark-based LexRank extractive summarizer for text documents
MIT License
19 stars 4 forks source link

Use accumulators to quantify boilerplate removal #44

Open karlhigley opened 8 years ago

karlhigley commented 8 years ago

It would be nice to know how many or what proportion of sentences are removed by the boilerplate filtering. Spark's accumulator variables would provide a good way to track that.