Open karlhigley opened 8 years ago
It would be nice to know how many or what proportion of sentences are removed by the boilerplate filtering. Spark's accumulator variables would provide a good way to track that.
It would be nice to know how many or what proportion of sentences are removed by the boilerplate filtering. Spark's accumulator variables would provide a good way to track that.