miso-belica / sumy

Module for automatic summarization of text documents and HTML pages.
https://miso-belica.github.io/sumy/
Apache License 2.0
3.52k stars 530 forks source link

Is sumy utilising any corpus? #33

Closed houmie closed 9 years ago

houmie commented 9 years ago

I was wondering if e.g. the brown corpus would be utilised to achieve better LSA results? Or are the corpora only utilised for tagging?

miso-belica commented 9 years ago

Hi, unfortunately I don't understand you. I don't think any corpus would bring better results into LSA summarization method. This method is not trained on any corpus. Implemented LSA is unsupervised summarization method and currently there is no way how to train it on some corpus.