cadmiumcr / cadmium

Natural Language Processing (NLP) library for Crystal
https://cadmiumcr.com
MIT License
205 stars 15 forks source link

Adds Luhn summarizer and abstract summarizer class. Fixes #18 #20

Closed rmarronnier closed 5 years ago

rmarronnier commented 5 years ago

Performance is not great. Can't tell if this comes from the algorithm itself or my implementation.

watzon commented 5 years ago

How is the performance looking after using WordTokenizer?

rmarronnier commented 5 years ago

40 % faster :-) I think I'm done for now for this one. I'll add another summarizer in another PR and then starts investigating how to use / reuse the tfidf class. It's not clear for me yet.

watzon commented 5 years ago

Awesome. Well the specs are passing so I'll go ahead and merge. Thanks for your hard work!