xkxx / algodb

Mad (╯°□°)╯'ing
MIT License
11 stars 3 forks source link

Problem with keyword extraction on markdown #13

Closed xkxx closed 8 years ago

xkxx commented 8 years ago

RAKE does not behave well on markdown files. Suspected reasons:

Proposed solutions:

grant commented 8 years ago

I'm using a markdown to html package, could you use that? On Sun, Dec 13, 2015 at 22:49 xkxx notifications@github.com wrote:

RAKE does not behave well on markdown files. Suspected reasons:

  • Too many special characters (like this)
  • Line breaks do not represent end of lines in markdown. Rake is upset about it

Proposed solutions:

  • convert markdown into plaintext (hard)
  • use special characters as stopwords

— Reply to this email directly or view it on GitHub https://github.com/xkxx/throwtable/issues/13.