keredson / wordninja

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
MIT License
806 stars 108 forks source link

Split on hyphen (-) #29

Open peamjo opened 1 year ago

peamjo commented 1 year ago

Is it possible to not split up the words that are hyphenated? For example, "Post-Punk" is split into "Post" and "Punk" even though I added "Post-Punk" into my custom dict already. Thank you.

404Huang commented 4 months ago

same problem here