ottokart / punctuator2

A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
http://bark.phon.ioc.ee/punctuator
MIT License
657 stars 195 forks source link

word break #40

Closed dmhorbachov closed 5 years ago

dmhorbachov commented 5 years ago

Hi, Ottokar

Great product! I'm not experienced in neural networks, so I have a question. Is it possible to keep words the same as from the input? I.e. the one is broken in such situation:

Unpunctuated text:

you're gonna get left behind in twenty sixteen everyone starting a podcast everyone is starting a podcast it's gonna be like bugs it's you you used to just be able to have a blog and it was cool and people read it

Result

You'Re gon na get left behind in twenty sixteen everyone starting a podcast. Everyone is starting a podcast, it's gon na be like bugs it's you. You used to just be able to have a blog, and it was cool and people read it

Thanks