Koziev / rulemma

Лемматизатор для русскоязычных текстов
MIT License
42 stars 6 forks source link

Would you be open to porting this to Javascript? #4

Closed giorgosera closed 4 years ago

giorgosera commented 4 years ago

Hey @Koziev,

I stumbled upon your project here while searching for Russian lemmatizer implementations in JS.

Would you be open to porting this to Javascript? I can start working on this but I wanted to check with you first.

Let me know what you think! Great work on this project by the way 💯

Koziev commented 4 years ago

Hi,

Would you be open to porting this to Javascript? It would be very difficult, I suppose. The lemmatizer requires two other tools - part-of-speech tagger and dictionary - to be ported. And POS tagger is built upon CRFSuite package. Last but not least - I'm not a JS developer, I can't write or support it :(

giorgosera commented 4 years ago

Hey Ilya,

Thanks for the reply. I was actually proposing to do it myself but I thought I'd first get your opinion 👍

Yes, indeed since the lemmatizer has some dependencies the task might be harder than I expected.

Thanks anyways! :)

Koziev commented 4 years ago

By the way, loot at https://github.com/koorchik/node-mystem3, for example. It is a kind of wrapper for MyStem pos tagger and lemmatizer.

giorgosera commented 4 years ago

That works indeed. Nice finding! Thank you :)