pemistahl / lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Apache License 2.0
706 stars 63 forks source link

About python version and train own data #86

Closed limitime closed 3 years ago

limitime commented 3 years ago

I think the project is excellent. I wonder is it possible to convert it to a python version? Right now I use jpype to use this package in python. But the performance is not stable. So is it possible to convert the model to a python version? What's more, I wonder how do you get the train data. I want to train my own data, can you provide train code example?

pemistahl commented 3 years ago

I'm not planning to port the library to plain Python. It is possible, however, to provide Python bindings for the Rust version of the library. I think this is something I'm going to follow but I cannot tell you when exactly.

The readme mentions the source of my training data in section 4.

Thrameos commented 3 years ago

What in particular is not stable in Jpype? If there is something that needs to be addressed you can put in an issue.