pemistahl / lingua

The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Apache License 2.0
706 stars 63 forks source link

Add Oromo #143

Closed kwhumphreys closed 2 years ago

kwhumphreys commented 2 years ago

Add Oromo using data from https://dumps.wikimedia.org/omwiki/latest/omwiki-latest-pages-articles.xml.bz2 cleaned using https://github.com/attardi/wikiextractor

kwhumphreys commented 2 years ago

updated README instructions to address some of the issues I encountered

pemistahl commented 2 years ago

Hello Kevin @kwhumphreys, please excuse my late response. Thanks a lot for your language addition. This is great work. :) I'm going to merge it gladly so that Oromo will be part of the next major release 1.3.0.