optimaize / language-detector

Language Detection Library for Java
Apache License 2.0
567 stars 165 forks source link

Azerbaijani support #109

Open bgbashiro opened 3 years ago

bgbashiro commented 3 years ago

Frequencies extracted from azwiki dump using CLI. Had to add null tag check to TagExtractor as it was giving errors (string check of closing tag fails if this.tag_ is null) when parsing XML dump.