ebasaran / language-detection

Automatically exported from code.google.com/p/language-detection
0 stars 0 forks source link

Profile for Korean is incorrect #56

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
The Korean profile included in this project is incorrect and not usable. 

This is apparently due to the replacing of Korean UnicodeBlock.HANGUL_SYLLABLES 
characters in NGram.java.

I've re-generated the attached profile for Korean using --genprofile with the 
problematic part of NGram.java removed.

Original issue reported on code.google.com by robert.m...@gmail.com on 30 Jun 2013 at 11:12

Attachments:

GoogleCodeExporter commented 9 years ago
Oh, I see.
Then I replaced ko into yours.
I'll modify NGram.java later.
Very Thanks!

Original comment by nakatani.shuyo on 24 Jul 2013 at 8:55