wooorm / franc

Natural language detection
https://wooorm.com/franc/
MIT License
4.12k stars 173 forks source link

How to contribute languages #24

Closed smari closed 9 years ago

smari commented 9 years ago

Hi! I'm curious to know how to contribute new languages to franc? Do you have a standard method for creating the definition lines?

wooorm commented 9 years ago

Contact the Unicode Consortium: http://unicode.org/udhr/contributing.html. It’ll probably take years though.

Out of curiosity, which language would you like to add? (Does it have an Ethnologue entry?)

smari commented 9 years ago

I'd like to add Icelandic (which I have the knowledge to do), but I'm also looking for support for Georgian.

(For fun: I just threw this together -- probably not useful for most people, but it's useful for my purposes and others might benefit too: https://github.com/occrp/Francserver )

wooorm commented 9 years ago

Icelandic is supported in he franc-all version, as it has ± 300,000 speakers. Read this for more info on how to include it.

Also, cool server! :+1:

smari commented 9 years ago

Ah, thanks for that. How would I use that from node though?

wooorm commented 9 years ago

I believe the file supports CommonJS so you should be able to load the file directly. Otherwise, forking.