komodojp / tinyld

Simple and Performant Language detection library for NodeJS
https://komodojp.github.io/tinyld/
MIT License
415 stars 12 forks source link

Add more languages #8

Closed kefniark closed 4 months ago

kefniark commented 2 years ago

Description

Add more languages, the goal would be to have:

So far, to me it sounds a bit useless to do more than 100 languages. It really become niche usage and the accuracy of those small languages just degrade.

Totistar commented 1 year ago

add Catalan (ISO Codes: ca cat) please ???

kefniark commented 1 year ago

I will take a look, but it's typically the kind of language I usually avoid 😄 Not in the top 100 language by speakers and really close to spanish and french, so good to create false positive with those languages on short sentences.

Another idea I need to experiment is to provide bigger profiles with more languages. And provide a way for people to recompile the library with only the 20~30 languages subset they need.

Totistar commented 1 year ago

It would be great to have him. We are clear about the relationship of Catalan with Spanish and French. But having 7 million speakers, similar to Danish for example. I'm sure it can be useful for many people. I encourage you.