get-alex / alex

Catch insensitive, inconsiderate writing
https://alexjs.com
MIT License
4.81k stars 207 forks source link

Internationalisation #202

Open wooorm opened 6 years ago

wooorm commented 6 years ago

Subject of the issue

Alex, for now, supports just English, but retext could work with other Latin-script languages.

Do you know a language other than English

...and are you able (it’ll take some time) and wiling to help? Create a new issue for your language and let’s start working on this!

Translations

The following table describes the status of translations:

Language profanities cuss retext-profanities retext-equality
English Yes Yes Yes Yes
Arabic (Latin-script) Yes Yes Yes No
Spanish Yes Yes Yes No
French Yes Yes Yes No
Portuguese (Brazilian) Yes Yes Yes No
CRomano31415 commented 6 years ago

I see the issue in retext-profanities. I'll start putting a list in Spanish together 👍🏼

JekRock commented 6 years ago

Does it support only Latin languages? I can help with Russian and Ukrainian.

deatheguard commented 6 years ago

I can help with French and/or English!

baezor commented 6 years ago

I can help with spanish!. Is someone working on spanish here?

wooorm commented 6 years ago

@CRomano31415 Please do! Feel free to open a new issue / PR about Spanish

@JekRock Under the hood, mainly Latin-script languages are supported. Cyrillic could potentially work. See parse-latin for the main parts about it.

@deatheguard French would be great. Looks like you found GH-207 already. For English, you can just help with retext-profanities and retext-equality already!

@baezor Please do as well! Let’s check if @CRomano31415 opens an issue so y’all can collaborate :)

AhmedRedaAmin commented 6 years ago

Can I add support for Arab Latin-Script Slang ? (Used heavily on Social media) Note : it includes numbers for some Arabic sounds , if that will cause problems for parse-latin

wooorm commented 6 years ago

@AhmedRedaAmin Yes, feel free to do that, and create a new issue for it!

Note : it includes numbers for some Arabic sounds , if that will cause problems for parse-latin

I’m not sure, I don’t know Arabic, let’s try it out!

inesbenomar18 commented 6 years ago

I can help with Arabic and French as well! :)

wooorm commented 6 years ago

@inesbenomar18 Awesome! For french, start with GH-207. And feel free to open a new issue about Arabic (note though that I think Arabic-script may not be properly supported)

AhmedRedaAmin commented 6 years ago

Okay , so I noticed there were 2 approaches to adding new languages , so I 'll take @CRomano31415 's approach , I already started by borrowing her repo template , I hope you don't mind Claudia !

rampagesang commented 6 years ago

@wooorm Do you know a language other than English ...and are you able (it’ll take some time) and wiling to help? Create a new issue for your language and let’s start working on this!

I can help with Korean!~

wooorm commented 6 years ago

@rampagesang Awesome! Unfortunately I don’t think languages that use a script other than the Latin-script can work with the current technical setup :(

Yangeok commented 6 years ago

Can i help you to translate language korean and germany?

wooorm commented 6 years ago

@Yangeok Hey, that’s cool! I don’t think Korean will work (see the comment before yours), but German would definitely work!

PaoloWeishaupt commented 6 years ago

I can help with italian, spanish, french and german.

luigicorreia commented 6 years ago

I can help with Portuguese.

toucedam commented 6 years ago

I can help with spanish!. Is someone working on spanish here?

yes

GledsonAfonso commented 6 years ago

I can help with Portuguese too (pt-BR).

waaghree commented 6 years ago

Do you know a language other than English

...and are you able (it’ll take some time) and wiling to help? Create a new issue for your language and let’s start working on this!

How about Urdu? It's in mostly Arabic script with added characters, however, for a large part people use English characterset to write on social media in something called "Roman Urdu". @AhmedRedaAmin are you working on something similar in Arabic?

wooorm commented 6 years ago

@waaghree Yep, romanised can work. For inspiration, see the Arabic cuss file added in https://github.com/words/cuss/pull/16!

AhmedRedaAmin commented 6 years ago

@waaghree yes , I worked on something very similar . You can refer to @wooorm 's link as well as the issue titled Arabic Latin-Script on this repo , feel free to ping me if you want to ask about anything . Glad to see Arabic script being transliterated for more than one language :D , Good luck mate.

GledsonAfonso commented 6 years ago

@wooorm @AhmedRedaAmin Except from the words/cuss project, there's another way that we can help with Internationalisation? If yes, how?

AhmedRedaAmin commented 6 years ago

@GledsonAfonso So here is how it works , you add the words to the words/cuss project , rate them based on profanity , then they get used in get-alex/alex to detect profane words , that is as far as I know how the Yaml files can Identify the inappropriate words , as for the "insensitive" phrases , they get added to retext-equality by hand directly together with the suggested alternatives , they are different from flat out cusses which obviously don't have alternatives suggested . This is what I understood atleast . Short answer : The second best way to help with internationalization is to add your native language insensitive phrases to retext-equality along with their suggested alternatives , Good luck mate ! :D

GledsonAfonso commented 6 years ago

@AhmedRedaAmin Thank you for the explanation! I think it would be interesting if we had some section in the get-alex/alex project (wiki maybe? With a mention in the README file and all the cool stuff) about this... or this already exists and I'm just making a fool of myself here, haha.

Anyway, thanks again for the heads up. I'll see if I can add some phrases in retext-equality as soon as I get the gist of it. Cheers!

wooorm commented 6 years ago

@GledsonAfonso We could definitely use a section on internationalisation in the contributing.md file!

GledsonAfonso commented 6 years ago

@wooorm Great! Do we need to create an issue for that?

wooorm commented 6 years ago

Sure, you could create a separate issue, or if you’d like to work on it feel free to take a stab at it!

OtacilioN commented 6 years ago

Hey Folks, I would love to help with Portuguese!

GledsonAfonso commented 6 years ago

@wooorm Okay! I will create an issue for that now and see if I can manage to work on it later, in case no one take it first. Thanks!

@OtacilioN Thanks for creating the issue for the language.