aradzie / keybr.com

The smartest way to learn touch typing and improve your typing speed.
https://www.keybr.com/
GNU Affero General Public License v3.0
1.43k stars 133 forks source link

Add Turkish Q and Turkish F layouts #117

Closed hanefff closed 3 months ago

hanefff commented 6 months ago

I learned fast typing on keybr and now I'm willing to improve my both Q and F layout Turkish writing skills. I'd be so happy if I can do it here as well. They are both ISO layouts.

aradzie commented 3 months ago

Hi, I added the Turkish language and layouts c27a557f

The change is already live, can you please test it?

I am not sure I got all the rules right, especially about the dotless I, so I am looking for the help of a native speaker.

Also, we have an unfinished Turkish translation, maybe you can translate a few hundred words?

hanefff commented 3 months ago

Hi, yes I have created it and did some translations, will continue as soon as I have time.

Thanks a lot for adding the layout! We have two I's, one with dot and one without dot. They are separate letters and they have their own capital letters. I can happily test it!

aradzie commented 3 months ago

FYI, the list of Turkish letters is "abcçdefgğhıiklmnoöprsştuüvyz". I included dotless I, and removed Q, W and X. Is that ok? It seems like the removed letters are only used in the borrowed words, and have a very low frequency, which breaks the pseudo-word generator.

aradzie commented 3 months ago

Ah, and thank you for your translation!

hanefff commented 3 months ago

We also have "j" , removed ones are correct

hanefff commented 3 months ago

I've noticed only one issue, when a word starts with capital dotted i "İ" it shows as I.

For example; it appears as "Istiyorum" but it should be "İstiyorum" I didn't notice any other issue, works flawless!

aradzie commented 3 months ago

Ah, yes, "J" is also removed. So the full list of removed letters is "J", "Q", "W" and "X".

I took a list of words from subtitles.

The problem is, there is a lot of obscene, vulgar and profane language in subtitles, so it must be censored, and usually I get help from native speakers. However, I managed to do something on my own.

Here is the dictionary BEFORE I censored it.

English language is common is subtitles, so I excluded English words from the above list.

Also, I found a list of bad words on the Interned and removed these as well.

The final, censored word list is here (it's a gzip archive). Or, for your convenience, the same list as a spreadsheet. I think it would be a good idea to review the first couple of thousands of words, and if you find anything offensive, just update the spreadsheet and let me know, I'll update the website.

hanefff commented 3 months ago

No the opposite actually, "J" should be added, it wasn't on the list of alphabet you shared.

Sure, I'm checking the list now

hanefff commented 3 months ago

Checked first 5500 words, removed half words and would be weird to have ones. You can update the website

aradzie commented 3 months ago

Thanks!

I have updated the dictionary, here is the difference.

The letter "J" is also included. Although among the first 3000 words there are only four that have it: "ajan", "mesaj", "majesteleri", "enerji".

I also know why the I letters are not capitalized properly:

const a = "AaIıİi";
console.log(a.toUpperCase());           // "AAIIİI", wrong!
console.log(a.toLocaleUpperCase("tr")); // "AAIIİİ", ok!
console.log(a.toLowerCase());           // "aaiıi̇i", wrong!
console.log(a.toLocaleLowerCase("tr")); // "aaııii" ok!

I'll fix this issue and update the website tomorrow.