kriskowal / tengwarjs

A Tengwar (J.R.R. Tolkien’s Elvish alphabet) transcriber for ES5 and HTML5
http://tengwar.3rin.gs
MIT License
58 stars 9 forks source link

List of common words to add that trick the transliterator #59

Open dreamingfifi opened 5 years ago

dreamingfifi commented 5 years ago

So, I have been thinking of this for a while, and I think it would be a good partial step between just having an entire dictionary in the transliterator and letting it be fooled by every little spelling weirdness that English does.

Why not have a list of function words added to it? These are words like determiners, pronouns, helping verbs, prepositions, pronouns and so on. We wouldn't have to add all of them either, just ones that wouldn't be rendered correctly because they "trick" the transliterator.

I found a handy list of function words: https://semanticsimilarity.wordpress.com/function-word-lists/

If there is anything I'm good at, it's gathering lists and transliterating them. It'd take me a couple hours tops. Hell, before you get a chance to read this I may have already finished and posted the list.

It should be easy to implement too... you're already doing this with the shorthands of "of, the, of the, and" so this would basically be the same thing.

dreamingfifi commented 4 years ago

Here is a list of tricky words. list-of-function-words.pdf