-
Hi,
we are using indic-trans in order to make transliteration from Hindi to Roman/Eng.
After applying your model, I have good results in general, but there are still some errors, as some Hindi peop…
-
Following Axelle's advice, let us use
``
instead of
``
1. to tag Grantha syllables in a Tamil text
2. to tag Sanskrit in Grantha sentences/phrases in a Tamil text
I guess we will thus use
…
-
It's feasible to **remove multiple whitespace** from the source text.
i.e. - Replace each occurrence of multiple spaces by a single space.
The total is **1298** each in both the Cherokee and the t…
-
`title_case` parameter has to be script-specific: it is only relevant when used with a target script that makes an upper-lower case distinction, but it will be language-independent (no matter what lan…
-
### Actual behavior
Search always return empty result for cyrillic keywords
### Expected behavior
It should work the same as for latin text/keywords
### Steps to reproduce the behavior
…
xTNTx updated
5 years ago
-
ISO 24229 was the basis for Interscript system codes.
With the progress of Interscript we are now aware that the original code "system" doesn't quite work:
i.e. the pattern `"{authority code}-{lan…
-
My PC and mobile's default languages are set to Japanese, and I have Spotify set to English. But Receiptify is displaying the Japanese transliterations of some non-Japanese artists' names. I was wonde…
-
Task 1: We need to compile a dataset of KPS9566 encoded documents. Extract from a Red Star OS dump.
https://archive.org/details/RedStarOS
Task 2: Make python aware of the encoding.
https://d…
-
As mentioned in https://github.com/jgm/pandoc/issues/807#issuecomment-310831480 and https://github.com/jgm/pandoc/issues/807#issuecomment-310831794 ,
I'd like to have the option to transliterate no…
-
https://github.com/interscript/interscript/issues/284 in the document attached,
1-
a)defined article such as "عبدالرحمن" Abd-al-Rahman
is sometimes having an Hyphen as a separation
as here
![ima…