-
Search for las, observe the list of results that goes like: las, las 'bo, ..., las 'bras, ..., las 'byung, ..., las 'char, ... - which is obviously sorted in ascii order
-
do you have any examples for tokenizer of Tibetan Scripts?
-
Is there any plan to support CoNLL-U's **SpaceAfter=No** attribute that can be put in its MISC column? I am writing annotation guidelines for Tibetan, which like Chinese does not put whitespace betwee…
heacu updated
7 years ago
-
### Description
This document outlines the proposed changes to the application architecture to transmit tokens via cookies rather than sending them in the loader GET request. The motivation for this …
-
## Defect Report
### Font
`NotoSerifTibetan-Regular.ttf`
### Where the font came from, and when
https://github.com/googlefonts/noto-fonts/raw/404f9f1a7a3929c939cb566d249b9477f24d18cb/hint…
-
U+0F39 TIBETAN MARK TSA -PHRU is in USE subclass CMAbv. U+0F71 TIBETAN VOWEL SIGN AA is overridden to Indic_Syllabic_Category=Nukta, so it is in subclass CMBlw. Other vowel signs are in subclasses VAb…
-
Here is our use case:
We are dealing with languages that can be displayed in various manners, for instance:
- Tibetan displayed in:
- Unicode Tibt script
- EWTS transliteration
- anoth…
eroux updated
4 years ago
-
![330558086_617021590244976_5870843549161514781_n](https://user-images.githubusercontent.com/2863444/223897247-845f6075-a65a-43ab-a3c5-14e10fa3c773.jpg)
---
tag: vertical-text Tibetan in vertica…
-
Here are some remarks from someone who uses OpenPecha on BUDA and is also uses Google OCR directly:
> Oh, as I start to use the e-text facility of
> https://library.bdrc.io/
> I notice that occa…
eroux updated
2 years ago
-
The current transliteration button (allowing to switch between Wylie and Unicode) is really good for Tibetan but should be adjusted for non-Tibetan etexts such as https://library.bdrc.io/show/bdr:UTIE…