khmer-nlp Search Results

ljvmiranda921/comments.ljvmiranda921.github.io #54

Guest lecture @ UNC Charlotte: Labeling with LLMs

# Guest lecture @ UNC Charlotte: Labeling with LLMs A few weeks ago, I held a guest lecture at University of North Carolina Charlotte on how we can use large language models for annotation in the con…

utterances-bot updated 8 months ago

typesense/typesense #228

Support for writing systems without spaces between words

## Description I'm trying to use Typesense with my content in Thai. What's special is that Thai (and a few other languages) doesn't use spaces to separate words. Typesense seems to care about that.…

artt updated 2 months ago

notofonts/notofonts.github.io #46

Some combination glyphs in Khmer return inconsistent width v…

## Defect Report I use NotoSansKhmer and uharfbuzz together with fpdf2 to create a pdf document. To right align the text I need the width of a string after being adjusted by the font shaping engine…

kreier updated 5 months ago

python-poetry/poetry #9476

Custom optional = false group in local dependency not gettin…

### Description For a local dependency with custom non optional group, dependencies listed do not get installed when installing main project. #### here's the local package `znlp_translate` #####…

DivyanshuBhoyar updated 4 months ago

tidyverse/stringr #542

str_split not splitting correctly on Unicode character

I am trying to split Burmese Unicode characters in stringr::str_split() but not return the correct values. `str_split("စမ်းသပ်မှု", "")[[1]]` it returns: > [1] "စ" "မ်" "း" "သ" "ပ်" "မှု" …

alexanderbeatson updated 4 months ago

koreader/koreader #11701

FR: Thai language word break doesn't work

## Problem KOReader use [libunibreak](https://github.com/adah1972/libunibreak) find location to break lines. However, it doesn't support breaking SEA languages: Thai, Burmese, Lao, Khmer outlined in…

inganault updated 6 months ago

SEACrowd/seacrowd-datahub #109

Create dataset loader for Asian Language Treebank Parallel C…

Dataloader name: `parallel_asian_treebank/parallel_asian_treebank.py` DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?parallel_asian_treebank | Dataset| parallel_asian_tr…

SamuelCahyawijaya updated 7 months ago

SEACrowd/seacrowd-datahub #424

Create dataset loader for Bactrian-X

Dataloader name: `bactrian_x/bactrian_x.py` DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?bactrian_x | Dataset| bactrian_x | |-------------|---| | Description | The B…

SamuelCahyawijaya updated 7 months ago

cldf/cldf #98

How to model paradigms in CLDF

While paradigmatic data can be modeled somewhat using pure `Wordlist`s, this means that a lot of information may get lost (or at least be only informally added) - and in particular not be readily avai…

xrotwang updated 1 year ago

common-voice/common-voice #3666

Requesting to add Khmer in common voice

# Welcome to the Common Voice Community ! > Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labelled voice data that is representative of l…

ksoky updated 2 years ago

13 results for khmer-nlp

13 results
for khmer-nlp