-
Dataloader name: `uit_viquad/uit_viquad.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?uit_viquad
| Dataset| uit_viquad |
|-------------|---|
| Description | Vietnamese…
-
This is a "living issue". Editing is appreciated.
### Context:
- Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard
- We can choose to index the pdf dat…
-
I am trying to rerun the https://github.com/fastai/course-nlp/blob/master/nn-vietnamese.ipynb Vietnamese notebook and am getting the file not found error at
get_wiki(path,lang)
This seems to be …
-
I try to use newspaper for vietnam's news, everything seem good but keyword extraction.
For example, with http://dantri.com.vn/xa-hoi/tphcm-thu-hoi-nha-cua-ong-tran-van-truyen-1002764.htm
, extracted …
-
Would the AUDIO version of this logically fall under the same commit?
vgx32 updated
9 years ago
-
👩💼 As an NLP engineer, you know that accurate and reliable language resources are essential for building effective natural language processing systems. 📚 The Vietnamese Dictionary project is a compre…
-
i train work embedding 300 dimension with 12000 word from https://nlp.stanford.edu/projects/nmt/data/iwslt15.en-vi/.
but not work.. help me tks you very much.
---------------------------------------…
-
**Issue by [monday0rsunday](https://github.com/monday0rsunday)**
_Fri Dec 5 07:09:24 2014_
_Originally opened as https://github.com/codelucas/newspaper/issues/93_
----
I try to use newspaper for v…
-
Now, after some digging up there was this project a while ago: http://www.ustarconsortium.com/qws/slot/u50227/index.html which has some documentation on https://www.nict.go.jp/en/asean_ivo/lde9n200000…
-
## Description
Hello @mishig25 ,
I am writing to raise a concern about the lack of support for automatic fill-in of internal TOCs (Tables of Contents) for non-English languages in the huggingfac…