-
- uid: vinbigdata_asr_vlsp_2020
- type: processed
- description:
- name: VinBigData ASR VLSP 2020
- description: 100 hours of speech data in Vietnamese provided by VinBigData for the VLSP ASR Cha…
-
- uid: MT_Vi_Mono_VLSP2020
- type: processed
- description:
- name: Vietnamese VLSP 2020 Machine Translation Monolingual Dataset
- description: Vietnamese monolingual dataset consisting of 2 mill…
-
- uid: vietnamese_MT_EV_VLSP2020
- type: processed
- description:
- name: Vietnamese EN-VI Machine Translation VLSP 2020
- description: Bilingual Dataset EN-VI. Consisting of 20k samples. Domains…
-
Hi @mmcauliffe - thanks for this great tool!
I'm getting a lot of OOVs because it appears that word-final colons and single quotes are being stripped, so while my lexicon has something like `iahote…
-
Cho em hỏi corpus dataset ban đầu dùng để train cho model này nhóm lấy ở đâu? có thể chia sẻ không?
-
Thanks for your great Vietnamese NLP toolkits .
I want to get the VLSP 2013 POS tagging dataset and the VLSP 2016 NER dataset .
How can I get these datasets since the official website does not pr…
-
**Please be sure to add the Anthology ID in the title**
[See here](https://www.aclweb.org/anthology/info/corrections/) to read about the three types of corrections.
## Metadata correction: pleas…
sonvx updated
3 years ago
-
Thanks for creating such a wonderful tool. However, I cannot seem to find the POS tag corpus and mapping table anywhere, therefore the result of POS tagging is a bit confusing to me. Could you guide m…
-
Hi,
Could download the PhoBERT package with transformers; would you know then how to use post tagging afterwards?
Thanks!
Pierre
-
### Bug Description
I use branch v0.2 to train and convert model [stream convnet](https://github.com/facebookresearch/wav2letter/tree/v0.2/recipes/models/streaming_convnets).
After 65 epochs i stop …