PyThaiNLP / nlpforthai.com

NLP For Thai
http://nlpforthai.com/
Apache License 2.0
25 stars 7 forks source link

[TODO] Todo: Add link #9

Open wannaphong opened 2 years ago

wannaphong commented 2 years ago

ASR

Dataset

LM

Speech

Text corpus

Coreference resolution

wannaphong commented 3 months ago

mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus

Paper: https://arxiv.org/abs/2406.08707 Dataset: https://huggingface.co/datasets/oscar-corpus/mOSCAR