Open wannaphong opened 2 years ago
Dataset
mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus
Paper: https://arxiv.org/abs/2406.08707 Dataset: https://huggingface.co/datasets/oscar-corpus/mOSCAR
ASR
Dataset
LM
Speech
Text corpus
Coreference resolution