monolingual-corpora Search Results

133 results
for monolingual-corpora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SEACrowd/seacrowd-datahub #35

Create dataset loader for MIRACL

Dataloader name: `miracl/miracl.py` DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?miracl | Dataset| miracl | |-------------|---| | Description | MIRACL is a multilingual d…

SamuelCahyawijaya updated 10 months ago
6
clarin-eric/ParlaMint #708

IS: missing lang attribute in language

https://github.com/clarin-eric/ParlaMint/blob/535dae3f802d20ea053e76899ddcf6ab805049c0/Data/ParlaMint-IS/ParlaMint-IS.xml#L124-L127 should be: ```XML English Icelandic ``` O…

matyaskopp updated 1 year ago
4
furukawa-ai/deeplearning_papers #87

Unsupervised Machine Translation Using Monolingual Corpora O…

unsupervised NMTモデルその２（Facebook、2017-10-31にarxivに投稿、ICLR2018狙い） https://arxiv.org/abs/1711.00043 >Machine translation has recently achieved impressive performance thanks to recent advances in de…

msrks updated 1 year ago
1
iflytek/cino #29

What is the pre-training dataset

論文入面好似冇提到預訓練資料集係乜嘢

ayaka14732 updated 1 year ago
1
furukawa-ai/deeplearning_papers #86

UNSUPERVISED NEURAL MACHINE TRANSLATION

unsupervised NMTモデルその１（2017-10-30にarxivに投稿、ICLR2018狙い） https://arxiv.org/abs/1710.11041 >In spite of the recent success of neural machine translation (NMT) in standard benchmarks, the lack of la…

msrks updated 1 year ago
1
mozilla/firefox-translations #602

Please support for Catalan language

Hello Please consider adding Catalan language. In this repository you have a large collection of open source aligned parallel corpus that you can use to train your system: https://github.com/…

jordimas updated 1 year ago
10
AI4Bharat/IndicBERT #1

Extending IndicBert V2

I have some data for three low resource languages, two of them are not in the list of 24 languages of IndicBERT V2 and for one I may have some more data. I want to continue training on this data from …

singhakr updated 1 year ago
6
ezosa/M3L-topic-model #2

Missing data

Hello! Thanks for your great job! I find that the data folder is missing. If possible, can you release the dataset or the preprocessing script? Thanks all.

liuh236 updated 1 year ago
9
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 2 weeks ago
1906
AI4Bharat/indicnlp_corpus #10

The Corpus is not downloadable

Hi, I'm trying to download the corpus for Hindi Language using the link in Readme.md, but getting the following Error: ```bash wget https://storage.googleapis.com/ai4bharat-public-indic-nlp-corpor…

swapnil3597 updated 2 years ago
5

上一页 1...2 3 4 5 6 7 8...14 下一页

133 results for monolingual-corpora

133 results
for monolingual-corpora