parallel-corpus Search Results

1000+ results
for parallel-corpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

CLARIAH/clariah-plus #12

[usecases] Parallel historical corpus mining

https://github.com/CLARIAH/usecases/blob/master/cases/hipaco.md

rlzijdeman updated 2 years ago
1
mnarayanar/language-translate #1

find dataset English - hindi pairs

mnarayanar updated 2 months ago
1
arXivTimes/arXivTimes #1468

Margin-based Parallel Corpus Mining with Multilingual Senten…

## 一言でいうと多言語の分散表現を得る手法(LASER)。Bi-directional LSTMのEncoder/Decoderが基本で、Encoderで処理した文はMax-poolをとり、Decode時に常に言語IDとともにconcatする。Encoderが言語独立の表現獲得を担当し、Decoderが言語固有の復元を担当する形で学習を行う。 ![image](https://us…

icoxfog417 updated 4 years ago
1
mozilla/translations #368

Snakemake pipeline is not in the right order

I have no idea how to fix this, any help or at least guidance is appreciated. And here is my current log for a new job. It seems to be trying to `Collecting translated mono src dataset` before tra…

AmitMY updated 9 months ago
2
fani-lab/LADy #84

2018, NAACL, Noising and Denoising Natural Language: Diverse…

**Paper** Noising and Denoising Natural Language: Diverse Back Translation for Grammar Correction **Introduction** This research proposes a solution for data sparsity (noisy and clean pairs) in g…

Sepideh-Ahmadian updated 1 month ago
2
Lkruitwagen/deepsentinel #1

Approach

- ML test-bench style - streaming data? or fixed corpus? (transfer costs excessive on Azure - probably want a fixed corpus) - two corpuses: [DL S2 DLSR, S1] and [L2A, EES1] - pytorch - pytorch par…

Lkruitwagen updated 4 years ago
4
ajinkyakulkarni14/TED-Multilingual-Parallel-Corpus #7

This is just a reminder to users. Some files may not be down…

Thank the original author for his work. But, "This repository is over its data quota. Purchase more data packs to restore access." This question is really stupid! It's like eating steak with a nail c…

aoutom updated 7 months ago
2
huggingface/nanotron #233

Learning rate restart broken with Nanoset?

Retraining on checkpoint works perfectly with the tokenization on the fly, but breaks while using nanoset: training restart with a different lr, which is not the same as lr_schedule.pt We also have…

Pclanglais updated 5 days ago
9
ko-nlp/Korpora #135

[CLI] 번역 모델 학습용 병합 말뭉치 생성 기능 제공

(usage scenario) ``` korpora parallel \ --corpus_names aihub open_subtitles_2018 \ --output_dir path/to/train/ \ --target_lang en \ --save_each ``` ``` korpora parallel \ --corpu…

lovit updated 3 years ago
3
Helsinki-NLP/OPUS-ingest #14

Add Multilingual corpus of Caucasian languages

Add multilingual corpus available from https://github.com/danielinux7/Multilingual-Parallel-Corpus

jorgtied updated 1 year ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for parallel-corpus

1000+ results
for parallel-corpus