monolingual-corpora Search Results

arXivTimes/arXivTimes #631

Unsupervised Machine Translation Using Monolingual Corpora O…

## 一言でいうと教師なしで翻訳を行う試み。ソース・ターゲットでそれぞれノイズを入れた文を復元するEncoder-Decoderを作成し、翻訳結果(＋ノイズ)をターゲットのEncoderで潜在表現にしたものがソースのDecoderで復元できるよう学習する。ソース/ターゲットの潜在空間が近しくなるよう、敵対的学習のlossを加えている ![image](https://user-ima…

icoxfog417 updated 6 years ago

howardyclo/papernotes #1

Unsupervised Machine Translation using Monolingual Corpora O…

### Metadata - Authors: Guillaume Lample, Ludovic Denoyer, Marc'Aurelio Ranzato - Organization: Facebook AI Research - Conference: ICLR 2018 - Link: https://openreview.net/forum?id=rkYTTf-AZ

howardyclo updated 3 years ago

google/corpuscrawler #78

Add Wikipedia crawler ? (300+ languages)

A [quick search](https://github.com/google/corpuscrawler/search?q=wikipedia) shows you that CorpusCrawler does not crawl or use Wikipedia. I don't know Python but it seems feasible, either from scratc…

hugolpz updated 8 months ago

yukiar/distil_wic #1

parameters missing?

Hi :) I've read your paper quite interestingly, and I would like to try running the create_training_corpora_monolingual.sh, but it seems like number_of_lines_in_corpus or path_to_corpus is missing. Co…

TheoSeo93 updated 2 years ago

mozilla/firefox-translations-training #368

Snakemake pipeline is not in the right order

I have no idea how to fix this, any help or at least guidance is appreciated. And here is my current log for a new job. It seems to be trying to `Collecting translated mono src dataset` before tra…

AmitMY updated 9 months ago

Intelligent-Systems-Phystech/2018-Project-12 #1

Разобраться с кодом по машинному переводу.

Разобраться с кодом одной из реализации статьи. 1. https://github.com/IlyaGusev/UNMT 2. https://github.com/sobamchan/unsupervised-machine-translation-using-monolingual-corpora-only-pytorch 3. htt…

bahleg updated 6 years ago

mozilla/firefox-translations-training #524

[meta] Train easy to segment LTR languages

In the short term we are focusing on building up our language list by training easy to segment LTR languages, as they don't require changes to the training pipeline, and are immediately supported in F…

gregtatum updated 5 months ago

owos/afri_augs #6

Create a script to perform back translations

Back translation involves the use of monolingual data to generate more training data for MT task. A backward intermediate model is trained on the available corpora and then used to generate synthetic …

Iambusayor updated 9 months ago

google/corpuscrawler #79

Use available corpora for opensubtitles (63 languages)

### Research * J. Tiedemann, 2016, Finding Alternative Translations in a Large Corpus of Movie Subtitles. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LRE…

hugolpz updated 8 months ago

facebookresearch/LASER #49

issue with mine_bitexts.py

Hi, I am trying to mine some parallel sentences from two large monolingual corpora (over 40M sentences each). In the first step I encoded the two sides and then called `mine_bitexts.py` to do the mag…

afarajian updated 1 year ago

133 results for monolingual-corpora

133 results
for monolingual-corpora