parallel-corpus Search Results

1000+ results
for parallel-corpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

clarin-eric/resource-families-issues #48

English-Luganda Parallel Corpus

http://hdl.handle.net/11372/LRT-560 - Metadata issue: - [x] Unknown licence

jakoble updated 2 months ago
1
acoli-repo/acoli-corpora #9

more parallel corpus data

- [ ] http://bilinguis.com/book/baskerville/es/en/c14/

chiarcos updated 3 months ago
1
mozilla/translations #905

Limit the amount of data used for distillation

In #771 I ran an experiment to see the effects of the size of the distillation corpus for the change in the COMET score for the students. Adding more data to this step did not affect the COMET score b…

gregtatum updated 6 days ago
3
thammegowda/mtdata #80

Add parallel bible corpus

First appeared here in https://aclanthology.org/L14-1215/ which references link: http://paralleltext.info/data/ but that link is no longer available. However, recently https://arxiv.org/pd…

thammegowda updated 2 years ago
1
clarin-eric/resource-families-issues #56

The Norwegian-Spanish Parallel Corpus

http://hdl.handle.net/11509/73 - Metadata issue: - [ ] Unclear alignment/annotation

jakoble updated 3 years ago
2
clarin-eric/resource-families-issues #50

Polish-Bulgarian-Russian Parallel Corpus

https://hdl.handle.net/11321/308 - Metadata issues: - [x] Unknown size - [ ] Unclear alignment/annotation - [x] Unknown licence

jakoble updated 3 years ago
1
google/clusterfuzz #2413

About corpus of parallel fuzzing

I think this strategy is also good for clusterfuzz . https://github.com/google/fuzzbench/pull/1197#issuecomment-880810941 https://www.fuzzbench.com/reports/experimental/2021-08-05-parallel/index.htm…

gtt1995 updated 3 years ago
2
danielinux7/anana #15

[NMT] Parallel Corpus clean-up

**Ахцәажәара** The current parallel corpus has been extracted from various sources (ebooks,websites...) **Ауадаҩрақәа** The sentences are automatically lined up. We come across these issues…

danielinux7 updated 2 years ago
3
mozilla/translations #915

Reduce monolingual data for en-lt to investigate distillatio…

In #771 I tested the effects of reducing the distillation data to understand that expensive part of our pipeline. However, we should do it again for the `base` student model, as the other one was done…

gregtatum updated 1 hour ago
1
inception-project/inception #4706

Support Pharaoh format for parallel corpus alignment

For example, [awesome-align](https://github.com/neulab/awesome-align) supports generating word by word parallel corpus alignment, i.e. the Pharaoh format files. Or even can we achieve this in the cur…

fishfree updated 6 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for parallel-corpus

1000+ results
for parallel-corpus