issues
search
Helsinki-NLP
/
OPUS-ingest
4
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add Interlingua datasets
#36
jorgtied
opened
1 month ago
0
add Kurdish BLARK dataset
#35
jorgtied
opened
1 month ago
0
new version of JParaCrawl
#34
jorgtied
opened
3 months ago
0
Multi-Parallel Corpus of North Levantine Arabic
#33
jorgtied
opened
9 months ago
0
Add datasets from https://github.com/Softcatala/nmt-softcatala
#32
marco-c
opened
10 months ago
0
Add CoVoST dataset
#31
marco-c
opened
10 months ago
0
macocu datasets
#30
jorgtied
closed
10 months ago
1
chore: add dependency on pybind11-dev
#29
SethFalco
closed
11 months ago
0
datasets collected in NLLB from various sources
#28
jorgtied
opened
1 year ago
0
ELRA-W0232 is empty
#27
kpu
opened
1 year ago
1
add TALPCo dataset
#26
jorgtied
opened
1 year ago
0
Add en-th dataset
#25
jorgtied
closed
11 months ago
1
MDN Web Docs
#24
graemenail
closed
11 months ago
1
chores: clean up repo
#23
SethFalco
closed
1 year ago
0
feat: add tldr-pages corpus
#22
SethFalco
closed
11 months ago
4
There are two template files for each type
#21
SethFalco
closed
1 year ago
1
NLLB dataset
#20
jorgtied
closed
11 months ago
1
bug: unable to clone all submodules
#19
SethFalco
closed
1 year ago
1
Add CLUVI corpus for Galician>Spanish, English
#18
onadegibert
opened
1 year ago
0
wmt21 multilingual data set
#17
jorgtied
opened
1 year ago
0
LoResMT data sets
#16
jorgtied
opened
1 year ago
0
gourmet swahili english does not show in opus api
#15
jorgtied
closed
1 year ago
2
Add Multilingual corpus of Caucasian languages
#14
jorgtied
opened
1 year ago
2
alignments missing?
#13
jorgtied
closed
1 year ago
1
JW300 alignment problems
#12
jorgtied
closed
1 year ago
1
Invalid xml
#11
miau1
opened
5 years ago
0
improve makefiles
#10
jorgtied
opened
5 years ago
0
release filtered/unfiltered commoncrawl and rapid corpus
#9
jorgtied
closed
1 year ago
1
add mediawiki translation corpus
#8
jorgtied
closed
5 years ago
0
UD compatible pre-processing
#7
jorgtied
closed
1 year ago
1
update parsed data
#6
jorgtied
closed
1 year ago
1
better use of disk space and temp directories
#5
jorgtied
closed
1 year ago
0
mulitparallel and updated ParaCrawl corpus
#4
jorgtied
closed
1 year ago
0
update various outdated corpora
#3
jorgtied
closed
1 year ago
0
update tatoeba
#2
jorgtied
closed
1 year ago
0
add UN new UN corpus
#1
jorgtied
closed
1 year ago
2