issues
search
Helsinki-NLP
/
OpusTools
67
stars
17
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix opusfilter interface
#43
svirpioj
closed
1 month ago
0
Problems in OpusRead interface with moses preprocessing
#42
svirpioj
closed
1 month ago
0
Cannot download resource due to `DH_KEY_TOO_SMALL`
#41
cgbahk
opened
7 months ago
0
change in OPUS yaml files
#40
jorgtied
closed
2 months ago
1
Spaces before punctation marks on opus_read output
#39
keith555
opened
1 year ago
1
Recreate sample files shown in OpenSubtitles corpus
#38
keith555
closed
1 year ago
1
Add yield tuple write mode
#37
larrylawl
opened
1 year ago
0
support search with 3-letter language codes or BCP-47
#36
jorgtied
opened
1 year ago
0
DB for off-line search
#35
jorgtied
closed
1 year ago
1
fix ZeroDivisionError bug in progress printing
#34
svirpioj
closed
1 year ago
0
Bump numpy from 1.16.4 to 1.22.0
#33
dependabot[bot]
opened
2 years ago
0
opus_read fails to extract CCMatrix
#32
Waino
opened
2 years ago
3
opus_get downloads *all* corpora with just the -s switch
#31
dumitrescustefan
closed
2 years ago
1
convert newlines to spaces when outputting to moses formats
#30
svirpioj
closed
2 years ago
0
Is it possible to download all corpus associate with the given language pair?
#29
BrightXiaoHan
closed
2 years ago
1
Misleading logging information in opus_express
#28
aarnetalman
closed
2 years ago
1
Add progress indicator to opus_express
#27
aarnetalman
closed
2 years ago
2
Using opus_read with -az, -sz, -tz options
#26
pluiez
closed
3 years ago
1
malformed tmx from opus_read
#25
keith555
closed
2 years ago
1
Format of downloaded files does not match the format expected by opus_read
#24
keith555
closed
2 years ago
1
What is the tokenizer for all languages?
#23
SefaZeng
opened
3 years ago
0
Missing alignment data for English(en) - Oromo(om)?
#22
ashaltu
closed
3 years ago
1
Memory Issue: opus_read fails to extract MultiCCAligned
#21
aflueckiger
closed
2 years ago
1
Where are the missing language pairs?
#20
icaswell
opened
3 years ago
1
PyPI wheel includes old files
#19
compwiztobe
closed
2 years ago
2
Alignment problem with JW300 corpora?
#18
sklampfl
opened
3 years ago
1
opus_express without confirmation?
#17
ZJaume
closed
3 years ago
1
opus_express not checking correctly root directory
#16
ZJaume
closed
3 years ago
1
fix opus_express shuffle broken due to missing import
#15
ymyt
closed
3 years ago
0
create_bash_script.py : this file enable to create command lines for …
#14
Sohyo
closed
3 years ago
0
List of datasets | Monolingual raw files
#13
jchwenger
closed
4 years ago
5
Opus_read: SentenceParserError
#12
Stamenov
closed
4 years ago
5
Query to get list of existing corpora (by language)
#11
dumitrescustefan
closed
4 years ago
1
Automatically switch token delimiter for languages not using whitespace
#10
pks
closed
4 years ago
3
problem with os.rename in opus_langid
#9
jorgtied
closed
2 years ago
1
OPUS returns no data
#8
george-roussos
closed
4 years ago
2
Question: monolingual dialogs (Finnish language)
#7
remotejob
closed
4 years ago
2
preserve inline tags
#6
jorgtied
closed
5 years ago
1
Can you adjust the paths so that the users don't have to manually enter "y" and the download will begin automatically? Thanks!
#5
miau1
closed
5 years ago
1
not sure if I can ask questions here, but I got stuck on this when I try to download a TMX from the OPUS JW300 set. It has nothing to do with the set, I think.
#4
miau1
closed
5 years ago
11
implement opus-filter
#3
jorgtied
closed
4 years ago
5
option for adding document boundaries
#2
jorgtied
closed
5 years ago
1
tool for language identification
#1
jorgtied
closed
5 years ago
3