issues
search
rsennrich
/
subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
MIT License
2.18k
stars
464
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Use a single regex match with optional operator
#70
alvations
closed
5 years ago
3
re.split can catch groups and save the delimiter
#69
alvations
closed
5 years ago
3
how to restore the original encoding from BPE encoding after translation?
#68
shshen-closer
closed
5 years ago
0
Vocabulary size / convention
#67
Kyubyong
closed
5 years ago
1
UnicodeEncodeError when using subword-nmt learn-bpe with verbose mode
#66
leoxiao2012
closed
5 years ago
1
Skip special tokens
#65
voidmagic
closed
5 years ago
2
Post Processing
#64
jigyasa06
closed
5 years ago
1
encoding issue?
#63
kmario23
closed
6 years ago
2
pass `total_symbols` to learn_bpe
#62
bastings
closed
6 years ago
1
Number of merge operations
#61
Anupama94
closed
6 years ago
2
Fix best practice instruction
#60
bastings
closed
6 years ago
2
Removing the BPE
#59
kpiyush16
closed
6 years ago
0
Add tqdm in learn_bpe
#58
universome
closed
6 years ago
1
enable unicode separator/glossaries in cli
#57
jsenellart
closed
6 years ago
3
Extending --glossaries to handle regex
#56
Proyag
closed
6 years ago
1
apply_bpe gives fewer segments than before
#55
hsajjad
closed
6 years ago
2
Subtract characters
#54
phikoehn
closed
6 years ago
3
Chinese word segmentation
#53
parahaoer
closed
6 years ago
7
Improve library usability
#52
lfurrer
closed
6 years ago
1
Pip version bug - ModuleNotFoundError: No module named 'learn_bpe'
#51
sai-prasanna
closed
6 years ago
1
Adding to pypi
#50
mjpost
closed
6 years ago
1
How to use 'glossaries' field?
#49
mzeidhassan
closed
6 years ago
6
apply_bpe.py doubles empty lines
#48
emjotde
closed
6 years ago
2
Continue processing if the bpe codes file has some invalid lines
#47
lovasoa
closed
6 years ago
2
learn_bpe.py generates an invalid bpe file
#46
lovasoa
closed
6 years ago
4
IndexError: tuple index out of range
#45
yapingzhao
closed
6 years ago
4
byte pair encoding
#44
jigyasa06
closed
6 years ago
2
support
#43
bhagat02
closed
6 years ago
2
Apply.bpe is giving duplicates in Vocabulary result file (German-English)
#42
mohammedayub44
closed
6 years ago
3
New version of subword-nmt can't handle certain sentences
#41
edunov
closed
6 years ago
1
Fixing the utf-8 related error
#40
goswamig
closed
6 years ago
2
Fix a bug when second time reading file breaks version
#39
maksymbevza
closed
6 years ago
2
apply_bpe.py repeats last character twice (if not EOL symbol)
#38
maksymbevza
closed
6 years ago
3
Packed library into package
#37
universome
closed
6 years ago
4
Test case fails on fresh clone of repo.
#36
se4u
closed
6 years ago
0
add up repeated entries with --dist-input
#35
obo
closed
6 years ago
0
Fixes
#34
ozancaglayan
closed
6 years ago
0
no pair has frequency >= 2. Stopping
#33
yapingzhao
closed
6 years ago
2
learn_joint_bpe_and_vocab: Fix parameter passing
#32
ozancaglayan
closed
6 years ago
0
Option to apply fewer BPE operations than learned
#31
Proyag
closed
6 years ago
0
Subword for Arabic
#30
kmario23
closed
6 years ago
1
apply_bpe.py produces extra lines on specific file / codecs module
#29
gugray
closed
6 years ago
8
Is it suitable for any language?
#28
atricfox
closed
7 years ago
2
Serious bug in apply_bpe (mutable default parameter)
#27
sklampfl
closed
7 years ago
2
BPE and FST
#26
loretoparisi
closed
7 years ago
2
results on newstest2014
#25
apeterswu
closed
7 years ago
3
README typo
#24
karishmamalkan
closed
7 years ago
1
chmod +x apply_bpe.py
#23
jvdbogae
closed
7 years ago
1
Feat/glossaries
#22
dmesq
closed
7 years ago
3
Adding some documentation to the statistics updates
#21
alvations
closed
7 years ago
1
Previous
Next