issues
search
bitextor
/
bicleaner
Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
GNU General Public License v3.0
150
stars
22
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump scikit-learn from 1.1.3 to 1.5.0
#81
dependabot[bot]
opened
4 months ago
0
Conda package not up-to-date
#80
OrianeN
closed
9 months ago
5
Cannot install bicleaner due to hunspell error
#79
bhaddow
closed
10 months ago
11
Installing Bicleaner in Colab
#78
AlexSkrn
closed
1 year ago
2
Issues with bicleaner-classify
#77
dbradley407
closed
1 year ago
6
Tests don't pass
#76
onadegibert
closed
1 year ago
2
Bump numpy from 1.21.6 to 1.22.0
#75
dependabot[bot]
closed
1 year ago
3
Bump requirements (specially scikit-learn)
#74
lpla
closed
1 year ago
0
Specific Tokenizer is not working
#73
jgcb00
closed
2 years ago
2
Non-deterministic training when --seed is provided
#72
cgr71ii
closed
2 years ago
5
Bicleaner models for Paracrawl bonus languages
#71
MaximumEntropy
closed
2 years ago
3
Conda build
#70
cgr71ii
closed
2 years ago
0
ModuleNotFoundError: No module named 'pycld2'
#69
cgr71ii
closed
2 years ago
2
Tests don't pass
#68
cgr71ii
closed
2 years ago
2
Bicleaner 0.15
#67
ZJaume
closed
2 years ago
0
Improving probabilistic dictionary
#66
jgcb00
closed
2 years ago
2
Does disable_hardrules also disable_lm_filter ?
#65
jgcb00
closed
2 years ago
5
Add headers to input and output files
#64
cgr71ii
closed
2 years ago
0
Performance improvement of bicleaner
#63
hlhlpfp
closed
2 years ago
1
Model Architecture
#62
hlhlpfp
closed
2 years ago
1
Can't understand what third column(numbers generated when testing bicleaner)
#61
hlhlpfp
closed
2 years ago
1
Performance evaluation
#60
hlhlpfp
closed
3 years ago
1
Error while installing KenLM
#59
hlhlpfp
closed
3 years ago
7
Probabilistic dictionaries
#58
hlhlpfp
closed
3 years ago
6
Training Corpus
#57
hlhlpfp
closed
3 years ago
10
scipy needs to be upgraded to 1.7.1, and joblib to 1.0.1
#56
lpla
closed
3 years ago
4
dict_pruner.py freezes
#55
nataliamakhamalkina
closed
3 years ago
6
Bump pyyaml from 5.1.2 to 5.4
#54
dependabot[bot]
closed
3 years ago
0
Issue with training a bicleaner model
#53
jgcb00
closed
3 years ago
16
bicleaner-classify cannot serialize '_io.TextIOWrapper'
#52
pmoda
closed
3 years ago
3
Bicleaner consumes all available memory
#51
cgr71ii
closed
3 years ago
10
Porn sentences through bicleaner
#50
jgcb00
closed
3 years ago
3
Buffered tokenizer
#49
kpu
closed
3 years ago
3
OSError: Cannot read model; lmplz: not found
#48
xpertasks
closed
4 years ago
6
Miquel changes to Bicleaner 0.15
#47
mbanon
closed
4 years ago
0
Missing strip() causes inconsistent results
#46
zuny26
closed
4 years ago
2
Typo in ‘train your model’
#45
djshowtime
closed
4 years ago
1
Bicleaner 0.14
#44
mbanon
closed
4 years ago
0
Training failed with the -S and -T options.
#43
cidrugHug8
closed
4 years ago
2
Training failed.
#42
cidrugHug8
closed
4 years ago
4
Sacremoses
#41
mbanon
closed
4 years ago
0
Replacing mosestokenizer with sacremoses
#40
mbanon
closed
4 years ago
0
Use word replacement and cutting sentences noise function
#39
ZJaume
closed
4 years ago
0
Porn removal
#38
ZJaume
closed
4 years ago
0
Spco refinements
#37
mbanon
closed
4 years ago
0
Classifier and feature improvements [WIP]
#36
ZJaume
closed
4 years ago
2
Consider sacremoses
#35
kpu
closed
4 years ago
5
split the dictionary by tabs
#34
Bachstelze
closed
4 years ago
3
OSError: Cannot read model
#33
Bachstelze
closed
4 years ago
6
Hardrule for porn videos
#32
ZJaume
closed
4 years ago
1
Next