issues
search
SYSTRAN
/
fuzzy-match
Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.
MIT License
45
stars
8
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix systran/fuzzymatch docker image: Boost 1.74
#70
monsieurzhang
closed
4 months ago
0
ST5107: add segment alphabet for Thai and Burmese
#69
monsieurzhang
closed
4 months ago
0
ST-5107: support Thai and Burmese
#68
monsieurzhang
closed
4 months ago
1
update cpp standard + new way of linking with find_package
#67
minhthuc2502
opened
10 months ago
0
indication of bm25 branch in readme
#66
Maxwell1447
closed
1 year ago
0
Incremental index
#65
tihanyi
opened
1 year ago
2
Fix compilation warnings
#64
guillaumekln
closed
1 year ago
0
Update actions/checkout to v3 to silence the warning
#63
guillaumekln
closed
1 year ago
0
Remove options that break the compilation
#62
guillaumekln
closed
1 year ago
0
Bm25
#61
Maxwell1447
closed
1 year ago
0
Use test files from the current branch, not master
#60
guillaumekln
closed
1 year ago
0
removed irrelevant filter
#59
Maxwell1447
closed
1 year ago
0
Irrelevant filter
#58
Maxwell1447
closed
1 year ago
0
Custom edit costs + contrastive retrieval
#57
Maxwell1447
closed
1 year ago
0
Pin the Ubuntu version in the CI job
#56
guillaumekln
closed
1 year ago
0
refs RB-327: PN9 integration for Windows environment
#55
duydq12
closed
1 year ago
0
Production experience?
#54
aehlke
closed
1 year ago
1
Add include<stdexcept>, necessary to use std::invalid_argument, for s…
#53
ClementChouteau
closed
2 years ago
0
Remove the OpenNMT Tokenizer include from the public header
#52
guillaumekln
closed
2 years ago
0
Use pre-built NFC normalizer from ICU
#51
guillaumekln
closed
2 years ago
0
Remove non needed class declaration
#50
guillaumekln
closed
2 years ago
0
Simplify compare_ngrams function
#49
guillaumekln
closed
2 years ago
0
Remove mutex and unused methods in VocabIndexer
#48
guillaumekln
closed
2 years ago
0
Fix debug build and add this build type in the CI
#47
guillaumekln
closed
2 years ago
0
Factorize IDF penalty computation
#46
guillaumekln
closed
2 years ago
0
Cleanup Costs structure
#45
guillaumekln
closed
2 years ago
0
Iterate on sentences from longest to shortest match
#44
guillaumekln
closed
2 years ago
0
Use a linear search to find pattern words in the sentence
#43
guillaumekln
closed
2 years ago
0
Update the cost upper bound to return early from edit_distance
#42
guillaumekln
closed
2 years ago
0
Only resolve the longest match when registering suffixes
#41
guillaumekln
closed
2 years ago
0
Clarify attributes name of Subseq structure
#40
guillaumekln
closed
2 years ago
0
Cleanup SuffixArray getters
#39
guillaumekln
closed
3 years ago
0
Remove unused method SuffixArray::clear()
#38
guillaumekln
closed
3 years ago
0
Replace comparison struct by a lambda function
#37
guillaumekln
closed
3 years ago
0
Remove unnecessary SuffixArray constructors and destructor
#36
guillaumekln
closed
3 years ago
0
Remove unused constructor argument
#35
guillaumekln
closed
3 years ago
0
Use a custom integer hash function for mapping sentence IDs
#34
guillaumekln
closed
3 years ago
0
Fix and clarify the marking of matched words in the pattern
#33
guillaumekln
closed
3 years ago
0
Simplify register_ranges method
#32
guillaumekln
closed
3 years ago
0
Remove intermediate ngram vector
#31
guillaumekln
closed
3 years ago
2
Remove duplicated suffix range
#30
guillaumekln
closed
3 years ago
0
Make max_tokens_in_pattern configurable
#29
guillaumekln
closed
3 years ago
0
Set ios_base::open_mode to binary in import_binarized_fuzzy_matcher
#28
panosk
closed
3 years ago
0
Fix compilation with Boost 1.58
#27
guillaumekln
closed
3 years ago
0
Move min_seq_len to NGramMatches constructor.
#26
ClementChouteau
closed
3 years ago
2
Reduce number of calls to edit_distance_char
#25
guillaumekln
closed
3 years ago
0
Improve reliability with fuzz testing
#24
ClementChouteau
opened
3 years ago
0
Use linear time algorithm for suffix array construction
#23
ClementChouteau
opened
3 years ago
0
Small optimizations and cleanup in edit distance functions
#22
guillaumekln
closed
3 years ago
1
Reserve vector in VocabIndexer::getIndex
#21
guillaumekln
closed
3 years ago
0
Next