issues
search
rth
/
vtext
Simple NLP in Rust with Python bindings
Apache License 2.0
147
stars
11
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Bump scipy from 1.4.1 to 1.10.0 in /ci
#86
dependabot[bot]
opened
1 year ago
0
Bump numpy from 1.17.3 to 1.22.0 in /ci
#85
dependabot[bot]
opened
2 years ago
0
Update dependencies
#84
rth
opened
3 years ago
0
Treebank word tokenizer from NLTK
#83
rth
opened
4 years ago
0
Feature/kskip-ngram
#82
joshlk
opened
4 years ago
10
Use approx create for tests
#81
rth
closed
4 years ago
0
Fine-tune tokenizers
#80
rth
opened
4 years ago
0
Standardize language option
#79
rth
opened
4 years ago
0
Add StopWordFilter
#78
rth
opened
4 years ago
3
Fix clippy warnings
#77
rth
closed
4 years ago
2
Improve error handling
#76
rth
closed
4 years ago
0
Renamed `UnicodeSegmentTokenizer` to `UnicodeWordTokenizer`.
#75
rth
closed
4 years ago
0
Add CHANGELOG.md
#74
rth
closed
4 years ago
0
Add pickling support for Python tokenizers
#73
rth
closed
4 years ago
0
Rename UnicodeSegmentTokenizer to UnicodeWordTokenizer
#72
rth
closed
4 years ago
1
Sentence tokenizers benchmarks
#71
rth
closed
4 years ago
0
Punctuation sentence tokenizer
#70
joshlk
closed
4 years ago
9
Update to PyO3 0.10 and rust-numpy 0.9
#69
rth
closed
4 years ago
0
Update rust version used in CI
#68
rth
closed
4 years ago
0
Sentence tokenization using Unicode segmentation (Python package)
#67
joshlk
closed
4 years ago
3
Sentence tokenization using Unicode segmentation
#66
joshlk
closed
4 years ago
3
MAINT Build wheels for Python 3.8
#65
rth
closed
4 years ago
0
MAINT Update dependencies
#64
rth
closed
4 years ago
0
Make to_ascii_lowercase optional
#63
technic
opened
4 years ago
4
BLD Build for the wasm target
#62
rth
closed
4 years ago
1
MAINT Make rayon dependency optional
#61
rth
closed
4 years ago
0
PY Implement get_params methods for tokenizers
#60
rth
closed
4 years ago
0
MNT Update dependencies versions
#59
rth
closed
4 years ago
0
TST Use hypothesis in python tests
#58
rth
closed
4 years ago
0
API Set parameters with the builder pattern
#57
rth
closed
4 years ago
0
Update to PyO3 0.7
#56
rth
closed
5 years ago
3
Parallel CountVectorizer
#55
rth
closed
5 years ago
0
TST add float_cmp crate for tests
#54
jbowles
closed
4 years ago
1
Tokenizers dispatch in vectorizers
#53
rth
closed
5 years ago
1
General architecture feedback
#52
rth
opened
5 years ago
2
Add sentence splitter
#51
rth
closed
4 years ago
8
Better support of configuration parameters in vectorizers
#50
rth
closed
4 years ago
2
ENH Improve CountVectorizer performance
#49
rth
closed
5 years ago
0
Add tokenizer trait
#48
rth
closed
5 years ago
2
Migrate to PyO3 0.6.0
#47
rth
closed
5 years ago
0
ENH Avoid copying tokens in tokenizers in Python
#46
rth
closed
4 years ago
1
Add CharacterTokenizer
#45
rth
closed
5 years ago
0
Relicense under Apache license 2.0
#44
rth
closed
5 years ago
0
Add Levenshtein Edit distance
#43
rth
closed
5 years ago
0
DOC Add function signatures
#42
rth
closed
5 years ago
0
ENH Jaro Winkler similarity
#41
rth
closed
5 years ago
0
Character n-grams
#40
rth
opened
5 years ago
4
Add Jaro similarity
#39
rth
closed
5 years ago
0
Add Sørensen-Dice string similarity
#38
rth
closed
5 years ago
0
Update python readme and rename python package
#37
rth
closed
5 years ago
0
Next