issues
search
Bergvca
/
string_grouper
Super Fast String Matching in Python
MIT License
364
stars
76
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Feature/upgrade topn
#98
icfly2
opened
6 months ago
0
Update string_grouper.py for pandas
#97
crc32
opened
9 months ago
0
value too large to convert to int , help, How can just a few tens of thousands of strings not work? TKS ~~
#96
honglitao
opened
10 months ago
2
Ensure requirements are more explicit to create reproducible builds
#95
robinknox
closed
1 year ago
0
Change applymap to map to remove "FutureWarning: DataFrame.applymap h…as been deprecated. Use DataFrame.map instead."
#94
robinknox
closed
1 year ago
0
sparse_dot_topn_for_blocks dependency broken with Cython 3.0
#93
alexiskaldany
opened
1 year ago
8
Ngram re-use
#92
hyshandler
opened
1 year ago
1
Module compiled against API version 0x10 but this version of numpy is 0xf
#91
jpswagner
closed
1 year ago
1
Installation not working: Massive error message
#90
MatthewMishek
opened
1 year ago
3
Import Error
#89
ZhimingMei
closed
1 year ago
4
Tips for working with large datasets
#88
ryangdar
opened
2 years ago
1
How to cite this work?
#87
skwolvie
opened
2 years ago
1
Unable to allocate 8.41 GiB for an array with shape (2258174000,) and data type int32
#86
BudiZhu
opened
2 years ago
0
Numpy version issue?
#85
arnicas
opened
2 years ago
2
Some general questions about the package
#84
eneszv
opened
2 years ago
0
added ability to cache matrix in queries across which `master` is constant
#83
ParticularMiner
opened
2 years ago
8
updated CHANGELOG.md to contain certain missing notable changes
#82
ParticularMiner
closed
2 years ago
1
Different matching behavior across versions
#81
probablyfine
closed
2 years ago
9
Question about version string_grouper group_similar_strings
#80
dariswan
opened
2 years ago
4
using string_grouper with a lookup column in orginal source data
#79
campbellalex321
closed
2 years ago
13
able to change default cosine similarity of .8?
#78
campbellalex321
closed
2 years ago
2
Could not install string-grouper
#77
campbellalex321
closed
2 years ago
2
Question / suggestion to use multiple n-grams to get more features
#76
iibarant
opened
3 years ago
0
Formula for optimal matrix block-size
#75
ParticularMiner
opened
3 years ago
0
added guesstimate for n_blocks
#74
ParticularMiner
closed
3 years ago
4
Error When matching Chinese name
#73
ZhihaoMa
opened
3 years ago
9
added blocking capabilities
#72
ParticularMiner
closed
3 years ago
18
Jupyter Notebook installation not working
#71
junotown
opened
3 years ago
1
Do the groups returned by String_Grouper follow the transitive property?
#70
bmanczak
closed
3 years ago
2
Question: How to have built StringGrouper corpus persist across multiple match_string calls in a programming session
#69
justasojourner
opened
3 years ago
5
Error when importing
#68
okkyadhi
opened
3 years ago
4
raise StringLengthException if vectoriser is applied to strings that …
#67
gw00207
opened
3 years ago
6
how to handle 'ValueError: empty vocabulary; perhaps the documents only contain stop words' in group_similar_strings
#66
gw00207
opened
3 years ago
2
match_string on small data series
#65
berndnoll
closed
3 years ago
2
Question: How to achieve matching for multiple fields and priorities
#64
berndnoll
opened
3 years ago
81
Get an error message while running match_string
#63
iibarant
opened
3 years ago
7
included string_grouper_utils package specification in setup.py (bug fix)
#62
ParticularMiner
closed
3 years ago
1
[question] How to import string_grouper_utils?
#61
MauKruisheer
opened
3 years ago
5
set max_n_matches=1 in match_most_similar() for a significant performance boost
#60
ParticularMiner
closed
3 years ago
0
Installation problem
#59
skwolvie
closed
3 years ago
2
updated dependency on new version of sparse_dot_topn (v0.3.1)
#58
ParticularMiner
closed
3 years ago
6
[question] Partial matching of strings
#57
andrei-volkau
opened
3 years ago
2
created a more user-friendly error message when bad data is found
#56
ParticularMiner
opened
3 years ago
2
integrated external package sparse_dot_topn into string_grouper as dedicated sub-package string_grouper_topn
#55
ParticularMiner
closed
3 years ago
2
updated string_grouper_utils.py to quell unittest deprecated warnings
#54
ParticularMiner
closed
3 years ago
0
optimized code and fixed bug related to input Series of length 1 (discovered by user)
#53
ParticularMiner
closed
3 years ago
4
group_similar_strings with series length 1 produces ValueError
#52
gw00207
closed
3 years ago
2
boosted _symmetrize_matches_list() (5x) and _get_matches_list() (33x)
#51
ParticularMiner
closed
3 years ago
1
fixed pypi README.md issues
#50
ParticularMiner
closed
3 years ago
1
[question] Does string_grouper is for English only?
#49
andrei-volkau
closed
3 years ago
2
Next