issues
search
dedupeio
/
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
https://docs.dedupe.io
MIT License
4.15k
stars
551
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add support for record id of bytes type
#1152
lmores
opened
1 year ago
2
Bump pypa/cibuildwheel from 2.12.0 to 2.12.1
#1151
dependabot[bot]
closed
1 year ago
2
Can't import 'dedupe_dataframe' because of numpy
#1150
jordy-moddit
closed
1 year ago
1
Doc update to add reference for cluster scoring
#1148
jaime-varela
closed
1 year ago
0
Change predicate function signatures
#1147
lmores
closed
1 year ago
13
Improve cpredicates.pyx
#1145
lmores
closed
1 year ago
12
extend index predicates to whole model
#1144
fgregg
opened
1 year ago
0
New Index Predicate types using Embeddings
#1143
fgregg
opened
1 year ago
0
fix: loading training pairs from an existing training.json file
#1142
regel
opened
1 year ago
0
Change IDF formula
#1141
lmores
opened
1 year ago
5
purify id types
#1138
fgregg
closed
1 year ago
2
dedupe Is Not Opening Question window To Provide Label
#1137
ragvendra3898
closed
1 year ago
4
Improve typing of Data
#1136
fgregg
closed
1 year ago
0
Bump pypa/cibuildwheel from 2.11.3 to 2.12.0
#1135
dependabot[bot]
closed
1 year ago
1
Lmores fix/console label
#1134
fgregg
closed
1 year ago
1
Fix levenshtein search dep
#1133
fgregg
closed
1 year ago
1
back up dependency for Levenshtein-search to 1.4.4
#1132
fgregg
closed
1 year ago
0
Update pythonpackage.yml
#1131
fgregg
closed
1 year ago
0
Bump levenshtein-search from 1.4.5 to 1.4.6
#1130
dependabot[bot]
closed
1 year ago
5
Installation breaks because Levenshtein_search version 1.4.5 is no more listed on PyPi
#1129
fsal
closed
1 year ago
5
Levenshtein_search GPL 3-licensed Revisited
#1128
sarmohamed
closed
1 year ago
4
Fix console_label()
#1127
lmores
closed
1 year ago
0
About Inverse Document Frequency implementation
#1126
lmores
closed
1 year ago
1
About CanopyIndex implementation
#1125
lmores
closed
1 year ago
1
Bump pypa/cibuildwheel from 2.11.3 to 2.11.4
#1124
dependabot[bot]
closed
1 year ago
2
Bump pypa/cibuildwheel from 2.11.2 to 2.11.3
#1123
dependabot[bot]
closed
1 year ago
1
Reference plugin variables using "module:class" strings
#1122
NickCrews
closed
5 months ago
4
setuptools plugin solution for variables
#1121
fgregg
closed
5 months ago
9
Bump dessant/lock-threads from 3 to 4
#1120
dependabot[bot]
closed
1 year ago
1
raise BlockingError( dedupe.core.BlockingError: No records have been blocked together. Is the data you are trying to match like the data you trained on? If so, try adding more training data.
#1119
sowmyahnstreamforce
closed
1 year ago
0
PermissionError: [WinError 32] The process cannot access the file because it is being used by another process
#1118
paulmakeraiimi
closed
1 year ago
3
memory leak
#1117
Pobby321
closed
1 year ago
2
memory leak
#1116
Pobby321
closed
2 years ago
0
Fix incompatible change for random.sample() in python 3.11
#1115
lmores
closed
1 year ago
2
Unmap file to prevent PermissionError when deleting temp file on Win
#1114
PaulM5406
closed
1 year ago
3
Is incremental clustering supported?
#1113
lmores
closed
1 year ago
3
Bump pypa/cibuildwheel from 2.10.1 to 2.11.2
#1112
dependabot[bot]
closed
1 year ago
0
Dedupe prepare_training() error for more than 5K records
#1111
SantyGator
closed
1 year ago
1
Syntax Error in dedupe/api.py
#1110
sushantpatil99
closed
2 years ago
1
Bump pypa/cibuildwheel from 2.10.1 to 2.11.1
#1109
dependabot[bot]
closed
2 years ago
2
ValueError in `numpy.concatenate` during active labeling in Record Linkage and Gazeteer examples
#1108
manusturla
closed
2 years ago
2
udpate readme for cloning repo
#1107
f-hafner
closed
2 years ago
1
ValueError: Iteration of zero-sized operands is not enabled
#1106
Pythonspoofer
closed
2 years ago
6
Bump pypa/cibuildwheel from 2.10.1 to 2.10.2
#1105
dependabot[bot]
closed
2 years ago
2
Can we overhaul internals of Variables
#1104
NickCrews
opened
2 years ago
1
Blocking as a feature for scoring
#1103
fgregg
opened
2 years ago
1
Prep DataModel for removal
#1102
NickCrews
opened
2 years ago
2
Remove usage of DataModel from core.py and labeler.py
#1101
NickCrews
closed
2 years ago
1
Update numpy requirement to >=1.20
#1100
benmanns
closed
2 years ago
1
Bump pypa/cibuildwheel from 2.9.0 to 2.10.1
#1099
dependabot[bot]
closed
2 years ago
1
Previous
Next