issues
search
kosukeimai
/
fastLink
R package fastLink: Fast Probabilistic Record Linkage
272
stars
48
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
R aborts during linkage of two files when n rows > 25k
#86
wbakerrobinson
opened
2 months ago
0
Using a pre-existing EM object does not work unless all comparison levels are present
#85
zmbc
opened
2 months ago
0
Subscript out of bounds error while deduping dataset
#84
wbakerrobinson
closed
2 months ago
3
Linking / deduplication with a list of possible candidates
#83
BERENZ
opened
6 months ago
5
Relaxing the conditional independence assumption?
#82
zmbc
opened
6 months ago
4
Log(X) NaNs produced
#81
kslungaardmumma
opened
6 months ago
15
Window blocking errors when the variable in `window.block` is integer
#80
etiennebacher
opened
9 months ago
0
NA handling Issue in expectation maximization function
#79
jw2249a
closed
10 months ago
1
dedupeMatches does not consider exact matches
#78
jw2249a
opened
10 months ago
2
Improving gamma.R funs
#77
jw2249a
opened
11 months ago
2
Performance (gamma*() functions)
#76
jw2249a
opened
11 months ago
3
Dealing with aliases in FastLink
#75
jkafka
opened
1 year ago
1
Cran patch
#74
bfifield
closed
1 year ago
0
Use of gc() creates a constant overhead to calling fastLink()
#73
zmbc
opened
1 year ago
3
blockData – Error: Vector memory exhausted (limit reached?)
#72
itsmevictor
opened
1 year ago
10
cut.a for stringdist.method == "lv"
#71
wbakerrobinson
closed
1 year ago
4
window blocking performance
#70
bengoehring
opened
1 year ago
1
aggconfusion development update
#69
SamShin
closed
1 year ago
2
How to return multiple matches
#68
msghankinson
closed
2 years ago
2
Seemingly odd partial matching behavior
#67
bengoehring
closed
1 year ago
5
Looking for a way to feed threshold cutoffs to individual variables
#66
ajw5296
opened
2 years ago
5
Q: Database size limit for duplicate removal?
#65
gbdias
opened
2 years ago
15
The documentation for return.all does not seem to match the function default?
#64
zross
opened
2 years ago
2
Extracting matches when using blocking
#63
jamesmartherus
closed
2 years ago
4
Using reweight.names in fastlink() returns only completely NA rows
#62
brittlh
opened
2 years ago
5
Guidance on improving chances EM algorithm will converge?
#61
zross
opened
2 years ago
9
Running time
#60
MAranzazuRU89
opened
2 years ago
5
Blocking strategy with millions of rows
#59
lucasmalherbe
closed
3 years ago
6
Run times for field comparison variables
#58
marialma
closed
3 years ago
4
Measure distance to nearest group
#57
shamahutoto
opened
3 years ago
2
NA values create error in getMatches()
#56
datafj
opened
3 years ago
7
Exact match on certain column
#55
shamahutoto
opened
3 years ago
4
nameReweight NA issue
#54
EmericA570
opened
3 years ago
1
Long runtime on sampled data
#53
emcghee73
opened
3 years ago
3
Matching Dates
#52
geneh0
opened
3 years ago
3
Custom/adjusted string comparison functions
#51
jrtran
closed
3 years ago
6
Uninformative error in fastLink during imputation
#50
emma-klein
closed
3 years ago
1
Weighting the matching fields
#49
paull71
opened
3 years ago
4
How scalable fastlink with mln rows tables?
#48
Ibrokhimsadikov
closed
3 years ago
2
Col::subvec() error with some data
#47
muranyia
opened
4 years ago
25
Running fastLink on several cores/threads on
#46
felixhaass
closed
4 years ago
3
question / documentation
#45
kalakaru
opened
4 years ago
3
Question: window blocking conditional on another variable
#44
froukehe0
opened
4 years ago
5
New feature request: Matthews correlation coefficient
#43
aalexandersson
opened
5 years ago
1
Vignette: Missing "gender" variable in the example datasets
#42
Najsztub
opened
5 years ago
1
not all patterns with NA counted?
#41
timbp
closed
5 years ago
3
[Accidentally Opened]
#40
jgaeb
closed
5 years ago
0
fixes dedupe, closes #37
#39
bfifield
closed
5 years ago
0
question - matching multiple datasets
#38
ghost
opened
5 years ago
1
getMatches() warning / deduplication
#37
felixhaass
closed
5 years ago
2
Next