-
I was attempting a simple record linkage as follows, and I noticed something very strange. When I specify the ```partial.match``` argument, the outcome of the match is different from when it is not sp…
-
for our own db we need to be able to match strings to some certain percentage
-
Hi,
First let me thank you for the great library, it is pleasure to use it.
In my work project I use DBScan class to cluster a bag of strings.
I have noticed that by default identical objects are ig…
-
Currently when executing comparison of result-sets, it's possible to apply a tolerance for numeric and dateTime values but not for string. It should be possible to support a tolerance based on an appr…
-
Some demographics needs to be phonetically standardized before hashing.
-
We are using Jaro-Winkler instead of that Damerau-Levenstain.
Is it possible to switch between these two?
-
Hello!
Can the java-string-similarity be applied to many languages such as Germany,French,Chinese and so on.Can it be is relative to language type?
-
The jaro algorithm implementation appears to be flawed,
it returns 1.0 for disctinct strings.
Adding the following test condition to
\spec\jaro-winkler_spec.js
it('should treat simple d…
jfseb updated
7 years ago
-
Join first to Who's On First based on the common GeoNames concordance and harvest Wikidata IDs from the Who's On First concordances. Verify the result by joining with OpenStreetMap and make one-off ed…
-
The search in the database is not very reliable. I already changed it to be lower-case only, which helped. Also, MetaNetX has mapping for things that are redundant, however I can only find one source …