larsga / Duke

Duke is a fast and flexible deduplication engine written in Java
Apache License 2.0
614 stars 194 forks source link

Specify which source --singlematch should keep unique #229

Open ravivmg opened 8 years ago

ravivmg commented 8 years ago

When I run record-linkage with the --singlematch option then I get unique records for only one of the sources. In other words, Duke seems to do a One-to-Many match.

How do I tell Duke which of the two input files should be unique in the outputted match file?

Thanks!