larsga / Duke

Duke is a fast and flexible deduplication engine written in Java
Apache License 2.0
614 stars 194 forks source link

No identity for records #263

Closed xinelim closed 5 years ago

xinelim commented 5 years ago

Hi, I am encountering an issue, Exception in thread "main" no.priv.garshol.duke.DukeException: No identity for record [CompactRecord {}]. The commands were working well for my previous dataset. May I know what is the cause for this issue? Thank you!

larsga commented 5 years ago

Probably you have a record with no value for the identity field. From the error message it looks like that record has no values at all. Duke can't work with that, because there's no way to identify the record.

xinelim commented 5 years ago

Problem solved. I downloaded my dataset again and it works now.