larsga / Duke

Duke is a fast and flexible deduplication engine written in Java
Apache License 2.0
614 stars 194 forks source link

Retrieve original text rather than the cleaned text #252

Open bino013 opened 6 years ago

bino013 commented 6 years ago

Hi,

Is there's a way to retrieve the original text rather than the cleaned text after deduplication or record linkage?

Thank you!