elliotchance / gedcom

👪 A Go library and CLI tools for encoding, decoding, traversing, merging, comparing, querying and publishing GEDCOM files.
MIT License
97 stars 22 forks source link

Publish: Add option to exclude duplicate names #297

Closed metaben closed 4 years ago

metaben commented 4 years ago

Have you got any thoughts on de-duplicating these? Not sure how much logic you want to build in.

image

I wasn't expecting to see this, bit of a surprise, but I looked at the raw gedcom and I see the values are there. Maybe FTM hides them. If FTM does, here's the workflow its skewed me to:

Name (already known and sourced): John Johnson Name (in new source material but totally wrong): Jon Gonson

FTM would let you "discard" the name of Jon Gonson, but allow you to link the source up to the good name of John Johnson. So you're improving quality of research, but without polluting/diluting with digitisation mistakes (which might lead to manual corrections). Does this make sense? Is this evidenced in the gedcom?

elliotchance commented 4 years ago

Yep. That's super easy. I'll add a -hide-duplicate-names or similar flag to gedcom publish.