openhatch / oh-greenhouse

Greenhouse -- a (temporary?) project to help track new contributors to open source projects. Might migrate to oh-mainline eventually.
GNU Affero General Public License v3.0
5 stars 7 forks source link

Deduplicate email addresses using GPG fingerprints #19

Closed daveeloo closed 11 years ago

daveeloo commented 11 years ago

Add an extra column to People that is original_email with default being null and add extra column to Upload that is original_email_changer (make it an EmailField instead of TextField while at it) and change the code for build_people_db to deal with this. Add an "unify" button on the person page that pops up a div with a JQuery autocomplete with names. Have algorithm that will make these changes.

daveeloo commented 11 years ago

Potential heuristics: most duplicated identity emails have the same name different domain

notes: use fingerprint not key_id if uploader and changer are the same must be a debian developer same name different email bigger problem as person gets more experienced

paulproteus commented 11 years ago

Assigning to next milestone, b/c we talked about it at Monday meeting.

daveeloo commented 11 years ago

The fingerprint is for the sponsor, which means it is not useful because the People objects are built from email_changer.