andymeneely / chromium-history

Scripts and data related Chromium's history
11 stars 4 forks source link

Match scraped elipsized emails with known Developer emails #172

Closed kaylaerdmann closed 9 years ago

kaylaerdmann commented 10 years ago

We will be parsing both the owner and commenter ids and emails from the json. This is fine temporarily, but we want to be able to match developers through their emails between code reviews and bug issues. Let's find matches and throw them into a google spreadsheet for now.

andymeneely commented 10 years ago

We'll need the following done for this:

We're going to get A LOT more developers in the bug data than in code reviews, so we can't trace all of them manually. We'll need to do some automated work by looking up the ellipsized emails and seeing if there's a unique result. Or, maybe there's another way?

andymeneely commented 9 years ago

Ok, I'm reviving this task and assigning it to @kayladavis. Let's revisit this problem and try to come up with a creative solution, or come to a better understanding of our limitations.

One option might be that we cannot mine developers emails from the bug data unless we have the raw email to begin with, say, from the commit logs. But, I'd obviously rather get the emails if we can.

andymeneely commented 9 years ago

In retrospect, we now have a ton of great research questions that don't require this data. Let's shelve this and close it.