Closed vmarkovtsev closed 5 years ago
As https://github.com/src-d/eee-identity-matching/issues/28 suggests, we need to count commits per name for each identity with more than one name and perform the detection.
We have to generate the second Parquet file:
Please note: the primary name should not be lower-cased. The easiest thing to do is to capitalize the first letter in each word.
Assigning myself because @Guillemdb is busy with the commit time series, @irinakhismatullina is on the demo and @r0mainK is doing his school report.
As https://github.com/src-d/eee-identity-matching/issues/28 suggests, we need to count commits per name for each identity with more than one name and perform the detection.
We have to generate the second Parquet file:
Please note: the primary name should not be lower-cased. The easiest thing to do is to capitalize the first letter in each word.