cmu-db / cmdbac

CMDBAC - Carnegie Mellon Database Application Catalog
http://cmdbac.cs.cmu.edu
Apache License 2.0
35 stars 7 forks source link

Detecting fork when crawling repositories #2

Open zeyuanxy opened 8 years ago

zeyuanxy commented 8 years ago

We should ignore the forked repositories if they are the same with the original one. One Idea is to calculate the hash value (e.g, MD5) to avoid too much computation.