polarisZhao / awesome-face

😎 face releated algorithm, dataset and paper
MIT License
891 stars 182 forks source link

About face recognition dataset #5

Closed SueeH closed 5 years ago

SueeH commented 5 years ago

Great conclusion for face recognition and detection! I am confused about these dataset. Do you know MS-Celeb-1M and Trillion Pairs( MS-Celeb-1M-v1c & Asian-Celeb) have any relationship? Is trillion pairs is part of MS-Celeb-1M(do some cleaning)?

polarisZhao commented 5 years ago

You can see some information for here: http://trillionpairs.deepglint.com/overview trillion pairs Datasets include training set and testing set: Training set MS-Celeb-1M-v1c with 86,876 ids/3,923,399 aligned images cleaned from MS-Celeb-1M dataset. This dataset has been excluded from both LFW and Asian-Celeb. Asian-Celeb 93,979 ids/2,830,146 aligned images. This dataset has been excluded from both LFW and MS-Celeb-1M-v1c. Testing set Trillion Pairs Trillion Pairs is consisted of the following two parts. ELFW: Face images of celebrities in LFW name list. There are 274k images from 5.7k ids. DELFW: Distractors for ELFW. There are in total 1.58 million face images from Flickr.