deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project
https://insightface.ai
22.81k stars 5.34k forks source link

Share Celeb-500k dataset via torrent #1451

Closed codonna9 closed 3 years ago

codonna9 commented 3 years ago

Hello, I'm currently experimenting with robust face recognition algorithms like Subcenter-Arcface. Can anyone, who has the Celeb-500k dataset, share it via torrents? Thanks so much in advance!

AGenchev commented 3 years ago

Why not use Glint360k ? Authors say this is their best (for 112x112). They share it on magnet:?xt=urn:btih:e5f46ee502b9e76da8cc3a0e4f7c17e4000c7b1e&dn=glint360k They also give baidu link but I can not use baidu because they do not know my country and its phone code exist.

codonna9 commented 3 years ago

Sorry for the late reply. For some unknown reason, I didn't got the notification from github. Yeah, I downloaded the Glint360k by torrent and I'm very thankful for the authors releasing it. The reason I ask for for Celeb-500k is I'd like to test some new noise-tolerant algorithms to see how far they can push the performance of face recognition models.

AGenchev commented 3 years ago

I see this is closed, I can tell you there is a lot of noise in MSCeleb1M as well. Infact I was planning to merge it into VGGFace, but this will destroy opportunity to use LFW as verification dataset, because many celebs are shared between them, e.g. train/test will overlap. Beware the case with glint369k is the same - possible overlapping with LFW.