I trained sphereface64 network in the same super-parameter setting, but can not reach 99.% accuracy in LFW, but sphereface20 network can get 99.20% accuracy in LFW, which make me confused...why the deeper network works worse than the shallow network?
I trained sphereface64 network in the same super-parameter setting, but can not reach 99.% accuracy in LFW, but sphereface20 network can get 99.20% accuracy in LFW, which make me confused...why the deeper network works worse than the shallow network?