Closed gicheonkang closed 6 years ago
Same as been used in the article, the error is lessen when the eyes are closer.
Thank you for your reply. I have one more question. In paper, there is no reason why vanillaCNN model uses absolute hyperbolic tangent(activation function).
I tested ReLU, but the accuracy is worse. Can you explain the reason?
@gicheonkang Most issues in this database is due to very little data. ReLU has more dynamic range and gives better results where data is available. Here ,my guess is that the Abs Tangent little dynamic range (less degrees of freedom) prevents over-fitting.
Hi I have a question about evaluating a error rate. while profiling, I saw a code like below (in 'testAFW_TestSet')
you use mse_normlized with ground-truth, prediction Is there any particular reason why you use MSE ?