Closed rjgarciar closed 9 months ago
can you share the data set?
it might be a rounding problem. in the comment line it says "metric_value": 0.0191
Shouldn't it be around 0.5 looking at the plot?
The data set has these columns:
I know this is out of topic, it should be in your deepface package, but it's the reason I was trying to stablish a threshold: it is relatively common that:
Perhaps this is the reason of "metric_value": 0.0191...
Data set is available here
Data set size is really large and i cannot download it. Could subsample it and share here again?
I have uploaded here faces_3.csv a 50% subsampling of original data.
For large datasets the code isn't evaluating every possible partition (presumably due to performance). Instead it's using mean and +/- 1-3 std deviations. This subsampling is implemented in processContinuousFeatures.
I have a CSV with pre-calculated cosine distance between face embeddings of people images in my dataset like this:
And I use this script to calculate findDecision tree:
The results I get are:
The plot is:
and outputs/rules/rules.py:
As you can see, it gives me a 0.0 threshold when it should be around 0.68.
Am I doing something wrong?
Regards