When selecting hard samples, the input labels are obtained based on the index of the maximum value in each row of y. For example, for the prominent pose 1 of front_raise, the label is [10000000], and its maximum value index is 0. For the prominent pose 2 of front_raise, the label is [00000000], and the maximum value index is also 0. This would make them considered matching samples. How are the two distinguished?
When selecting hard samples, the input labels are obtained based on the index of the maximum value in each row of y. For example, for the prominent pose 1 of front_raise, the label is [10000000], and its maximum value index is 0. For the prominent pose 2 of front_raise, the label is [00000000], and the maximum value index is also 0. This would make them considered matching samples. How are the two distinguished?