positve pairs and negitive pairs?

mimbres / neural-audio-fp

MIT License

179 stars 25 forks source link

@kasireddygariDineshKumarReddy

As you know, the goal of fingerprinting is to identify the source audio file, not the sound generating source. The same dogs' barking every time will differ in audio signal, and of course, they can't be positive pair in this project. In fact, your scenario is the same as standard sound event detection.
Perhaps your question is about the figure 2 in our paper. Any sample in the training batch has a chance to be anchored once. For example, the red original circle is an anchor to be compared with others in the first row. You may find the red circle in the second rows but the anchor in the row is pink circle. We compute softmax crossentropy on each row.

mimbres / neural-audio-fp