Closed NickleDave closed 2 years ago
@yardencsGitHub maybe this is something we can better address with examples from canary song
Will this tend to work against the correct labeling of rare variants? For example if the sequence 'abc' is very prevalent in a song, and the sequence 'abb' is very rare, will the labeling of 'abb' be biased towards the labels 'abc'?
Based on most recent results and follow-up discussions with @yardencsGitHub we will want to say something like:
"generally speaking, we chose a bin size that was just smaller than the shortest duration silent gaps between syllables, because a larger bin size would have prevented our model from producing correct segments in cases where the true gaps were shorter than our bin size. In initial studies we experimented with even smaller bin sizes but found that the network tended to over-segment. We address possible reasons for this in the discussion."
I went ahead and added this sentence pretty much verbatim to "generating spectrograms" in Methods.
We should make sure to point this out in response to reviewers.
Closing as done.
key point for this issue is that:
window size will be addressed by follow-up experiments, e.g. #69
but for bin size, as discussed with @yardencsGitHub, we will want to make clear that we are thinking of the model as implicitly learning to segment, and so bin size will impact its ability to segment well
heuristically: going below < 1 ms will probably give any additional benefits, and going > 5 ms will probably be too noisy