Hi @igor-panteleev, great job. I want to train my own model for two or three particular classes. I'm planning the following training pipeline.
128 dim embedding --> a classifier --> classes
Though I've seen the Google Audio set data is provided in 128 dim tf.records, I couldn't find it in a downloadable form in the site. I found a frame by frame tensorflow.SequenceExample file of 2.4 GB. But is that the same data you've used? Please help me on this.
On evaluation, the pipeline will be
WAV format --> VGGish --> 128 dim embedding --> a classifier --> prediction label
Hi @igor-panteleev, great job. I want to train my own model for two or three particular classes. I'm planning the following training pipeline.
128 dim embedding --> a classifier --> classes
Though I've seen the Google Audio set data is provided in 128 dim tf.records, I couldn't find it in a downloadable form in the site. I found a frame by frame
tensorflow.SequenceExample
file of 2.4 GB. But is that the same data you've used? Please help me on this.On evaluation, the pipeline will be
WAV format --> VGGish --> 128 dim embedding --> a classifier --> prediction label
Do I miss something?