devicehive / devicehive-audio-analysis

Apache License 2.0
196 stars 80 forks source link

TF record files for training custom new model #10

Open sleebapaul opened 6 years ago

sleebapaul commented 6 years ago

Hi @igor-panteleev, great job. I want to train my own model for two or three particular classes. I'm planning the following training pipeline.

128 dim embedding --> a classifier --> classes

Though I've seen the Google Audio set data is provided in 128 dim tf.records, I couldn't find it in a downloadable form in the site. I found a frame by frame tensorflow.SequenceExample file of 2.4 GB. But is that the same data you've used? Please help me on this.

On evaluation, the pipeline will be

WAV format --> VGGish --> 128 dim embedding --> a classifier --> prediction label

Do I miss something?