devicehive / devicehive-audio-analysis

Apache License 2.0
196 stars 80 forks source link

Creating own dataset #8

Open Wild1234 opened 6 years ago

Wild1234 commented 6 years ago

Good night.

I want to create my own dataset with my own labels. is it possible for this repository?

Thanks

igor-panteleev commented 6 years ago

Hi, Yes, you can use your own dataset.

You need to extract features from raw data. To do this take a look to vggish lib here.

Also you can find how i did this on the fly.

Hope it will help.

Lelo123 commented 6 years ago

Thanks!

But isn't necessary to train a model with youtube 8M?

igor-panteleev commented 6 years ago

Short answer - yes. Two models have been used here. vggish - as feature extractor. youtube8m - as classifier.

So if you want to use different labels you need to extract features using vggish and then train youtube 8m model with these features.

hei9gag commented 5 years ago

Hi,

I still have some questions about creating own data set.

The training script provided by youtube8m is using .tfrecord extension. Do you know how to generate this format for audio?

Also, how do I add my custom labels using the youtube8m training model? Many thanks.