QutEcoacoustics / audio-analysis

The audio analysis code (AnalysisPrograms.exe) for the QUT Ecoacoustics Research Group
https://ap.qut.ecoacoustics.info/
Apache License 2.0
53 stars 12 forks source link

New feature: offline training analysis #162

Open atruskie opened 6 years ago

atruskie commented 6 years ago

We're interested in incorporating learned models into our workflow, which we intend to use for a new class of recognizers. Here we use the term offline to denote that each instance of an offline training analysis will sample data from many files to build a model; it will not produce any classification results.

Conversely, online is the term we will use when processing one audio file to get predictions.

Expected behaviour

AP.exe will have a new entry point named "model-learning".

This entry point will accept:

This analysis should have common options enabled that define:

This analysis will implement the IAnalyzer2 interface. The interfaces stages will map as such:

Analyze is the map stage and will download each audio segment (using analysis coordinator), and do feature extraction.

SummarizeResults is the reduce stage and will be responsible for training and then producing the model.

This will be used by @mkholghi and @towsey - comments welcome.

towsey commented 6 years ago

This sounds like a good approach to me, although I am not across the finer points.