automatically detect labels and bins

felixbur / nkululeko

Machine learning speaker characteristics

MIT License

26 stars 4 forks source link

I meant that the labels [anger, disgust, happy...] are already in the data.

Yes, this should be the default if no labels is given in [DATA] section in the INI file. If there is the option of labels in the DATA section, the labels should use the defined labels.

For regression, we should actually treat it as real regression, i.e., predicting continuous score. Let's use examples from iemocap, msp-improv and msp-podcasts datasets. the format of data usually is "file, valence, arousal, dominance, naturalness," where the last four columns from valence to naturalness are continuous scores. The output should be continuos score. In this case label is required (name of header to predict).

Binning can be added too to map between regression to classification and provide further analysis.

felixbur / nkululeko

automatically detect labels and bins #61