MixedEmotions / up_emotions_audio

This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.
GNU General Public License v3.0
21 stars 8 forks source link

up_emotions_audio

This RESTful webservice aims to extract arousal and valence from audio. The input argument is either an uploaded audio/video file to the server or a URL. The output is the predicted emotion in terms of Arousal and Valence within the JSON-LD format.

To set up the module, you need:

Example:

http://localhost:8888/er/aer/getdims?dims=arousal,valence&url=http://tv-download.dw.com/dwtv_video/flv/wikoe/wikoe20151114_wiruebli_sd_avc.mp4&timing=9,15;147,152

where:

getdims: desired dimensions separated by comma (arousal,valence)

url: the url of the video/audio or the name of the uploaded file

timing: start and end of the segments (in seconds): start1,end1;start2,end2, it can be also 'asr' if ASR is available.

To upload an audio/video file use curl:

Windows: curl -v -H "Content-Type:multipart/form-data" --user meuser -i -X POST -F "file=@D:\path\to\sample.wav" http://localhost:8888/er/aer/upload

Linux: curl -v -H "Content-Type:multipart/form-data" --user meuser -i -X POST -F 'file=@./sample.wav' http://localhost:8888/er/aer/upload

Moreover, this repository handles the fusion of audio and video outputs. Run this command to fuse the results of audio and video outpus: wget "localhost:8080/er/general/fuse?video=cat json_video_plain.txt&audio=cat json_audio_plain.txt" In which the files should have the following entities. Note: keep ':time=start,end' in the "@id" section.

See http://localhost:8888/er/general for more information

Licenses:

openSMILE: distributed free of charge for research and personal use (http://www.audeering.com/research-and-open-source/files/openSMILE-open-source-license.txt) WEKA GPL 3

In case of using this module, please cite the following paper: http://ieeexplore.ieee.org/abstract/document/8269329/