aris-ai / Audio-and-text-based-emotion-recognition

A multimodal approach on emotion recognition using audio and text.
Apache License 2.0
152 stars 29 forks source link

Audio-and-text-based-emotion-recognition

A multimodal approach on emotion recognition using audio and text.

A pytorch implementation of the paper

Objective

This model is used to recognize emotion based on variable length audio inputs and texts.

Datasets

We used IMEOCAP dataset for the project. It can be downloaded from https://sail.usc.edu/iemocap/ We also omitted one second audio data from the dataset.

Methodology

Audio model

Text model

Multimodal approach