PiotrSobczak / speech-emotion-recognition

Multi-modal Speech Emotion Recogniton on IEMOCAP dataset
83 stars 15 forks source link

Update recipe for linguistic model #15

Open pbrusco opened 4 years ago

pbrusco commented 4 years ago

Hi,

Please update the readme and sourcecode for training the linguistic model.

I assumed I had to go inside the data_loader and execute generate_transcriptions . Also, I had to download the deepspeech models after I figure out I need them, also I think the deepspeech_generator is deprecated, so I had to make a sys call

Thanks again!

pbrusco commented 4 years ago

I found out that the "deep_generator" was a script on the speech_emotion_recognition folder. My bad. Anyway the steps for generating the linguistic models will be appreciated :). Thanks again.

Managed generate ASR transcriptions run it by running:

$ python import speech_emotion_recognition.data_loader as dl from speech_emotion_recognition import * dl.generate_transcriptions()

leijue222 commented 3 years ago

I hava download deepspeech model.

Deepspeech 0.5.1 must use ubuntu. Then the generate_transcriptions() will be worked .

But a new problem is here: How to get the file of embeddings_array.numpy andword_to_index.pickle?

https://github.com/PiotrSobczak/speech-emotion-recognition/blob/8ecbcf11a6dbc6d6802e51f7f797fcd85fdb7992/speech_emotion_recognition/word2vec_wrapper.py#L14-L15 @PiotrSobczak

cui0721 commented 3 years ago

Hello, I would like to ask where to generate the NPY file

ADOLoooo commented 2 years ago

the same question about how to generate embeddings_array.numpy and word_to_index.pickle。。。Thanks