-
Hi, Thank you very much for the code you shared.
I tried to run the code with the following arguments:
`python main.py -lr=5e-5 -ep=40 -mod=tav -bs=8 --img-interval=500 --early-stop=6 --loss=bce…
-
![微信截图_20240205001611](https://github.com/LeMei/UniMSE/assets/71129884/d076dcf0-df0d-42ee-8f3a-f818f454c668)
我现在尝试运行main.py,但是发现还缺少一些文件,但是我也找到生成这些文件的方法,于是乎到了这一步,在iemocap文件夹运行preprocess.py时候发现输出上图的n…
-
Hello, I am very interested in your project, but I couldn't find descriptions for the Vanilla_Transformer and localTranformer2 methods in the code.
-
How it will work for some kind of reviews or interviews (real world) ?
-
Dataset is not loaded into the file, when i try another speech dataset (SAVEE/CASIA.IEMOCAP), help me to know ,how to solve this issue?
-
Many thanks for the contribution,
although the utterance segmentation is not a part of your work (the IEMOCAP emotion dataset is already segmented into utterances), do you have any idea about any too…
-
![Uploading 3008c02105eea3c29326fa854092625.png…]()
-
您好,我对您的文章很感兴趣,在复现时遇到了一些问题,希望得到您的指点!
我在复现IEMOCAP数据集时,ACC和F1只能达到70,论文中写有72,我用的也是RTX 3090,实验中参数和您设置一样,这可能是由于不同参数有不同的效果,因此我也在尝试中。
![image](https://github.com/feiyuchen7/M3NET/assets/126675907/80ac7d1d-1…
-
# Speech Emotion Captioning
Speech emotion captioning is to describe the emotion in speech using natural language.
## Task Objective
Compared with traditional speech emotion recognition(wher…
-
Hello, I have some questions about the input data. First, I see that when the original features you provided are loaded, all the three modal features have three dimensions, the first dimension is batc…