Closed bagustris closed 2 years ago
Hi @bagustris , thanks for your attention. You can check the role of *_fixed.txt in here. Simply saying, this is to fix the script manually, and you may just create blank file at ./preparation/iemocap_fixed.txt
if you don't have any of it.
Thank @keonlee9420. It fixed that step.
It seems also you reorganized the original dataset structure. In this case, I didn't see sessions
directory in the original dataset as required in argument config.
My original structure is only like this:
$ pwd
/data/IEMOCAP_full_release
$ tree -L 2 .
├── Session1
│ ├── dialog
│ ├── sentences
├── Session2
│ ├── dialog
│ ├── sentences
├── Session3
│ ├── dialog
│ ├── sentences
├── Session4
│ ├── dialog
│ ├── sentences
├── Session5
│ ├── dialog
│ ├── sentences
All those sessions you moved into directory sessions
, right?
Also, what is the difference between corpus_path
and raw_path
? When I run the prepare_data
, I saw that it processed data from raw_path
, not corpus_path
. It will be helpful to show the structure of your dataset and your working directory like the tree's output above.
sessions
is used for an intermediate directory.corpus_path
. Then prepare_align.py
will automatically prepare the dataset (e.g., resampling the audio and cleaning the text) and then gather them into raw_path
directory as follows:
IEMOCAP
├── sessions
│ ├── Ses01F_impro01
│ ├── Ses01F_impro02
So the data flows through corpus_path
(raw data) -> raw_path/IEMOCAP
-> preprocessed_data/IEMOCAP
during the preprocessing phase described in README.md.
Hope it helps.
Hi @keonlee9420,
Thanks for your clear explanation. I can make it works now. However, it takes a long time if I tried with CPU. It runs for more than a week in my machine. For IEMOCAP dataset, how long did you run in GPU? Also, it will be beneficial to tell your GPU type for raw comparison.
Sorry for the late reply (maybe I'm too late). Using GPU is recommended. I trained the model on a single TITAN RTX, and it took about a day to get intelligible speech. Full convergence may take more than that.
Hi @bagustris , thanks for your attention. You can check the role of *_fixed.txt in here. Simply saying, this is to fix the script manually, and you may just create blank file at
./preparation/iemocap_fixed.txt
if you don't have any of it.
thks @keonlee9420 for sharing your work. I've been stucked by this problem too, maybe u can put this fix into README.md, that would be help.
close due to inactivity
Hey, thanks for a great repository!
I tried to replicate your experiment for IEMOCAP data. I already have this dataset and experiment with this data for speech emotion recognition. However, I got the following error when running command:
I checked the original dataset and there is no file named iemocap_fixed. Where is this file come from?