SpectData / MONAH

Multi-Modal Narratives for Humans
MIT License
3 stars 0 forks source link

Bug - Duplicate Logic Error in run_weaving_talkturn.py #21

Closed joshkyh closed 3 years ago

joshkyh commented 3 years ago

The function run_weaving_talkturn in execute_weaveing_talkturn.py is the process that converts word_transcripts.csv into weave talkturns.csv produces a duplicate row as per screen shot. This causes a downstream problem in word_pitch_vol.py line 212 of dev-pitch branch.

image

image

The video pair from Ses01F_impro01_F.

Could we also take this opportunity to improve the commenting detail of run_weaving_talkturn? :)

mfmakahiya commented 3 years ago

Hi Joshua! As I was examining the error, I realized this might be specific for iemocap split videos only because of the quality of audio recordings that we have (wherein the other speaker is being heard in the other speaker's recording). So this is actually not just one instance but multiple times in the entire weaved talkturn result. I'm afraid there is no workaround for this instance and the only possible solution I can suggest is using two new examples wherein the other speakers are not heard from each other's recording.

joshkyh commented 3 years ago

Hi Marriane, if the other speaker is heard on the speaker file, then I think the same words would have different speakers. However, we see that in the screenshot, the duplicate happened under the same speaker (man). Could you have a look please? :)

mfmakahiya commented 3 years ago

image Hi Joshua! I can't seem to replicate the same results you got from the above. I'm only getting these results despite running it several times. Talkturn numbers 118 and 119 seem very close to each other but they are recorded from different speakers unlike the above. I didn't edit anything from the script yet so I'm not sure what's actually going on.

I tried to check the other talkturns as well and indeed the other speaker can be heard in the other recording. As a result, we have duplicate talkturns but from different speakers. In quick glance, I didn't notice any duplicate talkturns coming from the same speaker.

joshkyh commented 3 years ago

Let's park this since you didn't find any duplicates from the same speaker. The duplicates due to audio being heard in two microphones can't be solved.