Background:
I'm working on Violence Detection using Multimodal approach in my Internship. I want to feed it Spectrograms of Audio of video. I have been trying to understand the code to make it 3 Stream CNN. I can't find the code where you calculate the optical flow and feed it to the net object.
Can you please help with these things:
In which file optical flow is being calculated?
I want to make it 3 Stream CNN. I want to feed it Spectrograms generated from video Audio. Do you have any suggestions & guidance to implement this feature?
I'll highly appreciate your response.
Thanking you in anticipation.
Greetings,
Background: I'm working on Violence Detection using Multimodal approach in my Internship. I want to feed it Spectrograms of Audio of video. I have been trying to understand the code to make it 3 Stream CNN. I can't find the code where you calculate the optical flow and feed it to the net object.
Can you please help with these things:
I'll highly appreciate your response. Thanking you in anticipation.