Closed WinstonDeng closed 4 years ago
How can you run your network on arbitrary audio durations at test time?
To do this, we input the audio as is (+padding) and then resize a hidden layer such that it matches the required pose sequence length shape.
For more details, please refer to the test script of our code, which is compatible with arbitrary audio length: https://github.com/amirbar/speech2gesture/blob/master/audio_to_multiple_pose_gan/predict_audio.py
Thanks for your reply!
What are the details of the testing implementation?