joonson / syncnet_trainer

Disentangled Speech Embeddings using Cross-Modal Self-Supervision
MIT License
154 stars 26 forks source link

What does text_path mean? #23

Closed 6eternal6 closed 2 weeks ago

6eternal6 commented 8 months ago

In the speaker recognition task, two parameters are test_list and test_path. What path is test_path? The code uses "inp1 = loadWAV(os.path.join(test_path,file), self.__max_frames__*4, evalmode=True, num_eval=num_eval).cuda()", But some of the data in the file is a single string of numbers, and some is the full audio address.