jishengpeng / WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
MIT License
786 stars 43 forks source link

Semantic Representation #55

Open Uneasy-Z opened 4 hours ago

Uneasy-Z commented 4 hours ago

How can I get semantic representations from wavtokenizer? As the paper reads: We extract embeddings corresponding to the discrete codebooks of an acoustic codec model as its respective representations and evaluate the classification accuracy of the codec model on ARCH datasets using its representations.

wish you all the best!

jishengpeng commented 3 hours ago

How can I get semantic representations from wavtokenizer? As the paper reads: We extract embeddings corresponding to the discrete codebooks of an acoustic codec model as its respective representations and evaluate the classification accuracy of the codec model on ARCH datasets using its representations.

wish you all the best!

README, wavtokenizer.encode_infer(). Thank you for your attention.

Uneasy-Z commented 3 hours ago

ok~ Thank you for your prompt reply~