Open treya-lin opened 4 days ago
Examples:(github does not accept wav so I had to upload as webm sorry...)
in this example, moshi only greets but didn't reply meaningful content, it greets and then remained silent till the end A_4.webm output_moshi_4.webm
This seems to be the very few times when it did reply, but it does not consistently respond like this, sometimes it only greets. And I don't understand why it greets when the input is talking? A_0.webm output_moshi.webm
Due diligence
Topic
The pytorch implementation
Question
Hello, thanks for your great work. I am trying the python API to see if I can use existing audio files to simulate streaming input obtain moshi's reply, but it didn't work as expected so I assume I am not using it the proper way. Could you kindly take a look?
my main question:
Some other questions:
my code that I used when I tried to solve question 1:
I tried many times with many audio of different length but it always just returned moshi saying something like "hey what'up" or "hey how's it going". There is once or twice that it replied something meaningful after greeting, but still, I hope it can just "listen " to my words and reply without always greating first . I am trying to look into the code too, but I think I am not doing it the proper way. Could you please give more guide on how to use the API to play around it? Thank you! Any suggestion is much appreciated!