cjyaddone / ChatWaifu

Combined ChatGPT with Moegoe TTS to create a Chatting Waifu
MIT License
803 stars 89 forks source link

chatwaifu生成语音苹果不能听见,windows/安卓可以 #8

Open ShinoKana opened 1 year ago

ShinoKana commented 1 year ago

不知道是否和生成文件的格式有关?chatwaifu是作为插件加入到qchatgpt的,苹果的话qq语音听不到,用tim也是。 Guessed Channel Layout for Input Stream #0.0 : mono Input #0, wav, from 'D:\qqbot-02\QChatGPT\voice-file\voice_f269610d5f370c8f2ffa938318917a96.wav': Duration: 00:00:02.47, bitrate: 705 kb/s Stream #0:0: Audio: pcm_f32le ([3][0][0][0] / 0x0003), 22050 Hz, 1 channels, flt, 705 kb/s Stream mapping: Stream #0:0 -> #0:0 (pcm_f32le (native) -> pcm_s16le (native)) Press [q] to stop, [?] for help Output #0, s16le, to 'D:\qqbot-02\QChatGPT\voice-file\voice_f269610d5f370c8f2ffa938318917a96.pcm': Metadata: encoder : Lavf59.36.100 Stream #0:0: Audio: pcm_s16le, 22050 Hz, mono, s16, 352 kb/s Metadata: encoder : Lavc59.58.100 pcm_s16le size= 106kB time=00:00:02.46 bitrate= 354.5kbits/s speed= 552x video:0kB audio:106kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.000000% ** Silk Encoder (Fixed Point) v 1.0.9.6 **** ** Compiled for 32 bit cpu *** Input: D:\qqbot-02\QChatGPT\voice-file\voice_f269610d5f370c8f2ffa938318917a96.pcm Output: D:\qqbot-02\QChatGPT\voice-file\voice_f269610d5f370c8f2ffa938318917a96.silk API sampling rate: 24000 Hz Maximum internal sampling rate: 24000 Hz Packet interval: 20 ms Inband FEC used: 0 DTX used: 0 Complexity: 2 Target bitrate: 25000 bps Packets encoded: 113 File length: 2.260 s Time for encoding: 0.040 s (1.766% of realtime) Average bitrate: 24.485 kbps Active bitrate: 25.661 kbps

MuBai-He commented 1 year ago

这个需要silkv3转换才能使用,比较麻烦。群文件里面有解决方法!