Open sih4sing5hong5 opened 6 years ago
問到一個很酷的看不懂的模擬方式
先餵 LPF, cornet at 3.8kHz, attenuation > 60dB, downsample by 2. #聽不懂我就不解釋了
有時間會研究一下
報告隊長,我加了 avconv 需要的套件 https://github.com/twgo/siann1-hak8_boo5-hing5/blob/5a50e1e6703a50eceb2625bed0ff99ba61633525/Dockerfile#L10
並依你的寫法在wav.scp每行尾巴多pipe執行avconv https://github.com/twgo/siann1-hak8_boo5-hing5/blob/5a50e1e6703a50eceb2625bed0ff99ba61633525/Dockerfile#L83
但似乎是無法轉的
我在想
我不應去改wav.scp
而是再它之前把wav壓過
讓wav.scp抓壓過的wav?
試了
sox -G /usr/local/pian7sik4_gi2liau7/TW01/M0/TW01M0P0000/tbw1010.wav -b 16 -c 1 -r 16k -t wav - | avconv -i - -ar 8000 -
回應
Guessed Channel Layout for Input Stream #0.0 : mono Input #0, wav, from 'pipe:': Duration: N/A, bitrate: 256 kb/s Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, 1 channels, s16, 256 kb/s [NULL @ 0xd09300] Unable to find a suitable output format for 'pipe:' pipe:: Invalid argument
看起來是參數要調整
Unable to find a suitable output format for 'pipe:'
加 -f wav
Yang-Hsiang Chang notifications@github.com 於 2018年6月20日週三 上午10:21 寫道:
試了
sox -G /usr/local/pian7sik4_gi2liau7/TW01/M0/TW01M0P0000/tbw1010.wav -b 16 -c 1 -r 16k -t wav - | avconv -i - -ar 8000 -
回應
Guessed Channel Layout for Input Stream #0.0 : mono Input #0, wav, from 'pipe:': Duration: N/A, bitrate: 256 kb/s Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, 1 channels, s16, 256 kb/s [NULL @ 0xd09300] Unable to find a suitable output format for 'pipe:' pipe:: Invalid argument
看起來是參數要調整
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/twgo/siann1-hak8_boo5-hing5/issues/41#issuecomment-398603253, or mute the thread https://github.com/notifications/unsubscribe-auth/AFuAC7sxNJN9q-yUptucn7YjnEnYd2ABks5t-bGcgaJpZM4Unrol .
試了
/usr/local/kaldi/egs/taiwanese/s5c/data/train# sox -G /usr/local/pian7sik4_gi2liau7/TW01/M0/TW01M0P0000/tbw1010.wav -b 16 -c 1 -r 16k -t wav - | avconv -f wav -i - -ar 8000 -
回
Guessed Channel Layout for Input Stream #0.0 : mono Input #0, wav, from 'pipe:': Duration: N/A, bitrate: 256 kb/s Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, 1 channels, s16, 256 kb/s [NULL @ 0x20d1320] Unable to find a suitable output format for 'pipe:' pipe:: Invalid argument
抱歉,我這樣直接改wav.scp對嗎?
RUN sed -i -z 's/\n/avconv -f wav -i - -ar 8000 - |\n/g' $KALDI_S5C/data/train/wav.scp
你看一下 -f
的參數,這是用佇output的,所以愛khng佇input後壁
tw01+tw02_8k
8k wav : sox -G /usr/local/gi2_liau7_khoo3/音檔/MH/MaternalHome-003.wav -b 16 -c 1 -r 16k -t wav - | avconv -i - -f wav -ar 8000 - |
tw01+tw02_8k a-law
| avconv -i - -f alaw -ar 8000 - | avconv -i - -f wav -ar 8000 -
仝語者_8k 仝語者_8k+a-alaw
tw01+tw02 tw01+tw02_8k tw01+tw02_8k_alaw
仝語者 仝語者_8k 仝語者_8k+a-alaw