noise-robust-asr Search Results

62 results
for noise-robust-asr

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

RoboTutorLLC/RoboTutor_2019 #298

1.8.9.1 try using NoiseSuppressor to improve ASR

See if NoiseSuppressor makes ASR more robust wrt noise. [https://developer.android.com/reference/android/media/audiofx/NoiseSuppressor](https://developer.android.com/reference/android/media/audiofx/N…

JackMostow updated 5 years ago
3
hirofumi0810/neural_sp #343

How to decode empty target files which only contain backgrou…

I have some background noise wav files in my testset, in order to test the robustness of models. But I find it would remove the empty examples in bin/asr/eval.py when I remove lines 114-115 in "uti…

CSLujunyu updated 3 years ago
1
YuanGongND/whisper-at #16

'Whisper' object has no attribute 'transcribe_audio'

File "/opt/whisper/whisper-at/src/noise_robust_asr/intermediate_feat_extract/as_full/extract_as_full_whisper_all.py", line 35, in extract_audio _, audio_rep = mdl.transcribe_audio(wav) File …

herbiel updated 9 months ago
8
insunhwang89/StyleVC #2

About the speech rate of generated voice

Hi. I tested the model with the inference jupyter file your provided. It's amazing that the model can still generate good voice even if a Mandarin source file is fed as input. However, I notice that…

Charlottecuc updated 1 year ago
7
ftshijt/CALL-proto #21

cnn_tdnnf

您好，我在您的代码里看到了有cnn_tdnn的脚本，想请教一下在训练声学模型的时候是否可以替换为该脚本，因为最近在尝试chime6的相关内容，对后端了解不深。谢谢

shanhaidexiamo updated 4 years ago
1
YuanGongND/whisper-at #5

How to Use Temporal Pooling Layer?

I use “time_pooling = nn.AvgPool2d((60,1))” for whisper large pre-trained model（encoder out size is [batch,1500,1280]）as Temporal Pooling Layer, but for 'last_mlp' and 'last_tr' methods cannot achiev…

Yunlei-AI updated 1 year ago
3
ufal/whisper_streaming #105

New Fork: Web client + WebSocket + own VAD impl.

I have created [fork](https://github.com/marcinmatys/whisper_streaming/blob/main/README2.md) of whisper_streaming , so I took the liberty of writing about it here. We may close this issue soon as it…

marcinmatys updated 2 days ago
9
yongxuUSTC/sednn #54

关于噪声估计的问题

您好，我看了徐勇2015年的博士毕业论文，里面噪声告知训练的部分，在连续7帧即（7,257）的样本中通过取连续几帧的带噪语音的平均功率作为估计的平均噪声功率，这个依据是什么有些看不明白这部分的内容

qq274943639 updated 4 years ago
1
YuanGongND/whisper-at #29

the question about dataset feature

Hi Yuan, The features of the ESC dataset you provided seem to only have whisper-large-v1，But it seems that the provided code includes features from more than one model. Thanks

LithiumZhou updated 5 months ago
5
ggerganov/whisper.cpp #137

Faster streaming support

Have you tried building the spectrogram and encoder output in smaller chunks and appending? I think the spectrogram should generate fairly easily with minimal noise depending on the size of the chunk,…

ameenba updated 7 months ago
27

上一页 1...1 2 3 4 5 6 7...7 下一页

62 results for noise-robust-asr

62 results
for noise-robust-asr