speech-transformer Search Results

1000+ results
for speech-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/blog #1702

How to introduce new alphabets in Whisper fine-tuning

Dear @sanchit-gandhi, I was following your tutorial, [Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers](https://huggingface.co/blog/fine-tune-whisper), to fine-tune Whisper with a dataset i…

mequanent updated 1 month ago
1
huggingface/distil-whisper #11

[Speculative Decoding] How to run speculative decoding for b…

Transformers 4.35 only supports speculative decoding for batch size == 1. In order to use speculative decoding for batch size > 1, please make sure to use this branch: https://github.com/huggingface/t…

patrickvonplaten updated 1 month ago
3
modelscope/FunASR #2042

A10卡GPU推理效率和CPU持平，不清楚是什么地方的问题

Notice: In order to resolve issues more efficiently, please raise issue following the template. （注意：为了更加高效率解决您遇到的问题，请按照模板提问，补充细节） ## ❓ Questions and Help ### Before asking: 1. search the iss…

lanyuer updated 1 week ago
1
espnet/espnet #5871

Speech-to-Speech/Audio-to-Audio support

Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves) I was able to implement a decoder only model, I first used audio codec to…

nanowell updated 2 months ago
4
modelscope/FunASR #1773

调用FunASR/runtime/http服务出现错误 basic_string::_M_construct null …

使用https://github.com/modelscope/FunASR/blob/main/runtime/http/readme_zh.md 这里的文档自行build了一个http服务端，可以正常启动但curl -F \"file=@example.wav\" 127.0.0.1:80调用的时候出现错误 basic_string::_M_construct null not v…

hooploop updated 1 week ago
6
usefulsensors/useful-transformers #21

How to run the transcribe_wav.py code directly instead of us…

I need to make certain modifications to the code, such as converting the frequency of the WAV file before reading it and then transcribing the speech. However, if I run transcribe_wav.py directly, it …

QingWind6 updated 1 month ago
2
huggingface/audio-transformers-course #65

fix a typoe

In UNIT4 : Pretrained models for audio classification We’ll load an official [Audio Spectrogram Transformer](https://huggingface.co/docs/transformers/model_doc/audio-spectrogram-transformer) checkpo…

abdelkareemkobo updated 10 months ago
1
YuanGongND/ast #133

AssertionError: choose a window size 400 that is [2, 1]

I try to use the feature extractor on my audiofiles. My audio files are all 16000Hz and 5 seconds long. The `waveform.shape[1]` is 80000 ```python input_values = feature_extractor(waveform, sampli…

GrafKnusprig updated 4 months ago
2
bentoml/BentoML #4339

bug: bentoml.transformers.save_model() is constrained on ima…

### Describe the bug ```import torch from transformers import pipeline import bentoml pipe = pipeline( "automatic-speech-recognition", ) bentoml.transformers.save_model( "automatic-s…

xinghua-qu updated 10 months ago
1
facebookresearch/fairseq #4227

speech_to_text: s2t_transformer_l has an extremely low score…

Hello! I am trying to reproduce the results that were achieved by pretrained models described in [librispeech_example.md](https://github.com/pytorch/fairseq/blob/main/examples/speech_to_text/docs/libr…

Graf-D updated 2 years ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for speech-transformer

1000+ results
for speech-transformer