espnet / espnet_model_zoo

ESPnet Model Zoo
Apache License 2.0
243 stars 41 forks source link

Missing getitem on huggingface page: #68

Open boeddeker opened 1 year ago

boeddeker commented 1 year ago

Hi, I tried a model from Huggingface (https://huggingface.co/espnet/simpleoier_librispeech_asr_train_asr_conformer7_wavlm_large_raw_en_bpe5000_sp) and copied the code from the "Use in ESPnet" button. The example was broken, I had to change

text, *_ = model(speech)

to

text, *_ = model(speech)[0]

According to the readme of espnet_model_zoo, the user has to use the getitem first. I don't know, how to fix that. Could you fix the example on huggingface?

Here, the examples from huggingface and github with the mismatch of the expected output of Speech2Text: image

image