CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6
GNU Affero General Public License v3.0
1.55k stars 96 forks source link

时间戳对齐无效&&未来可能会支持sensevoice吗? #241

Open SerDavil opened 6 days ago

SerDavil commented 6 days ago

日志如下: ==========TimeStample_Alignment==========

TimeStample alignment transform transcript result... process audio... load wav2vec2 model...

Wav2Vec2ForCTC requires the PyTorch library but it was not found in your environment. Checkout the instructions on the installation page: https://pytorch.org/get-started/locally/ and follow the ones that match your environment. Please note that you may need to restart your runtime after installation.

Error loading model from huggingface, check https://huggingface.co/models for finetuned wav2vec2.0 models alignment Error Error: The chosen align_model "jonatasgrosman/wav2vec2-large-xlsr-53-japanese" could not be found in huggingface (https://huggingface.co/models) or torchaudio (https://pytorch.org/audio/stable/pipelines.html#id14) UPdata DataModel

======================================= 按理说,软件应该是包含所有需要的python环境了吧?ummmm,是不是打包什么的漏了些内容呢?

另外,未来可能会支持sensevoice吗?

andeziod commented 5 days ago

223

之前和其他人讨论过,0.8.1有bug,得退回0.8.0才能用时间戳。期待下一个版本修复吧。