The whisper models can transcribe and translate at the same time, if configured correctly.
The models were trained on either English-only data or multilingual data. The English-only models were trained on the task of speech recognition. The multilingual models were trained on both speech recognition and speech translation. For speech recognition, the model predicts transcriptions in the same language as the audio. For speech translation, the model predicts transcriptions to a different language to the audio.
source
If we add this feature to VRCT, combined with the feature of running these models on the GPU, it could result in very fast, high quality translations.
Google Translate:
ウィスパー モデルは、正しく構成されていれば、文字起こしと翻訳を同時に行うことができます。
The whisper models can transcribe and translate at the same time, if configured correctly.
If we add this feature to VRCT, combined with the feature of running these models on the GPU, it could result in very fast, high quality translations.
Google Translate: ウィスパー モデルは、正しく構成されていれば、文字起こしと翻訳を同時に行うことができます。
この機能を VRCT に追加し、これらのモデルを GPU で実行する機能と組み合わせると、非常に高速で高品質の翻訳が可能になります。