def transcribe(vocals_file_path):
audio = whisperx.load_audio(vocals_file_path)
result = model.transcribe(audio, batch_size=batch_size, task="translate", language="ja")
model_a, metadata = whisperx.load_align_model(language_code=result["language"], device=device)
result = whisperx.align(result["segments"], model_a, metadata, audio, device, return_char_alignments=False)
print(result["segments"])
return result
transcribe("studentvideo.mp4")`
it downloaded this model at this location see "C:\Users\Andriken.cache\huggingface\hub\models--jonatasgrosman--wav2vec2-large-xlsr-53-japanese"
but i'm getting this as a segment results below
when specifying language as "en" I get proper words in word timestamps instead of letters, the letters issue occurs only when specifying language as "ja"
see my code below
it downloaded this model at this location see "C:\Users\Andriken.cache\huggingface\hub\models--jonatasgrosman--wav2vec2-large-xlsr-53-japanese" but i'm getting this as a segment results below
when specifying language as "en" I get proper words in word timestamps instead of letters, the letters issue occurs only when specifying language as "ja"