Closed andriken closed 2 weeks ago
if all the vocabulary in your text are included in the model's vocabulary, then you need to use romanize=False
, romanization turns all languages into roman letters which probably do not exist in a Japanese model
yeah now it works, but why am i getting the output like this always
[{'start': 1.22, 'end': 1.22, 'text': 'そ', 'score': 0.0}, {'start': 1.32, 'end': 1.32, 'text': 'れ', 'score': 0.0}, {'start': 1.34, 'end': 1.36, 'text': ' ', 'score': 0.0}, {'start': 1.4, 'end': 1.4, 'text': 'か', 'score': 0.0}, {'start': 1.52, 'end': 1.52, 'text': 'ら', 'score': 0.0}, {'start': 1.54, 'end': 1.56, 'text': ' ', 'score': 0.0}, {'start': 1.82, 'end': 1.82, 'text': '母', 'score': 0.0}, {'start': 1.86, 'end': 1.86, 'text': 'さ', 'score': 0.0}, {'start': 1.88, 'end': 1.9, 'text': ' ', 'score': 0.0}
shouldn't it be "text" and then the segments shown in the json output in the ReadMD file like this below
{ "text": "This is a sample text to be aligned with the audio.", "segments": [ { "start": 0.000, "end": 1.234, "text": "This" },
It should when outputting the results to a json file, if you are using it in python your output would be the correct one
This is my python usage below
so I got this below error, I confirm that It downloaded the model successfully and didn't do anything, then I ran again the code and got this error