Open sribich opened 3 months ago
full_get_segment_bytes exists, but not full_get_token_bytes.
full_get_segment_bytes exists
full_get_token_bytes
For non English languages, whisper can split tokens on non-valid UTF8 boundaries making finer grained parsing impossible with the current text methods.
text
full_get_segment_bytes exists
, but notfull_get_token_bytes
.For non English languages, whisper can split tokens on non-valid UTF8 boundaries making finer grained parsing impossible with the current
text
methods.