issues
search
linto-ai
/
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
GNU Affero General Public License v3.0
2.01k
stars
156
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Sometimes segments with "no_speech_prob" larger than "speech threshold" and "avg_logprob" lower than "logprob_threshold" still appear
#26
ItakeLs
closed
1 year ago
2
Verbose option does not support None (no output)
#25
kamranjon
closed
1 year ago
1
Inconsistent number of segments: whisper_segments (462) != timestamped_word_segments (461)
#24
ItakeLs
closed
1 year ago
12
huggingface_hub.utils._validators.HFValidationError
#23
boxabirds
closed
1 year ago
7
Clarifying whisperX limitations
#22
m-bain
closed
1 year ago
6
Fail with "***.en" models
#21
alexgain
closed
1 year ago
6
Change licence
#20
damienlaine
closed
1 year ago
0
Possible failure "AssertionError: Fatal Error: Got inconsistent logprob for segment <...>: <...> != <...>"
#19
Jeronymous
closed
1 year ago
0
Suggestion: Print timestamped words on-the-fly with option verbose=True
#18
vedran
opened
1 year ago
3
transcribe fails with error: `Argument #4: Padding size should be less than the corresponding input dimension, but got: padding (200, 200) at dimension 2 of input [1, 1, 144]`
#17
eschmidbauer
closed
1 year ago
6
Transcription contains duplicated fragments
#16
romanzoniit
closed
1 year ago
5
Got infinite logprob
#15
tkorchagin
closed
1 year ago
9
"TypeError: can't convert cuda:0 device type tensor to numpy" with option --plot / plot_word_alignment on CUDA
#14
WentingMiao
closed
1 year ago
8
AssertionError: Got inconsistent logprob at index <N>: <XXX> != <YYY>
#13
Jeronymous
closed
1 year ago
0
AssertionError: Got inconsistent logprobs length : 23 != 22
#12
MohammedMehdiTBER
closed
1 year ago
17
How can align sentences instead of words?
#11
MohammedMehdiTBER
closed
1 year ago
1
Suggestion: Support beam search / nbest decoding (temperature fallback...)
#10
Jeronymous
closed
1 year ago
0
"Got infinite logprob" assertion failure, with option condition_on_previous_text=False
#9
ItakeLs
closed
1 year ago
2
Preparing metadata (setup.py)
#8
romanzoniit
closed
1 year ago
1
Cannot run multiple transcriptions without reloading the model
#7
KMC07
closed
1 year ago
1
start + end outside length of audio
#6
doublex
opened
1 year ago
9
CSV output changed to better match Gentle output
#5
seanth
closed
1 year ago
4
AssertionError: Inconsistent number of segments: whisper_segments (12) != timestamped_word_segments (11)
#4
romanzoniit
closed
1 year ago
16
word.strip()
#3
doublex
closed
1 year ago
3
Update requirements.txt
#2
seanth
closed
1 year ago
0
WARNING:whisper_timestamped:Inconsistent number of segments:
#1
tkorchagin
closed
1 year ago
8
Previous