issues
search
linto-ai
/
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
GNU Affero General Public License v3.0
2.01k
stars
156
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Trouble with timings
#91
boxabirds
closed
1 year ago
4
I want to get high performance - azure
#90
rairavi
closed
1 year ago
1
Deprecation warning
#89
aeschylus
closed
1 year ago
1
error for long (1 hr) hindi video - used large-v2 whisper model
#87
rairavi
closed
1 year ago
3
Words are correct but regular subtitles appear too early and linger?
#85
2600box
closed
1 year ago
3
Can't install on M2 Mac
#83
boxabirds
closed
1 year ago
3
[Idea] Basic timestamp validation
#82
misutoneko
opened
1 year ago
8
Improve Whisper transcription using transcript
#81
lenaten
closed
1 year ago
14
AssertionError with --vad, only with medium model
#80
misutoneko
closed
1 year ago
2
Try VAD with auditok
#78
Jeronymous
closed
11 months ago
1
High Memory Usage and "Killed: 9" Error.
#75
mcgreenwood
closed
1 year ago
5
VAD does not handle almost complete silence
#74
freddyertl
closed
11 months ago
23
--vad impacts recognition accuracy
#72
freddyertl
closed
1 year ago
6
End time of one word is the start time of the next one
#71
konradipipan
closed
1 year ago
1
Regular whisper model is still downloaded when using hugginface models
#70
blueskyleaf
closed
1 year ago
3
Install whisper-timestamped with only CPU dependencies
#68
agorman
closed
1 year ago
3
The result is almost lost when I use whisper timestamped.
#67
YeDaxia
closed
1 year ago
5
Can we get a speed boost of %80?
#66
FlowDownTheRiver
closed
1 year ago
4
Merge
#65
Jeronymous
closed
1 year ago
0
Inconsistent number of segments error
#64
olevanss
closed
8 months ago
27
Weird repetition on transcript
#63
catalwaysright
closed
1 year ago
2
[Bug] remove_last_null_duration_words
#62
mmichelli
closed
1 year ago
7
AssertionError "assert len(segment_tokens_check) < len(segment["tokens"])" with option --accurate
#61
agorman
closed
1 year ago
3
Option --vad not working offline (when VAD torch model has been loaded already)
#60
misutoneko
closed
1 year ago
1
Inconsistent number of segments: whisper_segments (1352) != timestamped_word_segments (1350)
#59
jeremymatt
closed
1 year ago
5
AssertionError: Got empty transcription!
#57
agorman
closed
1 year ago
4
[Bug] Unable to use multiple output formats directly (without "all")
#56
misutoneko
closed
1 year ago
2
Warning of onnxruntime "Removing initializer 'XXX'. It is not used by any node and should be removed from the model." with option --vad
#55
Jeronymous
closed
1 year ago
0
Run Silero VAD before transcribing with Whisper (to reduce hallucinations)
#54
Jeronymous
closed
1 year ago
0
Compatibility issues with openai-whisper version 20230306
#53
Jeronymous
closed
1 year ago
1
Update to Whisper version 20230306
#51
Jeronymous
closed
1 year ago
0
Make Whisper Requirement more flexible to be able to use a specific Whisper version (as some breakages were introducted in 20230306)
#48
kamranjon
closed
1 year ago
7
Fatal Error: Got inconsistent text for segment 10
#47
maptz
closed
1 year ago
5
Spot (probable) disfluencies in transcription
#46
Jeronymous
closed
1 year ago
0
How to write SRT file? Are models the same as whisper?
#42
Adsc58
closed
1 year ago
4
Do not rely on whisper timestamps
#41
Jeronymous
closed
1 year ago
0
Consider Supporting CTranslate2 for faster inference
#40
kamranjon
opened
1 year ago
15
Question: How to efficiently get attention weights with beam search decoding?
#39
ItakeLs
closed
1 year ago
3
Different results with whisper and whisper_timestamped
#38
skanda1005
closed
1 year ago
9
Suggestion: Problem with small words in SRT files
#37
xaxole98
closed
1 year ago
3
Suggestion: Use VAD to improve over Whisper's segment timestamps estimation
#36
Jeronymous
closed
1 year ago
6
Suggestion: Add Speaker Diarization
#35
ubanning
opened
1 year ago
3
Word level output is combined for Languages that don't use spaces
#34
kamranjon
closed
1 year ago
1
Whisper_timestamped does not transcript all the video?
#33
aliscie
closed
1 year ago
8
why whisper_timestamped does not transcript the entire video?
#32
aliscie
closed
1 year ago
0
Add a max_line_length parameter to subtitle files
#31
ubanning
closed
1 year ago
11
Word Timing Accuracy Falls Off After Pauses Example
#30
kamranjon
closed
1 year ago
4
Specific functions don't work.
#29
xaxole98
closed
1 year ago
3
Issue of duplicate word lines
#28
MohammedMehdiTBER
closed
1 year ago
4
Delay in the word level transcription
#27
ItakeLs
closed
1 year ago
3
Previous
Next