linto-ai whisper-timestamped issues

linto-ai / whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

GNU Affero General Public License v3.0

2.01k stars 156 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Trouble with timings

#91 boxabirds closed 1 year ago
4
I want to get high performance - azure

#90 rairavi closed 1 year ago
1
Deprecation warning

#89 aeschylus closed 1 year ago
1
error for long (1 hr) hindi video - used large-v2 whisper model

#87 rairavi closed 1 year ago
3
Words are correct but regular subtitles appear too early and linger?

#85 2600box closed 1 year ago
3
Can't install on M2 Mac

#83 boxabirds closed 1 year ago
3
[Idea] Basic timestamp validation

#82 misutoneko opened 1 year ago
8
Improve Whisper transcription using transcript

#81 lenaten closed 1 year ago
14
AssertionError with --vad, only with medium model

#80 misutoneko closed 1 year ago
2
Try VAD with auditok

#78 Jeronymous closed 11 months ago
1
High Memory Usage and "Killed: 9" Error.

#75 mcgreenwood closed 1 year ago
5
VAD does not handle almost complete silence

#74 freddyertl closed 11 months ago
23
--vad impacts recognition accuracy

#72 freddyertl closed 1 year ago
6
End time of one word is the start time of the next one

#71 konradipipan closed 1 year ago
1
Regular whisper model is still downloaded when using hugginface models

#70 blueskyleaf closed 1 year ago
3
Install whisper-timestamped with only CPU dependencies

#68 agorman closed 1 year ago
3
The result is almost lost when I use whisper timestamped.

#67 YeDaxia closed 1 year ago
5
Can we get a speed boost of %80?

#66 FlowDownTheRiver closed 1 year ago
4
Merge

#65 Jeronymous closed 1 year ago
0
Inconsistent number of segments error

#64 olevanss closed 8 months ago
27
Weird repetition on transcript

#63 catalwaysright closed 1 year ago
2
[Bug] remove_last_null_duration_words

#62 mmichelli closed 1 year ago
7
AssertionError "assert len(segment_tokens_check) < len(segment["tokens"])" with option --accurate

#61 agorman closed 1 year ago
3
Option --vad not working offline (when VAD torch model has been loaded already)

#60 misutoneko closed 1 year ago
1
Inconsistent number of segments: whisper_segments (1352) != timestamped_word_segments (1350)

#59 jeremymatt closed 1 year ago
5
AssertionError: Got empty transcription!

#57 agorman closed 1 year ago
4
[Bug] Unable to use multiple output formats directly (without "all")

#56 misutoneko closed 1 year ago
2
Warning of onnxruntime "Removing initializer 'XXX'. It is not used by any node and should be removed from the model." with option --vad

#55 Jeronymous closed 1 year ago
0
Run Silero VAD before transcribing with Whisper (to reduce hallucinations)

#54 Jeronymous closed 1 year ago
0
Compatibility issues with openai-whisper version 20230306

#53 Jeronymous closed 1 year ago
1
Update to Whisper version 20230306

#51 Jeronymous closed 1 year ago
0
Make Whisper Requirement more flexible to be able to use a specific Whisper version (as some breakages were introducted in 20230306)

#48 kamranjon closed 1 year ago
7
Fatal Error: Got inconsistent text for segment 10

#47 maptz closed 1 year ago
5
Spot (probable) disfluencies in transcription

#46 Jeronymous closed 1 year ago
0
How to write SRT file? Are models the same as whisper?

#42 Adsc58 closed 1 year ago
4
Do not rely on whisper timestamps

#41 Jeronymous closed 1 year ago
0
Consider Supporting CTranslate2 for faster inference

#40 kamranjon opened 1 year ago
15
Question: How to efficiently get attention weights with beam search decoding?

#39 ItakeLs closed 1 year ago
3
Different results with whisper and whisper_timestamped

#38 skanda1005 closed 1 year ago
9
Suggestion: Problem with small words in SRT files

#37 xaxole98 closed 1 year ago
3
Suggestion: Use VAD to improve over Whisper's segment timestamps estimation

#36 Jeronymous closed 1 year ago
6
Suggestion: Add Speaker Diarization

#35 ubanning opened 1 year ago
3
Word level output is combined for Languages that don't use spaces

#34 kamranjon closed 1 year ago
1
Whisper_timestamped does not transcript all the video?

#33 aliscie closed 1 year ago
8
why whisper_timestamped does not transcript the entire video?

#32 aliscie closed 1 year ago
0
Add a max_line_length parameter to subtitle files

#31 ubanning closed 1 year ago
11
Word Timing Accuracy Falls Off After Pauses Example

#30 kamranjon closed 1 year ago
4
Specific functions don't work.

#29 xaxole98 closed 1 year ago
3
Issue of duplicate word lines

#28 MohammedMehdiTBER closed 1 year ago
4
Delay in the word level transcription

#27 ItakeLs closed 1 year ago
3

Previous Next