-
I aligned my audio with text, breaking at the sentence level. I open Audacity and utilize the start and end segment times to review the alignment. On multiple occasions, I've found the last word fra…
-
I assumed `turbo ` model is supported by the latest `stable-ts`.
I get the following error. I guess the error message indicates incompatibility of alignment heads:
```
Traceback (most recent call…
-
First of all, thanks so much for all your work on this and making it open source! It would be cool if it were possible to do a fragment search using an existing SRT transcription without having to re-…
-
**Is your feature request related to a problem? Please describe.**
Need Phoneme Timings...
**Describe the solution you'd like**
I used Gentle before to get phoneme timestamps but it was hard to i…
-
we can set the numbering format of a list style manaually in the word, but is there any way that I can use office.js to set the numbering format by code? I used `const style = context.document.addStyl…
-
Hi @nkilm! I found your repo, which does most of what I want. Did you happen to implement the confidence score functionality from the [whisperX](https://github.com/m-bain/whisperX) library? If not, I'…
-
### Description
Looking at the value tuple type `(bool, Nullable)` i expect the following layout:
1 byte for the bool + 3 padding bytes
1 byte for the bool of the nullable + 3 padding bytes
4 bytes …
-
(See prior discussion on ![](https://github.trello.services/images/mini-trello-icon.png) [As a reader, I can view the transcription side by side with a word level alignment](https://trello.com/c/3mx94…
-
# Task Name
Localization of named entities in the spoken utterance
## Task Objective
The goal of NEL is to predict the start and end times of any named entities in a spoken utterance. NEL is …
-
I want to have the time codes at the word level, but for the need of inference time optimization, I just want to retrieve the time codes of faster whisper and not use the alignment model. Has anyone d…