-
**Is your feature request related to a problem? Please describe.**
The current acoustic training works from scratch and the resulting model feels like it force-guesses the phonetic marker placements …
-
I can use your command to generate alignment based on bpe level. But how to generate alignment based on token level?
-
## Story Explanation
### User Story
As an aligner, I want machine-generated suggestions clearly distinguishable from alignment memory so that I can deal with each category of data appropriately.…
-
When working with C libraries/specifications, you sometimes have to align data of specific types in memory. I came across the need for this while I was learning how to share variables between GLSL and…
-
This comes mostly from me working on re-alignment. I'm moving the grand design I had in #26 to here, and make that pull request more about just supporting alignment info passthrough.
1. For alignme…
-
Hello!
I've been using the WhisperX large-v2 model in English on a project to transcribe vocals taken from songs, which I derive using source separation with spleeter. If it matters, I've been runn…
-
Hi,
First of all, thanks for sharing the source code and data.
I encounter some trouble when reproducing the process of using FastAlign to obtain word-level alignments. By using the code in scri…
-
-
Hi,
I just started using WhispherX on a short (~5min) HCP movie for the word-level timestamp.
I followed the GitHub steps and installed it in Python environment, and tried the following command:
…
-
I am trying to use whisperx for word alignment.