-
it says
------
Welcome back, to speak press RIGHT_CTRL.
Connecting to VTube Studio!
You (mic) >Authentication request sent to VTube Studio, please click allow.
VTube Studio connected! at port 800…
-
Hi Andreas,
thanks for the great project! 🚀
## Description
I would like to request a new feature: an integrated **cost estimation feature**. This feature would provide users with an estimated c…
ka1h updated
4 months ago
-
One of the most annoying features of the crowdsourced subtitles from Viki is that they insist on transcribing and translating _every single k-pop song_. All of them.
They are kind enough, however, …
-
If I pass in `mps` to device option it will crush. Would be wonderful if M1 GPU can be supported
```
❯ whisperx assets/test.mp3 --device mps --model large-v2 --vad_filter --align_model WAV2VEC2_AS…
-
I am processing hour wav as below. there is a few part that transcribed as repetitive words such as :
[SPEAKER01]
As in there As in there As in there As in there As in there As in there As in …
-
Some manuscripts transmit both the root text as well as a commentary. Is there any way, apart from transcribing them into separate TEI files, of assigning different sigla to the manuscript, depending…
-
I am transcribing some 1600’s marriages right now. And it occurs to me how strange the spellings of the first names and surnames are.
How will they fair with our Search tool?
A search on Surname onl…
-
Hi! I am using Layla for baseline detection as a part of Loghi. I've noticed that there are times when, despite being a whole line in the input image, the Laypa model recognizes this line as two separ…
-
This is very much a rough draft, or even worse, just some notes. However I wanted to share them to see if there might be demand for a adding feature like this to seedsigner. Feedback and comments welc…
-
![image](https://github.com/m-bain/whisperX/assets/23407436/7567c6e6-8fc7-499c-b778-6f3223cfdc73)
The segments must be around between 1 - 5 seconds. The subtitles are not readable like this. I susp…