Closed gd-three closed 6 days ago
@gd-three No Buzz does not have this feature. If you need an api see this discussion on some APIs available in the cloud and some that you can run on your computer
@gd-three No Buzz does not have this feature. If you need an api see this discussion on some APIs available in the cloud and some that you can run on your computer
Now, I use a string in my program (buzz add --task transcribe --language zh --model-type whisper --model-size small --txt file path)CMD to perform audio recognition, which starts a new buzz process each time, which is inefficient
@gd-three Please explain what is inefficient and what you would like to be different. If you explain more about your process, I may be able to find a more efficient solution. Where do the audio files come from and how you use the transcription results. Is it something automated?
If the transcription speed is slow, you will need a more powerful hardware or you will need to send your transcription to some API.
This should transcribe using OpenAI API, you will need a API token. Prices according to their price list.
buzz add --task transcribe --language zh --model-type openaiapi --openai-token ABC123DEF456 --txt file path
@gd-three Please explain what is inefficient and what you would like to be different. If you explain more about your process, I may be able to find a more efficient solution. Where do the audio files come from and how you use the transcription results. Is it something automated?
If the transcription speed is slow, you will need a more powerful hardware or you will need to send your transcription to some API.
This should transcribe using OpenAI API, you will need a API token. Prices according to their price list.
buzz add --task transcribe --language zh --model-type openaiapi --openai-token ABC123DEF456 --txt file path
I want to get the results of real-time speech recognition in the program I wrote myself, and then do some things of my own, but the recognition results now only have UI display, there is no storage path, if buzz can provide the path, it is great, it will solve my problem.
@gd-three Buzz stores all transcription results in a SQLite database. Location of the database is printed out in the logs. To see the logs please see this section https://github.com/chidiwilliams/buzz/blob/main/CONTRIBUTING.md#troubleshooting
On Windows you can paste this %USERPROFILE%\AppData\Local\Buzz\Buzz
in the address par of your file manager and get to the location Buzz stores it's internal data.
@gd-three Buzz stores all transcription results in a SQLite database. Location of the database is printed out in the logs. To see the logs please see this section https://github.com/chidiwilliams/buzz/blob/main/CONTRIBUTING.md#troubleshooting
On Windows you can paste this
%USERPROFILE%\AppData\Local\Buzz\Buzz
in the address par of your file manager and get to the location Buzz stores it's internal data.
ok, I know this directory before, I use version 0.8 so I didn't see what you said
I am currently downloading version 1.1 in China very slowly, and only found version 1.0, which has a flash back in real-time recognition.
log===========================================
[2024-11-09 17:58:35,558] locale.
Is there an address in China where i can download it quickly? Or does version 1.1 still have this bug?
Is there an address in China where i can download it quickly? Or does version 1.1 still have this bug?
@gd-three If you were using the old version less than 1.0 before and then upgraded to some latest version you may get a crash on startup. Fix for this bug is to delete the old recording history (everything in the directory mentioned above) OR use the latest development version 1.2.0 from latest builds from here https://github.com/chidiwilliams/buzz/actions/workflows/ci.yml?query=branch%3Amain
Select the latest build, scroll down to the artifacts section and download the installation file. You need to be logged in the Github to download link. This download will be slow unfortunately.
I delete the old recording history and then solve the problem by reinstalling it,thank you for your patience, for you answer.
still, Perferences---->Folder Watch,What is this feature?Does it periodically monitor folders for transcribe?
still, Perferences---->Folder Watch,What is this feature?Does it periodically monitor folders for transcribe?
I found a bug. When there were multiple audio files in the input directory, buzz would create multiple duplicate transcription tasks. After buzz transcribed an audio file, it would move it to the output directory, resulting in subsequent duplicate tasks failing to find the file during transcription.Then lead to an error
@gd-three If you were using the old version less than 1.0 before and then upgraded to some latest version you may get a crash on startup. Fix for this bug is to delete the old recording history (everything in the directory mentioned above) OR use the latest development version 1.2.0 from latest builds from here https://github.com/chidiwilliams/buzz/actions/workflows/ci.yml?query=branch%3Amain
Select the latest build, scroll down to the artifacts section and download the installation file. You need to be logged in the Github to download link. This download will be slow unfortunately.
please see above 👆
@gd-three I am unable to replicate this on latest development version 1.2.0. There was such bug but it was fixed.
Can you please ensure you are using 1.2.0
and provide detailed steps on how to replicate.
Test 1:
This worked, files from input folder were processed.
Test 2:
This also worked, all files from input folder were processed, no duplicates and no errors.
@gd-three I am unable to replicate this on latest development version 1.2.0. There was such bug but it was fixed.
Can you please ensure you are using
1.2.0
and provide detailed steps on how to replicate.Test 1:
- Have a separate input and output folders
- Have 3 files in input folder
- Enable folder watch in preferences
This worked, files from input folder were processed.
Test 2:
- Have a separate input and output folders
- Turn off Buzz
- Add files to input folder
- Start buzz
This also worked, all files from input folder were processed, no duplicates and no errors.
In that case, it's okay, I'm using version 1.0.1, not developer version 1.2. Ignore this problem.
I want to do speech recognition through the program API, Is there a way or method to do it now? (anxious)