-
https://github.com/tesseract-ocr/tessdata/tree/3a94ddd47be0
@theraysmith
,
How to present those 'best' files to our users?
https://github.com/tesseract-ocr/tesseract/wiki/Data-Files
Do you p…
-
Is there direct command line to call to convert text to phonemes? i don't want the alignments to the audio, just the phonemes. The use case is after training with a TTS model, in the inference time w…
-
**Is your feature request related to a problem? Please describe.**
I regularely make a lot of typos in code comments, docs, but also in variable names. It is always annoying if pull-requests get post…
rkusa updated
13 minutes ago
-
```bash
python s2s_pipeline.py --local_mac_optimal_settings
```
It seems this is done running setup and ready for me to start speaking? My mic is set to MacBook Pro Microphone. I say somethin…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Contact Details
_No response_
### What should this feature add?
Prompts are mostly written in english. Since…
-
I want to make a large Japanese data set for training, but I encountered some problems while making it.
1、I got a lot of free ttf and otf files from two websites and successfully made them into png…
-
usage: whisper [-h]
[--model {tiny.en,tiny,base.en,base,small.en,small,medium.en,medium,large}]
[--device DEVICE] [--output_dir OUTPUT_DIR] [--verbose VERBOSE]
…
-
Post questions here for this week's fundamental readings:
J. Evans and B. Desikan. 2022. “Deep Learning?” and “Deep Neural network models of text”, Thinking with Deep Learning, chapter 1, 9
Ash…
lkcao updated
7 months ago
-
Hi. I am using exactly the same code as yours in run_sft.sh:
```
#!/bin/bash
CUR_DIR=`pwd`
ROOT=${CUR_DIR}
export PYTHONPATH=${ROOT}:${PYTHONPATH}
VISION_MODEL=openai/clip-vit-large-pa…
-
I have come across this draft specification only very recently, and I can see that a lot of effort has gone in to it. I’ve read through it and have some thoughts around its design, which I’ve put into…