-
As it turns out, the English sets in CommonVoice v2.0 contain out-of-alphabet (of the English 26 letter one plus space plus apostrophe) characters - `[':', 'ú', 'l', "'", 'é', 'á', ';', '’', 'ō', '`&#…
-
- **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: Ubuntu 16.04
- **TensorFlow installed from (our builds, or upstream TensorFlow)**: mozilla tensorflow
- **TensorFlow version (use comma…
-
A subtitles API, allowing users to import a subtitle for a given video via the PeerTube API, could have a significant impact in the PeerTube network accessibility, and some interesting projects could …
-
@tilmankamp this error kills my jobs seemingly at random, and it happens so frequently it really slows down my workflow.
The job doesn't die instantaneously, but only after a minute or more. This …
-
commonvoice/s5/local/prepare_dict.sh uses the following path to sequitur:
sequitur=$KALDI_ROOT/tools/sequitur
But now sequitur is installed as $KALDI_ROOT/tools/sequitur-g2p. So, it should be:
se…
-
Sondage pour choisir une date: https://framadate.org/foGyOuwuwlXVCYnB
-
For support and discussions, please use our [Discourse forums](https://discourse.mozilla.org/c/deep-speech).
If you've found a bug, or have a feature request, then please create an issue with the f…
-
`87369eabded58abe351f3cca1af834d39de0ba420ea27a51e815ae8c079f3956cd097472a3481a58d7de5be87b6c38a9a07178bb26bb38203e6b5b0baab1c1c8` in `en/test.tsv` of CommonVoice corpus v2.0 has an empty transcript (…
-
Thanks for @nicolaspanel's work, there is a dataset of roughly 200 hours already available that we should be able to use: https://github.com/nicolaspanel/TrainingSpeech
It'd be great to:
- write …
-
@JRMeyer
something like the following from reuben's code in .compute (utf8 branch)
```
#!/bin/bash
set -xe
data="${SHARED_DIR}/data"
fis="${data}/LDC/fisher"
swb="${data}/LDC/LDC97S62/…