Open wwyl2000 opened 2 years ago
positive refers to audios with target keyword (fire). negative are the audios without target keyword (fire).
training on negative set helps decreasing false positive rate.
Unfortunately, there isn't a good way of generating a dataset for custom wakeword. If it is made up of common word such as hey, hi, cat. Data generation using Mozilla Dataset should work.
However, generating a dataset for non-standard word such as hakunamatata is not yet supported.
Hi ljj7975, Many thanks for your informaion. Best, wwy
Dear Author, Thanks for sharing your package. In your example to generate the data set, "fire" has 2 parts of data, positive and negative. What is the positive data? Was it pre-recorded? Also, if i have a new word to detect, for example, hakunamatata, how to obtain the datasets?
Thanks, WWY