castorini / howl

Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
Mozilla Public License 2.0
199 stars 30 forks source link

data sets #119

Open wwyl2000 opened 2 years ago

wwyl2000 commented 2 years ago

Dear Author, Thanks for sharing your package. In your example to generate the data set, "fire" has 2 parts of data, positive and negative. What is the positive data? Was it pre-recorded? Also, if i have a new word to detect, for example, hakunamatata, how to obtain the datasets?

Thanks, WWY

ljj7975 commented 1 year ago

positive refers to audios with target keyword (fire). negative are the audios without target keyword (fire).

training on negative set helps decreasing false positive rate.

Unfortunately, there isn't a good way of generating a dataset for custom wakeword. If it is made up of common word such as hey, hi, cat. Data generation using Mozilla Dataset should work.

However, generating a dataset for non-standard word such as hakunamatata is not yet supported.

wwyl2000 commented 1 year ago

Hi ljj7975, Many thanks for your informaion. Best, wwy