Thai language dataset for spoof detection. The dataset consists of genuine speech signals and various types of spoofed speech signals.The spoofed speech dataset is generated using text-to-speech tools for the Thai language, synthesis tools, and tools for speech modification. Accessing the dataset requires creating a (free) account on the AI for Thai portal.
Subsets
-
Languages
tha
Tasks
Hoax Detection, Spoken Language Understanding
License
Creative Commons Attribution Non Commercial Share Alike 3.0 (cc-by-nc-sa-3.0)
Dataloader name:
thai_spoof/thai_spoof.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?thai_spoof