MEDISCO is a Medical Indonesian Speech Corpus. The medical text corpus is collected from five Indonesian online medical consultation websites. From the text corpus, we created a speech corpus that consists of 360 sentences read by 13 speakers. In total, our speech corpus contains 731 medical terms and consists of 4,680 utterances with a total duration of 10 hours.
Dataloader name:
medisco/medisco.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?medisco