DagsHub / audio-datasets

open-source audio datasets
https://dagshub.com/DagsHub/audio-datasets
141 stars 23 forks source link

Coswara Dataset #32

Closed mertbozkir closed 3 years ago

mertbozkir commented 3 years ago

Claim Dataset: Coswara

About Dataset: The COVID-19 pandemic presents global challenges transcending boundaries of country, race, religion, and economy. The current gold standard method for COVID-19 detection is reverse transcription-polymerase chain reaction (RT-PCR) testing. However, this method is expensive, time-consuming, and violates social distancing. Also, as the pandemic is expected to stay for a while, there is a need for an alternate diagnosis tool that overcomes these limitations and is deployable at a large scale. The prominent symptoms of COVID-19 include cough and breathing difficulties. We foresee that respiratory sounds, when analyzed using machine learning techniques, can provide useful insights, enabling the design of a diagnostic tool.

Towards this, the paper presents an early effort in creating (and analyzing) a database, called Coswara, of respiratory sounds, namely, cough, breath, and voice. The sound samples are collected via worldwide crowdsourcing using a website application. The curated dataset is released as open access. As the pandemic is evolving, data collection and analysis are a work in progress. We believe that insight from the analysis of Coswara can be effective in enabling sound-based technology solutions for point-of-care diagnosis of respiratory infection, and in the near future, this can help to diagnose COVID-19.