gveres / donateacry-corpus

An infant cry audio corpus that's being built through the Donate-a-cry campaign - see http://donateacry.com
153 stars 63 forks source link

donateacry-corpus

An infant cry audio corpus that has been built through the Donate-a-cry campaign (no longer active)

Source of these files

This repository contains user-uploaded audio samples in their original, unmodified, unchecked form. The audio samples were uploaded using the Donate-a-cry mobile applications for Android and iOS.

Using the files

The database is published under the ODbL, see below. If you work with the corpus in any way, please drop us a line at hello@newparentsapps.com

File naming convention

The audio files should contain baby cry samples, with the corresponding tagging information encoded in the filenames. The samples were tagged by the contributors themselves. So here's how to parse the filenames.

iOS:
0D1AD73E-4C5E-45F3-85C4-9A3CB71E8856-1430742197-1.0-m-04-hu.caf
app instance uuid (36 chars)-unix epoch timestamp-app version-gender-age-reason

So, the above translates to:

Android:
0c8f14a9-6999-485b-97a2-913c1cbf099c-1431028888092-1.7-m-26-sc.3gp
The structure is the same with the exception that the unix epoch timestamp is in milliseconds

Tags

Gender

Age

Reason

License

This donateacry-corpus is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/dbcl/1.0/ - See more at: http://opendatacommons.org/licenses/odbl/#sthash.ejQJkkvi.dpuf