MTG / freesound-datasets

A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.
https://annotator.freesound.org/
GNU Affero General Public License v3.0
134 stars 12 forks source link

Download button unavailable - Early request of FSD dataset #157

Closed jerpint closed 5 years ago

jerpint commented 5 years ago

I'm currently logged in to the website, https://datasets.freesound.org/fsd/discussion/ , I see the Download tab but it is not active (it doesn't link to anything). Is the dataset not yet available for download? Is it possible to request it temporarily?

xavierfav commented 5 years ago

The dataset is not available for download yet. However, there is a resulting dataset with a fewer number of categories that is available to download here.

It is possible to provide you the ground truth annotations generated in the Freesound Datasets platform, but you will have to download the corresponding audio content from Freesound (with the API).

Let us know if you are interested.

jerpint commented 5 years ago

@xavierfav I would be interested in getting the ground truth annotations you speak of, how would that work? Thank you!

xavierfav commented 5 years ago

We can share a Json file:

{
  "<AudioSet_category_ID>": 
    {
      "PP": [<freesound_id>, ...],
      "PNP": [<freesound_id>, ...],
    }, 
  "<AudioSet_category_ID>":
    { ...
    ....
}

Where for each category you have a list of Present and Predominant (PP) & Present but Not Predominant (PNP) Freesound sound ids. Categories come from the AudioSet Ontology.

It is possible to include more stuff if you want, such as the candidate annotations generated by our automatic method, its quality estimate, the sounds that are considered Not Present, the raw votes, etc... We use annotator agreement to consider that a sound category is present or not in an audio sample. We use some tests in the forms to discard votes from bad annotators, or spam. We also propagate the annotations from the leaf categories to some of its parents. Depending on your application, you can focus on very specific categories, broader ones, or all at once.

Then, for the audio content, you can get it from the Freesound API (as well as metadata and acoustic features). For that, you can find the documentation here, and some clients libraries for different languages here.

Send me an email: xavier.favory[at]upf.edu

xavierfav commented 5 years ago

We finally decided we were not providing any early release of the FSD dataset. We are working on the first release and the data will be available soon (a couple of months).

zengqingfa commented 5 years ago

Is there a way to download the bulk download of the relevant data? image Like this one where I want to download all the audio of the car honking instead of one by one, what do I do?