MTG / mtg-jamendo-dataset

Metadata, scripts and baselines for the MTG-Jamendo dataset
Apache License 2.0
264 stars 37 forks source link

Dataset in a lossless format #31

Closed gogo9th closed 2 years ago

gogo9th commented 2 years ago

Hi,

Thanks very much for this great dataset. I am working on a research project that requires lossless music files as input. I wonder if there is any way for us (or me) to get a .wav or .flac lossless version of your dataset by any chance.

alastair commented 2 years ago

Hi, You have two options to obtain these files in flac format. One is to download the mp3 version, and convert them to wav, e.g. using lame.

If you want a non-compressed version, you can also download the audio files again from jamendo using their API: https://developer.jamendo.com/v3.0/tracks/file

for example, the track_id field of our data files includes the jamendo id, which you can use in their API: https://raw.githubusercontent.com/MTG/mtg-jamendo-dataset/master/data/autotagging_moodtheme.tsv

Note that the data in jamendo may have changed since we created the dataset, and some files may no longer exist.

gogo9th commented 2 years ago

I am sorry for my late reply.. and thanks very much for your useful tip. I'm glad to learn that uncompressed high-quality tracks are available.

About your website's APIs, I wonder if you could please give me a tip for the following 2 types of user requests:

  1. Retrieve all tracks of an uncompressed format
  2. An example of downloading a particular track among the tracks retrieved from the tracks from step 1.

I really appreciate your support.

alastair commented 2 years ago

Hi, Please note that this dataset is not an official product of Jamendo, who hosts the data. This dataset is an independent project developed by an academic research group.

You may find this repository useful for some example python code to download uncompressed jamendo audio: https://github.com/AudioCommons/jamendo-downloader

gogo9th commented 2 years ago

Thanks for the URL of the new repository.

When I run it, I get the following :

:Desktop/jamendo-downloader$ python3 getplaylist.py track_0000948 out.json
offset 0
results is empty
Done

Do you have any idea? I wonder if this is because I have to set the API key to some value (?) in config.py.

gogo9th commented 2 years ago

I created an account from Jamendo and got the client ID 4044a455. Now I can download tracks. Thanks very much!