tylerchr / parallel-database

An experimental parallelized database optimized for read performance
2 stars 0 forks source link

Create tool to convert Million Song data to our database format #1

Closed tylerchr closed 8 years ago

tylerchr commented 8 years ago

Here is the million song thing:

http://labrosa.ee.columbia.edu/millionsong/pages/example-track-description

tdecker91 commented 8 years ago

I converted the 1 million song file using the generator tool. Even after changes to the way the hasher worked it was still incredibly slow on my laptop. But it was able to finish in like 2 minutes on my desktop. The resulting data.db file is 1GB so I recommend generating rather than hosting it something for us to download.

tdecker91 commented 8 years ago

Here's a link to the full database file https://www.dropbox.com/s/gyto9l22u01978l/data.db?dl=0