dessalines / torrenttunes-client

DEPRECATED: A BitTorrent-based music streaming service.
GNU General Public License v3.0
358 stars 47 forks source link

Use NFO-data instead of Musicbrainz for initial scraping #48

Closed tritnaha closed 8 years ago

tritnaha commented 8 years ago

Instead of having to query musicbrainz for info with every single file, you could scan the directory for the existance of nfo-files and scrape the data off that if it exists, at least initially.

nfo-files are used within the scene and contain all the valuable information about the release, meaning you wouldn't really have to ping someplace like musicbrainz like crazy. Quality is also guaranteed as releases that are sub-par or break rules (FLAC)/rules (MP3) will automatically be nuked.

dessalines commented 8 years ago

Do these nfo files contain the musicbrainz id for each song within them?

tritnaha commented 8 years ago

No they do not, not using any external service at all in that regard. No real need as (ofcourse, mistakes can and will happen) the information is almost always correct, usually fetched via exactly something like musicbrainz or discogdb, usually a combination of both.

My suggestion would be to use the info off the nfo's initially as this should be reliable enough, doing this would speed up the library-creation by a significant part. I do not know if this is possible, but wouldn't it be a decent idea to either offload musicbrainz and host a local copy/dump someplace ourselves or ship torrenttunes with a standard, recent musicbrainz db that it can use for faster indexation/tagging? You could probably work some rsync/zsync-magic into it to fetch daily/weekly/monthly updates of said database - Or query the musicbrainz-server we would run ourselves.

dessalines commented 8 years ago

No real need.

Musicbrainz is the holy grail of open source music information. Those .nfo files are pretty much useless unless they link to it. Since I've decided to use it to keep everything well organized and tagged, I can't use .nfos.

For example, here's a link to a song I have on torrenttunes: http://torrenttunes.ml/?song=a8322e8f-cee8-49d8-808c-2578c444d656 Which has a musicbrainz link at: http://musicbrainz.org/recording/a8322e8f-cee8-49d8-808c-2578c444d656

The musicbrainz DB is pretty massive, and although they do have docs for setting it up and replicating it yourself, I don't know if think sub-1 second song tagging is required yet.

You'd be surprised when going through my own collection, how many songs are mis-tagged, I guesstimate about 30% of my albums were misnamed, and probably 10% of songs were. I usually scan one artist at a time, fixing the tags and making sure musicbrainz is able to identify them.

tritnaha commented 8 years ago

I wish you the best of luck!

I suppose you could look at magnet-links instead of creating copious amounts of torrents too, that'd be even neater.