hummingbird-me / kitsu-tools

:hammer: The tools we use to build Kitsu, the coolest platform for anime and manga
https://kitsu.app
Apache License 2.0
2.09k stars 265 forks source link

Support for lookup of series from file hash? #87

Closed pharaun closed 10 years ago

pharaun commented 10 years ago

Basically one of the biggest draw to AniDB for ex is the support for hashing the file and then using this to look up information on the series from the file hash.

The unfortunate fact is lots of time files can be renamed, scrambled up, or so forth so that the information on the file name is not valid or even useful. Hashing enables us to find out what group released the file and other related metadata.

Another useful aspect of hashing off the filename is it allows you to programically re-organize your collection by fetching all form of useful information out of say AniDB and then use that to move/cleanup and rename all of the files into something that is more consistent and uniform and re-structure your directory-tree based off a useful schema.

NuckChorris commented 10 years ago

Similar to #82 this would require them to embrace fansubbing directly which, like it or not, is just not gonna happen.

While I can understand the frustration with this as a programmer (hashing seems like a good solution from that perspective) you also have to understand the human angle here:

This would take a lot of manpower. Crowdsourcing could probably get the manpower, but why bother? As I stated in the Hacker News comments it's really not necessary: parsing filenames is surprisingly simple. Even when you've renamed files, I don't think you'd remove the series and episode number from it. And even if you do, MKV fansubs usually have easy-to-parse metadata attached to them. We could pull from that.

Basically, it's a lot of work for a very very very very rare case when it won't parse correctly.