demophoon / trello-concert-tracker

Track concerts and post them to trello
0 stars 1 forks source link

Investigate "Likely Artist" heuristic or service #12

Open rattboi opened 7 years ago

rattboi commented 7 years ago

Right now, some artists get dropped because of inconsistent formatting by the venues entering the data. It'd be cool to either write a solid heuristic to remove cruft from artists, or perhaps farm it out to another service that could do "Oh, did you mean ?".

Example: Roseland has "Pixies (Night One)", and "Pixies (Night Two)". Last.fm didn't know what to do with that, so it was skipped.

rattboi commented 7 years ago

Initial research didn't come up with anything. I think we're stuck with heuristics.

Some of my venue scrapers have the beginnings of heuristics to cut cruft. Perhaps a recursive method that tries each cut on last.fm might get us what we're looking for.