wuvt / trackman

WUVT's track logging web-app
GNU Affero General Public License v3.0
4 stars 3 forks source link

Grouping for weekly new music chart email should be case insensitive #68

Closed g-i-g closed 5 years ago

g-i-g commented 5 years ago

Currently, for the weekly email that gets sent out, the case is considered:

1 : JEFF The Brotherhood - Magick Songs
...
1 : Jeff The Brotherhood - Magick Songs

Ideally, this should be two plays under the same author.

The trivial solution, but maybe not the most optimal, would be to just lowercase everything: 2: jeff the brotherhood - magick songs There could be a little extra logic to re-capitalize words once the chart is tallied if we wanted to make things look pretty. Just an idea.

echarlie commented 5 years ago

One potential concern (at the risk of bikeshedding): this won't solve the problem of where DJs incorrectly enter unicode points (e.g. https://en.wikipedia.org/wiki/22,_A_Million). Those will still need to be handled manually.

I also don't know if lowercasing is the correct form of normalizing: since this is for a report to https://naccchart.com/, what is the "correct" form of normalization they want?