ajayyy / DeArrow

Crowdsourcing better titles and thumbnails on YouTube
https://dearrow.ajay.app
GNU General Public License v3.0
1.41k stars 38 forks source link

Automatic capitalization: issues with unicode #32

Closed KMikeeU closed 1 year ago

KMikeeU commented 1 year ago

Automatic capitalization sometimes generates odd titles when the first letter is a unicode character

Original title: Ski Aggu über Crazy Frog, "Party Sahne", Southstar & Domiziana – Interview mit Aria Nejati

Auto generated: Ski Aggu üBer Crazy Frog, "Party Sahne", Southstar & Domiziana – Interview Mit Aria Nejati

The issue being the capital B in "über"

ajayyy commented 1 year ago

Should be fixable by changing the "is letter" check to include unicode letters. The reason for it is to exclude quotes, or brackets, etc.

Also auto-caps is going to be disabled by default in next version if the main language is not english since many expressed concern that the title case rules don't apply in other languages. Will have to determine a good solution for that long term.