yosi-dediashvili / SubiT

Subtitles downloading, the right (click) way.
http://subit-app.sourceforge.net
GNU General Public License v3.0
19 stars 2 forks source link

Improvements to the normalization module #22

Open yosi-dediashvili opened 10 years ago

yosi-dediashvili commented 10 years ago

We need to add another normalization method that concatenates words from the title. The new method will come right after the 2nd normalization, and for will:

This step will actually return a list of normalized strings and not a single ont.

yosi-dediashvili commented 10 years ago

Another normalization method: Replace any number with its string representation:

We should limit ourselves to normalization of up to the number 20. Past that, the numbers are converted to two words, so it get too complicated.