mideind / Tokenizer

A tokenizer for Icelandic text
Other
27 stars 6 forks source link

Fix/colon time correct spaces #50

Closed gardarjuto closed 4 weeks ago

gardarjuto commented 4 weeks ago

Improved handling for colon-separated times and durations in correct_spaces. Added tests for this too. Previously it added spaces after all colons resulting in wrong time formats, e.g. "kl. 9:40" -> "kl. 9: 40" and "3:40:15" -> "3: 40: 15".

vthorsteinsson commented 4 weeks ago

Approved!