typst / hypher

Separates words into syllables.
Apache License 2.0
109 stars 9 forks source link

Support for Malayalam (ml) Language - Hyphenation #8

Open gcafepgdi opened 1 year ago

gcafepgdi commented 1 year ago

This is to request the Malayalam language hyphenation support in Typst.

"Malayalam is one of the main languages spoken (by 40 million people) in South India, with a rich cultural and literary history. It is one of the oldest languages in India with classical language status."

Issue

Malayalam text output by Typst has too much spacing between some words.

Sample in Typst

nicola-tesla-ml-typst

Sample Text in Latex with Proper hyphenation and justification

nicola-tesla-ml-latex

References:

laurmaedje commented 1 year ago

We excluded Indic languages because we couldn't find any evidence that hyphenation was actually used with them in practice. Perhaps that wasn't true. Do you have any sources of non-TeX related places where Malayalam is hyphenated?

tilman-schieber commented 9 months ago

If you don't speak the language It is hard to find evidence of hyphenation as hyphens are not commonly used. The words are just broken at end of line without any symbol. Firefox and Chromium have added support for hyphenation in indic languages, see for example here. Here is a blog that documents that indesign, scribus, libreoffice and so on all support hyphenation for indic languages.

The author of this blog also has a repo with all his hyphenation patterns for indic languages: https://github.com/santhoshtr/hyphenation