Closed Rimole closed 9 months ago
But some sources suggest otherwise:
The Vietnamese language is predominantly a monosyllabic language in which the majority of words have one syllable. The language does have some disyllabic and polysyllabic words, however. Often a syllable is repeated with or without variation of either one sound or a tonal aspect (Thompson, 1965).
But of course this is independent of whether hyphenation should ever be used in Vietnamese typography (even for foreign words).
Patterns are declared in TeX distributions, even if empty. For example, in TeXLive the file language.dat
contains the line:
arabic zerohyph.tex
Very likely a similar line should be added for Vietnamese (and perhaps for some other languages).
Where, or to whom should I approach to request to add a pattern to language.dat
? I imagine there is a memory limit to the number of patterns that can be loaded to the format, is it anywhere close to the current situation?
Don’t worry. I’ll do it myself in the next few days.
Great thanks! just to make sure, can you do that for Hebrew as well?
@Udi-Fogiel Sure.
Karl Berry fixed it - thank you!
Fixed what, and how/where?
@car222222 In TeXLive: https://tug.org/pipermail/tex-live/2023-September/049480.html.
Understood.
Is Karl aware that there are some (maybe older) typographic traditions used in Vietnam that do have some limited hyphenation?
Hyphenation is maintained by @reutenauer. I don’t know he’s aware, but in principle Vietnamese isn’t hyphenated, and in case a set of patters is contributed the only change is to replace zerohyph
by the corresponding file.
The real issue here isn’t the warning, which is just noisy, but the fact with babel
languages (in the actual sense, not \language
) often share patterns and a \hyphenation
is applied to all of them (I presume because of the limitations of the original TeX). Furthermore, if babel
doesn’t find a \l@language
, it sets it to \language=0
, ie, English, which is a very bad idea, IMO (too late to change it, I think).
With this change \hyphenation
can now be used to add hyphenation points to Vietnamese without touching other languages.
From https://tex.stackexchange.com/questions/373049/babel-warning-about-the-non-existance-of-vietnamese-hyphenation-patterns
pdflatex
ed this prints in the log file:At StackExchange, egreg suggested to replace
\usepackage[vietnamese]{babel}
byElsewhere I read
\IfPackageLoadedTF{inputenc}{\addto\extrasvietnamese{\inputencoding{utf8}}}{}
. It was suggested that babel should have "empty" hyphenation rules for Vietnamese (becauseAs a native speaker, I can confirm that all native Vietnamese words are monosyllabic. Hyphenation would only be useful for scientific words that are transcribed from another language, but since there are too many conventions to write them (I can recall at least 4), probably one should best do manual hyphenating. McSinyx Dec 2, 2017 at 13:01
).