notofonts / kannada

Noto Kannada
SIL Open Font License 1.1
1 stars 1 forks source link

Noto Sans Kannada: issue with nukta and consonant conjugate #28

Closed MayuraVerma closed 5 years ago

MayuraVerma commented 7 years ago

Title

Noto Sans Kannada: issue with nukta and consonant conjugate

Font

'NotoSansKannada-Regular.ttf'.

Where the font came from, and when

http://kannadafont.blogspot.com/2017/10/case-10-nukta-with-consonant-conjugate.html With Nukta at the end of letters, breaks the letters FAILS-> ಕ್ಕ಼ (single consonant conjugate with nukta at the end) ಕ್ಕ್ರ಼ (double consonant conjugate)

Font Version

Version 1.03 uh

OS name and version

Mac OS High Sierra GM

Application name and version

Google Chrome, Version 61.0.3163.100 (Official Build) (64-bit)

Issue

Summarize the issue briefly -- one paragraph preferred

  1. Steps to reproduce
  2. Observed results
  3. Expected results
  4. Additional information

    Unicode chart, technical specs, shaping info, comparison with non-Noto fonts, comparison with earlier version of the same font (regression cases)

Character data

Please include real character data to illustrate your issue-- Unicode codepoints are helpful. This makes it possible for developers who don't know the language or script to copy/paste the text to reproduce the issue.

Screenshot

If possible, include a screenshot or an image illustrating the issue. Annotations are also helpful.

Tools for reporting bugs

Useful tools for reporting bugs are available at: https://github.com/googlei18n/

Harfbuzz hb-view and hb-shape

These are part of the HarfBuzz distribution and can help isolate if an issue is in the app/OS, shaping engine, or font.

For example:

  hb-view --font-file {path to font} --text-file {path to text file} --output-file '{sample}.png'

Fontview

Fontdiff

MayuraVerma commented 7 years ago

Please review this below page for more combinations

http://kannadafont.blogspot.com/2017/09/case-6c-nukta-combination-with-google.html

marekjez86 commented 7 years ago

added https://github.com/googlei18n/noto-source/blob/master/test/Kannada/kn-971.html as a test

JelleBosmaMT commented 7 years ago

The nukta should be entered right behind the consonant.

untitled 2

See this image showing that it works that way.

With the given examples the Safari browser gives dotted circles, indicating invalid unicode sequences. I don't quite understand why the Chrome browser doesn't.

MayuraVerma commented 7 years ago

@JelleBosmaMT Could you please confirm this in the Unicode document. I can't see reference for Kannada, my bad! But if this is true, then it can be controlled/constrained in the native Kannada keyboard. Please advise.

JelleBosmaMT commented 7 years ago

Not sure what Unicode document you are talking about?

MayuraVerma commented 7 years ago

http://www.unicode.org/versions/Unicode10.0.0/ch12.pdf

JelleBosmaMT commented 7 years ago

I am afraid, I am just a type designer, not one of the authors of the Unicode specification. Chapter 12 gives a lot of information in the section about Devanagari. It would have been better to have a separate section on common principles of the Indic scripts, rather than relying on people reading about Devanagari and then translate the render rules to the other scripts where applicable.

In Kannada we have ಜ and ಫ known to combine with the nukta, and sometimes ಸ. Noto Sans Kannada tries to support all consonants. But I haven't seen conjuncts with 2 or more consonants where the nukta is used for a consonant which isn't the first. So consonants with nukta's are not rendered with their subscript form, but always in full.

MayuraVerma commented 7 years ago

@JelleBosmaMT Thank you. I am not an expert in unicode standard. I am helping the Kannada community with Unicode fonts and keyboard drivers. Legal unicode sequence for Kannada Nukta is not clearly written in Unicode specs. When I asked this question with Kannada linguistics, they felt its better to leave it outside the consonant conjugate and diacritics. This way when user search for "ಕ್ನ್ಕೇ" they get both "ಕ್ನ್ಕೇ" and "ಕ್ನ್ಕೇ಼". But if the Nukta sits inside "ಕ಼್ನ್ಕೇ" the cluster, then we can't find the text. Nirmala font (windows 7/10 UI) supports both with few exceptions.

I clearly don't know which one is correct form. Is there way to get clarity on this from unicode consortium. If we have to follow the Devanagiri method, thats fine. But somewhere this needs to be documented.

It could outside the font issue, but related! Please advise.

roozbehp commented 7 years ago

The easiest way for asking Unicode to document something is contacting them using the contact form, here: https://www.unicode.org/reporting.html

MayuraVerma commented 7 years ago

@roozbehp @JelleBosmaMT Thank you.

sridatta1 commented 7 years ago

I was about to file issue on nukta on subscripts I Kannada nukta is generally used on for f ಫ಼ z ಜ಼ in English and Perso-arabic words. In words like pizza ಪಿಜ಼್ಜ಼ಾ gulf ಗಲ್ಫ಼್ waqf ವಕ್ಫ಼್ In these words consonant with nukta is a non-initial consonant of the cluster. Subscript forms with nukta should also be supported for f z, though these aren't commonly seen in other fonts thus these words are also written without nukta.

MayuraVerma commented 7 years ago

@sridatta1 If in pizza, we need nukta both base and conjugate "ಜ" in ಪಿಜ್ಜಾ. It could be very complicated. We need to approach Unicode consortium to get this written in the manual.

MayuraVerma commented 7 years ago

@sridatta1 if you see possibility of having nukta in the ottakshara, then we need to specify in the Unicode standard that the nukta needs to sit right next to the consonant. It is not clear in the standard.

sridatta1 commented 7 years ago

The common principles of Indic scripts are specified in Devanagari Chapter See R9 pg 457 of chapter 12.1 of core specification Subscript/ottu forms with nukta should be rendered as below mainly for ಫ಼ ಜ಼ (Murty Kannada font) nukta

MayuraVerma commented 6 years ago

@sridatta1 Thank you. Most of the Kannada fonts would fail with these combinations.

Can you please add if more words to justify the change with other fonts.

Also, do you know how to approach the Microsoft to get this rectified in Tunga and Nirmala.

JelleBosmaMT commented 6 years ago

I had to think about this.

There are two reasons why Noto Sans Kannada doesn't have subscript forms with nukta.

So now in 2017 when I try adding subscript Za and Fa and try them, the document version of the regular font looks like this in the Chrome, Safari (El Capitan) and Edge Browsers:

document nukta

Only Chrome treats conjuncts with nukta the same as those without. The Microsoft shaper doesn't re-order the vowel sign before the subscript as it would normally do. It will require a work-around in the font.

On the Mac the fonts are displayed heavier than they really are. But even then I think I should move the nukta a bit further away from the subscript Ja and Pha, in order to improve legibility at small sizes.

But that will be difficult in the UI version of the fonts, where there is no room to put anything below the existing subscripts. So these have to be moved up to make room and then it looks like this: ui nukta

The plan is to add some additional characters to Noto Sans Kannada, such as the new candrabindu's, and to add additional weights and widths: from thin to black and from normal width to extra condensed. Actually I send Google a design proposal last week. I will modify this proposal to add support for subscript Fa and Za and wait to hear if they and their expert reviewers agree. I am not sure if it makes sense to add support for any combination of subscript with nukta. For the UI version that seems hopeless, as it would require 3 separate glyphs for each combination. A lot of work for combinations that may never be used, may not be legible when they are used, and I have less time to spend on characters that are used. For the normal document version, it might be possible to add a nukta that is displayed below the other subscript consonants. But I am not sure if it a good idea. It may be correct in following the render rules of Kannada, but I am not sure if it is user friendly. I can imagine it will be more often than not a misspelling that needs to be corrected, and then using the full form the user can see there is a nukta, rather than a subscript with a blur?

JelleBosmaMT commented 6 years ago

Huh, I just see that in the samples above the syllable "Z-Jaa" is different in Chrome, because the Ja subscript isn't moved below the vowel sign, as it does in Edge and Safara. So Google and Apple and Microsoft have a bug rendering nukta, it is just three different bugs!! 😢

kalapi commented 6 years ago

I can confirm that this has been incorporated in the latest build of Noto Serif Kannada without the need of below-base nuke variant glyphs. Additionally, as seen in the last four clusters, the below base consonants shift rightward when the base consonant has a nukta so that they don't collide.

screen shot 2017-10-12 at 6 48 55 pm screen shot 2017-10-12 at 6 48 43 pm screen shot 2017-10-12 at 6 48 05 pm screen shot 2017-10-12 at 6 48 29 pm

Tested on the following platforms:

OSX 10.11.6:

Windows 8.1:

MayuraVerma commented 6 years ago

is it possible to get a copy of the new build for testing? Also, please check the same with libreoffice. Although libreoffice uses harfbuzz, there seems to be some issue with complex text.

kalapi commented 6 years ago

@MayuraVerma

Unfortunately i'm unable to provide a binary at this time, but it should be available soon. I'm also reviewing all Kannada specific issues in this repo and testing against the latest files.

With regards to LibreOffice:

This is Version: 5.3.6.1 on OSX 10.11.6 with Kannada selected under Preferences>Language Settings>Languages>Complex text layout (CTL)

screen shot 2017-10-16 at 2 32 23 pm

JelleBosmaMT commented 6 years ago

@kalapi (off topic) I assume the sample showing thin to black are not at the same point size, with the thin at the largest. But even then the virama of the Thin weight looks too large. Mostly the top half being very large and more in a constructed style as the intent of the design (It was me who made the original design proposal in 2014, so I am pardoned to have an opinion).

kalapi commented 6 years ago

@JelleBosmaMT Sorry for the late reply. I've now fixed this in the latest version. Thank you for pointing out this issue. While testing the design my sample texts did not contain words which ended with a viral which is why I missed this. I have now included this in my standard test.

Example 1: screen shot 2017-11-09 at 11 03 29 pm

Example 2: screen shot 2017-11-09 at 10 58 55 pm

MayuraVerma commented 6 years ago

In mac OS beta "10.13.2 Beta (17C79a)”, nukta is supported for subscript.

I have tested the Noto Sans Kannada. It is rendering as expected.

Noto Sans Kannada in Pages on mac OS public beta 4. screen shot 2017-11-19 at 7 30 01 pm

I request please consider supporting nukta for all consonants in subscript form as its done in "Murty" font.

puneethk4 commented 6 years ago

Not sure if this is the right place to ask this query. Wanted to know how to write pizza, gulf, waqf in kannada as in above post.

(Please let me know the exact keystrokes, I am using Inscript Keyboard Layout)

I am already having Noto Sans Kannada in Pages on latest mac OS. Also Libre Office 5.4.5 with CTL selected.

Max I could write was only as below using ZWNJ (zero width non joiner character)

nukta_issue

MayuraVerma commented 6 years ago

@puneethk4 Noto Sans Kannada updated test version is posted here https://github.com/googlei18n/noto-fonts/issues/973#issuecomment-336591932 Only this version supports the nukta in vattu. Or you can use Murty-Kannada font.

In Pages, Numbers and Keynotes: Please check the mac OS version, it should be above 10.13.2 for apple applications to render this text correctly. Apple applications uses Apple Core-Text shaping engine. Check the font and mac OS version.

In LibreOffice, you need version above 5.4.2, which you already have. Libreoffice should render correctly independent of mac OS version though, because it uses harfbuzz shaping engine. Check the font you are using.

FYI: Font and shaping engine both must support the feature to render OpenType font.

puneethk4 commented 6 years ago

@MayuraVerma Thank You. It worked in Pages/Murthy Font. Murthy Font download website was down over the weekend, hence couldn't use it earlier and was only checking with officially available Noto Sans which wasn't working.

Any Idea on when above mentioned Noto Sans Kannada will be officially available?

~Thanks, Puneeth

sridatta1 commented 6 years ago

Also check the latest alpha fonts of Noto Serif Kannada, they might support nukta with subscripts https://github.com/googlei18n/noto-fonts-alpha/tree/master/from-pipeline/unhinted/otf/serif/NotoSerifKannada

sequence- Ja+ nukta+ Virama+Ja+nukta + vowel sign AA and similarly others

On Tue 20 Mar, 2018 7:54 pm puneethk4, notifications@github.com wrote:

@MayuraVerma https://github.com/mayuraverma Thank You. It worked in Pages/Murthy Font. Murthy Font download website was down over the weekend, hence couldn't use it earlier and was only checking with officially available Noto Sans which wasn't working.

Any Idea on when above mentioned Noto Sans Kannada will be officially available?

~Thanks, Puneeth

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/googlei18n/noto-fonts/issues/971#issuecomment-374616321, or mute the thread https://github.com/notifications/unsubscribe-auth/AWL56_rTbPQhee8HDE0FYCgLK5T12Ts0ks5tgREUgaJpZM4PqFCE .