Closed Tarobish closed 8 years ago
Where is this coming from (a pdf loaded into indesign is problematic, pdf is not made for this kind of thing)? Also we don't do specific things for Uyghur and just a tiny bit for Urdu.
Can you reproduce this in the live testing?
I will update the Generated documents in a minute.
Here is the sample from "persian-arabic2_Katibeh-Regular_18.pdf” also still i can see the extension parts here im not able to type this :(
On Feb 4, 2016, at 11:13 AM, Lasse Fister notifications@github.com wrote:
Where is this coming from (a pdf loaded into indesign is problematic, pdf is not made for this kind of thing)? Also we don't do specific things for Uyghur and just a tiny bit for Urdu.
Can you reproduce this in the live testing?
I will update the Generated documents in a minute.
— Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-180007410.
copy and paste it from here: https://github.com/Tarobish/Katibeh/blob/master/Document-Sources/persian-arabic2.txt
Uh, and pic is missing. Which page is it on?
I think we have to leave this issue let it be open i can see the same problem from our Source
On Feb 4, 2016, at 1:09 PM, Lasse Fister notifications@github.com wrote:
Uh, and pic is missing. Which page is it on?
— Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-180050023.
it was the last page from "persian-arabic2_Katibeh-Regular_18.pdf”
On Feb 4, 2016, at 1:09 PM, Lasse Fister notifications@github.com wrote:
Uh, and pic is missing. Which page is it on?
— Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-180050023.
Yes! Exactly
On Feb 4, 2016, at 1:39 PM, Lasse Fister notifications@github.com wrote:
Is this about right: http://tarobish.github.io/Jomhuria/#live?eyJ2YWx1ZSI6Itis24fardqv2Ygg2KrYp9qtINiz24fZhNin2YTZidiz2Ykg2K/blduL2LHZidqv25Ug2YPbldmE2q/bldmG2K/blSAoNjE4IOKAkyA5MDcpINiz24fYrNin24vYpyDZitin2LHZidiq2YnZviDYqNuV2LHar9uV2YYg2KrZiNmC2YLbh9iyINmK24jYsduI2LQg2YXbh9iy2YnZg9inINiz25DYs9iq2YnZhdmJ2LPZiSDYptin2LPYp9iz2YnYr9inIDY0MCDigJMg2YrZidmE2YkgMTAg2YrbiNix24jYtCDZhduH2LLZidmD2Kcg2 LPbkNiz2KrZidmF2YnYs9mJ2YbZiSDYqNuV2LHZvtinINmC2YnZhNiv2YkuINio24fZhNin2LEg2KrbhtuL25XZhtiv2YnZg9mJ2obblSA6IDEuINmD24fahtin2LEg2YXbh9iy2YnZg9mJ2LPZiSDYjCAyLiDZgtuV2LTZgtuV2LEg2YXbh9iy2YnZg9mJ2LPZiSDYjCAiLCJiaWRpIjoicnRsIiwibGFuZyI6ImFyIn0= http://tarobish.github.io/Jomhuria/#live?eyJ2YWx1ZSI6Itis24fardqv2Ygg2KrYp9qtINiz24fZhNin2YTZidiz2Ykg2K/blduL2LHZidqv25Ug2YPbldmE2q/bldmG2K/blSAoNjE4IOKAkyA5MDcpINiz24fYrNin24vYpyDZitin2LHZidiq2YnZviDYqNuV2LHar9uV2YYg2KrZiNmC2YLbh9iyINmK24jYsduI2LQg2YXbh9iy2YnZg9inINiz25DYs9iq2YnZhdmJ2LPZiSDYptin2LPYp9iz2YnYr9inIDY0MCDigJMg2YrZidmE2YkgMTAg2YrbiNix24jYtCDZhduH2LLZidmD2Kcg2LPbkNiz2KrZidmF2YnYs9mJ2YbZiSDYqNuV2LHZvtinINmC2YnZhNiv2YkuINio24fZhNin2LEg2KrbhtuL25XZhtiv2YnZg9mJ2obblSA6IDEuINmD24fahtin2LEg2YXbh9iy2YnZg9mJ2LPZiSDYjCAyLiDZgtuV2LTZgtuV2LEg2YXbh9iy2YnZg9mJ2LPZiSDYjCAiLCJiaWRpIjoicnRsIiwibGFuZyI6ImFyIn0= https://cloud.githubusercontent.com/assets/393132/12830529/0fc4785c-cb90-11e5-8599-5a3fe92b3dbc.png — Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-180063499.
Can you please make me a live testing page with broken words? One line per word. It would really help me to investigate this further. Also, you say "Dotless Y" is broken, but I see several broken letters. I think I should know all broken letters eventually, to make sure I repair them all...
http://tarobish.github.io/Katibeh/html/live-testing.html#?eJwljj0OgkAQha9ithazbAgitTfQytgQRUOCaMgmGo0VIFltbDgDxsYgveeYcS/jLHbzvZ/MO7HVNpHMZ3uHc9ZnMjzIcZSSkMqYONxMomNIKJydJI6DZE1E14J6YVeFVpeYQ405KmhRzZMevHSlb9Cg+tJFAma6wtwAXqCT8Aq1CcATFX5M56EraP4RsgtdwpusDGpjtvoObZct/l9owjKQ3TRuuxYXFvem9sgXrm97g6EtZuz8A/Pxa3k= there are three characters one it has to be "uniFBE8" and "uniFBE9" (alefMaksura-ar.medi and alefMaksura-ar.init) another one its "uniFEEA" or "uniFBA7" (heh-ar.fina or hehgoal-ar.fina)
سۇلالىسى دەۋرىگە كەلگەندە يارىتىپ بەرگەن مۇزىكا سېستىمىسى
one it has to be "uniFBE8" and "uniFBE9" (alefMaksura-ar.medi and alefMaksura-ar.init)
These seem to be missing from the font. I can't find them.
another one its "uniFEEA" or "uniFBA7" (heh-ar.fina or hehgoal-ar.fina)
These exist in the font
From your example, the letter marked in pink here:
It is actually encoded as uni06D5 ARABIC LETTER AE (Uighur, Kazakh, Kirghiz)
I will replace it just in fina
with: heh-ar.fina (uniFEEA)? On an article at wikipedia about Kazakh I found some backup for that. I was wondering if it also uses the initial and medial forms of Heh, but it doesn't, as it seems.
I think that the hehgoal-ar.fina (uniFBA7
) character that you mention is a red herring.
The isolated form of uniFBA7
is uni06C1
06C1 ARABIC LETTER HEH GOAL. We have substitutions for it in init, medi and fina to uniFBA8
, uniFBA9
and uniFBA7
So I think there's no indication that uni06C1
is broken
lets try it then :) but we do have them, here they are from UFO
Oh I see, I just pulled from github and there they are
You added them today?
Great :) lets finish it
On Feb 8, 2016, at 7:38 PM, Lasse Fister notifications@github.com wrote:
Oh I see, I just pulled from github and there they are
— Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-181691423.
Yesterday :) i thought it may be something wrong with characters :) a bit research and then fire :)
On Feb 8, 2016, at 7:38 PM, Lasse Fister notifications@github.com wrote:
You added them today?
— Reply to this email directly or view it on GitHub https://github.com/Tarobish/Katibeh/issues/79#issuecomment-181691690.
Ah, so, unfortunately we do can't use the features generated by glyphs. So Our feature files need to be hand-updated when new glyphs are added. That's maybe also why uniFEEA is not coming up.
Bad thing is when we miss making these updates.
OH :( should i do this and how?
What exactly? I am adding the features for this issue at the very moment. But if you add any new glyphs that need features, you should open a new issue to inform me.
If you added new glyphs in the past, it would be cool if you know which ones, so we can update our features.
The features for "uniFBE8" and "uniFBE9" are not created by glyphs anyways, I just checked.
i mean adding the features which you are doing it :) I didn't know that, actually i know what glyphs please check these uni08A8, uni08A8.fina, uni08A8.init, uni08A8.medi
First things first:
Awesome :) just perfect
please check these uni08A8, uni08A8.fina, uni08A8.init, uni08A8.medi
They are also not exported by glyphs. It seems also that we should decompose them, so that they will trigger some of our ligatures.
No don't do that let them be the way they are
ok.
What's up with this one from your initial description?
Oh, totally forgot that, let me check it
Please just replace it with initial form, we do have the characters
Which name/unicode is the isolated form and which is the final form.
It has to be "uni0679.init" and "uni0679.medi" if I'm not mistaken (I'm not sure) and here is the iso and final "uni06BB" and "uni06BB.fina" but it risky i couldn't even find the init and medi from "Noto"
06BB ARABIC LETTER RNOON • Sindhi
There's also a letter 0679 ARABIC LETTER TTEH • Urdu
That looks alike https://en.wikipedia.org/wiki/%E1%B9%ACe
Wikipedia says:
Some layout engines do not properly generate medial and final forms (which should look like ـٹـ and ﭨ) and will render the isolate form ٹ, without joining.
See the wikipedia page for the example letters and some more infos.
If it's uni0679
it should be replaced by uni0679.medi
and uni0679.fina
and uni0679.init
. But for the replacements I can also use whatever is the right glyph, like uni06BB.fina
The same for uni06BB
, I can replace it with what you suggest if you want me to.
06BB has a quite good wikipedia article https://de.wikipedia.org/wiki/%E1%B9%86un
it says init should be FBA2, medi should be FBA3 and fina should be FBA1
Uh, that's the German wikipedia :-D didn't notice.
Here is the stuff for 0679 TTEH:
FB67 ARABIC LETTER TTEH FINAL FORM
≈
FB68 ARABIC LETTER TTEH INITIAL FORM
≈
FB69 ARABIC LETTER TTEH MEDIAL FORM
≈
Haha :))) exactly, it was my problem because i found it next to ARABIC LETTER TTEH :) while i was looking for ARABIC LETTER RNOON :)
OK :) if you think its right to replace them, do this please
Here's RNOON from the unicode PDF
ok can we wrap this up. what should be replaced by what?
in the picture we have the isol form it has to be replace by the inti so uni06BB replace by uniFB68 later ill check the pdf to see all its right
Ok, so it's all about TTEH, was not all clear to me :-)
To be honest to me too :) i don't get it, how two letters with the same shape, making different sound :) its like to have letter R and N with the same shapes :)))))
Yeah, it's kind of stupid. Maybe they don't even make another sound, but that's not how unicode. There is not really a system for unicode encodings I think, they just try to do their best.
SO we have "uni0679.init" and "uni0679.medi". Do you also have the fina form for it? On wikipedia it says TTHE is derived from Ta (062A) So I think we could also make it by decomposition: uni066E.fina + uni0615
we already have it by "uni0679" and "uniFB67" for final and isol
I just found out. I'll use uniFB67 uniFB68 uniFB69 and I guess you don't want to risk having ligatures of these?
Yes :) we don't want to risk it
Something its broken in letter (Dotless Y
eh, Heh.fin and rnoon-ar.fina