Princeton-CDH / geniza

version 4.x of the Princeton Geniza Project
https://geniza.princeton.edu
Apache License 2.0
11 stars 2 forks source link

"Combining dot above" rendered incorrectly in transcriptions #670

Open blms opened 2 years ago

blms commented 2 years ago

testing notes

Review the second transcription (by Goitein and Friedman) on PGPID 9083 linked below and check if the combining dot renders correctly. Recommend testing in multiple browsers, since before we saw different behavior on this with different browsers.


Unicode Character “◌̇” (U+0307) Combining Dot Above is missing from Frank Ruhl 1924 MF, so it does not render correctly in transcriptions.

Example: https://geniza.princeton.edu/en/documents/9083/

Screen Shot 2022-02-28 at 1 52 26 PM
blms commented 2 years ago

Additional missing glyphs in Arabic: https://geniza.princeton.edu/en/documents/9140/

blms commented 2 years ago

Seems to behave differently on different browsers, and is not resolved by Unicode normalization. Another example: https://geniza.princeton.edu/en/documents/4767/

May be related to fallback font behavior. Noting some different results of testing with browser tools here:

Default (Frank Ruhl): Screen Shot 2022-09-29 at 10 38 51 AM

font-family: serif or font-family: Georgia (what I thought our fallback was already!) Screen Shot 2022-09-29 at 10 38 18 AM

font-family: 'Greta Sans Hebrew Regular' Screen Shot 2022-09-29 at 10 38 41 AM

font-family: Times New Roman Screen Shot 2022-09-29 at 10 38 29 AM

All tested on Chrome.

kseniaryzhova commented 2 years ago

@blms @rlskoeser combining dot is not rendering for me (PC, Chrome) image

kseniaryzhova commented 2 years ago

@rlskoeser @blms It works partially (same transcription, test site) image

rlskoeser commented 2 years ago

Maybe this is a Windows-specific rendering problem...

Here's current versions of the Goitein + Friedman transcription on PGPID 9083:

production Screen Shot 2022-10-24 at 3 38 47 PM

test Screen Shot 2022-10-24 at 3 39 27 PM

richmanrachel commented 9 months ago

@kseniaryzhova - is this still an issue for you on a PC? The Ju-Ar ones look fine to me on my Mac, but the Arabic one (https://geniza.princeton.edu/en/documents/9140/) is still showing a weird box.

kseniaryzhova commented 9 months ago

@richmanrachel @blms For me it's uneven, sometimes the combining dot works and sometimes it doesn't (this is from PGPID 9083)

image

I do get the empty squares for the Arabic script.

blms commented 9 months ago

@richmanrachel It's interesting; looking more closely at PGPID 9140, I think that might be an error in the transcription. If it is meant to be a combining dot above, it's using a different Unicode symbol for that than the others:  U+F0C2 rather than the typical ◌̇ U+0307 combining dot above. It's the only transcription on the site that uses that character.

@kseniaryzhova Sounds like it's still just a PC issue with the correct (U+0307) character. Maybe it will resolve itself with later Windows versions of browsers…

richmanrachel commented 9 months ago

@blms - Yes you are right. Here's the transcription editor view:

image

@kseniaryzhova - should I just ask Yusuf or Amel to fix this one? I'm not confident enough in the Arabic paleography to do it myself, but most of the lines I looked at it seems like the symbol is just a mistake and not even representing a letter. In the last line some double slashes might be needed for above the line insertion