sanskrit-lexicon / GreekInSanskrit

Provide missing Greek text for the Cologne digitizations of Sanskrit dictionaries.
0 stars 1 forks source link

PWG Greek section headings v. Greek text #13

Open funderburkjim opened 9 years ago

funderburkjim commented 9 years ago

We have mentioned this before, but in processing the Greek text for PWG I encountered this question again.

Here's the situation:

I'm not sure how to resolve the issue.

When I think about a resolution, the first thing that comes to mind is another, superficially unrelated, aspect of the PWG digitization that makes working with it quite awkward. This other issue is that the records of the digitization are often very long maybe many thousands of characters in length.
The PWG digitization was done before the idea to correspond lines of the text digitization and lines of the printed text.

So, this issue will remain open until and when a way to address it becomes clearer.

gasyoun commented 9 years ago

It's big, but not crucial and is a RegEx issue, not of knowing Greek. So let it be there documented as it is, it's part of the inner cleaning in the 10 year list to-come. It's part of the perfection plan. It's too high above and the practical value is minimal.

Andhrabharati commented 2 years ago

The greek headings in PWG were resolved long back (when was it exactly?!), and the issue can be closed now.

funderburkjim commented 2 years ago

Let's reopen this pwg greek section headings question. I think we need to examine single-letter greek text. For example (page5-0710)

OLD
vgl. 1, {%b%},
<lang n="greek">α</lang>): {#vANmAtrAmaDura#} {%nur in Worten%}
NEW
vgl. 1, {%b%},
α): {#vANmAtrAmaDura#} {%nur in Worten%}

BUT from Page1-0118, this probably requires no change (not a section heading):

<lang n="greek">α</lang> besprochene {#aTa vA#} ist das {#aTa vA pakzAntare#}

There aren't many cases (only 5 alphas, for instance). They should be examined individually, and changed where needed. Then we can close the issue.

Andhrabharati commented 2 years ago

This other issue is that the records of the digitization are often very long maybe many thousands of characters in length.\nThe PWG digitization was done before the idea to correspond lines of the text digitization\nand lines of the printed text.

So, this issue will remain open until and when a way to address it becomes clearer.

I think this point should keep the issue open, if not for the Greek letters.

gasyoun commented 9 months ago

I think we need to examine single-letter greek text.

@AnnaRybakovaT it seems there are some Greek letter left without care )

Andhrabharati commented 9 months ago

I have done this at my end already; waiting for Jim to come to PWG from pwk.

gasyoun commented 9 months ago

waiting for Jim to come

Is there a full list of where are you waiting for him? Was not aware that Greek was in that list as well