Edelweiss / beehive

Compile data for Berichtigungsliste
http://www.uni-heidelberg.de/fakultaeten/philosophie/zaw/papy/index.html
MIT License
1 stars 0 forks source link

Coptic words, names often missing. #53

Open jcowey opened 10 months ago

jcowey commented 10 months ago

It appears to be the case that many (almost all?) Coptic words letters (in other words the Coptic font used) have not made it out of the CD into Alles.html in properly readable form.

http://beehive.zaw.uni-heidelberg.de/correction/show/76343

we have "δ(ιὰ) | [κλ(ήρου?) Παρό]ου. Γεῶργε → δ(ιὰ) | [κλ(ηρονόμου) Παρό]ου Γεώργε (= Γεωργίου), ,,by Teshnè, heiress of Parous, son of George", J. Shelton, Enchoria 17 (1990), S. 113-114."

it should be "δ(ιὰ) ⲧⲉϣⲛⲏ | [κλ(ήρου?) Παρό]ου. Γεῶργε → δ(ιὰ) ⲧⲉϣⲛⲏ | [κλ(ηρονόμου) Παρό]ου Γεώργε (= Γεωργίου), ,,by Teshnè, heiress of Parous, son of George", J. Shelton, Enchoria 17 (1990), S. 113-114."

We might be able to harvest these using Alles.html

jcowey commented 10 months ago
<TD class="tekst">

<span class="grieks">
&#x03B4;(&#x03B9;&#x1F70;)]
</span>

<IMG SRC="images/BL9-302_5.jpg" class="inlinefig" />

<span class="grieks">
&#x03BA;&#x03BB;(&#x1F75;&#x03C1;&#x03BF;&#x03C5;?) &#x03A0;&#x03B1;&#x03C1;&#x1F79;&#x03BF;&#x03C5;
</span>

&rarr;

<span class="grieks">
&#x03B4;(&#x03B9;&#x1F70;)]
</span>

<IMG SRC="images/BL9-302_5.jpg" class="inlinefig" />

<span class="grieks">
&#x03BA;&#x03BB;(&#x03B7;&#x03C1;&#x03BF;&#x03BD;&#x1F79;&#x03BC;&#x03BF;&#x03C5;) &#x03A0;&#x03B1;&#x03C1;&#x1F79;&#x03BF;&#x03C5;
</span>
, „by Teshnè, heiress of Parous”,  
<span class="italic">
J. Shelton
</span>
, Enchoria 17 (1990),  S. 113-114.
</TD>
<TD class="vol">BL9</TD>
<TD class="page">302</TD>
</TR>

Search for class="inlinefig",

return by moving up to <TD class="tekst"> then in a row in a googlespreadsheet please as follows

It would, I envisage look like this:

  | δ(ιὰ) \| [ | IMG SRC="images/BL9-302_5.jpg" class="inlinefig" | κλ(ήρου?) Παρό]ου. Γεῶργε | → | δ(ιὰ) \| [ | IMG SRC="images/BL9-302_5.jpg" class="inlinefig" | κλ(ηρονόμου) Παρό]ου Γεώργε (= Γεωργίου), ,,by Teshnè, heiress of Parous, son of George", J. Shelton, Enchoria 17 (1990), S. 113-114. | BL9 | 302 |   -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | --
jcowey commented 10 months ago

In many cases the regex / script will not have to deal with as many as nine cells

jcowey commented 9 months ago

This sheet shows 128 unique images.

This spreadsheet has a list of all 1262 instances of these 128 images.

The Good (as opposed to the Bad and the Ugly)

Edelweiss commented 9 months ago

That’s how it’s supposed to look like:

Bildschirmfoto 2024-02-09 um 09 16 34

If it doesn’t appear like this, (in Firefox) click »Ansicht« and »Textkodierung reparieren«

jcowey commented 9 months ago

The Good (as opposed to the Bad and the Ugly) is very helpful. Thank you.