Open jcowey opened 10 months ago
<TD class="tekst">
<span class="grieks">
δ(ιὰ)]
</span>
<IMG SRC="images/BL9-302_5.jpg" class="inlinefig" />
<span class="grieks">
κλ(ήρου?) Παρόου
</span>
→
<span class="grieks">
δ(ιὰ)]
</span>
<IMG SRC="images/BL9-302_5.jpg" class="inlinefig" />
<span class="grieks">
κλ(ηρονόμου) Παρόου
</span>
, „by Teshnè, heiress of Parous”,
<span class="italic">
J. Shelton
</span>
, Enchoria 17 (1990), S. 113-114.
</TD>
<TD class="vol">BL9</TD>
<TD class="page">302</TD>
</TR>
Search for class="inlinefig"
,
return by moving up to <TD class="tekst">
then in a row in a googlespreadsheet please as follows
<IMG SRC="images/[^"]+" class="inlinefig" />
content of <span class="grieks">
returned as unicode Greek characters, please<IMG SRC="images/[^"]+" class="inlinefig" />
→
→
<IMG SRC="images/[^"]+" class="inlinefig" />
content of <span class="grieks">
returned as unicode Greek characters, please<IMG SRC="images/[^"]+" class="inlinefig" />
</TD>
content of <span class="grieks">
returned as unicode Greek characters, please<TD class="vol">
<TD page="vol">
It would, I envisage look like this:
In many cases the regex / script will not have to deal with as many as nine cells
This sheet shows 128 unique images.
This spreadsheet has a list of all 1262 instances of these 128 images.
The Good (as opposed to the Bad and the Ugly)
That’s how it’s supposed to look like:
If it doesn’t appear like this, (in Firefox) click »Ansicht« and »Textkodierung reparieren«
It appears to be the case that many (almost all?) Coptic words letters (in other words the Coptic font used) have not made it out of the CD into Alles.html in properly readable form.
http://beehive.zaw.uni-heidelberg.de/correction/show/76343
we have "δ(ιὰ) | [κλ(ήρου?) Παρό]ου. Γεῶργε → δ(ιὰ) | [κλ(ηρονόμου) Παρό]ου Γεώργε (= Γεωργίου), ,,by Teshnè, heiress of Parous, son of George", J. Shelton, Enchoria 17 (1990), S. 113-114."
it should be "δ(ιὰ) ⲧⲉϣⲛⲏ | [κλ(ήρου?) Παρό]ου. Γεῶργε → δ(ιὰ) ⲧⲉϣⲛⲏ | [κλ(ηρονόμου) Παρό]ου Γεώργε (= Γεωργίου), ,,by Teshnè, heiress of Parous, son of George", J. Shelton, Enchoria 17 (1990), S. 113-114."
We might be able to harvest these using Alles.html