CopticScriptorium / corpora

Public repository for Coptic SCRIPTORIUM Corpora Releases
31 stars 13 forks source link

Coptic OT for fall 2018 publication #21

Closed ctschroeder closed 5 years ago

ctschroeder commented 6 years ago

This is the publication thread for the new Coptic OT "base text" donated by the Coptic OT project in Goettingen. @ctschroeder & @amir-zeldes are coordinating on the following items:

ctschroeder commented 6 years ago

csv file mapping LXX translation to CoptOT is in the bible-dev repo. Pls see commit note. Thanks!

ctschroeder commented 5 years ago

New files for mapping translations are in the bible-dev repo @amir-zeldes. Mapping based on the CoptOT September 2018 text data. 1) csv file mapping most of the KNOWN verses where the LXX translation versification does not align with the CoptOT data versification. Mapping does not map all books/chapters/verses; it only maps known problems. This file is NOT ready to go. Please read... 2) my notes file. This has all my notes on how I found misalignments, what's going on in each book with a known misalignment, and a list of books/chapters with problems for which I have not yet figured out the solution. We may need to go over this material in person.

ctschroeder commented 5 years ago

See mapping file + notes in the OT folder. Decision was to null out English translation where it was difficult to match up the Coptic data with the Brenton translation. We may hire a student to work on that this fall.

ctschroeder commented 5 years ago

From email (so I don't lose this thread): items still under discussion

Data

Metadata corpus AND document levels

Metadata Corpus level

Metadata Document level

amir-zeldes commented 5 years ago

Should be good to go, test version behind login on ANNIS

ctschroeder commented 5 years ago

This looks really good. There are some odd “book names” that I didn’t notice before, like “epistle of Jeremiah” which isn’t an epistle —it’s the prophetic book Jeremiah, right?— but I assume that comes from the Copt OT data?

amir-zeldes commented 5 years ago

Thanks - there are a couple of new books, all based on the CoptOT book designations. I think the Epistle of Jeremiah is a separate work not included in all canons, see https://en.wikipedia.org/wiki/Letter_of_Jeremiah

ctschroeder commented 5 years ago

Ah ok. Obv I did not look closely. Let me check the abbreviations in the urns.

ctschroeder commented 5 years ago

@amir-zeldes do you have a table you use to produce the urns? Looks like some of them may need adjusting. Letter of Jeremiah docs have the same abbreviation and urns as Jeremiah. (Abbreviation for the former is usually Ep Jer according to SBL.) I'm not seeing anything else needing changing at a glance, but if you have a table I can look at, it would probably be more effective than trying to spot-check docs. Thanks!

amir-zeldes commented 5 years ago

Actually that was just a bug of Jeremiah overwriting the similar urn key from Epistle- I'll fix that. Here's the mapping, I think it was supposed to SBL where I got it from:

urn_dict={
"Genesis":"gen",
"Exodus":"exod",
"Leviticus":"lev",
"Numbers":"num",
"Deuteronomy":"deut",
"Joshua":"josh",
"Judges": "judg",
"Ruth":"ruth",
"I Samuel": "1sam",
"II Samuel": "2sam",
"I Kings": "1kgs",
"II Kings": "2kgs",
"Job": "job",
"Proverbs": "prov",
"Psalms":"pss",
"Ecclesiastes":"eccl",
"Song of Solomon":"song",
"Isaiah":"isa",
"Jeremiah":"jer",
"Lamentations":"lam",
"Ezekiel":"ezek",
"Daniel":"dan",
"Hosea":"hos",
"Amos":"amos",
"Joel":"joel",
"Obadia":"obad",
"Jonah":"jonah",
"Micah":"mic",
"Nahum":"nah",
"Habakkuk":"hab",
"Zephaniah":"zeph",
"Haggai":"hag",
"Zechariah":"zech",
"Tobit":"tob",
"Baruch":"bar",
"I Esdras": "1esd",
"I Maccabees":"1macc",
"II Maccabees":"2macc",
"III Maccabees":"3macc",
"IV Maccabees":"4macc",
"Prayer of Manasses":"prman",
"Prayer of Azariah":"prazar",
"Bel and the Dragon":"bel",
"Susanna":"sus",
"Wisdom":"wis",
"Sirach":"sir",
"Epistle of Jeremiah":"epjer",
"Judith": "jdt"
}
ctschroeder commented 5 years ago

Ok thanks!  Looks good!