openscriptures / morphhb

Open Scriptures Hebrew Bible
https://hb.openscriptures.org
Other
178 stars 64 forks source link

Accent Catalog #30

Closed jcuenod closed 7 years ago

jcuenod commented 7 years ago

Hi, I'm trying to parse the ETCBC data to build accentuation information. I have found your accent catalog (https://github.com/openscriptures/morphhb/blob/master/structure/OshbVerse/Script/AccentCatalog.js) particularly helpful for that (maybe I will submit a couple of other issues once I've used it a bit more). One thing that has just struck me as I've been testing my own code is that I searched for "Merkha with Atnach" (\u05A5\u0591), a poetic accent, and found only one instance but it was in Jer 46:4. Am I correct in understanding this to be a mistaken categorisation? Thanks

DavidTroidl commented 7 years ago

Thanks for catching that. I confirmed the reference and updated the file.

jcuenod commented 7 years ago

Hmm, okay now I just realised that Job 32:11 has the word תְּב֥וּנֹֽתֵיכֶ֑ם which nearly matches but has a meteg between the other two accents. Sorry I didn't catch this earlier..

DavidTroidl commented 7 years ago

Fixed again. In the demo, I left the meteg out, when identifying accents, and sacrificed identifying silluq, just to avoid the multiplicity of combinations.

On 3/4/2017 2:28 AM, James Cuénod wrote:

Hmm, okay now I just realised that Job 32:11 has the word תְּב֥וּנֹֽתֵיכֶ֑ם which nearly matches but has a meteg between the other two accents. Sorry I didn't catch this earlier..

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/30#issuecomment-284134374, or mute the thread https://github.com/notifications/unsubscribe-auth/AAKwBaqkIcGtycRUJBKMW1J0Em20crT6ks5riRKjgaJpZM4MR2R0.


This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus

jcuenod commented 7 years ago

Does this give you 100% coverage? I'm still getting a number of other combinations (especially with maqqef) - I also ignore metegs if there's not a pattern I'm thinking of doing that with maqqef but I need to take a closer look at the actual words, the weird thing is that it's not always the last accent which I was expecting (so I need to take a closer look). But these are the outstanding combinations for me:

outstanding combinations (click to show)
[Darga, Maqaf]
[Darga, Meteg, Tevir]
[Darga, Munah]
[Darga, Telisha Qetana]
[Darga, Tevir]
[Dehi]
[Dehi, Meteg, Maqaf]
[Etnahta, Geresh]
[Etnahta, Meteg, Sof Pasuq]
[Etnahta, Pashta]
[Etnahta, Segol]
[Etnahta, Sof Pasuq]
[Etnahta, Zaqef Qatan]
[Geresh, Etnahta]
[Geresh Muqdam]
[Geresh Muqdam, Meteg]
[Geresh Muqdam, Revia]
[Geresh Muqdam, Telisha Qetana]
[Geresh, Tipeha]
[Geresh, Zaqef Qatan]
[Gershayim, Revia]
[Mahapakh, Maqaf]
[Mahapakh, Maqaf, Qadma, Pashta]
[Mahapakh, Merkha]
[Mahapakh, Meteg, Maqaf]
[Mahapakh, Munah]
[Mahapakh, Paseq, Pashta]
[Mahapakh, Qadma]
[Mahapakh, Revia]
[Maqaf, Etnahta]
[Maqaf, Geresh]
[Maqaf, Mahapakh]
[Maqaf, Merkha]
[Maqaf, Meteg, Sof Pasuq]
[Maqaf, Munah]
[Maqaf, Pashta]
[Maqaf, Qadma]
[Maqaf, Qadma, Pashta]
[Maqaf, Revia]
[Maqaf, Telisha Gedola]
[Maqaf, Telisha Qetana]
[Maqaf, Tevir]
[Maqaf, Tipeha]
[Maqaf, Zaqef Gadol]
[Maqaf, Zaqef Qatan]
[Maqaf, Zinor]
[Merkha, Mahapakh]
[Merkha, Maqaf]
[Merkha, Maqaf, Tipeha]
[Merkha, Merkha]
[Merkha, Meteg, Maqaf]
[Merkha, Munah]
[Merkha, Pashta]
[Merkha, Qadma, Pashta]
[Merkha, Telisha Qetana]
[Merkha, Tipeha, Merkha, Meteg, Sof Pasuq]
[Merkha, Zinor]
[Meteg, Etnahta, Sof Pasuq]
[Meteg, Geresh, Pashta]
[Meteg, Maqaf, Darga]
[Meteg, Maqaf, Etnahta]
[Meteg, Maqaf, Geresh]
[Meteg, Maqaf, Gershayim]
[Meteg, Maqaf, Mahapakh]
[Meteg, Maqaf, Merkha]
[Meteg, Maqaf, Merkha, Tipeha]
[Meteg, Maqaf, Meteg, Sof Pasuq]
[Meteg, Maqaf, Munah]
[Meteg, Maqaf, Pashta]
[Meteg, Maqaf, Qadma]
[Meteg, Maqaf, Qadma, Pashta]
[Meteg, Maqaf, Revia]
[Meteg, Maqaf, Tevir]
[Meteg, Maqaf, Tipeha]
[Meteg, Maqaf, Zaqef Qatan]
[Meteg, Maqaf, Zinor]
[Meteg, Merkha, Maqaf]
[Meteg, Munah, Maqaf]
[Meteg, Munah, Segol]
[Meteg, Ole]
[Meteg, Ole, Maqaf]
[Meteg, Qadma, Maqaf]
[Meteg, Revia, Sof Pasuq]
[Meteg, Segol, Sof Pasuq]
[Meteg, Tipeha, Pashta]
[Meteg, Zaqef Qatan, Sof Pasuq]
[Meteg, Zaqef Qatan, Zinor]
[Munah, Maqaf]
[Munah, Maqaf, Telisha Gedola]
[Munah, Merkha]
[Munah, Meteg, Maqaf]
[Munah, Meteg, Maqaf, Etnahta]
[Munah, Meteg, Maqaf, Pazer]
[Munah, Munah]
[Munah, Pashta]
[Munah, Qadma, Pashta, Paseq]
[Munah, Segol]
[Munah, Telisha Gedola]
[Munah, Telisha Qetana]
[Munah, Tevir]
[Munah, Tipeha]
[Munah, Zinor]
[Ole, Maqaf]
[Ole, Meteg, Maqaf]
[Pashta, Meteg, Zaqef Qatan]
[Pashta, Zaqef Qatan]
[Pazer, Geresh]
[Pazer, Zaqef Qatan]
[Qadma, Geresh, Telisha Qetana]
[Qadma, Mahapakh, Pashta]
[Qadma, Maqaf]
[Qadma, Merkha]
[Qadma, Munah]
[Qadma, Munah, Pashta, Paseq]
[Qadma, Qadma]
[Qadma, Tipeha]
[Qadma, Tipeha, Pashta]
[Revia, Etnahta]
[Revia, Meteg, Sof Pasuq]
[Telisha Gedola, Mahapakh]
[Telisha Qetana, Qadma]
[Telisha Qetana, Telisha Qetana]
[Tevir, Telisha Qetana]
[Tipeha, Geresh]
[Tipeha, Maqaf]
[Tipeha, Merkha, Meteg, Sof Pasuq]
[Tipeha, Meteg, Maqaf]
[Tipeha, Munah]
[Tipeha, Munah, Etnahta]
[Tipeha, Munah, Paseq]
[Tipeha, Pashta]
[Tipeha, Qadma, Pashta]
[Tipeha, Zinor]
[Yetiv, Zaqef Qatan]
[Zaqef Gadol, Maqaf]
[Zaqef Gadol, Pashta]
[Zaqef Qatan, Etnahta]
[Zaqef Qatan, Geresh]
[Zaqef Qatan, Merkha]
[Zaqef Qatan, Meteg, Sof Pasuq]
[Zaqef Qatan, Pashta]
[Zaqef Qatan, Pazer]
[Zaqef Qatan, Revia]
[Zarqa, Zinor]
DavidTroidl commented 7 years ago

We aren't treating maqqef as an accent, but as a connector, like a hyphen. The words on either side of the maqqef are treated as separate. I don't know how that will affect your list.

Also, are you working directly with the OSHB, or is this based on the other text? There are places where the WLC has double accents on each word, notably in the Ten Commandments, to indicate two separate traditions. In the OSHB, we separate these traditions, listing one in the main text and the other in notes after each such word. This may be a source of unusual combinations of accents, especially something like |[Etnahta, Meteg, Sof Pasuq].|

On 3/4/2017 10:25 AM, James Cuénod wrote:

Does this give you 100% coverage? I'm still getting a number of other combinations (especially with maqqef) - I also ignore metegs if there's not a pattern I'm thinking of doing that with maqqef but I need to take a closer look at the actual words, the weird thing is that it's not always the last accent which I was expecting (so I need to take a closer look). But these are the outstanding combinations for me:

|[Darga, Maqaf] [Darga, Meteg, Tevir] [Darga, Munah] [Darga, Telisha Qetana] [Darga, Tevir] [Dehi] [Dehi, Meteg, Maqaf] [Etnahta, Geresh] [Etnahta, Meteg, Sof Pasuq] [Etnahta, Pashta] [Etnahta, Segol] [Etnahta, Sof Pasuq] [Etnahta, Zaqef Qatan] [Geresh, Etnahta] [Geresh Muqdam] [Geresh Muqdam, Meteg] [Geresh Muqdam, Revia] [Geresh Muqdam, Telisha Qetana] [Geresh, Tipeha] [Geresh, Zaqef Qatan] [Gershayim, Revia] [Mahapakh, Maqaf] [Mahapakh, Maqaf, Qadma, Pashta] [Mahapakh, Merkha] [Mahapakh, Meteg, Maqaf] [Mahapakh, Munah] [Mahapakh, Paseq, Pashta] [Mahapakh, Qadma] [Mahapakh, Revia] [Maqaf, Etnahta] [Maqaf, Geresh] [Maqaf, Mahapakh] [Maqaf, Merkha] [Maqaf, Meteg, Sof Pasuq] [Maqaf, Munah] [Maqaf, Pashta] [Maqaf, Qadma] [Maqaf, Qadma, Pashta] [Maqaf, Revia] [Maqaf, Telisha Gedola] [Maqaf, Telisha Qetana] [Maqaf, Tevir] [Maqaf, Tipeha] [Maqaf, Zaqef Gadol] [Maqaf, Zaqef Qatan] [Maqaf, Zinor] [Merkha, Mahapakh] [Merkha, Maqaf] [Merkha, Maqaf, Tipeha] [Merkha, Merkha] [Merkha, Meteg, Maqaf] [Merkha, Munah] [Merkha, Pashta] [Merkha, Qadma, Pashta] [Merkha, Telisha Qetana] [Merkha, Tipeha, Merkha, Meteg, Sof Pasuq] [Merkha, Zinor] [Meteg, Etnahta, Sof Pasuq] [Meteg, Geresh, Pashta] [Meteg, Maqaf, Darga] [Meteg, Maqaf, Etnahta] [Meteg, Maqaf, Geresh] [Meteg, Maqaf, Gershayim] [Meteg, Maqaf, Mahapakh] [Meteg, Maqaf, Merkha] [Meteg, Maqaf, Merkha, Tipeha] [Meteg, Maqaf, Meteg, Sof Pasuq] [Meteg, Maqaf, Munah] [Meteg, Maqaf, Pashta] [Meteg, Maqaf, Qadma] [Meteg, Maqaf, Qadma, Pashta] [Meteg, Maqaf, Revia] [Meteg, Maqaf, Tevir] [Meteg, Maqaf, Tipeha] [Meteg, Maqaf, Zaqef Qatan] [Meteg, Maqaf, Zinor] [Meteg, Merkha, Maqaf] [Meteg, Munah, Maqaf] [Meteg, Munah, Segol] [Meteg, Ole] [Meteg, Ole, Maqaf] [Meteg, Qadma, Maqaf] [Meteg, Revia, Sof Pasuq] [Meteg, Segol, Sof Pasuq] [Meteg, Tipeha, Pashta] [Meteg, Zaqef Qatan, Sof Pasuq] [Meteg, Zaqef Qatan, Zinor] [Munah, Maqaf] [Munah, Maqaf, Telisha Gedola] [Munah, Merkha] [Munah, Meteg, Maqaf] [Munah, Meteg, Maqaf, Etnahta] [Munah, Meteg, Maqaf, Pazer] [Munah, Munah] [Munah, Pashta] [Munah, Qadma, Pashta, Paseq] [Munah, Segol] [Munah, Telisha Gedola] [Munah, Telisha Qetana] [Munah, Tevir] [Munah, Tipeha] [Munah, Zinor] [Ole, Maqaf] [Ole, Meteg, Maqaf] [Pashta, Meteg, Zaqef Qatan] [Pashta, Zaqef Qatan] [Pazer, Geresh] [Pazer, Zaqef Qatan] [Qadma, Geresh, Telisha Qetana] [Qadma, Mahapakh, Pashta] [Qadma, Maqaf] [Qadma, Merkha] [Qadma, Munah] [Qadma, Munah, Pashta, Paseq] [Qadma, Qadma] [Qadma, Tipeha] [Qadma, Tipeha, Pashta] [Revia, Etnahta] [Revia, Meteg, Sof Pasuq] [Telisha Gedola, Mahapakh] [Telisha Qetana, Qadma] [Telisha Qetana, Telisha Qetana] [Tevir, Telisha Qetana] [Tipeha, Geresh] [Tipeha, Maqaf] [Tipeha, Merkha, Meteg, Sof Pasuq] [Tipeha, Meteg, Maqaf] [Tipeha, Munah] [Tipeha, Munah, Etnahta] [Tipeha, Munah, Paseq] [Tipeha, Pashta] [Tipeha, Qadma, Pashta] [Tipeha, Zinor] [Yetiv, Zaqef Qatan] [Zaqef Gadol, Maqaf] [Zaqef Gadol, Pashta] [Zaqef Qatan, Etnahta] [Zaqef Qatan, Geresh] [Zaqef Qatan, Merkha] [Zaqef Qatan, Meteg, Sof Pasuq] [Zaqef Qatan, Pashta] [Zaqef Qatan, Pazer] [Zaqef Qatan, Revia] [Zarqa, Zinor] |

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/30#issuecomment-284158167, or mute the thread https://github.com/notifications/unsubscribe-auth/AAKwBRIbF8khebp12GeJegbRnkKQQbDaks5riYJngaJpZM4MR2R0.


This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus

jcuenod commented 7 years ago

You're quite right. I'm using the ETCBC data and I've just modified the way that I'm looking for accents (which means I'm handling words now) and ignoring maqqef. I also hadn't factored in the double accent tradition in the decalogue. Nevertheless, with these changes and ignoring the decalogue I still have 227 instances unresolved with 46 combinations:

composite accents (click to show)
[Darga, Meteg, Tevir]
[Darga, Munah]
[Darga, Telisha Qetana]
[Darga, Tevir]
[Dehi]
[Etnahta, Pashta]
[Etnahta, Sof Pasuq]
[Etnahta, Zaqef Qatan]
[Geresh Muqdam]
[Geresh Muqdam, Meteg]
[Geresh Muqdam, Revia]
[Geresh Muqdam, Telisha Qetana]
[Geresh Muqdam, Tevir]
[Geresh Muqdam, Tipeha]
[Mahapakh, Paseq, Pashta]
[Merkha, Merkha]
[Merkha, Pashta]
[Merkha, Qadma, Pashta]
[Merkha, Tipeha, Merkha, Meteg, Sof Pasuq]
[Merkha, Zinor]
[Meteg, Meteg]
[Meteg, Munah, Segol]
[Meteg, Ole]
[Munah, Merkha]
[Munah, Munah]
[Munah, Segol]
[Munah, Telisha Gedola]
[Munah, Telisha Qetana]
[Munah, Tipeha]
[Munah, Zinor]
[Pashta, Meteg, Zaqef Qatan]
[Pashta, Zaqef Qatan]
[Qadma, Merkha]
[Qadma, Qadma]
[Telisha Gedola, Mahapakh]
[Telisha Gedola, Telisha Gedola]
[Telisha Qetana, Qadma]
[Telisha Qetana, Telisha Qetana]
[Tipeha, Merkha, Meteg, Sof Pasuq]
[Tipeha, Munah]
[Tipeha, Munah, Etnahta]
[Tipeha, Pashta]
[Yetiv, Zaqef Qatan]
[Zaqef Gadol, Pashta]
[Zaqef Qatan, Merkha]
[Zarqa, Zinor]
All 227 are listed here with references (click to show)
('Genesis', 4, 19) [Meteg, Meteg]
('Genesis', 15, 10) [Meteg, Meteg]
('Genesis', 21, 14) [Meteg, Meteg]
('Genesis', 21, 21) [Meteg, Meteg]
('Genesis', 24, 7) [Meteg, Meteg]
('Genesis', 30, 37) [Meteg, Meteg]
('Genesis', 32, 32) [Meteg, Meteg]
('Genesis', 34, 16) [Meteg, Meteg]
('Genesis', 35, 22) [Zaqef Gadol, Pashta]
('Genesis', 35, 22) [Tipeha, Pashta]
('Genesis', 35, 22) [Etnahta, Zaqef Qatan]
('Genesis', 36, 13) [Munah, Munah]
('Genesis', 36, 39) [Munah, Munah]
('Genesis', 38, 23) [Meteg, Meteg]
('Genesis', 41, 45) [Meteg, Munah, Segol]
('Genesis', 48, 7) [Telisha Qetana, Telisha Qetana]
('Exodus', 2, 3) [Meteg, Meteg]
('Exodus', 4, 13) [Meteg, Meteg]
('Exodus', 14, 9) [Pashta, Meteg, Zaqef Qatan]
('Exodus', 21, 10) [Meteg, Meteg]
('Leviticus', 1, 3) [Geresh Muqdam]
('Leviticus', 15, 14) [Meteg, Meteg]
('Leviticus', 15, 29) [Meteg, Meteg]
('Numbers', 11, 35) [Darga, Meteg, Tevir]
('Numbers', 25, 19) [Etnahta, Sof Pasuq]
('Numbers', 30, 9) [Meteg, Meteg]
('Numbers', 30, 13) [Meteg, Meteg]
('Numbers', 32, 38) [Darga, Tevir]
('Deuteronomy', 21, 13) [Telisha Qetana, Telisha Qetana]
('Deuteronomy', 22, 7) [Meteg, Meteg]
('Deuteronomy', 28, 9) [Meteg, Meteg]
('Joshua', 7, 2) [Merkha, Qadma, Pashta]
('Joshua', 13, 6) [Merkha, Qadma, Pashta]
('Joshua', 13, 17) [Tipeha, Merkha, Meteg, Sof Pasuq]
('Joshua', 13, 27) [Munah, Telisha Qetana]
('Joshua', 15, 25) [Mahapakh, Paseq, Pashta]
('Joshua', 15, 61) [Yetiv, Zaqef Qatan]
('Joshua', 18, 22) [Darga, Meteg, Tevir]
('Joshua', 19, 27) [Munah, Segol]
('Joshua', 21, 11) [Telisha Qetana, Qadma]
('Joshua', 21, 32) [Merkha, Pashta]
('Judges', 1, 5) [Merkha, Qadma, Pashta]
('Judges', 17, 2) [Meteg, Meteg]
('Judges', 17, 7) [Merkha, Qadma, Pashta]
('Judges', 17, 8) [Merkha, Qadma, Pashta]
('Judges', 17, 9) [Merkha, Qadma, Pashta]
('Judges', 19, 1) [Meteg, Meteg]
('Judges', 19, 17) [Darga, Telisha Qetana]
('1_Samuel', 17, 12) [Merkha, Qadma, Pashta]
('1_Samuel', 20, 20) [Meteg, Meteg]
('1_Samuel', 22, 12) [Meteg, Meteg]
('1_Samuel', 26, 19) [Meteg, Meteg]
('2_Samuel', 2, 8) [Merkha, Qadma, Pashta]
('2_Samuel', 2, 15) [Merkha, Qadma, Pashta]
('2_Samuel', 3, 8) [Zarqa, Zinor]
('1_Kings', 1, 51) [Meteg, Meteg]
('2_Kings', 15, 29) [Munah, Zinor]
('2_Kings', 16, 7) [Telisha Gedola, Mahapakh]
('2_Kings', 17, 13) [Geresh Muqdam, Telisha Qetana]
('2_Kings', 17, 24) [Telisha Gedola, Telisha Gedola]
('2_Kings', 20, 12) [Munah, Telisha Gedola]
('2_Kings', 25, 27) [Munah, Telisha Qetana]
('Isaiah', 7, 3) [Tipeha, Munah]
('Isaiah', 8, 1) [Merkha, Tipeha, Merkha, Meteg, Sof Pasuq]
('Isaiah', 8, 3) [Merkha, Tipeha, Merkha, Meteg, Sof Pasuq]
('Isaiah', 39, 1) [Munah, Telisha Gedola]
('Isaiah', 50, 2) [Merkha, Zinor]
('Jeremiah', 11, 10) [Telisha Qetana, Telisha Qetana]
('Jeremiah', 15, 18) [Etnahta, Pashta]
('Jeremiah', 27, 13) [Etnahta, Pashta]
('Jeremiah', 37, 20) [Meteg, Meteg]
('Jeremiah', 38, 20) [Meteg, Meteg]
('Jeremiah', 39, 3) [Munah, Telisha Gedola]
('Jeremiah', 44, 17) [Meteg, Meteg]
('Jeremiah', 48, 30) [Qadma, Qadma]
('Jeremiah', 52, 31) [Munah, Telisha Qetana]
('Ezekiel', 3, 15) [Munah, Telisha Gedola]
('Ezekiel', 47, 10) [Merkha, Qadma, Pashta]
('Ezekiel', 48, 1) [Munah, Telisha Qetana]
('Hosea', 4, 11) [Meteg, Meteg]
('Hosea', 4, 17) [Meteg, Meteg]
('Amos', 7, 2) [Meteg, Meteg]
('Zechariah', 3, 8) [Meteg, Meteg]
('Psalms', 2, 2) [Geresh Muqdam]
('Psalms', 8, 7) [Geresh Muqdam]
('Psalms', 14, 2) [Geresh Muqdam]
('Psalms', 18, 15) [Geresh Muqdam]
('Psalms', 18, 38) [Geresh Muqdam]
('Psalms', 21, 6) [Geresh Muqdam]
('Psalms', 21, 9) [Geresh Muqdam, Meteg]
('Psalms', 21, 14) [Geresh Muqdam, Meteg]
('Psalms', 24, 3) [Geresh Muqdam]
('Psalms', 24, 9) [Geresh Muqdam]
('Psalms', 28, 6) [Geresh Muqdam]
('Psalms', 30, 11) [Geresh Muqdam]
('Psalms', 31, 5) [Geresh Muqdam]
('Psalms', 35, 9) [Geresh Muqdam]
('Psalms', 35, 20) [Geresh Muqdam]
('Psalms', 35, 28) [Geresh Muqdam]
('Psalms', 40, 9) [Meteg, Meteg]
('Psalms', 46, 4) [Meteg, Meteg]
('Psalms', 46, 8) [Geresh Muqdam]
('Psalms', 46, 12) [Geresh Muqdam]
('Psalms', 47, 3) [Geresh Muqdam]
('Psalms', 48, 1) [Geresh Muqdam]
('Psalms', 49, 17) [Geresh Muqdam]
('Psalms', 56, 10) [Qadma, Merkha]
('Psalms', 60, 2) [Munah, Zinor]
('Psalms', 60, 3) [Munah, Merkha]
('Psalms', 63, 5) [Geresh Muqdam]
('Psalms', 64, 7) [Meteg, Meteg]
('Psalms', 66, 1) [Geresh Muqdam]
('Psalms', 68, 34) [Geresh Muqdam]
('Psalms', 73, 20) [Geresh Muqdam]
('Psalms', 78, 35) [Geresh Muqdam]
('Psalms', 78, 38) [Geresh Muqdam]
('Psalms', 81, 9) [Meteg, Meteg]
('Psalms', 94, 16) [Geresh Muqdam]
('Psalms', 97, 7) [Geresh Muqdam]
('Psalms', 98, 9) [Meteg, Meteg]
('Psalms', 105, 15) [Geresh Muqdam]
('Psalms', 105, 24) [Geresh Muqdam, Meteg]
('Psalms', 106, 46) [Geresh Muqdam, Tevir]
('Psalms', 107, 9) [Geresh Muqdam]
('Psalms', 109, 8) [Meteg, Meteg]
('Psalms', 109, 9) [Geresh Muqdam]
('Psalms', 113, 1) [Geresh Muqdam, Meteg]
('Psalms', 118, 15) [Geresh Muqdam]
('Psalms', 118, 16) [Geresh Muqdam]
('Psalms', 118, 19) [Geresh Muqdam]
('Psalms', 119, 21) [Geresh Muqdam]
('Psalms', 119, 42) [Geresh Muqdam]
('Psalms', 119, 155) [Geresh Muqdam]
('Psalms', 124, 1) [Geresh Muqdam]
('Psalms', 129, 1) [Geresh Muqdam]
('Psalms', 129, 2) [Geresh Muqdam]
('Psalms', 131, 3) [Geresh Muqdam]
('Psalms', 132, 1) [Geresh Muqdam]
('Psalms', 134, 2) [Geresh Muqdam]
('Psalms', 135, 1) [Geresh Muqdam, Meteg]
('Psalms', 135, 7) [Geresh Muqdam]
('Psalms', 139, 17) [Geresh Muqdam]
('Psalms', 140, 6) [Meteg, Meteg]
('Psalms', 141, 5) [Meteg, Meteg]
('Psalms', 143, 8) [Geresh Muqdam]
('Psalms', 147, 1) [Geresh Muqdam]
('Job', 1, 11) [Meteg, Meteg]
('Job', 2, 5) [Meteg, Meteg]
('Job', 2, 8) [Meteg, Meteg]
('Job', 5, 10) [Geresh Muqdam]
('Job', 5, 24) [Geresh Muqdam]
('Job', 6, 29) [Geresh Muqdam]
('Job', 8, 2) [Geresh Muqdam]
('Job', 12, 9) [Geresh Muqdam]
('Job', 14, 1) [Meteg, Meteg]
('Job', 15, 17) [Meteg, Meteg]
('Job', 22, 7) [Meteg, Meteg]
('Job', 22, 10) [Geresh Muqdam, Meteg]
('Job', 24, 18) [Geresh Muqdam]
('Job', 26, 4) [Geresh Muqdam]
('Job', 28, 12) [Geresh Muqdam]
('Job', 29, 3) [Geresh Muqdam]
('Job', 29, 8) [Geresh Muqdam, Meteg]
('Job', 31, 15) [Geresh Muqdam, Meteg]
('Job', 31, 20) [Geresh Muqdam]
('Job', 32, 20) [Meteg, Meteg]
('Job', 33, 31) [Meteg, Meteg]
('Job', 33, 33) [Meteg, Meteg]
('Job', 36, 3) [Meteg, Meteg]
('Job', 38, 29) [Geresh Muqdam]
('Job', 39, 25) [Geresh Muqdam]
('Job', 40, 19) [Geresh Muqdam]
('Job', 42, 3) [Meteg, Ole]
('Job', 42, 3) [Dehi]
('Job', 42, 3) [Geresh Muqdam, Revia]
('Job', 42, 4) [Meteg, Meteg]
('Job', 42, 4) [Dehi]
('Job', 42, 4) [Geresh Muqdam, Revia]
('Job', 42, 5) [Geresh Muqdam, Revia]
('Job', 42, 6) [Dehi]
('Proverbs', 7, 25) [Geresh Muqdam]
('Proverbs', 10, 10) [Munah, Tipeha]
('Proverbs', 10, 14) [Geresh Muqdam]
('Proverbs', 12, 11) [Meteg, Meteg]
('Proverbs', 14, 13) [Geresh Muqdam, Tipeha]
('Proverbs', 14, 34) [Meteg, Meteg]
('Proverbs', 15, 13) [Geresh Muqdam]
('Proverbs', 16, 24) [Geresh Muqdam]
('Proverbs', 17, 4) [Geresh Muqdam]
('Proverbs', 19, 22) [Geresh Muqdam]
('Proverbs', 20, 13) [Meteg, Meteg]
('Proverbs', 22, 24) [Geresh Muqdam]
('Proverbs', 22, 29) [Meteg, Meteg]
('Proverbs', 22, 29) [Geresh Muqdam]
('Proverbs', 26, 10) [Meteg, Meteg]
('Proverbs', 26, 27) [Geresh Muqdam]
('Proverbs', 27, 4) [Geresh Muqdam]
('Proverbs', 28, 19) [Meteg, Meteg]
('Proverbs', 28, 19) [Meteg, Meteg]
('Proverbs', 28, 22) [Geresh Muqdam]
('Proverbs', 30, 22) [Meteg, Meteg]
('Proverbs', 31, 2) [Geresh Muqdam]
('Proverbs', 31, 5) [Geresh Muqdam, Meteg]
('Proverbs', 31, 6) [Geresh Muqdam]
('Ruth', 1, 1) [Darga, Munah]
('Ruth', 1, 11) [Meteg, Meteg]
('Ruth', 1, 21) [Merkha, Merkha]
('Lamentations', 5, 5) [Meteg, Meteg]
('Esther', 6, 13) [Telisha Qetana, Telisha Qetana]
('Daniel', 6, 17) [Meteg, Meteg]
('Daniel', 6, 21) [Meteg, Meteg]
('Ezra', 2, 59) [Merkha, Qadma, Pashta]
('Nehemiah', 7, 61) [Merkha, Qadma, Pashta]
('Nehemiah', 12, 29) [Pashta, Zaqef Qatan]
('Nehemiah', 13, 15) [Meteg, Meteg]
('1_Chronicles', 2, 19) [Meteg, Meteg]
('1_Chronicles', 2, 54) [Merkha, Qadma, Pashta]
('1_Chronicles', 2, 54) [Tipeha, Munah, Etnahta]
('1_Chronicles', 4, 32) [Zaqef Qatan, Merkha]
('1_Chronicles', 5, 23) [Darga, Tevir]
('2_Chronicles', 2, 2) [Meteg, Meteg]
('2_Chronicles', 2, 6) [Meteg, Meteg]
('2_Chronicles', 2, 7) [Meteg, Meteg]
('2_Chronicles', 8, 18) [Meteg, Meteg]
('2_Chronicles', 11, 18) [Meteg, Meteg]
('2_Chronicles', 18, 2) [Meteg, Meteg]
('2_Chronicles', 19, 2) [Zarqa, Zinor]

Thanks for the help on this!

DavidTroidl commented 7 years ago

I worked through the Torah so far. I consolidated the list, by omitting the double Meteg references. I am accumulating changes to the Accent Catalog, before I commit them to github. Here are the results so far: Genesis.35.22 [Zaqef Gadol, Pashta] double accent tradition Genesis.35.22 [Tipeha, Pashta] double accent tradition Genesis.35.22 [Etnahta, Zaqef Qatan] double accent tradition Genesis.36.13 [Munah, Munah] addition Genesis.36.39 [Munah, Munah] בַּ֣עַל חָנָ֣ן two words Genesis.41.45 [Meteg, Munah, Segol] צָֽפְנַ֣ת פַּעְנֵחַ֒ two words Genesis.48.7 [Telisha Qetana, Telisha Qetana] addition Exodus.14.9 [Pashta, Meteg, Zaqef Qatan] פִּי֙ הַֽחִירֹ֔ת two words Leviticus.1.3 [Geresh Muqdam] addition Numbers.11.35 [Darga, Meteg, Tevir] מִ/קִּבְר֧וֹת הַֽתַּאֲוָ֛ה two words Numbers.25.19 [Etnahta, Sof Pasuq] no Sof Pasuq in WLC (26:1 also has no Atnach) Numbers.32.38 [Darga, Tevir] בַּ֧עַל מְע֛וֹן two words Deuteronomy.21.13 [Telisha Qetana, Telisha Qetana] +1 As you can see, there are differences between your text and the WLC that cause some of the discrepancies.

jcuenod commented 7 years ago

Thanks, you've really helped me make a lot of progress on this (https://github.com/jcuenod/tf-accent-data). I really appreciate the help, I'm down to about 108 issues (but I haven't dealt with double traditions yet and I need to figure out a better way of handling maqqef - I don't know whether you noticed this but I think all the instances of double meteg precede a maqqef)

DavidTroidl commented 7 years ago

I finished going through the list, and posted the update to Accent Catalog. In resolving these issues, I found Job 42:1-6 has to be interpreted as poetic. I updated Accent Interpretation to reflect this. Here is the list with notes: Genesis.35.22 [Zaqef Gadol, Pashta] double accent tradition Genesis.35.22 [Tipeha, Pashta] double accent tradition Genesis.35.22 [Etnahta, Zaqef Qatan] double accent tradition Genesis.36.13 [Munah, Munah] addition Genesis.36.39 [Munah, Munah] בַּ֣עַל חָנָ֣ן two words Genesis.41.45 [Meteg, Munah, Segol] צָֽפְנַ֣ת פַּעְנֵחַ֒ two words Genesis.48.7 [Telisha Qetana, Telisha Qetana] addition Exodus.14.9 [Pashta, Meteg, Zaqef Qatan] פִּי֙ הַֽחִירֹ֔ת two words Leviticus.1.3 [Geresh Muqdam] addition Numbers.11.35 [Darga, Meteg, Tevir] מִ/קִּבְר֧וֹת הַֽתַּאֲוָ֛ה two words Numbers.25.19 [Etnahta, Sof Pasuq] no Sof Pasuq in WLC (26:1 also has no Atnach) Numbers.32.38 [Darga, Tevir] בַּ֧עַל מְע֛וֹן two words Deuteronomy.21.13 [Telisha Qetana, Telisha Qetana] +1 Joshua.7.2 [Merkha, Qadma, Pashta] בֵּ֥ית אָ֨וֶן֙ two words Joshua.13.6 [Merkha, Qadma, Pashta] מִשְׂרְפֹ֥ת מַ֨יִם֙ two words Joshua.13.17 [Tipeha, Merkha, Meteg, Sof Pasuq] וּ/בֵ֖ית בַּ֥עַל מְעֽוֹן׃ three words Joshua.13.27 [Munah, Telisha Qetana] בֵּ֣ית הָרָם֩ two words Joshua.15.25 [Mahapakh, Paseq, Pashta] וְ/חָצ֤וֹר ׀ חֲדַתָּה֙ two words Joshua.15.61 [Yetiv, Zaqef Qatan] בֵּ֚ית הָעֲרָבָ֔ה two words Joshua.18.22 [Darga, Meteg, Tevir] וּ/בֵ֧ית הָֽעֲרָבָ֛ה two words Joshua.19.27 [Munah, Segol] בֵּ֣ית דָּגֹן֒ two words Joshua.21.11 [Telisha Qetana, Qadma] קִרְיַת֩ אַרְבַּ֨ע two words Joshua.21.32 [Merkha, Pashta] חַמֹּ֥ת דֹּאר֙ two words Judges.1.5 [Merkha, Qadma, Pashta] אֲדֹנִ֥י בֶ֨זֶק֙ two words Judges.17.7 [Merkha, Qadma, Pashta] מִ/בֵּ֥ית לֶ֨חֶם֙ two words Judges.17.8 [Merkha, Qadma, Pashta] מִ/בֵּ֥ית לֶ֨חֶם֙ two words Judges.17.9 [Merkha, Qadma, Pashta] מִ/בֵּ֥ית לֶ֨חֶם֙ two words Judges.19.17 [Darga, Telisha Qetana] no Telisha Qetana in WLC 1_Samuel.17.12 [Merkha, Qadma, Pashta] מִ/בֵּ֥ית לֶ֨חֶם֙ two words 2_Samuel.2.8 [Merkha, Qadma, Pashta] אִ֥ישׁ בֹּ֨שֶׁת֙ two words 2_Samuel.2.15 [Merkha, Qadma, Pashta] אִ֥ישׁ בֹּ֨שֶׁת֙ two words 2_Samuel.3.8 [Zarqa, Zinor] addition 2_Kings.15.29 [Munah, Zinor] תִּגְלַ֣ת פִּלְאֶסֶר֮ two words 2_Kings.16.7 [Telisha Gedola, Mahapakh] תִּ֠גְלַת פְּלֶ֤סֶר two words 2_Kings.17.13 [Geresh Muqdam, Telisha Qetana] שֻׁ֝֠בוּ Geresh Muqdam, Telisha Gedola in WLC (added)

jcuenod commented 7 years ago

This is great, thanks for the update! I am in the process of working through the update. One thing that I still don't understand from before is how you are determining whether an accent combination is conjunctive or disjunctive (there are two accents that could be either conjunctive or disjunctive: \u05A5 and \u05AB [I haven't checked for more since the update]) I can see that it has something to do with the n value that is sometimes on w nodes but the meaning of n is not clear to me.

jcuenod commented 7 years ago

Having run through my text with your updates I only have 8 instances that I can't explain. All but one of them is simply the issue of a different text (which you have noted, thanks!). The exception is Psalms.56.10 [Qadma, Merkha]. You have added Merkha with Qadma to poetry but this instance is the other way around - is this a mistake? (actually, I realise that the accents are on the same letter but I would have thought there would be a conventional order that is supposed to be followed or something)

DavidTroidl commented 7 years ago

The n attributes were added specifically to indicate disjunctive breaks in the verse, and their relation to each other. They are what are used to do the accent structure in the demo.

On 3/6/2017 10:45 PM, James Cuénod wrote:

This is great, thanks for the update! I am in the process of working through the update. One thing that I still don't understand from before is how you are determining whether an accent combination is conjunctive or disjunctive (there are two accents that could be either conjunctive or disjunctive: |\u05A5| and |\u05AB| [I haven't checked for more since the update]) I can see that it has something to do with the |n| value that is sometimes on w nodes but the meaning of |n| is not clear to me.

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/30#issuecomment-284613453, or mute the thread https://github.com/notifications/unsubscribe-auth/AAKwBeHcA1xd4n9cSl59CDEQ-O9REbEkks5rjNLegaJpZM4MR2R0.


This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus

DavidTroidl commented 7 years ago

The WLC generally follows the mark order given in the SBL Hebrew User Manual, p.

  1. Here below marks precede above marks. I recorded what I found in the WLC, so this represents another disagreement between the texts.

On 3/6/2017 11:10 PM, James Cuénod wrote:

Having run through my text with your updates I only have 8 instances that I can't explain. All but one of them is simply the issue of a different text (which you have noted, thanks!). The exception is |Psalms.56.10 [Qadma, Merkha]|. You have added |Merkha with Qadma| to poetry but this instance is the other way around - is this a mistake? (actually, I realise that the accents are on the same letter but I would have thought there would be a conventional order that is supposed to be followed or something)

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/30#issuecomment-284616458, or mute the thread https://github.com/notifications/unsubscribe-auth/AAKwBWGZRLccneCqy0uPl5DVUGdKChC8ks5rjNiqgaJpZM4MR2R0.


This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus

jcuenod commented 7 years ago

Interesting, so how do you tell the difference - is any ole followed by a yored a combined accent?

DavidTroidl commented 7 years ago

Yes. As far as I know, Ole never appears without Yored, forming Ole We Yored. Only the Yored appears alone, when necessary, because the Ole has nowhere to go.

On 3/8/2017 2:27 PM, James Cuénod wrote:

Interesting, so how do you tell the difference - is any ole followed by a yored a combined accent?

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/30#issuecomment-285142175, or mute the thread https://github.com/notifications/unsubscribe-auth/AAKwBWNT1CL748IVSbO-jfidheMfNwK0ks5rjwEKgaJpZM4MR2R0.


This email has been checked for viruses by Avast antivirus software. https://www.avast.com/antivirus

jcuenod commented 7 years ago

The ETCBC guys do a lot of stuff with jupyter notebooks to process their data. I put together this notebook to find ole weyoreds that are split across words. If you get a chance, I'd be grateful if you would take a look at it: https://github.com/jcuenod/tf-notebooks/blob/master/splitOleWeyoreds.ipynb

DavidTroidl commented 7 years ago

I'm not familiar with jupyter notebooks. Looking over the list, I find that 41 out of 44 instances have Yored on the first letter. The other three are a little surprising. However, looking at the list led me to make a few corrections to Accent Catalog. Thanks.