openscriptures / morphhb

Open Scriptures Hebrew Bible
https://hb.openscriptures.org
Other
181 stars 63 forks source link

Parsing inseparable prepositions with the definite article #4

Closed dowens76 closed 6 years ago

dowens76 commented 11 years ago

Currently the schema parses inseparable prepositions with the article in a strange way that could cause problems. As an example, consider: לַ/חֹ֣דֶשׁ The lemma is l/2320. This term includes two segments, the preposition and the noun. It's parsing, however, would involve three: HR/Tp/Ncmsa. Perhaps it would be better to parse the definite article as an attribute of the preposition, so: HRd/Ncmsa, the Rd representing "preposition with the definite article."

DavidTroidl commented 11 years ago

Something has to be done, one way or the other. The site won't allow HR/Td/Ncmsa. Would allowing HRd/Ncmsa cover the b, k and l prepositions? I'd have to redo the morphology codes and the code parser to make it work. It shouldn't be a problem. Otherwise it would mess up the segmentation. Let me know.

On 7/13/2013 1:48 AM, Daniel Owens wrote:

Currently the schema parses inseparable prepositions with the article in a strange way that could cause problems. As an example, consider: לַ/חֹ֣דֶשׁ The lemma is l/2320. This term includes two segments, the preposition and the noun. It's parsing, however, would involve three: HR/Tp/Ncmsa. Perhaps it would be better to parse the definite article as an attribute of the preposition, so: HRd/Ncmsa, the Rd representing "preposition with the definite article."

— Reply to this email directly or view it on GitHub https://github.com/openscriptures/morphhb/issues/4.

dowens76 commented 11 years ago

I believe it would work for all three of the inseparable prepositions. They behave the same. I think we should do this.

dowens76 commented 11 years ago

I am assuming that this is the consensus now, using "HRd" for prepositions with the definite article. The default is without and needs no further code. I am closing this issue and will modify the parsing schema. The necessary changes will need to be made in the website.

DavidTroidl commented 11 years ago

I agree that this is the right solution. I'll have to look at the mechanics of the morph parser, to see how to fit this in.

David

On 7/16/2013 10:24 PM, Daniel Owens wrote:

I am assuming that this is the consensus now, using "HRd" for prepositions with the definite article. The default is without and needs no further code. I am closing this issue and will modify the parsing schema. The necessary changes will need to be made in the website.

— Reply to this email directly or view it on GitHub https://github.com/openscriptures/morphhb/issues/4#issuecomment-21088298.

dowens76 commented 11 years ago

In light of private email conversation, we have not resolved this yet. In the interest of simplicity, we do not want to add an attribute of the preposition part of speech. Currently we have a particle type:

*p = definite article with inseparable preposition

If we want to use that, we should bring all prepositions under particles and just use two types. For the preposition we cannot use "Tr" because that currently signifies a relative. I think it would be easier to make a regular preposition "p" because it is memorable. What about these two particle types:

p = preposition q = inseparable preposition with the definite article

That keeps the two close together, and "q" is a little bit like "p+." At least, it makes sense to me.

DavidTroidl commented 6 years ago

This has been resolved in the Hebrew Morphology Codes.