Open anacastrosalgado opened 2 years ago
I see that all values of metalmark/@function
are entryTypographiConvention
. Is there a more fine-grained typology?
<metamark function="entryTypographiConvention">,</metamark> or <metamark function="entryTypographiConvention">;</metamark>
DOUBT: Probably "entry" is not a good choice.
Ex.:
<form type="lemma">
<orth>FE’DO</orth>
</form>
<metamark function="entryTypographiConvention">,</metamark>
<gramGrp>```
```<metamark function="senseTypographiConvention">§</metamark>```
Ex.:
```<sense xml:id="MORAIS_1.DLP.JANTAR_s3">
<def>Porção de dinheiro , que as Villas , e Cidades davão aos Reis , quando hião de correição para ſuſtento de ſua comitiva</def>
<pc>.</pc>
</sense>```
```<metamark function="usgTypographiConvention">,</metamark>```
Ex.:
```<sense xml:id="MORAIS_1.DLP.FEDO_s1">
<def>feio</def>
<pc>.</pc>
<cit type="example" xml:lang="pt">
<bibl type="attestation" source="#M._L._Monarchia_Luſitana">
<title>Luz da Medicina</title>
</bibl>
<pc>,,</pc>
<quote>lepra , e outros achaques fédos</quote>
</cit>
<metamark function="usgTypographiConvention">,</metamark>
<usg type="frequency">p. usado</usg>
<pc>.</pc>
</sense>```
OK. This has to be fine-tuned!
Basic lexicographic article structure without explicit part of speech and where the number appears before the gender (plur. femin.)
<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt">
<form type="lemma">
<orth>ABADERNAS</orth>
<metamark function="entryTypographiConvention">,</metamark>
</form>
<gramGrp>
<gram type="number">plur.</gram>
<gram type="gender">femin.</gram>
</gramGrp>
<sense xml:id="MOR1.DLP.ABADERNAS_s.1">
<usg type="domain">naut.</usg>
<def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def>
<pc>.</pc>
</sense>
</entry>
@laurentromary After talking with Toma, I agree on using Delimiter instead of Typographic Convention. In any case, I believe that both of you will still discuss this point together.
<metamark function="posDelimiter">,</metamark>
<metamark function="senseDelimiter">§</metamark>
<metamark function="usageDelimiter">,</metamark>
Basic lexicographic article structure without explicit part of speech and where the number appears before the gender (plur. femin.)
<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt"> <form type="lemma"> <orth>ABADERNAS</orth> <metamark function="entryTypographiConvention">,</metamark> </form> <gramGrp> <gram type="number">plur.</gram> <gram type="gender">femin.</gram> </gramGrp> <sense xml:id="MOR1.DLP.ABADERNAS_s.1"> <usg type="domain">naut.</usg> <def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def> <pc>.</pc> </sense> </entry>
Just thinking aloud about this example, shouldn't the metamark and pc elements be outside the form and sense elements respectively? That would be cleaner from a content point of view and definitely makes sense for the metalmark: it makes the boundary between for and the rest...
@laurentromary You're totally right, and I regret having this kind of issue in my encoding :( Probably I have to change my glasses!
<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt">
<form type="lemma">
<orth>ABADERNAS</orth>
</form>
<metamark function="lemmaDelimiter">,</metamark>
<gramGrp>
<gram type="number">plur.</gram>
<gram type="gender">femin.</gram>
</gramGrp>
<sense xml:id="MOR1.DLP.ABADERNAS_s.1">
<usg type="domain">naut.</usg>
<def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def>
</sense>
<pc>.</pc>
</entry>
Instead of having posDelimiter, I'm just wondering if it is not better to have lemmaDelimiter. It delimits the lemma from the POS...
The syntax is different (after)...
lemmaDelimiter: after a lemma
usageDelimiter: before a usage label
senseDelimiter before a new sense
The lexicographic articles in Morais 1st edition present some traditional typographic features, such as: the headword in upper case letter, followed by a comma (a conventional typographical delimiter) preceded by a blank space, the abbreviated grammatical information (part of speech and gender usually) and finally the lexicographic definition.
ESTOJO [case; cover; kit], example of a basic lexicographic article structure
The commas, through the dictionary, are encoded as a when they separate different elements or as an if is just a punctuation character.
Sometimes (it is worth remembering that we are dealing with a historical dictionary, from the 18th century, all written by hand), instead of the comma as the delimiter, a semicolon appears. Obviously, the TEI element remains the same:
JALDE [yellow color], example of an article with a semicolon delimiting the lemma from the POS