anacastrosalgado / TEI

1 stars 0 forks source link

Basic lexicographic article structure: ESTOJO” [case; cover; kit] / JALDE [yellow color] #2

Open anacastrosalgado opened 2 years ago

anacastrosalgado commented 2 years ago

The lexicographic articles in Morais 1st edition present some traditional typographic features, such as: the headword in upper case letter, followed by a comma (a conventional typographical delimiter) preceded by a blank space, the abbreviated grammatical information (part of speech and gender usually) and finally the lexicographic definition.

ESTOJO_1 ESTOJO [case; cover; kit], example of a basic lexicographic article structure

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS_1.DLP.ESTOJO" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>ESTOJO</orth>
   </form>
   <metamark function="entryTypographiConvention">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <sense xml:id="MORAIS_1.DLP.ESTOJO_s1">
      <def>caixinha de coiro , ou papélão com repartimentos para navalhas , teſouras , facas , canivetes , &amp;</def>
      <pc>.</pc>
   </sense>
</entry>

The commas, through the dictionary, are encoded as a when they separate different elements or as an if is just a punctuation character. Sometimes (it is worth remembering that we are dealing with a historical dictionary, from the 18th century, all written by hand), instead of the comma as the delimiter, a semicolon appears. Obviously, the TEI element remains the same:

JALDE_1

JALDE [yellow color], example of an article with a semicolon delimiting the lemma from the POS

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS_1.DLP.JALDE" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>JALDE</orth>
   </form>
   <metamark function="entryTypographiConvention">;</metamark>
   <gramGrp>
      <gram type="pos" norm="ADJECTIVE">adj.</gram>
   </gramGrp>
   <sense xml:id="MORAIS_1.DLP.JALDE_s1">
      <def>còr amarella acceza</def>
      <pc>.</pc>
   </sense>
</entry>
laurentromary commented 2 years ago

I see that all values of metalmark/@function are entryTypographiConvention. Is there a more fine-grained typology?

anacastrosalgado commented 2 years ago

<metamark function="entryTypographiConvention">,</metamark> or <metamark function="entryTypographiConvention">;</metamark> DOUBT: Probably "entry" is not a good choice.

Ex.:


            <form type="lemma">
               <orth>FE’DO</orth>
            </form>
            <metamark function="entryTypographiConvention">,</metamark>
            <gramGrp>```

```<metamark function="senseTypographiConvention">§</metamark>```

Ex.: 
```<sense xml:id="MORAIS_1.DLP.JANTAR_s3">
                  <def>Porção de dinheiro , que as Villas , e Cidades davão aos Reis , quando hião de correição para ſuſtento de ſua comitiva</def>
                  <pc>.</pc>
               </sense>```

```<metamark function="usgTypographiConvention">,</metamark>```

Ex.:
```<sense xml:id="MORAIS_1.DLP.FEDO_s1">
               <def>feio</def>
               <pc>.</pc>
               <cit type="example" xml:lang="pt">
                  <bibl type="attestation" source="#M._L._Monarchia_Luſitana">
                     <title>Luz da Medicina</title>
                  </bibl>
                  <pc>,,</pc>
                  <quote>lepra , e outros achaques fédos</quote>
               </cit>
               <metamark function="usgTypographiConvention">,</metamark>
                  <usg type="frequency">p. usado</usg>
                  <pc>.</pc>
            </sense>```
laurentromary commented 2 years ago

OK. This has to be fine-tuned!

anacastrosalgado commented 2 years ago

Basic lexicographic article structure without explicit part of speech and where the number appears before the gender (plur. femin.)

ABADERNAS_Morais_1

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>ABADERNAS</orth>
      <metamark function="entryTypographiConvention">,</metamark>
   </form>
   <gramGrp>
      <gram type="number">plur.</gram>
      <gram type="gender">femin.</gram>
   </gramGrp>
   <sense xml:id="MOR1.DLP.ABADERNAS_s.1">
      <usg type="domain">naut.</usg>
      <def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def>
      <pc>.</pc>
   </sense>
</entry>
anacastrosalgado commented 2 years ago

@laurentromary After talking with Toma, I agree on using Delimiter instead of Typographic Convention. In any case, I believe that both of you will still discuss this point together.

<metamark function="posDelimiter">,</metamark>
<metamark function="senseDelimiter">§</metamark>
<metamark function="usageDelimiter">,</metamark>
laurentromary commented 2 years ago

Basic lexicographic article structure without explicit part of speech and where the number appears before the gender (plur. femin.)

ABADERNAS_Morais_1

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>ABADERNAS</orth>
      <metamark function="entryTypographiConvention">,</metamark>
   </form>
   <gramGrp>
      <gram type="number">plur.</gram>
      <gram type="gender">femin.</gram>
   </gramGrp>
   <sense xml:id="MOR1.DLP.ABADERNAS_s.1">
      <usg type="domain">naut.</usg>
      <def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def>
      <pc>.</pc>
   </sense>
</entry>

Just thinking aloud about this example, shouldn't the metamark and pc elements be outside the form and sense elements respectively? That would be cleaner from a content point of view and definitely makes sense for the metalmark: it makes the boundary between for and the rest...

anacastrosalgado commented 2 years ago

@laurentromary You're totally right, and I regret having this kind of issue in my encoding :( Probably I have to change my glasses!

ABADERNAS_Morais_1

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MOR1.DLP.ABADERNAS" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>ABADERNAS</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="number">plur.</gram>
      <gram type="gender">femin.</gram>
   </gramGrp>
   <sense xml:id="MOR1.DLP.ABADERNAS_s.1">
      <usg type="domain">naut.</usg>
      <def>ganchos onde ſe fixão os colhedores, e outros cabos, quando ſe aperta a enxarcia</def>
   </sense>
   <pc>.</pc>
</entry>
anacastrosalgado commented 2 years ago

Instead of having posDelimiter, I'm just wondering if it is not better to have lemmaDelimiter. It delimits the lemma from the POS...

anacastrosalgado commented 2 years ago

The syntax is different (after)...

lemmaDelimiter: after a lemma

usageDelimiter: before a usage label

senseDelimiter before a new sense