anacastrosalgado / TEI

1 stars 0 forks source link

JACARE’ ou JACAREO [cayman; alligator], lemma followed by an orthographic variant #10

Open anacastrosalgado opened 1 year ago

anacastrosalgado commented 1 year ago

JACARE_morais_1

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS.1.DLP.JACARE" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth>JACARE’</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <metamark function="variantDelimiter">ou</metamark>
   <form type="variant" xml:id="JACAREO" xml:lang="pt">
      <orth>JACAREO</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <note>(o primeiro he mais commum no Brasil)</note>
   <sense xml:id="MORAIS.1.DLP.JACARE.s.1">
      <def>o mesmo , que o crocodilo</def>
   </sense>
   <pc>.</pc>
</entry>
laurentromary commented 1 year ago

We need to have a clear editorial strategy concerning the use of capitals: do we capitalise or keep the orthographic form as it appears in the source? If we also provide a normalised form in @norm that mays also allow us to be even more faithful to the source.

anacastrosalgado commented 1 year ago

@laurentromary We have to be faithful to the source for preservation purposes, so I changed the capitals. I'm keeping the attribute norm but probably, as Boris suggested, we can use the XSLT transformation later... Meanwhile, I was thinking and "o mesmo que crocodilo" [the same as alligator] is a cross-reference. Is it strange if I keep the <def>?

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS.1.DLP.JACARE" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth norm="JACARE’">JACARE</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <metamark function="variantDelimiter">ou</metamark>
   <form type="variant" xml:id="JACAREO" xml:lang="pt">
      <orth>Jacareo</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <note>(o primeiro he mais commum no Brasil)</note>
   <sense xml:id="MORAIS.1.DLP.JACARE.s.1">
      <def>o mesmo , que o <ref target="#MORAIS.1.DLP.CROCODILO" type="mainEntry">crocodilo</ref></def>
   </sense>
   <pc>.</pc>
</entry>
laurentromary commented 1 year ago

Not a <def>. Clearly an <xr>.

anacastrosalgado commented 1 year ago

It is very curious because these exercises highlight what we (lexicographers) do wrong in order to adapt to the tools we use. This is a case where we (ACL Portuguese team) would put the information inside the <def>, but only because we have no choice... And by the way, do you agree on using <seg>?

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS.1.DLP.JACARE" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth norm="JACARE’">JACARE</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <metamark function="variantDelimiter">ou</metamark>
   <form type="variant" xml:id="JACAREO" xml:lang="pt">
      <orth>Jacareo</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <note>(o primeiro he mais commum no Brasil)</note>
   <xr type="related">
      <seg>o mesmo , que o</seg>
      <ref target="#MORAIS.1.DLP.CROCODILO" type="mainEntry">crocodilo</ref>
   </xr>
   <pc>.</pc>
</entry>
anacastrosalgado commented 1 year ago

We can also classify the <note> as <note type="usageNote"> . Well... it's a frequency label by means of a note.

laurentromary commented 1 year ago

Why not <lbl> like any kind of similar.... label?

daliboris commented 1 year ago

I'm just working on it and I'm suggesting this solution:


<form type="lemma" xml:id="MORAIS.1.DLP.JACARE.f.1">
  <orth norm="JACARE’">JACARE</orth>
</form>
...
<usg type="geographic">
  (o <ref type="entry" target="#MORAIS.1.DLP.JACARE.f.1">primeiro</ref> he mais commum no Brasil)
</usg>

I've problem with the reference type, I'm referring to the lemma (orthography), but is not @type="entry" too wide?

laurentromary commented 1 year ago

We can also classify the <note> as <note type="usageNote"> . Well... it's a usage label by means of a note....

So it's a <usg>, isn't it? (type='geographic')

anacastrosalgado commented 1 year ago

I can't use <usg> (type='geographic') because it is not exclusively for Brazil... And why this, @daliboris ? Sorry, I don't understand.

  (o <ref type="entry" target="#MORAIS.1.DLP.JACARE.f.1">primeiro</ref>

The note means that JACARE was more common in Portugal and in Brazil JACAREO (in the 18th century!)

laurentromary commented 1 year ago

I can't use <usg> (type='geographic') because it is not exclusively for Brazil...

But <usg> does not mean exclusivity: it is a semantic qualification for what it is said here. Imagine that in another entry it issaid "exclusively in Basil": you would have used <usg>, but not here => we need consistant encoding.

daliboris commented 1 year ago

The note means that JACARE was more common in Portugal and in Brazil JACAREO (in the 18th century!)

Sorry, according to DeepL translation "o primeiro he mais commum no Brasil" means "the first is more common in Brazil". I thought that "first" means "first headword in the entry". But you suggest that "primeiro" points to the second variant in the entry, aren't you?

Now I see, that o primeiro can be also translated as "former"- now I understand.

I that case I can change my code:

(o <ref type="entry" target="#MORAIS.1.DLP.JACARE.v.JACAREO">primeiro</ref>

(for the value of @xml:id see next comment)

daliboris commented 1 year ago

Use unique @xml:ids as possible, so I suggest:

<form type="variant" xml:id="MORAIS.1.DLP.JACARE.v.JACAREO" xml:lang="pt">

Delimiter v. as for "variant".

anacastrosalgado commented 1 year ago

Done.

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS.1.DLP.JACARE" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth norm="JACARE’">JACARE</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <metamark function="variantDelimiter">ou</metamark>
   <form type="variant" xml:id="MORAIS.1.DLP.JACARE.v.JACAREO" xml:lang="pt">
      <orth>Jacareo</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <usg type="geographic">
      (o <ref type="entry" target="#MORAIS.1.DLP.JACARE.f.1">primeiro</ref> he mais commum no Brasil)
   </usg>

   <note>(o primeiro he mais commum no Brasil)</note>
   <xr type="related">
      <lbl>o mesmo , que o</lbl>
      <ref target="#MORAIS.1.DLP.CROCODILO" type="mainEntry">crocodilo</ref>
   </xr>
   <pc>.</pc>
</entry>
anacastrosalgado commented 1 year ago

Another example of an orthographic variant.

JEROGLIFICO_morais_1

<entry xmlns="http://www.tei-c.org/ns/1.0" xml:id="MORAIS.1.DLP.JEROGLIFICO" type="mainEntry" xml:lang="pt">
   <form type="lemma">
      <orth norm="JACARE’">JEROGLIFICO</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <metamark function="variantDelimiter">ou</metamark>
   <form type="variant" xml:id="MORAIS.1.DLP.JEROGLIFICO.v.JEROGLIPHICO" xml:lang="pt">
      <orth>JEROGLIPHICO</orth>
   </form>
   <metamark function="lemmaDelimiter">,</metamark>
   <gramGrp>
      <gram type="pos" norm="NOUN">ſ.</gram>
      <gram type="gen">m.</gram>
   </gramGrp>
   <sense xml:id="MORAIS.1.DLP.JEROGLIFICO.s.1">
      <def>pintura emblematica , e ſignificativa de conceitos , como hoje o são as palavras eſcritas , forão
         uſados pelos Egypcios ; ou reprefentavão
         ideias myſterioſas da ſua religião
      </def>
      <cit type="example" xml:lang="pt">
         <pc>.</pc>
         <bibl type="attestation" source="#Vieira">
            <title>Vieira</title>
            <!-- I don't know what is 4 and n. -->
            <citedRange unit="???">4</citedRange>
            <citedRange unit="???">n. 230</citedRange>
            <pc>,,</pc>
         </bibl>
         <quote>a eſte jeroglifico de Salomão</quote>
      </cit>
   </sense>
   <pc>.</pc>
</entry>
anacastrosalgado commented 1 year ago

@daliboris So for the variant delimiter I also have to add <hi>: <metamark function="variantDelimiter"><hi>ou</hi></metamark>