DARIAH-ERIC / lexicalresources

Data space of the DARIAH Lexical Resources Working Group
https://dariah-eric.github.io/lexicalresources/
BSD 2-Clause "Simplified" License
18 stars 24 forks source link

8.3 тур. vs. * #164

Open bansp opened 2 years ago

bansp commented 2 years ago

In the "incorrect" encoding in 8.3., the source language is identified as 'тур'. Below, in the "correct" encoding, the canonical form is given in the attribute(s), but the original version is replaced by '*', and that seems to be wrong (?).

Aside from that, it struck me that it needn't be immediately obvious how the attributes @value, @norm, and the element content interplay in

<lang value="tr" expand="турцизам" norm="tr">*</lang>

-- it would be great to have at least cursory info on that, and maybe

https://dariah-eric.github.io/lexicalresources/pages/TEILex0/spec.html#TEI.lang

is a good place to add the above example to the spec, and say one sentence about what all these animals do there.

Thanks for considering that! :-)

ttasovac commented 1 year ago

@bansp I'm so sorry I missed your post. I only see it now.

The asterisk is used as a symbol for Turkisms in some Serbian dictionaries, but I will have to look into these examples again and see what's up, because something, as you noticed, doesn't quite match up.

We also need a proper explanation of the attributes used — and probably with a better example than some obscure Serbian dictionary that nobody understands.

I've assigned this to myself and will get back to it.