DARIAH-ERIC / lexicalresources

Data space of the DARIAH Lexical Resources Working Group
https://dariah-eric.github.io/lexicalresources/
BSD 2-Clause "Simplified" License
18 stars 24 forks source link

Implement basic page layout elements #184

Open ttasovac opened 1 year ago

ttasovac commented 1 year ago

This is a placeholder from the Lexical Resources Summit.

After much discussion, and in the context of the ongoing work on the TEI Lex-0 Publisher, we agreed that:

xlhrld commented 1 year ago

Related to #4

daliboris commented 1 year ago

We also need to document properly what to do when breaks occur between two entries or divs. If it should be placed between these elements – as the thing itself appears in its nature: the page must start first, followed by the new entry -, or the pb and cb elements must be first child element of divs or entries.

See an example from TEI Guidlines:

 <p>
<!-- ... -->
 </p>
 <pb n="2" facs="page2.png"/>
<!-- similarly, for page 2 -->
 <p>

But I also know that some software (EVT, for example) needs that milestone elements should be always inside the block element with the text content.

daliboris commented 1 year ago

There is also a @break attribute that indicates if pb/cb/lb elements are breaking the word/token or not.

I must check if the indexing mechanism in eXist-db or BaseX can handle it, or it's better to use something similar to this (except that the <w> element is not allowed in the TEI Lex-0):

<quote>... sed <w>imp<lb/>erator</w> dixit ... </quote>
laurentromary commented 1 year ago

I think it would be ideal to keep to the tEI mechanism as much as we can.