Conal-Tuohy / TroveProxy

A transforming proxy and harvester for the National Library of Australia's Trove API
Apache License 2.0
2 stars 0 forks source link

How should a collection of Trove records be represented in TEI? #1

Closed Conal-Tuohy closed 10 months ago

Conal-Tuohy commented 1 year ago

A collection of texts could be represented as a teiCorpus (conceptually, a collection of texts) or as a TEI/text/group (conceptually, a text consisting of a compilation of texts)

Varieties of Composite Text [...] In corpora, the component samples are clearly distinct texts, but the systematic collection, standardized preparation, and common markup of the corpus often make it useful to treat the entire corpus as a unit, too. Some corpora may become so well established as to be regarded as texts in their own right; the Brown and LOB corpora are now close to achieving this status. [...] The group element is provided to simplify the encoding of collections, anthologies, and cyclic works; as noted above, the group element can also be used to record the potentially complex internal structure of language corpora. For a full description, see chapter 4 Default Text Structure.

Conal-Tuohy commented 10 months ago

Decided on teiCorpus which better reflects the semantics of a set of texts published independently, transcribed independently, and later grouped together for analytical purposes (rather than as a set of texts which were published as a compendium, or anthology, and subsequently transcribed from that publication)