open-editions / corpus-joyce-portrait-TEI

The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man
GNU General Public License v3.0
20 stars 20 forks source link
joyce portrait tei xml

The Open Scholarly Edition of James Joyce’s A Portrait of the Artist as a Young Man

DOI

Join the chat at https://gitter.im/open-editions/Lobby

This is the open scholarly edition of James Joyce's novel A Portrait of the Artist as a Young Man, richly annotated in semantic markup with TEI XML.

See a short presentation about it here, and test a XSL-transformed prototype, containing a few of the text’s features, here on netlify. The transformed prototype is not meant to be a canonical representation of the features of the edition—the main text of the edition is portrait.xml in this repository.

About this Project

We intend to aggregate much of the existing knowledge about the book in a single file--textual notes, critical commentary, interpretations, and more. We want to be able to answer questions such as these:

These questions are easily answerable with very simple computational analyses, but these analyses need a machine-readable edition of the novel. This is possible with TEI XML, a markup format that allows for semantic description of the text. Most existing TEI editions, however, don't use the full semantic features of the language. TEI features an epigraph tag, for instance, but epigraphs are often marked up as quotes, verse lines, or, even worse, italicized text. Those descriptions are all true, but not as useful to computational literary analysis as the epigraph tag.

When we see an epigraph, we mark it up as such. When we see dialog, we mark it as such and indicate who is speaking. When we correct an OCR or typographical error in the text, we document the reasons why. One error that appears in most print editions is the word "green"--Joyce meant for it to read "geen," indicating infantile pronunciation of the color word. We correct errors like this by referring to manuscripts.

We also mark up:

Apart from markup features, one of the distinguishing features of this project is its infrastructure. We leverage distributed version control using git and GitHub, which allows anyone to edit the edition in the browser and submit a request for the change.

Markup Conventions

See the issue tracker for a more complete list.

An Example Epigraph

<epigraph>
    <cit> 
        <quote xml:lang="lat"><hi rend="italic">Et ignotas animum dimittit in artes.</hi></quote> 
        <bibl>- Ovid, <hi rend="italic">Metamorphoses</hi>, VIII, 188</bibl> 
    </cit> 
</epigraph>

An Example of language markup:

<p>The word now shone in his brain, clearer and brighter 
than any ivory sawn from the mottled tusks of elephants. 
<hi rend="italic">Ivory, <seg xml:lang="fra">ivoire</seg>, 
<seg xml:lang="ita">avorio</seg>, 
<seg xml:lang="lat">ebur</seg>.</hi> One of the first 
examples that he had learnt in Latin had run: 
<seg xml:lang="lat"><hi rend="italic">India mittit ebur</hi></seg>; 

Editorial Conventions

How to contribute:

See CONTRIBUTING.md.