Open JonathanReeve opened 5 years ago
I'm trying out text matching between Gilbert and Ulysses. I can identify lots of matches, but the difficulty is coming up with the right XML structure. Here's my best guess so far:
<note resp="#authorOfSecondary">
<!-- Primary text quoted from -->
<quote type="primary" source="xmlidOfBibl">
<!-- Location in primary text -->
<bibl><biblScope unit="page" from="5" to="6"></biblScope></bibl>
<!-- The quoted snippet here, as appears in the primary text -->
</quote>
<quote type="secondary" source="xmlidOfBibl">
<!-- Location in critical discussion -->
<bibl><biblScope unit="page" from="5" to="6"></biblScope></bibl>
<!-- The text here, as appears in the critical text -->
</quote>
<interp>
<!-- Snippet of critical commentary, before and after the match location. -->
</interp>
</note>
<bibl>
<!-- Bibliographic details of secondary text -->
</bibl>
@bnays started working on this issue via WorksHub.
A user started working on this issue via WorksHub.
@bnays, happy to see you're interested in working on this one! So over in the analogous area for Finnegans Wake, where most of the work is taking place, I've already made a TEI file with matches between a few hundred articles and the Wake. That could serve as a guide if you start in on something here.
One thing I've been itching to do is text-match some of the more famous books about Ulysses, like Stuart Gilbert's, and make the location data available in the TEI.
An adjacent task is to make this data displayable in the HTML output. That's secondary, of course, but maybe something like that could be done in Elm or a Haskell framework?
A user started working on this issue via WorksHub.
@sciemesfin started working on this issue via WorksHub.
Hi, @sciemesfin! Welcome to the project! Could you tell me what you had in mind for this issue?
And @bnays, are you still working on this issue?
A user started working on this issue via WorksHub.
@draconid719 started working on this issue via WorksHub.
Hi, @draconid719! Welcome to the project! What did you have in mind for this issue?
@sciemesfin and @bnays, are you still working on this?
@cjmont started working on this issue via WorksHub.
Since it's not terribly hard to use Zotero to download lots of PDFs concerning Ulysses from common databases (JSTOR, Project Muse), it'd be relatively easy to text-match between our Ulysses text and some of the more well-known critical articles.
We could even match on books by scripting something to query Google Books or Hathi Trust for every five-gram in the novel. The process would take a while, but it'd be worth it.
There are also the "popular passages" listed on the Google Books "About this Book" pages, which link to other works that cite those passages. The only issue here is that the number of passages is limited to about 12.
I'm looking into how best to do this in TEI. This page describes citation formats. It looks like Zotero can export to TEI, so that really simplifies the process.
My guess now is that the text could have something like this in it:
<note target="#gilbert1982">Gilbert calls this word "eminently Joycean."</note>
which would then refer to abibliography.xml
which would contain a TEI bibliography.Input and opinions on this front would be very welcome.