CoEDL / nyingarn-workspace

The Nyingarn data ingest and preparation application
GNU General Public License v3.0
0 stars 0 forks source link

FtP ingestation with people and places marked up with data #123

Closed sophlew closed 1 year ago

sophlew commented 1 year ago

The mark up of Place and Person in FromthePage Howitt and Fison links to important Nyingarn hw0024 information and/or web links. The information is lost (or not discoverable) when ingested into Nyingarn. Attached are screenshots: 1. examples of PLace and Person data in FromthePage 2. how the mark up in currently presented in the Nyignarn workspace

Advice on how to manage, please FtP person data FtP person data2 H F FtP

Conal-Tuohy commented 1 year ago

NB the rs element ("referring string") identifies a kind of name (either a proper name or at least some kind of noun phrase that identifies something). There are several similar elements in TEI, grouped into a class of elements called nameLike. The rs element is the most general purpose element in the set, in that it can refer to any kind of object, not just a person (as persName does) or a place (placeName). The Nyingarn schema already includes placeName and persName among others, but not (yet) rs.

The element is a kind of hyperlink; it has a ref attribute whose value is the URL of information elsewhere which provides more information about the entity referred to.

In the case of the Howitt and Fison documents, these URLs point to elements in the teiHeader which provide information about the people, places, etc, identified by the rs elements. e.g. in hw0024.xml there's this rs element:

<rs ref="#S94966">A. W. Howitt</rs>

... which is a reference to the element which has the xml:id attribute S94966:

            <person xml:id="S94966">
              <persName>A. W. Howitt</persName>
                <note type="categorization">
                  Categories:
                    <ab>
                      <ptr ana="#category #leaf #root" target="#C410">People</ptr>
                    </ab>
                </note>
                <note type="article">
                    <p xml:id="OTPS94966P0">http://adb.anu.edu.au/biography/howitt-alfred-william-510</p>
            </note>
            </person>

Potentially these rs elements could be upgraded at ingestion time into persName and placeName elements based on the type of the element they link to (e.g. an rs that links to a person would be converted to a persName).

But as to how to deal with the person, place, etc metadata, that's a different question.

sophlew commented 1 year ago

@Conal-Tuohy Please upgrade ingestion as you suggest so <rs> will show in Nyingarn as <persName> and <placeName>. Perhaps the metadata needs to be added into Nyingarn under the metadata tab @marcolarosa I'll put it on the meeting agenda for Monday

Conal-Tuohy commented 1 year ago

PR #125 deals with this.

NB there are still a few rs which don't refer to people or to places; they refer instead to an element which in general doesn't identify what kind of entity is being referred to (in practice it seems they are either language groups or national groups). I have left these as rs.

sophlew commented 1 year ago

Screen Shot 2022-12-02 at 8 50 34 am

@Conal-Tuohy There a still some references to <ref> followed by numbers after Person name or Place name in the new H&F ingests, see screenshot. Is this correct?

sophlew commented 1 year ago

The ref=numbers refers to a person in the TEI metadata using a unique identifier. @nthieberger should we keep this? I think the user would need to look up the ID number on the H&F website example

Conal-Tuohy commented 1 year ago

I thought it was worth keeping those links, even if they're not currently usable.

In the fairly short term, there'll be the possibility for a user to download the full TEI from Nyingarn, and then be able to follow those cross-references using some other software.

At present there's no facility in Nyingarn's workspace to dereference those ref attributes and retrieve the person or place metadata, and it's not on the TODO list as such, but it's possible that at some point Nyingarn's HTML preview could be extended e.g. to replicate the functionality of the H&F website by displaying popup notes when you hover your mouse over those names.