Closed MikeJ1971 closed 4 years ago
With regard to compatibility with the infrastructure being developed at Trinity, I've asked the following questions:
Am I correct in thinking that the ‘compatibility’ between the TEI files and both the knowledge base and the database is just the case of ensuring that the same taxonomy terms are used?
Even if the TEI XML files are not stored in the database itself as BLOBs (or whatever) there will be a row in the database that will contain metadata about the file, including keywords? What metadata is needed? Can I/we write an XQuery script to extract the data from the XML file that is needed in the database?
So, we should be using FAST LoC headings (https://fast.oclc.org/searchfast/) here as well?
That would be my intention, yes.
@munnellg can you clarify for Mike what type of link we should encode. It looks to me like the LoC headings can be output in a number of different formats. Would we want a WorldCat permalink such as this: http://id.worldcat.org/fast/946068/
Or Would we want the RDF view: http://id.worldcat.org/fast/946068.rdf.xml
Or does it make any difference?
Declan O'Sullivan's advice was definitely to include the link, but I'm not sure what format would be preferable to future proof this approach to topical keywords.
Bear in mind that the keywords may prove useful for the IR.
I think the link for the RDF view would be preferable, especially if we want to be able to process whatever is on the other end of the link.
that makes sense to me. it's precisely the question about being able to process -- and what and how we want to process - - that I'm getting at.
On Fri, 4 Sep 2020 at 10:32, Gary Munnelly notifications@github.com wrote:
I think the link for the RDF view would be preferable, especially if we want to be able to process whatever is on the other end of the link.
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/ilrt/Beyond2022TeiTemplate/issues/14#issuecomment-687037082, or unsubscribe https://github.com/notifications/unsubscribe-auth/AM2V4XNYFAIPOFNTY4ATMHTSECX2LANCNFSM4PXSLOPA .
Updated to use https://fast.oclc.org/searchfast/ with the @ref pointing to the RDF version.
<!-- searchFast: https://fast.oclc.org/searchfast/ -->
<textClass>
<keywords scheme="https://fast.oclc.org/">
<term ref="http://id.worldcat.org/fast/1205427.rdf.xml">Ireland</term>
<term ref="http://id.worldcat.org/fast/1020301.rdf.xml">Middle Ages</term>
<term ref="http://id.worldcat.org/fast/924349.rdf.xml">Finance</term>
<term ref="http://id.worldcat.org/fast/924841.rdf.xml">Fines and
Recoveries</term>
</keywords>
</textClass>
</profileDesc>
I understand that Peter would like to use the Library of Congress subject heading: https://id.loc.gov/authorities/subjects.html
I've used the textClass, keywords and term elements:
See https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-textClass.html See https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-keywords.html See https://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-term.html