Conal-Tuohy / ARDC-CDL-HASS-Use-Cases

Researcher use cases for Trove etc.
Apache License 2.0
0 stars 0 forks source link

"The Inky Way" #1

Open Conal-Tuohy opened 1 year ago

Conal-Tuohy commented 1 year ago

Roger Osbourne from JCU has a dream to research the history of Australian journalism contained with a regular column of the Bulletin called "The Inky Way".

Can the content of this column be extracted automatically?

Can the content be parsed into a structured form such as a knowledge graph using some NLP technique?

The column I mentioned that has decades of information about the movement of journalists is the “Inky Way” in the Sydney Bulletin. It contains reports of the movement of journalists and editors across Australia, and so contains information about people, places and organisations/institutions over time that could be mapped and rendered in a variety of ways to say something about the profession of journalism in the early twentieth century, which, for me, has strong intersections with literary production in regional, metropolitan and national contexts. For an example, here are a links to give you an idea of the content in each weekly column:

https://nla.gov.au/nla.obj-672504697/view?sectionId=nla.obj-686328556&partId=nla.obj-672547246#page/n35/mode/1up

The Column is know as “The Inky Way” from 1901 until 1930 but references to the “inky way” continue in “Personal Items” page until the 1950s, for example from the 1930s here:

https://nla.gov.au/nla.obj-558325842/view?sectionId=nla.obj-568739180&partId=nla.obj-558394469

Finding some way to extract this column and then analyse/compute/output a result in a variety of formats would be ideal. I like the idea of RDF graphs and the possibilities of visualising over time.

Conal-Tuohy commented 1 year ago

Would it be possible to craft a query that would reliably return the full set of relevant articles? Or would it be better to create a Trove List or similar data structure which could be generated automatically but still be manually editable to deal with special cases?

Note that the "Inky Way" column is not necessarily zoned as a single article in Trove but can appear as a part of a large article, following the "The Inky Way" heading.

Is this a case for using "To Be Continued"?