Community repository for documenting stories and use cases related to uses of the International Image Interoperability Framework.
21
stars
0
forks
source link
As a user I want to be able to browse through the semantic structure of a Newspaper resource or I want to be able to query a specific component of a Newspaper. #46
The degree of digitisation and OCR performed on a data provider side will determine the granularity of representation that will need to be retained. Digitised Newspapers can have an image representation for the title, issue, page, and article level, a non OCR text representation at the level of the issue (e.g. a PDF file); while the full-text could be at page level, article, lines or even words level.
The structure of the digitised Newspapers should be identifiable from the metadata describing it.
There are two options to address this requirement in the Europeana context:
either representing the different levels as a concept using a controlled vocabulary such as the MARC genre list, the ontology BIBO or RDA.
or representing the different levels as resources re-using classes from existing standards (using rdf:type).
The IIIF community can help defining a series of resources types. One key aspects as the discussion will be to decide whether these types are aligned with the current resources described by IIIF (Manifest, Sequence, Canvas…) or closer to the semantic structure of Newspapers (title, issue, pages…).
The degree of digitisation and OCR performed on a data provider side will determine the granularity of representation that will need to be retained. Digitised Newspapers can have an image representation for the title, issue, page, and article level, a non OCR text representation at the level of the issue (e.g. a PDF file); while the full-text could be at page level, article, lines or even words level. The structure of the digitised Newspapers should be identifiable from the metadata describing it.
There are two options to address this requirement in the Europeana context: