Open lobajuluwa opened 12 years ago
@chris-sleep i am starting with this upload folder
@chris-sleep @melitabirthaelmer we need some metadata otherwise we can't make something from these uploads.
Metadata needs to be harvested over OAI-PMH
the folder names are numbers like 27576 these numbers are the only id like information we currently have. in order to harvest metadata over OAI-PMH we only have these numbers. Can you provide me with a link to the UBER OAI-PMH service? So i can check if these umbers are meaningful.
Thank you for you email, the folder numbers ( = ${FOLDER_NAME} )are in deed matching the catalog id which is also used as part of the oai-pmh item identifieer; so we can get the metadata records from (real uri obfuscated by xxx):
https://edoc.xxxxx.de/OAI-2.0?verb=GetRecord&identifier=oai:xxxx.de:${FOLDER_NAME}&metadataPrefix=oai_dc
i created a script which fetches metadata from the OAI-PMH service.:
/mnt/nfs-demeter/upload/providers/de-uber/fetch-metadata.sh
running this script will download for each of the folders an oai_dc (DublinCore) file and a txt file which contains additinal information like structural information, page type, chapters, pagenumbers, etc.
we seem to have sufficient metadata for 57 monographs of 59 to create a proper folder structure. A script to create the structure is now available. The medatata files are in the work sub folder and ave the fileextension *.txt
@chris-sleep all folders are harmonized, except one folder which is for some reason not properly processed by the create-structure.sh script, here is the according snippet from the logfile wich shows that the create-structure.awk program is not printing any output just "DONE":
processing 27594 ...
processing as Monograph
using structure from txt metadata file ./work/christ-farbtafeln-Mn016C962.txt
running create-structure.awk ...
DONE
Despite of this one problem we should start the test ingest on uber.
@akohlbecker thanks - I'll set up for a test ingest shortly
Task description: Align (UBER) upload data/structure with ingest tool needs
subtask: Ingest (UBER) data
59 folders with historic expedition reports (journals?) in
Actions to take:
fetch-metadata.sh
Summary: