Open myrmoteras opened 5 years ago
Meeting notes:
Get the data dictionary done by end of July. Get version 1.0 out 1.1. GGXML Puneet to provide feedback to what Guido has made 1.2. Guido to make changes in TB to use the agreed tags, and export XML
Agreed 3 formats of treatments. A Primary digital object: XHTML, version of record B Description: HTML with minimal formatting (
, may be italics, and some other elements for better viewing C Upload files: DWCA, TaxPub (should eventually become the default mime type; to be defined by TC), simplified GG Version ("puneet" version)
3, The treatment deposit is type"section". In the midterm we need to convince DataCite to create a type"taxonTreatment"
Include treatmentCitation in the upload, linking to existing treatment can be done later
Upload of 15-30K treatments in the sandbox, starting July 15 (GS). The documentation is here
DA, MG to run searches on the corpus and report back.
Next skype July 22/23. DA to organize
The issue of data workflow has been raised by Lars https://github.com/plazi/arcadia-project/issues/61 and needs our attention
1.1. Terry, Guido, Donat, Marcus develop an GGXML version that has minimal structural elements,focus on semantics and uses for this the tags that map to the agreed terms in the data dictionary 1.2. Guido to make changes in TB to use the agreed tags, and export XML
run treatment extraction from XMLs to test the data dictionary
final tweaks to the data dictionary and finalize v1.0
3.1 Guido to make changes in TB to use the agreed tags, and export XML
run treatment extraction from XMLs
set up auto-update process for processing treatments from new XMLs
load ~15K-30K treatments into Zenodo sandbox
test sandbox API