galterlibrary / digital-repository

DigitalHub - Institutional Repository for Galter Health Sciences
https://digitalhub.northwestern.edu/
5 stars 1 forks source link

Export Invenio Metadata Field #787

Closed Meowcenary closed 3 years ago

Meowcenary commented 3 years ago

Export the top level field metadata from the Sufia format to the Invenio format. Use the following as test data to verify the end result:

        "metadata": {
            "resource_type": {
                "type": "publication",
                "subtype": "publication-article"
            },
            "creators": [{
                "name": "Nielsen, Lars Holm",
                "type": "personal",
                "given_name": "Lars Holm",
                "family_name": "Nielsen",
                "identifiers": {
                    "orcid": "0000-0001-8135-3489"
                },
                "affiliations": [{
                    "name": "CERN",
                    "identifiers": {
                        "ror": "01ggx4157",
                        "isni": "000000012156142X"
                    }
                }]
            }],
            "title": "InvenioRDM",
            "additional_titles": [{
                "title": "a research data management platform",
                "type": "subtitle",
                "lang": "eng"
            }],
            "publisher": "InvenioRDM",
            "publication_date": "2018/2020-09",
            "subjects": [{
                "subject": "test",
                "identifier": "test",
                "scheme": "dewey"
            }],
            "contributors": [{
                "name": "Nielsen, Lars Holm",
                "type": "personal",
                "role": "other",
                "given_name": "Lars Holm",
                "family_name": "Nielsen",
                "identifiers": {
                    "orcid": "0000-0001-8135-3489"
                },
                "affiliations": [{
                    "name": "CERN",
                    "identifiers": {
                        "ror": "01ggx4157",
                        "isni": "000000012156142X"
                    }
                }]
            }],
            "dates": [{
                "date": "1939/1945",
                "type": "other",
                "description": "A date"
            }],
            "languages": ["da", "en"],
            "identifiers": [{
                "identifier": "1924MNRAS..84..308E",
                "scheme": "bibcode"
            }],
            "related_identifiers": [{
                "identifier": "10.1234/foo.bar",
                "scheme": "doi",
                "relation": "cites",
                "resource_type": {"type": "dataset"}
            }],
            "sizes": [
                "11 pages"
            ],
            "formats": [
                "application/pdf"
            ],
            "version": "v1.0",
            "rights": [{
                "rights": "Creative Commons Attribution 4.0 International",
                "scheme": "spdx",
                "identifier": "cc-by-4.0",
                "url": "https://creativecommons.org/licenses/by/4.0/"
            }],
            "description": "Test",
            "additional_descriptions": [{
                "description": "Bla bla bla",
                "type": "methods",
                "lang": "eng"
            }],
            "locations": [{
                "point": {
                    "lat": 1,
                    "lon": 2
                },
                "place": "home",
                "description": "test"
            }],
            "funding": [{
                "funder": {
                    "name": "European Commission",
                    "identifier": "1234",
                    "scheme": "ror"
                },
                "award": {
                    "title": "OpenAIRE",
                    "number": "246686",
                    "identifier": ".../246686",
                    "scheme": "openaire"
                }
            }],
            "references": [{
                "reference": "Nielsen et al,..",
                "identifier": "101.234",
                "scheme": "doi"
            }]
        }
fenekku commented 3 years ago

When doing the export of the resource_types subfield. Here are the reference documents to use:

Digitalhub 2 resource types: https://github.com/galterlibrary/digitalhub-2/blob/master/app_data/vocabularies/resource_types.csv

[UPDATED WITH CORRECT FILE] Mapping between Digitalhub resource types and Digitalhub 2 resource types: https://northwestern.app.box.com/file/694465344611?s=c5ktvsr5qoj4hxluciqn3xqwkhuc2dk2

Meowcenary commented 3 years ago

Just realized this got moved to the wrong column so shifting it back.

Meowcenary commented 3 years ago

It's been decided that we will break out each individual field of metadata into it's own ticket for the sake of organization. I'm working on entering those tickets now.

Meowcenary commented 3 years ago

This is basically our "Epic" now so I'm assigning it to Austin as well. I'll leave it in as a point of reference.

fenekku commented 3 years ago

This is out-of-date and no point in updating it. Closing.