The programs bundled in this repository intend to solve the problem of automatically retrieving metadata records for a given study submitted to NMDC through the NMDC Submission Portal, and converting the metadata into Excel spreadsheets that are accepted by DOE user facilities.
There are two components to keep in mind when trying to use this application.
JSON header configuration file
etl.py The command line application that can facilitate the conversion of metadata from the Submission Portal into user facility formats by consuming the above two files as inputs.
poetry install
etl.py
with options as follows:➜ metadata-template-transformation git:(main) ✗ poetry run python etl.py --help
Usage: etl.py [OPTIONS]
Options:
-o, --output TEXT Path to result output XLSX file. [required]
-m, --mapper PATH Path to user facility specific JSON file.
[required]
-h, --header / --no-header [default: no-header]
-u, --user-facility TEXT User facility to send data to. [required]
-s, --submission TEXT Metadata submission id. [required]
--help Show this message and exit.