LAAC-LSCP / zoo-babble-validation

Apache License 2.0
0 stars 0 forks source link

Upgraded metadata format #2

Closed lucasgautheron closed 3 years ago

lucasgautheron commented 3 years ago

child-project generates metadata with more information such as zooniverse_id which identifies each subject (i.e. each chunk)

to make analyses consistent with this new format, we provide a script that converts older metadata to that new format.

Instructions:

usage: python data_analyses/code/recover_zooniverse_metadata.py [-h] [--subjects SUBJECTS]
                                      [--links LINKS]
                                      source destination

positional arguments:
  source               path to the source metadata to convert
  destination          destination path

optional arguments:
  -h, --help           show this help message and exit
  --subjects SUBJECTS  path to the Zooniverse .csv subjects export
  --links LINKS        path to the dataframe linking ChildID to FileName

Example:

python data_analyses/code/recover_zooniverse_metadata.py data_analyses/metadata/Metadata_batch1b_20200809.csv test.csv