zooniverse / kade

Knowledge and Discovery Engine (KaDE)
Apache License 2.0
0 stars 1 forks source link

create a GZ Cosmic Dawn schema #55

Closed camallen closed 1 year ago

camallen commented 1 year ago

The schema in data for cosmic dawn workflow with id 21802 (production) is different from decals 8 schema and needs a new label_extractors class setup to handle data import from caesar and exporting the catalogues for use in re-training zoobot.

Refactor the LabelExtractors::GalaxyZoo code in https://github.com/zooniverse/kade/blob/f4f32bf88ea0d5eeb50f83f371279369da0e8259/app/modules/label_extractors/galaxy_zoo.rb#L19-L86 to include the new GZ CD workflow aggregated labels.

'id_str'
'smooth-or-featured_smooth'
'smooth-or-featured_featured-or-disk'
'smooth-or-featured_problem'
'how-rounded_round'
'how-rounded_in-between'
'how-rounded_cigar-shaped'
'disk-edge-on_yes'
'disk-edge-on_no'
'edge-on-bulge_rounded'
'edge-on-bulge_boxy'
'edge-on-bulge_none'
'bar_strong'
'bar_weak'
'bar_no'
'has-spiral-arms_yes'
'has-spiral-arms_no'
'spiral-winding_tight'
'spiral-winding_medium'
'spiral-winding_loose'
'spiral-arm-count_1'
'spiral-arm-count_2'
'spiral-arm-count_3'
'spiral-arm-count_4'
'spiral-arm-count_more-than-4'
'spiral-arm-count_cant-tell'
'bulge-size_none'
'bulge-size_small'
'bulge-size_moderate'
'bulge-size_large'
'bulge-size_dominant'
'merging_merger'
'merging_major-disturbance'
'merging_minor-disturbance'
'merging_none'
'lensing_yes'
'lensing_no'
'clumps_yes'
'clumps_no'
'problem_star'
'problem_non-star'
'problem_zoom'
'artifact_non-star'
'artifact_other'
'artifact_satellite'
'artifact_scattered'
'artifact_diffraction'
'artifact_ray'
'artifact_saturation'
camallen commented 1 year ago

closed by 7b17447ed232c4f13570b84fde31ae27743f47ab