This repository contains all the code used to obtain and process the HSC DR1 data to produce tomographic measurements of the galaxy clustering power spectrum.
The folder data_query
contains all the code needed to download the PDR1 data from the HSC database. Follow the instructions in the README there to do so. The main raw products needed are:
The data volume is quite large (dominated by the photo-z pdfs), and downloading all of it may take some time.
The analysis pipeline that processes the raw data to produce power spectrum measurements is provided as a python module in the directory hsc_lss
. This module is made up of a series of pipeline stages that inherit from the PipelineStage
class of the ceci
software. To run the pipeline you therefore first need to install ceci.
The analysis pipeline runs on each of the 6 HSC WIDE fields (GAMA09H, GAMA15H, HECTOMAP, VVDS, WIDE12H, XMMLSS) individually, and therefore it needs to be run on each of them for each set of configuration parameters. This is done automatically by the script run_all.py
, which also runs the different versions of the pipeline used in our analysis.
The analysis pipeline consists of 6 stages:
The param and configuration files for the different HSC fields are stored in hsc_lss_params
. All fields use the same common set of configuration parameters, but different paths must be provided to their corresponding raw data files and output directories. See in_aegis.yml and config.yml to see the different parameters and options.
The different scripts, notebooks and previous versions of the pipeline that have contributed towards the final pipeline are stored in the directory legacy_code
.