hzovaro / spaxelsleuth

A package for analysing data from large integral field unit surveys such as the SAMI and Hector Galaxy Surveys.
MIT License
1 stars 1 forks source link

Hector integration #36

Closed hzovaro closed 4 months ago

hzovaro commented 8 months ago

Get spaxelsleuth working with data from the Hector Galaxy Survey from Gabby's spectral fitting pipeline.

BIG TO DOs

little to dos

Maybes

Getting stuff working on misfit:

Things to suggest/raise with Gabby

Questions for Hector team members

hzovaro commented 8 months ago

Stuff to do/keep in mind:

sarahsweet commented 7 months ago

24 Busy week feedback:

  1. Provide feedback on the FITS format - a. Is it intuitive and easy to use?
    • Yes, even though there are many extensions. I think this is helpful; it just means that the documentation needs to be super clear.
    • Suggest to have the extensions in the same order as the column descriptions wiki page. Or perhaps even a separate wiki page for just Hector, since there are many more column descriptions than extensions in the Hector FITS files.
    • Maybe avoid spaces and non-alphanumeric characters in the extension names? Not sure.

b. Are there any missing header keywords that would be useful?

c. Are there any typos/mistakes/etc. in the FITS headers?

d. Should data be split up into multiple FITS files grouping e.g. emission line measurements, stellar kinematics, etc.? Or all-in-one?

e. Should the data cubes also be stored in the same FITS file?

  1. Inspect the data products - a. Do some basic science (e.g. recreating the mass-metallicity relation) - do the values look reasonable?
    • I plotted the continuum from some random frames and found spatial offsets between the red and blue. This might be just a feature of CVD but would be worth checking the input data.
    • I made some plots e.g. MZR and found sensible values. There were a few with missing or spurious data:
    • extensions 3-7 v_gas etc. always missing data
    • ext 8 missing components flag is always (or nearly always?) 0 in the nocuts sample
    • 9-12 are always missing data
    • the line flux errors are sometimes zero; is this sensible? Should it be NaN instead?
    • 41, 46, 49, 50 S/N on derived lines are often missing, perhaps hinting at a problem with error propagation? E.g. 42, 47, ... which they are derived from usually have defined S/N.
    • 60-68 flags always 0 in nocuts
    • 69-79, 81-82 always 0 in nocuts and cuts
    • 83-96 missing flux flags always 0
    • 136,7 missing v*,sigma* flags always 0
    • 138,9 D4000 error > D4000

b. Are there any additional data products people would like for their science? E.g. extra metallicity measurements, stellar indices, etc.

  1. Other questions - a. Should measurements come pre-masked if they fail S/N and/or DQ criterion? E.g. should line fluxes be NaN’d if the S/N < 3? Or should this be up to the user?
    • I strongly suggest this should be up to the user, since they may want to choose a different S/N limit, or use S/N cut in a different line or continuum or binning scheme. They can create their own masks based on flux and noise.
    • Having both cuts and nocuts available is probably a good idea, since the S/N_Ha >= 3 cut will be useful for many as you expect!

b. Should data products from all pipelines be stored in a single FITS file?

hzovaro commented 4 months ago

Issue moved to Hector-Galaxy-Survey fork.