PolymathicAI / AstroCLIP

Multimodal contrastive pretraining for astronomical data
MIT License
77 stars 12 forks source link

Morphology classification dataset #21

Open Eason-nuosen opened 1 month ago

Eason-nuosen commented 1 month ago

I'm trying to do your downstream tasks, but the data of 'images_npix152_0%02d000000_0%02d000000.h5' from legacy survey is too large for me. Could you support the file after crossing match? Thanks very much!

lsarra commented 1 month ago

Hi!

Do you mean this dataset? (56Gb) https://github.com/PolymathicAI/AstroCLIP/blob/e6c23e0b54b19595e28fd9155e6fdf044c5f5060/astroclip/data/dataset.py#L35

Or are you looking for only the embeddings obtained after going through astroclip (i.e. OBJECT_ID, EMBEDDING)? This is here

Eason-nuosen commented 1 month ago

Thanks! But What I mean is about the downstream task (morphology classification). https://github.com/PolymathicAI/AstroCLIP/tree/main/downstream_tasks/morphology_classification In cross_match.py, you mentioned that you used the full DESI-LS survey. It is too large. So I will really appreciate if you could support the result file after cross_match .