askap-vast / vast-pipeline

This repository holds the code of the Radio Transient detection pipeline for the VAST project.
https://vast-survey.org/vast-pipeline/
MIT License
8 stars 3 forks source link

High memory usage during sky region ideal coverage #554

Closed marxide closed 3 years ago

marxide commented 3 years ago

When attempting to perform a full-scale run, I encountered OOM errors during the sky region ideal coverage stage in vast_pipeline.pipeline.utils.get_src_skyregion_merged_df.

The current implementation creates a Cartesian product of all sources and sky regions and filters the results by a separation threshold from there. For a full-scale VAST-P1 run with almost 1 million sources and almost 1,000 sky regions, this approach requires a lot of memory.