Closed DebajyotiS closed 6 months ago
Most of the catalogs should have direct download links to the original sources on the about page. For GAIA DR3, you can find full download information here: https://sdsc-users.flatironinstitute.org/~gaia/dr3/README.txt This will be much more efficient than pulling all the data out of the database.
However, the memory issue is just a limitation of the python client, which downloads into a numpy array in memory. You can download directly to a file through the API if you really want to, but it will be very slow.
Hello all, I am trying to download the GDR3 release onto my cluster using flathub, however I keep running into memory issues. Here's my py code snippet.
This runs into
Is there a way to do this in chunks while making sure I get all of the 1.8+ billion sources?