Closed jchiang87 closed 6 years ago
The schema for the Project
table is at the pserv
repo.
The imsim undithered coadd catalogs have been ingested into the DESC_DC1_Level_2
database:
MySQL [DESC_DC1_Level_2]> select count(*) from Coadd_Object;
+----------+
| count(*) |
+----------+
| 11288832 |
+----------+
1 row in set (1 min 14.25 sec)
MySQL [DESC_DC1_Level_2]>
I will drop the corresponding table in the DESC_Twinkles_Level_2
db.
I've ingested the imsim dithered coadd catalogs from
/global/cscratch1/sd/descdm/DC1/DC1-imsim-dithered
at NERSC into the DESC_DC1_Level_2
database:
MySQL [DESC_DC1_Level_2]> select * from Project;
+-----------+----------------------+
| projectId | projectName |
+-----------+----------------------+
| 0 | DC1 imsim undithered |
| 1 | DC1 imsim dithered |
+-----------+----------------------+
2 rows in set (0.01 sec)
MySQL [DESC_DC1_Level_2]> select count(*) from Coadd_Object where projectId=1;
+----------+
| count(*) |
+----------+
| 12311932 |
+----------+
1 row in set (3 min 5.85 sec)
The phosim coadd catalog loading has finished.
After the imSim background jobs finish we should re-ingest with analysis flags also decoded/ingested.
@jchiang87 Are we ready to re-ingist the corrected dithered and undithered imSim runs? I sent you a notebook last week with several ways to get at the flags. Did you get that, or are you already clear anyway on what we need to do?
@jchiang87 I think we can close this now correct? Also, since you have mad the new versions with the flags etc, I think you put them in the new project area at NERSC correct? So, I think I can delete mine in the old project area but I just wanted to check.
yes, sure we can close this.
I've done this for the imsim undithered data, the Level 2 output of which is available at NERSC:
For now, I've put the data in the
DESC_Twinkles_Level_2
database onscidb1.nersc.gov
. NERSC has provided aDESC_DC1_Level_2
database onnerscdb04.nersc.gov
, but that db isn't configured to accept loading via csv files (which is faster by O(10) thaninsert
commands), so I've used the Twinkles db until the DC1 db is reconfigured.The schema for the
Coadd_Object
table is derived from the FITS tables in the coadd merged object catalogs, e.g.,/global/cscratch1/sd/descdm/DC1/full_focalplane_undithered/deepCoadd-results/merged/0/10,10/ref-0-10,10.fits
. I've added columns (as part of the primary key along with theid
column) to contain thepatch
andprojectId
, respectively. Thepatch
can serve as part of a rough spatial query, in addition to identifying the coadd image where an object can be found.projectId
will be used to differentiate the Level 2 catalog results between the imsim undithered, imsim dithered, and phosim DC1 datasets.Here is example code showing how to obtain query results as a pandas DataFrame:
The latter, commented-out query can be used to avoid the join with the
Project
table if you know theprojectId
. Instructions for setting up and using thepserv
package are available at thepserv
repo.