Closed cmarshak closed 1 year ago
Here are the 5/15 jobs that completed successfully:
https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/53fa596a-f299-435e-b9f7-d1c685cd2a90/S1-GUNW-A-R-064-tops-20220206_20220125-015020-00119W_00032N-PP-b0e2-v3_0_0.nc
https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/3b38ff13-78b8-4523-b1c9-0308907ee471/S1-GUNW-A-R-064-tops-20220218_20220125-015020-00119W_00032N-PP-0670-v3_0_0.nc
https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/4a8b6892-8ad1-40f3-a50c-e28c8b2edebc/S1-GUNW-A-R-064-tops-20220125_20220113-015021-00119W_00032N-PP-1fc8-v3_0_0.nc
https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/40e00903-9a89-4ece-bfc9-55e17fef7912/S1-GUNW-A-R-064-tops-20220218_20220206-015020-00119W_00032N-PP-c915-v3_0_0.nc
https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/b7803fb8-2969-4a6f-9491-01369af89423/S1-GUNW-A-R-064-tops-20220206_20220113-015020-00119W_00032N-PP-a1b7-v3_0_0.nc
(1 / 15 jobs is still running)
Here is one of the job dicts for a successful job:
{'job_id': '53fa596a-f299-435e-b9f7-d1c685cd2a90',
'job_type': 'INSAR_ISCE_TEST',
'request_time': '2023-06-08T17:59:33+00:00',
'status_code': 'SUCCEEDED',
'user_id': 'cmarshak',
'name': 'Los-Angeles-0_64_HRES_0608',
'job_parameters': {'compute_solid_earth_tide': True,
'estimate_ionosphere_delay': True,
'frame_id': 9847,
'granules': ['S1A_IW_SLC__1SDV_20220206T015006_20220206T015035_041786_04F91C_FABC'],
'secondary_granules': ['S1A_IW_SLC__1SDV_20220125T015006_20220125T015036_041611_04F317_D09A'],
'weather_model': 'HRES'},
'files': [{'filename': 'S1-GUNW-A-R-064-tops-20220206_20220125-015020-00119W_00032N-PP-b0e2-v3_0_0.nc',
's3': {'bucket': 'hyp3-a19-jpl-contentbucket-1wfnatpznlg8b',
'key': '53fa596a-f299-435e-b9f7-d1c685cd2a90/S1-GUNW-A-R-064-tops-20220206_20220125-015020-00119W_00032N-PP-b0e2-v3_0_0.nc'},
'size': 46763184,
'url': 'https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/53fa596a-f299-435e-b9f7-d1c685cd2a90/S1-GUNW-A-R-064-tops-20220206_20220125-015020-00119W_00032N-PP-b0e2-v3_0_0.nc'}],
'logs': [],
'browse_images': ['https://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b.s3.us-west-2.amazonaws.com/53fa596a-f299-435e-b9f7-d1c685cd2a90/S1-GUNW-A-R-064-tops-20220206_20220125-015020-00119W_00032N-PP-b0e2-v3_0_0.png'],
'thumbnail_images': [],
'expiration_time': '2023-12-06T00:00:00+00:00',
'processing_times': [6576.498, 3373.337]}
@jlmaurer - looking at the log files from raider - is there any idea of how long the data took before the script started downloading from ECMWF? In the case below - it looks like a 10 minute lag time? Or is that the time for the complete download over https?
Using this log
2023-06-08 22:01:49 In case of problems, please check https://confluence.ecmwf.int/display/WEBAPI/Web+API+FAQ or contact servicedesk@ecmwf.int
2023-06-08 22:01:50 Request submitted
2023-06-08 22:01:50 Request id: 64824fcee4fb1eb8b842b7a3
2023-06-08 22:01:50 Request is submitted
2023-06-08 22:01:52 Request is queued
2023-06-08 22:09:53 Request is active
2023-06-08 22:10:21 Calling 'nice mars /tmp/20230608-2200/b8/tmp-_marsI1oAF1.req'
2023-06-08 22:10:21 mars - WARN -
2023-06-08 22:10:21 mars - WARN -
2023-06-08 22:10:21 MIR environment variables:
2023-06-08 22:10:21 MIR_CACHE_PATH=/data/ec_coeff
2023-06-08 22:10:21 Using MARS binary: /usr/local/apps/mars/versions/6.33.15.2/bin/mars.bin
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Welcome to MARS
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - MARS Client build stamp: 20230328082615
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - MARS Client bundle version: 6.33.15.2
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package mars-client version: 6.33.15
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package mir version: 1.16.2
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package odc version: 1.4.6
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package fdb version: 5.11.7
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package metkit version: 1.10.4
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package eckit version: 1.22.0
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - package eccodes version: 2.28.1
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Maximum retrieval size is 75.00 G
2023-06-08 22:10:21 retrieve,levelist=all,stream=oper,area=35.16216216216216/-120.0/32.7/-115.53783783783784,levtype=ml,expver=1,dataset=hres,padding=0,step=0,grid=0.08108108108108109/0.08108108108108109,param=129/130/133/152,time=06:00,date=2022-01-25,resol=av,type=an,class=odmars - INFO - 20230608.220924 - Automatic split on dates is on
2023-06-08 22:10:21
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Processing request 1
2023-06-08 22:10:21
2023-06-08 22:10:21 RETRIEVE,
2023-06-08 22:10:21 DATASET = hres,
2023-06-08 22:10:21 CLASS = OD,
2023-06-08 22:10:21 TYPE = AN,
2023-06-08 22:10:21 STREAM = OPER,
2023-06-08 22:10:21 EXPVER = 0001,
2023-06-08 22:10:21 REPRES = SH,
2023-06-08 22:10:21 LEVTYPE = ML,
2023-06-08 22:10:21 LEVELIST = ALL,
2023-06-08 22:10:21 PARAM = 129/130/133/152,
2023-06-08 22:10:21 TIME = 0600,
2023-06-08 22:10:21 STEP = 0,
2023-06-08 22:10:21 DOMAIN = G,
2023-06-08 22:10:21 RESOL = AV,
2023-06-08 22:10:21 AREA = 35.16216216216216/-120.0/32.7/-115.53783783783784,
2023-06-08 22:10:21 GRID = 0.08108108108108109/0.08108108108108109,
2023-06-08 22:10:21 PADDING = 0,
2023-06-08 22:10:21 DATE = 20220125
2023-06-08 22:10:21
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Web API request id: 64824fcee4fb1eb8b842b7a3
2023-06-08 22:10:21 mars - WARN - 20230608.220924 - Cannot compute number of fields from request.
2023-06-08 22:10:21 mars - WARN - 20230608.220924 - Try to avoid the use of the value 'ALL'
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Setting SO_SNDBUF to 33554432 (32.00 M)
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Current value is 8192 (8.00 K)
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Setting SO_RCVBUF to 33554432 (32.00 M)
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Current value is 43690 (42.67 K)
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Calling mars on 'fdbprod', local port is 59743
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Server task is 420 [ATOS FDB]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Retrieving from FDB [ATOS FDB]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Looking up FDB indexes: 0.004183 second elapsed, 0.003815 second cpu [ATOS FDB]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Calling mars on 'fdbbc', local port is 51629
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Server task is 499 [ATOS FDB BC]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Retrieving from FDB [ATOS FDB BC]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Looking up FDB indexes: 0.000566 second elapsed, 0.000566 second cpu [ATOS FDB BC]
2023-06-08 22:10:21 mars - INFO - 20230608.220924 - Calling mars on 'marsod-core', local port is 53305
2023-06-08 22:10:21 mars - INFO - 20230608.220927 - Server task is 713 [marsod]
2023-06-08 22:10:21 mars - INFO - 20230608.220927 - Request cost: 276 fields, 2.11128 Gbytes online, nodes: mvr003 [marsod]
2023-06-08 22:10:21 mars - INFO - 20230608.220927 - The efficiency of your requests in the last 12 hours is 100% [marsod]
2023-06-08 22:10:21 mars - INFO - 20230608.220927 - Transfering 2266967084 bytes
2023-06-08 22:10:21 mars - INFO - 20230608.220927 - ShToGridded: loading Legendre coefficients '/data/ec_coeff/mir/legendre/4/local-T1279-GaussianN1280-OPT4189816c2e.leg'
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - 276 fields retrieved from 'marsod'
2023-06-08 22:10:21 mars - WARN - 20230608.221014 - Visiting database marsod : expected 0, got 276
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - 276 fields have been interpolated
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Request time: wall: 50 sec cpu: 42 sec
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Processing in marsod: wall: 3 sec
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Visiting marsod: wall: 50 sec
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Read from network: 2.11 Gbyte(s) in 7 sec [313.03 Mbyte/sec]
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Post-processing: wall: 39 sec cpu: 39 sec
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Writing to target file: 1.26 Mbyte(s) in < 1 sec [115.74 Mbyte/sec]
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - Memory used: 375.55 Mbyte(s)
2023-06-08 22:10:21 mars - INFO - 20230608.221014 - No errors reported
2023-06-08 22:10:21 Process '['nice', 'mars', '/tmp/20230608-2200/b8/tmp-_marsI1oAF1.req']' finished
2023-06-08 22:10:21 Calling 'nice grib_to_netcdf /data/scratch/private/blue/02/20230608-2200/5d/_mars-bol-webmars-private-svc-blue-008-4a73a881a8d5eead47db9eff2f9935a4-bSsvPl.grib -o /data/scratch/private/blue/02/20230608-2210/0b/_grib2netcdf-bol-webmars-private-svc-blue-003-4a73a881a8d5eead47db9eff2f9935a4-BbQMRA.nc -utime'
2023-06-08 22:10:21 grib_to_netcdf: Version 2.26.0
2023-06-08 22:10:21 grib_to_netcdf: Processing input file '/data/scratch/private/blue/02/20230608-2200/5d/_mars-bol-webmars-private-svc-blue-008-4a73a881a8d5eead47db9eff2f9935a4-bSsvPl.grib'.
2023-06-08 22:10:21 grib_to_netcdf: Found 276 GRIB fields in 1 file.
2023-06-08 22:10:21 grib_to_netcdf: Ignoring key(s): method, type, stream, refdate, hdate
2023-06-08 22:10:21 grib_to_netcdf: Creating netCDF file '/data/scratch/private/blue/02/20230608-2210/0b/_grib2netcdf-bol-webmars-private-svc-blue-003-4a73a881a8d5eead47db9eff2f9935a4-BbQMRA.nc'
2023-06-08 22:10:21 grib_to_netcdf: NetCDF library version: 4.3.3.1 of Dec 10 2015 16:44:18 $
2023-06-08 22:10:21 grib_to_netcdf: Creating large (64 bit) file format.
2023-06-08 22:10:21 grib_to_netcdf: Defining variable 'z'.
2023-06-08 22:10:21 grib_to_netcdf: Defining variable 't'.
2023-06-08 22:10:21 grib_to_netcdf: Defining variable 'q'.
2023-06-08 22:10:21 grib_to_netcdf: Defining variable 'lnsp'.
2023-06-08 22:10:21 grib_to_netcdf: Done.
2023-06-08 22:10:21 Process '['nice', 'grib_to_netcdf', '/data/scratch/private/blue/02/20230608-2200/5d/_mars-bol-webmars-private-svc-blue-008-4a73a881a8d5eead47db9eff2f9935a4-bSsvPl.grib', '-o', '/data/scratch/private/blue/02/20230608-2210/0b/_grib2netcdf-bol-webmars-private-svc-blue-003-4a73a881a8d5eead47db9eff2f9935a4-BbQMRA.nc', '-utime']' finished
2023-06-08 22:10:21 Request is complete
2023-06-08 22:10:21 Transfering 1.81736 Mbytes into /home/raider/weather_files/HRES_2022_01_25_T06_00_00.nc
2023-06-08 22:10:21 From https://apps.ecmwf.int/api/streaming/private/blue/02/20230608-2210/0b/_grib2netcdf-bol-webmars-private-svc-blue-003-4a73a881a8d5eead47db9eff2f9935a4-BbQMRA.nc
2023-06-08 22:10:26 Transfer rate 413.797 Kbytes/s
2023-06-08 22:10:26 Done
This is also an interesting issue. We should follow up about this:
Downloading s3://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b/3454a837-7f14-47c6-a663-2ea960e2cce5/S1-GUNW-D-R-048-tops-20220216_20220204-235953-00090E_00040N-PP-ff95-v3_0_0.nc to S1-GUNW-D-R-048-tops-20220216_20220204-235953-00090E_00040N-PP-ff95-v3_0_0.nc
Downloading s3://hyp3-a19-jpl-contentbucket-1wfnatpznlg8b/3454a837-7f14-47c6-a663-2ea960e2cce5/S1-GUNW-D-R-048-tops-20220216_20220204-235953-00090E_00040N-PP-ff95-v3_0_0.json to S1-GUNW-D-R-048-tops-20220216_20220204-235953-00090E_00040N-PP-ff95-v3_0_0.json
Writing /home/raider/.ecmwfapirc locally!
Downloading products: 0%| | 0/1 [00:00<?, ?product/s]
Downloading S1A_OPER_AUX_POEORB_OPOD_20220309T081630_V20220216T225942_20220218T005942.EOF: 0%| | 0.00/4.41M [00:00<?, ?B/s][A
Downloading S1A_OPER_AUX_POEORB_OPOD_20220309T081630_V20220216T225942_20220218T005942.EOF: 24%|██■| 1.05M/4.41M [00:00<00:03, 1.11MB/s][A
Downloading S1A_OPER_AUX_POEORB_OPOD_20220309T081630_V20220216T225942_20220218T005942.EOF: 48%|████▊ | 2.10M/4.41M [00:01<00:01, 2.19MB/s][A
Downloading S1A_OPER_AUX_POEORB_OPOD_20220309T081630_V20220216T225942_20220218T005942.EOF: 95%|█████████▌| 4.19M/4.41M [00:01<00:00, 4.60MB/s][A
Downloading S1A_OPER_AUX_POEORB_OPOD_20220309T081630_V20220216T225942_20220218T005942.EOF: 100%|██████████| 4.41M/4.41M [00:01<00:00, 3.51MB/s]
MD5 checksumming: 0%| | 0.00/4.41M [00:00<?, ?B/s][A
[A
Downloading products: 100%|██████████| 1/1 [00:01<00:00, 1.91s/product]
Downloading products: 100%|██████████| 1/1 [00:01<00:00, 1.91s/product]
Wrote new cfg file: GUNW_20220216-20220204_235953.yaml
[33;21mWARNING: Weather model only extends to the surface topography; height levels below the topography will be interpolated from the surface and may be inaccurate.[0m
Invalid extension GTiff for cube. Defaulting to .nc
Invalid extension GTiff for cube. Defaulting to .nc
Output cube spacing: 0.1 degrees
Output SNWE: [39.8, 42.0, 89.7, 93.5]
Starting to run the weather model calculation
Date: 20220217
Beginning weather model pre-processing
Weather model HRES is available from 1983-04-20 to Present
2023-06-08 20:27:58 ECMWF API python library 1.6.3
2023-06-08 20:27:58 ECMWF API at https://api.ecmwf.int/v1
2023-06-08 20:27:59 Welcome David Bekaert
2023-06-08 20:28:01 In case of problems, please check https://confluence.ecmwf.int/display/WEBAPI/Web+API+FAQ or contact servicedesk@ecmwf.int
2023-06-08 20:28:02 Request submitted
2023-06-08 20:28:02 Request id: 648239d202015829529e8117
2023-06-08 20:28:02 Request is submitted
[31;21mERROR: 'ecmwf.API error 1: ERROR 102 (USER_QUEUED_LIMIT_EXCEEDED): Too many queued requests. Max allowed queued requests per user is 20.'[0m
Weather model point bounds are 39.80/42.00/89.70/94.12
Query datetime: 2022-02-17 00:00:00
[31;21mERROR: [Errno 2] No such file or directory: '/home/raider/weather_files/HRES_2022_02_17_T00_00_00.nc'[0m
[31;21mERROR: Downloading and/or preparation of HRES failed.[0m
[31;21mERROR: No weather model data was successfully obtained.[0m
Traceback (most recent call last):
File "/opt/conda/envs/RAiDER/bin/raider.py", line 8, in <module>
sys.exit(main())
File "/opt/conda/envs/RAiDER/lib/python3.10/site-packages/RAiDER/cli/__main__.py", line 42, in main
process_entry_point.load()()
File "/opt/conda/envs/RAiDER/lib/python3.10/site-packages/RAiDER/cli/raider.py", line 524, in calcDelaysGUNW
cube_filenames = calcDelays([path_cfg])
File "/opt/conda/envs/RAiDER/lib/python3.10/site-packages/RAiDER/cli/raider.py", line 277, in calcDelays
raise RuntimeError
RuntimeError
Describe the bug Had a conversation with @jlmaurer about HRES availability. He mentioned it has far fewer challenges than ERA5. I submitted some Hyp3 jobs to take it for a proverbial test drive.
9/15 failed. As can be see below, all of the jobs entered the Raider step (see
processing_times
- Raider is the second step).The job parameters and logs of those that failed are at the end of this ticket. For a few I inspected, here is the log: