aria-jpl / coseismic_enumerator

Submits a given AOI's enumeration jobs for all intersecting POEORB's
Apache License 2.0
1 stars 0 forks source link

Illegal characters in _id field for S1-COSEISMIC-GUNW-acq-list-event-iter dataset entries in ES #7

Closed riverma closed 3 years ago

riverma commented 3 years ago

Describe the bug Identified an issue with the _id field sent into Elasticsearch for the S1-COSEISMIC-GUNW-acq-list-event-iter dataset type. Namely, the _id sometimes contains space characters that interfere with HySDS internal processing (which expects only alphanumerics). We need to address this.

To Reproduce Steps to reproduce the behavior:

  1. Log onto GRQ and subset into viewing only S1-COSEISMIC-GUNW-acq-list-event-iter datasets
  2. Submit an On Demand "purge" job for this dataset
  3. Notice some of the jobs fail, and the ones that fail all have a space in the _id field. (e.g. S1-COSEISMIC-GUNW-acq-list-event-iter_3 3--AOITRACK_eq_usgs_neic_pdl_us7000d20e_70_Pondaguitan_Philippines_90--acquisition-S1A_IW_SLC__1SDV_20210224T211648_20210224T211717_036737_045171_0973-esa_scihub)

Expected behavior No spaces or special characters within the _id field. Only alphanumerics a-z0-9-_

Screenshots n/a

Environment (please complete the following information): n/a

Additional context Add any other context about the problem here.