simpeg / aurora

software for processing natural source electromagnetic data
MIT License
13 stars 2 forks source link

Wide-scale testing on Earthscope #252

Closed kkappler closed 11 months ago

kkappler commented 1 year ago

Scripts are being kept in earthscope_tests branch, in aurora/test_utils/earthscope for now.

Tasks include

Karl ToDo

kkappler commented 12 months ago

This task has run an entire first pass with aurora results in reasonable agreement with spud in most cases.

A follow up task is to take the six stages of testing:

and wrap them in a common framework. Towards this I forked a widescale_test branch off of earthscope_tests where each of the six steps can be wrapped as an instance of a WideScaleTest class. The idea is that each test has an output table (defined by a schema), and that the table is prepared into a dataframe and then dask iterates over the df.

I am attaching a previous result csv from stage 00, 01, 02 here for comparison with the updated version.

00_spud_xml_scrape.csv

01_spud_xml_review_2023-09-07_203451.csv

kkappler commented 11 months ago

All stages have successfully executed on gadi, as well as on my local machine.

If we were going to do this again, I would make the following updates:

  1. More testing of dask, it was not clear that dask was speeding things up much for stage 01
  2. Merge the h5 files, either into one archive, or at least one archive per survey, rather than one mth5 per station.

For reference, I zipped and attached the summary tables csvs from gadi

summary_tables_gadi_20230928.zip