geodesymiami / rsmas_insar

RSMAS InSAR code
https://rsmas-insar.readthedocs.io/
GNU General Public License v3.0
62 stars 23 forks source link

run_files to master transition: missing items #153

Closed falkamelung closed 5 years ago

falkamelung commented 5 years ago

Hi @mirzaees , here a few items:

Once these issues are resolved lets switch. I wish we would have had this before I got crazy with the codebase!

mirzaees commented 5 years ago

Hi @falkamelung,

please first update because I fixed some minor bugs that your error from execute_runfiles.py may have come from.

process_rsmas.py $TE/template                            or
process_rsmas.py $TE/template  --dostep  download        or
process_rsmas.py $TE/template  --dostep  process
/run_files/run_1_unpack_slc_topo_master
/run_files/run_2_average_baseline
/run_files/run_3_geo2rdr_resample
/run_files/run_4_extract_stack_valid_region
/run_files/run_5_merge_burst_igram
/run_files/run_6_filter_coherence
/run_files/run_7_merge_master_slave_slc
/run_files/run_8_unwrap
/run_files/run_9_pysar_small_baseline
/run_files/run_10_amplitude_ortho_geo
/run_files/run_11_email_pysar
/run_files/run_12_ingest_insarmaps
/run_files/run_13_email_insarmaps
falkamelung commented 5 years ago

@mirzaees Thank you. It seems to work now. I have to look at it in more detail. Can we we now specify the walltime for a specific job in the template file? How would I specify a longer wall time for run_unpack_slc_topo_master?

What I tried to achieve with vlong, long, short is scaling of the walltime with the job size. Instead of specifying 12 different wall times when we go from 5 to 20 bursts, we just need to give one parameter for the size of the area processed (number of bursts or numbers of pixels for squeezar), and the walltimes are calculated accordingly. If the default apples for 5 bursts, when we say 20 bursts, we would just multiply all wall times by a factor of 4.

mirzaees commented 5 years ago

Ok I see your point. It is not able to read from template right now but I will add it. I will add a parameter to the template as computation_time that accepts short, long and vlong to adjust the wall times.

On May 1, 2019, at 9:13 PM, Falk Amelung notifications@github.com<mailto:notifications@github.com> wrote:


WARNING: This email originated outside of the University of Miami. Do not click links or attachments unless you recognize the sender and know the content is safe.


@mirzaeeshttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fmirzaees&data=02%7C01%7Csara.mirzaee%40rsmas.miami.edu%7C64e16a406e334decf30608d6ce9b74c0%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636923564412155290&sdata=p89%2FMhRRvmCUMyp%2FkD8bnHjfCbM478E9dPdcbQXlYjw%3D&reserved=0 Thank you. It seems to work now. I have to look at it in more detail. Can we we now specify the walltime for a specific job in the template file? How would I specify a longer wall time for run_unpack_slc_topo_master?

What I tried to achieve with vlong, long, short is scaling of the walltime with the job size. Instead of specifying 12 different wall times when we go from 5 to 20 bursts, we just need to give one parameter for the size of the area processed (number of bursts or numbers of pixels for squeezar), and the walltimes are calculated accordingly. If the default apples for 5 bursts, when we say 20 bursts, we would just multiply all wall times by a factor of 4.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeodesymiami%2Frsmas_insar%2Fissues%2F153%23issuecomment-488527993&data=02%7C01%7Csara.mirzaee%40rsmas.miami.edu%7C64e16a406e334decf30608d6ce9b74c0%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636923564412155290&sdata=QOzW5ZO3LODoLZYchzbK1QV7ftt7dOiwZtD7anYRoYk%3D&reserved=0, or mute the threadhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAI4MCE2VY3AZMM3SWYE2J2TPTI55NANCNFSM4HJRRKNA&data=02%7C01%7Csara.mirzaee%40rsmas.miami.edu%7C64e16a406e334decf30608d6ce9b74c0%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636923564412165298&sdata=JQDsS3TliRXa9br7pJwsmj6X4nMwGvUpQY5iYSAERHg%3D&reserved=0.

falkamelung commented 5 years ago

But do it smarter than I did. What you write above does not sound right. I just allowed for 3 different walltimes, which seems enough. We can have a function job_submission_defaults = scale_job_submission_defaults(number_of_bursts=10), and we give the number of bursts with the template (default 5), alternatively we could say job_submission_defaults = scale_job_submission_defaults(area_processed='2000*2000') (default='1000*1000'). If you are short in time lets do for now just quick-and-dirty giving-the-walltime-on-template.

falkamelung commented 5 years ago
/login3/projects/scratch/insarlab/famelung/TESTBENCH1/unittestGalapagosSenDT128/run_files[1056] cat ./run_8_pysar_small_baseline_0_20500916.e
Traceback (most recent call last):
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/prep_isce.py", line 462, in <module>
    main() 
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/prep_isce.py", line 454, in main
    update_mode=inps.update_mode)
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/prep_isce.py", line 402, in prepare_stack
    raise FileNotFoundError('no file found in pattern: {}'.format(filePattern))
FileNotFoundError: no file found in pattern: filt_*.unw
Traceback (most recent call last):
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/pysarApp.py", line 1061, in <module>
    main()
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/pysarApp.py", line 1051, in main
    app.run(steps=inps.runSteps, plot=inps.plot)
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/pysarApp.py", line 980, in run
    self.run_load_data(sname)
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/pysarApp.py", line 338, in run_load_data
    load_complete, stack_file, geom_file = ut.check_loaded_dataset(self.workDir, print_msg=True)[0:3]
  File "/nethome/famelung/test/development/rsmas_insar/sources/PySAR/pysar/utils/utils.py", line 59, in check_loaded_dataset
    raise FileNotFoundError(errno.ENOENT, os.strerror(errno.ENOENT), './INPUTS/ifgramStack.h5')
FileNotFoundError: [Errno 2] No such file or directory: './INPUTS/ifgramStack.h5'
mirzaees commented 5 years ago

Hi Falk,

Good idea, I will fix it today

About the error from pysar, I think it is pysar bug. I run all steps with execute_runfiles.py and test data, it worked fine

Sara

On May 2, 2019, at 1:51 AM, Falk Amelung notifications@github.com<mailto:notifications@github.com> wrote:


WARNING: This email originated outside of the University of Miami. Do not click links or attachments unless you recognize the sender and know the content is safe.


But do it smarter than I did. What you write above does not sound right. I just allowed for 3 different walltimes, which seems enough. We can have a function job_submission_defaults = scale_job_submission_defaults(number_of_bursts=10), and we give the number of bursts with the template (default 5), alternatively we could say job_submission_defaults = scale_job_submission_defaults(area_processed='20002000') (default='10001000'). If you are short in time lets do for now just quick-and-dirty giving-the-walltime-on-template.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeodesymiami%2Frsmas_insar%2Fissues%2F153%23issuecomment-488559973&data=02%7C01%7Csara.mirzaee%40rsmas.miami.edu%7Cf9496deac4d24f49002c08d6cec2397d%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636923730921502472&sdata=XqUcdBtRMj%2FQbKrEjavPnJAb2roK6fruG%2BhxyhV6bVk%3D&reserved=0, or mute the threadhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAI4MCE7EHR5HQ3QT6ZGQCULPTJ6ODANCNFSM4HJRRKNA&data=02%7C01%7Csara.mirzaee%40rsmas.miami.edu%7Cf9496deac4d24f49002c08d6cec2397d%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636923730921502472&sdata=WS5JIepVvlgAIJ5pmcuUHqlWd9CQwxKupMrC8OKRR08%3D&reserved=0.

mirzaees commented 5 years ago

Hi @falkamelung here some update:

jobs = cb.submit_batch_jobs(batch_file=item, 
                            out_dir=os.path.join(inps.work_dir, 'run_files'),
                            memory=memorymax, walltime=walltimelimit, queue=queuename)

I am working on implementing Dask (same as ifgram inversion) for some scripts in rinsar and pysqsar. also on export_ortho_geo.py to make the main short and the script more clear.

falkamelung commented 5 years ago

Thank you for the update. Sounds good. I am looking forward to try. Two comments:

On May 5, 2019, at 12:42 AM, Sara Mirzaee notifications@github.com<mailto:notifications@github.com> wrote:

Hi @falkamelunghttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Ffalkamelung&data=02%7C01%7Cfamelung%40rsmas.miami.edu%7C4f3802dae639480686d508d6d0af7eaa%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636925849497197308&sdata=ByR%2FinxSZwMkh49edO%2BoG0apN0mbhHzXBY%2BY1gE68PI%3D&reserved=0 here some update:

jobs = cb.submit_batch_jobs(batch_file=item, out_dir=os.path.join(inps.work_dir, 'run_files'), memory=memorymax, walltime=walltimelimit, queue=queuename)

I am working on implementing Dask (same as ifgram inversion) for some scripts in rinsar and pysqsar. also on export_ortho_geo.py to make the main short and the script more clear.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeodesymiami%2Frsmas_insar%2Fissues%2F153%23issuecomment-489343186&data=02%7C01%7Cfamelung%40rsmas.miami.edu%7C4f3802dae639480686d508d6d0af7eaa%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636925849497207317&sdata=MGjERBHcHYaMoHULXulZaFxPFcj2Uz9XDyiBBFFTc1s%3D&reserved=0, or mute the threadhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FACVFHXBCQK5KBNEXFB5T7X3PTW4HFANCNFSM4HJRRKNA&data=02%7C01%7Cfamelung%40rsmas.miami.edu%7C4f3802dae639480686d508d6d0af7eaa%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636925849497207317&sdata=kcpOrxpmDmiC%2B%2BSCjTAbOJdCPZS5FuoXftLTgMhSwhE%3D&reserved=0.

mirzaees commented 5 years ago
falkamelung commented 5 years ago

Cool re 1! Re 2, yes for —submit. I sometimes give —submit —walltime 0:10 just to start something.

On May 5, 2019, at 9:13 AM, Sara Mirzaee notifications@github.com<mailto:notifications@github.com> wrote:

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fgeodesymiami%2Frsmas_insar%2Fissues%2F153%23issuecomment-489440378&data=02%7C01%7Cfamelung%40rsmas.miami.edu%7C502d3685ed4543e59a1a08d6d17495ed%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636926696015959159&sdata=TzjK8O8cYv3ubsLSCVxF3b7N2lP0L4Zn6jezyQPQEX0%3D&reserved=0, or mute the threadhttps://nam01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FACVFHXDAP6XQXWXPH3QWAO3PT4BR3ANCNFSM4HJRRKNA&data=02%7C01%7Cfamelung%40rsmas.miami.edu%7C502d3685ed4543e59a1a08d6d17495ed%7C2a144b72f23942d48c0e6f0f17c48e33%7C0%7C0%7C636926696015959159&sdata=W0YM3J4Y9mF71VIkfSwz%2BkYIpKlodKPGJpu99IBlbRc%3D&reserved=0.