CliMA / ClimaCoupler.jl

ClimaCoupler: bringing atmosphere, land, and ocean together
Apache License 2.0
25 stars 4 forks source link

debug 2 GPU error #863

Open juliasloan25 opened 3 months ago

juliasloan25 commented 3 months ago

Purpose

debug error detailed in https://github.com/CliMA/ClimaCoupler.jl/issues/687

Examples

To-do

Content

To reproduce (full driver)

run the following example on buildkite:

      - label: "2 GPU AMIP with diagnostic EDMF"
        key: "gpu_2_amip_diagedmf"
        command: "srun julia --threads=3 --color=yes --project=experiments/ClimaEarth/ experiments/ClimaEarth/run_amip.jl --config_file config/benchmark_configs/amip_diagedmf.yml --job_id gpu_2_amip_diagedmf"
        artifact_paths: "experiments/ClimaEarth/output/amip/gpu_2_amip_diagedmf_artifacts/*"
        agents:
          slurm_gpus_per_task: 1
          slurm_cpus_per_task: 4
          slurm_ntasks: 2
          slurm_mem: 16GB

or, to run interactively, enter a repl with two tasks having one gpu each, and run the driver experiments/ClimaEarth/run_amip.jl with the config file config/benchmark_configs/amip_diagedmf.yml

To reproduce (MRE)

in progress...