Closed schlunma closed 6 years ago
@schlunma are you sure that the data is correct (ie what the diag needs) given that the ensembles differ?
Yes, I am sure. I need the member r1i1p1
of historical tas
of HadGEM2-ES. The data structure looks like this:
/data/cmip5/output1/MOHC/HadGEM2-ES/historical/mon/atmos/Amon
.
├── r1i1p1
├── r2i1p1
├── r3i1p1
└── r4i1p1
and
/data/cmip5/output1/INPE/HadGEM2-ES/historical/mon/atmos/Amon
.
└── r5i1p1
In this case, I would expect the tool to find my data in the MOHC
directory. However, it does not because it also searches in the second directory (INPE
) where clearly no data is available and fails.
I am still puzzelated by this issue: if r1i1p1
data is needed and INPE
doesn't have it, then it should automatically grab it from MOHC
if it has it and shut up, if not, it will unambiguously crap out since data is unavailable in both places - it is very specific, once you tell it you need ensemble r1i1p1
then it will look only for that -- I have just run with HadGEM2-ES and there is no problem for me, I made sure the code won't fail if it doesn't find the data in one place but then it finds it in the next institution -- what variable is this you using, I need to first replicate your issue
oh wait...I think I know where the issue is...
I think
this should be solved by this https://github.com/ESMValGroup/ESMValTool/pull/580/commits/308872c41f07bf86d6b13442bd3ce0ee16607bd0 but unfortunately I can not test in a real case since BADC has not even a bit of INPE data
Fixed by #580.
While testing #580 I encountered another bug:
I have the following dataset in my recipe:
For this model, multiple institutes are available in in
config-developer.yml
:The problem is: On the cluster, data is available for the members
r1i1p1
-r4i1p1
in theMOHC
directory and onlyr5i1p1
in theINPE
directory. Even though the correct data is available, the tool fails with the following error because there is nor1i1p1
directory inINPE
:If I use the
r5i1p1
ensemble, the same problem occurs in theMOHC
directory.In the case of multiple institutes, the tool should check all institutes, and only fail when no data is available in any of them.