Closed pkrog closed 6 years ago
The error was first detected by @korseby (see https://phenomenal-h2020.slack.com/archives/C0R5FKERL/p1531301810000299 on Slack).
Fixes found upstream in https://github.com/ISA-tools/isa-rwval/pull/10
We just need a new package version number for isa-rwval, now, @djcomlab .
Yes, see last commit.
Just testing it now.
For some reason when loading MTBLS520, there are no factors listed, or assays and data files:
Great ! thanks. Tell me if you want me to test that.
Yes that is the effect I guess. Look at my patch. I've just transformed None values into empty lists. I don't know while those three variables were set to None.
So maybe the real issue is somewhere upper in the code, but I don't know how to identify it. I suggest we let it like this for Dalcotidine release, and open an issue in rwval so maybe you can find the real problem later.
OK sure. The real problem is likely that MTBLS520 is malformed. @proccaserra has reported various problems in MetaboLights ISA-Tabs to them already.
But if @korseby needs MTBLS520 to work urgently, I can prioritise fixing this.
Maybe at some point we could had a warning message in red inside the HTML saying something like "The input ISA-Tab archive was malformed, some information about the internal data cannot be displayed.".
On Metabolights there are listed several experimental factors, see screenshot:
I don't quite understand. Can you forward me the list of potential errors?
The challenge of the MTBLS520 data set is, that it has a different experiment design than traditional biomedicine studies for which Metabolights was designed for.
@korseby There are a range of errors in the ISA-Tabs that MetaboLights has stored in its database that sometimes cause loading problems. I'll check MTBLS520 myself through the validator and also try and see why it is missing displaying some of the metadata as per above.
MTBLS520 seems to load OK with the full isatools
ISA-Tab parser (Galaxy datatype uses isa-rwval
that has a stripped-back parser).
That sounds great. Are there any showstoppers left?
Yes, the Galaxy datatype can't use the full isatools
ISA-Tab parser...
I found the problem. It was indeed cause by a data issue! In the investigation file:
STUDY PUBLICATIONS
Study PubMed ID ""
Study Publication DOI ""
Study Publication Author List "Kristian Peters
Karin Gorzolka
Steffen Neumann
Helge Bruelheide"
Study Publication Title "Computational workflow to study the seasonal variation of secondary metabolites in 9 different bryophytes"
Study Publication Status ""
Study Publication Status Term Accession Number ""
Study Publication Status Term Source REF ""
We can see that the Study Publication Author List
value has line breaks, which the parser picks up as newlines obviously, then incorrectly continues. The investigation file is parsed like a CSV table, so for the author list only "Kristian Peters
would be picked up, and then the following lines go on to cause the parser to behave incorrectly.
This should now be fixed with changes in isa-rwval
ref https://github.com/ISA-tools/isa-rwval/issues/11
Using branch
release_18.01_plus_isa_runnerRestartJobs
, I've tried to upload study MTBLS520.zip (metadata only). The ISA dataset creation failed, displaying the messageUnable to finish job
. The error in the Galaxy log was: