Open mw201608 opened 1 year ago
Thanks @mw201608 ! @hsun3163 please correct me if I'm wrong but that files was from GTEx repository right? I wonder if we should just change it and maintain a local copy of it along the lines @mw201608 suggested or if you can check they have an update? I say we start changing it locally and then we send PR on their repo as we see fit?
In the container The rsem aggregation step failed to extract the proper sample ids when sample names contained character
.
, and may even crashed (error message shown below) when the partially extracted sample ids are not unique.This is because the sample ids are extracted through string split by
.
,sample_id = filename.split('.')[0]
It might be better to use a regular expression to extract the complete sample ids, like
re.sub(".rsem.[a-z]+.results", "", filename)
Minghui