Closed YoannPa closed 3 years ago
Have you got any success running this in serial mode on a small number of input samples, say 3? I am trying your code and it works on my end. But I am running on a mac.
@zwdzwd I will try on a smaller subset and check if it works under Ubuntu, and will come back to you with more info.
Could the problem come from the version of R I am using ? (R version 4.0.0 (2020-04-24))
@zwdzwd So, I tried on a smaller subset of IDATs with less cores, and everything worked well ! Now I am starting to suspect that the system goes out of memory for some reasons.
The warning message put me on this track thanks to this comment
The funny part is that I always keep an eye on htop when I parallel compute something on a cluster, to make sure the cores and the RAM I need are available. And at no time during the process the RAM nor the Swap got full...
I have send a mail to the admin sys of the cluster I use to clarify this point.
I am glad you figure it out!
Hi @zwdzwd , After several attempt I ruled out the memory issue. It is very likely that there is an issue in one of the IDAT files I use. The error is replicable from one computer to another under Linux (Ubuntu or CentOS) using the same data with the same faulty IDAT file. This is a first for me, as I previously made multiple analysis on these IDAT files without trouble. Could it be a IDAT version incompatibility ?
Would you be interested into checking the actual content of this / these problematic IDATs ? Otherwise do you know any tool to check the integrity of IDAT files from HM450K ?
Thank you for your support. Best,
Yoann.
I think it is possible. can you try manually install illuminaio from my fork https://github.com/zwdzwd/illuminaio let me know if this solves your problem. if not, you need to show me your error message.
I think it is possible. can you try manually install illuminaio from my fork https://github.com/zwdzwd/illuminaio let me know if this solves your problem. if not, you need to show me your error message.
I uninstalled my current version of illuminaio, and install the one from the github repo you linked. I restarted my R session, and reloaded packages, with the code. This did not solve the problem. So I believe that 1 IDAT is indeed faulty or corrupted. I runned without any trouble openSesame on other IDAT cohorts I had. I will test all IDATs and let you know if I found something similar between them.
I have tested all IDATs in my possession : only 1 is faulty. So, except if you are interested into finding why this specific IDAT doesn't work with openSesame, I can live without 1 sample out of a thousand. Thank you for the help !
OK. yeah, if not too much trouble can you send me that one IDAT I can look into it when I get a chance. Thanks for digging!
Hi,
I am currently trying to use sesame pipeline on 450K data I have as Idat files, in a folder as following:
This returns me an error and a warning (I have translated parts that are not in english):
I made multiple attempts with the exact same piece of code : note that following these attempts the lengh of "vector" in the error message is not always the same (9 or 4).
Using a smaller integer for number of workers resulted in the same kind of error message and warning.
What am I doing wrong ?
traceback gives:
My sessionInfo():
Thank you in advance for your help !