Open GreeshmaThulasi opened 6 years ago
Dear Greeshma,
I am afraid that your sample file size is too large to be processed by NIPTeR on the machine you are using. I am afraid that the only two options are to either downsample your files to a processable size (should work), or to use a more powerful machine.
Regards, Lennart
Hi, I can't downsample the files. Does the problem indicates less RAM space ? My system having a RAM space of 8GB. Should I increase the RAM space?
Dear Greeshma,
Yes, that could help, although your bam files are much larger than the ones we used during design of NIPTeR (< 1 Gb). This could mean that even with more RAM it will run out of memory.
I always like using SAMtools view -s for downsampling purposes.
Cheers, Lennart
Ok.
How many samples did you included in the control_group ?
I tried with 4 files, each around 3 GB and again getting the following error.
Loading Bam [bam_sort_core] merging from 22 files... BAM loaded Binning Binning done Loading Bam [bam_sort_core] merging from 20 files... BAM loaded Binning Binning done Loading Bam [bam_sort_core] merging from 17 files... BAM loaded Binning Binning done Loading Bam [bam_sort_core] merging from 15 files... BAM loaded Binning Binning done Loading Bam [bam_sort_core] merging from 33 files... Error in value[[3L]](cond) : cannot allocate vector of size 256.0 Mb file: C:\Users\admin\AppData\Local\Temp\RtmpeK4KRe\file17ec3507d0a.bam index: NA
Whether the downsampling decreases quality of the file?
Dear Greeshma,
Downsampling removes part of the reads, so from that perspective you could say the quality of the file decreases. However a 1 Gb file should have more than enough reads for a reliable prediction, given enough control samples. For all Z-score methods a normal distribution is needed. Generally a threshold of 30 samples is taken before a normal distribution can be obtained. However, the more control samples the better. I would recommend to use at least 100 control samples.
Dear Greeshma,
Another solution may be to change the memory limit settings.
More information can be found here: http://stat.ethz.ch/R-manual/R-devel/library/base/html/Memory-limits.html
Hi @ljohansson I executed with a set of reference data. I searched z-scores for all chromosomes based on chi-squared method. What's the threshold for aneuploidy? Is it 3?
Dear Greeshma, Given a normal distribution of the fraction of the chromosome of interest, 99.87% of samples are expected to have a Z-score below 3. Therefore, indeed often a Z-score of 3 is taken as a threshold for trisomy calling. However, the true trisomy risk is also determined by other factors, such as the CV of the control group (high CV is lower Z-score is low sensitivity), and the a priori risk of a woman to carry a child with a trisomy (high-risk group or low-risk group) as well as the percentage of foetal DNA present. To calculate the personalized post-test risk you could for instance use NIPTRIC: http://www.niptric.eu/
Cheers, Lennart
Hi, I was trying to do the binning for one of my bam files of size 6GB, I got the following error,
even-though I executed from a server. Please help me to rectify the problem.