Open popicka opened 4 years ago
Greetings, was there more to that error? I don't see a line number to where in genReads.py it was going wrong.
I wonder if you would be able to share your target bed file? I think the code might expect it to very organized (sorted, no overlapping regions) and things can go wrong if it's not.
Hi, sorry for the delayed response. Here are all of the BED files that we used. With all of these BED files we get the same error. S07604514_Regions_v6_hg38_Targets.txt S31285117_Regions_v7_hg38_Targets.txt S07604514_Regions_v6_hg38_Targets_removed_additional_info.txt S04380110_Regions_v5_hg38_Targets.txt S04380110_Regions_v5_hg38_Targets_removed_additional_info.txt S04380110_Regions_v5_hg38_Targets_removed_additional_info.sorted.txt
Interestingly, we also have a problem with GRCH38 WGS simulations, where NEAT breaks without reporting an error. Thank you!
I was able to successfully simulate small test datasets using those bed files (minus the header lines) and a subset of chr1. So something must be going wrong later down the line. Do you have a log file of where the simulation was failing? E.g. which chromosome and position did it make it to before dying?
Hi,
We have been using Neat to successfully generate reads with hg19 reference. However, when using GRCh38 with alternative sequences, plus decoys and HLA, we stumble upon different problems.
and got this error /usr/local/lib/python2.7/dist-packages/numpy/core/fromnumeric.py:2909: RuntimeWarning: Mean of empty slice. out=out, **kwargs) /usr/local/lib/python2.7/dist-packages/numpy/core/_methods.py:80: RuntimeWarning: invalid value encountered in double_scalars ret = ret.dtype.type(ret / rcount)
Do you maybe know what could have caused this error? When running neat-genreads should we use the original fasta or the fasta file used for computeGC? Are there any known bugs related to GRCH38 reference?
Thank you, Ana