Closed shwong-tw closed 1 year ago
Hi, can you share the version of Mosek you are using? If it is version 8, please upgrade to Mosek version 9 or 10 (via pip or conda). This error may also occur if the bam file is from targeted sequencing (e.g. Circle-Seq), as opposed to whole-genome sequencing.
Thanks, Jens
HI Jens,
Sorry for the missing info, I ran AA with docker option and the Mosek version it used was 9.2.49. The data is from whole-genome sequencing, and the previous AA run (when specifying other seed intervals) with the same sample was successful. Please let me know if furhter information would be helpful. Thanks a lot!
Cheers, Siao-Han
Thanks for clarifying Siao-Han,
How are you providing seed intervals to the tool? What is the source of the copy number estimates given to the seeding regions? Are you using AmpliconSuite-pipeline to generate the seed regions? If not, how are you selecting the seeds? It's possible AmpliconAchitect is being given some problematic regions of the genome.
If you are able to provide
I would be happy to take a look.
Thanks, Jens
Hi Jens,
Thanks for the prompt reply. I did select seed regions with my own criteria (therefore tried different runs on the same sample). It's simply differnet copy-number cutoffs, without filtering by interval size. If there are some rule of thumbs (e.g. size) indicating bad intervals I can check whether this is the case in my seed intervals.
Edit: I've just checked that the smallest interval sizes were around 50kbp in the failed samples. However, other successfully finished samples contain smaller intervals (down to 25kbp)
The exact command would be argstring=" --bam /home/bam_dir/tumor.bam --bed /home/bed_dir/tumor.bed --out /home/output/H059-5DFS_tumor7 --downsample 0 --ref GRCh37" /home/run_aa_script.sh
And an example log is (hopefully) attached. Example.log
But to be honest, please don't spend too much time on looking into the log. The only thing I hope for is that AA can finish the run and I wouldn't mind there were one or two amplicons skipped/not resolved.
Thanks a lot!
Cheers, Siao-Han
Hi Siao-Han,
Thanks for clarifying. We cannot guarantee that AA will work when deployed on any custom set of intervals - they may contain unfiltered repeat elements and other problematic parts of the genome. A standardized method for selecting these intervals is available in AmpliconSuite-pipeline, which uses databases of these known problematic regions and other criteria to establish a reliable set of seed regions where focal amplifications may exist.
Thank, Jens
I understand, thanks for the feedback :)
Hi,
Thanks for making this software available to the community. I've done several rounds of AA (using different seed intervals) and it usually runs successfully.
In a recent run, some of the samples encountered optimization issue at the MOSEK step on specific amplicons (please find error msg below). As a result, AA terminated the run and reported error msg. The problematic amplicons ended up with cycles/graph.txt files empty, while the edges_cnseg/edges.txt files are only sometimes empty, amplicon log was not generated.
I wonder if it is possible to include an error-handling mechanism, so that upon MOSEK optimization failure, AA would report warning and skip this amplicon instead of terminating the entire AA run?
Thank you very much!
versions
software log:
cluster log: