Open Phillip-a-richmond opened 2 years ago
Hi Phil, It has been a while, but I would think that 4G would be more than enough. Maybe @ryanlayer has more recent results on this.
... and yes, it can work with streamed crams. samtools view -u
might be a bit faster.
On a single thread can you ballpark wall-time needed to run Excord on a 30x CRAM?
And if I fed extra threads to samtools view with -@ $Threads
would that improve speed for Excord?
I would guess 1.5 hours, but again, @ryanlayer might have more input. I think that excord would likely be the bottleneck so I don't think the additional samtools view threads would help.
Ok thanks! Very useful. I'll run some tests before scaling.
Cool tool and paper, hopefully can get closer to rare SVs in rare disease context with this approach.
Hey Brent,
I'm looking to run Excord on 1kG data from the NYU remap to GRCh38+alt.
Wondering 2 questions:
I'm trying to choose what kind of machines to run this on to be most efficient with the cloud resources. Essentially I'm pulling GRCh38 CRAM from S3-->onboard NVMe, then running this, saving output, and then I'll aggregate with stix over however many samples I can afford to run (assuming ~$3k).
Thanks, Phil