molikd / otb

Only The Best (Genome Assembly Tools)
Other
5 stars 3 forks source link

ragtag patch step is hella slow. #51

Closed molikd closed 2 years ago

molikd commented 2 years ago

The ragtag patch step with the ec reads is hella slow, this is the one before HiC is run. Either need to speed it up or remove it.

molikd commented 2 years ago

https://lh3.github.io/minimap2/minimap2.html

we need to set:

-I NUM | Load at most NUM target bases into RAM for indexing [4G]. If there are more than NUM bases in target.fa, minimap2 needs to read query.fa multiple times to map it against each batch of target sequences. NUM may be ending with k/K/m/M/g/G. NB: mapping quality is incorrect given a multi-part index.

-2 | Use two I/O threads during mapping. By default, minimap2 uses one I/O thread. When I/O is slow (e.g. piping to gzip, or reading from a slow pipe), the I/O thread may become the bottleneck. Apply this option to use one thread for input and another thread for output, at the cost of increased peak RAM.

-K NUM | Number of bases loaded into memory to process in a mini-batch [500M]. Similar to option -I, K/M/G/k/m/g suffix is accepted. A large NUM helps load balancing in the multi-threading mode, at the cost of increased memory.

molikd commented 2 years ago

--mm2-params '-x map-hifi -t \<threads> -I 8GB -2 -K 2G'

molikd commented 2 years ago

make ragtag patch optional

molikd commented 2 years ago

pass through added in ec91272