chanzuckerberg / shasta

[MOVED] Moved to paoloshasta/shasta. De novo assembly from Oxford Nanopore reads
Other
272 stars 59 forks source link

If there is a way I can know which raw reads go to a specific contig? #294

Closed WenyuLiang closed 2 years ago

WenyuLiang commented 2 years ago

Hi! If there is a way I can know which raw reads go to a specific contig?

paoloczi commented 2 years ago

For haploid assembly, you can do that using the following command line option:

--Assembly.writeReadsByAssembledSegment

I have not tested this option in some time, so if you bump into problems please post here and I will look into it.

For diploid assembly, this functionality is not available.

paoloczi commented 2 years ago

A bit more information on that option. If you turn it on, the assembly directory will contain a csv file named ReadsBySegment.csv. The top of the file looks like this:

image

The meaning of the columns is as follows:

WenyuLiang commented 2 years ago

Thank you so much!!!

paoloczi commented 2 years ago

I am closing this due to lack of additional discussion. If other questions emerge, feel free to open another issue.