alekseyzimin / masurca

GNU General Public License v3.0
243 stars 35 forks source link

LHE_COVERAGE parameter #29

Open TimothyStephens opened 6 years ago

TimothyStephens commented 6 years ago

Hi,

I wish to confirm that the LHE_COVERAGE parameter is the estimated coverage of long reads given to the assembler. Currently its description is difficult to interpret.

Arkhaan commented 6 years ago

Hi,

I was asking myself the same question. I will try run it with LHE_COVERAGE = 6, as we have 6x of long reads. I hope it does mean that.

alekseyzimin commented 6 years ago

LHE_COVERAGE is the maximum longest read coverage the assembler will use. If your coverage is lower than LHE_COVERAGE then all reads will be used. Otherwise, the assembler will use LHE_COVERAGE of the longest reads, discarding the rest.

kfletcher88 commented 4 years ago

Hi, On this topic has anyone done any investigations to optimize? Is there evidence that too much LHE_Coverage could be detrimental or is it a case of the more, the better? Generally is it better to remove reads below a certain length threshold? Thanks

alekseyzimin commented 4 years ago

Hi,

Too much coverage (beyond 25-30) slows down the assembly and does not bring any improvement in the final contiguity or correctness. The assembler by default uses 25x coverage by the longest reads. You can increase theis parameter if you feel that the assembler detected the genome size incorrectly. The genome size is saved in ESTIMATED_GENOME_SIZE.txt

Best, Aleksey

On Fri, Mar 6, 2020 at 11:00 AM Kyle Fletcher notifications@github.com wrote:

Hi, On this topic has anyone done any investigations to optimize? Is there evidence that too much LHE_Coverage could be detrimental or is it a case of the more, the better? Generally is it better to remove reads below a certain length threshold? Thanks

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/alekseyzimin/masurca/issues/29?email_source=notifications&email_token=AGPXGHOW6GSAWXAZW6D4UK3RGEMZHA5CNFSM4FDSQ5R2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEOB3OAA#issuecomment-595834624, or unsubscribe https://github.com/notifications/unsubscribe-auth/AGPXGHJLDXVPXTFFQ4CL6H3RGEMZHANCNFSM4FDSQ5RQ .

-- Dr. Alexey V. Zimin Associate Research Scientist Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA (301)-437-6260 website http://ccb.jhu.edu/people/alekseyz/ blog http://masurca.blogspot.com