Closed gunjanpandey closed 1 week ago
It is a not a perfect fit, but it is within acceptable limits. One thing to note is the reported genome length will be the haploid length (e.g. human will be reported as 3Gbp and not 6Gbp). But I agree this is small for a fish genome. What is your cutoff for high frequency kmers? I would raise this to 100,000 or 1,000,000 to make sure the most common repeats are accounted for
Good luck! Mike
On Fri, Aug 23, 2024 at 10:06 PM gunjanpandey @.***> wrote:
Could you please help me understand why estimated genome size is almost half in this case? This is a fish genome, and I expect it to be around 1G.
Do I need to sequence more to increase the coverage?
linear_plot.png (view on web) https://github.com/user-attachments/assets/13db0696-2b77-4e0c-8bfe-cf9540c8ba02
— Reply to this email directly, view it on GitHub https://github.com/schatzlab/genomescope/issues/137, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABP34ZLLZR2IBWLCWCSOO3ZS7TDFAVCNFSM6AAAAABNBEPSQ2VHI2DSMVQWIX3LMV43ASLTON2WKOZSGQ4DIMJRGQYDQOA . You are receiving this because you are subscribed to this thread.Message ID: @.***>
Thanks for your help, Mike.
I ran the following command. Where should I set the cutoff.
meryl count k=19 output k19.meryl ${HiFi}
meryl histogram k19.meryl/ > k19_meryl.hist
Rscript ${genomescope} -i k19_meryl.hist -k 19 -o k19_genomescpe
Your other plot included the high frequency kmers (out past 1e8) so meryl must be catching these already (not all kmer counters do)
Good luck!
Mike
On Sun, Sep 1, 2024 at 10:02 AM gunjanpandey @.***> wrote:
Thanks for your help, Mike.
I ran the following command. Where should I set the cutoff.
meryl count k=19 output k19.meryl ${HiFi}
meryl histogram k19.meryl/ > k19_meryl.hist
Rscript ${genomescope} -i k19_meryl.hist -k 19 -o k19_genomescpe
— Reply to this email directly, view it on GitHub https://github.com/schatzlab/genomescope/issues/137#issuecomment-2323352886, or unsubscribe https://github.com/notifications/unsubscribe-auth/AABP346MOQEO5YFSLQET5M3ZUMM55AVCNFSM6AAAAABNBEPSQ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMRTGM2TEOBYGY . You are receiving this because you commented.Message ID: @.***>
Thank you
Could you please help me understand why estimated genome size is almost half in this case? This is a fish genome, and I expect it to be around 1G.
Do I need to sequence more to increase the coverage?