Do I need read correction with guppy5 and SUP model?

gushiro commented 2 years ago

I was wondering if I can avoid the correction step of my assembly and jumping directly to the trimming step and so on. My reads have been called with the SUP models in Guppy5+, but not sure if the Kmer histogram of the reads shows low error rate. I basically want to avoid the error correction because it was taken over 5Tb in my previous runs and I don't have enough disk space.

The histogram does not show a peak as clear as I have seen it by another users who asked related question. Also, I am thinking to use a min read length of 5K to get the largest 25X at least: would it be OK?.

I should mention the species is a diploid with a genome size of 2.5gb and 3.5% of heterozygosity.

Thanks in advance. Here is the output of Meryl:

[CORRECTION/READS]
--
-- In sequence store './test_meryl.seqStore':
--   Found 17012572 reads.
--   Found 89476994241 bases (35.79 times coverage).
--    Histogram of raw reads:
--    
--    G=89476994241                      sum of  ||               length     num
--    NG         length     index       lengths  ||                range    seqs
--    ----- ------------ --------- ------------  ||  ------------------- -------
--    00010        26389    262267   8947716664  ||       1000-21395     16502224---------------------------------------------------------------
--    00020        19447    661924  17895408426  ||      21396-41791      476282|--
--    00030        15085   1186749  26843108413  ||      41792-62187       30838|-
--    00040        11826   1858202  35790808981  ||      62188-82583        2634|-
--    00050         9156   2718822  44738498848  ||      82584-102979        377|-
--    00060         6880   3846362  53686201313  ||     102980-123375         82|-
--    00070         4961   5379157  62633900088  ||     123376-143771         36|-
--    00080         3384   7563481  71581598453  ||     143772-164167         10|-
--    00090         2093  10919410  80529294875  ||     164168-184563         19|-
--    00100         1000  17012571  89476994241  ||     184564-204959          6|-
--    001.000x            17012572  89476994241  ||     204960-225355          6|-
--                                               ||     225356-245751         12|-
--                                               ||     245752-266147          7|-
--                                               ||     266148-286543          5|-
--                                               ||     286544-306939          6|-
--                                               ||     306940-327335          6|-
--                                               ||     327336-347731          2|-
--                                               ||     347732-368127          2|-
--                                               ||     368128-388523          0|
--                                               ||     388524-408919          1|-
--                                               ||     408920-429315          4|-
--                                               ||     429316-449711          2|-
--                                               ||     449712-470107          1|-
--                                               ||     470108-490503          2|-
--                                               ||     490504-510899          0|
--                                               ||     510900-531295          2|-
--                                               ||     531296-551691          1|-
--                                               ||     551692-572087          0|
--                                               ||     572088-592483          0|
--                                               ||     592484-612879          0|
--                                               ||     612880-633275          1|-
--                                               ||     633276-653671          0|
--                                               ||     653672-674067          0|
--                                               ||     674068-694463          1|-
--                                               ||     694464-714859          0|
--                                               ||     714860-735255          0|
--                                               ||     735256-755651          0|
--                                               ||     755652-776047          0|
--                                               ||     776048-796443          0|
--                                               ||     796444-816839          0|
--                                               ||     816840-837235          1|-
--                                               ||     837236-857631          0|
--                                               ||     857632-878027          0|
--                                               ||     878028-898423          0|
--                                               ||     898424-918819          0|
--                                               ||     918820-939215          0|
--                                               ||     939216-959611          1|-
--                                               ||     959612-980007          0|
--                                               ||     980008-1000403         0|
--                                               ||    1000404-1020799         1|-
--

[CORRECTION/MERS]
--
--  16-mers                                                                                           Fraction
--    Occurrences   NumMers                                                                         Unique Total
--       1-     1         0                                                                        0.0000 0.0000
--       2-     2 166632968 ***********************************************                        0.0919 0.0037
--       3-     4 243105626 ********************************************************************** 0.1663 0.0083
--       5-     7 228530010 *****************************************************************      0.2747 0.0181
--       8-    11 202321627 **********************************************************             0.3838 0.0335
--      12-    16 178904141 ***************************************************                    0.4860 0.0551
--      17-    22 150774668 *******************************************                            0.5781 0.0830
--      23-    29 121548013 **********************************                                     0.6563 0.1154
--      30-    37  96837436 ***************************                                            0.7199 0.1502
--      38-    46  76944925 **********************                                                 0.7710 0.1858
--      47-    56  61168682 *****************                                                      0.8119 0.2214
--      57-    67  48780641 **************                                                         0.8445 0.2561
--      68-    79  39090943 ***********                                                            0.8706 0.2895
--      80-    92  31532655 *********                                                              0.8916 0.3213
--      93-   106  25618096 *******                                                                0.9086 0.3513
--     107-   121  20956800 ******                                                                 0.9225 0.3796
--     122-   137  17249077 ****                                                                   0.9338 0.4061
--     138-   154  14268858 ****                                                                   0.9432 0.4309
--     155-   172  11871523 ***                                                                    0.9510 0.4541
--     173-   191   9957707 **                                                                     0.9574 0.4757
--     192-   211   8400963 **                                                                     0.9628 0.4959
--     212-   232   7121744 **                                                                     0.9674 0.5148
--     233-   254   6080718 *                                                                      0.9713 0.5324
--     255-   277   5209459 *                                                                      0.9746 0.5489
--     278-   301   4485090 *                                                                      0.9775 0.5644
--     302-   326   3879306 *                                                                      0.9799 0.5789
--     327-   352   3363758                                                                        0.9821 0.5925
--     353-   379   2926168                                                                        0.9839 0.6052
--     380-   407   2558923                                                                        0.9855 0.6172
--     408-   436   2242577                                                                        0.9869 0.6285
--     437-   466   1972145                                                                        0.9881 0.6391
--     467-   497   1742581                                                                        0.9892 0.6490
--     498-   529   1543381                                                                        0.9902 0.6584
--     530-   562   1373857                                                                        0.9910 0.6673
--     563-   596   1227834                                                                        0.9918 0.6757
--     597-   631   1100329                                                                        0.9924 0.6837
--     632-   667    987847                                                                        0.9930 0.6912
--     668-   704    888683                                                                        0.9936 0.6984
--     705-   742    803260                                                                        0.9941 0.7052
--     743-   781    726228                                                                        0.9945 0.7117
--     782-   821    659351                                                                        0.9949 0.7179
--
--           0 (max occurrences)
-- 89035034101 (total mers, non-unique)
--  1813969049 (distinct mers, non-unique)
--           0 (unique mers)