Closed dcopetti closed 2 years ago
I'm not sure why all the jobs landed on one node. They should have spread across the cluster (and caused even more havoc to the file server). Are you running canu
on the head node, or submitting it as a job to the grid? In the latter case, it will (probably) end up running on just a single node and ignoring the grid entirely. Post one of the log files from canu-scripts
.
A bit of what's going on here. The 'mhap' processes are computing half of the overlaps, writing (compressed) .ovb files. The other half of the overlaps are just the same as the first half, but 'flipped': if mhap computed "read A aligned to read B" the flip would be "read B aligned to read A" - it's the same alignment, but upside down. This flip is done when overlaps are copied into the ovlStore. The complication is that we need to group all overlaps for a single read together and so "building the ovlStore" is equivalent to "sort 2*45 TB of data" (in your case). This sort is done in two passes, roughly "flip overlaps and write to buckets" (each bucket will hold overlaps for some set of reads) then "sort each bucket". Based on the directory sizes, you're failing in the first pass.
The solution, as you correctly guessed, is to increase the minimum overlap length. In the assembly directory, there is a error_correct.report
that contains a read length histogram. Here's a (truncated) example from human:
-- In sequence store './asm.seqStore':
-- Found 11308761 reads.
-- Found 228039710962 bases (73.56 times coverage).
--
-- G=161318970111 sum of || length num
-- NG length index lengths || range seqs
-- ----- ------------ --------- ------------ || ------------------- -------
-- 00010 84952 155984 16131904946 || 1000-15446 8292104|---------------------------------------------------------------
-- 00020 67093 371392 32263812356 || 15447-29893 1304285|----------
-- 00030 54953 637754 48395707775 || 29894-44340 724114|------
-- 00040 45008 962310 64527623427 || 44341-58787 447442|----
-- 00050 36058 1362378 80659502478 || 58788-73234 262662|--
-- 00060 27456 1873889 96791390485 || 73235-87681 142416|--
-- 00070 19119 2575304 112923279581 || 87682-102128 73271|-
-- 00080 11707 3651832 129055187228 || 102129-116575 34932|-
-- 00090 5859 5586443 145187073272 || 116576-131022 15700|-
-- 00100 1000 11308760 161318970111 || 131023-145469 6512|-
-- 001.000x 11308761 161318970111 || 145470-159916 2916|-
-- || 159917-174363 1224|-
-- || 174364-188810 540|-
-- || 188811-203257 286|-
-- || 203258-217704 131|-
-- || 217705-232151 94|-
-- || 232152-246598 45|-
For this assembly, I did two things: increase the minimum read length from the default 1,000 to 10,000 and increase the minimum overlap from the default 500 to 8,000. You can use the NG table on the left as a guide for the minimum read length to use, then pick an overlap length a tad below there. Note that 'index' is the number of reads at this length or longer, and by throwing out around 19x of coverage (length 10,000) I threw out around 70% of the reads which should also get rid of (at least) 70% of the overlaps.
Now that the theory is out of the way....how to move forward?
1) Delete everything, restart from scratch after changing minReadLength and minOverlapLength. This is the simplest and 'cleanest', but most expensive option.
2) Filter short overlaps from the existing .ovb files. This is probably the correct way to go, but, surprisingly, I don't see an easy way to do it and I'll have to write a little bit of code. Give me a day or so to get this implemented and tested. The process will likely be to run a command on each of the .ovb files in 1-overlapper/results/.
The existing error_correct.ovlStore.BUILDING
(and config files) can be removed.
(Don't forget to post logging from canu-scripts/
and *.report
)
Well, to address your first paragraph: the 1 CPU jobs were submitted and running on all nodes, in all about 480 processes. It really slowed down the head node. You can see this started in the graph right in between the "Fri" and "Sun" marks, and the load went up very high. All of those processes were successful though except for the 128 running on n009. I believe something must have crashed on there because they never finished, but the other jobs running finished and canu went through all 700+ jobs for this step. The load on n009 went up to over 400 according to our monitoring, and I just rebooted it to recover it. Now canu seems to have spawned a bunch of new jobs, not sure if it is resuming or what, the current state of our SGE is below. Not sure what will happen now as we only have about 7.8T free on the working partition. It sounds like I should kill this canu job and wait for your code update (but how? it is spawned in the background, is there a command to issue to stop canu running?).
$ qstat
queuename qtype resv/used/tot. load_avg arch states
---------------------------------------------------------------------------------
all.q@n001.genome.arizona.edu BIP 0/25/25 13.60 lx26-amd64
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 190
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 191
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 192
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 193
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 194
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 195
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 196
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 197
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 198
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 199
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 200
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 201
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 202
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 203
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 204
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 205
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 206
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 207
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 208
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 209
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 210
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 211
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 212
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 213
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 214
---------------------------------------------------------------------------------
all.q@n002.genome.arizona.edu BIP 0/25/25 14.16 lx26-amd64
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 215
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 216
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 217
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 218
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 219
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 220
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 221
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 222
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 223
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 224
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 225
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 226
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 227
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 228
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 229
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 230
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 231
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 232
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 233
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 234
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 235
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 236
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 237
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 238
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 239
---------------------------------------------------------------------------------
all.q@n003.genome.arizona.edu BIP 0/0/25 0.57 lx26-amd64
---------------------------------------------------------------------------------
all.q@n004.genome.arizona.edu BIP 0/41/41 24.13 lx26-amd64
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 240
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 241
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 242
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 243
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 244
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 245
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 246
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 247
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 248
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 249
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 250
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 251
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 252
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 253
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 254
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 255
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 256
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 257
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 258
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 259
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 260
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 261
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 262
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 263
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 264
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 265
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 266
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 267
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 268
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 269
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 270
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 271
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 272
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 273
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 274
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 275
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 276
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 277
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 278
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 279
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 280
---------------------------------------------------------------------------------
all.q@n005.genome.arizona.edu BIP 0/41/41 20.41 lx26-amd64
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 281
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 282
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 283
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 284
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 285
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 286
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 287
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 288
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 289
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 290
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 291
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 292
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 293
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 294
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 295
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 296
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 297
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 298
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 299
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 300
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 301
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 302
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 303
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 304
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 305
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 306
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 307
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 308
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 309
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 310
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 311
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 312
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 313
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 314
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 315
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 316
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 317
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 318
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 319
342291 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 320
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 346
---------------------------------------------------------------------------------
all.q@n006.genome.arizona.edu BIP 0/41/41 19.50 lx26-amd64
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 388
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 389
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 390
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 391
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 392
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 393
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 394
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 395
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 396
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 397
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 398
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 399
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 400
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 401
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 402
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 403
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 404
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 405
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 406
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 407
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 408
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 409
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 410
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 411
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 412
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 413
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 414
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 415
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 416
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 417
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 418
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 419
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 420
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 421
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 422
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 423
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 424
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 425
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 426
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 427
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 428
---------------------------------------------------------------------------------
all.q@n007.genome.arizona.edu BIP 0/41/41 20.06 lx26-amd64
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 347
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 348
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 349
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 350
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 351
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 352
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 353
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 354
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 355
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 356
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 357
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 358
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 359
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 360
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 361
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 362
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 363
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 364
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 365
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 366
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 367
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 368
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 369
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 370
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 371
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 372
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 373
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 374
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 375
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 376
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 377
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 378
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 379
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 380
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 381
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 382
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 383
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 384
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 385
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 386
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 387
---------------------------------------------------------------------------------
all.q@n008.genome.arizona.edu BIP 0/116/131 55.40 lx26-amd64
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 429
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 430
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 431
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 432
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 433
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 434
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 435
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 436
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 437
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 438
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 439
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 440
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 441
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 442
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 443
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 444
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 445
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 446
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 447
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 448
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 449
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 450
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 451
342292 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 452
342293 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 454
342293 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 455
342294 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 457
342294 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 458
342295 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 460
342295 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 461
342296 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 463
342296 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 464
342297 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 466
342297 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 467
342298 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 469
342298 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 470
342299 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 472
342299 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 473
342300 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 475
342300 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 476
342301 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 517
342302 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 520
342303 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 524
342304 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 531
342305 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 537
342305 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 538
342306 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 544
342306 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 545
342307 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 552
342308 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 558
342309 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 562
342310 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 564
342311 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 568
342312 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 572
342313 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 578
342314 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 580
342315 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 590
342316 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 592
342317 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 596
342318 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 598
342319 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 600
342320 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 602
342321 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 604
342322 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 606
342323 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 608
342324 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 610
342325 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 618
342326 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 620
342327 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 628
342328 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 632
342329 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 634
342330 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 636
342331 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 638
342332 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 642
342333 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 644
342334 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 646
342335 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 656
342336 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 662
342337 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 664
342338 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 666
342339 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 668
342340 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 670
342341 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 672
342342 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 674
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 676
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 677
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 678
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 679
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 680
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 681
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 682
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 683
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 684
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 685
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 686
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 687
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 688
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 689
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 690
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 691
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 692
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 693
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 694
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 695
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 696
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 697
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 698
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 699
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 700
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 701
342343 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 702
342344 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 704
342344 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 705
342345 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 708
342346 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 710
342347 0.55500 ovB_error_ scar r 09/14/2021 10:52:47 1 737
---------------------------------------------------------------------------------
all.q@n009.genome.arizona.edu BIP 0/0/131 0.94 lx26-amd64
---------------------------------------------------------------------------------
all.q@pac.genome.arizona.edu BIP 0/0/20 34.10 lx26-amd64 d
############################################################################
- PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS - PENDING JOBS
############################################################################
342348 0.00000 canu_error scar hqw 09/14/2021 10:52:41 1
$
Sorry, also here are the last 2 log files from canu-scripts: Sep 4 15:29 canu.06.out Sep 14 10:52 canu.07.out
$ cat canu-scripts/canu.06.out
Found perl:
/usr/bin/perl
Found java:
/usr/bin/java
java version "1.8.0_66"
Found canu:
/usr/local/src/canu-2.1.1/build/bin/canu
canu 2.1.1
-- canu 2.1.1
--
-- CITATIONS
--
-- For 'standard' assemblies of PacBio or Nanopore reads:
-- Koren S, Walenz BP, Berlin K, Miller JR, Phillippy AM.
-- Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.
-- Genome Res. 2017 May;27(5):722-736.
-- http://doi.org/10.1101/gr.215087.116
--
-- Read and contig alignments during correction and consensus use:
-- Šošic M, Šikic M.
-- Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.
-- Bioinformatics. 2017 May 1;33(9):1394-1395.
-- http://doi.org/10.1093/bioinformatics/btw753
--
-- Overlaps are generated using:
-- Berlin K, et al.
-- Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.
-- Nat Biotechnol. 2015 Jun;33(6):623-30.
-- http://doi.org/10.1038/nbt.3238
--
-- Myers EW, et al.
-- A Whole-Genome Assembly of Drosophila.
-- Science. 2000 Mar 24;287(5461):2196-204.
-- http://doi.org/10.1126/science.287.5461.2196
--
-- Corrected read consensus sequences are generated using an algorithm derived from FALCON-sense:
-- Chin CS, et al.
-- Phased diploid genome assembly with single-molecule real-time sequencing.
-- Nat Methods. 2016 Dec;13(12):1050-1054.
-- http://doi.org/10.1038/nmeth.4035
--
-- Contig consensus sequences are generated using an algorithm derived from pbdagcon:
-- Chin CS, et al.
-- Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.
-- Nat Methods. 2013 Jun;10(6):563-9
-- http://doi.org/10.1038/nmeth.2474
--
-- CONFIGURE CANU
--
-- Detected Java(TM) Runtime Environment '1.8.0_66' (from 'java') with -d64 support.
-- Detected gnuplot version '4.2 patchlevel 6 ' (from 'gnuplot') and image format 'png'.
-- Detected 24 CPUs and 126 gigabytes of memory.
-- Detected Sun Grid Engine in '/usr/share/gridengine/default'.
-- User supplied Parallel Environment 'smp'.
-- User supplied Memory Resource 'mem_free'.
--
-- Found 2 hosts with 128 cores and 1009 GB memory under Sun Grid Engine control.
-- Found 4 hosts with 24 cores and 126 GB memory under Sun Grid Engine control.
-- Found 4 hosts with 40 cores and 252 GB memory under Sun Grid Engine control.
--
-- (tag)Threads
-- (tag)Memory |
-- (tag) | | algorithm
-- ------- ---------- -------- -----------------------------
-- Grid: meryl 42.000 GB 8 CPUs (k-mer counting)
-- Grid: hap 16.000 GB 20 CPUs (read-to-haplotype assignment)
-- Grid: cormhap 60.000 GB 12 CPUs (overlap detection with mhap)
-- Grid: obtovl 24.000 GB 8 CPUs (overlap detection)
-- Grid: utgovl 24.000 GB 8 CPUs (overlap detection)
-- Grid: cor 24.000 GB 4 CPUs (read correction)
-- Grid: ovb 4.000 GB 1 CPU (overlap store bucketizer)
-- Grid: ovs 32.000 GB 1 CPU (overlap store sorting)
-- Grid: red 42.000 GB 8 CPUs (read error detection)
-- Grid: oea 8.000 GB 1 CPU (overlap error adjustment)
-- Grid: bat 1009.000 GB 64 CPUs (contig construction with bogart)
-- Grid: cns -.--- GB 8 CPUs (consensus)
--
-- In 'error_correct.seqStore', found Nanopore reads:
-- Nanopore: 5
--
-- Raw: 1
--
-- Generating assembly 'error_correct' in '/data/dario/canu_sikem/Sikem_ONT_EC':
-- - correct raw reads.
-- - trim corrected reads.
-- - assemble corrected and trimmed reads.
--
-- Parameters:
--
-- genomeSize 5300000000
--
-- Overlap Generation Limits:
-- corOvlErrorRate 0.3200 ( 32.00%)
-- obtOvlErrorRate 0.1200 ( 12.00%)
-- utgOvlErrorRate 0.1200 ( 12.00%)
--
-- Overlap Processing Limits:
-- corErrorRate 0.5000 ( 50.00%)
-- obtErrorRate 0.1200 ( 12.00%)
-- utgErrorRate 0.1200 ( 12.00%)
-- cnsErrorRate 0.2000 ( 20.00%)
--
--
-- BEGIN CORRECTION
--
--
-- OVERLAPPER (mhap) (correction) complete, not rewriting scripts.
--
-- Found 751 mhap overlap output files.
-- Finished stage 'cor-mhapCheck', reset canuIteration.
----------------------------------------
-- Starting command on Sat Sep 4 15:27:30 2021 with 59208.256 GB free disk space
cd correction
/usr/local/src/canu-2.1.1/build/bin/ovStoreConfig \
-S ../error_correct.seqStore \
-M 16-32 \
-L ./1-overlapper/ovljob.files \
-create ./error_correct.ovlStore.config \
> ./error_correct.ovlStore.config.txt \
2> ./error_correct.ovlStore.config.err
-- Finished on Sat Sep 4 15:29:05 2021 (95 seconds) with 59208.241 GB free disk space
----------------------------------------
--
-- Creating overlap store correction/error_correct.ovlStore using:
-- 751 buckets
-- 3838 slices
-- using at most 29 GB memory each
-- Finished stage 'cor-overlapStoreConfigure', reset canuIteration.
--
-- Running jobs. First attempt out of 2.
--
-- 'scripts/1-bucketize.jobSubmit-01.sh' -> job 342289 tasks 1-751.
--
----------------------------------------
-- Starting command on Sat Sep 4 15:29:05 2021 with 59208.241 GB free disk space
cd /data/dario/canu_sikem/Sikem_ONT_EC
qsub \
-hold_jid 342289 \
-pe smp 1 \
-l mem_free=4g \
-V \
-q all.q \
-S /bin/bash \
-cwd \
-N 'canu_error_correct' \
-j y \
-o canu-scripts/canu.07.out canu-scripts/canu.07.sh
Your job 342290 ("canu_error_correct") has been submitted
-- Finished on Sat Sep 4 15:29:05 2021 (in the blink of an eye) with 59208.241 GB free disk space
----------------------------------------
$
$ cat canu-scripts/canu.07.out
Found perl:
/usr/bin/perl
Found java:
/usr/bin/java
java version "1.8.0_66"
Found canu:
/usr/local/src/canu-2.1.1/build/bin/canu
canu 2.1.1
-- canu 2.1.1
--
-- CITATIONS
--
-- For 'standard' assemblies of PacBio or Nanopore reads:
-- Koren S, Walenz BP, Berlin K, Miller JR, Phillippy AM.
-- Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.
-- Genome Res. 2017 May;27(5):722-736.
-- http://doi.org/10.1101/gr.215087.116
--
-- Read and contig alignments during correction and consensus use:
-- Šošic M, Šikic M.
-- Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.
-- Bioinformatics. 2017 May 1;33(9):1394-1395.
-- http://doi.org/10.1093/bioinformatics/btw753
--
-- Overlaps are generated using:
-- Berlin K, et al.
-- Assembling large genomes with single-molecule sequencing and locality-sensitive hashing.
-- Nat Biotechnol. 2015 Jun;33(6):623-30.
-- http://doi.org/10.1038/nbt.3238
--
-- Myers EW, et al.
-- A Whole-Genome Assembly of Drosophila.
-- Science. 2000 Mar 24;287(5461):2196-204.
-- http://doi.org/10.1126/science.287.5461.2196
--
-- Corrected read consensus sequences are generated using an algorithm derived from FALCON-sense:
-- Chin CS, et al.
-- Phased diploid genome assembly with single-molecule real-time sequencing.
-- Nat Methods. 2016 Dec;13(12):1050-1054.
-- http://doi.org/10.1038/nmeth.4035
--
-- Contig consensus sequences are generated using an algorithm derived from pbdagcon:
-- Chin CS, et al.
-- Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.
-- Nat Methods. 2013 Jun;10(6):563-9
-- http://doi.org/10.1038/nmeth.2474
--
-- CONFIGURE CANU
--
-- Detected Java(TM) Runtime Environment '1.8.0_66' (from 'java') with -d64 support.
-- Detected gnuplot version '4.2 patchlevel 6 ' (from 'gnuplot') and image format 'png'.
-- Detected 24 CPUs and 126 gigabytes of memory.
-- Detected Sun Grid Engine in '/usr/share/gridengine/default'.
-- User supplied Parallel Environment 'smp'.
-- User supplied Memory Resource 'mem_free'.
--
-- Found 2 hosts with 128 cores and 1009 GB memory under Sun Grid Engine control.
-- Found 4 hosts with 24 cores and 126 GB memory under Sun Grid Engine control.
-- Found 4 hosts with 40 cores and 252 GB memory under Sun Grid Engine control.
--
-- (tag)Threads
-- (tag)Memory |
-- (tag) | | algorithm
-- ------- ---------- -------- -----------------------------
-- Grid: meryl 42.000 GB 8 CPUs (k-mer counting)
-- Grid: hap 16.000 GB 20 CPUs (read-to-haplotype assignment)
-- Grid: cormhap 60.000 GB 12 CPUs (overlap detection with mhap)
-- Grid: obtovl 24.000 GB 8 CPUs (overlap detection)
-- Grid: utgovl 24.000 GB 8 CPUs (overlap detection)
-- Grid: cor 24.000 GB 4 CPUs (read correction)
-- Grid: ovb 4.000 GB 1 CPU (overlap store bucketizer)
-- Grid: ovs 32.000 GB 1 CPU (overlap store sorting)
-- Grid: red 42.000 GB 8 CPUs (read error detection)
-- Grid: oea 8.000 GB 1 CPU (overlap error adjustment)
-- Grid: bat 1009.000 GB 64 CPUs (contig construction with bogart)
-- Grid: cns -.--- GB 8 CPUs (consensus)
--
-- In 'error_correct.seqStore', found Nanopore reads:
-- Nanopore: 5
--
-- Raw: 1
--
-- Generating assembly 'error_correct' in '/data/dario/canu_sikem/Sikem_ONT_EC':
-- - correct raw reads.
-- - trim corrected reads.
-- - assemble corrected and trimmed reads.
--
-- Parameters:
--
-- genomeSize 5300000000
--
-- Overlap Generation Limits:
-- corOvlErrorRate 0.3200 ( 32.00%)
-- obtOvlErrorRate 0.1200 ( 12.00%)
-- utgOvlErrorRate 0.1200 ( 12.00%)
--
-- Overlap Processing Limits:
-- corErrorRate 0.5000 ( 50.00%)
-- obtErrorRate 0.1200 ( 12.00%)
-- utgErrorRate 0.1200 ( 12.00%)
-- cnsErrorRate 0.2000 ( 20.00%)
--
--
-- BEGIN CORRECTION
--
--
-- Creating overlap store correction/error_correct.ovlStore using:
-- 751 buckets
-- 3838 slices
-- using at most 29 GB memory each
--
-- Overlap store bucketizer jobs failed, retry.
-- job correction/error_correct.ovlStore.BUILDING/bucket0190 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0191 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0192 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0193 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0194 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0195 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0196 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0197 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0198 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0199 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0200 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0201 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0202 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0203 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0204 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0205 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0206 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0207 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0208 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0209 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0210 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0211 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0212 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0213 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0214 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0215 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0216 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0217 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0218 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0219 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0220 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0221 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0222 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0223 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0224 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0225 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0226 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0227 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0228 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0229 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0230 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0231 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0232 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0233 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0234 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0235 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0236 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0237 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0238 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0239 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0240 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0241 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0242 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0243 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0244 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0245 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0246 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0247 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0248 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0249 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0250 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0251 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0252 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0253 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0254 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0255 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0256 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0257 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0258 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0259 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0260 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0261 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0262 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0263 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0264 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0265 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0266 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0267 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0268 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0269 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0270 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0271 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0272 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0273 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0274 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0275 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0276 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0277 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0278 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0279 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0280 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0281 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0282 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0283 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0284 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0285 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0286 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0287 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0288 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0289 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0290 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0291 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0292 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0293 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0294 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0295 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0296 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0297 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0298 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0299 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0300 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0301 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0302 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0303 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0304 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0305 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0306 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0307 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0308 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0309 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0310 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0311 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0312 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0313 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0314 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0315 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0316 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0317 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0318 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0319 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0320 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0346 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0347 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0348 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0349 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0350 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0351 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0352 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0353 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0354 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0355 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0356 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0357 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0358 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0359 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0360 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0361 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0362 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0363 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0364 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0365 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0366 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0367 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0368 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0369 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0370 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0371 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0372 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0373 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0374 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0375 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0376 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0377 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0378 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0379 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0380 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0381 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0382 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0383 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0384 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0385 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0386 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0387 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0388 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0389 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0390 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0391 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0392 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0393 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0394 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0395 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0396 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0397 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0398 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0399 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0400 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0401 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0402 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0403 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0404 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0405 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0406 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0407 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0408 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0409 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0410 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0411 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0412 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0413 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0414 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0415 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0416 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0417 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0418 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0419 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0420 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0421 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0422 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0423 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0424 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0425 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0426 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0427 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0428 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0429 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0430 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0431 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0432 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0433 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0434 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0435 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0436 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0437 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0438 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0439 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0440 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0441 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0442 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0443 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0444 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0445 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0446 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0447 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0448 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0449 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0450 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0451 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0452 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0454 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0455 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0457 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0458 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0460 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0461 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0463 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0464 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0466 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0467 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0469 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0470 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0472 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0473 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0475 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0476 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0517 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0520 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0524 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0531 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0537 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0538 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0544 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0545 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0552 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0558 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0562 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0564 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0568 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0572 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0578 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0580 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0590 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0592 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0596 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0598 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0600 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0602 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0604 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0606 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0608 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0610 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0618 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0620 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0628 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0632 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0634 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0636 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0638 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0642 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0644 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0646 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0656 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0662 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0664 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0666 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0668 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0670 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0672 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0674 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0676 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0677 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0678 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0679 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0680 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0681 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0682 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0683 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0684 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0685 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0686 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0687 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0688 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0689 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0690 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0691 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0692 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0693 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0694 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0695 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0696 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0697 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0698 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0699 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0700 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0701 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0702 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0704 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0705 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0708 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0710 FAILED.
-- job correction/error_correct.ovlStore.BUILDING/bucket0737 FAILED.
--
--
-- Running jobs. Second attempt out of 2.
--
-- 'scripts/1-bucketize.jobSubmit-01.sh' -> job 342291 tasks 190-320.
-- 'scripts/1-bucketize.jobSubmit-02.sh' -> job 342292 tasks 346-452.
-- 'scripts/1-bucketize.jobSubmit-03.sh' -> job 342293 tasks 454-455.
-- 'scripts/1-bucketize.jobSubmit-04.sh' -> job 342294 tasks 457-458.
-- 'scripts/1-bucketize.jobSubmit-05.sh' -> job 342295 tasks 460-461.
-- 'scripts/1-bucketize.jobSubmit-06.sh' -> job 342296 tasks 463-464.
-- 'scripts/1-bucketize.jobSubmit-07.sh' -> job 342297 tasks 466-467.
-- 'scripts/1-bucketize.jobSubmit-08.sh' -> job 342298 tasks 469-470.
-- 'scripts/1-bucketize.jobSubmit-09.sh' -> job 342299 tasks 472-473.
-- 'scripts/1-bucketize.jobSubmit-10.sh' -> job 342300 tasks 475-476.
-- 'scripts/1-bucketize.jobSubmit-11.sh' -> job 342301 task 517.
-- 'scripts/1-bucketize.jobSubmit-12.sh' -> job 342302 task 520.
-- 'scripts/1-bucketize.jobSubmit-13.sh' -> job 342303 task 524.
-- 'scripts/1-bucketize.jobSubmit-14.sh' -> job 342304 task 531.
-- 'scripts/1-bucketize.jobSubmit-15.sh' -> job 342305 tasks 537-538.
-- 'scripts/1-bucketize.jobSubmit-16.sh' -> job 342306 tasks 544-545.
-- 'scripts/1-bucketize.jobSubmit-17.sh' -> job 342307 task 552.
-- 'scripts/1-bucketize.jobSubmit-18.sh' -> job 342308 task 558.
-- 'scripts/1-bucketize.jobSubmit-19.sh' -> job 342309 task 562.
-- 'scripts/1-bucketize.jobSubmit-20.sh' -> job 342310 task 564.
-- 'scripts/1-bucketize.jobSubmit-21.sh' -> job 342311 task 568.
-- 'scripts/1-bucketize.jobSubmit-22.sh' -> job 342312 task 572.
-- 'scripts/1-bucketize.jobSubmit-23.sh' -> job 342313 task 578.
-- 'scripts/1-bucketize.jobSubmit-24.sh' -> job 342314 task 580.
-- 'scripts/1-bucketize.jobSubmit-25.sh' -> job 342315 task 590.
-- 'scripts/1-bucketize.jobSubmit-26.sh' -> job 342316 task 592.
-- 'scripts/1-bucketize.jobSubmit-27.sh' -> job 342317 task 596.
-- 'scripts/1-bucketize.jobSubmit-28.sh' -> job 342318 task 598.
-- 'scripts/1-bucketize.jobSubmit-29.sh' -> job 342319 task 600.
-- 'scripts/1-bucketize.jobSubmit-30.sh' -> job 342320 task 602.
-- 'scripts/1-bucketize.jobSubmit-31.sh' -> job 342321 task 604.
-- 'scripts/1-bucketize.jobSubmit-32.sh' -> job 342322 task 606.
-- 'scripts/1-bucketize.jobSubmit-33.sh' -> job 342323 task 608.
-- 'scripts/1-bucketize.jobSubmit-34.sh' -> job 342324 task 610.
-- 'scripts/1-bucketize.jobSubmit-35.sh' -> job 342325 task 618.
-- 'scripts/1-bucketize.jobSubmit-36.sh' -> job 342326 task 620.
-- 'scripts/1-bucketize.jobSubmit-37.sh' -> job 342327 task 628.
-- 'scripts/1-bucketize.jobSubmit-38.sh' -> job 342328 task 632.
-- 'scripts/1-bucketize.jobSubmit-39.sh' -> job 342329 task 634.
-- 'scripts/1-bucketize.jobSubmit-40.sh' -> job 342330 task 636.
-- 'scripts/1-bucketize.jobSubmit-41.sh' -> job 342331 task 638.
-- 'scripts/1-bucketize.jobSubmit-42.sh' -> job 342332 task 642.
-- 'scripts/1-bucketize.jobSubmit-43.sh' -> job 342333 task 644.
-- 'scripts/1-bucketize.jobSubmit-44.sh' -> job 342334 task 646.
-- 'scripts/1-bucketize.jobSubmit-45.sh' -> job 342335 task 656.
-- 'scripts/1-bucketize.jobSubmit-46.sh' -> job 342336 task 662.
-- 'scripts/1-bucketize.jobSubmit-47.sh' -> job 342337 task 664.
-- 'scripts/1-bucketize.jobSubmit-48.sh' -> job 342338 task 666.
-- 'scripts/1-bucketize.jobSubmit-49.sh' -> job 342339 task 668.
-- 'scripts/1-bucketize.jobSubmit-50.sh' -> job 342340 task 670.
-- 'scripts/1-bucketize.jobSubmit-51.sh' -> job 342341 task 672.
-- 'scripts/1-bucketize.jobSubmit-52.sh' -> job 342342 task 674.
-- 'scripts/1-bucketize.jobSubmit-53.sh' -> job 342343 tasks 676-702.
-- 'scripts/1-bucketize.jobSubmit-54.sh' -> job 342344 tasks 704-705.
-- 'scripts/1-bucketize.jobSubmit-55.sh' -> job 342345 task 708.
-- 'scripts/1-bucketize.jobSubmit-56.sh' -> job 342346 task 710.
-- 'scripts/1-bucketize.jobSubmit-57.sh' -> job 342347 task 737.
--
----------------------------------------
-- Starting command on Tue Sep 14 10:52:41 2021 with 0.005 GB free disk space
cd /data/dario/canu_sikem/Sikem_ONT_EC
qsub \
-hold_jid 342291,342292,342293,342294,342295,342296,342297,342298,342299,342300,342301,342302,342303,342304,342305,342306,342307,342308,342309,342310,342311,342312,342313,342314,342315,342316,342317,342318,342319,342320,342321,342322,342323,342324,342325,342326,342327,342328,342329,342330,342331,342332,342333,342334,342335,342336,342337,342338,342339,342340,342341,342342,342343,342344,342345,342346,342347 \
-pe smp 1 \
-l mem_free=5g \
-V \
-q all.q \
-S /bin/bash \
-cwd \
-N 'canu_error_correct' \
-j y \
-o canu-scripts/canu.08.out canu-scripts/canu.08.sh
Your job 342348 ("canu_error_correct") has been submitted
-- Finished on Tue Sep 14 10:52:41 2021 (fast as lightning) with 0.005 GB free disk space !!! WARNING !!!
----------------------------------------
$
with 0.005 GB free disk space
:-(
A bunch of random comments, then I'll get to how to fix things in the next reply.
You are in the first phase of constructing the store. This is (nearly) the maximum disk usage of canu. But it doesn't really matter since you're out of space anyway.
Killing the pending canu
job, then killing all the other jobs will stop canu. Canu tries to run jobs twice, so it would have stopped anyway after this batch. It's smart enough to not rerun successfully completed jobs.
Based on the logs, canu is finding the grid. Canu does tell the grid about memory and cpu requirements for each job ("-l mem_free=5g" in the qsub command). I recall that "mem_free=5g" just instructs SGE to start the job if the node currently has 5GB free memory. It does not reserve 5GB of memory for the job. There was likely some other large job on node 9 and the combined memory load was more than 1 TB.
It is possible to set up a 'consumable' for memory in SGE. Each node is configured (in SGE) with a resource called 'memory' set to however much memory that node physically has. A qsub with -l memory=5g
would then subtract 5 from this resource before starting the job. This works on the honor system - jobs can exceed the memory request with no penalty, but properly behaving jobs won't exhaust memory. For example, 100 jobs wanting 1 core and 100gb memory each will only be able to run 10 jobs on the 1 TB node, even though most of the CPUs are idle.
The poor man's substitute is to use qsub's -tc
option to limit the number of concurrently running jobs. Set this in either gridOptions="-tc 50"
to limit all canu jobs to 50 at once, or specifically for the store building gridOptionsovb="-tc 50" gridOptionsovs="-tc 50"
. This could still dump all jobs on one node, but I don't know of a way to prevent that without changing the grid configuration itself.
OK, onto fixing!
Clone and compile the latest canu code from github:
git clone https://github.com/marbl/canu.git
cd canu/src
gmake -j 12
In error_correct/correcvtion/1-overlapper:
mv results results-orig
chmod 444 results-orig/*
chmod 555 results-orig
mkdir results
For each ovb in results-orig
run:
${NEWCANU}/build/bin/overlapImport \
-ovb -raw \
-S ../../*.seqStore \
-minreadlength 3000 \
-minoverlaplength 2750 \
-o results/000001.ovb \
results-orig/000001.ovb
changing minreadlength and minoverlaplength as you see fit. These jobs use very little memory but will do lots of I/O. A summary of the processing performed is written to stderr:
Overlaps processed:
5469966
Overlaps discarded:
970159 17.74% - overlap length too short
166479 3.04% - read length too short
1305150 23.86% - both lengths too short
Overlaps output:
3028178 55.36%
Bye.
At the end, you should have smaller .ovb files; the .oc files will be the same size:
> ls -l results*/000001*
-r--r--r-- 1 bri bri 2332964 Sep 14 13:08 results-orig/000001.oc
-r--r--r-- 1 bri bri 106214194 Sep 14 13:08 results-orig/000001.ovb
-rw-r--r-- 1 bri bri 2332964 Sep 14 19:24 results/000001.oc
-rw-r--r-- 1 bri bri 57672580 Sep 14 19:24 results/000001.ovb
When they're all filtered, restart canu. Remember to remove the existing ovlStore config files!
Idle
Just to be helpful to other people who find this thread I will post my experience using @brianwalenz advice above. Thank you for the explanation, by the way, @brianwalenz.
My run with a 1n genome size of ~1.5Gb, probably around 3% heterozygosity, and 283Gb of PacBio CLR reads was slated to take up at least 45 TB of disk space just for overlaps using the command canu genomeSize=1.5g -pacbio *.fa.gz maxThreads=80
.
My histogram looked like this:
-- G=283702574793 sum of || length num
-- NG length index lengths || range seqs
-- ----- ------------ --------- ------------ || ------------------- -------
-- 00010 60495 382548 28370285892 || 1000-6975 4221867|---------------------------------------------------------------
-- 00020 48602 910725 56740558112 || 6976-12951 2576491|---------------------------------------
-- 00030 41243 1546784 85110796627 || 12952-18927 1976670|------------------------------
-- 00040 35785 2286700 113481045879 || 18928-24903 1556230|------------------------
-- 00050 31070 3137201 141851287466 || 24904-30879 1253885|-------------------
-- 00060 26236 4128970 170221548701 || 30880-36855 1054641|----------------
-- 00070 21270 5327427 198591809503 || 36856-42831 739232|------------
-- 00080 16136 6851446 226962071022 || 42832-48807 482898|--------
-- 00090 10245 9021094 255332325871 || 48808-54783 316295|-----
-- 00100 1000 14759347 283702574793 || 54784-60759 205808|----
-- 001.000x 14759348 283702574793 || 60760-66735 133923|--
-- || 66736-72711 86852|--
-- || 72712-78687 55617|-
-- || 78688-84663 35017|-
-- || 84664-90639 22320|-
-- || 90640-96615 14326|-
-- || 96616-102591 9140|-
-- || 102592-108567 6155|-
I will update when I try some new parameters to not use shorter reads and to increase the minimum overlap length.
Hello,
I am running canu 2.1.1 on a 2.7 Gb highly heterozygous plant genome, starting from ONT reads (N50 about 50 kb, 75x coverage, or 37x per allele). The command used is
canu -d Sikem_ONT_EC -p error_correct genomeSize=5.3g corMaxEvidenceErate=0.15 corMhapFilterThreshold=0.0000000002 corMhapOptions="--threshold 0.80 --num-hashes 512 --num-min-matches 3 --ordered-sketch-size 1000 --ordered-kmer-size 14 --min-olap-length 2000 --repeat-idf-scale 50" mhapMemory=60g mhapBlockSize=500 ovlMerDistinct=0.975 useGrid=true gridEngineResourceOption="-pe smp THREADS -l mem_free=MEMORY" gridOptions="-V -q all.q -S /bin/bash" -nanopore reads.fq.gz
(we added the parameters that suppress repeat hits)The goal is to at least error correct the ONT reads in an allele-specific way (thanks to the high sequencing coverage) - this is why we put the size of the diploid genome. After about 3 weeks running on our cluster, it stopped due to no space left on the device. This is the size of the correction folder:
and the *.ovlStore.BUILDING/1-bucketize.successs file is not there yet.
I wonder if we can change some parameters (also given the large read N50) to have a smaller footprint and complete the correction step. Maybe
--min-olap-length 2000
to 5000? What else would you tweak? Would it be possible to resume not from the very beginning?Also, all the jobs are now submitted to one node and this is what our sys admin saw: "It spawned a bunch of 1 CPU processes and really tanked the cluster. I think I may need to kill the canu run and/or reboot n009. " This is the usage for this job:
How can we kill those jobs? Thanks, Dario