PacificBiosciences / FALCON

FALCON: experimental PacBio diploid assembler -- Out-of-date -- Please use a binary release: https://github.com/PacificBiosciences/FALCON_unzip/wiki/Binaries
https://github.com/PacificBiosciences/FALCON_unzip/wiki/Binaries
Other
204 stars 103 forks source link

pb-assembly fails at 2-asm-falcon stage with unclear error #701

Closed charlesfeigin closed 4 years ago

charlesfeigin commented 4 years ago

Running pb-assembly locally using settings in cfg file in the attachment. Run went smoothly until 2-asm-falcon stage, but failed with an error that I don't really understand. End of the stderr file (also attached) says:

time python -m falcon_kit.mains.collect_pread_gfa --add-string-graph >| sg.gfa.json

python -m falcon_kit.mains.collect_pread_gfa --add-string-graph
falcon-kit 1.3.0
pypeflow 2.2.0
Traceback (most recent call last):
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"main", fname, loader, pkg_name)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/mains/collect_pread_gfa.py", line 243, in
main()
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/mains/collect_pread_gfa.py", line 240, in main
run(sys.stdout, **vars(args))
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/mains/collect_pread_gfa.py", line 196, in run
add_string_graph_to_gfa(gfa_graph, sg_edges_list, utg_data, ctg_paths, preads_dict, preads_overlap_dict, sg_edges_dict)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/mains/collect_pread_gfa.py", line 129, in add_string_graph_to_gfa
asm_graph = AsmGraph(sg_edges_list, utg_data, ctg_paths)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/fc_asm_graph.py", line 33, in init
self.build_node_map()
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/fc_asm_graph.py", line 169, in build_node_map
sg = self.get_sg_for_ctg(ctg_id)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/falcon_kit/fc_asm_graph.py", line 156, in get_sg_for_ctg
nx.add_path(sg, one_path)
AttributeError: 'module' object has no attribute 'add_path'
2019-11-21 08:21:02,683 - root - WARNING - Call '/bin/bash user_script.sh' returned 256.
2019-11-21 08:21:02,683 - root - INFO - CD: '/home/myusername/data/genomes/my_species/2-asm-falcon' -> '/home/myusername/data/genomes/my_species/2-asm-falcon'
2019-11-21 08:21:02,683 - root - INFO - CD: '/home/myusername/data/genomes/my_species/2-asm-falcon' -> '/home/myusername/data/genomes/my_species/2-asm-falcon'
2019-11-21 08:21:02,684 - root - CRITICAL - Error in /home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py with args="{'json_fn': '/home/myusername/data/genomes/my_species/2-asm-falcon/task.json',\n 'timeout': 30,\n 'tmpdir': None}"
Traceback (most recent call last):
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/runpy.py", line 174, in _run_module_as_main
"main", fname, loader, pkg_name)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/runpy.py", line 72, in _run_code
exec code in run_globals
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py", line 281, in
main() File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py", line 273, in main
run(**vars(parsed_args))
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py", line 267, in run
run_cfg_in_tmpdir(cfg, tmpdir, '.')
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py", line 242, in run_cfg_in_tmpdir
run_bash(bash_template, myinputs, myoutputs, parameters)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/do_task.py", line 201, in run_bash
util.system(cmd)
File "/home/myusername/.conda/envs/denovo_asm/lib/python2.7/site-packages/pypeflow/io.py", line 29, in syscall
raise Exception(msg)
Exception: Call '/bin/bash user_script.sh' returned 256.
+++ pwd
++ echo 'FAILURE. Running top in /home/myusername/data/genomes/my_species/2-asm-falcon (If you see -terminal database is inaccessible- you are using the python bin-wrapper, so you will not get diagnostic info. No big deal. This process is crashing anyway.)'
++ rm -f top.txt
++ which python
++ which top
++ env -u LD_LIBRARY_PATH top -b -n 1
++ env -u LD_LIBRARY_PATH top -b -n 1
++ pstree -apl

real 25m15.321s user 51m59.876s sys 4m7.425s

finish
echo 'finish code: 1'

Is this because of a missing python module? It seems like python can't find something it should be able to but I don't really know the language. Is it possible to resume this run without starting from the beginning? Thanks so much for your help, I'm happy to provide any other information that may be helpful.

my_pb-assembly_run.zip

pb-cdunn commented 4 years ago

Your networkx is inconsistent with your python code. We no longer support python2.7. Please switch to python3.7 and update all.

pb-cdunn commented 4 years ago

Also, we generally don't look at these old repos anymore. Please see https://github.com/PacificBiosciences/pbbioconda