Hi! I'm running older liftoff (the one that runs straight from the python file) and have encountered some issues trying to get separate executions to run in parallel on cluster computing services. for reference, the parallel commands are
i have used cluster commands to set maximum memory and disk space usage for each process, but I encounter this error:
[W::sam_hdr_create] Ignored @SQ line with missing SN: tag
[E::sam_hrecs_update_hashes] Header includes @SQ line with no SN: tag
[E::sam_parse1] failed to parse header
[W::sam_read1] Parse error at line 169448
Traceback (most recent call last):
File "/global/scratch/users/lojohnathan6/old_liftoff/liftoff.py", line 121, in
main()
File "/global/scratch/users/lojohnathan6/old_liftoff/liftoff.py", line 72, in main
feature_db, parent_features, intermediate_features, children_features, parent_order = liftover_types.lift_original_annotation(
File "/global/scratch/users/lojohnathan6/old_liftoff/liftover_types.py", line 15, in lift_original_annotation
aligned_segments = align_features.align_features_to_target(ref_chroms, target_chroms, processes, target_fasta,
File "/global/scratch/users/lojohnathan6/old_liftoff/align_features.py", line 29, in align_features_to_target
aligned_segments = parse_alignment(file, parent_dict, children_dict, unmapped_features, search_type)
File "/global/scratch/users/lojohnathan6/old_liftoff/align_features.py", line 95, in parse_alignment
for ref_seq in sam_file_iter:
File "pysam/libcalignmentfile.pyx", line 2187, in pysam.libcalignmentfile.IteratorRowAll.next
OSError: truncated file
among others indicating that the process was killed due to memory use. I have tried to rectify this using all sorts of memory configurations, however, the issue remains. Also, it runs fine when running only one liftoff process at a time, regardless of memory configuration.
Hi! I'm running older liftoff (the one that runs straight from the python file) and have encountered some issues trying to get separate executions to run in parallel on cluster computing services. for reference, the parallel commands are
python /global/liftoff/liftoff.py -t species/species.fna -r ref/ref.fna -db ref/ref.gff_db -o species/species.gff
i have used cluster commands to set maximum memory and disk space usage for each process, but I encounter this error:
[W::sam_hdr_create] Ignored @SQ line with missing SN: tag [E::sam_hrecs_update_hashes] Header includes @SQ line with no SN: tag [E::sam_parse1] failed to parse header [W::sam_read1] Parse error at line 169448 Traceback (most recent call last): File "/global/scratch/users/lojohnathan6/old_liftoff/liftoff.py", line 121, in
main()
File "/global/scratch/users/lojohnathan6/old_liftoff/liftoff.py", line 72, in main
feature_db, parent_features, intermediate_features, children_features, parent_order = liftover_types.lift_original_annotation(
File "/global/scratch/users/lojohnathan6/old_liftoff/liftover_types.py", line 15, in lift_original_annotation
aligned_segments = align_features.align_features_to_target(ref_chroms, target_chroms, processes, target_fasta,
File "/global/scratch/users/lojohnathan6/old_liftoff/align_features.py", line 29, in align_features_to_target
aligned_segments = parse_alignment(file, parent_dict, children_dict, unmapped_features, search_type)
File "/global/scratch/users/lojohnathan6/old_liftoff/align_features.py", line 95, in parse_alignment
for ref_seq in sam_file_iter:
File "pysam/libcalignmentfile.pyx", line 2187, in pysam.libcalignmentfile.IteratorRowAll.next
OSError: truncated file
among others indicating that the process was killed due to memory use. I have tried to rectify this using all sorts of memory configurations, however, the issue remains. Also, it runs fine when running only one liftoff process at a time, regardless of memory configuration.