Closed isaacovercast closed 4 years ago
This is fucking weird:
-rw-r--r-- 1 mporter columbuslab 33554432 Jun 11 01:40 15430_C.htemp
-rw-r--r-- 1 mporter columbuslab 33554432 Jun 11 02:54 15421_I.htemp
Different runs produce the same behavior with different samples?!?!
In looking at the *clust_0.8 folder I spotted two new files with identical size. As you pointed out, this is very unlikely. Note that this is not the same two files as lasttime, though one individual is the same.
-rw-r--r-- 1 mporter columbuslab 16777216 Jun 11 12:09 15421_I.utemp
-rw-r--r-- 1 mporter columbuslab 16777216 Jun 11 12:01 15424_J.utemp
/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS$ grep DEBUG ipyrad_log.txt | tail
2018-06-12 18:37:52,555 pid=6571 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15501_I_derep.fastq', '-strand', 'plus', '-query_cov', '0.75', '-id', '0.8', '-minsl', '0.5', '-userout', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15501_I.utemp', '-userfields', 'query+target+id+gaps+qstrand+qcov', '-maxaccepts', '1', '-maxrejects', '0', '-threads', '4', '-notmatched', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15501_I.htemp', '-fasta_width', '0', '-fastq_qmax', '100', '-fulldp', '-usersort']
2018-06-12 18:37:52,560 pid=6581 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15424_J_derep.fastq', '-strand', 'plus', '-query_cov', '0.75', '-id', '0.8', '-minsl', '0.5', '-userout', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15424_J.utemp', '-userfields', 'query+target+id+gaps+qstrand+qcov', '-maxaccepts', '1', '-maxrejects', '0', '-threads', '4', '-notmatched', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15424_J.htemp', '-fasta_width', '0', '-fastq_qmax', '100', '-fulldp', '-usersort']
2018-06-12 18:37:52,569 pid=6570 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15430_C_derep.fastq', '-strand', 'plus', '-query_cov', '0.75', '-id', '0.8', '-minsl', '0.5', '-userout', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15430_C.utemp', '-userfields', 'query+target+id+gaps+qstrand+qcov', '-maxaccepts', '1', '-maxrejects', '0', '-threads', '4', '-notmatched', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15430_C.htemp', '-fasta_width', '0', '-fastq_qmax', '100', '-fulldp', '-usersort']
2018-06-12 18:37:52,556 pid=34702 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15537_G_derep.fastq', '-strand', 'plus', '-query_cov', '0.75', '-id', '0.8', '-minsl', '0.5', '-userout', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15537_G.utemp', '-userfields', 'query+target+id+gaps+qstrand+qcov', '-maxaccepts', '1', '-maxrejects', '0', '-threads', '4', '-notmatched', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15537_G.htemp', '-fasta_width', '0', '-fastq_qmax', '100', '-fulldp', '-usersort']
2018-06-12 18:37:52,560 pid=2577 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15500_C_derep.fastq', '-strand', 'plus', '-query_cov', '0.75', '-id', '0.8', '-minsl', '0.5', '-userout', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15500_C.utemp', '-userfields', 'query+target+id+gaps+qstrand+qcov', '-maxaccepts', '1', '-maxrejects', '0', '-threads', '4', '-notmatched', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_clust_0.8/15500_C.htemp', '-fasta_width', '0', '-fastq_qmax', '100', '-fulldp', '-usersort']
2018-06-12 18:37:52,574 pid=24784 [cluster_within.py] DEBUG ['/rhome/mporter/miniconda2/lib/python2.7/site-packages/bin/vsearch-linux-x86_64', '-cluster_smallmem', '/bigdata/columbuslab/mporter/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80_edits/15426_J_derep.fastq', '-strand', 'p
ipyrad [v.0.7.19]
Interactive assembly and analysis of RAD-seq data
loading Assembly: opuntia_TEST80
from saved path: ~/bigdata/ddRAD/opuntia_ipyrad/TESTS/opuntia_TEST80.json
establishing parallel connection:
host compute node: [10 cores] on c08
host compute node: [10 cores] on c01
host compute node: [10 cores] on c05
host compute node: [10 cores] on c07
Step 3: Clustering/Mapping reads
[####################] 100% dereplicating | 0:00:36
[################ ] 83% clustering | 14:51:37
I am going to assume this is some kind of non-deterministic behavior from the old version which is not replicatable in the new version.