Closed DH1983 closed 5 years ago
@DH1983 It appears that PIPITS can't find your the uchime reference dataset.
Can you make sure you have uchime_reference_dataset_28.06.2017.fasta
in $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017
?
You can do so by:
ls $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
You probably missed a step or two, but please do have a look at "A3. Reference datasets" in the manual. https://github.com/hsgweon/pipits#a3-reference-datasets
Thank you very much for your quick answer.
I have checked that the reference dataset is placed in the directory that PIPITS are trying to find by doing what you have instructed me.
This is what i have got from the terminal in my computer (pipits_env) lh001@lh001-300E4C-300E5C-300E7C:~/Desktop/PIPITS_Trial$ ls $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta /home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
However, I still got the same error message...
I supposed the problem arises from the A4 stage.
Would please checked if I do correctly?
This is how I did. I edited the file that is located in /home/environment by adding the following commands export PIPITS_UNITE_RETRAINED_DIR=$HOME/pipits/refdb/UNITE_retrained export PIPITS_UNITE_REFERENCE_DATA_CHIMERA=$HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta export PIPITS_WARCUP_RETRAINED_DIR=$HOME/pipits/refdb/warcup_retrained_V2 and saved, and subsequently restarted the terminal.
Could please tell me where the problem is arising from?
Thank you again for your help.
DongHyeon
Can you try:
echo $PIPITS_UNITE_REFERENCE_DATA_CHIMERA
and see if you get:
/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
Hi, Hyun,
This is what I got from the terminal.
lh001@lh001-300E4C-300E5C-300E7C:~$ echo $PIPITS_UNITE_REFERENCE_DATA_CHIMERA $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
Can you try and let me know the result?
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
Here, What I got..
lh001@lh001-300E4C-300E5C-300E7C:~/pipits$ vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1 vsearch v2.7.1_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch Unable to open file out_process/intermediate/input_nr_otus_nonchimeras.fasta for writing
Let's start this from the beginning:
Can you try this again after running pipits_funits?
pipits_process -i out_funits/ITS.fasta -o out_process -v -r
We will get this sorted!!
Thank you for your time, I will try it out!
Hi Hyun, Below is what I got from the log file after I ran it
[91m2018-09-10 19:27:14[0m [0mpipits_process started[0m [91m2018-09-10 19:27:14[0m [0mGenerating a sample list from the input sequences[0m [91m2018-09-10 19:27:14[0m [0mDereplicating and removing unique sequences prior to picking OTUs[0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Reading file out_funits/ITS.fasta 100% 8584 nt in 52 seqs, min 144, max 275, avg 165 Dereplicating 100% Sorting 100% 15 unique sequences, avg cluster 3.5, median 2, max 12 Writing output file 100% 9 uniques written, 6 clusters discarded (40.0%) [91m2018-09-10 19:27:14[0m [0mPicking OTUs [VSEARCH][0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Reading file out_process/intermediate/input_nr.fasta 100% 1447 nt in 9 seqs, min 147, max 187, avg 161 Masking 100% Sorting by length 100% Counting k-mers 100% Clustering 100% Sorting clusters 100% Writing clusters 100% Clusters: 6 Size min 1, max 3, avg 1.5 Singletons: 4, 44.4% of seqs, 66.7% of clusters [91m2018-09-10 19:27:14[0m [0mRemoving chimeras [VSEARCH][0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Unable to open file for reading ($HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta) [91m2018-09-10 19:27:14[0m Error: None zero returncode: vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
Ok, now can you try:
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
Here what I got in this time.
(pipits_env) lh001@lh001-300E4C-300E5C-300E7C:~/Desktop/PIPITS_Trial$ vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1 vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Unable to open file for reading ($HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta)
What about:
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db ~/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
It has run it through, and the last block that I have from the terminal is below.
16786547 nt in 30555 seqs, min 146, max 2570, avg 549
Masking 100%
Counting k-mers 100%
Creating k-mer index 100%
Detecting chimeras 100%
Found 0 (0.0%) chimeras, 6 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 6 unique sequences.
Taking abundance information into account, this corresponds to
0 (0.0%) chimeras, 37 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 37 total sequences.
Hmm... are you on Ubuntu or macOS?
I am running in Ubuntu.
Can you try this, and see if this works?
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
This seems working too.
Here is what I got from the terminal
_reference_dataset_28.06.2017.fasta 100%
16786547 nt in 30555 seqs, min 146, max 2570, avg 549
Masking 100%
Counting k-mers 100%
Creating k-mer index 100%
Detecting chimeras 100%
Found 0 (0.0%) chimeras, 6 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 6 unique sequences.
Taking abundance information into account, this corresponds to
0 (0.0%) chimeras, 37 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 37 total sequences.
This is weird... can you confirm that the output from:
echo $PIPITS_UNITE_REFERENCE_DATA_CHIMERA
is the same as
ls $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
For the first command
and for the second one (pipits_env) lh001@lh001-300E4C-300E5C-300E7C:~/Desktop/PIPITS_Trial$ ls $HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta /home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
So, it is different each other..
For the second one, The directory is located in /home/lh001/... but the first one, /HOME/pipits/...
How crazy... Try this? (going around in circles now...)
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
In this time,
Unable to open file for reading ($HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta)
Can you try:
conda install vsearch=2.8.0 -y
and then try:
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
Hi, Hyun,
I try it with 'conda install vsearch=2.8.0 -y', and the message is # All requested packages already installed.
I have read one of the posts, and for the bug from vsearch, if I remember correctly, I downgraded as followed by your instructions.
And I still got the error message after I ran it...
Unable to open file for reading ($HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta)
Probably some really trivial thing that we are missing here...
Can you (in the same terminal):
export $PIPITS_UNITE_REFERENCE_DATA_CHIMERA=/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
And then try:
vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
And, please re-check that /home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
really exists, by seeing its content:
less /home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
If I put 'export ...' I got this error message.. (pipits_env) lh001@lh001-300E4C-300E5C-300E7C:~/Desktop/PIPITS_Trial$ export $PIPITS_UNITE_REFERENCE_DATA_CHIMERA=/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta bash: export: `$HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta=/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta': not a valid identifier
but If I try to find out the data really exists by putting less /home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
it DOES...
Ooops my bad. It should be:
export PIPITS_UNITE_REFERENCE_DATA_CHIMERA=/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
In this time, it has all run it through.
And the last block that I see is below.
_dataset_28.06.2017.fasta 100%
16786547 nt in 30555 seqs, min 146, max 2570, avg 549
Masking 100%
Counting k-mers 100%
Creating k-mer index 100%
Detecting chimeras 100%
Found 0 (0.0%) chimeras, 6 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 6 unique sequences.
Taking abundance information into account, this corresponds to
0 (0.0%) chimeras, 37 (100.0%) non-chimeras,
and 0 (0.0%) borderline sequences in 37 total sequences.
Great! how about the whole process? pipits_process? if it works, then might be useful to put corrected PATH into .bashrc as per instruction.
export PIPITS_UNITE_RETRAINED_DIR=/home/lh001/pipits/refdb/UNITE_retrained
export PIPITS_UNITE_REFERENCE_DATA_CHIMERA=/home/lh001/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta
export PIPITS_WARCUP_RETRAINED_DIR=/home/lh001/pipits/refdb/warcup_retrained_V2
Thank you very much, Hyun. It is now running completely. I am really appreciated for your time. Best regards, DongHyeon.
I don’t exactly know why you are seeing a such behaviour, but hope this solution will do for your data (well for now anyway). Good luck with the rest of your analysis and do let me know if you have any other issues.
From: DH1983 notifications@github.com Sent: Monday, September 10, 2018 12:35:00 PM To: hsgweon/pipits Cc: H. Soon Gweon; Comment Subject: Re: [hsgweon/pipits] Stucked in the stage 'PIPITS_PROCESS' (#18)
Thank you very much, Hyun. It is now running completely. I am really appreciated for your time. Best regards, DongHyeon.
— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://github.com/hsgweon/pipits/issues/18#issuecomment-419882467, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AJtTUh58ceZn1FB4zt3_2ZKThTYXCMN1ks5uZk5kgaJpZM4Wg9Uz.
Thank you for your note again, I will now try with my data. Hope you have a great rest of the day. Best, DongHyeon
Hi, Hyun,
I followed your instructions, but I can not move forward from the stage 'PIPITS_PROCESS'.
I am running a test with your test file, and my problem is indicated below.
pipits_process 2.2, the PIPITS Project https://github.com/hsgweon/pipits
[91m2018-09-10 17:27:16[0m [0mpipits_process started[0m [91m2018-09-10 17:27:16[0m [0mGenerating a sample list from the input sequences[0m [91m2018-09-10 17:27:16[0m [0mDereplicating and removing unique sequences prior to picking OTUs[0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Reading file out_funits/ITS.fasta 100% 8584 nt in 52 seqs, min 144, max 275, avg 165 Dereplicating 100% Sorting 100% 15 unique sequences, avg cluster 3.5, median 2, max 12 Writing output file 100% 9 uniques written, 6 clusters discarded (40.0%) [91m2018-09-10 17:27:16[0m [0mPicking OTUs [VSEARCH][0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Reading file out_process/intermediate/input_nr.fasta 100% 1447 nt in 9 seqs, min 147, max 187, avg 161 Masking 100% Sorting by length 100% Counting k-mers 100% Clustering 100% Sorting clusters 100% Writing clusters 100% Clusters: 6 Size min 1, max 3, avg 1.5 Singletons: 4, 44.4% of seqs, 66.7% of clusters [91m2018-09-10 17:27:16[0m [0mRemoving chimeras [VSEARCH][0m vsearch v2.8.0_linux_x86_64, 7.6GB RAM, 4 cores https://github.com/torognes/vsearch
Unable to open file for reading ($HOME/pipits/refdb/uchime_reference_dataset_28.06.2017/uchime_reference_dataset_28.06.2017.fasta) [91m2018-09-10 17:27:16[0m Error: None zero returncode: vsearch --uchime_ref out_process/intermediate/input_nr_otus.fasta --db $PIPITS_UNITE_REFERENCE_DATA_CHIMERA --nonchimeras out_process/intermediate/input_nr_otus_nonchimeras.fasta --threads 1
I am a beginner, so would you please let me know in details, so that I can follow it.
Many thanks in advance.
DongHyeon