ridgelab / JustOrthologs

35 stars 16 forks source link

use test human.fa and bonbo.fa cannot use JustOrthologs #4

Closed wangjie1201 closed 5 years ago

ridgelab commented 5 years ago

Hello, The error seems to indicate that justOrthologs.py is an html file for you. Did you you clone JustOrthologs into your local directory using git clone? If that doesn't work, do you mind sending me the command that you used and the justOrthologs.py script so I can ensure that you have the correct version?

wangjie1201 commented 5 years ago

Hello, Thank you for give me the solution.The error had solved.But i have another question, if I want to use your gff3parser.py to make each CDS region followed by an asterisk(*),i need to give the FASTA header lines must be formatted in one of two ways:>NC###### or >gi|####|ref|NC_#####? If i have the fasta header lines,such as Potri####,what should i do? Thanks again for your reply

------------------ 原始邮件 ------------------ 发件人: "ridgelab"notifications@github.com; 发送时间: 2019年6月25日(星期二) 凌晨1:59 收件人: "ridgelab/JustOrthologs"JustOrthologs@noreply.github.com; 抄送: "675103467"675103467@qq.com;"State change"state_change@noreply.github.com; 主题: Re: [ridgelab/JustOrthologs] use test human.fa and bonbo.fa cannotuse JustOrthologs (#4)

Hello, The error seems to indicate that justOrthologs.py is an html file for you. Did you you clone JustOrthologs into your local directory using git clone? If that doesn't work, do you mind sending me the command that you used and the justOrthologs.py script so I can ensure that you have the correct version?

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub, or mute the thread.

ridgelab commented 5 years ago

I'm glad that worked for you. Currently we don't support Potri numbers, but you should be able to use >Potri#### as the header line as long as the Potri# is used the same way as the NC_# would be used in the gff3 file. The purpose of the gff3 parser is to extract the CDS regions from the fasta file and combine CDS regions belonging to the same gene by putting an * between each CDS region. For example, if gene 1 had one CDS region and gene 2 had three CDS regions, the output file would look something like this:

gene 1 CDS1 gene 2 CDS1CDS2CDS3

If you are able to get an output file in the same format (with or without the gff3_parser.py) then you can sort the file by the number of CDS regions (using sortFastaBySeqLen.sh) and run JustOrthologs.py on it. Hope that helps.

ridgelab commented 5 years ago

Looks like the * italicized the text. Let me try again for the output file:

`>gene 1 CDS1[asterisk]

gene 2 CDS1[asterisk]CDS2[asterisk]CDS3[asterisk]`

wangjie1201 commented 5 years ago

Thank you I will try it.You're so good

------------------ 原始邮件 ------------------ 发件人: "ridgelab"notifications@github.com; 发送时间: 2019年6月27日(星期四) 凌晨3:59 收件人: "ridgelab/JustOrthologs"JustOrthologs@noreply.github.com; 抄送: "675103467"675103467@qq.com;"State change"state_change@noreply.github.com; 主题: Re: [ridgelab/JustOrthologs] use test human.fa and bonbo.fa cannotuse JustOrthologs (#4)

Looks like the * italicized the text. Let me try again for the output file:

`>gene 1 CDS1[asterisk]

gene 2 CDS1[asterisk]CDS2[asterisk]CDS3[asterisk]`

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub, or mute the thread.

wangjie1201 commented 5 years ago

I'm sorry to bother your again. Others can do it ,but i have another one can not do it.The file is attached. link: https://pan.baidu.com/s/1MLQK1bBzYZZPbC-DYi9Bkg Extract the code:3fbg I don't know what went wrong.

------------------ 原始邮件 ------------------ 发件人: "ridgelab"notifications@github.com; 发送时间: 2019年6月27日(星期四) 凌晨3:59 收件人: "ridgelab/JustOrthologs"JustOrthologs@noreply.github.com; 抄送: "675103467"675103467@qq.com;"State change"state_change@noreply.github.com; 主题: Re: [ridgelab/JustOrthologs] use test human.fa and bonbo.fa cannotuse JustOrthologs (#4)

Looks like the * italicized the text. Let me try again for the output file:

`>gene 1 CDS1[asterisk]

gene 2 CDS1[asterisk]CDS2[asterisk]CDS3[asterisk]`

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub, or mute the thread.

wangjie1201 commented 5 years ago

I tried again, and it worked. Thank you very much for your help

------------------ 原始邮件 ------------------ 发件人: "ridgelab"notifications@github.com; 发送时间: 2019年6月27日(星期四) 凌晨3:59 收件人: "ridgelab/JustOrthologs"JustOrthologs@noreply.github.com; 抄送: "675103467"675103467@qq.com;"State change"state_change@noreply.github.com; 主题: Re: [ridgelab/JustOrthologs] use test human.fa and bonbo.fa cannotuse JustOrthologs (#4)

Looks like the * italicized the text. Let me try again for the output file:

`>gene 1 CDS1[asterisk]

gene 2 CDS1[asterisk]CDS2[asterisk]CDS3[asterisk]`

— You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub, or mute the thread.

ridgelab commented 5 years ago

I'm glad that you were able to get it to work. Good luck with your analyses.

wangjie1201 commented 5 years ago

I use gff3_parser.py ,but the output.fasta like this: gene 1 CDS1 gene 2 CDS1 gene 2 CDS2 gene 2 CDS3 I donnot know why