loosolab / TOBIAS

Transcription factor Occupancy prediction By Investigation of ATAC-seq Signal
MIT License
191 stars 41 forks source link

Input files for BINDetect #8

Closed msbentsen closed 4 years ago

msbentsen commented 4 years ago

Hi Mette, I am using the function of Bindetect. Where can I get my motif file? and also what is the bed file that containg all the peaks? how can I get? I am sorry. I am quit new to coding.

Best wishes Xiaohan

Originally posted by @xiaohanhe2020 in https://github.com/loosolab/TOBIAS/issues/6#issuecomment-601449811

msbentsen commented 4 years ago

Hi @xiaohanhe2020,

I just created a new issue to keep the questions separate. To answer your question:

Hope these answers help you out!

xiaohanhe2020 commented 4 years ago

Thank you very much for your reply Mette, It is very helpfu. May I ask you another question? How can you get the annotated peaks.bed file?

Best wishes Xiaohan

xiaohanhe2020 commented 4 years ago

Hi Mette. Can I use homer instead of UROPA?

Best wishes Xiaohan

msbentsen commented 4 years ago

Hi Xiaohan,

Yes, you can use any annotation software you like (I used UROPA for the file in test_data). For BINDetect, the peaks do not have to be annotated - but I recommend to annotate since the additional columns from --peaks are transferred to the output TFBS. That makes it easier to interpret the biological significance of the differential binding sites.

Best Mette

xiaohanhe2020 commented 4 years ago

Dear matte, Thank you so much. Could you please do me a help to use UROPA to annotate the my file for me? I just learn coding for two weeks. I don’t think I will get this work without help.

Many thanks Xiaohan

On 20 Mar 2020, at 13:01, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:

Hi Xiaohan,

Yes, you can use any annotation software you like (I used UROPA for the file in test_data). For BINDetect, the peaks do not have to be annotated - but I recommend to annotate since the additional columns from --peaks are transferred to the output TFBS. That makes it easier to interpret the biological significance of the differential binding sites.

Best Mette

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601688132&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7C9bcff14ba9f349e911f108d7cccec16b%7C8370cf1416f34c16b83c724071654356%7C0&sdata=fxaI0%2FS5O9t%2FzPv6P6cTQTHp73TBFiD9IiRMQuqzsnc%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X52CG6Y2GUJZLAU7MTRINSJFANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7C9bcff14ba9f349e911f108d7cccec16b%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6ORP80TTRJQprIGT7vfycrb0KTrweAYGkXU%2FkcQN1%2B8%3D&reserved=0.

msbentsen commented 4 years ago

Hi Xiaohan,

I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is here). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.

Best Mette

xiaohanhe2020 commented 4 years ago

Hi Mette, Thank you for your email. I try UROPA. This is my code. But it seems not work well.

download GTF file

wget ftp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gzhttp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gz gunzip Homo_sapiens.GRCh37.75.gtf.gz

download BED file

wget https://www.encodeproject.org/files/ENCFF966LMJ/@@download/ENCFF966LMJ.bed.gz gunzip ENCFF966LMJ.bed.gz

config.json

''' {"queries": [ {"feature":"gene", "distance":5000, "feature.anchor": "start", “show.attribute s":"gene_name"}, {"feature": "gene","distance":5000, "feature.anchor":"center"}],"priority" : "False","gtf": "/home/soft/uropa/Homo_sapiens.GRCh37.75.gtf","bed": "/home/soft/research/uropa/ENCFF966LMJ.bed"} '''

uropa -I config.json

Do you know the reasons?

Best wishes Xiaohan

On 20 Mar 2020, at 15:22, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:

Hi Xiaohan,

I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is herehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FUROPA&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=K9flv4q%2FeNID%2FJtQRgbXFt6iGZYbwTlSyDt7pvy6qeA%3D&reserved=0). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.

Best Mette

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601756194&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6M43YZ0dFYLnQKNpWoy4tCdFQrCJAg85RS%2Ft%2F2QjLN4%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X3IZUMZ7GS5TEE2JCTRIOC4ZANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=1IEuNYwjx%2FleLy1KybHC6yoi0FNpwEkAnOe4BHogvOU%3D&reserved=0.

xiaohanhe2020 commented 4 years ago

Dear Matte, Sorry to disturb you. Could you please send me the config.json file for annotated peaks.bed in the test dataset of Tobias?

Best wishes Xiaohan

On 21 Mar 2020, at 10:40, He, Xiaohan xiaohan.he@kcl.ac.uk<mailto:xiaohan.he@kcl.ac.uk> wrote:

Hi Mette, Thank you for your email. I try UROPA. This is my code. But it seems not work well.

download GTF file

wget ftp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gzhttp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gz gunzip Homo_sapiens.GRCh37.75.gtf.gz<x-msg://3/Homo_sapiens.GRCh37.75.gtf.gz>

download BED file

wget https://www.encodeproject.org/files/ENCFF966LMJ/@@download/ENCFF966LMJ.bed.gz gunzip ENCFF966LMJ.bed.gz

config.json

''' {"queries": [ {"feature":"gene", "distance":5000, "feature.anchor": "start", “show.attribute s":"gene_name"}, {"feature": "gene","distance":5000, "feature.anchor":"center"}],"priority" : "False","gtf": "/home/soft/uropa/Homo_sapiens.GRCh37.75.gtf<x-msg://3/Homo_sapiens.GRCh37.75.gtf>","bed": "/home/soft/research/uropa/ENCFF966LMJ.bed"} '''

uropa -I config.json

Do you know the reasons?

Best wishes Xiaohan

On 20 Mar 2020, at 15:22, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:

Hi Xiaohan,

I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is herehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FUROPA&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=K9flv4q%2FeNID%2FJtQRgbXFt6iGZYbwTlSyDt7pvy6qeA%3D&reserved=0). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.

Best Mette

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601756194&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6M43YZ0dFYLnQKNpWoy4tCdFQrCJAg85RS%2Ft%2F2QjLN4%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X3IZUMZ7GS5TEE2JCTRIOC4ZANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=1IEuNYwjx%2FleLy1KybHC6yoi0FNpwEkAnOe4BHogvOU%3D&reserved=0.

msbentsen commented 4 years ago

Hi Xiaohan, Please write an issue on the UROPA github issues page if you are having problems with the use of UROPA - thank you! Closing as this is not related to TOBIAS.