Closed msbentsen closed 4 years ago
Hi @xiaohanhe2020,
I just created a new issue to keep the questions separate. To answer your question:
The motif file is a file containing any TF motifs in common formats (jaspar/pfm/meme). You can download this file from public databases such as JASPAR (link to download).
The peaks file is a .bed-file of ATAC-seq peaks. You can use a peak-caller such as MACS2 to get these per sample/condition. The important thing is that the peaks cover all conditions you want to compare. So if you did peak-calling separately per sample/condition, you can use bedtools merge
to merge the different peak sets.
Hope these answers help you out!
Thank you very much for your reply Mette, It is very helpfu. May I ask you another question? How can you get the annotated peaks.bed file?
Best wishes Xiaohan
Hi Mette. Can I use homer instead of UROPA?
Best wishes Xiaohan
Hi Xiaohan,
Yes, you can use any annotation software you like (I used UROPA for the file in test_data). For BINDetect, the peaks do not have to be annotated - but I recommend to annotate since the additional columns from --peaks
are transferred to the output TFBS. That makes it easier to interpret the biological significance of the differential binding sites.
Best Mette
Dear matte, Thank you so much. Could you please do me a help to use UROPA to annotate the my file for me? I just learn coding for two weeks. I don’t think I will get this work without help.
Many thanks Xiaohan
On 20 Mar 2020, at 13:01, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:
Hi Xiaohan,
Yes, you can use any annotation software you like (I used UROPA for the file in test_data). For BINDetect, the peaks do not have to be annotated - but I recommend to annotate since the additional columns from --peaks are transferred to the output TFBS. That makes it easier to interpret the biological significance of the differential binding sites.
Best Mette
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601688132&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7C9bcff14ba9f349e911f108d7cccec16b%7C8370cf1416f34c16b83c724071654356%7C0&sdata=fxaI0%2FS5O9t%2FzPv6P6cTQTHp73TBFiD9IiRMQuqzsnc%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X52CG6Y2GUJZLAU7MTRINSJFANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7C9bcff14ba9f349e911f108d7cccec16b%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6ORP80TTRJQprIGT7vfycrb0KTrweAYGkXU%2FkcQN1%2B8%3D&reserved=0.
Hi Xiaohan,
I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is here). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.
Best Mette
Hi Mette, Thank you for your email. I try UROPA. This is my code. But it seems not work well.
wget ftp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gzhttp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gz gunzip Homo_sapiens.GRCh37.75.gtf.gz
wget https://www.encodeproject.org/files/ENCFF966LMJ/@@download/ENCFF966LMJ.bed.gz gunzip ENCFF966LMJ.bed.gz
config.json
''' {"queries": [ {"feature":"gene", "distance":5000, "feature.anchor": "start", “show.attribute s":"gene_name"}, {"feature": "gene","distance":5000, "feature.anchor":"center"}],"priority" : "False","gtf": "/home/soft/uropa/Homo_sapiens.GRCh37.75.gtf","bed": "/home/soft/research/uropa/ENCFF966LMJ.bed"} '''
uropa -I config.json
Do you know the reasons?
Best wishes Xiaohan
On 20 Mar 2020, at 15:22, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:
Hi Xiaohan,
I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is herehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FUROPA&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=K9flv4q%2FeNID%2FJtQRgbXFt6iGZYbwTlSyDt7pvy6qeA%3D&reserved=0). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.
Best Mette
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601756194&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6M43YZ0dFYLnQKNpWoy4tCdFQrCJAg85RS%2Ft%2F2QjLN4%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X3IZUMZ7GS5TEE2JCTRIOC4ZANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=1IEuNYwjx%2FleLy1KybHC6yoi0FNpwEkAnOe4BHogvOU%3D&reserved=0.
Dear Matte, Sorry to disturb you. Could you please send me the config.json file for annotated peaks.bed in the test dataset of Tobias?
Best wishes Xiaohan
On 21 Mar 2020, at 10:40, He, Xiaohan xiaohan.he@kcl.ac.uk<mailto:xiaohan.he@kcl.ac.uk> wrote:
Hi Mette, Thank you for your email. I try UROPA. This is my code. But it seems not work well.
wget ftp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gzhttp://ftp.ensembl.org/pub/release-75/gtf/homo_sapiens/Homo_sapiens.GRCh37.75.gtf.gz gunzip Homo_sapiens.GRCh37.75.gtf.gz<x-msg://3/Homo_sapiens.GRCh37.75.gtf.gz>
wget https://www.encodeproject.org/files/ENCFF966LMJ/@@download/ENCFF966LMJ.bed.gz gunzip ENCFF966LMJ.bed.gz
config.json
''' {"queries": [ {"feature":"gene", "distance":5000, "feature.anchor": "start", “show.attribute s":"gene_name"}, {"feature": "gene","distance":5000, "feature.anchor":"center"}],"priority" : "False","gtf": "/home/soft/uropa/Homo_sapiens.GRCh37.75.gtf<x-msg://3/Homo_sapiens.GRCh37.75.gtf>","bed": "/home/soft/research/uropa/ENCFF966LMJ.bed"} '''
uropa -I config.json
Do you know the reasons?
Best wishes Xiaohan
On 20 Mar 2020, at 15:22, Mette Bentsen notifications@github.com<mailto:notifications@github.com> wrote:
Hi Xiaohan,
I suggest you first spend some time to take a look at the input for UROPA, and try it out yourself (link to the GitHub repository is herehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FUROPA&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=K9flv4q%2FeNID%2FJtQRgbXFt6iGZYbwTlSyDt7pvy6qeA%3D&reserved=0). If you are then still unsure of the usage of the tool, you are welcome to open an issue at the UROPA page. It is not that I don't want to help, but it takes longer than 2 weeks to learn bioinformatics - and this is unfortunately not anything I can help you with.
Best Mette
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Floosolab%2FTOBIAS%2Fissues%2F8%23issuecomment-601756194&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=6M43YZ0dFYLnQKNpWoy4tCdFQrCJAg85RS%2Ft%2F2QjLN4%3D&reserved=0, or unsubscribehttps://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAO4D5X3IZUMZ7GS5TEE2JCTRIOC4ZANCNFSM4LQFNNSQ&data=01%7C01%7Cxiaohan.he%40KCL.ac.uk%7Cd82acc3c9e464628ff4a08d7cce2902e%7C8370cf1416f34c16b83c724071654356%7C0&sdata=1IEuNYwjx%2FleLy1KybHC6yoi0FNpwEkAnOe4BHogvOU%3D&reserved=0.
Hi Xiaohan, Please write an issue on the UROPA github issues page if you are having problems with the use of UROPA - thank you! Closing as this is not related to TOBIAS.
Hi Mette, I am using the function of Bindetect. Where can I get my motif file? and also what is the bed file that containg all the peaks? how can I get? I am sorry. I am quit new to coding.
Best wishes Xiaohan
Originally posted by @xiaohanhe2020 in https://github.com/loosolab/TOBIAS/issues/6#issuecomment-601449811